23 research outputs found
A pilot study for channel catfish whole genome sequencing and de novo assembly
<p>Abstract</p> <p>Background</p> <p>Recent advances in next-generation sequencing technologies have drastically increased throughput and significantly reduced sequencing costs. However, the average read lengths in next-generation sequencing technologies are short as compared with that of traditional Sanger sequencing. The short sequence reads pose great challenges for <it>de novo </it>sequence assembly. As a pilot project for whole genome sequencing of the catfish genome, here we attempt to determine the proper sequence coverage, the proper software for assembly, and various parameters used for the assembly of a BAC physical map contig spanning approximately a million of base pairs.</p> <p>Results</p> <p>A combination of low sequence coverage of 454 and Illumina sequencing appeared to provide effective assembly as reflected by a high N50 value. Using 454 sequencing alone, a sequencing depth of 18 X was sufficient to obtain the good quality assembly, whereas a 70 X Illumina appeared to be sufficient for a good quality assembly. Additional sequencing coverage after 18 X of 454 or after 70 X of Illumina sequencing does not provide significant improvement of the assembly. Considering the cost of sequencing, a 2 X 454 sequencing, when coupled to 70 X Illumina sequencing, provided an assembly of reasonably good quality. With several software tested, Newbler with a seed length of 16 and ABySS with a K-value of 60 appear to be appropriate for the assembly of 454 reads alone and Illumina paired-end reads alone, respectively. Using both 454 and Illumina paired-end reads, a hybrid assembly strategy using Newbler for initial 454 sequence assembly, Velvet for initial Illumina sequence assembly, followed by a second step assembly using MIRA provided the best assembly of the physical map contig, resulting in 193 contigs with a N50 value of 13,123 bp.</p> <p>Conclusions</p> <p>A hybrid sequencing strategy using low sequencing depth of 454 and high sequencing depth of Illumina provided the good quality assembly with high N50 value and relatively low cost. A combination of Newbler, Velvet, and MIRA can be used to assemble the 454 sequence reads and the Illumina reads effectively. The assembled sequence can serve as a resource for comparative genome analysis. Additional long reads using the third generation sequencing platforms are needed to sequence through repetitive genome regions that should further enhance the sequence assembly.</p
Immunoglobulin Genomics in the Guinea Pig (Cavia porcellus)
In science, the guinea pig is known as one of the gold standards for modeling human disease. It is especially important as a molecular and cellular biology model for studying the human immune system, as its immunological genes are more similar to human genes than are those of mice. The utility of the guinea pig as a model organism can be further enhanced by further characterization of the genes encoding components of the immune system. Here, we report the genomic organization of the guinea pig immunoglobulin (Ig) heavy and light chain genes. The guinea pig IgH locus is located in genomic scaffolds 54 and 75, and spans approximately 6,480 kb. 507 VH segments (94 potentially functional genes and 413 pseudogenes), 41 DH segments, six JH segments, four constant region genes (μ, γ, ε, and α), and one reverse δ remnant fragment were identified within the two scaffolds. Many VH pseudogenes were found within the guinea pig, and likely constituted a potential donor pool for gene conversion during evolution. The Igκ locus mapped to a 4,029 kb region of scaffold 37 and 24 is composed of 349 Vκ (111 potentially functional genes and 238 pseudogenes), three Jκ and one Cκ genes. The Igλ locus spans 1,642 kb in scaffold 4 and consists of 142 Vλ (58 potentially functional genes and 84 pseudogenes) and 11 Jλ -Cλ clusters. Phylogenetic analysis suggested the guinea pig’s large germline VH gene segments appear to form limited gene families. Therefore, this species may generate antibody diversity via a gene conversion-like mechanism associated with its pseudogene reserves
Comprehensive survey and genomic characterization of Toll-Like Receptors (TLRs) in channel catfish, <em>Ictalurus punctatus</em>: Identification of novel fish TLRs
International audienc
Comparative genomics of transcription factors driving expression of the immunoglobulin heavy chain locus in teleost fish
Immune genes of the Atlantic salmon, Salmo salar L.: application of modern molecular genetic techniques to the study of antibodies and major histocompatibility complex antigens
Discovery of a unique Ig heavy-chain isotype (IgT) in rainbow trout: Implications for a distinctive B cell developmental pathway in teleost fish
During the analysis of Ig superfamily members within the available rainbow trout (Oncorhynchus mykiss) EST gene index, we identified a unique Ig heavy-chain (IgH) isotype. cDNAs encoding this isotype are composed of a typical IgH leader sequence and a VDJ rearranged segment followed by four Ig superfamily C-1 domains represented as either membrane-bound or secretory versions. Because teleost fish were previously thought to encode and express only two IgH isotypes (IgM and IgD) for their humoral immune repertoire, we isolated all three cDNA isotypes from a single homozygous trout (OSU-142) to confirm that all three are indeed independent isotypes. Bioinformatic and phylogenetic analysis indicates that this previously undescribed divergent isotype is restricted to bony fish, thus we have named this isotype “IgT” (τ) for teleost fish. Genomic sequence analysis of an OSU-142 bacterial artificial chromosome (BAC) clone positive for all three IgH isotypes revealed that IgT utilizes the standard rainbow trout V(H) families, but surprisingly, the IgT isotype possesses its own exclusive set of D(H) and J(H) elements for the generation of diversity. The IgT D and J segments and τ constant (C) region genes are located upstream of the D and J elements for IgM, representing a genomic IgH architecture that has not been observed in any other vertebrate class. All three isotypes are primarily expressed in the spleen and pronephros (bone marrow equivalent), and ontogenically, expression of IgT is present 4 d before hatching in developing embryos
