Search CORE

144 research outputs found

On the Origin and Evolution of Vertebrate Olfactory Receptor Genes: Comparative Genome Analysis Among 23 Chordate Species

Olfaction is a primitive sense in organisms. Both vertebrates and insects have receptors for detecting odor molecules in the environment, but the evolutionary origins of these genes are different. Among studied vertebrates, mammals have ∼1,000 olfactory receptor (OR) genes, whereas teleost fishes have much smaller (∼100) numbers of OR genes. To investigate the origin and evolution of vertebrate OR genes, I attempted to determine near-complete OR gene repertoires by searching whole-genome sequences of 14 nonmammalian chordates, including cephalochordates (amphioxus), urochordates (ascidian and larvacean), and vertebrates (sea lamprey, elephant shark, five teleost fishes, frog, lizard, and chicken), followed by a large-scale phylogenetic analysis in conjunction with mammalian OR genes identified from nine species. This analysis showed that the amphioxus has >30 vertebrate-type OR genes though it lacks distinctive olfactory organs, whereas all OR genes appear to have been lost in the urochordate lineage. Some groups of genes (θ, κ, and λ) that are phylogenetically nested within vertebrate OR genes showed few gene gains and losses, which is in sharp contrast to the evolutionary pattern of OR genes, suggesting that they are actually non-OR genes. Moreover, the analysis demonstrated a great difference in OR gene repertoires between aquatic and terrestrial vertebrates, reflecting the necessity for the detection of water-soluble and airborne odorants, respectively. However, a minor group (β) of genes that are atypically present in both aquatic and terrestrial vertebrates was also found. These findings should provide a critical foundation for further physiological, behavioral, and evolutionary studies of olfaction in various organisms

Crossref

PubMed Central

Law of Genome Evolution Direction : Coding Information Quantity Grows

Author: A. F. A. Smit
A. G. Matera
A. Mira
B. Charlesworth
C. L. Organ
C. Nusbaum
D. A. Petrov
D. L. Marais Des
D. R. Scannell
E. Schrodinger
E. T. Dermitzakis
F. Clark
G. Bejerano
G. Liu
G. Storz
H. H. Chou
H. H. Kazazian
H. Ozkan
H. Winter
I. J. Leitch
I. Wapinski
I. Wickelgren
International Human Genome Sequencing Consortium
J. Filkowski
J.M. Aury
K. M. Devos
L. F. Luo
L. F. Luo
L. F. Luo
L. He
L. Patthy
L. R. Zhang
Liao-fu Luo
R. J. Taft
R. P. Bininda-Edmonds
S. E. Peters
T. C. Stadtman
T. Kouzarides
T. R. Gregory
The ENCODE Project Consortium
W. Deng
W. Enard
W. H. Li
W. Makalowski
X. Xu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 25/08/2008
Field of study

The problem of the directionality of genome evolution is studied. Based on the analysis of C-value paradox and the evolution of genome size we propose that the function-coding information quantity of a genome always grows in the course of evolution through sequence duplication, expansion of code, and gene transfer from outside. The function-coding information quantity of a genome consists of two parts, p-coding information quantity which encodes functional protein and n-coding information quantity which encodes other functional elements except amino acid sequence. The evidences on the evolutionary law about the function-coding information quantity are listed. The needs of function is the motive force for the expansion of coding information quantity and the information quantity expansion is the way to make functional innovation and extension for a species. So, the increase of coding information quantity of a genome is a measure of the acquired new function and it determines the directionality of genome evolution.Comment: 16 page

arXiv.org e-Print Archive

Crossref

Comparative studies of glycosylphosphatidylinositol-anchored high-density lipoprotein-binding protein 1: evidence for a eutherian mammalian origin for the GPIHBP1 gene from an LY6-like gene

Glycosylphosphatidylinositol-anchored high-density lipoprotein-binding protein 1 (GPIHBP1) functions as a platform and transport agent for lipoprotein lipase (LPL) which functions in the hydrolysis of chylomicrons, principally in heart, skeletal muscle and adipose tissue capillary endothelial cells. Previous reports of genetic deficiency for this protein have described severe chylomicronemia. Comparative GPIHBP1 amino acid sequences and structures and GPIHBP1 gene locations were examined using data from several mammalian genome projects. Mammalian GPIHBP1 genes usually contain four coding exons on the positive strand. Mammalian GPIHBP1 sequences shared 41–96% identities as compared with 9–32% sequence identities with other LY6-domain-containing human proteins (LY6-like). The human N-glycosylation site was predominantly conserved among other mammalian GPIHBP1 proteins except cow, dog and pig. Sequence alignments, key amino acid residues and conserved predicted secondary structures were also examined, including the N-terminal signal peptide, the acidic amino acid sequence region which binds LPL, the glycosylphosphatidylinositol linkage group, the Ly6 domain and the C-terminal α-helix. Comparative and phylogenetic studies of mammalian GPIHBP1 suggested that it originated in eutherian mammals from a gene duplication event of an ancestral LY6-like gene and subsequent integration of exon 2, which may have been derived from BCL11A (B-cell CLL/lymphoma 11A gene) encoding an extended acidic amino acid sequence

Crossref

Springer - Publisher Connector

PubMed Central

Griffith Research Online

Recommended from our members

Genome, transcriptome and proteome: the rise of omics data and their integration in biomedical sciences

Author: Alonso
Aranda
Aretz
Ashburner
Babtie
Bell
Ben-Ari Fuchs
Bender
Berg
Bernfield
Betzen
Breda
Brum
Bulik-Sullivan
Bumgarner
Burgess
Byron
Cerami
Chatr-Aryamontri
Cho
Claudia Manzoni
Colantuoni
Collins
Consortium GT
Consortium UK
Cook
Croft
Cusick
Darmanis
Demis A Kia
Dilthey
Eichler
ENCODE Project Consortium
Finkbeiner
Fonseca
Gaul
Genomes
Gentleman
Gholami
Gillis
Giordano
Gonzaga-Jauregui
Gostev
Gudbjartsson
Guinney
Harrow
Harrow
Haug
Homer
Horaitis
Huang
Huang
Huang da
International HapMap Consortium
International Human Genome Sequencing Consortium
Jana Vandrovcova
Jin
John Hardy
Jostins
Kaiser
Kanehisa
Kang
Kannan
Kartashov
Kellis
Kerrien
Khatri
Koh
Kohler
Koufaris
Kristensen
Kukurba
Kulkarni
Lamb
Langfelder
Lappalainen
Law
Li
Liang
Londin
Manolio
Manolio
Manzoni
Marchini
Martens
Martens
Marx
Mattick
McKenna
McWilliam
Menzel
Metzker
Mi
Modelska
MSI Board Menbers
Nagalakshmi
Naifang
Nalls
Nicholas W Wood
Orchard
Orchard
Orchard
Orchard
Pantaleo
Pathan
Patrick A Lewis
Pearson
Perez-Riverol
Pible
Pop
Pritchard
Protein
Purcell
Raffaele Ferrari
Ramasamy
Reimand
Rhee
Rivas
Roider
Salek
Sales
Sanger
Scalbert
Schaefer
Schneider
Searls
Shendure
Shi
Smith
Speir
Szklarczyk
Tasan
Turing
Uda
UniProt Consortium
van Dijk
van Karnebeek
Vaughan
Venter
Vizcaino
Vizcaino
Vogelzang
Von Bertalanffy
Wain
Wang
Wang
Wanichthanarak
Warde-Farley
Watson
Westra
Wild
Williams
Wingender
Wishart
Wishart
Wu
Yang
Yao
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/03/2018
Field of study

Advances in the technologies and informatics used to generate and process large biological data sets (omics data) are promoting a critical shift in the study of biomedical sciences. While genomics, transcriptomics and proteinomics, coupled with bioinformatics and biostatistics, are gaining momentum, they are still, for the most part, assessed individually with distinct approaches generating monothematic rather than integrated knowledge. As other areas of biomedical sciences, including metabolomics, epigenomics and pharmacogenomics, are moving towards the omics scale, we are witnessing the rise of inter-disciplinary data integration strategies to support a better understanding of biological systems and eventually the development of successful precision medicine. This review cuts across the boundaries between genomics, transcriptomics and proteomics, summarizing how omics data are generated, analysed and shared, and provides an overview of the current strengths and weaknesses of this global approach. This work intends to target students and researchers seeking knowledge outside of their field of expertise and fosters a leap from the reductionist to the global-integrative analytical approach in research

Central Archive at the University of Reading

Crossref

UCL Discovery

Large publishing consortia produce higher citation impact research but co-author contributions are hard to evaluate

Author: AACR Project GENIE Consortium
Bernstein B. E.
Bornmann L.
Buniello A.
Clifford G. M.
Costa M. R.
Defazio D.
Fortunato S.
Garner C.
Gene Ontology Consortium
Gibbons M. R.
Hagen N. T.
Hoekman J.
Hu F.
Hu F. P.
International Human Genome Sequencing Consortium
Ioannidis J. P.
Iso H.
Kawashima H.
Larivière V.
Larivière V.
Levitt J. M.
Liu X. Z.
Moed H. F.
Mongeon P.
Moore F. A.
Munafò M. R.
Olds J. L.
Pe’er I.
Psaty B. M.
Riedel T.
Roberts L.
Sjögårde P.
Sud P.
Thelwall M.
Thelwall M.
van Raan A.
Vermeulen N.
Wagner C. S.
Waltman L.
Waltman L.
Welter D.
Wu D.
Wuchty S.
Xie Z.
Publication venue: 'MIT Press - Journals'
Publication date: 05/06/2019
Field of study

This paper introduces a simple agglomerative clustering method to identify large publishing consortia with at least 20 authors and 80% shared authorship between articles. Based on Scopus journal articles 1996-2018, under these criteria, nearly all (88%) of the large consortia published research with citation impact above the world average, with the exceptions being mainly the newer consortia for which average citation counts are unreliable. On average, consortium research had almost double (1.95) the world average citation impact on the log scale used (Mean Normalised Log Citation Score). At least partial alphabetical author ordering was the norm in most consortia. The 250 largest consortia were for nuclear physics and astronomy around expensive equipment, and for predominantly health-related issues in genomics, medicine, public health, microbiology and neuropsychology. For the health-related issues, except for the first and last few authors, authorship seem to primary indicate contributions to the shared project infrastructure necessary to gather the raw data. It is impossible for research evaluators to identify the contributions of individual authors in the huge alphabetical consortia of physics and astronomy, and problematic for the middle and end authors of health-related consortia. For small scale evaluations, authorship contribution statements could be used, when available

arXiv.org e-Print Archive

Crossref

Wolverhampton Intellectual Repository and E-theses

Multivariate Analysis and Visualization of Splicing Correlations in Single-Gene Transcriptomes

BACKGROUND: RNA metabolism, through 'combinatorial splicing', can generate enormous structural diversity in the proteome. Alternative domains may interact, however, with unpredictable phenotypic consequences, necessitating integrated RNA-level regulation of molecular composition. Splicing correlations within transcripts of single genes provide valuable clues to functional relationships among molecular domains as well as genomic targets for higher-order splicing regulation. RESULTS: We present tools to visualize complex splicing patterns in full-length cDNA libraries. Developmental changes in pair-wise correlations are presented vectorially in 'clock plots' and linkage grids. Higher-order correlations are assessed statistically through Monte Carlo analysis of a log-linear model with an empirical-Bayes estimate of the true probabilities of observed and unobserved splice forms. Log-linear coefficients are visualized in a 'spliceprint,' a signature of splice correlations in the transcriptome. We present two novel metrics: the linkage change index, which measures the directional change in pair-wise correlation with tissue differentiation, and the accuracy index, a very simple goodness-of-fit metric that is more sensitive than the integrated squared error when applied to sparsely populated tables, and unlike chi-square, does not diverge at low variance. Considerable attention is given to sparse contingency tables, which are inherent to single-gene libraries. CONCLUSION: Patterns of splicing correlations are revealed, which span a broad range of interaction order and change in development. The methods have a broad scope of applicability, beyond the single gene – including, for example, multiple gene interactions in the complete transcriptome

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Collection Of Biostatistics Research Archive

A genome-wide study of preferential amplification/hybridization in microarray-based pooled DNA experiments

Author: Akey
Arnheim
Bansal
Barcellos
Barratt
Buetow
Butcher
C.-H. Lin
C.S.J. Fann
Dubreuil
Guo
H.-C. Yang
Hillel
Hinds
Hinds
Holm
Hoogendoorn
Huang
J.-Y. Wu
Jawadi
Johnson
Kennedy
L.-H. Li
Le Hellard
Lindroos
Liu
M.-C. Huang
Macgregor
Matsuzaki
Meaburn
Meaburn
Mohlke
Moskvina
Nelson
Norton
Pan
Pusch
Sham
Shapiro
Shaw
Simpson
The ENCODE Project Consortium
The International HapMap Consortium
The International Human Genome Mapping Consortium
Uhl
Visscher
Werner
Wolford
Xu
Y.-J. Liang
Y.-T. Chen
Yang
Yang
Yang
Yang
Zou
Publication venue: Oxford University Press
Publication date: 23/08/2006
Field of study

Microarray-based pooled DNA methods overcome the cost bottleneck of simultaneously genotyping more than 100 000 markers for numerous study individuals. The success of such methods relies on the proper adjustment of preferential amplification/hybridization to ensure accurate and reliable allele frequency estimation. We performed a hybridization-based genome-wide single nucleotide polymorphisms (SNPs) genotyping analysis to dissect preferential amplification/hybridization. The majority of SNPs had less than 2-fold signal amplification or suppression, and the lognormal distributions adequately modeled preferential amplification/hybridization across the human genome. Comparative analyses suggested that the distributions of preferential amplification/hybridization differed among genotypes and the GC content. Patterns among different ethnic populations were similar; nevertheless, there were striking differences for a small proportion of SNPs, and a slight ethnic heterogeneity was observed. To fulfill appropriate and gratuitous adjustments, databases of preferential amplification/hybridization for African Americans, Caucasians and Asians were constructed based on the Affymetrix GeneChip Human Mapping 100 K Set. The robustness of allele frequency estimation using this database was validated by a pooled DNA experiment. This study provides a genome-wide investigation of preferential amplification/hybridization and suggests guidance for the reliable use of the database. Our results constitute an objective foundation for theoretical development of preferential amplification/hybridization and provide important information for future pooled DNA analyses

Crossref

PubMed Central

Genome-wide comparison of Asian and African rice reveals high recent activity of DNA transposons

Author: AH Paterson
AJ Hartlerode
B Edlinger
B Piegu
C Feschotte
F Sabot
G Yang
G Yang
G Yang
GTH Vu
International Human Genome Sequencing Consortium
International Rice Genome Sequencing Project
JM Richardson
JP Buchmann
K Fujino
K Kikuchi
L Duret
LS Symington
M Kimura
M Wang
N Jiang
P Cao
P SanMiguel
R Kalendar
RH Plasterk
S Moon
S Ouyang
T Nakazaki
T Wicker
T Wicker
T Wicker
T Wicker
TD Wu
TE Bureau
TE Bureau
The International Brachypodium Initiative
V Robert
WR Engels
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

The bacterial and mitochondrial ribosomal A-site molecular switches possess different conformational substates

Author: Anderson
Barrell
Brünger
Brünger
Cannone
Carter
Collaborative Computational Project Number 4.
Dunham
Echols
Entelis
Eric Westhof
Fischel-Ghodsian
Florentz
Fokine
Fokine
François
François
Heckman
Hutchin
Hutchin
International Human Genome Sequencing Consortium.
Jiro Kondo
Jones
Koc
Koc
Kohanski
Komine
Kondo
Kondo
Kondo
Kondo
Krebs
Leontis
Leslie
Murphy
Navaza
Ogle
Ogle
Ogle
Ogle
O’Brien
Pfister
Prezant
Rossmann
Sanbonmatsu
Schneider
Selmer
Sengupta
Shandrick
Sprinzl
Suzuki
Suzuki
Terwilliger
Vicens
Vicens
Vicens
Weixlbaumer
Wimberly
Yokoyama
Zhao
Publication venue: Oxford University Press
Publication date: 01/05/2008
Field of study

The A site of the small ribosomal subunit participates in the fidelity of decoding by switching between two states, a resting ‘off’ state and an active decoding ‘on’ state. Eight crystal structures of RNA duplexes containing two minimal decoding A sites of the Homo sapiens mitochondrial wild-type, the A1555G mutant or bacteria have been solved. The resting ‘off’ state of the mitochondrial wild-type A site is surprisingly different from that of the bacterial A site. The mitochondrial A1555G mutant has two types of the ‘off’ states; one is similar to the mitochondrial wild-type ‘off’ state and the other is similar to the bacterial ‘off’ state. Our present results indicate that the dynamics of the A site in bacteria and mitochondria are different, a property probably related to the small number of tRNAs used for decoding in mitochondria. Based on these structures, we propose a hypothesis for the molecular mechanism of non-syndromic hearing loss due to the mitochondrial A1555G mutation

Physical mapping and BAC-end sequence analysis provide initial insights into the flax (Linum usitatissimum L.) genome

Author: A Diederichsen
ACJ Frijters
AH Paterson
AP Chan
Arabidopsis Genome Initiative
B Ewing
BC Meyers
C Soderlund
C Vitte
C Wu
CA Cullis
CA Cullis
CA Mathewson
CMC Bassett
CP Hong
CP Hong
CT Kelleher
CWJ Lai
D Zohary
DE Soltis
E Coe
E Datema
E Hribova
E Kvavadze
E Lerat
E Paux
E Paux
F Cheung
FM McCarthy
French-Italian Consortium for Grapevine Genome Characterization
GA Tuskan
GM Evans
HB Zhang
HBM Ali
International Brachypodium Initiative
International Rice Genome Sequencing Project
J Jurka
J Messing
J Schmutz
J Terol
J-H Mun
JA Schlueter
JA Shapiro
JC Venter
JE Frelichowski Jr
JE Stajih
JL Bennetzen
JL Shultz
L Mao
M Chen
M Delseny
M Febrer
M Marra
N Huo
P Smýkal
PB Goldsbrough
PB Goldsbrough
PF Cavagnaro
PS Schnable
Q Yu
R Ming
R Velasco
Raja Ragupathy
Rajkumar Rathinavelu
RE Pruitt
RG Schneeberger
RK Varshney
RL Warren
S Cloutier
S Cloutier
S Fenart
S Huang
S Ide
S McGinnis
S Ouyang
S Scalabrin
S Tucker
SY Rhee
Sylvie Cloutier
T Mozo
T Thiel
T Wicker
The Gene Ontology Consortium
The International Human Genome Mapping Consortium
The Universal Protein Resource Consortium
VM Gonzalez
WM Nelson
X Cheng
X Huang
Y Han
Y Han
YQ Gu
ZX Xu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Flax (<it>Linum usitatissimum </it>L.) is an important source of oil rich in omega-3 fatty acids, which have proven health benefits and utility as an industrial raw material. Flax seeds also contain lignans which are associated with reducing the risk of certain types of cancer. Its bast fibres have broad industrial applications. However, genomic tools needed for molecular breeding were non existent. Hence a project, Total Utilization Flax GENomics (TUFGEN) was initiated. We report here the first genome-wide physical map of flax and the generation and analysis of BAC-end sequences (BES) from 43,776 clones, providing initial insights into the genome. Results The physical map consists of 416 contigs spanning ~368 Mb, assembled from 32,025 fingerprints, representing roughly 54.5% to 99.4% of the estimated haploid genome (370-675 Mb). The N50 size of the contigs was estimated to be ~1,494 kb. The longest contig was ~5,562 kb comprising 437 clones. There were 96 contigs containing more than 100 clones. Approximately 54.6 Mb representing 8-14.8% of the genome was obtained from 80,337 BES. Annotation revealed that a large part of the genome consists of ribosomal DNA (~13.8%), followed by known transposable elements at 6.1%. Furthermore, ~7.4% of sequence was identified to harbour novel repeat elements. Homology searches against flax-ESTs and NCBI-ESTs suggested that ~5.6% of the transcriptome is unique to flax. A total of 4064 putative genomic SSRs were identified and are being developed as novel markers for their use in molecular breeding. Conclusion The first genome-wide physical map of flax constructed with BAC clones provides a framework for accessing target loci with economic importance for marker development and positional cloning. Analysis of the BES has provided insights into the uniqueness of the flax genome. Compared to other plant genomes, the proportion of rDNA was found to be very high whereas the proportion of known transposable elements was low. The SSRs identified from BES will be valuable in saturating existing linkage maps and for anchoring physical and genetic maps. The physical map and paired-end reads from BAC clones will also serve as scaffolds to build and validate the whole genome shotgun assembly.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central