By Kevin E. Noonan --
The
coelacanth, an aquatic animal described as a "living fossil" when discovered in 1938, was thought to have gone
extinct during late Cretaceous period, ~70 million years ago. Only about 300 specimens of the African
coelacanth, Latimeria chalumnae, are known to exist and a second species, Latimeria menadoensis, was discovered
in 1997. These animals are morphologically
primitive, resembling fossils dating ~300 million years ago, which has
suggested that these are "slowly evolving" species (although this is
more a descriptive than an informed characterization). There have been sporadic reports of
coelacanth gene sequencing over the past decade, but last week Nature
published the first report on whole genome sequencing (Ameniya et al., "The
African coelacanth genome provides insighted into tetrapod evolution," Nature
496: 311-16, April 18, 2013).
The article reports that the coelacanth genome
comprises 2.68 gigabases, on 48 chromosomes, with 19,033 protein-coding genes
comprising 21,817 transcripts, 2,894 "short" non-coding RNAs, and 1,214 "long"
non-coding RNAs. 336 of protein-encoding
genes were found to have undergone gene duplication. The authors examined transcripts from 251
genes from coelacanth and compared these genes with Protopterus annectens
(the African lungfish) expressed in brain, gonad, and kidney and with 15
terrestrial vertebrate lineages (dog, human, mouse, elephant, armadillo, Tammar
wallaby, opossum, platypus, chicken, turkey, zebra finch, lizard, Western
clawed frog, Chinese brown frog) and five modern fish species (tilapia,
pufferfish, zebra fish, spotted catshark, little skate, and elephant shark). The results of these analysis, comprising
100,583 concatenated amino acid positions, was consistent with the lungfish
being the earliest relative of modern tetrapods (four-limbed animals),
answering a previously unresolved uncertainty as to the role of the coelacanth
in vertebrate evolution.
Turning to the question of the status of the coelacanth as a "living fossil," the authors confirmed earlier conclusions (based on Hox genes and protocadherins) that this species evolves slowly, the authors assessed the data from the 251 genes used in their phylogenetic analyses. This was done as follows:
Pair-wise distances between taxa were calculated from the branch lengths of the tree using the two-cluster test proposed previously to test for equality of average substitution rates. Then, for each of the following species and species clusters (coelacanth, lungfish, chicken and mammals), we ascertained their respective mean distance to an outgroup consisting of three cartilaginous fishes (elephant shark, little skate and spotted catshark). Finally, we tested whether there was any significant difference in the distance to the outgroup of cartilaginous fish for every pair of species and species clusters, using a Z statistic.
The coelacanth genes showed 0.89 substitutions per site, compared with 1.05 substitutions/site in the lungfish, 1.09 substitutions/site in chicken, and 1.21 substitutions/site in mammals.
The authors also ascertained the "abundance" of transposable elements in the coelacanth genome, because these elements are believed to provide "templates for exaptation," i.e., to facilitate formation of novel protein exons and regulatory elements, as well as providing targets for genomic rearrangement. Transposable element content was "high" (~25%, which the authors consider an underestimate) and also showed a "wide variety of transposable-element superfamilies," with 14 such families being "currently active." The authors acknowledge that these results "contrast[] with the slow protein evolution observed."
"[E]xtensive conservation of synteny" was observed in a comparison of chromosomal breakpoints in coelacanth and tetrapod genomes, and "indicate that large-scale rearrangements have occurred at a generally low rate in the coelacanth lineage." Interchromosomal rearrangements indicated that "karyotypic evolution in the coelacanth lineage has occurred at a relatively slow rate, similar to that of non-mammalian tetrapods" (31 in coelacanth, 20 in salamander and 21 in Xenopus species). Comparison of the two coelacanth species, L. chalumnae (Africa) and L. menadoensis (Indonesia), showed divergence rates similar to those found between humans and chimpanzees, and the authors estimated that these species diverged "slightly more than" 6-8 million years ago, based on the slower substitution rates found in coelacanth species.
The authors then looked at estimates of how vertebrates adapted to the terrestrial environment. They identified 50 genes found in coelacanth but not terrestrial tetrapods, presumably because these genes were not needed when the animal left water for land. These genes included "components of fibroblast growth factor (FGF) signalling, TGF-β and bone morphogenic protein (BMP) signalling, and WNT signalling pathways, as well as many transcription factor genes," and specifically that 4 genes (And1, And2, Fgf24 and Asip2) not present in tetrapod genomes were indeed present in the coelacanth genome. These genes also included 13 genes involved in fin development, 8 genes in otolith and ear development, 7 genes for kidney development, 13 genes for eye development, and 23 genes for brain development. In contrast, there were only slight differences in homeobox genes. There were also changes in gene regulation, wherein 6% of "conserved non-coding elements (CNEs)" ("promoters, enhancers, repressors and insulators") had originated after divergence of the coelacanth from the ancestral lineage. Further analysis showed that tetrapod-specific CNEs were most closely (genetically) linked to genes involved in smell perception (consistent with the recognized expansion of olfactory receptor genes in the evolution of tetrapods from teleost fishes) and "morphogenesis (radial pattern formation, hind limb morphogenesis, kidney morphogenesis) and cell differentiation (endothelial cell fate commitment, epithelial cell fate commitment)" and immunoglobulin VDJ recombination.
A particular set of genes compared in the study are genes for digits, "[a] major innovation in tetrapod evolution," and specifically Hox genes for regulating limb development in ray-finned fish, coelacanths and tetrapods (mouse). Three of the six "cis-regulatory elements" showed sequence conservation limited to tetrapods, with one element being shared by tetrapods and coelacanth but not the ray-finned fish; this latter element could function in transgenic mouse assays to "drive reporter gene expression in a limb-specific pattern." Another particular set of genes compared between tetrapod and coelacanth lineages were genes for the urea cycle, because "[e]xcretion of nitrogen is a major physiological challenge for terrestrial vertebrates." Urea cycle genes involved in producing urea for nitrogenous waste disposal (such as carbamoyl phosphate synthase I) showed strong evidence of selection whereas genes (such as mitochondrial arginase) involved in arginine metabolism but not excretion did not show such selection. The authors conclude that this is evidence of adaptive evolution in the transition from water to land. Hox gene studies also indicated changes from the coelacanth and tetrapod lineages implicated in placental and other reproductive structures not found in animals living in an aquatic environment. Finally, the coelacanth genome lacks genes for immunoglobuins of the M class but did possess two IgW genes previously found only in lungfish and certain cartilaginous fish.
In addition to establishing that lungfish not the coelacanth was the common ancestor to all terrestrial vertebrate, the authors also established that the coelacanth also showed a slow rate of evolutionary change, which they speculate might be due to "a static habitat and lack of predation" and promising that "[f]urther study of these changes between tetrapods and the coelacanth may provide important insights into how a complex organism like a vertebrate can markedly change its way of life."
The authors were affiliated with the following institutions: The Broad Institute at MIT; Benaroya Research Institute and University of Washington; University of Konstanz, Germany; Universite de Montreal; University of Oregon; Institute of Molecular and Cell Biology, Singapore; Universidade Federal do Para, Brazil; Harvard University; University of Utah; Ecole Normale Superieure de Lyon; University of Kentucky; Rhodes University, South Africa; Wellcome Trust Sanger Institute; University of Trieste; University of Liege; Victoria University, Australia; University of Tuscia; University of Hamburg; Polytechnic University of Marche, Italy; University of South Florida; University of Western Cape, South Africa; Woods Hole Oceanographic Institution; Oxford University; Universitat Leipzig; Keio University, Japan; The Graduate University for Advanced Studies, Japan; European Molecular Biology Laboratory; University of Wuerzburg, Germany; University of Illinois at Chicago; National Institute of Genetics, Japan; and Uppsala University.
Image of Latimeria chalumnae (above) by Alberto Fernandez Fernandez, from the Wikipedia Commons under the Creative Commons Attribution 3.0 Unported license.



(Once Again) It Ain't Necessarily So
By Kevin E. Noonan --
A large part of the debate on patenting genetic diagnostic method and isolated genes has revolved around the effects of such patents on what is loosely termed "personalized medicine." Personalized medicine can be summarized as a dream/holy grail/GATTACA future of universal genetic information -- every infant having her genomic DNA sequence determined at birth and contained on a medical identity card, to determine what diseases she may get, what drugs she should not and even who she should or should not marry (or at least mate with). It is the most recent of the promises of the biotechnology or genetic revolution, made possible (in theory) by the fruits of the Human Genome Project; advances in genomic sequencing technology (as described in The $1,000 Genome and elsewhere) have quickened the expectations surrounding the technology. But many have begun to wonder why this genetic fruit is taking so long to ripen (see, for example, "How Bright Promise of Genetic Testing Fell Apart"), and a recent study published in Science Translational Medicine may provide some clues.
The paper notes that there is an estimate of three million sequence variants per person (K. A. Frazer et al., 2009, "Human genetic variation and its contribution to complex traits," Nat. Rev. Genet. 10: 241-51); of these, thousands of genetic variants have been associated with human disease, including Mendelian traits (e.g., sickle cell anemia), SNPs (e.g., Huntington's s disease), or by genome-wide association studies (GWAS) (examples include familial pancreatic cancer and Miller syndrome).
The paper addresses the question of what the benefit of such information would be, defining "benefit" as "receiving information indicating that the risk of disease is increased or decreased to a degree that would alter an individual's lifestyle or medical management." The authors recognize that it is impossible to assess these benefits generally, but monozygotic twins present the possibility to make this assessment: as they have in many other instances, the identity of genetic information make it possible that diseases and other traits with a large genetic component should be experienced in common between twins: "If one twin of the pair has a disease, then the probability of the other twin developing that disease is dependent on the genome whenever that disease has some genetic component." However, the paper also notes that "[t]he general public does not appear to be aware that, despite their very similar height and appearance, monozygotic twins in general do not always develop or die from the same
maladies," citing Wong et al., 2005, "Phenotypic differences in genetically identical organisms: The epigenetic perspective," Hum. Mol. Genet. 14 Spec No 1, R11-R18, and "Identical Twins Not As Identical As Believed," ScienceDaily, reflecting a limitation in the predictive power of genetic disease assessment even between individuals having almost identical genetic complements. (Interestingly, even monozygotic twins are not necessarily identical genetic copies of one another, there being copy number variants between them; Bruder et al., 2008, "Phenotypically concordant and discordant monozygotic twins display different DNA copy-number variation profiles," Am. J. Hum. Genet. 82: 763-71.) The twin populations were selected from "the Swedish Twin Registry, Danish Twin Registry, Finnish Twin Cohort, Norwegian National Birth Registry and the National Academy of Science – National Research World War II Veteran Twins Registry."
The authors also define "heritability" as the difference between the incidence of disease in monozygotic twins compared with dizygotic twins, "reflect[ing] the average genetic contribution to disease" in the population of twins studied. Averages not as informative as distributions in this regard, since a given average incidence of disease could reflect either "a small fraction of twin-pairs with genometypes [(i.e., a complete genomic DNA sequence from an individual)] conferring high genetic risk or a larger fraction of twin-pairs with genometypes conferring a moderate genetic risk." This challenge is illustrated by an example:
Suppose a woman receives a whole-genome test result indicating that she has a 90% lifetime risk (the total risk over her entire life) of developing breast cancer. She may decide to have a prophylactic double mastectomy to prevent this outcome. Similarly, if the test indicated an 80% or even a 50% lifetime risk of developing breast cancer, she may consider mastectomy. On the other hand, if the test indicated only a 14% risk of developing breast cancer, then mastectomies would be considered by very few women, given that most women today do not opt for prophylactic mastectomies even though the lifetime risk of developing breast cancer in the general population is 12%.
The authors adopt the threshold of a "positive predictive value" of 10%, meaning that 10% of patients with a "positive" test result are expected to develop a disease, according to Clarke-Pearson, 2009, "Clinical practice. Screening for ovarian cancer," N. Engl. J. Med. 361: 170-77. However, for several diseases, including chronic fatigue syndrome, gastro-esophageal reflux disorder, coronary heart disease-related death and general dystocia, this threshold is inappropriate due in part to the prevalence of these diseases in the population; for these diseases the threshold is a two-fold higher risk of disease compared with the general population. In addition to these diseases, the study assessed the risk for coronary artery disease, stroke, cancer (bladder, breast, colorectal, lung, leukemia, ovarian, pancreatic, prostate and stomach), thyroid autoimmunity, diabetes (types 1 and 2), Alzheimer's disease, dementia, Parkinson's disease, irritable bowel syndrome, pelvic organ prolapse, and stress urinary incontinence.
The bulk of paper set forth a mathematical treatment of these biostatistics that is beyond the scope of the discussion here; the results, on the other hand, are informative. These include:
• "The fraction of patients that would receive a positive test varies widely from disease to disease."
• "The majority of patients (>50%) who would ultimately develop 13 of the 27 disease categories would not test positive, even in the best-case scenario."
• "There were four disease categories -- thyroid autoimmunity, type I diabetes, Alzheimer's disease, and coronary heart disease-related deaths in males -- for which genetic tests might identify more than 75% of the patients who ultimately develop the disease."
• "The fraction of individuals in the population who would receive positive test results for each disease is small."
• For 22 of the 27 disease categories studied, "a negative test would not indicate a risk that is less than half that in the general population, even in the best-case scenario" (which is probably not sufficient to warrant changes in behavior, lifestyle, or preventative medical practices).
• An exception is Alzheimer's disease, where a negative test "could indicate as little as a ~12% relative risk of disease compared to the entire twin cohort . . . . Knowledge of such a reduced risk might be comforting and relieve anxiety, particularly to those with a family history of Alzheimer's disease."
• ">95% of men and >90% of women could receive at least one positive test result."
• Many of these results represent the best-case scenarios and thus the true benefits of genetic disease testing may be overestimated.
The authors' conclusions are a dose of cold water on the hopes and expectations of many in the field (and even more laypersons outside the field, including policymakers, judges, and even Supreme Court justices):
[O]ur results suggest that genetic testing, at its best, will not be the dominant determinant of patient care and will not be a substitute for preventative medicine strategies incorporating routine checkups and risk management based on the history, physical status and life style of the patient.
The authors best state the significance of an accurate assessment of the likelihood of predictive genetic testing: "Recognition of these merits and limits [for genetic diagnostic testing] can be useful to consumers, researchers, and industry, as they can minimize unrealistic expectations and foster fruitful investigations." These are words policymakers should no doubt heed when considering changes to established patent and other policies based on perhaps unrealistic prospects for a brighter genetic future.
Posted at 11:44 PM in Biotech/Pharma News, Media Commentary | Permalink | Comments (11) | TrackBack (0)