Posted: Feb 19, 2013 2:16 pm
by Calilasseia
Meanwhile courtesy of Abbie Smith's ERV blog, we have this development:

Abbie Smith wrote:Paleovirology seems to work the opposite of the way the Creationist want. The more information we have, the further back the timeline shifts, not vice versa.

For example, we used to think HIV-1 started in humans ~1930, but after we found more ‘old’ HIV sequences, the clock got pushed back to 1902-1921.

We used to think Simian Immunodeficiency Virus emerged 1266-1685, but after we found more ‘old’ SIV sequences, the clock got pushed back 76,794 years.

After a recent finding in fish, the evolutionary history of retroviruses got pushed back… 400 million years.

An Endogenous Foamy-like Viral Element in the Coelacanth Genome

They found an endogenous retrovirus in a fishy genome. But not just any ol ERV… a foamy virus ERV. Foamy viruses are complex, and only found in land mammals, thus assumed to be relatively recent inventions of nature.

But here is a foamy virus, plopped in the middle of a fishy genome, itself about 19 million years old.

When they compared the fishy foamy ERV to mammalian foamy viruses, and lots of different kinds of exogenous and endogenous retroviruses, they found:

… the most parsimonious explanation of this phylogenetic pattern is that foamy viruses infecting land mammals originated ultimately from a prehistoric virus circulating in lobe-finned fishes.

… The common ancestor of coelacanths and tetrapods must have existed prior to the earliest known coelacanth fossil, which is 407–409 million years old.


This means that complex retroviruses are at least ~400 million years old (how old are the simple ones??? HA!). They followed us out of the water, and onto the land.

Its like every great event in our evolutionary history, we were walking hand-in-hand with our retroviruses… footprints in the sand…


The paper in question is this one:

An Endogenous Foamy-like Viral Element in the Coelacanth Genome by Guan-Zhu Han and Michael Worobey, PLoS Pathogens, 8(6): e1002790 (28th June 2012) [Full paper downloadable via the above link from the ERV blog]

Han & Worobey, 2012 wrote:Abstract

Little is known about the origin and long-term evolutionary mode of retroviruses. Retroviruses can integrate into their hosts' genomes, providing a molecular fossil record for studying their deep history. Here we report the discovery of an endogenous foamy virus-like element, which we designate ‘coelacanth endogenous foamy-like virus’ (CoeEFV), within the genome of the coelacanth (Latimeria chalumnae). Phylogenetic analyses place CoeEFV basal to all known foamy viruses, strongly suggesting an ancient ocean origin of this major retroviral lineage, which had previously been known to infect only land mammals. The discovery of CoeEFV reveals the presence of foamy-like viruses in species outside the Mammalia. We show that foamy-like viruses have likely codiverged with their vertebrate hosts for more than 407 million years and underwent an evolutionary transition from water to land with their vertebrate hosts. These findings suggest an ancient marine origin of retroviruses and have important implications in understanding foamy virus biology.


From the paper in more detail:

Han & Worobey, 2012 wrote:Introduction

Foamy viruses are complex retroviruses thought exclusively to infect mammalian species, including cats, cows, horses, and nonhuman primates [1]. Although human-specific foamy viruses have not been found, humans can be naturally infected by foamy viruses of non-human primate origin [2–4]. Comparing the phylogenies of simian foamy viruses (SFVs) and Old World primates suggests they co-speciated with each other for more than 30 million years [5]. Retroviruses can invade their hosts’ genomes in the form of endogenous retroviral elements (ERVs), providing ‘molecular fossils’ for studying the deep history of retroviruses and the long-term arms races between retroviruses and their hosts [6,7]. Although ERVs are common components of vertebrate genomes (for example, ERVs constitute around 8% of the human genome) [8], germline invasion by foamy virus seems to be very rare [9,10]. To date, endogenous foamy virus-like elements have been discovered only within the genomes of sloths (SloEFV) [9] and the aye-aye (PSFVaye) [10]. The discovery of SloEFV extended the co-evolutionary history between foamy viruses and their mammal hosts at least to the origin of placental mammals [9]. However, the ultimate origin of foamy virus
and other retroviruses remains elusive.

The continual increase in eukaryotic genome-scale sequence data is facilitating the discovery of additional ERVs, providing important insights into the origin and long-term evolution of this important lineage of viruses. In this study, we report the discovery and analysis of an endogenous foamy virus-like element in the genome of the coelacanth (Latimeria chalumnae), which we designate ‘coelacanth endogenous foamy-like virus’ (CoeEFV). The discovery CoeEFV offers unique insights into the origin and evolution of foamy viruses and the retroviruses as a whole.

Results/Discussion

Discovery of foamy virus-like elements within the genome of coelacanth

We screened all available animal whole genome shotgun (WGS) sequences using the tBLASTn algorithm using the protein sequences of representative foamy viruses (Table S1) and identified several foamy virus-like insertions (Table S2 and Fig. S1) within the genome of L. chalumnae, one of only two surviving species of an ancient Devonian lineage of lobe-finned fishes that branched off near the root of all tetrapods [11–15]. There are numerous inframe stop codons and frame-shift mutations present in these CoeEFV elements, suggesting that the CoeEFV elements might be functionally defective. Although more than 230 vertebrate genome scale sequences are currently available, endogenous foamy virus elements have been only found in the aye-aye, sloths, and coelacanth, indicating that germline invasion of foamy virus is a rare process [9,10]. We extracted all contigs containing significant matches and reconstructed a consensus CoeEFV genomic sequence (Fig. S2). The resulting consensus genome shows recognizable and typical foamy virus characteristics (Fig. 1). Its genome has long terminal repeat (LTR) sequences at both 5' and 3' ends and encodes the three main open reading frames (ORFs), gag, pol, and env, in positions similar to those of exogenous foamy viruses (Fig. 1). Two additional putative ORFs were found at positions similar to known foamy virus accessory genes but exhibit no significant similarity (Fig. 1). Notably, we found that the Env protein is conserved among foamy viruses and the coelacanth virus-like element (Fig. 2). A Conserved Domain search [16] identified a conserved foamy virus envelope protein domain (pfam03408) spanning most (887 of 1016 residues) of the CoeEFV Env protein, with an E-value of 1.36 × 10-69 (Fig. 2). The CoeEFV Env protein shares no detectable similarity with other (non-foamy
virus) retroviral Env proteins or with retroviral elements within available genomic sequences of other fishes, such as the zebrafish (Danio rerio). Hence, it provides decisive evidence that CoeEFV originated from a foamy-like virus.

To exclude the possibility that these CoeEFV elements result from laboratory contamination, we obtained a tissue sample of L. chalumnae and succeeded in amplifying CoeEFV insertions within the genome of L. chalumnae via PCR with degenerate primers designed for conserved regions of foamy virus pol and env genes.

To establish the position of CoeEFV on the retrovirus phylogeny, conserved regions of the Pol protein sequences of CoeEFV and various representative endogenous and exogenous retroviruses were used to reconstruct a phylogenetic tree with a Bayesian approach. The phylogenetic tree shows that CoeEFV groups with the foamy viruses with strong support (posterior probability = 1.00; Figs. 3 and S3), confirming that CoeEFV is indeed an endogenous form of a close relative of extant foamy viruses. The discovery of CoeEFV establishes that a distinct lineage of exogenous foamy-like viruses existed (and may still exist) in species outside the Mammalia.

CoeEFV likely invaded the coelacanth genome more than 19 million years ago

Endogenous retroviruses are likely to undergo a gradual accumulation of neutral mutations with host genome replication after endogenization [17]. To date the invasion of CoeEFV into coelacanth genome, we identified two sets of sequences, each of which arose by segmental duplication because each set of sequences shares nearly identical flanking regions (Fig. S4). The two sets contain five and two sequences, respectively. Because the divergence time of the two extant coelacanth species (L. chalumnae and L. menadoensis) is uncertain [11], it is impossible to obtain a reliable neutral evolutionary rate of coelacanth species. Nevertheless, even using the mammalian neutral evolutionary rate [18] as a proxy for the coelacanth rate, the invasion dates were conservatively estimated at 19.3 (95% highest posterior density [HPD]: 15.3–23.6) million years ago for the dataset of five sequences. For the dataset containing two sequences, the divergence between the pair is estimated to be 4.1% and the invasion time is estimated to be approximately 9.3 million years ago. Because the CoeEFV invasion almost certainly occurred earlier than the duplication events within the host genome and because the evolutionary rate of coelacanth species is thought to be lower than other vertebrate species [19,20], the time of CoeEFV integration might be much more than 19 million years. Additional phylogenetic evidence (see below) suggests that its exogenous progenitors likely infected coelacanths for hundreds of millions of years prior to the event that fossilized CoeEFV within its host’s genome.

Foamy-like viruses have likely codiverged with their vertebrate hosts for at least 407 million years

To further evaluate the relationship of foamy viruses, we reconstructed phylogenetic trees based on the conserved region of Pol proteins of foamy viruses and Class III retroviruses, the conserved region of foamy virus Pol and Env protein concatenated alignment, and the conserved region of foamy virus Env protein alignment, respectively. The three phylogenies have the same topology in terms of foamy viruses (Figs. 4, S5, and S6). CoeEFV was positioned basal to the known foamy viruses (Fig. 4), suggesting a remarkably ancient ocean origin of foamy-like viruses: the most parsimonious explanation of this phylogenetic pattern is that foamy viruses infecting land mammals originated ultimately from a prehistoric virus circulating in lobe-finned fishes. The branching order of the three foamy virus phylogenies (Fig. 4, S5, and S6) is completely congruent with the known relationships of their hosts, and each node on the three virus trees is supported by a posterior probability of 1.0 (except the node leading to equine, bovine, and feline foamy viruses on the Env phylogeny, which is supported by a posterior probability of 0.94; Fig. S6). The common ancestor of coelacanths and tetrapods must have existed prior to the earliest known coelacanth fossil, which is 407–409 million years old [21]. The completely congruent virus topology, therefore, strongly indicates that an ancestral foamy-like virus infected this ancient animal. Crucially, the foamy viral branch lengths of the three phylogenies are highly significantly correlated with host divergence times (R2= 0.7115, p= 1.10 × 10-5, Fig. 5; R2 = 0.7024, p= 1.41 × 10-5, Fig. S5; and R2= 0.7429, p= 4.26 ×10-6, Fig. S6), a pattern that can reasonably be expected only if the viruses and hosts codiverged. It is worth emphasizing that we used a consensus sequence to represent CoeEFV in these analyses, so its branch length should correspond roughly to that of the exogenous virus that integrated >19 million years ago, rather than within-host mutations since that time.

There are two alternative explanations for these phylogenetic patterns. One is that the exogenous progenitor of CoeEFV is not truly the sister taxon to the mammalian foamy viruses, but a more distant relative. The robust posterior probability (1.00) placing them in the same clade and the absence of evidence for viruses or virus-like elements from other species disrupting this clade argue against this view, as does the significant similarity between the Env proteins of CoeEFV and the foamy viruses (Fig. 2). Moreover, its branch length would be difficult to explain under such a scenario. If the coelacanth foamy-like virus lineage and the mammalian foamy virus lineage did not share a most recent common ancestor in their ancestral host, why is CoeEFV neither more nor less divergent from the mammalian foamy viruses than one might expect if they did?

The other alternative to the hypothesis that these viruses have co-diverged over more than 407 million years is that they somehow moved, in more recent times, from terrestrial hosts to sarcopterygian hosts that inhabited the deep sea, and that the similarity of the coelacanth virus to the mammalian viruses is due to cross-species (in fact cross-class) transmission, rather than shared history. However, as illustrated by the significant correlation between host divergence times and viral distances (Figs. 5, S5, and S6), the long branches leading to CoeEFV and the clade of mammal foamy viruses suggest the virus had already circulated in vertebrates for an extremely long time before the origin of mammal foamy virus. Given that there is strong evidence that placental mammals were already being infected with foamy viruses by about 100 million years ago [9], the distinctness of the coelacanth virus suggests that it would have to have crossed from some other unidentified host, one whose foamy-like virus was already hundreds of millions of years divergent from the mammalian viruses. This seems highly unlikely. Although crossspecies transmission of SFVs has been observed [2–5,22], foamy viruses seem to mainly follow a pattern of co-diversification with their hosts [5,9]. If one accepts that the endogenous foamy viruses within the genomes of sloths indicate more than 100 million years of host-virus co-divergence, it seems plausible that CoeEFV extends that timeline by an additional 300 million years.

Moreover, the habitat isolation of the coelacanth and terrestrial vertebrates would have provided limited opportunities for direct transfer of foamy viruses to coelacanths. Taken together, these lines of evidence strongly suggest that foamy viruses and their vertebrate hosts have codiverged for more than 407 million years, and that foamy viruses underwent a remarkable evolutionary transition from water to land simultaneously with the conquest of land by their vertebrate hosts.

Our analyses provide compelling evidence for the existence of retroviruses going back at least to the Early Devonian. This is the oldest estimate, to our knowledge, for any group of viruses, significantly older than the previous estimates for hepadnaviruses (19 million years) [23] and large dsDNA viruses of insects (310 million years) [24]. Although highly cytopathic in tissue culture, foamy viruses do not seem to cause any recognizable disease in their natural hosts [1,25,26]. Such long-term virus-host coevolution may help explain the low pathogenicity of foamy viruses. The fact that the Env is well conserved between CoeEFV and foamy viruses is consistent with the fact that these viruses are asymptomatic and mainly co-evolve with their hosts in a relatively conflict-free relationship. It is easy to imagine that previously overlooked examples of such a non-pathogenic virus may yet be found in hosts that fill in some of the gaps in the phylogeny, namely amphibians, reptiles, and birds. It will be of interest to screen these hosts, but also various fish species, for evidence of exogenous and/or endogenous foamy-like viruses.

An ancient marine origin of retroviruses

Dating analyses provide the clearest evidence for when and where retroviruses originated. There is strong evidence that foamy viruses shared a common, exogenous retroviral ancestor more than 400 million years ago (since Env was present in both terrestrial and marine lineages). The discovery of endogenous lentiviruses demonstrates that lentiviruses, a distinct retroviral lineage that includes HIV, are also millions of years old [27–30]. Foamy viruses and lentiviruses share a distantly related ancestor (Figs. 3, S3) and the foamy virus clade alone almost certainly accounts for more than 407 million years of retroviral evolution. It follows that the origin of at least some retroviruses is older than 407 million years ago. As with the coelacanth lineage in the foamy virus clade, we found that retroviruses of fishes occupy the most basal positions within both the Class I and Class III retroviral clades (walleye dermal sarcoma virus (WSDV) and snakehead retrovirus (SnRV), respectively, blue asterisks), (Figs. 3, S3). This pattern provides additional evidence of a marine origin and longterm coevolution of these major retroviral lineages. However, to be specific, the phylogenetic reconstruction in Fig. 3 reflects the history of only of the Pol protein, not a comprehensive history of retroviral genomic evolution. Nevertheless, our analyses support a very ancient marine origin of retroviruses.


To sum up, what the above paper tells us is this.

[1] An endogenous retroviral insertion was found in the genome of the coelacanth. This in itself is not necessarily remarkable, as such an insertion could have occured relatively recently (say, in the past 5 million years).

[2] However, when the insertion in question was examined in detail, it was found to be remarkable indeed. Because the closest genetic match for this retroviral insertion came from a group of retroviruses known as the foamy viruses. The foamy viruses were previously known only from mammals, and furthermore, endogenous retroviral insertions by foamy viruses are extremely rare in mammalian genomes, as far is is currently understood - only two such insertions, one in sloths, and one in aye-ayes, has been detected to date.

[3] Consequently, this leads to the question of how the foamy virus endogenous insertion appeared in the coelacanth genome. Cross infection from a mammal is considered highly unlikely, because the foamy viruses are only known currently from terrestrial mammals, and the coelacanth is a deep water ocean dweller. Which leads to the possibility that the foamy virus ERV in the coelacanth is a left-over from the evolutionary history of the entire virus clade.

[4] If this is the case, then the foamy virus that left the ERV in the coelacanth genome must have had a common ancestor with the mammalian foamy viruses. It's highly unlikely that an entirely different fish virus, with an entirely different ancestry, evolved to become a close phylogenetic match to the mammalian foamy viruses. We would expect to see the real ancestry emerge from detailed genetic analysis of the ERV.

[5] As a corollary of [4] above, the common ancestor of the coelacanth ERV and the foamy viruses that left ERVs in sloths and aye-ayes, must have existed before those two clades diverged. This means that those viruses must have existed in fish ancestors of the two sarcopterygian clades, which dates them to the Devonian era. Consequently, the foamy viruses have to be at least 400 million years old.

[6] When the phylogeny of the coelacanth ERV is examined, the coeclacanth ERV is found to be basal with respect to the mammalian foamy viruses, matching similar results for other retrovirus clades, in which fish retroviruses are basal to those found in later emerging vertebrate clades.

I think Abbie should present a TV show - Walking With Retroviruses. :)