<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art><ui>1759-8753-1-12</ui><ji>1759-8753</ji><fm>
<dochead>Research</dochead>
<bibl>
<title>
<p>A subtelomeric non-LTR retrotransposon <it>Hebe </it>in the bdelloid rotifer <it>Adineta vaga </it>is subject to inactivation by deletions but not 5' truncations</p>
</title>
<aug>
<au id="A1"><snm>Gladyshev</snm><mi>A</mi><fnm>Eugene</fnm><insr iid="I1"/><insr iid="I2"/><email>gladysh@mcb.harvard.edu</email></au>
<au ca="yes" id="A2"><snm>Arkhipova</snm><mi>R</mi><fnm>Irina</fnm><insr iid="I1"/><email>iarkhipova@mbl.edu</email></au>
</aug>
<insg>
<ins id="I1"><p>Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, 7 MBL Street, Woods Hole, MA 02543, USA</p></ins>
<ins id="I2"><p>Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA 02138, USA</p></ins>
</insg>
<source>Mobile DNA</source>
<issn>1759-8753</issn>
<pubdate>2010</pubdate>
<volume>1</volume>
<issue>1</issue>
<fpage>12</fpage>
<url>http://www.mobilednajournal.com/content/1/1/12</url>
<xrefbib><pubidlist><pubid idtype="pmpid">20359339</pubid><pubid idtype="doi">10.1186/1759-8753-1-12</pubid></pubidlist></xrefbib>
</bibl>
<history><rec><date><day>1</day><month>7</month><year>2009</year></date></rec><acc><date><day>1</day><month>4</month><year>2010</year></date></acc><pub><date><day>1</day><month>4</month><year>2010</year></date></pub></history>
<cpyrt><year>2010</year><collab>Gladyshev and Arkhipova; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec>
<st>
<p>Abstract</p>
</st>
<sec>
<st>
<p>Background</p>
</st>
<p>Rotifers of the class Bdelloidea are microscopic freshwater invertebrates best known for: their capacity for anhydrobiosis; the lack of males and meiosis; and for the ability to capture genes from other non-metazoan species. Although genetic exchange between these animals might take place by non-canonical means, the overall lack of meiosis and syngamy should greatly impair the ability of transposable elements (TEs) to spread in bdelloid populations. Previous studies demonstrated that bdelloid chromosome ends, in contrast to gene-rich regions, harbour various kinds of TEs, including specialized telomere-associated retroelements, as well as DNA TEs and retrovirus-like retrotransposons which are prone to horizontal transmission. Vertically-transmitted retrotransposons have not previously been reported in bdelloids and their identification and studies of the patterns of their distribution and evolution could help in the understanding of the high degree of TE compartmentalization within bdelloid genomes.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<p>We identified and characterized a non-long terminal repeat (LTR) retrotransposon residing primarily in subtelomeric regions of the genome in the bdelloid rotifer <it>Adineta vaga</it>. Contrary to the currently prevailing views on the mode of proliferation of non-LTR retrotransposons, which results in frequent formation of 5'-truncated ('dead-on-arrival') copies due to the premature disengagement of the element-encoded reverse transcriptase from its template, this non-LTR element, <it>Hebe</it>, is represented only by non-5'-truncated copies. Most of these copies, however, were subject to internal deletions associated with microhomologies, a hallmark of non-homologous end-joining events.</p>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>The non-LTR retrotransposon <it>Hebe </it>from the bdelloid rotifer <it>A. vaga </it>was found to undergo frequent microhomology-associated deletions, rather than 5'-terminal truncations characteristic of this class of retrotransposons, and to exhibit preference for telomeric localization. These findings represent the first example of a vertically transmitted putatively deleterious TE in bdelloids, and may indicate the involvement of microhomology-mediated non-homologous end-joining in desiccation-induced double-strand break repair at the genome periphery.</p>
</sec>
</sec>
</abs>
</fm><bdy>
<sec>
<st>
<p>Background</p>
</st>
<p>Mobile genetic elements are divided into two types according to their mode of transposition: retrotransposons, which require an RNA intermediate to synthesize a new copy with the aid of the element-encoded reverse transcriptase (RT), and DNA transposons, which do not require an RNA intermediate for transposition. Retrotransposons, in turn, are divided into two large classes according to the presence, or lack, of long terminal repeats (LTRs): LTR retrotransposons are framed by LTRs, while non-LTR retrotransposons are not (reviewed in <abbrgrp>
<abbr bid="B1">1</abbr>
<abbr bid="B2">2</abbr>
<abbr bid="B3">3</abbr>
</abbrgrp>). When cDNA synthesis is primed extrachromosomally, template jumps during reverse transcription lead to the formation of LTRs. In contrast, if cDNA synthesis is primed directly at the insertion site by the 3'OH at the nick in chromosomal DNA (target-primed reverse transcription, or TPRT), no LTRs are formed. The nick is introduced by the non-LTR retrotransposon-encoded endonuclease (EN), which may or may not exhibit insertion preferences. RT then uses the endonuclease-generated 3' hydroxyl to prime cDNA synthesis and is believed to be highly prone to premature termination of reverse transcription, which results in formation of numerous 5' truncated copies of non-LTR retrotransposons (often called 'dead-on-arrival') <abbrgrp>
<abbr bid="B4">4</abbr>
<abbr bid="B5">5</abbr>
<abbr bid="B6">6</abbr>
</abbrgrp>. Typically, while the overall copy number of non-LTR retrotransposons in eukaryotic genomes tends to be high rather than low, there usually exist relatively few master copies which have the capacity to give rise to the new copies <abbrgrp>
<abbr bid="B7">7</abbr>
<abbr bid="B8">8</abbr>
</abbrgrp>. These master copies, however, need to persist in their corresponding host genomes for extended evolutionary times, as horizontal transfer of non-LTR elements is believed to be exceptionally rare <abbrgrp>
<abbr bid="B1">1</abbr>
</abbrgrp>.</p>
<p>Bdelloid rotifers are small freshwater invertebrates with the ability to reproduce entirely asexually and to undergo cycles of desiccation and rehydration at any stage of their life cycle. These features may be related to their peculiar genome structure: bdelloids are degenerate tetraploids, with chromosomes present in quartets, each comprising two co-linear pairs, with only a minority of genes common to both pairs, in the same order and orientation <abbrgrp>
<abbr bid="B9">9</abbr>
<abbr bid="B10">10</abbr>
</abbrgrp>. Gene copies from different co-linear pairs exhibit very high levels of divergence, which was initially interpreted as inter-allelic divergence accumulated following an ancient loss of sex <abbrgrp>
<abbr bid="B11">11</abbr>
</abbrgrp> but which, in fact, reflects the divergence between homeologs. Variable and much smaller levels of divergence within a co-linear pair (0%-6%) presumably reflect the occasional operation of homogenizing processes such as gene conversion and mitotic crossing-over. The extraordinary resistance of bdelloid rotifers to ionizing radiation may have evolved as an adaptation to frequent desiccation/rehydration cycles to protect the genomes from DNA damage <abbrgrp>
<abbr bid="B12">12</abbr>
</abbrgrp>.</p>
<p>Retrotransposons in bdelloid rotifers, including non-LTR retrotransposons, have remained elusive for some time. Initial screens employing degenerate polymerase chain reaction (PCR) primers targeted to multicopy LINE-like and <it>gypsy</it>-like elements turned up negative, despite yielding positive results in 39 diverse species from 23 animal phyla <abbrgrp>
<abbr bid="B13">13</abbr>
</abbrgrp>. DNA transposons, however, were easily detectable even in early PCR screens, and, like in other species, exhibited patchy distribution, in agreement with their ability to transfer laterally and to evolve <it>via </it>multiple rounds of invasion, amplification, decay and horizontal escape <abbrgrp>
<abbr bid="B13">13</abbr>
<abbr bid="B14">14</abbr>
</abbrgrp>. The presence of vertically-transmitted non-LTR elements, however, might pose a problem in asexual species, which could eventually be overcome by the load of deleterious mutations, lacking the capacity to get rid of harmful transposable element (TE) insertions <it>via </it>meiotic recombination <abbrgrp>
<abbr bid="B15">15</abbr>
</abbrgrp>.</p>
<p>Analysis of about 1.5 Mb of gene-rich DNA from two bdelloid species, <it>Adineta vaga </it>and <it>Philodina roseola </it>
<abbrgrp>
<abbr bid="B9">9</abbr>
<abbr bid="B10">10</abbr>
</abbrgrp> (JL Mark Welch, personal communication) also failed to reveal the presence of mobile elements, either intact or decayed. Several 40 kb - 70 kb co-linear contigs including <it>hsp82</it>, histone, <it>Hox </it>genes, and their genomic environment, were obtained by sequencing of overlapping fosmid library clones and contained only a single large indel polymorphism which was tentatively ascribed to a foldback-like DNA TE insertion <abbrgrp>
<abbr bid="B14">14</abbr>
</abbrgrp>. The overall gene density, however, is quite high, with coding sequences occupying about 50% of genomic DNA (see Figure Three in <abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp>). Such a conspicuous lack of mobile DNA in gene-rich regions of the genome is quite intriguing, since genomic DNA from gene-rich regions - even of those model eukaryotes which are regarded as relatively TE-poor - contains, on average, 7.7-12.3 retrotransposons and 2.3-3.6 DNA TEs per Mb (<it>Drosophila melanogaster </it>
<abbrgrp>
<abbr bid="B17">17</abbr>
</abbrgrp>), or ~ 7 retrotransposons and ~ 19 DNA TEs per Mb (<it>Caenorhabditis elegans </it>
<abbrgrp>
<abbr bid="B18">18</abbr>
</abbrgrp>).</p>
<p>We were able, however, to find genomic regions which do not appear refractory to TE insertion, but, in contrast, are highly enriched in TEs. Our efforts aimed at cloning and sequencing telomeres, which are a lot less conserved than the core genome and are typically rich in repetitive and mobile DNA, revealed several types of TEs inhabiting bdelloid chromosome end regions. These included <it>Athena </it>retroelements specialized for terminal transposition <abbrgrp>
<abbr bid="B19">19</abbr>
</abbrgrp>, low copy-number retrovirus-like elements <it>Juno </it>and <it>Vesta </it>
<abbrgrp>
<abbr bid="B20">20</abbr>
</abbrgrp>, numerous DNA TEs of various kinds <abbrgrp>
<abbr bid="B14">14</abbr>
</abbrgrp> and R9 insertions into 28S ribosomal genes <abbrgrp>
<abbr bid="B21">21</abbr>
</abbrgrp>. In the present study, we describe a non-LTR retrotransposon belonging to the jockey clade, which may be (or has recently been) active, is located preferentially in subtelomeric regions and is characterized by several unique features such as the lack of 5' terminal truncation and a high frequency of internal deletions associated with microhomologies.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<sec>
<st>
<p>Structural organization and copy number</p>
</st>
<p>In an extended genome walk directed from the chromosome end inwards, one of the telomeres from the bdelloid rotifer <it>A. vaga </it>(telomere O.4; <abbrgrp>
<abbr bid="B19">19</abbr>
</abbrgrp>) was found to carry a very long chain of telomere-associated retrotransposons (EF485020; Figure <figr fid="F1">1a</figr>). In addition to two consecutive <it>Athena </it>retroelements, the most proximal of which was 3'-truncated by fusion with the oppositely-oriented retrovirus-like LTR retrotransposon <it>Juno</it>, the head-to-tail retrotransposon chain continued with a non-LTR retrotransposon encoding two open reading frames (ORFs). The first ORF contained three Zn-knuckle motifs (Figure <figr fid="F1">1a</figr> and <figr fid="F1">1c</figr>; Additional File <supplr sid="S1">1</supplr>) and appeared most similar to the <it>gag</it>-like ORFs from two <it>Drosophila </it>telomere-associated retrotransposons, <it>TART </it>and <it>HeT-A</it>, while the second ORF had homology to the apurinic/apyrimidinic (AP) endonuclease and RT domains from other representatives of the jockey clade, with the highest degree of similarity to ORF2 of the retrotransposon <it>Syrinx </it>from the putatively asexual ostracod crustacean, <it>Darwinula stevensoni </it>
<abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp>. We named this element <it>Hebe</it>, as it appeared to be relatively young and capable of giving rise to new copies.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Structure and polymorphism of the <it>Adineta vaga Hebe </it>retrotransposon</p></caption><text>
   <p><b>Structure and polymorphism of the <it>Adineta vaga Hebe </it>retrotransposon</b>. (a) Genomic environment of the subtelomeric <it>Hebe </it>copy (K in panel b), including two head-to-tail <it>Athena </it>retroelements and a retrovirus-like long terminal repeat (LTR) retrotransposon <it>Juno</it>. The region containing <it>Athena </it>elements has likely been acquired from another telomere, O3 (reference <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>), possibly by break-induced replication. T, telomeric repeats. Scale bar, 1 kb. (b) Structure of a full-length <it>Hebe </it>consensus copy, and alignment of genomic copies to the consensus. APE and RT denote the AP-like endonuclease and RT domains, respectively; vertical black bars denote the three CCHC zinc knuckle motifs; (A)<sub>n</sub>, poly(A) stretch;//, copies that were truncated by cloning and could not be sequenced to completion. Deletions are indicated by brackets; in-frame stop codons, by vertical lines; probes used for library screening, by double-headed arrows; 72-bp tandem repeat units in the 3' untranslated region are in green. Also shown are the base coordinates for the start and stop of open reading frame (ORF) in the consensus sequence and the position of the unique <it>Spe</it>I site. Scale bar, 0.1 kb. (c) Three zinc knuckle motifs in <it>Hebe </it>ORF1 (panel a) and comparison with selected non-LTR elements from Figure 4b. Highly conserved residues are designated by asterisks.</p>
</text><graphic file="1759-8753-1-12-1"/></fig>
<suppl id="S1">
<title>
<p>Additional file 1</p>
</title>
<text>
<p>BoxShade alignment of amino acid sequences from the most conserved regions of open reading frame (ORF) 1 (p.1), endonuclease (p.2), and reverse transcriptase (p.3) domains from selected non-long terminal repeat retrotransposons analysed in Figure <figr fid="F4">4</figr>.</p>
</text>
<file name="1759-8753-1-12-S1.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<p>In order to determine the exact boundaries of the <it>Hebe </it>element, we designed a 1-kb PCR probe spanning the entire core RT domain (Figure <figr fid="F1">1b</figr>) and used it to screen the <it>A. vaga </it>genomic library to obtain additional <it>Hebe </it>copies. A second 0.6-kb probe spanning the 3' untranslated region (UTR) was also used in subsequent library screens in order to find out whether <it>Hebe </it>might give rise to a large number of short 3' truncated copies which would have been missed by the RT probe. Interestingly, hybridization with the second probe yielded few, if any, additional hybridizing spots, indicating a lack of copies which are 5' truncated in the region between the two probes. The number of fosmids obtained from screening <it>~ </it>4 <it>A. vaga </it>genome equivalents is shown in Table <tblr tid="T1">1</tblr>.</p>
<tbl id="T1"><title><p>Table 1</p></title><caption><p>Characteristics of <it>Hebe </it>copies.</p></caption><tblbdy cols="7">
      <r>
         <c ca="center">
            <p>
               <b>Copy</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>No. of</b>
            </p>
            <p>
               <b>fosmids</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Stop</b>
            </p>
            <p>
               <b>codon</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>5'</b>
            </p>
            <p>
               <b>truncation</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>3'</b>
            </p>
            <p>
               <b>truncation</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Internal</b>
            </p>
            <p>
               <b>deletion</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Genomic environment</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="7">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>A</b>
            </p>
         </c>
         <c ca="center">
            <p>4</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="left">
            <p>Tandem repeats</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>B</b>
            </p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="left">
            <p>Leucine rich repeat protein</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>C</b>
            </p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>?</p>
         </c>
         <c ca="center">
            <p>?</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="left">
            <p>No significant hits</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>D</b>
            </p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="left">
            <p>DNA TE, piggyBac-like</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>E</b>
            </p>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="left">
            <p>No significant hits</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>F</b>
            </p>
         </c>
         <c ca="center">
            <p>6</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="left">
            <p>Tandem repeats (=G)</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>G</b>
            </p>
         </c>
         <c ca="center">
            <p>6</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="left">
            <p>Tandem repeats (=F)</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>H</b>
            </p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="left">
            <p>TPR protein</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>I</b>
            </p>
         </c>
         <c ca="center">
            <p>4</p>
         </c>
         <c ca="center">
            <p>?</p>
         </c>
         <c ca="center">
            <p>?</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="left">
            <p>Kelch repeat; Fungal ORF</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>J</b>
            </p>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="left">
            <p>TPR repeat protein</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>K</b>
            </p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="left">
            <p>Telomere; telomeric repeats</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>L</b>
            </p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="left">
            <p>LTR TE (=M)</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>M</b>
            </p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="left">
            <p>LTR TE (=L)</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>N</b>
            </p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>?</p>
         </c>
         <c ca="center">
            <p>?</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="left">
            <p>No significant hits</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>O</b>
            </p>
         </c>
         <c ca="center">
            <p>4</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="left">
            <p>Telomeric repeats</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>See Additional File <supplr sid="S2">2</supplr> for sequences of copies A-O.</p>
      <p><b>?</b>, copies truncated by cloning.</p>
      <p>TE, transposable element; TPR, tetratricopeptide repeat; ORF, open reading frame; LTR, long terminal repeat.</p>
   </tblfn></tbl>
<p>All hybridizing fosmids were first sequenced with the primer located at the C-terminus of ORF2 and directed outwards, in order to find out how many independent insertions (flanked by differing genomic sequences) can be identified on these fosmids. The sequences fall into 13 groups defined by the adjacent flanking regions, which correspond to 13 independent insertion events. One group (L/M) consists of two subgroups which share the same deletions and flanking sequences, but differ by six point mutations, indicating that one was recently copied from the other (for example, in the course of segmental duplication, break-induced replication or gene conversion between two members of a co-linear pair). Another group (F/G) shares the 5' flank and 100% identity in sequence, but differs by a deletion involving the 3' end of copy F. Eleven of the 15 sequenced copies do not exhibit 3' truncation and have a characteristic poly(A) tail, which varies in length between 5 and 25 nucleotides and is located 11 bp downstream of the AATAAA signal. A peculiar feature of the 3' UTR is the presence of a 72-bp tandem repeat, the copy number of which varies from 1 to 4 between different copies, yielding variation in the 3' UTR length between 0.8 and 1 kb (Figure <figr fid="F1">1b</figr>).</p>
<p>In order to obtain an independent estimate of the copy number, we performed a Southern analysis of <it>A. vaga </it>genomic DNA digested with restriction endonucleases <it>Spe</it>I and <it>Sac</it>II. The latter does not have a recognition site in any of the sequenced <it>Hebe </it>copies, while the former cuts only once (Figure <figr fid="F1">1b</figr>). We used two enzymes to achieve a better resolution of the individual bands on the gel, by digesting away larger amounts of flanking sequences. The use of the 1-kb probe spanning the RT domain yielded a set of bands corresponding to each genomic <it>Hebe </it>copy plus variable amounts of adjacent flanking sequences (Figure <figr fid="F2">2</figr>). The results are in excellent agreement with the estimates obtained from genomic library screening: there are 11 hybridizing bands on the gel, four of which are of double intensity, therefore yielding a total of 15 different copies.</p>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>Southern blot analysis of <it>Hebe </it>genomic copies</p></caption><text>
   <p><b>Southern blot analysis of <it>Hebe </it>genomic copies</b>. Genomic DNA from <it>Adineta vaga </it>was digested with <it>Spe</it>I/<it>Sac</it>II (centre lane) and hybridized with the RT probe (right lane). Marker sizes in kbp are indicated on the left.</p>
</text><graphic file="1759-8753-1-12-2"/></fig>
</sec>
<sec>
<st>
<p>Preference for subtelomeric regions and lack of 5' truncations</p>
</st>
<p>The number of fosmids in each group, corresponding to a single genomic copy, turned out to be lower than that expected on the basis of genome coverage (typically between 1 and 4, with the exception of groups F/G) (Table <tblr tid="T1">1</tblr>). In comparison, for the same membranes, the corresponding number of fosmids carrying the single-copy <it>hsp82 </it>gene was between four and six for each haplotype <abbrgrp>
<abbr bid="B10">10</abbr>
<abbr bid="B23">23</abbr>
</abbrgrp> and the number of fosmids carrying the histone cluster was similarly high (five to seven for each haplotype) <abbrgrp>
<abbr bid="B24">24</abbr>
</abbrgrp>. Such under-representation is highly indicative of subterminal localization of the <it>Hebe</it>-containing fosmids, which would be present in the genomic library in lower numbers as the size-selection step in the library construction protocol puts the terminal regions at a disadvantage (see <abbrgrp>
<abbr bid="B19">19</abbr>
</abbrgrp>).</p>
<p>Another line of indirect evidence pointing at localization in subterminal regions is the nature of the surrounding flanking sequences: fosmid end-sequences and the immediately adjacent genomic flanking regions are characterized by features previously found in other bdelloid telomeric fosmids, such as tandem repeats, other TEs, ORFs coding for proteins of repetitive nature or of foreign origin and short stretches of telomeric repeats (Table <tblr tid="T1">1</tblr>). Finally, telomeric retrotransposons often possess 3' UTRs which are prone to formation of tandem repeats <abbrgrp>
<abbr bid="B25">25</abbr>
<abbr bid="B26">26</abbr>
</abbrgrp> and this is also the case for the <it>Hebe </it>element (Figure <figr fid="F1">1b</figr>; Additional File <supplr sid="S2">2</supplr>).</p>
<suppl id="S2">
<title>
<p>Additional file 2</p>
</title>
<text>
<p>Nucleotide sequences of <it>Hebe </it>elements obtained in this study.</p>
</text>
<file name="1759-8753-1-12-S2.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<p>We also sought to confirm that the 5' and 3' ends of <it>Hebe </it>are equally represented in the genomic library, as was indicated by our initial screening. To this end, we probed two additional membranes with the 5' and 3' <it>Hebe </it>probes of approximately equal length (Figure <figr fid="F1">1b</figr>) and counted the number of hybridizing spots: a total of 101 spots were shared between the two probes; the 5' probe revealed 15 additional spots not detected by the 3' probe; and the 3' probe revealed 26 additional spots not detected by the 5' probe. Thus, there is no significant excess of <it>Hebe </it>copies containing only the 3' end and additional spots in both directions can be explained either by deletions involving one of the termini, as seen in Figure <figr fid="F1">1b</figr>, or by the presence of incomplete copies truncated by cloning.</p>
</sec>
<sec>
<st>
<p>Divergence between copies</p>
</st>
<p>In order to evaluate the intactness of <it>Hebe </it>copies and the degree of divergence between them, we sequenced these copies by primer walking. The results are shown in Figure <figr fid="F1">1b</figr> (see also Additional File <supplr sid="S2">2</supplr>). <it>Hebe </it>exhibits a number of peculiar features which are not in agreement with the currently prevailing views on proliferation of non-LTR retrotransposons. First, we could not find any copies exhibiting 5'-terminal truncation, which normally results in formation of a large number of inactive copies and is believed to occur due to the premature dissociation of RT from its template. Second, it is evident that inactivation of individual copies occurred mostly <it>via </it>deletions. Nine copies (D, F/G, J, H, L/M, N, O) carried internal deletions 12-200 bp in length affecting the integrity of their ORFs, and comparison of the sequences at deletion boundaries (Figure <figr fid="F3">3c</figr>) reveals that most of them contain characteristic microhomologies (5-14 bp), which are typically regarded as a hallmark of non-homologous end-joining events resulting in imprecise repair of double strand DNA breaks (DSBs) (reviewed in <abbrgrp>
<abbr bid="B27">27</abbr>
</abbrgrp>). Four copies (F, H, L/M) exhibit 3' terminal truncation, which could have also occurred by deletion, although in this case it is not possible to compare the sequence with its original non-deleted version to reveal the presence of microhomologies. Copies D, F and G carry in-frame stop codons. Copy K contains a 2.3-kb insertion of unknown nature 26 bp upstream from the poly(A) tract. Overall, three copies (A, B and E) may be considered intact, because they carry no obvious defects in their ORFs and possess intact 5' and 3' termini. These copies are flanked by 7-12 bp target site duplications (Figure <figr fid="F3">3a</figr> and <figr fid="F3">3b</figr>) and differ from each other by 29-33 nucleotide substitutions.</p>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>Sequences of <it>Hebe </it>5' and 3' termini and internally deleted regions</p></caption><text>
   <p><b>Sequences of <it>Hebe </it>5' and 3' termini and internally deleted regions</b>. (a) 5'-terminal regions from 12 sequenced copies; (b) 3'-terminal regions from 11 sequenced copies, designated as in Figure 1b. Copies represented in both a and b sets, and the corresponding target site duplications, are underlined. The CATT initiator sequence and the polyadenylation signal are italicized. (c) Comparison between the <it>Hebe </it>consensus sequence (Cons) and the corresponding deleted regions from copies with internal deletions shown in Figure 1b. Copies G, F, L and M share the same deletion boundaries. Four deletions in copy O (O.1, O.2, O.3, O.4, from left to right) are shown. Microhomologies are underlined.</p>
</text><graphic file="1759-8753-1-12-3"/></fig>
<p>Notably, the <it>Hebe </it>element begins with the sequence CATT, which is the canonical initiator motif in many eukaryotes and is characteristic of internal RNA pol II promoters found in <it>Drosophila </it>non-LTR retrotransposons which ensure that a full-length copy does not lose its promoter after retrotransposition <abbrgrp>
<abbr bid="B28">28</abbr>
<abbr bid="B29">29</abbr>
<abbr bid="B30">30</abbr>
</abbrgrp>. The 5' UTR is rather short, being only 220 bp in length. There are two more copies (C and I) which we could not sequence in its entirety because fosmids containing these copies were truncated by cloning, but they appeared to be intact in their sequenced part. If these copies are given the benefit of the doubt, this may bring up to five the number of potentially intact <it>Hebe </it>copies in the genome.</p>
<p>The genealogy of <it>Hebe </it>genomic copies is depicted in Figure <figr fid="F4">4a</figr>. Overall, this pattern is characteristic for non-LTR elements, with inactive copies yielding long terminal branches, as they accumulate numerous point mutations in addition to deletions and the potentially active elements yielding much shorter branches, which are indicative of relatively recent activity. Pairwise all-by-all comparison of ORF1 and ORF2 coding sequences reveals an overall excess of synonymous substitutions over non-synonymous ones, which gradually fades away as copies become more decayed (Additional File <supplr sid="S3">3</supplr>). Again, this is in agreement with relatively recent activity of the element, although no two independent insertions were found which differed by less than 10 nucleotide substitutions (copies A+J and C+E, which potentially represent the most recent retrotransposition events; F/G and L/M are not independent insertions).</p>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>Genealogy of <it>Hebe </it>genomic copies and phylogenetic placement of <it>Hebe </it>open reading frames (ORFs)</p></caption><text>
   <p><b>Genealogy of <it>Hebe </it>genomic copies and phylogenetic placement of <it>Hebe </it>open reading frames (ORFs)</b>. (a) Neighbour-joining phylogram of 15 <it>Hebe </it>genomic copies (Table 1), designated as in Figure 1b, plus the majority-rule consensus sequence (Cons). Copies with deletions are indicated by &#916;. Scale bar, nucleotide substitutions per site. (b, c) Neighbour-joining phylograms of (b) ORF2 from representatives of major non-long terminal repeat (non-LTR) retrotransposon clades, combining EN and RT domain sequences about 900 amino acids in length and (c) ORF1 coding for <it>gag</it>-like proteins from diverse non-LTR retrotransposons, about 500 aa in length. Clades are designated with square brackets. Scale bar, amino acid substitutions per site. Bootstrap support values from 1000 replications are indicated at the nodes.</p>
</text><graphic file="1759-8753-1-12-4"/></fig>
<suppl id="S3">
<title>
<p>Additional file 3</p>
</title>
<text>
<p>Analysis of non-synonymous to synonymous substitution ratios in open reading frame (ORF) 1 and ORF2 of <it>Hebe</it>.</p>
</text>
<file name="1759-8753-1-12-S3.PDF">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec>
<st>
<p>Similarities to other retrotransposons</p>
</st>
<p>Phylogenetic analysis of the 896-aa ORF2 (including EN and RT domains) places <it>Hebe </it>as a basal member of the jockey clade of non-LTR retrotransposons, which also includes <it>TART </it>and <it>HeT-A/TAHRE </it>(Figure <figr fid="F4">4b</figr>), while its closest known RT relative is the <it>Syrinx </it>element from <it>D. stevensoni </it>
<abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp>. The 485-aa ORF1, which codes for a <it>gag</it>-like protein with three zinc knuckle motifs (CCHC; Figure <figr fid="F1">1b</figr> and <figr fid="F1">1c</figr>), is expected to evolve faster than ORF2 and, indeed, it exhibits much lower levels of sequence identity (&#8804;25%, compared to 34% for RT and 32% for EN) to <it>gag</it>-like proteins from other non-LTR retrotransposons (Figure <figr fid="F4">4c</figr>). It is intriguing that the top BLASTP hits included the corresponding <it>gag</it>-like ORFs of <it>Drosophila </it>telomere-associated retrotransposons, some of which have evolved special properties targeting them to telomeric heterochromatin <abbrgrp>
<abbr bid="B31">31</abbr>
</abbrgrp>. However, telomeric targeting is known to have evolved independently in members of other non-LTR clades such as R1 <abbrgrp>
<abbr bid="B32">32</abbr>
</abbrgrp>.</p>
</sec>
</sec>
<sec>
<st>
<p>Discussion</p>
</st>
<p>In this study, we report, for the first time, a non-LTR retrotransposon in the genome of a bdelloid rotifer <it>A. vaga</it>, which exhibits characteristics of an active (or recently active) vertically-transmitted retrotransposon without apparent preference for a specific target sequence and may contribute to understanding the reasons behind the conspicuous lack of TE insertions in the gene-rich regions of bdelloid genomes <abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp>. It appears that, despite relatively recent activity, the element has not reached high copy numbers and most of its copies are riddled by deletions. In addition, the majority of insertions appear to be concentrated in subtelomeric regions. We believe that we were able to clone and sequence most, if not all, of the genomic copies. Even though two sequence variants (C and N) were found in the library only once, and it is formally possible that a few telomere-proximal copies were not represented in the library, the Southern blot analysis is in good agreement with our original estimates from library screening and indicates that it was exhaustive.</p>
<p>Concentration near telomeres may have two possible explanations, which are not necessarily mutually exclusive: either the element preferentially inserts into subtelomeric regions (for example, after having developed an affinity to certain epigenetic marks in subterminal chromatin) or it inserts randomly throughout the genome, but insertions in gene-rich regions are eliminated by selection against deleterious effects of such insertions on nearby genes and/or against deleterious chromosomal rearrangements caused by ectopic recombination between insertions. A currently, or recently, active non-LTR retrotransposon can be expected to serve as a good model system with which to discriminate between these two possibilities, because any short 5'-terminally truncated insertions would have had a better chance of being found near genes, as they would constitute less efficient targets for ectopic recombination events <abbrgrp>
<abbr bid="B33">33</abbr>
<abbr bid="B34">34</abbr>
</abbrgrp>. However, these predictions could not be fulfilled because, surprisingly, we were unable to find any 5' truncated retrotransposed insertions in an exhaustive screen of the genomic library. If the element's RT is not at all prone to premature termination of cDNA synthesis, its chances of survival may increase if there is any insertional specificity disfavouring insertion into gene-rich regions by recognizing certain chromatin features. Although the molecular determinants for telomeric targeting by <it>gag</it>-like proteins in <it>Drosophila </it>are not known, the similarity between ORF1s from organisms as distant as fruit flies and rotifers is intriguing. While telomeric targeting may be a possibility, the paucity of TE insertions in gene-rich regions most probably results from synergistic selection against TE-mediated deleterious rearrangements following DSB repair (see <abbrgrp>
<abbr bid="B12">12</abbr>
<abbr bid="B35">35</abbr>
</abbrgrp>).</p>
<p>While we have previously observed little or no deletions in other telomere-associated TEs, most <it>Hebe </it>copies contain inactivating deletions, apparently formed <it>via </it>joining of microhomologies in the vicinity of a DSB. Bdelloids are known for their ability to survive multiple rounds of desiccation and rehydration <abbrgrp>
<abbr bid="B36">36</abbr>
</abbrgrp> and for their extraordinary resistance to ionizing radiation <abbrgrp>
<abbr bid="B12">12</abbr>
</abbrgrp>, which is accompanied by extensive DNA breakage and rejoining and has likely evolved as an adaptation to the desiccation-prone bdelloid lifestyle. DSB repair in bdelloids most likely occurs by homologous repair, which does not leave lesions in DNA. Indeed, examination of &gt;1 Mb of gene-rich co-linear pairs of bdelloid genomic DNA does not reveal any molecular footprints of non-homologous end joining (NHEJ) repair events. Repair, however, could also occur by error-prone NHEJ (also called microhomology-mediated end joining or MMEJ), which would seal the break after resection using short microhomologous stretches of DNA in the vicinity, resulting in deletion of the intervening DNA sequence (reviewed in <abbrgrp>
<abbr bid="B27">27</abbr>
</abbrgrp>). Deletions in <it>Hebe </it>copies were likely formed by this mechanism. It should be noted that similar deletions were previously seen in non-LTR retrotransposons of <it>Giardia</it>: of three retrotransposon families, one was preferentially disrupted by microhomology-mediated deletions <abbrgrp>
<abbr bid="B26">26</abbr>
</abbrgrp>; members of this family, <it>GilD</it>, are found in gene-poor genomic regions next to variant-specific surface proteins. We also observed microhomology-mediated deletions in two out of a few dozen PCR-amplified fragments of <it>A. vaga mariner </it>DNA transposons, although in this case their chromosomal location was unknown (<abbrgrp>
<abbr bid="B14">14</abbr>
</abbrgrp> and I R Arkhipova, unpublished data). Such deletions are not uncommon among TEs (<abbrgrp>
<abbr bid="B5">5</abbr>
</abbrgrp>; reviewed in <abbrgrp>
<abbr bid="B37">37</abbr>
</abbrgrp>) and tend to be correlated with heterochromatic environment (see <abbrgrp>
<abbr bid="B15">15</abbr>
</abbrgrp> for discussion).</p>
<p>We hypothesized that, in otherwise asexual bdelloid populations, genetic exchange might take place without conventional meiotic sex, based on the observation that the bdelloid germ line is susceptible to invasion of foreign DNA, accumulated mostly at telomeres <abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp>. Such penetrability of the germ line could potentially allow any lost TEs to be regained from bdelloid DNA released into the environment, even in the absence of meiosis and syngamy. Alternatively, the presence of non-LTR elements may constitute evidence of a cryptic sexual process occurring in bdelloids. Further investigations into the mechanisms by which bdelloids combat repetitive elements and repair their DNA, as well as comparative analyses of bdelloid whole-genome sequences and identification of additional non-LTR retrotransposon families, may be expected to shed more light on the remarkable compartmentalization of bdelloid TEs.</p>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>The non-LTR retrotransposon from the bdelloid rotifer <it>A. vaga</it>, named <it>Hebe</it>, was found to undergo frequent microhomology-associated deletions, rather than 5'-terminal truncations characteristic of this class of retrotransposons. In combination with the tendency for telomeric localization, these findings may indicate the involvement of the MMEJ pathway in the repair of double-strand breaks at the genome periphery and may eventually help to explain the overall under-representation of TEs in the bdelloid core genomic regions and their abundance at telomeres. It remains to be seen whether the presence of vertically-transmitted TEs in bdelloids may be indicative of sexual exchange.</p>
</sec>
<sec>
<st>
<p>Methods</p>
</st>
<sec>
<st>
<p>Library screening and fosmid analysis</p>
</st>
<p>The <it>A. vaga </it>genomic fosmid library <abbrgrp>
<abbr bid="B23">23</abbr>
</abbrgrp> was screened with the <sup>32</sup>P-labelled 1-kb RT domain fragment amplified by PCR using a pair of primers F1 (CCAGTGGTTTGATGATGGTGT) and R1 (CTGCTGATACGTTGCCACTTC), and the 0.6-kb 3'UTR fragment amplified with primers 3'UTR-F1 (ATGTCACATACAATCCAGCTTC) and 3'UTR-R1 (GTAACATAAAGTCAACGGAAGG). Selected fosmids were end-sequenced with standard T7 and ccFos primers, split into different groups with the primer seq1 (CAACAAACAACGACATTACACTG) directed into the flanking host sequences, and several fosmids from each group were sequenced by genome walking with custom primers (F1; R1; seq3, AGCCTTTTTCCACATTGCTGG; seq 4, AAAGTTGGACTATCATCTTCG; seq5, GTTGGTGCAAGTCATGGAAAT; seq6, TCGATCTTCTTGATCTTCTGATG; seq7, TGTCATGGATATTGACTTCAGCA). The 5' probe for additional membrane screening to compare representation of 5' and 3' ends was obtained using primers 5p (GATCAGTCGCATTCGTCCAA) and seq7, and the 3' probe - using primers seq1 and 3'UTR-R1. Entire fosmid sequences were obtained by shotgun subcloning into pBluescript II SK- and sequenced on the ABI3730XL at the W M Keck Ecological and Evolutionary Genetics Facility at the Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory. Sequences were deposited in GenBank under accession numbers <ext-link ext-link-id="EF485020" ext-link-type="gen">EF485020</ext-link> and <ext-link ext-link-id="GU176366+GU176367+GU176368+GU176369+GU176370+GU176371+GU176372+GU176373+GU176374+GU176375+GU176376+GU176377+GU176378+GU176379" ext-link-type="gen">GU176366-GU176379</ext-link>.</p>
</sec>
<sec>
<st>
<p>Southern blotting</p>
</st>
<p>
<it>A. vaga </it>genomic DNA was sequentially digested with restriction endonucleases <it>Spe</it>I and <it>Sac</it>II, fractionated on 0.7% agarose gel, and transferred to Hybond+ membrane (Amersham). The RT probe (about 1 kbp) was amplified from <it>A. vaga </it>genomic DNA using the primer pair F1/R1 described above, gel-purified using the Qiagen Gel Extraction Kit and labelled with <sup>32</sup>P-dCTP using random primers (Invitrogen, CA, USA). The probe was hybridized at high stringency (2&#215;SSC, 65&#176;C overnight).</p>
</sec>
<sec>
<st>
<p>Phylogenetic analysis</p>
</st>
<p>Alignment (ClustalW) and phylogenetic analysis was done with MEGA4 <abbrgrp>
<abbr bid="B38">38</abbr>
</abbrgrp>, using either nucleotide sequences (maximum composite likelihood; pairwise deletion; 1000 bootstrap replications) or amino acid sequences (neighbour-joining or minimum evolution; Poisson correction or <it>P</it>-distance; pairwise deletion; 1000 bootstrap replications). Amino acid sequence alignments in BoxShade format are presented in Additional File <supplr sid="S1">1</supplr>. Pairwise Ka/Ks ratios were calculated by the program DIVERGE from the Wisconsin package (Accelrys Inc., San Diego, CA, USA).</p>
</sec>
</sec>
<sec>
<st>
<p>Abbreviations</p>
</st>
<p>DSB: double-strand DNA break; LTR: long terminal repeats; MMEJ: microhomology-mediated end joining; NHEJ: non-homologous end joining; ORF: open reading frame; PCR: polymerase chain reaction; RT: reverse transcriptase; EN: endonuclease; TE: transposable elements; TPRT: target-primedreverse transcription; UTR: untranslated region.</p>
</sec>
<sec>
<st>
<p>Competing interests</p>
</st>
<p>The authors declare that they have no competing interests.</p>
</sec>
<sec>
<st>
<p>Authors' contributions</p>
</st>
<p>EG and IA designed and performed experiments and analysed the data. IA wrote the manuscript. Both authors read and approved the final manuscript.</p>
</sec>
</bdy><bm>
<ack>
<sec>
<st>
<p>Acknowledgements</p>
</st>
<p>We thank J Hur for providing the <it>A. vaga </it>genomic fosmid library and JL Mark Welch for communicating unpublished data on Hox fosmids. This research was supported by the NSF grant MCB-0821956 to I.A.</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>Origins and evolution of retrotransposons</p></title><aug><au><snm>Eickbush</snm><fnm>TH</fnm></au><au><snm>Malik</snm><fnm>H</fnm></au></aug><source>Mobile DNA II</source><publisher>Washington DC: ASM Press</publisher><editor>Craig NL, Craigie R, Gellert M, Lambowitz AM</editor><pubdate>2002</pubdate></bibl><bibl id="B2"><title><p>The diversity of retrotransposons and the properties of their reverse transcriptases</p></title><aug><au><snm>Eickbush</snm><fnm>TH</fnm></au><au><snm>Jamburuthugoda</snm><fnm>VK</fnm></au></aug><source>Virus Res</source><pubdate>2008</pubdate><volume>134</volume><fpage>221</fpage><lpage>34</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.virusres.2007.12.010</pubid><pubid idtype="pmcid">2695964</pubid><pubid idtype="pmpid">18261821</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>The take and give between retrotransposable elements and their hosts</p></title><aug><au><snm>Beauregard</snm><fnm>A</fnm></au><au><snm>Curcio</snm><fnm>MJ</fnm></au><au><snm>Belfort</snm><fnm>M</fnm></au></aug><source>Annu Rev Genet</source><pubdate>2008</pubdate><volume>42</volume><fpage>587</fpage><lpage>617</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1146/annurev.genet.42.110807.091549</pubid><pubid idtype="pmcid">2665727</pubid><pubid idtype="pmpid">18680436</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>The I factor and I-R hybrid dysgenesis in <it>Drosophila melanogaster </it></p></title><aug><au><snm>Finnegan</snm><fnm>DJ</fnm></au></aug><source>Mobile DNA</source><publisher>Washington DC: American Society for Microbiology</publisher><editor>Berg D, Howe M</editor><pubdate>1989</pubdate></bibl><bibl id="B5"><title><p>High intrinsic rate of DNA loss in Drosophila</p></title><aug><au><snm>Petrov</snm><fnm>DA</fnm></au><au><snm>Lozovskaya</snm><fnm>ER</fnm></au><au><snm>Hartl</snm><fnm>DL</fnm></au></aug><source>Nature</source><pubdate>1996</pubdate><volume>384</volume><fpage>346</fpage><lpage>349</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/384346a0</pubid><pubid idtype="pmpid" link="fulltext">8934517</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>Dynamics of R1 and R2 elements in the rDNA locus of <it>Drosophila simulans</it></p></title><aug><au><snm>P&#233;rez-Gonz&#225;lez</snm><fnm>CE</fnm></au><au><snm>Eickbush</snm><fnm>TH</fnm></au></aug><source>Genetics</source><pubdate>2001</pubdate><volume>158</volume><fpage>1557</fpage><lpage>1567</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1461747</pubid><pubid idtype="pmpid">11514447</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Hot L1s account for the bulk of retrotransposition in the human population</p></title><aug><au><snm>Brouha</snm><fnm>B</fnm></au><au><snm>Schustak</snm><fnm>J</fnm></au><au><snm>Badge</snm><fnm>RM</fnm></au><au><snm>Lutz-Prigge</snm><fnm>S</fnm></au><au><snm>Farley</snm><fnm>AH</fnm></au><au><snm>Moran</snm><fnm>JV</fnm></au><au><snm>Kazazian</snm><fnm>HH</fnm><suf>Jr</suf></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2003</pubdate><volume>100</volume><fpage>5280</fpage><lpage>5285</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0831042100</pubid><pubid idtype="pmcid">154336</pubid><pubid idtype="pmpid">12682288</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>Differences in non-LTR retrotransposons within <it>C. elegans </it>and <it>C. briggsae </it>genomes</p></title><aug><au><snm>Zagrobelny</snm><fnm>M</fnm></au><au><snm>Jeffares</snm><fnm>DC</fnm></au><au><snm>Arctander</snm><fnm>P</fnm></au></aug><source>Gene</source><pubdate>2004</pubdate><volume>330</volume><fpage>61</fpage><lpage>66</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.gene.2004.01.003</pubid><pubid idtype="pmpid" link="fulltext">15087124</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>Evidence for degenerate tetraploidy in bdelloid rotifers</p></title><aug><au><snm>Mark Welch</snm><fnm>DB</fnm></au><au><snm>Mark Welch</snm><fnm>JL</fnm></au><au><snm>Meselson</snm><fnm>M</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2008</pubdate><volume>105</volume><fpage>5145</fpage><lpage>5149</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0800972105</pubid><pubid idtype="pmcid">2278229</pubid><pubid idtype="pmpid">18362354</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>Degenerate tetraploidy was established before bdelloid rotifer families diverged</p></title><aug><au><snm>Hur</snm><fnm>JH</fnm></au><au><snm>Van Doninck</snm><fnm>K</fnm></au><au><snm>Mandigo</snm><fnm>ML</fnm></au><au><snm>Meselson</snm><fnm>M</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2009</pubdate><volume>26</volume><fpage>375</fpage><lpage>383</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msn260</pubid><pubid idtype="pmpid" link="fulltext">18996928</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Evidence for the evolution of bdelloid rotifers without sexual reproduction or genetic exchange</p></title><aug><au><snm>Mark Welch</snm><fnm>D</fnm></au><au><snm>Meselson</snm><fnm>M</fnm></au></aug><source>Science</source><pubdate>2000</pubdate><volume>288</volume><fpage>1211</fpage><lpage>1215</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.288.5469.1211</pubid><pubid idtype="pmpid" link="fulltext">10817991</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Extreme resistance of bdelloid rotifers to ionizing radiation</p></title><aug><au><snm>Gladyshev</snm><fnm>E</fnm></au><au><snm>Meselson</snm><fnm>M</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2008</pubdate><volume>105</volume><fpage>5139</fpage><lpage>5144</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0800966105</pubid><pubid idtype="pmcid">2278216</pubid><pubid idtype="pmpid">18362355</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>Transposable elements in sexual and ancient asexual taxa</p></title><aug><au><snm>Arkhipova</snm><fnm>I</fnm></au><au><snm>Meselson</snm><fnm>M</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2000</pubdate><volume>97</volume><fpage>14473</fpage><lpage>14477</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.97.26.14473</pubid><pubid idtype="pmcid">18943</pubid><pubid idtype="pmpid">11121049</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>Diverse DNA transposons in rotifers of the class Bdelloidea</p></title><aug><au><snm>Arkhipova</snm><fnm>IR</fnm></au><au><snm>Meselson</snm><fnm>M</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2005</pubdate><volume>102</volume><fpage>11781</fpage><lpage>11786</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0505333102</pubid><pubid idtype="pmcid">1188004</pubid><pubid idtype="pmpid">16081532</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>Deleterious transposable elements and the extinction of asexuals</p></title><aug><au><snm>Arkhipova</snm><fnm>I</fnm></au><au><snm>Meselson</snm><fnm>M</fnm></au></aug><source>Bioessays</source><pubdate>2005</pubdate><volume>27</volume><fpage>76</fpage><lpage>85</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/bies.20159</pubid><pubid idtype="pmpid" link="fulltext">15612027</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>Massive horizontal gene transfer in bdelloid rotifers</p></title><aug><au><snm>Gladyshev</snm><fnm>EA</fnm></au><au><snm>Meselson</snm><fnm>M</fnm></au><au><snm>Arkhipova</snm><fnm>IR</fnm></au></aug><source>Science</source><pubdate>2008</pubdate><volume>320</volume><fpage>1210</fpage><lpage>1213</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1156407</pubid><pubid idtype="pmpid" link="fulltext">18511688</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>The transposable elements of the <it>Drosophila melanogaster </it>euchromatin: a genomics perspective</p></title><aug><au><snm>Kaminker</snm><fnm>JS</fnm></au><au><snm>Bergman</snm><fnm>CM</fnm></au><au><snm>Kronmiller</snm><fnm>B</fnm></au><au><snm>Carlson</snm><fnm>J</fnm></au><au><snm>Svirskas</snm><fnm>R</fnm></au><au><snm>Patel</snm><fnm>S</fnm></au><au><snm>Frise</snm><fnm>E</fnm></au><au><snm>Wheeler</snm><fnm>DA</fnm></au><au><snm>Lewis</snm><fnm>SE</fnm></au><au><snm>Rubin</snm><fnm>GM</fnm></au><etal/></aug><source>Genome Biol</source><pubdate>2002</pubdate><volume>3</volume><fpage>RESEARCH0084</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2002-3-12-research0084</pubid><pubid idtype="pmcid">151186</pubid><pubid idtype="pmpid">12537573</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>Transposons but not retrotransposons are located preferentially in regions of high recombination rate in <it>Caenorhabditis elegans</it></p></title><aug><au><snm>Duret</snm><fnm>L</fnm></au><au><snm>Marais</snm><fnm>G</fnm></au><au><snm>Biemont</snm><fnm>C</fnm></au></aug><source>Genetics</source><pubdate>2000</pubdate><volume>156</volume><fpage>1661</fpage><lpage>1669</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1461346</pubid><pubid idtype="pmpid">11102365</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>Telomere-associated endonuclease-deficient <it>Penelope </it>-like retroelements in diverse eukaryotes</p></title><aug><au><snm>Gladyshev</snm><fnm>EA</fnm></au><au><snm>Arkhipova</snm><fnm>IR</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2007</pubdate><volume>104</volume><fpage>9352</fpage><lpage>9357</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0702741104</pubid><pubid idtype="pmcid">1890498</pubid><pubid idtype="pmpid">17483479</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>A deep-branching clade of retrovirus-like retrotransposons in bdelloid rotifers</p></title><aug><au><snm>Gladyshev</snm><fnm>EA</fnm></au><au><snm>Meselson</snm><fnm>M</fnm></au><au><snm>Arkhipova</snm><fnm>IR</fnm></au></aug><source>Gene</source><pubdate>2007</pubdate><volume>390</volume><fpage>136</fpage><lpage>145</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.gene.2006.09.025</pubid><pubid idtype="pmcid">1839950</pubid><pubid idtype="pmpid">17129685</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Rotifer rDNA-specific R9 retrotransposable elements generate an exceptionally long target site duplication upon insertion</p></title><aug><au><snm>Gladyshev</snm><fnm>EA</fnm></au><au><snm>Arkhipova</snm><fnm>IR</fnm></au></aug><source>Gene</source><pubdate>2009</pubdate><volume>448</volume><fpage>145</fpage><lpage>150</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.gene.2009.08.016</pubid><pubid idtype="pmpid" link="fulltext">19744548</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>Two families of non-LTR retrotransposons, <it>Syrinx </it>and <it>Daphne</it>, from the Darwinulid ostracod, <it>Darwinula stevensoni</it></p></title><aug><au><snm>Sch&#246;n</snm><fnm>I</fnm></au><au><snm>Arkhipova</snm><fnm>IR</fnm></au></aug><source>Gene</source><pubdate>2006</pubdate><volume>371</volume><fpage>296</fpage><lpage>307</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.gene.2005.12.007</pubid><pubid idtype="pmpid" link="fulltext">16469453</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>Duplication and divergence of regions containing <it>hsp82 </it>before the separation of two bdelloid families, Adinetidae and Phlodinidae</p></title><aug><au><snm>Hur</snm><fnm>J</fnm></au></aug><source>PhD Dissertation</source><publisher>Harvard University, Cambridge, MA, USA</publisher><pubdate>2006</pubdate></bibl><bibl id="B24"><title><p>Phylogenomics of unusual histone H2A Variants in Bdelloid rotifers</p></title><aug><au><snm>Van Doninck</snm><fnm>K</fnm></au><au><snm>Mandigo</snm><fnm>ML</fnm></au><au><snm>Hur</snm><fnm>JH</fnm></au><au><snm>Wang</snm><fnm>P</fnm></au><au><snm>Guglielmini</snm><fnm>J</fnm></au><au><snm>Milinkovitch</snm><fnm>MC</fnm></au><au><snm>Lane</snm><fnm>WS</fnm></au><au><snm>Meselson</snm><fnm>M</fnm></au></aug><source>PLoS Genet</source><pubdate>2009</pubdate><volume>5</volume><fpage>e1000401</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pgen.1000401</pubid><pubid idtype="pmcid">2642717</pubid><pubid idtype="pmpid">19266019</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>Retrotransposons provide an evolutionarily robust non-telomerase mechanism to maintain telomeres</p></title><aug><au><snm>Pardue</snm><fnm>ML</fnm></au><au><snm>DeBaryshe</snm><fnm>PG</fnm></au></aug><source>Annu Rev Genet</source><pubdate>2003</pubdate><volume>37</volume><fpage>485</fpage><lpage>511</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1146/annurev.genet.38.072902.093115</pubid><pubid idtype="pmpid" link="fulltext">14616071</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>Three retrotransposon families in the genome of <it>Giardia lamblia:</it> two telomeric, one dead</p></title><aug><au><snm>Arkipova</snm><fnm>IR</fnm></au><au><snm>Morrison</snm><fnm>HG</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2001</pubdate><volume>98</volume><fpage>14497</fpage><lpage>14502</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.231494798</pubid><pubid idtype="pmcid">64710</pubid><pubid idtype="pmpid">11734649</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>MMEJ repair of double-strand breaks (director's cut): deleted sequences and alternative endings</p></title><aug><au><snm>McVey</snm><fnm>M</fnm></au><au><snm>Lee</snm><fnm>SE</fnm></au></aug><source>Trends Genet</source><pubdate>2008</pubdate><volume>24</volume><fpage>529</fpage><lpage>538</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.tig.2008.08.007</pubid><pubid idtype="pmpid" link="fulltext">18809224</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p><it>jockey</it>, a mobile Drosophila element similar to mammalian LINEs, is transcribed from the internal promoter by RNA polymerase II</p></title><aug><au><snm>Mizrokhi</snm><fnm>LJ</fnm></au><au><snm>Georgieva</snm><fnm>SG</fnm></au><au><snm>Ilyin</snm><fnm>YV</fnm></au></aug><source>Cell</source><pubdate>1988</pubdate><volume>54</volume><fpage>685</fpage><lpage>691</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0092-8674(88)80013-8</pubid><pubid idtype="pmpid" link="fulltext">2842063</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>The 5' untranslated region of the I factor, a long interspersed nuclear element-like retrotransposon of <it>Drosophila melanogaster</it>, contains an internal promoter and sequences that regulate expression</p></title><aug><au><snm>McLean</snm><fnm>C</fnm></au><au><snm>Bucheton</snm><fnm>A</fnm></au><au><snm>Finnegan</snm><fnm>DJ</fnm></au></aug><source>Mol Cell Biol</source><pubdate>1993</pubdate><volume>13</volume><fpage>1042</fpage><lpage>1050</lpage><xrefbib><pubidlist><pubid idtype="pmcid">358989</pubid><pubid idtype="pmpid">8380889</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Promoter elements in <it>Drosophila melanogaster</it> revealed by sequence analysis</p></title><aug><au><snm>Arkhipova</snm><fnm>IR</fnm></au></aug><source>Genetics</source><pubdate>1995</pubdate><volume>139</volume><fpage>1359</fpage><lpage>1369</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1206462</pubid><pubid idtype="pmpid">7768444</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>Intracellular targeting of telomeric retrotransposon Gag proteins of distantly related Drosophila species</p></title><aug><au><snm>Casacuberta</snm><fnm>E</fnm></au><au><snm>Mar&#237;n</snm><fnm>FA</fnm></au><au><snm>Pardue</snm><fnm>ML</fnm></au></aug><source>Proc Natl Acad</source><pubdate>2007</pubdate><volume>104</volume><fpage>8391</fpage><lpage>8396</lpage><xrefbib><pubid idtype="doi">10.1073/pnas.0702566104</pubid></xrefbib></bibl><bibl id="B32"><title><p>Evolution of target specificity in R1 clade non-LTR retrotransposons</p></title><aug><au><snm>Kojima</snm><fnm>KK</fnm></au><au><snm>Fujiwara</snm><fnm>H</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2003</pubdate><volume>20</volume><fpage>351</fpage><lpage>361</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msg031</pubid><pubid idtype="pmpid" link="fulltext">12644555</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>Selection against deleterious LINE-1-containing loci in the human lineage</p></title><aug><au><snm>Boissinot</snm><fnm>S</fnm></au><au><snm>Entezam</snm><fnm>A</fnm></au><au><snm>Furano</snm><fnm>AV</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2001</pubdate><volume>18</volume><fpage>926</fpage><lpage>935</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">11371580</pubid></xrefbib></bibl><bibl id="B34"><title><p>Fitness cost of LINE-1 (L1) activity in humans</p></title><aug><au><snm>Boissinot</snm><fnm>S</fnm></au><au><snm>Davis</snm><fnm>J</fnm></au><au><snm>Entezam</snm><fnm>A</fnm></au><au><snm>Petrov</snm><fnm>D</fnm></au><au><snm>Furano</snm><fnm>AV</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2006</pubdate><volume>103</volume><fpage>9590</fpage><lpage>9594</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0603334103</pubid><pubid idtype="pmcid">1480451</pubid><pubid idtype="pmpid">16766655</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>Double-strand breaks associated with repetitive DNA can reshape the genome</p></title><aug><au><snm>Argueso</snm><fnm>JL</fnm></au><au><snm>Westmoreland</snm><fnm>J</fnm></au><au><snm>Mieczkowski</snm><fnm>PA</fnm></au><au><snm>Gawel</snm><fnm>M</fnm></au><au><snm>Petes</snm><fnm>TD</fnm></au><au><snm>Resnick</snm><fnm>MA</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2008</pubdate><volume>105</volume><fpage>11845</fpage><lpage>11850</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0804529105</pubid><pubid idtype="pmcid">2515620,2515620</pubid><pubid idtype="pmpid">18701715</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>Anhydrobiotic capabilities of bdelloid rotifers</p></title><aug><au><snm>Ricci</snm><fnm>C</fnm></au></aug><source>Hydrobiologia</source><pubdate>1998</pubdate><volume>387/388</volume><fpage>321</fpage><lpage>326</lpage><xrefbib><pubid idtype="doi">10.1023/A:1017086425934</pubid></xrefbib></bibl><bibl id="B37"><title><p>Mechanisms and rates of genome expansion and contraction in flowering plants</p></title><aug><au><snm>Bennetzen</snm><fnm>JL</fnm></au></aug><source>Genetica</source><pubdate>2002</pubdate><volume>115</volume><fpage>29</fpage><lpage>36</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1023/A:1016015913350</pubid><pubid idtype="pmpid" link="fulltext">12188046</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0</p></title><aug><au><snm>Tamura</snm><fnm>K</fnm></au><au><snm>Dudley</snm><fnm>J</fnm></au><au><snm>Nei</snm><fnm>M</fnm></au><au><snm>Kumar</snm><fnm>S</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2007</pubdate><volume>24</volume><fpage>1596</fpage><lpage>1599</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msm092</pubid><pubid idtype="pmpid" link="fulltext">17488738</pubid></pubidlist></xrefbib></bibl></refgrp>
</bm></art>
