<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art><ui>1759-8753-1-10</ui><ji>1759-8753</ji><fm>
<dochead>Research</dochead>
<bibl>
<title>
<p>The structure, organization and radiation of <it>Sadhu </it>non-long terminal repeat retroelements in <it>Arabidopsis </it>species</p>
</title>
<aug>
<au id="A1"><snm>Rangwala</snm><mi>H</mi><fnm>Sanjida</fnm><insr iid="I1"/><insr iid="I2"/><email>sanjida@mail.med.upenn.edu</email></au>
<au ca="yes" id="A2"><snm>Richards</snm><mi>J</mi><fnm>Eric</fnm><insr iid="I1"/><insr iid="I3"/><email>ejr77@cornell.edu</email></au>
</aug>
<insg>
<ins id="I1"><p>Department of Biology, Washington University in St Louis, St Louis, MO, USA</p></ins>
<ins id="I2"><p>Department of Genetics, University of Pennsylvania School of Medicine, Philadelphia, PA, USA</p></ins>
<ins id="I3"><p>Boyce Thompson Institute for Plant Research, Cornell University, Ithaca, NY, USA</p></ins>
</insg>
<source>Mobile DNA</source>
<issn>1759-8753</issn>
<pubdate>2010</pubdate>
<volume>1</volume>
<issue>1</issue>
<fpage>10</fpage>
<url>http://www.mobilednajournal.com/content/1/1/10</url>
<xrefbib><pubidlist><pubid idtype="pmpid">20226007</pubid><pubid idtype="doi">10.1186/1759-8753-1-10</pubid></pubidlist></xrefbib>
</bibl>
<history><rec><date><day>23</day><month>7</month><year>2009</year></date></rec><acc><date><day>1</day><month>3</month><year>2010</year></date></acc><pub><date><day>1</day><month>3</month><year>2010</year></date></pub></history>
<cpyrt><year>2010</year><collab>Rangwala and Richards; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec>
<st>
<p>Abstract</p>
</st>
<sec>
<st>
<p>Background</p>
</st>
<p>
<it>Sadhu </it>elements are non-autonomous retroposons first recognized in <it>Arabidopsis thaliana</it>. There is a wide degree of divergence among different elements, suggesting that these sequences are ancient in origin. Here we report the results of several lines of investigation into the genomic organization and evolutionary history of this element family.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<p>We present a classification scheme for <it>Sadhu </it>elements in <it>A. thaliana</it>, describing derivative elements related to the full-length elements we reported previously. We characterized <it>Sadhu5 </it>elements in a set of <it>A. thaliana </it>strains in order to trace the history of radiation in this subfamily. Sequences surrounding the target sites of different <it>Sadhu </it>insertions are consistent with mobilization by LINE retroelements. Finally, we identified <it>Sadhu </it>elements grouping into distinct subfamilies in two related species, <it>Arabidopsis arenosa </it>and <it>Arabidopsis lyrata</it>.</p>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>Our analyses suggest that the <it>Sadhu </it>retroelement family has undergone target primed reverse transcription-driven retrotransposition during the divergence of different <it>A. thaliana </it>strains. In addition, <it>Sadhu </it>elements can be found at moderate copy number in three distinct <it>Arabidopsis </it>species, indicating that the evolutionary history of these sequences can be traced back at least several millions of years.</p>
</sec>
</sec>
</abs>
</fm><meta>
<classifications>
<classification id="endnote" subtype="user_supplied_xml" type="bmc"/>
</classifications>
</meta><bdy>
<sec>
<st>
<p>Background</p>
</st>
<p>We previously reported a novel family of <it>Arabidopsis </it>retroposons, <it>Sadhu </it>
<abbrgrp>
<abbr bid="B1">1</abbr>
</abbrgrp>. The typical <it>Sadhu </it>element contains a poly(A) tract and is flanked by a direct 7 to 16 base pair (bp) target site duplication (TSD). Similar to small interspersed nuclear elements (SINEs), <it>Sadhu </it>elements are non-protein coding and do not contain long terminal repeats (LTRs); they are therefore expected to be non-autonomous. Although plant SINEs are thought to be mobilized by autonomous long interspersed nuclear elements (LINEs), the source of the transposase for <it>Sadhu </it>is not clear.</p>
<p>Structurally, <it>Sadhu </it>elements resemble SINEs (non-coding, poly(A) tract), but unlike known SINEs, they do not contain sequence similarity to known non-coding RNAs (for example, 5SrRNA, tRNA) <abbrgrp>
<abbr bid="B2">2</abbr>
</abbrgrp>. Nor do <it>Sadhu </it>elements carry conserved sequences similar to RNA polymerase II TATA boxes or RNA polymerase III promoter motifs (for example, A and B boxes). However, <it>Sadhu </it>elements share a motif near the 5' end (consensus 5' CAATCGTTSC 3') and an approximately 20 bp polypyrimidine region that we hypothesize might attract GAGA-repeat binding transcription factors <abbrgrp>
<abbr bid="B3">3</abbr>
<abbr bid="B4">4</abbr>
<abbr bid="B5">5</abbr>
</abbrgrp>. <it>Sadhu </it>elements in different <it>Arabidopsis thaliana </it>accessions are expressed, often at high levels. Sense transcription begins at or near the start of the element <abbrgrp>
<abbr bid="B6">6</abbr>
</abbrgrp>, consistent with the hypothesis that these elements carry their own internal promoter sequences. Expression can also occur in the antisense direction, presumably from promoters in the flanking DNA sequence. Whether sense or antisense, transcription of <it>Sadhu </it>elements is epigenetically regulated; silenced elements are associated with cytosine methylation and packaged in chromatin containing the dimethylated isoform of lysine 9 of histone H3 <abbrgrp>
<abbr bid="B1">1</abbr>
<abbr bid="B6">6</abbr>
</abbrgrp>. There is variation in the modes of silencing of various <it>Sadhu </it>family members highlighted by differential susceptibility to epigenetic modifier mutations and distinct cytosine methylation profiles. These findings suggest that <it>Sadhu </it>elements are silenced independently and individually, not coordinately <abbrgrp>
<abbr bid="B6">6</abbr>
</abbrgrp>. For these diverse reasons, <it>Sadhu </it>represents a unique family of non-LTR retroelements.</p>
<p>Related families of the same transposable element class can often be detected by sequence similarity in widely divergent species (see for example, <abbrgrp>
<abbr bid="B7">7</abbr>
<abbr bid="B8">8</abbr>
</abbrgrp>). <it>Sadhu </it>elements within <it>A. thaliana </it>are highly divergent in terms of nucleotide sequence, with an average pairwise identity of less than 75%, suggestive of an ancient origin. However, these sequences cannot be identified in any of the current public genome databases outside of the Brassicaceae. There are only 39 <it>Sadhu</it>-related sequences in the <it>A. thaliana </it>genome, showing a dispersed distribution pattern across all five chromosomes. This moderate copy number is typical of <it>Arabidopsis </it>non-LTR retroelements: there are approximately 130 SINE elements in the <it>A. thaliana </it>reference genome and less than 1,500 LINEs <abbrgrp>
<abbr bid="B9">9</abbr>
</abbrgrp>. The relatively low copy number of non-LTR retroelements in <it>A. thaliana </it>suggests that the transposition rate of these elements is low and/or that new insertions have been effectively removed during the evolutionary history of the species.</p>
<p>Here, we describe a classification scheme for this retroelement family. In addition, we investigate the organization and radiation of <it>Sadhu </it>sequences both in different <it>A. thaliana </it>accessions and related <it>Arabidopsis </it>species.</p>
</sec>
<sec>
<st>
<p>Results and Discussion</p>
</st>
<sec>
<st>
<p>Classification of <it>Sadhu </it>elements</p>
</st>
<p>We designed a classification scheme for <it>Sadhu </it>elements reflecting the phylogenetic grouping of these elements into 10 distinct subfamilies in the <it>A. thaliana </it>genome (Table <tblr tid="T1">1</tblr>, Figure <figr fid="F1">1</figr>, Additional file <supplr sid="S1">1</supplr>) <abbrgrp>
<abbr bid="B1">1</abbr>
</abbrgrp>. Table <tblr tid="T1">1</tblr> lists the new nomenclature side by side with locus ID numbers (for full-length elements) or locus position (for partial elements). <it>Sadhu </it>elements that extend from the 5' conserved motif 5' CAATCGTTSC 3' to a 3' poly(A) tract approximately 900 bp downstream have been designated 'full length'. Full-length elements on the same branch of the phylogeny share a family name (<it>Sadhu#</it>), but have different element names (<it>SadhuX-#</it>). Elements that closely align (&gt;75% identity) to a unique full-length element are designated 'd' indicating derived; for example, <it>Sadhu5-1d1 </it>is likely to be derived from <it>Sadhu5-1</it>. <it>Sadhu</it>-related sequences that are not similar to a unique full-length element are assigned to the nearest full-length element on a pairwise BLAST search with the designation 'L' for 'like' (for example, <it>Sadhu3L</it>). See Additional file <supplr sid="S1">1</supplr> for divergence matrices among elements within different subfamilies and among subfamilies.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Phylogenetic analysis of <it>Arabidopsis thaliana Sadhu </it>sequences</p></caption><text>
   <p><b>Phylogenetic analysis of <it>Arabidopsis thaliana Sadhu </it>sequences</b>. Maximum parsimony phylogram of full-length <it>Sadhu </it>elements. Taxa are color coded according to ontological grouping. See Table 1 for gene ID numbers corresponding to <it>Sadhu </it>numbers. Bootstrap values (percentages) were calculated from 500 bootstrap replicates.</p>
</text><graphic file="1759-8753-1-10-1" hint_layout="double"/></fig>
<tbl id="T1"><title><p>Table 1</p></title><caption><p><it>Sadhu</it>-related sequences in <it>Arabidopsis thaliana</it>.</p></caption><tblbdy cols="3">
      <r>
         <c ca="left">
            <p>
               <b><it>Sadhu </it>number</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>ID number (or position)</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Nucleotide position in <it>A. thaliana </it>genome (TAIR 9.0)</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>1-1</p>
         </c>
         <c ca="left">
            <p>At2g10410</p>
         </c>
         <c ca="left">
            <p>Chr2: 4014110-4013202</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>1-2</p>
         </c>
         <c ca="left">
            <p>At1g30835</p>
         </c>
         <c ca="left">
            <p>Chr1: 10967854-10966931</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>1-3</p>
         </c>
         <c ca="left">
            <p>At5g28626</p>
         </c>
         <c ca="left">
            <p>Chr5: 10632245-10632826; 10633568-10633948<sup>a</sup></p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>1L1</p>
         </c>
         <c ca="left">
            <p>At1g66795</p>
         </c>
         <c ca="left">
            <p>Chr1: 24926769-24927016</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>2-1</p>
         </c>
         <c ca="left">
            <p>At1g35112</p>
         </c>
         <c ca="left">
            <p>Chr1: 12841125-12840206</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>2-1d1</p>
         </c>
         <c ca="left">
            <p>At2g18535</p>
         </c>
         <c ca="left">
            <p>Chr2: 8048795-8048610</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>3-1</p>
         </c>
         <c ca="left">
            <p>At3g44042</p>
         </c>
         <c ca="left">
            <p>Chr3: 15825096-15824141</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>3-2</p>
         </c>
         <c ca="left">
            <p>At3g42658</p>
         </c>
         <c ca="left">
            <p>Chr3: 14761424-14760388</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>3-1d1</p>
         </c>
         <c ca="left">
            <p>At2g21905</p>
         </c>
         <c ca="left">
            <p>Chr2: 9345876-9346247</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>3-1d2</p>
         </c>
         <c ca="left">
            <p>At5g03205</p>
         </c>
         <c ca="left">
            <p>Chr5: 762065-761915</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>3L1</p>
         </c>
         <c ca="left">
            <p>At4g04925</p>
         </c>
         <c ca="left">
            <p>Chr4: 2506188-2506806</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>4-1</p>
         </c>
         <c ca="left">
            <p>At5g28913</p>
         </c>
         <c ca="left">
            <p>Chr5: 10934749-10933814</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>4-2</p>
         </c>
         <c ca="left">
            <p>At1g03420</p>
         </c>
         <c ca="left">
            <p>Chr1: 846815-847698</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>4-2d1</p>
         </c>
         <c ca="left">
            <p>At2g05027</p>
         </c>
         <c ca="left">
            <p>Chr2: 1781076-1781386</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>5-1</p>
         </c>
         <c ca="left">
            <p>At4g01525</p>
         </c>
         <c ca="left">
            <p>Chr4: 660768-661723</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>5-1d1</p>
         </c>
         <c ca="left">
            <p>At1g18195</p>
         </c>
         <c ca="left">
            <p>Chr1: 6262595-6263382</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>5-1d2</p>
         </c>
         <c ca="left">
            <p>At4g00953</p>
         </c>
         <c ca="left">
            <p>Chr4: 410383-411018</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>5-2</p>
         </c>
         <c ca="left">
            <p>At5g27927</p>
         </c>
         <c ca="left">
            <p>Chr5: 9957820-9956864</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>6-1</p>
         </c>
         <c ca="left">
            <p>At3g02515</p>
         </c>
         <c ca="left">
            <p>Chr3: 525338-526263</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>6-1d1</p>
         </c>
         <c ca="left">
            <p>At5g42095</p>
         </c>
         <c ca="left">
            <p>Chr5: 16845951-16846349</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>6-1d2</p>
         </c>
         <c ca="left">
            <p>At5g44565</p>
         </c>
         <c ca="left">
            <p>Chr5: 17981087-17980529</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>6-1d3</p>
         </c>
         <c ca="left">
            <p>At5g42237</p>
         </c>
         <c ca="left">
            <p>Chr5: 16987448-16988447</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>6L1</p>
         </c>
         <c ca="left">
            <p>At2g10935</p>
         </c>
         <c ca="left">
            <p>Chr2: 4312891-4312205</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>7-1</p>
         </c>
         <c ca="left">
            <p>At3g13438</p>
         </c>
         <c ca="left">
            <p>Chr3: 4377991-4377083</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>7-2</p>
         </c>
         <c ca="left">
            <p>At3g31442</p>
         </c>
         <c ca="left">
            <p>Chr3: 12807354-12806392</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>7L1</p>
         </c>
         <c ca="left">
            <p>At1g36745</p>
         </c>
         <c ca="left">
            <p>Chr1: 13912120-13913031</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>7L2</p>
         </c>
         <c ca="left">
            <p>At3g61625</p>
         </c>
         <c ca="left">
            <p>Chr3: 22815058-22814684</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>7L3</p>
         </c>
         <c ca="left">
            <p>At5g52140</p>
         </c>
         <c ca="left">
            <p>Chr5: 21206508-21206698</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>8-1</p>
         </c>
         <c ca="left">
            <p>At1g50735</p>
         </c>
         <c ca="left">
            <p>Chr1: 18811080-18810175</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>8L1</p>
         </c>
         <c ca="left">
            <p>At5g38915</p>
         </c>
         <c ca="left">
            <p>Chr5: 15597647-15597844</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>8L2</p>
         </c>
         <c ca="left">
            <p>At2g24745</p>
         </c>
         <c ca="left">
            <p>Chr2: 10540693-10541337</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>8L3</p>
         </c>
         <c ca="left">
            <p>At1g52615</p>
         </c>
         <c ca="left">
            <p>Chr1: 19607182-19606826</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>9-1</p>
         </c>
         <c ca="left">
            <p>At1g44935</p>
         </c>
         <c ca="left">
            <p>Chr1: 16904928-16905344</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>9L1</p>
         </c>
         <c ca="left">
            <p>At1g32455</p>
         </c>
         <c ca="left">
            <p>Chr1: 11733481-11733785</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>9L2</p>
         </c>
         <c ca="left">
            <p>At1g69365</p>
         </c>
         <c ca="left">
            <p>Chr1: 26083199-26083479</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>10-1</p>
         </c>
         <c ca="left">
            <p>At3g58865</p>
         </c>
         <c ca="left">
            <p>Chr3: 21776975-21777729</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>10L1</p>
         </c>
         <c ca="left">
            <p>At5g46395</p>
         </c>
         <c ca="left">
            <p>Chr5: 18836667-18837050</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>10L2</p>
         </c>
         <c ca="left">
            <p>At5g42945</p>
         </c>
         <c ca="left">
            <p>Chr5: 17240081-17240294</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>10L3</p>
         </c>
         <c ca="left">
            <p>At1g35255</p>
         </c>
         <c ca="left">
            <p>Chr1: 12935906-12936263</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p><sup>a</sup>Position is discontinuous due to insertion of AtLANTYS2_LTR sequence.</p>
      <p>Chr = chromosome.</p>
   </tblfn></tbl>
<suppl id="S1">
<title>
<p>Additional file 1</p>
</title>
<text>
<p>
<b>Divergence matrices of <it>Arabidopsis thaliana Sadhu </it>elements</b>. Additional file <supplr sid="S1">1</supplr> is a spreadsheet file containing divergence matrices of <it>A. thaliana Sadhu </it>elements, both within subfamilies and of consensus sequences across subfamilies. These matrices are based on ClustalX multiple sequence alignment.</p>
</text>
<file name="1759-8753-1-10-S1.XLS">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec>
<st>
<p>Partial <it>Sadhu </it>elements</p>
</st>
<p>The <it>Sadhu2</it>, <it>Sadhu3</it>, <it>Sadhu4</it>, <it>Sadhu5</it>, and <it>Sadhu6 </it>subfamilies feature derivative sequences that are greater than 80% identical to a particular full-length element (Figure <figr fid="F2">2</figr>, Table <tblr tid="T1">1</tblr>, Additional file <supplr sid="S1">1</supplr>). Many of the partial elements sequences are 5' truncated: that is, the region of similarity shared with the most closely related full-length element does not extend to the 5' end, but contains remnants of 3' poly(A) tracts (recognizably A-rich regions) and, in some cases, flanking direct repeats that represent TSDs. This pattern is consistent with abortive retrotransposition. Other partial sequences align to internal sections of full-length elements. In the case of <it>Sadhu2-1d</it>, a 3' poly(A) tract is detectable, but is preceded by a stretch of DNA sequence (19 bp) that does not align to the prospective progenitor <it>Sadhu </it>element (Figure <figr fid="F2">2c</figr>; <it>Sadhu7L1 </it>and <it>Sadhu10L3 </it>also have this structure). This type of chimeric retrotransposon structure can result from template switching during retrotransposition <abbrgrp>
<abbr bid="B10">10</abbr>
<abbr bid="B11">11</abbr>
</abbrgrp>. In contrast, the <it>Sadhu8L3 </it>derivative terminates in a poly(A) tract at a position earlier than its closest full-length element (Figure <figr fid="F2">2e</figr>). This structure might arise from abortive transcription and early polyadenylation of the precursor sequence or through subsequent internal deletion of the element. If partial elements arose by segmental duplication, we would expect to see DNA sequence similarity extending beyond the <it>Sadhu</it>-related sequence. However, none of the <it>Sadhu </it>elements in the Columbia (Col) reference genome shares significant sequence similarity in flanking genomic regions with their derivative elements. Therefore, it is more likely that the partial elements are remnants of ancestral retrotransposition followed by template switching, deletion and/or divergence.</p>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>Schematic alignment of selected <it>Sadhu </it>subfamilies in strain Col</p></caption><text>
   <p><b>Schematic alignment of selected <it>Sadhu </it>subfamilies in strain Col</b>. TSD sequences are different at different elements. Sizes of TSDs: TSD1, 11 base pairs (bp); TSD2, 12 bp; TSD3, 12 bp; TSD4, 10 bp; TSD5, 13 bp. Percentages correspond to sequence identity to the longest element in the subfamily. Sizes marked above each line represent positions relative to the gapped alignment and might be slightly different from the nucleotide length of element. (a) <it>Sadhu5</it>; (b) <it>Sadhu6</it>; (c) <it>Sadhu2</it>; (d) <it>Sadhu3</it>; (e) <it>Sadhu8-1 </it>versus <it>Sadhu8L3</it>. TSD = target site duplication.</p>
</text><graphic file="1759-8753-1-10-2" hint_layout="double"/></fig>
</sec>
<sec>
<st>
<p>Radiation of the <it>Sadhu5 </it>subfamily in <it>A. thaliana</it>
</p>
</st>
<p>A comparison of the genome sequences of two <it>Arabidopsis </it>strains, Col and Ler, revealed over 150 indels caused by differential activity of transposable elements between the strains <abbrgrp>
<abbr bid="B12">12</abbr>
</abbrgrp>. We previously reported that several <it>Sadhu </it>elements from different subfamilies are also polymorphic in terms of presence/absence among different <it>Arabidopsis </it>strains <abbrgrp>
<abbr bid="B1">1</abbr>
<abbr bid="B6">6</abbr>
</abbrgrp>. Below, we examine closely related elements from a single subfamily in a set of 24 <it>A. thaliana </it>strains in order to trace the retrotranspositional history of these elements. The <it>Sadhu5 </it>subfamily contains four elements that are all greater than 80% identical to one another in the Col reference genome and close to full-length or full-length (&gt;600 bp) (Figure <figr fid="F2">2a</figr>). <it>Sadhu5-1 </it>and <it>Sadhu5-2 </it>are 83% identical to one another, while the two derivative elements, <it>Sadhu5-1d1 </it>and <it>Sadhu5-1d2</it>, are greater than 95% identical to <it>Sadhu5-1</it>. This family therefore represents a closely related group of sequences that might have expanded during the recent evolutionary history of the species.</p>
<p>We began by examining the <it>Sadhu5-2 </it>element. A polymerase chain reaction (PCR) product corresponding to an internal region of this element was present in every strain examined (Table <tblr tid="T2">2</tblr>). We investigated whether <it>Sadhu5-2 </it>elements in different strains were present in the same genomic location: using an outward facing forward primer in the element and reverse primers designed based on the Col reference genome 5' and 3' adjacent sequence, we attempted to amplify PCR products spanning the flanks of the elements. In every case, we were successful in amplifying products of the expected size (Table <tblr tid="T2">2</tblr>). Therefore, it is likely that <it>Sadhu5-2 </it>represents a single insertion event in the ancestor of the <it>A. thaliana </it>lineage.</p>
<tbl id="T2"><title><p>Table 2</p></title><caption><p>Distribution of <it>Sadhu5 </it>subfamily members in natural strains.</p></caption><tblbdy cols="14">
      <r>
         <c ca="left">
            <p>
               <b>Accession number</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Stock number</b>
            </p>
         </c>
         <c ca="left" cspan="3">
            <p>
               <b>
                  <it>Sadhu5-1</it>
               </b>
            </p>
         </c>
         <c ca="left" cspan="3">
            <p>
               <b>
                  <it>Sadhu5-1d1</it>
               </b>
            </p>
         </c>
         <c ca="left" cspan="3">
            <p>
               <b>
                  <it>Sadhu5-1d2</it>
               </b>
            </p>
         </c>
         <c ca="left" cspan="3">
            <p>
               <b>
                  <it>Sadhu5-2</it>
               </b>
            </p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="12">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <b>Int</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>5'</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>3'</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Int</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>5'</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>3'</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Int</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>5'</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>3'</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Int</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>5'</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>3'</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="14">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Br-0</p>
         </c>
         <c ca="left">
            <p>CS22628</p>
         </c>
         <c ca="left">
            <p>ES</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>ES</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Bur-0</p>
         </c>
         <c ca="left">
            <p>CS22656</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>C24</p>
         </c>
         <c ca="left">
            <p>CS22620</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>ES</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Col</p>
         </c>
         <c ca="left">
            <p>Lehle WT-2</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Ct-1</p>
         </c>
         <c ca="left">
            <p>CS22639</p>
         </c>
         <c ca="left">
            <p>ES</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>ES</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Cvi</p>
         </c>
         <c ca="left">
            <p>Lehle WT-18</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X*</p>
         </c>
         <c ca="left">
            <p>X*</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X*</p>
         </c>
         <c ca="left">
            <p>X*</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Cvi-0</p>
         </c>
         <c ca="left">
            <p>CS22614</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Fei-0</p>
         </c>
         <c ca="left">
            <p>CS22645</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>ES</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Hi-0</p>
         </c>
         <c ca="left">
            <p>CS6736</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>ES</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Kn-0</p>
         </c>
         <c ca="left">
            <p>CS6762</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>Short</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>ES</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Kondara</p>
         </c>
         <c ca="left">
            <p>CS22651</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>Long</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Kz-1</p>
         </c>
         <c ca="left">
            <p>CS22606</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X*</p>
         </c>
         <c ca="left">
            <p>X*</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X*</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Ler</p>
         </c>
         <c ca="left">
            <p>Lehle WT-4</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>ES</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X*</p>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>N13</p>
         </c>
         <c ca="left">
            <p>CS22491</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X*</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>ES</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Po-0</p>
         </c>
         <c ca="left">
            <p>CS6839</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X*</p>
         </c>
         <c ca="left">
            <p>X*</p>
         </c>
         <c ca="left">
            <p>ES</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Pro-0</p>
         </c>
         <c ca="left">
            <p>CS22649</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X*</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X*</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Pu2-7</p>
         </c>
         <c ca="left">
            <p>CS22592</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X*</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>ES</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Ra-0</p>
         </c>
         <c ca="left">
            <p>CS22632</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Tamm-27</p>
         </c>
         <c ca="left">
            <p>CS22605</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>ES</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Ts-1</p>
         </c>
         <c ca="left">
            <p>CS22647</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>ES</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Tsu-1</p>
         </c>
         <c ca="left">
            <p>CS22641</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>ES</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Van-0</p>
         </c>
         <c ca="left">
            <p>CS22627</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Wei-0</p>
         </c>
         <c ca="left">
            <p>CS22622</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>ES</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X*</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Ws-2</p>
         </c>
         <c ca="left">
            <p>CS22659</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>Long</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
         <c ca="left">
            <p>X</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>Empty cells signify no PCR product amplified with the corresponding primers.</p>
      <p>3' = PCR product with one primer located in the 3' flank and the other in the element; 5' = PCR product with one primer located in the 5' flank and the other in the element; ES = negative for int PCR, but empty site amplified with 5' and 3' flanking primers; int = internal PCR product, both primers located within the element; PCR = polymerase chain reaction; X* = PCR product with more distal but not with more proximal primers.</p>
   </tblfn></tbl>
<p>In contrast to our finding for <it>Sadhu5-2</it>, we were unable to amplify PCR products from several strains using primers specific to the <it>Sadhu5-1</it>, <it>Sadhu5-1d1 </it>or <it>Sadhu5-1d2 </it>insertion sites in the Col strain (Table <tblr tid="T2">2</tblr>). To investigate the structure of putative deletions or 'empty' sites for these elements, we amplified PCR products from these strains using primers located 5' and 3' of the element in the Col reference genome. We identified 2 strains for <it>Sadhu5-1 </it>and 17 strains for <it>Sadhu5-1d1 </it>that amplified a specific, shorter PCR product than would be predicted from the reference genome. We obtained DNA sequence for these PCR products: in every case, there was a clean retrotransposition 'empty site', with a single, identical copy of the target site duplication of the element in strain Col (Figure <figr fid="F3">3</figr>). The structure of the 'empty' versus the 'filled' sites are typical of retroelements that undergo target primed reverse transcription (TPRT) <abbrgrp>
<abbr bid="B13">13</abbr>
</abbrgrp>. The Col strain carries the most common haplotype for the region surrounding the <it>Sadhu5-1d1 </it>insertion (Figure <figr fid="F3">3</figr>). Therefore, the most parsimonious explanation is that the element inserted relatively recently in the history of these strains, after the divergence of different haplotypes in this region.</p>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>Empty sites detected in <it>Arabidopsis thaliana </it>strains at positions occupied in Col by (a) <it>Sadhu5-1 </it>and (b)<it>Sadhu5-1d1</it></p></caption><text>
   <p><b>Empty sites detected in <it>Arabidopsis thaliana </it>strains at positions occupied in Col by (a) <it>Sadhu5-1 </it>and (b)<it>Sadhu5-1d1</it></b>. Multiple sequence alignments of Col 5' and 3' sequences flanking the site of insertion along with sequences of strains that do not contain the insertion. Sequences corresponding to <it>Sadhu </it>element insertions have been removed. Genbank accession numbers for <it>Sadhu5-1 </it>sequences are <ext-link ext-link-id="EF535531" ext-link-type="gen">EF535531</ext-link> and <ext-link ext-link-id="EF535532" ext-link-type="gen">EF535532</ext-link>. Genbank accession numbers for <it>Sadhu5-1d1 </it>sequences are <ext-link ext-link-id="EF535533" ext-link-type="gen">EF535533</ext-link>, <ext-link ext-link-id="EF535534" ext-link-type="gen">EF535534</ext-link>, <ext-link ext-link-id="EF535535" ext-link-type="gen">EF535535</ext-link>, <ext-link ext-link-id="EF535536" ext-link-type="gen">EF535536</ext-link>, <ext-link ext-link-id="EF535537" ext-link-type="gen">EF535537</ext-link>, <ext-link ext-link-id="EF535538" ext-link-type="gen">EF535538</ext-link>, <ext-link ext-link-id="EF535539" ext-link-type="gen">EF535539</ext-link>, <ext-link ext-link-id="EF535540" ext-link-type="gen">EF535540</ext-link>, <ext-link ext-link-id="EF535541" ext-link-type="gen">EF535541</ext-link>, <ext-link ext-link-id="EF535542" ext-link-type="gen">EF535542</ext-link>, <ext-link ext-link-id="EF535543" ext-link-type="gen">EF535543</ext-link>, <ext-link ext-link-id="EF535544" ext-link-type="gen">EF535544</ext-link>, <ext-link ext-link-id="EF535545" ext-link-type="gen">EF535545</ext-link>, <ext-link ext-link-id="EF535546" ext-link-type="gen">EF535546</ext-link>, <ext-link ext-link-id="EF535547" ext-link-type="gen">EF535547</ext-link>, <ext-link ext-link-id="EF535548" ext-link-type="gen">EF535548</ext-link>, and <ext-link ext-link-id="EF535549" ext-link-type="gen">EF535549</ext-link>.</p>
</text><graphic file="1759-8753-1-10-3" hint_layout="double"/></fig>
<p>The identification of clean presence/absence polymorphisms among <it>Arabidopsis </it>strains also lends support to the model that <it>Sadhu5-1 </it>and <it>Sadhu5-1d1 </it>are relatively recent retrotransposition events. In contrast, we could not find polymorphic insertion sites for <it>Sadhu5-1d2 </it>and <it>Sadhu5-2</it>, suggesting that these elements represent older, ancestral insertion events. <it>Sadhu5-2 </it>appears to be a truncated retrotransposition product relative to <it>Sadhu5-1</it>, as it is missing sequence that would align with the 5' portion of <it>Sadhu5-1 </it>(Figure <figr fid="F2">2a</figr>). Therefore, while the <it>Sadhu5-2 </it>sequence itself appears more prevalent than <it>Sadhu5-1</it>, the latter element could not be derived by retrotransposition or gene duplication from the former without invoking a subsequent deletion of the 5' region of the element, which is unlikely given that the same structure appears to exist in all strains based on PCR of the flanking regions (Table <tblr tid="T2">2</tblr>). An alternate hypothesis is that the full-length ancestor to this subfamily has been deleted or lost from the <it>A. thaliana </it>Col reference strain.</p>
</sec>
<sec>
<st>
<p>Target site consensus</p>
</st>
<p>TSDs are typical of most transposable elements. Non-LTR retroelements mobilized by the LINE enzymatic machinery feature TSDs of 7 to 20 bp in length. These TSDs result from the target primed reverse transcription mechanism, where two staggered cuts are made on the target strand <abbrgrp>
<abbr bid="B13">13</abbr>
</abbrgrp>. In mammals, the consensus for the LINE 5' endonuclease cleavage site contains two thymines, whereas the duplicated target site often starts with a string of four adenines <abbrgrp>
<abbr bid="B14">14</abbr>
<abbr bid="B15">15</abbr>
<abbr bid="B16">16</abbr>
</abbrgrp>. This string of adenines (thymines on the opposite strand) within the target site are hypothesized to act in priming reverse transcription from the poly(A) tail of the LINE transcript. SINEs, which are mobilized by hijacking of the LINE machinery <abbrgrp>
<abbr bid="B17">17</abbr>
</abbrgrp>, have a similar target site preference as LINEs. While plant LINEs are predicted to move in a similar manner to mammalian LINEs, the consensus site has not yet been studied in a comprehensive manner. However, a study of <it>Arabidopsis </it>SINEs indicated a similar consensus sequence as mammalian LINEs; a string of adenines within the target site duplication, as well as a thymine at the 3' nicking site <abbrgrp>
<abbr bid="B18">18</abbr>
</abbrgrp>.</p>
<p>A total of 14 <it>Sadhu </it>sequences containing target site duplications of between 7 and 16 bp were identified in the <it>A. thaliana </it>genome (Table <tblr tid="T3">3</tblr>). We examined the region around these target sites to determine whether 5' and 3' nicking site consensus patterns could be identified and, if so, whether they resembled patterns previously reported for LINEs and SINEs. As shown in Figure <figr fid="F4">4</figr>, the 5' nicking site does appear to favor a thymine (preceded by adenines), while the target site duplication also began with a stretch of adenines. There is no strong consensus at the 3' nicking site. These data are consistent with a model in which <it>Sadhu </it>elements, similar to SINEs, are mobilized by the LINE-encoded target primed reverse transcription machinery.</p>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>Logo diagrams of consensus sequences at <it>Sadhu </it>insertion sites, based on 14 insertions in the Col reference genome</p></caption><text>
   <p><b>Logo diagrams of consensus sequences at <it>Sadhu </it>insertion sites, based on 14 insertions in the Col reference genome</b>. Nine nucleotides proximal to the target site were examined as the 5' nicking site, while nine nucleotides distal to the target site were examined as the 3' nicking site. The first seven nucleotides within the target site duplication were examined.</p>
</text><graphic file="1759-8753-1-10-4" hint_layout="double"/></fig>
<tbl id="T3"><title><p>Table 3</p></title><caption><p>Target site sequences of <it>Arabidopsis thaliana Sadhu </it>elements.</p></caption><tblbdy cols="4">
      <r>
         <c ca="left">
            <p>
               <b>
                  <it>Sadhu</it>
               </b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>5' Nicking site</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Target site duplication</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>3' Nicking site</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>1-1</p>
         </c>
         <c ca="left">
            <p>tacaaaagt</p>
         </c>
         <c ca="left">
            <p>aaatgactagtagga</p>
         </c>
         <c ca="left">
            <p>taataaaca</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>1-2</p>
         </c>
         <c ca="left">
            <p>acttgacat</p>
         </c>
         <c ca="left">
            <p>agctatgaaaatcgt</p>
         </c>
         <c ca="left">
            <p>tggaccatc</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>3-2</p>
         </c>
         <c ca="left">
            <p>tttatgaag</p>
         </c>
         <c ca="left">
            <p>aatcttcgtt</p>
         </c>
         <c ca="left">
            <p>cagtcctgc</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>4-2</p>
         </c>
         <c ca="left">
            <p>acaacattt</p>
         </c>
         <c ca="left">
            <p>aaagatatctcgtttg</p>
         </c>
         <c ca="left">
            <p>tggagaacg</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>5-1</p>
         </c>
         <c ca="left">
            <p>ctgcaatat</p>
         </c>
         <c ca="left">
            <p>agtactactact</p>
         </c>
         <c ca="left">
            <p>aatgttatc</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>5-1d1</p>
         </c>
         <c ca="left">
            <p>ttaagaaag</p>
         </c>
         <c ca="left">
            <p>aaatgtgtctcaacg</p>
         </c>
         <c ca="left">
            <p>cgaccaacg</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>6-1</p>
         </c>
         <c ca="left">
            <p>gaagagacc</p>
         </c>
         <c ca="left">
            <p>aaaacctagtctggag</p>
         </c>
         <c ca="left">
            <p>tacaaagta</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>6-1d2</p>
         </c>
         <c ca="left">
            <p>ttataaaag</p>
         </c>
         <c ca="left">
            <p>aaaactaatcttaa</p>
         </c>
         <c ca="left">
            <p>gaaaaatac</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>7-1</p>
         </c>
         <c ca="left">
            <p>atggaagat</p>
         </c>
         <c ca="left">
            <p>aaagaatctggcttt</p>
         </c>
         <c ca="left">
            <p>ttgtaaaac</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>7-2</p>
         </c>
         <c ca="left">
            <p>ctatggaag</p>
         </c>
         <c ca="left">
            <p>aagaaggtaa</p>
         </c>
         <c ca="left">
            <p>ccaactact</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>7L1</p>
         </c>
         <c ca="left">
            <p>agggagttt</p>
         </c>
         <c ca="left">
            <p>ttaagag</p>
         </c>
         <c ca="left">
            <p>ttttattat</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>7L2</p>
         </c>
         <c ca="left">
            <p>tcatataat</p>
         </c>
         <c ca="left">
            <p>aattacctagca</p>
         </c>
         <c ca="left">
            <p>cgaaatcta</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>8-1</p>
         </c>
         <c ca="left">
            <p>gaacataac</p>
         </c>
         <c ca="left">
            <p>aaaagatccaa</p>
         </c>
         <c ca="left">
            <p>acgtatggt</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>9L3</p>
         </c>
         <c ca="left">
            <p>caatcaacc</p>
         </c>
         <c ca="left">
            <p>ccgtatt</p>
         </c>
         <c ca="left">
            <p>gtagatttt</p>
         </c>
      </r>
   </tblbdy></tbl>
<p>An examination of the <it>A. thaliana </it>Col reference genome <abbrgrp>
<abbr bid="B9">9</abbr>
</abbrgrp> reveals less than 1,500 LINE superfamily-related elements spanning 12 different lineages, including both LINE1, LINE2, TA11 and TA12 families <abbrgrp>
<abbr bid="B19">19</abbr>
<abbr bid="B20">20</abbr>
<abbr bid="B21">21</abbr>
</abbrgrp>. However, less than 50 LINEs in the <it>A. thaliana </it>reference genome are greater than 5,000 bp in length, and almost none contain intact open reading frames. Therefore, while it is evident that <it>Sadhu </it>elements have been mobile during the divergence of different <it>Arabidopsis </it>strains, their low copy number might be a consequence of the sheer rarity of active autonomous LINE driver elements.</p>
</sec>
<sec>
<st>
<p>Sadhu elements can be identified in taxa outside of <it>A. thaliana</it>
</p>
</st>
<p>In order to explore the evolutionary distribution of the <it>Sadhu </it>sequence family, we sought to identify <it>Sadhu </it>homologs in two related species of the <it>Brassicaceae </it>family, <it>A. arenosa </it>and <it>A. lyrata</it>. These species are estimated to have diverged from <it>A. thaliana </it>approximately 5 million years ago. The genomes of the three species have changed significantly in that interval: <it>Arabidopsis arenosa </it>and <it>Arabidopsis lyrata </it>maintain the ancestral complement of eight chromosomes, while <it>A. thaliana </it>has condensed its chromosome number to five <abbrgrp>
<abbr bid="B22">22</abbr>
<abbr bid="B23">23</abbr>
</abbrgrp>. Molecular evolutionary studies have determined that the average sequence divergence at silent sites between <it>A. thaliana </it>and <it>A. arenosa </it>or <it>A. lyrata </it>is 12% to 15% <abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp>.</p>
<p>We attempted to isolate <it>Sadhu </it>elements from <it>A. arenosa</it>. DNA sequence was obtained from specific PCR products that were generated using <it>A. arenosa </it>genomic templates and primers corresponding to the <it>A. thaliana </it>elements <it>Sadhu5-1</it>, <it>Sadhu1-3</it>, <it>Sadhu3-1</it>, and <it>Sadhu8-1 </it>(Table <tblr tid="T4">4</tblr>; Additional file <supplr sid="S2">2</supplr>). In a phylogenetic analysis, the <it>A. arenosa Sadhu </it>sequences that we obtained cluster within the previously defined subfamilies (Figure <figr fid="F5">5a</figr>).</p>
<fig id="F5"><title><p>Figure 5</p></title><caption><p>Phylogenetic analysis of <it>Sadhu </it>sequences from <it>Arabidopsis arenosa </it>and <it>Arabidopsis lyrata </it>relative to <it>Arabidopsis thaliana</it></p></caption><text>
   <p><b>Phylogenetic analysis of <it>Sadhu </it>sequences from <it>Arabidopsis arenosa </it>and <it>Arabidopsis lyrata </it>relative to <it>Arabidopsis thaliana</it></b>. (a) Maximum parsimony phylogram of <it>A. arenosa </it>(Aa) internal <it>Sadhu </it>sequence clones and related <it>A. thaliana </it>elements. <it>A. arenosa </it>sequences are in blue. (b) Maximum parsimony phylogram of <it>A. lyrata Sadhu </it>sequences &gt;350 bp (Al, purple) and full-length <it>A. thaliana </it>elements. Shaded large numbers indicate <it>Sadhu </it>subfamilies. See to Additional file <supplr sid="S4">4</supplr> for DNA sequences of <it>A. lyrata </it>elements. Bootstrap values (percentages) were calculated from 500 bootstrap replicates.</p>
</text><graphic file="1759-8753-1-10-5" hint_layout="double"/></fig>
<tbl id="T4"><title><p>Table 4</p></title><caption><p><it>Sadhu </it>sequences from <it>Arabidopsis arenosa</it>.</p></caption><tblbdy cols="6">
      <r>
         <c ca="left">
            <p>
               <b>Sequence name</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Length (bp) of <it>Sadhu </it>sequence</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Genbank accession number</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b><it>Arabidopsis thaliana </it>primer origin</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Closest <it>A. thaliana </it>homolog (pairwise blast)</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Percentage identity to <it>A. thaliana </it>ortholog</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AaSadhu1</p>
         </c>
         <c ca="left">
            <p>283</p>
         </c>
         <c ca="left">
            <p>
               <ext-link ext-link-id="DQ680035" ext-link-type="gen">DQ680035</ext-link>
            </p>
         </c>
         <c ca="left">
            <p>Sadhu1-3</p>
         </c>
         <c ca="left">
            <p>Sadhu1-2</p>
         </c>
         <c ca="left">
            <p>81</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AaSadhu1FP1</p>
         </c>
         <c ca="left">
            <p>204</p>
         </c>
         <c ca="left">
            <p>
               <ext-link ext-link-id="EF535557" ext-link-type="gen">EF535557</ext-link>
            </p>
         </c>
         <c ca="left">
            <p>Sadhu1 5' TAIL</p>
         </c>
         <c ca="left">
            <p>Sadhu1-1</p>
         </c>
         <c ca="left">
            <p>74</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AaSadhu1FP2</p>
         </c>
         <c ca="left">
            <p>117</p>
         </c>
         <c ca="left">
            <p>
               <ext-link ext-link-id="EF535558" ext-link-type="gen">EF535558</ext-link>
            </p>
         </c>
         <c ca="left">
            <p>Sadhu1 5' TAIL</p>
         </c>
         <c ca="left">
            <p>Sadhu1-3</p>
         </c>
         <c ca="left">
            <p>85</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AaSadhu1FP3</p>
         </c>
         <c ca="left">
            <p>45</p>
         </c>
         <c ca="left">
            <p>
               <ext-link ext-link-id="EF535559" ext-link-type="gen">EF535559</ext-link>
            </p>
         </c>
         <c ca="left">
            <p>Sadhu1 5' TAIL</p>
         </c>
         <c ca="left">
            <p>Sadhu1-3</p>
         </c>
         <c ca="left">
            <p>82</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AaSadhu1TP1</p>
         </c>
         <c ca="left">
            <p>470</p>
         </c>
         <c ca="left">
            <p>
               <ext-link ext-link-id="EF535560" ext-link-type="gen">EF535560</ext-link>
            </p>
         </c>
         <c ca="left">
            <p>Sadhu1 3' TAIL</p>
         </c>
         <c ca="left">
            <p>Sadhu1-2</p>
         </c>
         <c ca="left">
            <p>80</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AaSadhu1TP2</p>
         </c>
         <c ca="left">
            <p>232</p>
         </c>
         <c ca="left">
            <p>
               <ext-link ext-link-id="EF535561" ext-link-type="gen">EF535561</ext-link>
            </p>
         </c>
         <c ca="left">
            <p>Sadhu1 3' TAIL</p>
         </c>
         <c ca="left">
            <p>Sadhu1-3</p>
         </c>
         <c ca="left">
            <p>84</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AaSadhu1TP3</p>
         </c>
         <c ca="left">
            <p>478</p>
         </c>
         <c ca="left">
            <p>
               <ext-link ext-link-id="EF535565" ext-link-type="gen">EF535565</ext-link>
            </p>
         </c>
         <c ca="left">
            <p>Sadhu1 3' TAIL</p>
         </c>
         <c ca="left">
            <p>Sadhu1-3</p>
         </c>
         <c ca="left">
            <p>85</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AaSadhu1TP4</p>
         </c>
         <c ca="left">
            <p>480</p>
         </c>
         <c ca="left">
            <p>
               <ext-link ext-link-id="EF535564" ext-link-type="gen">EF535564</ext-link>
            </p>
         </c>
         <c ca="left">
            <p>Sadhu1 3' TAIL</p>
         </c>
         <c ca="left">
            <p>Sadhu1-3</p>
         </c>
         <c ca="left">
            <p>84</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AaSadhu3</p>
         </c>
         <c ca="left">
            <p>686</p>
         </c>
         <c ca="left">
            <p>
               <ext-link ext-link-id="DQ680038" ext-link-type="gen">DQ680038</ext-link>
            </p>
         </c>
         <c ca="left">
            <p>Sadhu3-1</p>
         </c>
         <c ca="left">
            <p>Sadhu3-2</p>
         </c>
         <c ca="left">
            <p>86</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AaSadhu3FP1</p>
         </c>
         <c ca="left">
            <p>49</p>
         </c>
         <c ca="left">
            <p>
               <ext-link ext-link-id="EF535567" ext-link-type="gen">EF535567</ext-link>
            </p>
         </c>
         <c ca="left">
            <p>Sadhu3 5' TAIL</p>
         </c>
         <c ca="left">
            <p>Sadhu3-1</p>
         </c>
         <c ca="left">
            <p>91</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AaSadhu3TP1</p>
         </c>
         <c ca="left">
            <p>188</p>
         </c>
         <c ca="left">
            <p>
               <ext-link ext-link-id="EF535566" ext-link-type="gen">EF535566</ext-link>
            </p>
         </c>
         <c ca="left">
            <p>Sadhu3 3' TAIL</p>
         </c>
         <c ca="left">
            <p>Sadhu3-2</p>
         </c>
         <c ca="left">
            <p>86</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AaSadhu5</p>
         </c>
         <c ca="left">
            <p>344</p>
         </c>
         <c ca="left">
            <p>
               <ext-link ext-link-id="DQ680036" ext-link-type="gen">DQ680036</ext-link>
            </p>
         </c>
         <c ca="left">
            <p>Sadhu5-1, Sadhu5-2</p>
         </c>
         <c ca="left">
            <p>Sadhu5-1</p>
         </c>
         <c ca="left">
            <p>88</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AaSadhu5FP1</p>
         </c>
         <c ca="left">
            <p>94</p>
         </c>
         <c ca="left">
            <p>
               <ext-link ext-link-id="EF535550" ext-link-type="gen">EF535550</ext-link>
            </p>
         </c>
         <c ca="left">
            <p>Sadhu5 5' TAIL</p>
         </c>
         <c ca="left">
            <p>Sadhu5-1</p>
         </c>
         <c ca="left">
            <p>87</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AaSadhu5TP1</p>
         </c>
         <c ca="left">
            <p>384</p>
         </c>
         <c ca="left">
            <p>
               <ext-link ext-link-id="EF535551" ext-link-type="gen">EF535551</ext-link>
            </p>
         </c>
         <c ca="left">
            <p>Sadhu5 3' TAIL</p>
         </c>
         <c ca="left">
            <p>Sadhu5-2</p>
         </c>
         <c ca="left">
            <p>86</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AaSadhu8</p>
         </c>
         <c ca="left">
            <p>472</p>
         </c>
         <c ca="left">
            <p>
               <ext-link ext-link-id="DQ680033" ext-link-type="gen">DQ680033</ext-link>
            </p>
         </c>
         <c ca="left">
            <p>Sadhu8-1</p>
         </c>
         <c ca="left">
            <p>Sadhu8-1</p>
         </c>
         <c ca="left">
            <p>79</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AaSadhu8FP1</p>
         </c>
         <c ca="left">
            <p>202</p>
         </c>
         <c ca="left">
            <p>
               <ext-link ext-link-id="EF535553" ext-link-type="gen">EF535553</ext-link>
            </p>
         </c>
         <c ca="left">
            <p>Sadhu8 5' TAIL</p>
         </c>
         <c ca="left">
            <p>Sadhu8-1</p>
         </c>
         <c ca="left">
            <p>76</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AaSadhu8TP1</p>
         </c>
         <c ca="left">
            <p>149</p>
         </c>
         <c ca="left">
            <p>
               <ext-link ext-link-id="EF535556" ext-link-type="gen">EF535556</ext-link>
            </p>
         </c>
         <c ca="left">
            <p>Sadhu8 3' TAIL</p>
         </c>
         <c ca="left">
            <p>Sadhu8-1</p>
         </c>
         <c ca="left">
            <p>79</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>TAIL PCR = thermal asymmetric interlaced polymerase chain reaction.</p>
   </tblfn></tbl>
<suppl id="S2">
<title>
<p>Additional file 2</p>
</title>
<text>
<p>
<b>Polymerase chain reaction (PCR) primers</b>. Additional file <supplr sid="S2">2</supplr> is a table listing PCR primers used in this study.</p>
</text>
<file name="1759-8753-1-10-S2.DOC">
   <p>Click here for file</p>
</file>
</suppl>
<p>We conducted TAIL PCR using <it>A. arenosa </it>genomic templates to identify more complete sequences for the <it>Sadhu </it>elements identified by PCR. Three 5' and four 3' flanking sequences homologous to <it>Sadhu1 </it>were amplified and cloned from <it>A. arenosa </it>genomic DNA template (Table <tblr tid="T4">4</tblr>, Additional file <supplr sid="S3">3</supplr>). Several of the 3' <it>Sadhu1 </it>portions were &gt;95% identical to one another, indicative of recent retrotransposition in this subfamily. Two 5' flanking clones (<it>AaSadhu1FP3 </it>and <it>AlSadhu1FP1</it>) shared a stretch of 150 bp of sequence that does not correspond to known <it>Sadhu1 </it>sequence in <it>A. thaliana</it>. This extra sequence may have been transduced by the <it>Sadhu </it>element resulting in a chimeric retroposon.</p>
<suppl id="S3">
<title>
<p>Additional file 3</p>
</title>
<text>
<p>
<b>DNA sequence information for <it>Sadhu </it>sequences greater than 350 base pairs (bp) in the <it>Arabidopsis lyrata </it>genome assembly</b>. Additional file <supplr sid="S4">4</supplr> provides DNA sequence information for <it>Sadhu </it>sequences greater than 350 bp in the <it>Arabidopsis lyrata </it>genome assembly. Target site duplications are indicated in purple and the conserved CAATCGTTSC motif is italicized and underlined. Non-<it>Sadhu </it>sequence inserted in the elements is in gray and italicized</p>
</text>
<file name="1759-8753-1-10-S3.DOC">
   <p>Click here for file</p>
</file>
</suppl>
<p>Both 3' and 5' flanking sequences were obtained by TAIL PCR corresponding to <it>A. arenosa Sadhu3 </it>(Table <tblr tid="T4">4</tblr> and Additional file <supplr sid="S3">3</supplr>). Because these sequences could not be joined by PCR, there are likely to be at least two members of this subfamily in <it>A. arenosa</it>. <it>Sadhu5 </it>TAIL PCR sequences isolated from <it>A. arenosa </it>were 85% to 88% identical to <it>A. thaliana Sadhu5 </it>subfamily members (5' and 3' portions) (Table <tblr tid="T4">4</tblr> and Additional file <supplr sid="S3">3</supplr>). 5' and 3' sequences were also obtained corresponding to <it>Sadhu8 </it>subfamily members from <it>A. arenosa </it>(Table <tblr tid="T4">4</tblr> and Additional file <supplr sid="S3">3</supplr>). These sequences were greater than 90% identical to one another and 75% to 79% identical to <it>A. thaliana Sadhu8-1</it>, indicating that retrotransposition occurred more recently than the divergence of the two species. In summary, <it>A. arenosa </it>contains several members of at least four <it>Sadhu </it>subfamilies. Examination of sequences flanking the <it>Sadhu </it>elements suggests that these elements are located in non-orthologous positions in <it>A. arenosa </it>relative to <it>A. thaliana </it>(Additional file <supplr sid="S3">3</supplr>).</p>
<p>
<it>A. lyrata Sadhu </it>elements were identified from iterative BLAST searches of the recent <it>A. lyrata </it>genome sequence assembly (JGI V. 1.0; Joint Genome Institute, Walnut Creek, CA, USA). We used <it>A. thaliana </it>full-length <it>Sadhu </it>sequences as queries in a primary search to identify a set of <it>A. lyrata </it>sequences, which were subsequently used as queries in secondary searches. This method is expected to identify all full-length or near full-length sequences, although shorter <it>Sadhu</it>-related partial elements might have been overlooked. In total, we found 21 full-length and 4 partial <it>Sadhu </it>elements greater than 350 bp in length (Table <tblr tid="T5">5</tblr>, Additional file <supplr sid="S4">4</supplr>). The number of full-length elements (21) is similar to that in <it>A. thaliana </it>(16), indicating that the element family is relatively small in both species. Full-length <it>A. lyrata </it>elements are structurally similar to <it>Sadhu </it>elements in <it>A. thaliana</it>: they begin with a conserved motif (5' CAATCGTTSC 3' followed by a polypyrimidine patch) and terminate approximately 900 bp downstream in a poly(A) tract. Of the 21 full-length elements, 15 feature direct target site duplications of between 8 and 18 bp in length, suggesting that they originated via retrotransposition. There are no discernable conserved open reading frames. None of the elements appear in orthologous locations to <it>A. thaliana </it>elements, indicating that <it>Sadhu </it>elements have mobilized considerably since the divergence of the two species, and that related elements are similar through retrotransposition and not through direct inheritance of the genomic region.</p>
<suppl id="S4">
<title>
<p>Additional file 4</p>
</title>
<text>
<p>
<b>Partial <it>Sadhu </it>elements and flanking genomic sequences identified in <it>Arabidopsis arenosa</it>
</b>. Additional file <supplr sid="S3">3</supplr> contains diagrams of partial <it>Sadhu </it>elements and flanking genomic sequences identified in <it>A. arenosa</it>. (a) <it>Sadhu1</it>; (b) <it>Sadhu3</it>; (c) <it>Sadhu5</it>; (d) <it>Sadhu8</it>. The scale is indicated. Internal polymerase chain reaction (PCR) sequences used specific primers based on the <it>Arabidopsis thaliana </it>sequence, while 5' and 3' sequences were obtained by thermal asymmetric interlaced (TAIL) PCR (see Table <tblr tid="T4">4</tblr> for details). 5' <it>Sadhu </it>sequences are in blue, 3' <it>Sadhu </it>sequences are orange. Gray dotted arrows indicate the extent of <it>Sadhu </it>sequence homology. Features in flanking sequences are marked as green boxes. The inverted arrow in the annotation of the Aa5FP1 clone indicates the direction of transcription of the flanking gene-related sequence. <it>Sadhu5 </it>and <it>Sadhu8 </it>3' sequences feature poly(A) tracts at the <it>Sadhu </it>boundary, consistent with retrotransposition.</p>
</text>
<file name="1759-8753-1-10-S4.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<tbl id="T5"><title><p>Table 5</p></title><caption><p><it>Sadhu </it>elements &gt;350 base pairs (bp) in the <it>Arabidopsis lyrata </it>genome.</p></caption><tblbdy cols="7">
      <r>
         <c ca="left">
            <p>
               <b>Sequence name</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>JGI scaffold coordinates (approximate)</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Orientation</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Length (bp) of <it>Sadhu </it>sequence</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Target site duplication (bp)</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Full length?</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Percentage identity to nearest <it>A. thaliana Sadhu</it></b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="7">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu1-1</p>
         </c>
         <c ca="left">
            <p>7:7309496-7310418</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>948</p>
         </c>
         <c ca="left">
            <p>18</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>86</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu1-2</p>
         </c>
         <c ca="left">
            <p>8:11697563-11698467</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>922</p>
         </c>
         <c ca="left">
            <p>12</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>86</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu1-3</p>
         </c>
         <c ca="left">
            <p>6:22517373-22518173</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>957</p>
         </c>
         <c ca="left">
            <p>14</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>84</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu1-4</p>
         </c>
         <c ca="left">
            <p>3:1662010-1662618</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>1009</p>
         </c>
         <c ca="left">
            <p>14</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>81</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu1-5</p>
         </c>
         <c ca="left">
            <p>1:24954753-24955365</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>924</p>
         </c>
         <c ca="left">
            <p>ND</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>84</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu1-6</p>
         </c>
         <c ca="left">
            <p>4:841417-842023</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>965</p>
         </c>
         <c ca="left">
            <p>16</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>82</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu1-6</p>
         </c>
         <c ca="left">
            <p>3:13675822-13676434</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>927</p>
         </c>
         <c ca="left">
            <p>16</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>84</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu1d</p>
         </c>
         <c ca="left">
            <p>2:14298861-14299465</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>827</p>
         </c>
         <c ca="left">
            <p>ND</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>81</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu3-1</p>
         </c>
         <c ca="left">
            <p>7:12620425-12621361</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>928</p>
         </c>
         <c ca="left">
            <p>18</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>86</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu5-1</p>
         </c>
         <c ca="left">
            <p>7:17697122-17698008</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>879</p>
         </c>
         <c ca="left">
            <p>11</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>85</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu5d</p>
         </c>
         <c ca="left">
            <p>6:5062639-5062768</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>791</p>
         </c>
         <c ca="left">
            <p>ND</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>72</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu5d2</p>
         </c>
         <c ca="left">
            <p>6:25041036-25041746</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>804</p>
         </c>
         <c ca="left">
            <p>ND</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>73</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu5d3</p>
         </c>
         <c ca="left">
            <p>5:4156620-4157046</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>395</p>
         </c>
         <c ca="left">
            <p>ND</p>
         </c>
         <c ca="left">
            <p>No</p>
         </c>
         <c ca="left">
            <p>71</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu6-1</p>
         </c>
         <c ca="left">
            <p>3:21898960-21899646</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>899</p>
         </c>
         <c ca="left">
            <p>14</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>77</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu6-2</p>
         </c>
         <c ca="left">
            <p>2:14183205-14186662</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>887*</p>
         </c>
         <c ca="left">
            <p>15</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>77</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu6-3</p>
         </c>
         <c ca="left">
            <p>8:5795744-5796493</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>927</p>
         </c>
         <c ca="left">
            <p>ND</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>78</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu7-1</p>
         </c>
         <c ca="left">
            <p>7:4360460-4360769</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>901</p>
         </c>
         <c ca="left">
            <p>16</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>79</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu8-1</p>
         </c>
         <c ca="left">
            <p>1:13276396-13277158</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>920</p>
         </c>
         <c ca="left">
            <p>13</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>77</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu8-2</p>
         </c>
         <c ca="left">
            <p>3:807549-808169</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>865</p>
         </c>
         <c ca="left">
            <p>ND</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>79</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu8-3</p>
         </c>
         <c ca="left">
            <p>2:25942-26642</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>908</p>
         </c>
         <c ca="left">
            <p>8</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>77</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu8-4</p>
         </c>
         <c ca="left">
            <p>8:14602238-14602861</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>875</p>
         </c>
         <c ca="left">
            <p>15</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>75</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu8-5</p>
         </c>
         <c ca="left">
            <p>7:17882685-17884241</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>930**</p>
         </c>
         <c ca="left">
            <p>ND</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>78</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu8-6</p>
         </c>
         <c ca="left">
            <p>6:17227119-17227908</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c ca="left">
            <p>910</p>
         </c>
         <c ca="left">
            <p>ND</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>77</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu10-1</p>
         </c>
         <c ca="left">
            <p>3:4473616-4474232</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>918</p>
         </c>
         <c ca="left">
            <p>17</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>80</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>AlSadhu10-2</p>
         </c>
         <c ca="left">
            <p>2:9675105-9675721</p>
         </c>
         <c ca="left">
            <p>+</p>
         </c>
         <c ca="left">
            <p>895</p>
         </c>
         <c ca="left">
            <p>ND</p>
         </c>
         <c ca="left">
            <p>Yes</p>
         </c>
         <c ca="left">
            <p>80</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>*interrupted by 2,028 nt non-<it>Sadhu </it>sequence; **Interrupted by 518 nt non-<it>Sadhu </it>sequence; bp = base pairs; JGI = Joint Genome Institute; ND = not detected; nt = nucleotides.</p>
   </tblfn></tbl>
<p>
<it>A. lyrata </it>elements are between 71% and 86% identical to the most similar <it>A. thaliana </it>element (Table <tblr tid="T5">5</tblr>). Figure <figr fid="F5">5b</figr> shows a phylogenetic tree showing the relationships among the 25 <it>A. lyrata </it>and 16 full-length <it>A. thaliana </it>elements. All <it>A. lyrata </it>elements clustered within previously defined subfamilies, indicating that the divergence of the different subfamilies predated the split of these two species. Most of the <it>Sadhu </it>subfamilies previously identified in <it>A. thaliana </it>have representatives in <it>A. lyrata</it>; however, there is a dramatic expansion of elements within certain subfamilies relative to others (Figure <figr fid="F5">5b</figr>, Table <tblr tid="T5">5</tblr>). For instance, the <it>Sadhu1 </it>subfamily contains three members in <it>A. thaliana </it>but has expanded to seven full-length members in <it>A. lyrata</it>. The <it>Sadhu8 </it>and <it>Sadhu6 </it>subfamilies are represented by only a single member in <it>A. thaliana</it>, but feature six and three full-length elements, respectively, in <it>A. lyrata</it>. These genome comparisons suggest that, while multiple distinct <it>Sadhu </it>subfamilies have been active since the divergence of these two taxa, different subfamilies have proliferated more in certain species than in others. Alternatively, certain subfamilies may have been pared down by deletion and elimination in one species relative to the other.</p>
</sec>
<sec>
<st>
<p>Perspective</p>
</st>
<p>We have identified <it>Sadhu </it>sequences corresponding to multiple subfamilies in the related species <it>A. lyrata </it>and <it>A. arenosa</it>. The presence of target site duplications and poly(A) tracts, along with the absence of orthologous sites, strongly suggests that <it>Sadhu </it>elements in these other taxa arose via retrotransposition. In a few cases, elements within a given species are greater than 95% identical to one another, indicating that these sequences have mobilized more recently than the divergence of the different species. The partial sequence available for the <it>Brassica </it>genome <abbrgrp>
<abbr bid="B24">24</abbr>
</abbrgrp> does not contain <it>Sadhu-</it>related sequences. While these sequences may have been lost from some taxa, the high degree of divergence amongst elements in the <it>Arabidopsis </it>genus strongly suggests an ancient origin for these elements. Therefore, we predict that some sequences related to <it>Sadhu </it>elements might be present in other plants, perhaps even those quite distantly related to <it>Arabidopsis</it>. These presumably more divergent <it>Sadhu </it>relatives might share little overall primary nucleotide sequence with the <it>A. thaliana </it>elements, but might have maintained other recognizable diagnostic features, such as length, conserved 5' motif(s), a 3' poly (A) tract, and target site duplications.</p>
<p>Low copy number and high divergence among element subfamilies is not a phenomenon unique to <it>Sadhu </it>elements. Indeed, because only 10% of the <it>Arabidopsis </it>genome is composed of transposable elements <abbrgrp>
<abbr bid="B25">25</abbr>
</abbrgrp>, lower than other sequenced plant genomes, there may be a general tendency for genome size reduction in this species through progressive loss of repetitive DNA. A comparison of the <it>A. thaliana </it>genome with the five times larger <it>Brassica oleracea </it>genome revealed that while most element families were present in both species, some (for example, <it>CACTA </it>elements) had contributed more than others to the relative expansion of the <it>Brassica </it>genome <abbrgrp>
<abbr bid="B21">21</abbr>
</abbrgrp>. As with the different <it>Sadhu </it>subfamilies, different SINE non-LTR subfamilies appear to be more active in each of the two species <abbrgrp>
<abbr bid="B26">26</abbr>
</abbrgrp>. The lack of orthologous <it>Sadhu </it>insertion sites among different <it>Arabidopsis </it>species is also reminiscent of the case with SINEs, which similarly featured no shared sites in <it>B. oleracea </it>
<abbrgrp>
<abbr bid="B26">26</abbr>
</abbrgrp>. Both types of non-LTR elements are therefore subject to frequent loss over evolutionary time. This susceptibility may be a consequence of the dispersed pattern of localization of <it>Sadhus </it>and SINEs: elements that target heterochromatic regions, such as <it>Athila </it>LTR elements, appear to be relatively protected from this winnowing process <abbrgrp>
<abbr bid="B27">27</abbr>
</abbrgrp>.</p>
<p>Although retroelement superfamilies can typically be found in widely differing plant taxa <abbrgrp>
<abbr bid="B8">8</abbr>
</abbrgrp>, certain families show longer phylogenetic branch lengths and low copy numbers more similar to the case with <it>Sadhu</it>. In particular, <it>copia/Ty1 </it>families in <it>Arabidopsis </it>are highly divergent from one another <abbrgrp>
<abbr bid="B19">19</abbr>
<abbr bid="B28">28</abbr>
<abbr bid="B29">29</abbr>
<abbr bid="B30">30</abbr>
</abbrgrp>. Non-LTR TA elements are also present in few copies per genome from distinct, evolutionarily ancient lineages <abbrgrp>
<abbr bid="B20">20</abbr>
</abbrgrp>. This high divergence among element subfamilies and lack of orthologous sites in related species stands in stark contrast to primate non-LTR elements: L1s and Alus crowd mammalian genomes, with both currently active lineages as well as many defunct ancestral sites shared among humans and their most recent relatives (for example, <abbrgrp>
<abbr bid="B31">31</abbr>
<abbr bid="B32">32</abbr>
<abbr bid="B33">33</abbr>
</abbrgrp>). Therefore, while the evolutionary trajectory of <it>Sadhu </it>elements is not dramatically different from that exhibited by some plant retroelements, it is unlike many more well-studied elements.</p>
</sec>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>
<it>Sadhu </it>elements represent a previously little characterized retrotransposon family. We have generated a comprehensive classification scheme for these sequences based on phylogenetic analysis. Partial elements often contain 3' poly(A) tracts and target site duplications, consistent with an origin by target primed reverse transcription-driven retrotransposition. An examination of the <it>Sadhu5 </it>subfamily among different <it>A. thaliana </it>strains indicates that subfamily members arose through retrotransposition; the presence of polymorphic insertion sites provides evidence for retrotransposition in the recent history of the species. In addition, sequences at the target site are similar to the <it>Arabidopsis </it>SINE consensus, consistent with the hypothesis that the LINE machinery is responsible for the mobilization of both of these types of elements. <it>Sadhu</it>-related sequences identified in <it>A. lyrata </it>and <it>A. arenosa </it>cluster within specific <it>A. thaliana </it>subfamilies, indicating that the radiation of this element family preceded the divergence of the <it>Arabidopsis </it>genus. These <it>A. lyrata </it>and <it>A. arenosa </it>elements often contain poly(A) tracts and target site duplications, consistent with the model that these sequences also arose via retrotransposition. Taken together, these studies indicate that <it>Sadhu </it>elements have been active since the divergence of different <it>Arabidopsis </it>species, and through the differentiation of different <it>A. thaliana </it>strains. Further research is warranted to resolve the molecular origin and potential impact of this unique class of DNA sequence on genome structure and organization.</p>
</sec>
<sec>
<st>
<p>Methods</p>
</st>
<sec>
<st>
<p>Plant materials</p>
</st>
<p>
<it>A. thaliana </it>strains were obtained from the <it>Arabidopsis </it>Biological Resource Center (ABRC, Columbus, OH, USA). Stock numbers are listed in Table <tblr tid="T2">2</tblr>. <it>A. arenosa </it>seeds were obtained from Craig Pikaard (Department of Biology, Indiana University, Bloomington, IN, USA). Plants were grown on soil or on 1 &#215; MS media with 1% sucrose. DNA was isolated using previously described methods <abbrgrp>
<abbr bid="B34">34</abbr>
</abbrgrp>.</p>
</sec>
<sec>
<st>
<p>Molecular biology</p>
</st>
<p>PCR was performed using standard conditions with <it>Taq </it>DNA polymerase (QIAGEN, Valencia, CA, USA) or KT1 polymerase (Clontech, Mountain View, CA, USA). Two rounds of TAIL PCR were performed on <it>A. arenosa </it>template using protocols and degenerate AD primers described previously <abbrgrp>
<abbr bid="B35">35</abbr>
</abbrgrp>. Products from the second round of TAIL PCR were isolated from agarose gel and TA cloned into pGEM-T Easy (Promega, Madison, WI, USA) before sequencing. All other PCR products were directly sequenced without an additional cloning step following purification through Performa DTR gel filtration cartridges (Edge BioSystems, Gaithersburg, MD, USA). DNA sequencing was performed using Big Dye Terminator Cycle Sequencing (PerkinElmer, Waltham, MA,, USA) protocols/reagents; sequences were processed at the Washington University Department of Biology sequencing facility. PCR primers used to generate the data in Tables <tblr tid="T2">2</tblr> and <tblr tid="T4">4</tblr> are described in Additional file <supplr sid="S2">2</supplr>. 'Internal' PCR primers were used to amplify sequence from different <it>A. thaliana </it>strains and to amplify homologs from <it>A. arenosa</it>. All sequences in this study have been deposited in the National Center for Biotechnology Information (NCBI) database. Genbank accession numbers are listed in Table <tblr tid="T3">3</tblr> (for <it>A. arenosa </it>sequences) and in the legend to Figure <figr fid="F3">3</figr> (for <it>A. thaliana </it>strain specific sequences).</p>
</sec>
<sec>
<st>
<p>Computational analysis</p>
</st>
<p>Full-length and partial <it>Sadhu </it>elements were identified based on sequence similarity to <it>At2 g01410 </it>as previously described <abbrgrp>
<abbr bid="B1">1</abbr>
</abbrgrp>. The maximum parsimony and neighbor joining trees in Figures <figr fid="F1">1</figr> and <figr fid="F5">5</figr> were generated using the software PAUP* V. 4.0 (Sinauer Associates, Sunderland, MA, USA) based on a ClustalX alignment <abbrgrp>
<abbr bid="B36">36</abbr>
</abbrgrp>. Divergence matrices in Additional file <supplr sid="S1">1</supplr> were generated based on a ClustalX alignment using the European Molecular Biology Open Software Suite (EMBOSS) program 'distmat' <abbrgrp>
<abbr bid="B37">37</abbr>
</abbrgrp> run without corrections. Consensus sequences of different subfamilies were generated from full-length and derivative sequences using the EMBOSS program 'cons' <abbrgrp>
<abbr bid="B37">37</abbr>
</abbrgrp>. Alignments in Figure <figr fid="F3">3</figr> were visualized by ClustalX <abbrgrp>
<abbr bid="B36">36</abbr>
</abbrgrp>. WebLogo <abbrgrp>
<abbr bid="B38">38</abbr>
</abbrgrp> was used to create the logo images in Figure <figr fid="F4">4</figr> that describe the retrotransposition target consensus sites. Annotations of features within TAIL PCR products in Additional file <supplr sid="S3">3</supplr> were aided by the repeat masker feature on the Censor server <abbrgrp>
<abbr bid="B39">39</abbr>
</abbrgrp> and the TAIR WU-BLAST server <abbrgrp>
<abbr bid="B40">40</abbr>
</abbrgrp>. <it>A. lyrata </it>sequence information was obtained using the database, browser, and BLAST tools at the Joint Genome Institute (JGI) <abbrgrp>
<abbr bid="B41">41</abbr>
</abbrgrp>. <it>A. lyrata Sadhu </it>elements were identified by iterative BLAST searches of the JGI assembly using, initially, <it>A. thaliana </it>and then <it>A. lyrata Sadhu </it>sequences as queries until a self-referencing set of sequences was identified. The classification scheme in Table <tblr tid="T1">1</tblr> and locus ID and nucleotide positions for full-length elements have been submitted to both The <it>Arabidopsis </it>Information Resource (TAIR) <abbrgrp>
<abbr bid="B9">9</abbr>
</abbrgrp> as well as the repeat database at the Genetic Information Research Institute (GIRI) <abbrgrp>
<abbr bid="B42">42</abbr>
</abbrgrp>.</p>
</sec>
</sec>
<sec>
<st>
<p>Abbreviations</p>
</st>
<p>BLAST: basic local alignment search tool; GIRI: Genetic Information Research Institute; JGI: Joint Genome Institute; LINE: long interspersed nuclear element; LTR: long terminal repeat; SINE: short interspersed nuclear element; TAIL PCR: thermal asymmetric interlaced polymerase chain reaction; TAIR: The <it>Arabidopsis </it>Information Resource; TSD: target site duplication.</p>
</sec>
<sec>
<st>
<p>Competing interests</p>
</st>
<p>The authors declare that they have no competing interests.</p>
</sec>
<sec>
<st>
<p>Authors' contributions</p>
</st>
<p>SHR designed and performed all experiments, conducted analysis and drafted the manuscript. EJR conducted analysis and revised and approved the manuscript.</p>
</sec>
</bdy><bm>
<ack>
<sec>
<st>
<p>Acknowledgements</p>
</st>
<p>We thank members of the Richards lab and the transposable element community for stimulating discussions on <it>Sadhu </it>elements over the years. This research was supported by a grant to EJR from the National Science Foundation (MCB-0548597).</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>Meiotically stable natural epialleles of <it>Sadhu</it>, a novel <it>Arabidopsis </it>retroposon</p></title><aug><au><snm>Rangwala</snm><fnm>SH</fnm></au><au><snm>Elumalai</snm><fnm>R</fnm></au><au><snm>Vanier</snm><fnm>C</fnm></au><au><snm>Ozkan</snm><fnm>H</fnm></au><au><snm>Galbraith</snm><fnm>DW</fnm></au><au><snm>Richards</snm><fnm>EJ</fnm></au></aug><source>PLoS Genet</source><pubdate>2006</pubdate><volume>2</volume><fpage>e36</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pgen.0020036</pubid><pubid idtype="pmcid">1401498,1401498</pubid><pubid idtype="pmpid">16552445</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>SINEs and LINEs: the art of biting the hand that feeds you</p></title><aug><au><snm>Weiner</snm><fnm>AM</fnm></au></aug><source>Curr Opin Cell Biol</source><pubdate>2002</pubdate><volume>14</volume><fpage>343</fpage><lpage>350</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0955-0674(02)00338-1</pubid><pubid idtype="pmpid" link="fulltext">12067657</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>The GA octodinucleotide repeat binding factor BBR participates in the transcriptional regulation of the homeobox gene Bkn3</p></title><aug><au><snm>Santi</snm><fnm>L</fnm></au><au><snm>Wang</snm><fnm>Y</fnm></au><au><snm>Stile</snm><fnm>MR</fnm></au><au><snm>Berendzen</snm><fnm>K</fnm></au><au><snm>Wanke</snm><fnm>D</fnm></au><au><snm>Roig</snm><fnm>C</fnm></au><au><snm>Pozzi</snm><fnm>C</fnm></au><au><snm>M&#252;ller</snm><fnm>K</fnm></au><au><snm>M&#252;ller</snm><fnm>J</fnm></au><au><snm>Rohde</snm><fnm>W</fnm></au><au><snm>Salamini</snm><fnm>F</fnm></au></aug><source>Plant J</source><pubdate>2003</pubdate><volume>34</volume><fpage>813</fpage><lpage>826</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1046/j.1365-313X.2003.01767.x</pubid><pubid idtype="pmpid" link="fulltext">12795701</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>Identification of a soybean protein that interacts with GAGA element dinucleotide repeat DNA</p></title><aug><au><snm>Sangwan</snm><fnm>I</fnm></au><au><snm>O'Brian</snm><fnm>MR</fnm></au></aug><source>Plant Physiol</source><pubdate>2002</pubdate><volume>129</volume><fpage>1788</fpage><lpage>1794</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1104/pp.002618</pubid><pubid idtype="pmcid">166767</pubid><pubid idtype="pmpid">12177492</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Chromatin. Ga-ga over GAGA factor</p></title><aug><au><snm>Granok</snm><fnm>H</fnm></au><au><snm>Leibovitch</snm><fnm>BA</fnm></au><au><snm>Shaffer</snm><fnm>CD</fnm></au><au><snm>Elgin</snm><fnm>SC</fnm></au></aug><source>Curr Biol</source><pubdate>1995</pubdate><volume>5</volume><fpage>238</fpage><lpage>241</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0960-9822(95)00048-0</pubid><pubid idtype="pmpid" link="fulltext">7780729</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>Differential epigenetics regulation within an <it>Arabidopsis </it>retroposon family</p></title><aug><au><snm>Rangwala</snm><fnm>SH</fnm></au><au><snm>Richards</snm><fnm>EJ</fnm></au></aug><source>Genetics</source><pubdate>2007</pubdate><volume>176</volume><fpage>151</fpage><lpage>160</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1534/genetics.107.071092</pubid><pubid idtype="pmcid">1893068</pubid><pubid idtype="pmpid">17339215</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Penelope-like elements - a new class of retroelements: distribution, function and possible evolutionary significance</p></title><aug><au><snm>Evgen'ev</snm><fnm>MB</fnm></au><au><snm>Arkhipova</snm><fnm>IR</fnm></au></aug><source>Cytogenet Genome Res</source><pubdate>2005</pubdate><volume>110</volume><fpage>510</fpage><lpage>521</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1159/000084984</pubid><pubid idtype="pmpid" link="fulltext">16093704</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>copia-like retrotransposons are ubiquitous among plants</p></title><aug><au><snm>Voytas</snm><fnm>DF</fnm></au><au><snm>Cummings</snm><fnm>MP</fnm></au><au><snm>Koniczny</snm><fnm>A</fnm></au><au><snm>Ausubel</snm><fnm>FM</fnm></au><au><snm>Rodermel</snm><fnm>SR</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1992</pubdate><volume>89</volume><fpage>7124</fpage><lpage>7128</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.89.15.7124</pubid><pubid idtype="pmcid">49658</pubid><pubid idtype="pmpid">1379734</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>The <it>Arabidopsis </it>Information Resource (TAIR): gene structure and function annotation</p></title><aug><au><snm>Swarbreck</snm><fnm>D</fnm></au><au><snm>Wilks</snm><fnm>C</fnm></au><au><snm>Lamesch</snm><fnm>P</fnm></au><au><snm>Berardini</snm><fnm>TZ</fnm></au><au><snm>Garcia-Hernandez</snm><fnm>M</fnm></au><au><snm>Foerster</snm><fnm>H</fnm></au><au><snm>Li</snm><fnm>D</fnm></au><au><snm>Meyer</snm><fnm>T</fnm></au><au><snm>Muller</snm><fnm>R</fnm></au><au><snm>Ploetz</snm><fnm>L</fnm></au><au><snm>Radenbaugh</snm><fnm>A</fnm></au><au><snm>Singh</snm><fnm>S</fnm></au><au><snm>Swing</snm><fnm>V</fnm></au><au><snm>Tissier</snm><fnm>C</fnm></au><au><snm>Zhang</snm><fnm>P</fnm></au><au><snm>Huala</snm><fnm>E</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2008</pubdate><volume>36</volume><fpage>D1009</fpage><lpage>1014</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkm965</pubid><pubid idtype="pmcid">2238962</pubid><pubid idtype="pmpid">17986450</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>End-to-end template jumping by the reverse transcriptase encoded by the R2 retrotransposon</p></title><aug><au><snm>Bibillo</snm><fnm>A</fnm></au><au><snm>Eickbush</snm><fnm>TH</fnm></au></aug><source>J Biol Chem</source><pubdate>2004</pubdate><volume>279</volume><fpage>14945</fpage><lpage>14953</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1074/jbc.M310450200</pubid><pubid idtype="pmpid" link="fulltext">14752111</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Retroelements and formation of chimeric retrogenes</p></title><aug><au><snm>Buzdin</snm><fnm>AA</fnm></au></aug><source>Cell Mol Life Sci</source><pubdate>2004</pubdate><volume>61</volume><fpage>2046</fpage><lpage>2059</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00018-004-4041-z</pubid><pubid idtype="pmpid" link="fulltext">15316654</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Genome sequence comparison of Col and Ler lines reveals the dynamic nature of <it>Arabidopsis </it>chromosomes</p></title><aug><au><snm>Ziolkowski</snm><fnm>PA</fnm></au><au><snm>Koczyk</snm><fnm>G</fnm></au><au><snm>Galganski</snm><fnm>L</fnm></au><au><snm>Sadowski</snm><fnm>J</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2009</pubdate><volume>37</volume><fpage>3189</fpage><lpage>3201</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkp183</pubid><pubid idtype="pmcid">2691826</pubid><pubid idtype="pmpid">19305000</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>Reverse transcription of R2Bm RNA is primed by a nick at the chromosomal target site: a mechanism for non-LTR retrotransposition</p></title><aug><au><snm>Luan</snm><fnm>DD</fnm></au><au><snm>Korman</snm><fnm>MH</fnm></au><au><snm>Jakubczak</snm><fnm>JL</fnm></au><au><snm>Eickbush</snm><fnm>TH</fnm></au></aug><source>Cell</source><pubdate>1993</pubdate><volume>72</volume><fpage>595</fpage><lpage>605</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/0092-8674(93)90078-5</pubid><pubid idtype="pmpid" link="fulltext">7679954</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>Human L1 retrotransposon encodes a conserved endonuclease required for retrotransposition</p></title><aug><au><snm>Feng</snm><fnm>Q</fnm></au><au><snm>Moran</snm><fnm>JV</fnm></au><au><snm>Kazazian</snm><fnm>HH</fnm><suf>Jr</suf></au><au><snm>Boeke</snm><fnm>JD</fnm></au></aug><source>Cell</source><pubdate>1996</pubdate><volume>87</volume><fpage>905</fpage><lpage>916</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0092-8674(00)81997-2</pubid><pubid idtype="pmpid" link="fulltext">8945517</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>Sequence patterns indicate an enzymatic involvement in integration of mammalian retroposons</p></title><aug><au><snm>Jurka</snm><fnm>J</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1997</pubdate><volume>94</volume><fpage>1872</fpage><lpage>1877</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.94.5.1872</pubid><pubid idtype="pmcid">20010</pubid><pubid idtype="pmpid">9050872</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>Molecular archeology of L1 insertions in the human genome</p></title><aug><au><snm>Szak</snm><fnm>ST</fnm></au><au><snm>Pickeral</snm><fnm>OK</fnm></au><au><snm>Makalowski</snm><fnm>W</fnm></au><au><snm>Boguski</snm><fnm>MS</fnm></au><au><snm>Landsman</snm><fnm>D</fnm></au><au><snm>Boeke</snm><fnm>JD</fnm></au></aug><source>Genome Biol</source><pubdate>2002</pubdate><volume>3</volume><fpage>0052</fpage><xrefbib><pubid idtype="doi">10.1186/gb-2002-3-10-research0052</pubid></xrefbib></bibl><bibl id="B17"><title><p>LINEs, SINEs and processed pseudogenes: parasitic strategies for genome modeling</p></title><aug><au><snm>Dewannieux</snm><fnm>M</fnm></au><au><snm>Heidmann</snm><fnm>T</fnm></au></aug><source>Cytogenet Genome Res</source><pubdate>2005</pubdate><volume>110</volume><fpage>35</fpage><lpage>48</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1159/000084936</pubid><pubid idtype="pmpid" link="fulltext">16093656</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>Identification and structural analysis of SINE elements in the <it>Arabidopsis thaliana </it>genome</p></title><aug><au><snm>Myouga</snm><fnm>F</fnm></au><au><snm>Tsuchimoto</snm><fnm>S</fnm></au><au><snm>Noma</snm><fnm>K</fnm></au><au><snm>Ohtsubo</snm><fnm>H</fnm></au><au><snm>Ohtsubo</snm><fnm>E</fnm></au></aug><source>Genes Genet Syst</source><pubdate>2001</pubdate><volume>76</volume><fpage>169</fpage><lpage>179</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1266/ggs.76.169</pubid><pubid idtype="pmpid" link="fulltext">11569500</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>Molecular paleontology of transposable elements from <it>Arabidopsis thaliana</it></p></title><aug><au><snm>Kapitonov</snm><fnm>VV</fnm></au><au><snm>Jurka</snm><fnm>J</fnm></au></aug><source>Genetica</source><pubdate>1999</pubdate><volume>107</volume><fpage>27</fpage><lpage>37</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1023/A:1004030922447</pubid><pubid idtype="pmpid" link="fulltext">10952195</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>Multiple non-LTR retrotransposons in the genome of <it>Arabidopsis thaliana</it></p></title><aug><au><snm>Wright</snm><fnm>DA</fnm></au><au><snm>Ke</snm><fnm>N</fnm></au><au><snm>Smalle</snm><fnm>J</fnm></au><au><snm>Hauge</snm><fnm>BM</fnm></au><au><snm>Goodman</snm><fnm>HM</fnm></au><au><snm>Voytas</snm><fnm>DF</fnm></au></aug><source>Genetics</source><pubdate>1996</pubdate><volume>142</volume><fpage>569</fpage><lpage>578</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1206989</pubid><pubid idtype="pmpid">8852854</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Genome-wide comparative analysis of the transposable elements in the related species <it>Arabidopsis thaliana </it>and <it>Brassica oleracea</it></p></title><aug><au><snm>Zhang</snm><fnm>X</fnm></au><au><snm>Wessler</snm><fnm>SR</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2004</pubdate><volume>101</volume><fpage>5589</fpage><lpage>5594</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0401243101</pubid><pubid idtype="pmcid">397431</pubid><pubid idtype="pmpid">15064405</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>Poorly known relatives of <it>Arabidopsis thaliana</it></p></title><aug><au><snm>Clauss</snm><fnm>MJ</fnm></au><au><snm>Koch</snm><fnm>MA</fnm></au></aug><source>Trends Plant Sci</source><pubdate>2006</pubdate><volume>11</volume><fpage>449</fpage><lpage>459</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.tplants.2006.07.005</pubid><pubid idtype="pmpid" link="fulltext">16893672</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>Evolution and genetic differentiation among relatives of <it>Arabidopsis thaliana</it></p></title><aug><au><snm>Koch</snm><fnm>MA</fnm></au><au><snm>Matschinger</snm><fnm>M</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2007</pubdate><volume>104</volume><fpage>6272</fpage><lpage>6277</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0701338104</pubid><pubid idtype="pmcid">1851049</pubid><pubid idtype="pmpid">17404224</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p><it>Brassica </it>sequence</p></title><url>http://brassica.bbsrc.ac.uk/</url></bibl><bibl id="B25"><title><p>Analysis of the genome sequence of the flowering plant <it>Arabidopsis thaliana</it></p></title><aug><au><cnm>Arabidopsis Genome Initiative</cnm></au></aug><source>Nature</source><pubdate>2000</pubdate><volume>408</volume><fpage>796</fpage><lpage>815</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/35048692</pubid><pubid idtype="pmpid" link="fulltext">11130711</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>Comparative evolution history of SINEs in <it>Arabidopsis thaliana </it>and <it>Brassica oleracea</it>: evidence for a high rate of SINE loss</p></title><aug><au><snm>Lenoir</snm><fnm>A</fnm></au><au><snm>Pelissier</snm><fnm>T</fnm></au><au><snm>Bousquet-Antonelli</snm><fnm>C</fnm></au><au><snm>Deragon</snm><fnm>JM</fnm></au></aug><source>Cytogenet Genome Res</source><pubdate>2005</pubdate><volume>110</volume><fpage>441</fpage><lpage>447</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1159/000084976</pubid><pubid idtype="pmpid" link="fulltext">16093696</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>Insertion bias and purifying selection of retrotransposons in the <it>Arabidopsis thaliana </it>genome</p></title><aug><au><snm>Pereira</snm><fnm>V</fnm></au></aug><source>Genome Biol</source><pubdate>2004</pubdate><volume>5</volume><fpage>R79</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2004-5-10-r79</pubid><pubid idtype="pmcid">545599</pubid><pubid idtype="pmpid">15461797</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>A superfamily of <it>Arabidopsis thaliana </it>retrotransposons</p></title><aug><au><snm>Konieczny</snm><fnm>A</fnm></au><au><snm>Voytas</snm><fnm>DF</fnm></au><au><snm>Cummings</snm><fnm>MP</fnm></au><au><snm>Ausubel</snm><fnm>FM</fnm></au></aug><source>Genetics</source><pubdate>1991</pubdate><volume>127</volume><fpage>801</fpage><lpage>809</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1204407</pubid><pubid idtype="pmpid">1709409</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>Structural and evolutionary analysis of the copia-like elements in the <it>Arabidopsis thaliana </it>genome</p></title><aug><au><snm>Terol</snm><fnm>J</fnm></au><au><snm>Castillo</snm><fnm>MC</fnm></au><au><snm>Bargues</snm><fnm>M</fnm></au><au><snm>Perez-Alonso</snm><fnm>M</fnm></au><au><snm>de Frutos</snm><fnm>R</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2001</pubdate><volume>18</volume><fpage>882</fpage><lpage>892</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">11319272</pubid></xrefbib></bibl><bibl id="B30"><title><p>The structure, distribution and evolution of the Ta1 retrotransposable element family of <it>Arabidopsis thaliana</it></p></title><aug><au><snm>Voytas</snm><fnm>DF</fnm></au><au><snm>Konieczny</snm><fnm>A</fnm></au><au><snm>Cummings</snm><fnm>MP</fnm></au><au><snm>Ausubel</snm><fnm>FM</fnm></au></aug><source>Genetics</source><pubdate>1990</pubdate><volume>126</volume><fpage>713</fpage><lpage>721</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1204225</pubid><pubid idtype="pmpid">2174394</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>Initial sequence of the chimpanzee genome and comparison with the human genome</p></title><aug><au><cnm>Chimpanzee Sequencing and Analysis Consortium</cnm></au></aug><source>Nature</source><pubdate>2005</pubdate><volume>437</volume><fpage>69</fpage><lpage>87</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature04072</pubid><pubid idtype="pmpid" link="fulltext">16136131</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>Different evolutionary fates of recently integrated human and chimpanzee LINE-1 retrotransposons</p></title><aug><au><snm>Lee</snm><fnm>J</fnm></au><au><snm>Cordaux</snm><fnm>R</fnm></au><au><snm>Han</snm><fnm>K</fnm></au><au><snm>Wang</snm><fnm>J</fnm></au><au><snm>Hedges</snm><fnm>DJ</fnm></au><au><snm>Liang</snm><fnm>P</fnm></au><au><snm>Batzer</snm><fnm>MA</fnm></au></aug><source>Gene</source><pubdate>2007</pubdate><volume>390</volume><fpage>18</fpage><lpage>27</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.gene.2006.08.029</pubid><pubid idtype="pmcid">1847406</pubid><pubid idtype="pmpid">17055192</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>Comparative analysis of Alu repeats in primate genomes</p></title><aug><au><snm>Liu</snm><fnm>GE</fnm></au><au><snm>Alkan</snm><fnm>C</fnm></au><au><snm>Jiang</snm><fnm>L</fnm></au><au><snm>Zhao</snm><fnm>S</fnm></au><au><snm>Eichler</snm><fnm>EE</fnm></au></aug><source>Genome Res</source><pubdate>2009</pubdate><volume>19</volume><fpage>876</fpage><lpage>885</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.083972.108</pubid><pubid idtype="pmcid">2675976</pubid><pubid idtype="pmpid">19411604</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>Pl-Bh, an anthocyanin regulatory gene of maize that leads to variegated pigmentation</p></title><aug><au><snm>Cocciolone</snm><fnm>SM</fnm></au><au><snm>Cone</snm><fnm>KC</fnm></au></aug><source>Genetics</source><pubdate>1993</pubdate><volume>135</volume><fpage>575</fpage><lpage>588</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1205657</pubid><pubid idtype="pmpid">7694886</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>Efficient isolation and mapping of <it>Arabidopsis thaliana </it>T-DNA insert junctions by thermal asymmetric interlaced PCR</p></title><aug><au><snm>Liu</snm><fnm>YG</fnm></au><au><snm>Mitsukawa</snm><fnm>N</fnm></au><au><snm>Oosumi</snm><fnm>T</fnm></au><au><snm>Whittier</snm><fnm>RF</fnm></au></aug><source>Plant J</source><pubdate>1995</pubdate><volume>8</volume><fpage>457</fpage><lpage>463</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1046/j.1365-313X.1995.08030457.x</pubid><pubid idtype="pmpid" link="fulltext">7550382</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools</p></title><aug><au><snm>Thompson</snm><fnm>JD</fnm></au><au><snm>Gibson</snm><fnm>TJ</fnm></au><au><snm>Plewniak</snm><fnm>F</fnm></au><au><snm>Jeanmougin</snm><fnm>F</fnm></au><au><snm>Higgins</snm><fnm>DG</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>1997</pubdate><volume>25</volume><fpage>4876</fpage><lpage>4882</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/25.24.4876</pubid><pubid idtype="pmcid">147148</pubid><pubid idtype="pmpid">9396791</pubid></pubidlist></xrefbib></bibl><bibl id="B37"><title><p>EMBOSS: the European Molecular Biology Open Software Suite</p></title><aug><au><snm>Rice</snm><fnm>P</fnm></au><au><snm>Longden</snm><fnm>I</fnm></au><au><snm>Bleasby</snm><fnm>A</fnm></au></aug><source>Trends Genet</source><pubdate>2000</pubdate><volume>16</volume><fpage>276</fpage><lpage>277</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0168-9525(00)02024-2</pubid><pubid idtype="pmpid" link="fulltext">10827456</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>WebLogo: a sequence logo generator</p></title><aug><au><snm>Crooks</snm><fnm>GE</fnm></au><au><snm>Hon</snm><fnm>G</fnm></au><au><snm>Chandonia</snm><fnm>JM</fnm></au><au><snm>Brenner</snm><fnm>SE</fnm></au></aug><source>Genome Res</source><pubdate>2004</pubdate><volume>14</volume><fpage>1188</fpage><lpage>1190</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.849004</pubid><pubid idtype="pmcid">419797</pubid><pubid idtype="pmpid">15173120</pubid></pubidlist></xrefbib></bibl><bibl id="B39"><title><p>Censor server</p></title><url>http://www.girinst.org/censor/</url></bibl><bibl id="B40"><title><p>TAIR WU-BLAST server</p></title><url>http://www.arabidopsis.org/wublast/index2.jsp</url></bibl><bibl id="B41"><title><p>Joint Genome Institute BLAST tools</p></title><url>http://genome.jgi-psf.org/Araly1/Araly1.home.html</url></bibl><bibl id="B42"><title><p>Repbase Update, a database of eukaryotic repetitive elements</p></title><aug><au><snm>Jurka</snm><fnm>J</fnm></au><au><snm>Kapitonov</snm><fnm>VV</fnm></au><au><snm>Pavlicek</snm><fnm>A</fnm></au><au><snm>Klonowski</snm><fnm>P</fnm></au><au><snm>Kohany</snm><fnm>O</fnm></au><au><snm>Walichiewicz</snm><fnm>J</fnm></au></aug><source>Cytogenet Genome Res</source><pubdate>2005</pubdate><volume>110</volume><fpage>462</fpage><lpage>467</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1159/000084979</pubid><pubid idtype="pmpid" link="fulltext">16093699</pubid></pubidlist></xrefbib></bibl></refgrp>
</bm></art>