BLASTX nr result

ID: Cocculus22_contig00008921 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus22_contig00008921
         (1712 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   233   2e-58
gb|AAC63678.1| putative non-LTR retroelement reverse transcripta...   232   3e-58
gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]               229   2e-57
gb|AAC95175.1| putative non-LTR retroelement reverse transcripta...   224   1e-55
gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal...   219   3e-54
emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|72678...   219   4e-54
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       218   9e-54
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   212   4e-52
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   211   6e-52
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]           211   8e-52
gb|ABV21212.1| Ty1_Copia-element protein [Arabidopsis thaliana]       211   1e-51
gb|AAC33226.1| putative non-LTR retroelement reverse transcripta...   208   7e-51
gb|AAG51098.1|AC025295_6 hypothetical protein [Arabidopsis thali...   194   8e-47
emb|CAB72467.1| putative protein [Arabidopsis thaliana]               193   2e-46
gb|AAC19278.1| T14P8.10 [Arabidopsis thaliana] gi|7269009|emb|CA...   192   4e-46
gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,...   189   4e-45
emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulga...   189   4e-45
gb|ABW81051.1| tn7 reverse transcriptase [Arabidopsis lyrata sub...   185   5e-44
ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298...   184   8e-44
ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom...   183   2e-43

>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  233 bits (594), Expect = 2e-58
 Identities = 141/456 (30%), Positives = 222/456 (48%), Gaps = 23/456 (5%)
 Frame = +3

Query: 90   TGQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYI 269
            +GQLPV++LGLP+I  +LS  DC PL+    +++  W ++ LSYAGRL L+ SVL S   
Sbjct: 764  SGQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSVLWSICN 823

Query: 270  FWTGAFPIPYSVCSKLESLMGSFLHGKSKLR----LISWATICRPLEEGGLGIRRIKDMN 437
            FW  AF +P     +LE +  +FL   +++      ISW  +C+P +EGGLG+R +K+ N
Sbjct: 824  FWLAAFRLPRKCIRELEKMCSAFLWSGTEMNSNKAKISWHMVCKPKDEGGLGLRSLKEAN 883

Query: 438  KAGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIW-TATIPNDVSWVYRRILKIRNQFAHH 614
                 KL+W I S   SLWV+W+    LRN S W      +  SW+++++LK R      
Sbjct: 884  DVCCLKLVWKIVSHSNSLWVKWVDQHLLRNASFWEVKQTVSQGSWIWKKLLKYREVAKTL 943

Query: 615  CFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEV----REAGYWNSP 782
                VGNG  T FW   W   G LL+  G+   +   I RR T++E     R+  + N  
Sbjct: 944  SKVEVGNGKQTSFWYDNWSDLGQLLERTGDRGLIDLGISRRMTVEEAWTNRRQRRHRNDV 1003

Query: 783  PS-SSPMVRTAWRQFQQIPKLGCDEEDQFVW---SPCPSGLFSVASAWEQIRHHYDVWEW 950
             +     ++ +W           + ED+ +W   S      FS    W   R       W
Sbjct: 1004 YNVIEDALKKSW-------DTRTETEDKVLWRGKSDVFRTTFSTRDTWHHTRSTSARVPW 1056

Query: 951  TELVWFYDKIPKCSFTCWRMLLSKLPTKDKLTRF--GAQSRCELCWAGVESEDHLFFECP 1124
             +++WF    PK SF  W     +LPT D++  +  G  + C  C   +E+ DHLFF C 
Sbjct: 1057 HKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWANGIATDCIFCQGTLETRDHLFFTCS 1116

Query: 1125 FSSEVWMRIKVKCWRNVQVVRGRFQESQT--------ILRSFGLNRADGVIRKLCYTVTV 1280
            F+S +W          V + RG F+   T         + +   +R +  +R+  +  T+
Sbjct: 1117 FTSVIW----------VDLARGIFKTQYTSHWQSIIEAITNSQHHRVEWFLRRYVFQATI 1166

Query: 1281 HFIWWERNMRLFNKGWRSATRLAEEIIQLVHQKVST 1388
            + +W ERN R   +   +A++L   I + +  ++S+
Sbjct: 1167 YIVWRERNGRRHGEPPNTASQLVGWIDKQIRNQLSS 1202


>gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1216

 Score =  232 bits (592), Expect = 3e-58
 Identities = 139/452 (30%), Positives = 224/452 (49%), Gaps = 17/452 (3%)
 Frame = +3

Query: 93   GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272
            G+LPV++LGLP++  +L+ +D +PL+    R++  W ++ LS+AGRL L+ SVL S   F
Sbjct: 486  GKLPVRYLGLPLVTKRLTTSDYSPLIDQIRRRIGMWTSRYLSFAGRLSLINSVLWSITNF 545

Query: 273  WTGAFPIPYSVCSKLESLMGSFLHGKSKLR----LISWATICRPLEEGGLGIRRIKDMNK 440
            W  AF +P    +++  +  + L    +L      +SW  IC+P +EGGLG++ +++ NK
Sbjct: 546  WMNAFRLPRECINEINRISSALLWSGPELNPKKAKVSWDEICKPKKEGGLGLQSLREANK 605

Query: 441  AGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDV-SWVYRRILKIRNQFAHHC 617
                KL+W + S + SLWV+W     L+  S W+    + + SW++RR+LK R      C
Sbjct: 606  VSSLKLIWRLLSCQDSLWVKWTRMNLLKKESFWSIGTHSTLGSWIWRRLLKHREVAKSFC 665

Query: 618  FNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPSSSP 797
               V NG  T FW   W  +G L++  G    +   I R  T+ E      W+       
Sbjct: 666  KIEVNNGVNTSFWFDNWSEKGPLINLTGARGAIDMGISRHMTLAEA-----WSRRRRKRH 720

Query: 798  MVRTAWRQFQQI-----PKLGCDEEDQFVW---SPCPSGLFSVASAWEQIRHHYDVWEWT 953
             V     +F++I          + ED  +W          FS    W  IR   +   W 
Sbjct: 721  RVEIL-NEFEEILLQKYQHRNIELEDAILWRGKEDVFKARFSTKDTWNHIRTSSNQRAWH 779

Query: 954  ELVWFYDKIPKCSFTCWRMLLSKLPTKDKLTRF--GAQSRCELCWAGVESEDHLFFECPF 1127
            + VWF    PK SF  W  + ++L T D++  +  G  + C  C + +E+ DHLFF+C +
Sbjct: 780  KGVWFAHATPKFSFCAWLAIRNRLSTGDRMMTWNNGTPTTCVFCSSPMETRDHLFFQCCY 839

Query: 1128 SSEVWMRIKVKCWRNVQVVRGRFQESQTI--LRSFGLNRADGVIRKLCYTVTVHFIWWER 1301
            SSE+W  I     +NV   R   + S  +  +     +R    + +  + V++H IW ER
Sbjct: 840  SSEIWTSIA----KNVYKDRFSTKWSAVVNYISDSQPDRIQSFLSRYTFQVSIHSIWRER 895

Query: 1302 NMRLFNKGWRSATRLAEEIIQLVHQKVSTSKK 1397
            N R   +  RSA+ L  +I + +  ++ST KK
Sbjct: 896  NSRRHGEKSRSASNLIRQIDKTIRNQLSTIKK 927


>gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]
          Length = 872

 Score =  229 bits (585), Expect = 2e-57
 Identities = 129/418 (30%), Positives = 207/418 (49%), Gaps = 12/418 (2%)
 Frame = +3

Query: 93   GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272
            GQLPV++LGLP++  +L+ AD +PL+    +++  W  +  S+AGR  L+KSVL S   F
Sbjct: 412  GQLPVRYLGLPLVTKRLTSADYSPLLEQIKKRIATWTFRFFSFAGRFNLIKSVLWSICNF 471

Query: 273  WTGAFPIPYSVCSKLESLMGSFLHGKSKL----RLISWATICRPLEEGGLGIRRIKDMNK 440
            W  AF +P     +++ L  SFL   S++      ISW  +C+P  EGGLG+R +K+ N 
Sbjct: 472  WLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHKAKISWDIVCKPKAEGGLGLRNLKEAND 531

Query: 441  AGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDV-SWVYRRILKIRNQFAHHC 617
                KL+W I S+  SLW +W+    +R  SIW+      + SW++R+ILKIR+      
Sbjct: 532  VSCLKLVWRIISNSNSLWTKWVAEYLIRKKSIWSLKQSTSMGSWIWRKILKIRDVAKSFS 591

Query: 618  FNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPSSSP 797
               VGNG++  FW   W   G L+DT G+   +   IPR A++ +           +S  
Sbjct: 592  RVEVGNGESASFWYDHWSAHGRLIDTVGDKGTIDLGIPREASVADAWTRRSRRRHRTSLL 651

Query: 798  MVRTAWRQFQQIPKLGCDEEDQFVW---SPCPSGLFSVASAWEQIRHHYDVWEWTELVWF 968
                    +Q+I     D ED  +W   +      FS    W  I+       W + VWF
Sbjct: 652  NEIEEMMAYQRIHH--SDAEDTVLWRGKNDVFKPHFSTRDTWHLIKATSSTVSWHKGVWF 709

Query: 969  YDKIPKCSFTCWRMLLSKLPTKDKLTRFGA----QSRCELCWAGVESEDHLFFECPFSSE 1136
                PK +   W  + ++LPT D++ ++ +       C LC    ++ +HLFF C ++S 
Sbjct: 710  RHATPKYALCTWLAIHNRLPTGDRMLKWNSSGSVSGNCVLCTNNSKTLEHLFFSCSYAST 769

Query: 1137 VWMRIKVKCWRNVQVVRGRFQESQTILRSFGLNRADGVIRKLCYTVTVHFIWWERNMR 1310
            VW  +    W+       R+    T + +   +R +G + +  +  T++ +W ERN R
Sbjct: 770  VWAALAKGIWKTRYST--RWSHLLTHISTHFQDRVEGFLTRYIFQATIYHVWRERNGR 825


>gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1352

 Score =  224 bits (570), Expect = 1e-55
 Identities = 142/441 (32%), Positives = 216/441 (48%), Gaps = 21/441 (4%)
 Frame = +3

Query: 93   GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272
            G LPVK+LGLP++  +++ +D  PLV     ++  W  + LS+AGRL+L+KSVL S   F
Sbjct: 912  GTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITSWTNRFLSFAGRLQLIKSVLSSITNF 971

Query: 273  WTGAFPIPYSVCSKLESLMGSFLHG----KSKLRLISWATICRPLEEGGLGIRRIKDMNK 440
            W   F +P +   ++E +  +FL       +K   I+W+ +C+  EEGGLG++ +K+ N+
Sbjct: 972  WLSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKAKIAWSEVCKLKEEGGLGLKPLKEANE 1031

Query: 441  AGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDV-SWVYRRILKIRNQFAHHC 617
              L KL+W I S++ SLWV+W++   +R  + W+      + SW++R+ILK R++     
Sbjct: 1032 VSLLKLIWRILSARDSLWVKWVNKHLIRKETFWSVKENTGLGSWLWRKILKQRDKARLFH 1091

Query: 618  FNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEV--------REAGYW 773
               V +G  T FW   W P G L    G    +   IP  AT+ EV          A + 
Sbjct: 1092 RMEVRSGTFTSFWHDHWCPLGRLHQHMGSRGTIDLGIPNNATVAEVMNTHRRKRHRADFL 1151

Query: 774  NSPPSSSPMVRTAWRQFQQIPKLGCDEEDQFVWSPCPSGLFSVASAWEQIRHHYDVWEWT 953
            N   S   + R   R       L   +ED F  S      FS +  W+QIR      +W 
Sbjct: 1152 NQIKSQIELARQD-RSTDGDRSLWKQKEDTFKSS------FSSSKTWQQIRSISLRCDWY 1204

Query: 954  ELVWFYDKIPKCSFTCWRMLLSKLPTKDKLTRF--GAQSRCELCWAGVESEDHLFFECPF 1127
              VWF    PK SF  W    ++L T DK+ ++  GA+  C  C   +E+ DHLFF CP+
Sbjct: 1205 RGVWFSASTPKYSFVTWLAFHNRLTTSDKICKWNSGARYDCVFCGEELETRDHLFFSCPY 1264

Query: 1128 SSEVWMRIK--VKCWRNV----QVVRGRFQESQTILRSFGLNRADGVIRKLCYTVTVHFI 1289
            SS VW  +   +   RN+     +       S+  L  F L  A        +  ++H +
Sbjct: 1265 SSHVWFSLTKGLLNGRNILNWNLITPHLLDSSRPYLHVFTLRYA--------FQASIHSL 1316

Query: 1290 WWERNMRLFNKGWRSATRLAE 1352
            W ERN R   +    A +LA+
Sbjct: 1317 WRERNCRRHGETAIPAAKLAK 1337


>gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana]
          Length = 629

 Score =  219 bits (558), Expect = 3e-54
 Identities = 137/452 (30%), Positives = 218/452 (48%), Gaps = 21/452 (4%)
 Frame = +3

Query: 93   GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272
            GQLPV++LGLP++  +L+  D +PL      ++  W ++ LS+AGRL L+ SVL S+  F
Sbjct: 170  GQLPVRYLGLPLVTKRLTKEDLSPLFEQIRNRIGTWTSRYLSFAGRLNLISSVLWSTMNF 229

Query: 273  WTGAFPIPYSVCSKLESLMGSFLHGKSKLR----LISWATICRPLEEGGLGIRRIKDMNK 440
            W  AF +P +   ++ S+  +FL    +L      +SW  IC+P +EGGLG+R + + N 
Sbjct: 230  WMSAFRLPSACLKEINSICSAFLWSGPELHRRKAKVSWDDICKPKQEGGLGLRSLTEANV 289

Query: 441  AGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDV--SWVYRRILKIRNQFAHH 614
              + KL+W + S+  SLWV+W     L+  S W+ T PN    SW+++++LK R      
Sbjct: 290  VSVLKLIWRVTSNDDSLWVKWSKMNLLKQESFWSLT-PNSSLGSWMWKKMLKYRETAKPF 348

Query: 615  CFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPSSS 794
                V NG  T FW   W   G L+D  G+  ++   I R  T+ E      W++     
Sbjct: 349  SRVEVNNGARTSFWFDNWSGMGHLMDVTGQRGQIDLGISRNKTVAEA-----WSNRRRRK 403

Query: 795  PM------VRTAWRQFQQIPKLGCDEEDQFVW---SPCPSGLFSVASAWEQIRHHYDVWE 947
                    +  A  Q  Q   L    ED  +W          FS    W Q+R   +   
Sbjct: 404  HRTEQLNDIEAALNQKYQTRNL--LREDATLWRGKGDVFKTSFSTKDTWNQVRKKSNEVA 461

Query: 948  WTELVWFYDKIPKCSFTCWRMLLSKLPT--KDKLTRFGAQSRCELCWAGVESEDHLFFEC 1121
            W + VWF    PK  F  W  L ++L T  + +L   G+  +C  C   +E+ DHLFF C
Sbjct: 462  WYKGVWFSHSTPKYQFCTWLALRNRLSTGYRMQLWNNGSDVKCTFCSTSIETRDHLFFSC 521

Query: 1122 PFSSEVWMRIKVKCWRNVQVVRGRFQ-ESQTILRSFGLNRADGV---IRKLCYTVTVHFI 1289
             ++S +W  I         V++ RF  + QTI+      + D +   + +  + +TVH +
Sbjct: 522  SYASAIWTAIA------KNVLQHRFSTDWQTIVNYISETQTDRIRSFLSRYIFQLTVHTV 575

Query: 1290 WWERNMRLFNKGWRSATRLAEEIIQLVHQKVS 1385
            W ERN R   +  R++  L   + + +  ++S
Sbjct: 576  WKERNDRRHGEEPRTSANLISWMDKQIRNQLS 607


>emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|7267871|emb|CAB78214.1|
            putative protein [Arabidopsis thaliana]
          Length = 473

 Score =  219 bits (557), Expect = 4e-54
 Identities = 143/453 (31%), Positives = 214/453 (47%), Gaps = 19/453 (4%)
 Frame = +3

Query: 141  LSIADCAPLVGMFAR-KLEGWQAKILSYAGRLELVKSVLQSSYIFWTGAFPIPYSVCSKL 317
            L +  C  +V MF+R K+  W A+ LSYAGRL L+ SVL S   FW GAF +P     ++
Sbjct: 30   LDVTHCN-IVTMFSRQKICSWSARFLSYAGRLNLISSVLWSICNFWMGAFRLPRDCIREI 88

Query: 318  ESLMGSFLHGKSKLRL----ISWATICRPLEEGGLGIRRIKDMNKAGLCKLLWWIYSSKK 485
            + +  ++L    +L      I+WA +C+P EEGGLG+R +K+ N     KL+W I S   
Sbjct: 89   DKMCSAYLWSGGELNTSKAKITWAFVCKPKEEGGLGLRSLKEANDVCCLKLIWRIISHAD 148

Query: 486  SLWVQWIHSRFLRNNSIWTATIPNDV-SWVYRRILKIRNQFAHHCFNLVGNGDATKFWLH 662
            SLWV+WI S  L+  S W       + SW++R+ILK R+     C   + NG  T FW  
Sbjct: 149  SLWVKWIQSSLLKKVSFWAVRENTSLGSWMWRKILKFRDIARTLCKVEINNGARTSFWYD 208

Query: 663  RWHPQGMLLDTFGENCRLFTRIPRRATIKEV-----REAGYWNSPPSSSPMVRTAWRQFQ 827
             W   G L+D+ G+   +   I + AT+ E      R     N        +  +W    
Sbjct: 209  DWSDLGRLIDSAGDRGAIDLGINKHATVVEAWGNRRRRRHRTNFLNRVEERLILSWNSRN 268

Query: 828  QIPKLGCDEEDQFVWSPCPS---GLFSVASAWEQIRHHYDVWEWTELVWFYDKIPKCSFT 998
            Q        ED+ +W    +    +FS    W  IR   +   W + VWF   IPK +F 
Sbjct: 269  Q-------AEDRALWKGKENRFRSIFSTKDTWNHIRTVSNKVAWYKGVWFAQAIPKHAFC 321

Query: 999  CWRMLLSKLPTKDKLT--RFGAQSRCELCWAGVESEDHLFFECPFSSEVWMRIKVKCWRN 1172
             W  + ++L T D++T    G  + C LC   +ES DHLFF CPF++E+W  +    +  
Sbjct: 322  MWLAVHNRLSTGDRMTLWNMGVDATCILCNKALESRDHLFFSCPFATEIWEPLAKTIYNT 381

Query: 1173 VQVVRGRFQESQTILRSFGLN---RADGVIRKLCYTVTVHFIWWERNMRLFNKGWRSATR 1343
                   + + QTI+ +   N   R  G + +    VT++ +W ERN R       S++R
Sbjct: 382  C-----FYTDWQTIINNVSRNWPDRIAGFLARCILQVTIYTLWRERNERKHGASPNSSSR 436

Query: 1344 LAEEIIQLVHQKVSTSKKLLSNSLDGEFLAQFW 1442
            L   I + +   +   K+      D  F  Q W
Sbjct: 437  LISWIDKHIRNHLMAIKQSGDRRFDRGF--QVW 467


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  218 bits (554), Expect = 9e-54
 Identities = 140/446 (31%), Positives = 209/446 (46%), Gaps = 16/446 (3%)
 Frame = +3

Query: 93   GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272
            G LP+++LGLP++  KL IA+  PL+     +   W  K LS+AGR++L+ SV+  S  F
Sbjct: 758  GTLPIRYLGLPLMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFGSINF 817

Query: 273  WTGAFPIPYSVCSKLESLMGSFLHG----KSKLRLISWATICRPLEEGGLGIRRIKDMNK 440
            W   F +P     ++ESL   FL      ++K   +SWA +C P  EGGLG+RR+ + NK
Sbjct: 818  WMSTFLLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEWNK 877

Query: 441  AGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDVSWVYRRILKIRNQFAHHCF 620
                +L+W ++ +K SLW  W H   L   S W        SW ++R+L +R        
Sbjct: 878  TLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFWAVEGGQSDSWTWKRLLSLRPLAHQFLV 937

Query: 621  NLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPSSSPM 800
              VGNG    +W   W   G L    G+      R+P  A +        W  P S S  
Sbjct: 938  CKVGNGLKADYWYDNWTSLGPLFRIIGDIGPSSLRVPLLAKVASAFSEDGWRLPVSRSAP 997

Query: 801  VRTAWRQF--QQIPKLGCDEEDQFVWSP----CPSGLFSVASAWEQIRHHYDVWEWTELV 962
             +          +P    ++ D++ WS     C    FS A  WE IR    V  W   +
Sbjct: 998  AKGIHDHLCTVPVPSTAQEDVDRYEWSVNGFLCQG--FSAAKTWEAIRPKATVKSWASSI 1055

Query: 963  WFYDKIPKCSFTCWRMLLSKLPTKDKLTRFG--AQSRCELCWAGVESEDHLFFECPFSSE 1136
            WF   +PK +F  W   L++L T+ +L  +G      C LC    ES DHL   C FS++
Sbjct: 1056 WFKGAVPKYAFNMWVSHLNRLLTRQRLASWGHIQSDACVLCSFASESRDHLLLICEFSAQ 1115

Query: 1137 VWMRIKVKCWRNVQVVRGRFQESQTILRSF---GLNRADGVIRKLCYTVTVHFIWWERNM 1307
            VW  +    +R +   R R   S + L S+       A  ++RK+   V V+ +W +RN 
Sbjct: 1116 VWRLV----FRRI-CPRQRLFSSWSELLSWVRQSSPEAPPLLRKIVSQVVVYNLWRQRNN 1170

Query: 1308 RLFNKGWRSATRLAEEII-QLVHQKV 1382
             L N     + RLA  +I +LV +++
Sbjct: 1171 LLHN-----SLRLAPAVIFKLVDREI 1191


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  212 bits (540), Expect = 4e-52
 Identities = 127/445 (28%), Positives = 201/445 (45%), Gaps = 13/445 (2%)
 Frame = +3

Query: 87   ITGQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSY 266
            + G  P ++LGLP++  KL  +D + L+   A +   W  K LS+AGRL+L+ SV+ S+ 
Sbjct: 755  VNGTFPFRYLGLPLLHRKLRRSDYSQLIDKIAARFNHWATKTLSFAGRLQLISSVIYSTV 814

Query: 267  IFWTGAFPIPYSVCSKLESLMGSFLHGKSKLRL----ISWATICRPLEEGGLGIRRIKDM 434
             FW  +F +P      +E +   FL G    R     +SW   C P  EGGLG+R     
Sbjct: 815  NFWLSSFILPKCCLKTIEQMCNRFLWGNDITRRGDIKVSWQNSCLPKAEGGLGLRNFWTW 874

Query: 435  NKAGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDVSWVYRRILKIRNQFAHH 614
            NK    +L+W +++ + SLWV W H+  LR+ + W A   +  SW+++ IL +R      
Sbjct: 875  NKTLNLRLIWMLFARRDSLWVAWNHANRLRHVNFWNAEAASHHSWIWKAILGLRPLAKRF 934

Query: 615  CFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPS-- 788
                VGNG    +W   W   G L++  G +    T I   A + E   +  W  P +  
Sbjct: 935  LRGAVGNGQLLSYWYDHWSNLGPLIEAIGASGPQLTGIHESAVVTEASSSTGWILPSART 994

Query: 789  -SSPMVRTAWRQFQQIPKLGCDEEDQFVW--SPCPSGLFSVASAWEQIRHHYDVWEWTEL 959
             ++ +              G   ED + W      S  FS    WE +R       W   
Sbjct: 995  RNASLANLRSTLLNSPAPSGDRGEDTYTWYIEGSSSTSFSSKLTWECLRQRDTTKLWAAA 1054

Query: 960  VWFYDKIPKCSFTCWRMLLSKLPTKDKLTRFGAQ--SRCELCWAGVESEDHLFFECPFSS 1133
            VW+   IPK +F  W   L++LP + + T +     S C +C    E+ DHLF  C   S
Sbjct: 1055 VWYKGCIPKYAFNFWVAHLNRLPVRARTTHWSTNRPSLCCVCQRETETRDHLFIHCTLGS 1114

Query: 1134 EVWMRIKVKCWRNVQVVRGRFQESQTILRSFGLNRA--DGVIRKLCYTVTVHFIWWERNM 1307
             +W ++  +  R+       F+E + I+     N+    G ++KL     +  IW ERN 
Sbjct: 1115 LIWQQVLARFGRSQM-----FREWKDIIEWMLSNQGSFSGTLKKLAVQTAIFHIWKERNS 1169

Query: 1308 RLFNKGWRSATRLAEEIIQLVHQKV 1382
            RL +    S T + ++I + +   +
Sbjct: 1170 RLHSAMSASHTAIFKQIDRSIRDSI 1194


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  211 bits (538), Expect = 6e-52
 Identities = 136/442 (30%), Positives = 203/442 (45%), Gaps = 12/442 (2%)
 Frame = +3

Query: 24   MTLIYKITNKSWGF*CKNMKNITGQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQ 203
            + L  +IT+ ++GF         G  P+++LGLP++  KL IAD  PL+   + +L  W 
Sbjct: 602  LDLSERITSAAYGF-------PAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWV 654

Query: 204  AKILSYAGRLELVKSVLQSSYIFWTGAFPIPYSVCSKLESLMGSFLHGKS----KLRLIS 371
            +K LS+AGR +L+ SV+     FW   F +P     K+ESL   FL   S    K   +S
Sbjct: 655  SKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVS 714

Query: 372  WATICRPLEEGGLGIRRIKDMNKAGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATI 551
            W   C P  EGGLG R   + NK  L +L+W ++    SLW QW     L + S W    
Sbjct: 715  WVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNA 774

Query: 552  PNDVSWVYRRILKIRNQFAHHCFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIP 731
                 W ++ +L +R          VGNG    FW   W   G L+   G+      RIP
Sbjct: 775  LQTDPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIP 834

Query: 732  RRATIKEVREAGYWNSPPSSSPMVRTAWRQFQQIPKLG-CDEEDQFVWSPCPSGL----F 896
              A + +  +   W  P S S    +       +P        D + W  C   +    F
Sbjct: 835  FSAKVADAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSW--CVDDVDCQGF 892

Query: 897  SVASAWEQIRHHYDVWEWTELVWFYDKIPKCSFTCWRMLLSKLPTKDKLTRFG--AQSRC 1070
            S A  WE +R    V  W + VWF   +PK +F  W   L++LPT+ +L  +G  + + C
Sbjct: 893  SAAKTWEVLRPRRPVKRWAKSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAEC 952

Query: 1071 ELCWAGVESEDHLFFECPFSSEVWMRIKVK-CWRNVQVVRGRFQESQTILRSFGLNRADG 1247
             LC    E+ DHL   C FSS+VW  + ++ C R  Q +   + E  +  R      A  
Sbjct: 953  CLCSFDTETRDHLLLLCDFSSQVWRMVFLRLCPR--QRLLCTWAELLSWTRQ-STAAAPS 1009

Query: 1248 VIRKLCYTVTVHFIWWERNMRL 1313
            ++RK+   + V+ +W +RN+ L
Sbjct: 1010 LLRKVVAQLVVYNLWRQRNLVL 1031


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score =  211 bits (537), Expect = 8e-52
 Identities = 136/442 (30%), Positives = 202/442 (45%), Gaps = 12/442 (2%)
 Frame = +3

Query: 24   MTLIYKITNKSWGF*CKNMKNITGQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQ 203
            + L  +IT+ ++GF         G  P+++LGLP++  KL IAD  PL+   + +L  W 
Sbjct: 602  LDLSERITSAAYGF-------PAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWV 654

Query: 204  AKILSYAGRLELVKSVLQSSYIFWTGAFPIPYSVCSKLESLMGSFLHGKS----KLRLIS 371
            +K LS+AGR +L+ SV+     FW   F +P     K+ESL   FL   S    K   +S
Sbjct: 655  SKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVS 714

Query: 372  WATICRPLEEGGLGIRRIKDMNKAGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATI 551
            W   C P  EGGLG R   + NK  L +L+W ++    SLW QW     L + S W    
Sbjct: 715  WVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNA 774

Query: 552  PNDVSWVYRRILKIRNQFAHHCFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIP 731
                 W ++ +L +R          VGNG    FW   W   G L+   G+      RIP
Sbjct: 775  LQTDPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIP 834

Query: 732  RRATIKEVREAGYWNSPPSSSPMVRTAWRQFQQIPKLG-CDEEDQFVWSPCPSGL----F 896
              A + +  +   W  P S S    +       +P        D + W  C   +    F
Sbjct: 835  FSAKVADAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSW--CVDDVDCQGF 892

Query: 897  SVASAWEQIRHHYDVWEWTELVWFYDKIPKCSFTCWRMLLSKLPTKDKLTRFG--AQSRC 1070
            S A  WE +R    V  W   VWF   +PK +F  W   L++LPT+ +L  +G  + + C
Sbjct: 893  SAAKTWEVLRPRRPVKRWARSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAEC 952

Query: 1071 ELCWAGVESEDHLFFECPFSSEVWMRIKVK-CWRNVQVVRGRFQESQTILRSFGLNRADG 1247
             LC    E+ DHL   C FSS+VW  + ++ C R  Q +   + E  +  R      A  
Sbjct: 953  CLCSFDTETRDHLLLLCDFSSQVWRMVFLRLCPR--QRLLCTWAELLSWTRQ-STAAAPS 1009

Query: 1248 VIRKLCYTVTVHFIWWERNMRL 1313
            ++RK+   + V+ +W +RN+ L
Sbjct: 1010 LLRKVVAQLVVYNLWRQRNLVL 1031


>gb|ABV21212.1| Ty1_Copia-element protein [Arabidopsis thaliana]
          Length = 438

 Score =  211 bits (536), Expect = 1e-51
 Identities = 136/415 (32%), Positives = 200/415 (48%), Gaps = 17/415 (4%)
 Frame = +3

Query: 99   LPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIFWT 278
            LP+++LGLP++  KL I++  PLV     KL  W  K LS+AGRL+L+ SV+    +FW 
Sbjct: 31   LPIRYLGLPLMSRKLKISEFEPLVVKIKAKLNFWAVKSLSFAGRLQLLSSVISGIVVFWM 90

Query: 279  GAFPIPYSVCSKLESLMGSFL-------HGKSKLRLISWATICRPLEEGGLGIRRIKDMN 437
              F +P     ++ES+   FL       H K+K   +SW+T+C P  EGGLG+R+  + N
Sbjct: 91   STFRLPKGCIREIESMCARFLWSGGTDEHHKAK---VSWSTVCLPKAEGGLGVRKFTEWN 147

Query: 438  KAGLCKLLWWIYSSKKSLWVQW--IHSRFLRNNSIWTATIPNDVSWVYRRILKIRNQFAH 611
             A   KL+W ++S+  SLWV W   H+     ++ W        SW +R +L++R   + 
Sbjct: 148  TALNLKLIWLLFSNSGSLWVAWHLFHNLSTSVSNFWLIKEGTTDSWNWRCLLRLRPLASK 207

Query: 612  HCFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYW--NSPP 785
              F  +GNG    FW   W P G LL   G +     RIP  + + +V     W   SP 
Sbjct: 208  FLFCSIGNGLTASFWADSWTPFGPLLTFIGSDGPRNQRIPLCSKVADVVNGNRWLLPSPR 267

Query: 786  SSSPMVRTAWRQFQQIPKLGCDEEDQFVW--SPCPSGLFSVASAWEQIRHHYDVWEWTEL 959
            SS+ +   A+     IP L    ED ++W    C    FS A  W  +RH      W   
Sbjct: 268  SSNALNLHAFLTTLSIP-LQPLVEDSYLWKVENCSDIGFSSAHTWNALRHKEVEKPWVSS 326

Query: 960  VWFYDKIPKCSFTCWRMLLSKLPTKDKLTRFG--AQSRCELCWAGVESEDHLFFECPFSS 1133
            VWF    PK +F  W     +L TK ++  +G      C LC  G E+ DHL   C FS 
Sbjct: 327  VWFKGVTPKNAFNMWITHQDRLRTKLRMIAWGFLVSPVCALCQVGFETRDHLMLSCDFSV 386

Query: 1134 EVWMRIKVKCWRNVQVVRGRFQE-SQTILRSFGLNR-ADGVIRKLCYTVTVHFIW 1292
             VW  ++ +    + +    FQ  S+ IL +   ++ A   +RKL     V+ +W
Sbjct: 387  SVWALVRQRIGTPLTI----FQNWSELILWTQNRSKAAPSTLRKLVAQAVVYALW 437


>gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1529

 Score =  208 bits (529), Expect = 7e-51
 Identities = 133/443 (30%), Positives = 215/443 (48%), Gaps = 11/443 (2%)
 Frame = +3

Query: 93   GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272
            GQLPV++LGLP++  +++ AD +PL+     K+  W A+ LSYAGRL L+ SV+ S   F
Sbjct: 1062 GQLPVRYLGLPLLTKQMTTADYSPLIEAVKTKISSWTARSLSYAGRLALLNSVIVSIANF 1121

Query: 273  WTGAFPIPYSVCSKLESLMGSFLHG----KSKLRLISWATICRPLEEGGLGIRRIKDMNK 440
            W  A+ +P     ++E L  +FL        K   I+W++IC+P +EGGLGI+ + + NK
Sbjct: 1122 WMSAYRLPAGCIREIEKLCSAFLWSGPVLNPKKAKIAWSSICQPKKEGGLGIKSLAEANK 1181

Query: 441  AGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDV-SWVYRRILKIRNQFAHHC 617
                KL+W + S++ SLWV WI +  +R  + W+A   + + SW+++++LK R       
Sbjct: 1182 VSCLKLIWRLLSTQPSLWVTWIWTFIIRKGTFWSANERSSLGSWMWKKLLKYRELAKSMH 1241

Query: 618  FNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEV-REAGYWNSPPSSS 794
               V NG +T FW   W   G LLD  G    +   IP    ++ V R   +     +  
Sbjct: 1242 KVEVRNGSSTSFWYDHWSHLGRLLDITGTRRVIDLGIPLETNLETVLRTHQHRQHRAAIY 1301

Query: 795  PMVRTAWRQFQQIPKLGCDEEDQFVWSPCPSGL---FSVASAWEQIRHHYDVWEWTELVW 965
              +    ++ QQ  +      D  +W    +     F     W  +R H     W + VW
Sbjct: 1302 NRINAEIQRLQQQEREA--GPDISLWRSLKNDFNKRFITKVTWNNVRTHQPQQNWYKGVW 1359

Query: 966  FYDKIPKCSFTCWRMLLSKLPTKDKLTRF--GAQSRCELCWAGVESEDHLFFECPFSSEV 1139
            F    PK SF  W  + ++L T D++  +  G    C LC    E+ DHLFF C ++S V
Sbjct: 1360 FPYSTPKYSFLLWLTVQNRLSTGDRIKAWNSGQLVTCTLCNNAEETRDHLFFSCQYTSYV 1419

Query: 1140 WMRIKVKCWRNVQVVRGRFQESQTILRSFGLNRADGVIRKLCYTVTVHFIWWERNMRLFN 1319
            W  +  +   +    R  +    T+L +  L R    + +  +  +++ IW ERN R   
Sbjct: 1420 WEALTQRL-LSTNYSRD-WNRLFTLLCTSNLPRDHLFLFRYVFQASIYHIWRERNARRHG 1477

Query: 1320 KGWRSATRLAEEIIQLVHQKVST 1388
            +      RL + I + V  ++S+
Sbjct: 1478 EISSPTNRLIKLIDKTVRNRISS 1500


>gb|AAG51098.1|AC025295_6 hypothetical protein [Arabidopsis thaliana]
          Length = 504

 Score =  194 bits (494), Expect = 8e-47
 Identities = 116/354 (32%), Positives = 172/354 (48%), Gaps = 15/354 (4%)
 Frame = +3

Query: 93   GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272
            G LPV++LGLP++  + S  D  PL+    +K+  W A+ LSY GRL L+ S+L S   F
Sbjct: 95   GTLPVRYLGLPLVTKQFSSTDYLPLIDHIKQKICSWSARFLSYTGRLNLISSILWSICNF 154

Query: 273  WTGAFPIPYSVCSKLESLMGSFLHGKSKLRL----ISWATICRPLEEGGLGIRRIKDMNK 440
            W GAF +P     +++ +  ++L    +L      I+WA +C+P EEGGLG+R +K+ N 
Sbjct: 155  WMGAFRLPRDCIREIDKMCSAYLWSGGELNTSKAKIAWAFVCKPKEEGGLGLRSLKEAND 214

Query: 441  AGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDV-SWVYRRILKIRNQFAHHC 617
                KL+W I S   SLWV+WI S  L+    W       + SW++R+ILK R+     C
Sbjct: 215  VCCLKLIWRIISHADSLWVKWIQSSLLKKVFFWAVRENTSLGSWMWRKILKFRDIARTLC 274

Query: 618  FNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEV-----REAGYWNSP 782
               + NG  T FW   W   G L+++ G+   +   I + AT+ E      R     N  
Sbjct: 275  KVEINNGAQTSFWYDDWSDLGRLIESAGDRGAIDLGINKHATVVEAWGNRRRRRHRANFL 334

Query: 783  PSSSPMVRTAWRQFQQIPKLGCDEEDQFVWSPCPS---GLFSVASAWEQIRHHYDVWEWT 953
                  +  +W    Q        ED  +W    +    +FS    W  IR   +   W 
Sbjct: 335  NRVEERLVLSWNSRNQ-------AEDCALWKGKENRFRSIFSTKDTWNHIRTVSNKVAWY 387

Query: 954  ELVWFYDKIPKCSFTCWRMLLSKLPTKDKLT--RFGAQSRCELCWAGVESEDHL 1109
            + VWF   IPK +F  W  + ++L T D++T    G  + C LC   +ES DHL
Sbjct: 388  KGVWFAQAIPKHAFCMWLAVHNRLSTGDRMTLWNMGVDATCILCNNALESRDHL 441


>emb|CAB72467.1| putative protein [Arabidopsis thaliana]
          Length = 762

 Score =  193 bits (490), Expect = 2e-46
 Identities = 109/351 (31%), Positives = 168/351 (47%), Gaps = 10/351 (2%)
 Frame = +3

Query: 93   GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272
            G+LP+++LGLP++  +LS  D APL+    +++  W ++ LS+AGR  L+ S++ SS  F
Sbjct: 318  GELPIRYLGLPLVTKRLSSVDYAPLIEQIRKRIGSWSSRFLSFAGRFNLISSIIWSSCNF 377

Query: 273  WTGAFPIPYSVCSKLESLMGSFLHG----KSKLRLISWATICRPLEEGGLGIRRIKDMNK 440
            W  AF +P +   ++E L  SFL       SK   ISW  +C+P  EGGLG+R +K+ N 
Sbjct: 378  WLSAFQLPRACIQEIEKLCSSFLWSGTNLNSKKAKISWNQVCKPKSEGGLGLRSLKEAND 437

Query: 441  AGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIW-TATIPNDVSWVYRRILKIRNQFAHHC 617
                KL+W I S   SLWV+W+    L+    W      N  SW++++ILK R      C
Sbjct: 438  VCCLKLVWRIISHGDSLWVKWVEHNLLKREIFWIVKENANLGSWIWKKILKYRGVAKRFC 497

Query: 618  FNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPSSSP 797
               VGNG++T FW   W   G L+D  G    +   I R  ++ +   +           
Sbjct: 498  KAEVGNGESTSFWFDDWSLLGRLIDVAGIRGTIDMGISRTMSVADAWTSRRRRHHRQEIL 557

Query: 798  MVRTAWRQFQQIPKLGCDEEDQFVW---SPCPSGLFSVASAWEQIRHHYDVWEWTELVWF 968
                     Q   +    ++ + +W   +      FS  + W  +R   +   W + VWF
Sbjct: 558  NTIEEVLSTQHQKRTQQQQQGRVLWKGKNDIYKDKFSTKNTWNYLRTTSNEVAWHKGVWF 617

Query: 969  YDKIPKCSFTCWRMLLSKLPTKDKLTRF--GAQSRCELCWAGVESEDHLFF 1115
                PK SF  W     +L T  ++ ++  G    C  C  G+E+ DHLFF
Sbjct: 618  PHATPKYSFCLWLAAHDRLATGARMIKWNRGETGDCTFCRQGIETRDHLFF 668


>gb|AAC19278.1| T14P8.10 [Arabidopsis thaliana] gi|7269009|emb|CAB80742.1| AT4g02490
            [Arabidopsis thaliana]
          Length = 657

 Score =  192 bits (488), Expect = 4e-46
 Identities = 111/362 (30%), Positives = 175/362 (48%), Gaps = 12/362 (3%)
 Frame = +3

Query: 93   GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272
            G LPV++LG+P++  K+   D  PLV     +   W A+ LS+AGRL+L+KSV+ S+  F
Sbjct: 306  GSLPVRYLGVPLMSQKMKKHDYQPLVDRINSRFTSWTARHLSFAGRLQLLKSVIYSTINF 365

Query: 273  WTGAFPIPYSVCSKLESLMGSFL----HGKSKLRLISWATICRPLEEGGLGIRRIKDMNK 440
            W   F +P     KLE +  +FL       ++   ISW  +C   E GGLG++R+   NK
Sbjct: 366  WASIFILPNQCLHKLEQMCNAFLWSGAPNSAREAKISWDIVCSSKESGGLGLKRLSSWNK 425

Query: 441  AGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDVSWVYRRILKIRNQFAHHCF 620
                KL+W ++++  SLWV W                   V WV+R++ K+R        
Sbjct: 426  VLALKLIWLLFTASGSLWVSW-------------------VRWVWRKLCKLREVARPFVI 466

Query: 621  NLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKE-VREAGYW-NSPPSSS 794
              VG+G   +FW   W   G L+   G        +   + +++ +R   +W  S  S +
Sbjct: 467  CEVGSGITARFWQDNWTGHGPLIHLTGLTGPQLVGLSITSVVRDAIRNDDWWIASSRSRN 526

Query: 795  PMVRTAWRQFQQIPKL-GCDEEDQFVW---SPCPSGLFSVASAWEQIRHHYDVWEWTELV 962
            P++         +  L  C+ +D ++W      PS  FS A  W  ++       W + V
Sbjct: 527  PVILLLKSLLPPVGNLVDCEHDDSYLWKVGDRVPSSKFSTADTWRALQPFSVSVSWHKAV 586

Query: 963  WFYDKIPKCSFTCWRMLLSKLPTKDKLTRFG--AQSRCELCWAGVESEDHLFFECPFSSE 1136
            WF +++PK +F  W    ++L T+D+L  +G    + C LC    E+ DHLFF C FSS 
Sbjct: 587  WFTNQVPKHAFISWVTAWNRLHTRDRLRSWGLIVPAECVLCNLVDETRDHLFFACRFSSR 646

Query: 1137 VW 1142
            +W
Sbjct: 647  IW 648


>gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13)
            [Arabidopsis thaliana]
          Length = 1164

 Score =  189 bits (479), Expect = 4e-45
 Identities = 111/359 (30%), Positives = 168/359 (46%), Gaps = 12/359 (3%)
 Frame = +3

Query: 93   GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272
            G LPV++LGLP++  KL+IA+ APL+     +   W  ++LS+AGR++L+ SV+     F
Sbjct: 655  GSLPVRYLGLPLMSRKLTIAEYAPLIEKITARFNSWVVRLLSFAGRVQLLASVISGIVNF 714

Query: 273  WTGAFPIPYSVCSKLESLMGSFLHG----KSKLRLISWATICRPLEEGGLGIRRIKDMNK 440
            W  +F +P     K+ESL   FL      K  +  ++W+ +C P  EGG+G+RR    N+
Sbjct: 715  WISSFILPLGCIKKIESLCSRFLWSSRIDKKGIAKVAWSQVCLPKAEGGIGLRRFAVSNR 774

Query: 441  AGLCKLLWWIYSSKKSLWVQWIHSRFL-RNNSIWTATIPNDVSWVYRRILKIRNQFAHHC 617
                +++W ++S+  SLWV W     L ++ S W        SW ++ +L++R       
Sbjct: 775  TLYLRMIWLLFSNSGSLWVAWHKQHSLGKSTSFWNQPEKPHDSWNWKCLLRLRVVAERFI 834

Query: 618  FNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPSSSP 797
               VGNG    FW   W P G L+   G       R+   A I +V  +  W+     S 
Sbjct: 835  RCNVGNGRDASFWFDNWTPFGPLIKFLGNEGPRDLRVHLNAKISDVCTSEGWSIADPRSD 894

Query: 798  MVRTAWRQFQQIP-KLGCDEEDQFVW----SPCPSGLFSVASAWEQIRHHYDVWEWTELV 962
               +       I       + D + W      C    FS A+ W  +R       W   V
Sbjct: 895  QALSLHTHLTNISMPSDAQDLDSYDWVVDNKVCQG--FSAAATWSALRPSSAPVPWARAV 952

Query: 963  WFYDKIPKCSFTCWRMLLSKLPTKDKLTRFGAQ--SRCELCWAGVESEDHLFFECPFSS 1133
            WF    PK +F  W   L +LPTK +L  +G Q  + C LC    E+ DHLF  C F++
Sbjct: 953  WFKGATPKHAFHLWTAHLDRLPTKVRLASWGMQIDTTCGLCSLHPETRDHLFLSCDFAN 1011


>emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1363

 Score =  189 bits (479), Expect = 4e-45
 Identities = 156/542 (28%), Positives = 234/542 (43%), Gaps = 16/542 (2%)
 Frame = +3

Query: 108  KHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIFWTGAF 287
            K+LG  ++P KL   D   L+      + GWQAK L+ AGR  L+KSV+ S  ++   + 
Sbjct: 752  KYLGCNILPNKLRRGDYDGLLEKVKSAINGWQAKYLNMAGRCTLIKSVVSSFPVYGMQSS 811

Query: 288  PIPYSVCSKLESLMGSFLHGKSK----LRLISWATICRPLEEGGLGIRRIKDMNKAGLCK 455
             +P SV +++E     FL  K      L  +SW  IC P  +GGLG RR+ + N A + K
Sbjct: 812  LLPVSVMNEIEKDCRKFLWNKMDKSHYLARMSWDRICSPTGKGGLGFRRLHNWNLAFMAK 871

Query: 456  LLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDVSWVYRRILKIRNQFAHHCFNLVGN 635
            L W I   +  LWV+ + +R+    S  +A   N  S ++R I+K R          +GN
Sbjct: 872  LGWMIIKDETKLWVRILKARYWERGSFLSAVGKNHHSPIWRDIVKGRELLEKGLVRRIGN 931

Query: 636  GDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPSSSPMVRTAW 815
            G +T  W H W   G L+D  G N   F        +  + + G W++   S  +     
Sbjct: 932  GRSTSLWYHWWVGGGPLVDVMGSNIPEFM---SHWQVSNIIKRGRWDTKKISHLLPPDIL 988

Query: 816  RQFQQIPKLGCDE-EDQFVWSPCPSGLFSVASAWEQIRHHYD----VWEWTELVWFYDKI 980
            +Q ++IP     E ED F W+   +G FSV SA+  I    +       W  L W  +  
Sbjct: 989  KQIKEIPLASMSEVEDDFTWNFEKNGTFSVKSAYYLINRREEETGGKGSWRGL-WRKNIP 1047

Query: 981  PKCSFTCWRMLLSKLPTKDKLTR--FGAQSRCELCWAGVESEDHLFFECPFSSEVWMRIK 1154
             K     W  + + LPT   L +       +C  C   +E   HLF +C  +S VW+ I 
Sbjct: 1048 FKYKLLIWNGIHNILPTALFLAKRIHNFNPQCVACDHPIEDMIHLFRDCCVASSVWIEIL 1107

Query: 1155 VKCWRNVQVVRGRFQESQTILRSFGLNRADGVIRKLCYTVTVHFIWWERNMRLFNKGWRS 1334
                 N Q +    +  + I   F LN+ D  + K  +T     IW  RN  +F      
Sbjct: 1108 KHHKPNNQNLFFNLEWEEWI--DFNLNQHDYWVTK--FTTAFWHIWCSRNKTVF------ 1157

Query: 1335 ATRLAEEIIQLVHQKVSTSKKLLSNSLDGEFLAQF--WHVNFDQLR-DPIVCQWSPPPEG 1505
                         +      K   N +  +F      + VN  Q     +V +W PP +G
Sbjct: 1158 -------------ECAVNHPKFTYNRVVADFFTNIRAFQVNNTQGNGSKVVLRWKPPHQG 1204

Query: 1506 ELVLNSDGSLSA--RGCFFGGVIRNHMGEIILGYSGGTSQGSVLLLEALGMYHGLKVAKE 1679
             L LN+DG+  A       GGV R+ +G   LG++     GS    E + +  GL+VA +
Sbjct: 1205 FLKLNTDGAWKADWENAGIGGVFRDAVGNWELGFAKRVDAGSPEAAELMAIREGLQVAWD 1264

Query: 1680 RN 1685
             N
Sbjct: 1265 CN 1266


>gb|ABW81051.1| tn7 reverse transcriptase [Arabidopsis lyrata subsp. lyrata]
          Length = 441

 Score =  185 bits (470), Expect = 5e-44
 Identities = 129/440 (29%), Positives = 200/440 (45%), Gaps = 24/440 (5%)
 Frame = +3

Query: 141  LSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIFWTGAFPIPYSVCSKLE 320
            ++ +D  PL+     ++  W A+ LS+AGRL+L+ SV+ S   FW  AF +P +   +++
Sbjct: 1    MTTSDYIPLIERIRERISCWTARHLSFAGRLQLISSVIHSLTNFWMSAFRLPNACIKEID 60

Query: 321  SLMGSFLHGKSKLR----LISWATICRPLEEGGLGIRRIKDMNKAGLCKLLWWIYSSKKS 488
             L  +FL    +L      +SW  +C P EEGGLG+R + + NK    KL+W + SS  S
Sbjct: 61   GLCSAFLWSGPELNRKKAKVSWNDVCMPKEEGGLGLRSLTEANKVCCLKLIWRLLSS-SS 119

Query: 489  LWVQWIHSRFLRNNSIWTATIPNDV-SWVYRRILKIRNQFAHHCFNLVGNGDATKFWLHR 665
            LWVQW+    +R  S W+    + + SW++R++LK R+  +      + NG    FW   
Sbjct: 120  LWVQWLRQYVIRKGSFWSLRDTSTLGSWMWRKLLKYRHLASGFTQYEIRNGKGVSFWHDN 179

Query: 666  WHPQGMLLDTFGENCRLFTRIPRRATIKEV-------REAGYWNSPPSSSPMVRTAWRQF 824
            W P G L+   G    +   I   AT+ E          A + N   +    +RT     
Sbjct: 180  WSPLGPLIAISGTRGCIDMGIDIHATVAEALTHRRRRHRADHLNQMEAQLEELRT----- 234

Query: 825  QQIPKLGCDEEDQFVWSP-----CPSGLFSVASAWEQIRHHYDVWEWTELVWFYDKIPKC 989
                K   + ED  +W        PS  FS    W   R      EW + +WF    PK 
Sbjct: 235  ----KGLVETEDVVLWKGKGGRFKPS--FSTKETWADTREQKPRNEWYQGIWFSHATPKY 288

Query: 990  SFTCWRMLLSKLPTKDKLTRF--GAQSRCELCWAGVESEDHLFFECPFSSEVWMRIKVKC 1163
            SF  W    ++L T D++  +  G    C  C    E+ +HLFF C +S EVW  +  K 
Sbjct: 289  SFITWLATKNRLSTGDRMMSWNAGVNLSCVFCQEQTETRNHLFFTCRYSREVWSGLTSKL 348

Query: 1164 WRNVQVVRGRFQESQTIL-----RSFGLNRADGVIRKLCYTVTVHFIWWERNMRLFNKGW 1328
                 + R    +  TIL     ++ G NR    + +  + + V+ IW ERN R   +  
Sbjct: 349  -----LTRHYSTDWTTILKLLTDKTLGNNRL--FLLRYAFQILVYSIWKERNSRRHGEEP 401

Query: 1329 RSATRLAEEIIQLVHQKVST 1388
              +  L + + + V  K+ST
Sbjct: 402  LPSALLLKRLDKEVRNKLST 421


>ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca
            subsp. vesca]
          Length = 958

 Score =  184 bits (468), Expect = 8e-44
 Identities = 119/416 (28%), Positives = 181/416 (43%), Gaps = 8/416 (1%)
 Frame = +3

Query: 93   GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIF 272
            G  PV++LG+P+I  KL + DC+PL+     +++ W+ K+LS+AGRL+L++SVL S  ++
Sbjct: 587  GTCPVRYLGIPLITSKLRMQDCSPLLDRIETRIKSWENKVLSFAGRLQLIQSVLSSIQVY 646

Query: 273  WTGAFPIPYSVCSKLESLMGSFL-----HGKSKLRLISWATICRPLEEGGLGIRRIKDMN 437
            W     +P  V   +E  +  FL      G++  + ++W+ IC P  EGGLGI+ +   N
Sbjct: 647  WASHLILPKKVLKDIEKRLRCFLWAGNCSGRAATK-VAWSEICLPKCEGGLGIKDLHCWN 705

Query: 438  KAGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDVSWVYRRILKIRNQFAHHC 617
            KA +   +W + SS  + W  W+    L+ NS W A +P+  SW +R++LKIR       
Sbjct: 706  KALMISHIWNLVSSSSNFWTDWVKVYLLKGNSFWNAPLPSICSWNWRKLLKIRELCCSFF 765

Query: 618  FNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPSSSP 797
             N++G+G AT  W   WHP G L   +  N               + E+G      S S 
Sbjct: 766  VNIIGDGRATSLWFDNWHPLGPLTLRWSSNI--------------IGESGL-----SKSA 806

Query: 798  MVRTAWRQFQQIPKLGCDEEDQFVWSPCPSGLFSVASAWEQIRHHYDVWEWTELVWFYDK 977
            M+                          P+G +S +SAW  +R    +  W  LVWF   
Sbjct: 807  ML-------------------------TPNGFYSTSSAWNTLRPSRFIVPWYRLVWFV-- 839

Query: 978  IPKCSFTCWRMLLSKLPTKDKLTRFGAQSRCELCWAGVESEDHLFFECPFSSEVWMRIKV 1157
                                                  E+ +HLFF+C +S  +W  +  
Sbjct: 840  -------------------------------------AETHNHLFFDCAYSFGIWTHVLS 862

Query: 1158 KCWRNVQVVRGRFQESQTIL---RSFGLNRADGVIRKLCYTVTVHFIWWERNMRLF 1316
            KC     V +     S  I     ++  N    VI KL     V+ IW ERN R F
Sbjct: 863  KC----DVSKPLLPWSDFIFWVATNWKGNSLPVVILKLALQAVVYAIWRERNNRRF 914


>ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
            gi|508722459|gb|EOY14356.1| Uncharacterized protein
            TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  183 bits (465), Expect = 2e-43
 Identities = 154/548 (28%), Positives = 242/548 (44%), Gaps = 18/548 (3%)
 Frame = +3

Query: 99   LPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKSVLQSSYIFWT 278
            LP+ +LG P+  G   +     LV     ++ GW+ KILS  GR+ L++SVL S  I+  
Sbjct: 1629 LPITYLGAPLYKGHKKVILFNDLVAKIEERITGWENKILSPGGRITLLRSVLASLPIYLL 1688

Query: 279  GAFPIPYSVCSKLESLMGSFLHGKS----KLRLISWATICRPLEEGGLGIRRIKDMNKAG 446
                 P  V  ++  L  SFL G S    ++   SWA I  P+ EGGL IR + ++ +A 
Sbjct: 1689 QVLKPPVCVLERVNRLFNSFLWGGSAASKRIHWASWAKIALPVTEGGLDIRSLAEVFEAF 1748

Query: 447  LCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDVSWVYRRILKIRNQFAHHCFNL 626
              K LWW + +  SLW +++  ++ R             S  ++R+L        H    
Sbjct: 1749 SMK-LWWRFRTTDSLWTRFMRMKYCRGQLPMQTQPKLHDSQTWKRMLTSSTITEQHMRWR 1807

Query: 627  VGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPSSSPMVR 806
            VG G+   FW   W  +  L+ +  E    FT       + +      WN     + + +
Sbjct: 1808 VGQGNVF-FWHDCWMGEAPLISSNQE----FT--SSMVQVCDFFTNNSWNIEKLKTVLQQ 1860

Query: 807  TAWRQFQQIPKLGCDEEDQFVWSPCPSGLFSVASAWEQIRHHYDVWEWTELVWFYDKIPK 986
                +  +IP +    +D+  W+P P+G FS  SAW+ IR    V      +W       
Sbjct: 1861 EVVDEIAKIP-IDTMNKDEAYWTPTPNGDFSTKSAWQLIRKRKVVNPVFNFIWHKTVPLT 1919

Query: 987  CSFTCWRMLLSKLPTKDKLTRFGAQ--SRCELCWAGVESEDHLFFECPFSSEVW------ 1142
             SF  WR+L   +P + K+   G Q  SRC  C    ES  H+ ++ P + +VW      
Sbjct: 1920 TSFFLWRLLHDWIPVELKMKSKGLQLASRCRCC-KSEESIMHVMWDNPVAMQVWNYFAKL 1978

Query: 1143 --MRIKVKCWRNVQVVRGRFQESQTILRSFGLNRADGVIRKLCYTVTVHFIWWERNMRLF 1316
              + I   C  N Q++   F          G     G IR L     + F+W ERN    
Sbjct: 1979 FQILIINPCTIN-QIIGAWFYS--------GDYCKPGHIRTLVPLFILWFLWVERNDAKH 2029

Query: 1317 NKGWRSATRLAEEIIQLVHQKVSTSKKLLSNSLDGE-FLAQFWHVNF--DQLRDPIVCQW 1487
                    R+   +++L+ Q++S  ++LL     G+  +AQ W + F  + L  P V  W
Sbjct: 2030 RNLGMYPNRVVWRVLKLI-QQLSLGQQLLKWQWKGDKQIAQEWGIIFQAESLAPPKVFSW 2088

Query: 1488 SPPPEGELVLNSDGSL-SARGCFFGGVIRNHMGEIILGYSGGTSQGSVLLLEALGMYHGL 1664
              P  GE  LN DGS   +     GG++R+H GE++ G+S      + L  E L +Y GL
Sbjct: 2089 HKPSLGEFKLNVDGSAKQSHNAAGGGILRDHAGEMVFGFSENLGTQNSLQAELLALYRGL 2148

Query: 1665 KVAKERNV 1688
             + ++ N+
Sbjct: 2149 ILCRDYNI 2156


Top