BLASTX nr result

ID: Cocculus23_contig00009173 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00009173
         (1974 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   332   4e-88
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   328   4e-87
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]           327   9e-87
gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]               326   3e-86
gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal...   326   3e-86
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       322   3e-85
gb|AAC63678.1| putative non-LTR retroelement reverse transcripta...   319   3e-84
gb|AAC33226.1| putative non-LTR retroelement reverse transcripta...   313   1e-82
gb|AAC95175.1| putative non-LTR retroelement reverse transcripta...   312   4e-82
ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298...   298   5e-78
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   295   7e-77
emb|CAB72467.1| putative protein [Arabidopsis thaliana]               291   6e-76
gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,...   273   3e-70
gb|AAC19278.1| T14P8.10 [Arabidopsis thaliana] gi|7269009|emb|CA...   270   2e-69
gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana]                262   4e-67
emb|CAN78577.1| hypothetical protein VITISV_020585 [Vitis vinifera]   246   4e-62
emb|CCA66153.1| hypothetical protein [Beta vulgaris subsp. vulga...   241   7e-61
gb|AAG51098.1|AC025295_6 hypothetical protein [Arabidopsis thali...   238   1e-59
ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom...   237   1e-59
dbj|BAB01344.1| reverse transcriptase-like protein [Arabidopsis ...   237   1e-59

>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  332 bits (851), Expect = 4e-88
 Identities = 188/543 (34%), Positives = 287/543 (52%), Gaps = 5/543 (0%)
 Frame = +1

Query: 1    FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180
            FSV VNG  AG+F+S RG+RQG  +SPYLF I M+VLS +        H      CK  G
Sbjct: 638  FSVQVNGELAGYFQSSRGLRQGCALSPYLFVICMDVLSKMLDKAAAARHFGYHPKCKTMG 697

Query: 181  ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360
            ++HL FADDL+V       ++   + +   FA++SGL I+ +KS ++LAG+S+   + + 
Sbjct: 698  LTHLSFADDLMVLSDGKIRSIERIIKVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVA 757

Query: 361  EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540
            +   F  G  PV+YLGLPLI+ RL  + C PL++ +  ++ SW SR LS+AGR  L+ SV
Sbjct: 758  DRFPFSSGQLPVRYLGLPLITKRLSTTDCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSV 817

Query: 541  LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSSKH-KFSLISWKDIARPLEEGGLNIR 717
            L S+  +W  AF +P     +LE +   FL SG+  +   + ISW  + +P +EGGL +R
Sbjct: 818  LWSICNFWLAAFRLPRKCIRELEKMCSAFLWSGTEMNSNKAKISWHMVCKPKDEGGLGLR 877

Query: 718  CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLK-GEXXXXXXXXXXXXXXXRRIFKLR 894
              K+ N    LKL+WK++++ +SLWV+WV    L+                  +++ K R
Sbjct: 878  SLKEANDVCCLKLVWKIVSHSNSLWVKWVDQHLLRNASFWEVKQTVSQGSWIWKKLLKYR 937

Query: 895  DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWR 1074
            ++AK      VGNG+ TSFW D W + G L     +   I LGI R   V     N   R
Sbjct: 938  EVAKTLSKVEVGNGKQTSFWYDNWSDLGQLLERTGDRGLIDLGISRRMTVEEAWTNRRQR 997

Query: 1075 FPYSRIPLIREIWNQCSSLYCLPTLEEDEVVWKATTS---GLFSTKSAWEITRFKHPKCS 1245
                R  +   I +     +   T  ED+V+W+  +      FST+  W  TR    +  
Sbjct: 998  --RHRNDVYNVIEDALKKSWDTRTETEDKVLWRGKSDVFRTTFSTRDTWHHTRSTSARVP 1055

Query: 1246 WAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHLFFSC 1425
            W K+IWF    PK++   W     ++PT  ++ +     ++ CIFC    ET  HLFF+C
Sbjct: 1056 WHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWANGIATDCIFCQGTLETRDHLFFTC 1115

Query: 1426 HFSLGVWSLIAQKCFNSTFHASWEASLRLIDKDFSSNSIDSIVVKLAFQASIYHVWNERN 1605
             F+  +W  +A+  F + + + W++ +  I      + ++  + +  FQA+IY VW ERN
Sbjct: 1116 SFTSVIWVDLARGIFKTQYTSHWQSIIEAITNS-QHHRVEWFLRRYVFQATIYIVWRERN 1174

Query: 1606 RRR 1614
             RR
Sbjct: 1175 GRR 1177


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  328 bits (842), Expect = 4e-87
 Identities = 191/541 (35%), Positives = 280/541 (51%), Gaps = 6/541 (1%)
 Frame = +1

Query: 1    FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180
            F++ VNG+  GFFRS +G+RQGDP+SPYLF +AMEV S +       G++          
Sbjct: 492  FTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYHPKAGDLS 551

Query: 181  ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360
            ISHLMFADD+++F    S++++     +  FA++SGL++NK KS +F AG+  +L + + 
Sbjct: 552  ISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGL--DLSERIT 609

Query: 361  EAS-GFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRS 537
             A+ GF  G FP++YLGLPL+  +L I+   PL++ + ++L+SW S++LS+AGRT L+ S
Sbjct: 610  SAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISS 669

Query: 538  VLQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGS-SKHKFSLISWKDIARPLEEGGLNI 714
            V+  L  +W   F +P     K+ES+  +FL +GS    K S +SW D   P  EGGL  
Sbjct: 670  VIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGF 729

Query: 715  RCPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGEXXXXXXXXXXXXXXXRRIFKLR 894
            R   + NK  LL+LIW L     SLW +W     L                  + +  LR
Sbjct: 730  RSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNALQTDPWTWKMLLNLR 789

Query: 895  DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWR 1074
             LA+  I   VGNG + SFW D W + G L   + ++    L I  +A VA      GWR
Sbjct: 790  PLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFSAKVADAIDGSGWR 849

Query: 1075 FPYSRIPLIREIWNQCSSL-YCLPTLEEDEVVWKATTSGL--FSTKSAWEITRFKHPKCS 1245
             P SR      I +  +SL    P +  D   W         FS    WE+ R + P   
Sbjct: 850  LPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQGFSAAKTWEVLRPRRPVKR 909

Query: 1246 WAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHLFFSC 1425
            WAK +WF   +PKHA   W   L ++PT  +L    +V S+ C  C    ET  HL   C
Sbjct: 910  WAKSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAECCLCSFDTETRDHLLLLC 969

Query: 1426 HFSLGVWSLI-AQKCFNSTFHASWEASLRLIDKDFSSNSIDSIVVKLAFQASIYHVWNER 1602
             FS  VW ++  + C       +W   L    +  S+ +  S++ K+  Q  +Y++W +R
Sbjct: 970  DFSSQVWRMVFLRLCPRQRLLCTWAELLSWTRQ--STAAAPSLLRKVVAQLVVYNLWRQR 1027

Query: 1603 N 1605
            N
Sbjct: 1028 N 1028


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score =  327 bits (839), Expect = 9e-87
 Identities = 190/541 (35%), Positives = 280/541 (51%), Gaps = 6/541 (1%)
 Frame = +1

Query: 1    FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180
            F++ VNG+  GFFRS +G+RQGDP+SPYLF +AMEV S +       G++          
Sbjct: 492  FTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYHPKAGDLS 551

Query: 181  ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360
            ISHLMFADD+++F    S++++     +  FA++SGL++NK KS +F AG+  +L + + 
Sbjct: 552  ISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGL--DLSERIT 609

Query: 361  EAS-GFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRS 537
             A+ GF  G FP++YLGLPL+  +L I+   PL++ + ++L+SW S++LS+AGRT L+ S
Sbjct: 610  SAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISS 669

Query: 538  VLQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGS-SKHKFSLISWKDIARPLEEGGLNI 714
            V+  L  +W   F +P     K+ES+  +FL +GS    K S +SW D   P  EGGL  
Sbjct: 670  VIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGF 729

Query: 715  RCPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGEXXXXXXXXXXXXXXXRRIFKLR 894
            R   + NK  LL+LIW L     SLW +W     L                  + +  LR
Sbjct: 730  RSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNALQTDPWTWKMLLNLR 789

Query: 895  DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWR 1074
             LA+  I   VGNG + SFW D W + G L   + ++    L I  +A VA      GWR
Sbjct: 790  PLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFSAKVADAIDGSGWR 849

Query: 1075 FPYSRIPLIREIWNQCSSL-YCLPTLEEDEVVWKATTSGL--FSTKSAWEITRFKHPKCS 1245
             P SR      I +  +SL    P +  D   W         FS    WE+ R + P   
Sbjct: 850  LPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQGFSAAKTWEVLRPRRPVKR 909

Query: 1246 WAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHLFFSC 1425
            WA+ +WF   +PKHA   W   L ++PT  +L    +V S+ C  C    ET  HL   C
Sbjct: 910  WARSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAECCLCSFDTETRDHLLLLC 969

Query: 1426 HFSLGVWSLI-AQKCFNSTFHASWEASLRLIDKDFSSNSIDSIVVKLAFQASIYHVWNER 1602
             FS  VW ++  + C       +W   L    +  S+ +  S++ K+  Q  +Y++W +R
Sbjct: 970  DFSSQVWRMVFLRLCPRQRLLCTWAELLSWTRQ--STAAAPSLLRKVVAQLVVYNLWRQR 1027

Query: 1603 N 1605
            N
Sbjct: 1028 N 1028


>gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]
          Length = 872

 Score =  326 bits (835), Expect = 3e-86
 Identities = 193/552 (34%), Positives = 287/552 (51%), Gaps = 14/552 (2%)
 Frame = +1

Query: 1    FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180
            FSV VNG   G+F+S+RG+RQG  +SPYLF I M+VLS +      +        C+  G
Sbjct: 285  FSVQVNGDLVGYFQSKRGLRQGCSLSPYLFVICMDVLSKMLDKAAGVRKFGFHPKCQRLG 344

Query: 181  ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360
            ++HL FADDL+V     + ++   L +   F + SGL I+ +KS +++AG+S  ++  + 
Sbjct: 345  LTHLSFADDLMVLSDGKTRSIEGILEVFDEFCKRSGLRISLEKSTLYMAGVSPIIKQEIA 404

Query: 361  EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540
                F  G  PV+YLGLPL++ RL  +   PL++ ++ ++ +W  R  S+AGR  L++SV
Sbjct: 405  AKFLFDVGQLPVRYLGLPLVTKRLTSADYSPLLEQIKKRIATWTFRFFSFAGRFNLIKSV 464

Query: 541  LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGS--SKHKFSLISWKDIARPLEEGGLNI 714
            L S+  +W  AF +P     +++ +   FL SGS  S HK + ISW  + +P  EGGL +
Sbjct: 465  LWSICNFWLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHK-AKISWDIVCKPKAEGGLGL 523

Query: 715  RCPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYL--KGEXXXXXXXXXXXXXXXRRIFK 888
            R  K+ N    LKL+W++I+N +SLW +WV   YL  K                 R+I K
Sbjct: 524  RNLKEANDVSCLKLVWRIISNSNSLWTKWV-AEYLIRKKSIWSLKQSTSMGSWIWRKILK 582

Query: 889  LRDLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDG 1068
            +RD+AK      VGNG S SFW D W  HG L   + +   I LGI R A VA     D 
Sbjct: 583  IRDVAKSFSRVEVGNGESASFWYDHWSAHGRLIDTVGDKGTIDLGIPREASVA-----DA 637

Query: 1069 W---RFPYSRIPLIREIWNQCSSLYCLPTLEEDEVVWKATTSGL---FSTKSAWEITRFK 1230
            W        R  L+ EI    +      +  ED V+W+         FST+  W + +  
Sbjct: 638  WTRRSRRRHRTSLLNEIEEMMAYQRIHHSDAEDTVLWRGKNDVFKPHFSTRDTWHLIKAT 697

Query: 1231 HPKCSWAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSS----CIFCWTGEE 1398
                SW K +WF    PK+A   W     ++PT  ++  LK   S S    C+ C    +
Sbjct: 698  SSTVSWHKGVWFRHATPKYALCTWLAIHNRLPTGDRM--LKWNSSGSVSGNCVLCTNNSK 755

Query: 1399 TEKHLFFSCHFSLGVWSLIAQKCFNSTFHASWEASLRLIDKDFSSNSIDSIVVKLAFQAS 1578
            T +HLFFSC ++  VW+ +A+  + + +   W   L  I   F  + ++  + +  FQA+
Sbjct: 756  TLEHLFFSCSYASTVWAALAKGIWKTRYSTRWSHLLTHISTHF-QDRVEGFLTRYIFQAT 814

Query: 1579 IYHVWNERNRRR 1614
            IYHVW ERN RR
Sbjct: 815  IYHVWRERNGRR 826


>gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana]
          Length = 629

 Score =  326 bits (835), Expect = 3e-86
 Identities = 192/544 (35%), Positives = 278/544 (51%), Gaps = 6/544 (1%)
 Frame = +1

Query: 1    FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180
            FSV VNG  AG+FRS RGIRQG  +SPYLF I+MEVLS +               CK  G
Sbjct: 43   FSVQVNGELAGYFRSARGIRQGCALSPYLFVISMEVLSKMLDQAAGGKRFGFHPKCKNLG 102

Query: 181  ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360
            ++HL FADDL++       +++  + +M  FA+ SGL+IN +K+ ++ AG+S +    ++
Sbjct: 103  LTHLCFADDLMILTDGKVRSVDGIVEVMNLFAKRSGLQINMEKTTLYTAGVSDHNRYMMI 162

Query: 361  EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540
                F  G  PV+YLGLPL++ RL      PL + + +++ +W SR LS+AGR  L+ SV
Sbjct: 163  SRYPFGLGQLPVRYLGLPLVTKRLTKEDLSPLFEQIRNRIGTWTSRYLSFAGRLNLISSV 222

Query: 541  LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSSKH-KFSLISWKDIARPLEEGGLNIR 717
            L S   +W  AF +PS+   ++ SI   FL SG   H + + +SW DI +P +EGGL +R
Sbjct: 223  LWSTMNFWMSAFRLPSACLKEINSICSAFLWSGPELHRRKAKVSWDDICKPKQEGGLGLR 282

Query: 718  CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGE-XXXXXXXXXXXXXXXRRIFKLR 894
               + N   +LKLIW++ +N DSLWV+W     LK E                +++ K R
Sbjct: 283  SLTEANVVSVLKLIWRVTSNDDSLWVKWSKMNLLKQESFWSLTPNSSLGSWMWKKMLKYR 342

Query: 895  DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWR 1074
            + AK      V NG  TSFW D W   G L     +  QI LGI RN  VA    N   R
Sbjct: 343  ETAKPFSRVEVNNGARTSFWFDNWSGMGHLMDVTGQRGQIDLGISRNKTVAEAWSNR--R 400

Query: 1075 FPYSRIPLIREIWNQCSSLY-CLPTLEEDEVVWKA---TTSGLFSTKSAWEITRFKHPKC 1242
                R   + +I    +  Y     L ED  +W+         FSTK  W   R K  + 
Sbjct: 401  RRKHRTEQLNDIEAALNQKYQTRNLLREDATLWRGKGDVFKTSFSTKDTWNQVRKKSNEV 460

Query: 1243 SWAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHLFFS 1422
            +W K +WF    PK+    W     ++ T ++++         C FC T  ET  HLFFS
Sbjct: 461  AWYKGVWFSHSTPKYQFCTWLALRNRLSTGYRMQLWNNGSDVKCTFCSTSIETRDHLFFS 520

Query: 1423 CHFSLGVWSLIAQKCFNSTFHASWEASLRLIDKDFSSNSIDSIVVKLAFQASIYHVWNER 1602
            C ++  +W+ IA+      F   W+  +  I  +  ++ I S + +  FQ +++ VW ER
Sbjct: 521  CSYASAIWTAIAKNVLQHRFSTDWQTIVNYI-SETQTDRIRSFLSRYIFQLTVHTVWKER 579

Query: 1603 NRRR 1614
            N RR
Sbjct: 580  NDRR 583


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  322 bits (826), Expect = 3e-85
 Identities = 183/541 (33%), Positives = 281/541 (51%), Gaps = 6/541 (1%)
 Frame = +1

Query: 1    FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180
            F+V +NG   GFF+S +G+RQGDP+SPYLF +AME  S++       G +          
Sbjct: 632  FTVSINGGNGGFFKSTKGLRQGDPLSPYLFVLAMEAFSNLLHSRYESGLIHYHPKASNLS 691

Query: 181  ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360
            ISHLMFADD+++F    S +L+     +  FA +SGL++NK KS+++LAG+ + LE +  
Sbjct: 692  ISHLMFADDVMIFFDGGSFSLHGICETLDDFASWSGLKVNKDKSHLYLAGL-NQLESNAN 750

Query: 361  EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540
             A GF  G  P++YLGLPL++ +L I+  +PL++ + ++ +SW ++ LS+AGR  L+ SV
Sbjct: 751  AAYGFPIGTLPIRYLGLPLMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGRIQLISSV 810

Query: 541  LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGS-SKHKFSLISWKDIARPLEEGGLNIR 717
            +     +W   F +P     ++ES+  RFL SG+  + K   +SW  +  P  EGGL +R
Sbjct: 811  IFGSINFWMSTFLLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPKSEGGLGLR 870

Query: 718  CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGEXXXXXXXXXXXXXXXRRIFKLRD 897
               + NK   ++LIW+L   KDSLW  W H  +L                  +R+  LR 
Sbjct: 871  RLLEWNKTLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFWAVEGGQSDSWTWKRLLSLRP 930

Query: 898  LAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWRF 1077
            LA   +   VGNG    +W D W + G L   I ++   SL +   A VA     DGWR 
Sbjct: 931  LAHQFLVCKVGNGLKADYWYDNWTSLGPLFRIIGDIGPSSLRVPLLAKVASAFSEDGWRL 990

Query: 1078 PYSRIPLIREIWNQCSSLYCLPTLEEDEVVWKATTSGL----FSTKSAWEITRFKHPKCS 1245
            P SR    + I +   ++    T +ED   ++ + +G     FS    WE  R K    S
Sbjct: 991  PVSRSAPAKGIHDHLCTVPVPSTAQEDVDRYEWSVNGFLCQGFSAAKTWEAIRPKATVKS 1050

Query: 1246 WAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHLFFSC 1425
            WA  IWF   +PK+A  +W   L ++ T  +L     + S +C+ C    E+  HL   C
Sbjct: 1051 WASSIWFKGAVPKYAFNMWVSHLNRLLTRQRLASWGHIQSDACVLCSFASESRDHLLLIC 1110

Query: 1426 HFSLGVWSLIAQK-CFNSTFHASWEASLRLIDKDFSSNSIDSIVVKLAFQASIYHVWNER 1602
             FS  VW L+ ++ C      +SW   L  + +  SS     ++ K+  Q  +Y++W +R
Sbjct: 1111 EFSAQVWRLVFRRICPRQRLFSSWSELLSWVRQ--SSPEAPPLLRKIVSQVVVYNLWRQR 1168

Query: 1603 N 1605
            N
Sbjct: 1169 N 1169


>gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1216

 Score =  319 bits (818), Expect = 3e-84
 Identities = 186/553 (33%), Positives = 277/553 (50%), Gaps = 9/553 (1%)
 Frame = +1

Query: 1    FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180
            FS+ VNG  AG+FRS RG+RQG  +SPYLF I+M+VLS +               CK  G
Sbjct: 359  FSIQVNGELAGYFRSARGLRQGCSLSPYLFVISMDVLSRMLDKAAGAREFGYHPRCKTLG 418

Query: 181  ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360
            ++HL FADDL++       +++  + ++  FA   GL+I  +K+ ++LAG+S +    + 
Sbjct: 419  LTHLCFADDLMILTDGKIRSVDGIVKVLNQFAAKLGLKICMEKTTLYLAGVSDHSRQLMS 478

Query: 361  EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540
                F  G  PV+YLGLPL++ RL  S   PLID +  ++  W SR LS+AGR  L+ SV
Sbjct: 479  SRYSFGVGKLPVRYLGLPLVTKRLTTSDYSPLIDQIRRRIGMWTSRYLSFAGRLSLINSV 538

Query: 541  LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSSKH-KFSLISWKDIARPLEEGGLNIR 717
            L S+  +W  AF +P     ++  I    L SG   + K + +SW +I +P +EGGL ++
Sbjct: 539  LWSITNFWMNAFRLPRECINEINRISSALLWSGPELNPKKAKVSWDEICKPKKEGGLGLQ 598

Query: 718  CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGE-XXXXXXXXXXXXXXXRRIFKLR 894
              ++ NK   LKLIW+L++ +DSLWV+W     LK E                RR+ K R
Sbjct: 599  SLREANKVSSLKLIWRLLSCQDSLWVKWTRMNLLKKESFWSIGTHSTLGSWIWRRLLKHR 658

Query: 895  DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGW- 1071
            ++AK      V NG +TSFW D W   G L         I +GI R     H  + + W 
Sbjct: 659  EVAKSFCKIEVNNGVNTSFWFDNWSEKGPLINLTGARGAIDMGISR-----HMTLAEAWS 713

Query: 1072 --RFPYSRIPLIREIWNQCSSLYCLPTLE-EDEVVWKA---TTSGLFSTKSAWEITRFKH 1233
              R    R+ ++ E        Y    +E ED ++W+         FSTK  W   R   
Sbjct: 714  RRRRKRHRVEILNEFEEILLQKYQHRNIELEDAILWRGKEDVFKARFSTKDTWNHIRTSS 773

Query: 1234 PKCSWAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHL 1413
             + +W K +WF    PK +   W     ++ T  ++        ++C+FC +  ET  HL
Sbjct: 774  NQRAWHKGVWFAHATPKFSFCAWLAIRNRLSTGDRMMTWNNGTPTTCVFCSSPMETRDHL 833

Query: 1414 FFSCHFSLGVWSLIAQKCFNSTFHASWEASLRLIDKDFSSNSIDSIVVKLAFQASIYHVW 1593
            FF C +S  +W+ IA+  +   F   W A +  I  D   + I S + +  FQ SI+ +W
Sbjct: 834  FFQCCYSSEIWTSIAKNVYKDRFSTKWSAVVNYI-SDSQPDRIQSFLSRYTFQVSIHSIW 892

Query: 1594 NERNRRRFQSRGR 1632
             ERN RR   + R
Sbjct: 893  RERNSRRHGEKSR 905


>gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1529

 Score =  313 bits (803), Expect = 1e-82
 Identities = 187/549 (34%), Positives = 287/549 (52%), Gaps = 11/549 (2%)
 Frame = +1

Query: 1    FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180
            FSV VNG  AGFF S RG+RQG  +SPYLF I M VLS +     +  ++     C+  G
Sbjct: 935  FSVQVNGELAGFFGSSRGLRQGCALSPYLFVICMNVLSHMIDEAAVHRNIGYHPKCEKIG 994

Query: 181  ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360
            ++HL FADDL+VF+     ++   + + + FA  SGL+I+ +KS I+LAG+S++     +
Sbjct: 995  LTHLCFADDLMVFVDGHQWSIEGVINVFKEFAGRSGLQISLEKSTIYLAGVSASDRVQTL 1054

Query: 361  EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540
             +  F  G  PV+YLGLPL++ ++  +   PLI+ +++K+ SW +RSLS+AGR  LL SV
Sbjct: 1055 SSFPFANGQLPVRYLGLPLLTKQMTTADYSPLIEAVKTKISSWTARSLSYAGRLALLNSV 1114

Query: 541  LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSSKH-KFSLISWKDIARPLEEGGLNIR 717
            + S+  +W  A+ +P+    ++E +   FL SG   + K + I+W  I +P +EGGL I+
Sbjct: 1115 IVSIANFWMSAYRLPAGCIREIEKLCSAFLWSGPVLNPKKAKIAWSSICQPKKEGGLGIK 1174

Query: 718  CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYL-KGEXXXXXXXXXXXXXXXRRIFKLR 894
               + NK   LKLIW+L++ + SLWV W+    + KG                +++ K R
Sbjct: 1175 SLAEANKVSCLKLIWRLLSTQPSLWVTWIWTFIIRKGTFWSANERSSLGSWMWKKLLKYR 1234

Query: 895  DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGI----DRNALVAHFRIN 1062
            +LAK      V NG STSFW D W + G L         I LGI    +   ++   +  
Sbjct: 1235 ELAKSMHKVEVRNGSSTSFWYDHWSHLGRLLDITGTRRVIDLGIPLETNLETVLRTHQHR 1294

Query: 1063 DGWRFPYSRIPL-IREIWNQCSSLYCLPTLEEDEVVWKATTSGL---FSTKSAWEITRFK 1230
                  Y+RI   I+ +  Q            D  +W++  +     F TK  W   R  
Sbjct: 1295 QHRAAIYNRINAEIQRLQQQERE------AGPDISLWRSLKNDFNKRFITKVTWNNVRTH 1348

Query: 1231 HPKCSWAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKH 1410
             P+ +W K +WFP   PK++ L+W     ++ T  ++K        +C  C   EET  H
Sbjct: 1349 QPQQNWYKGVWFPYSTPKYSFLLWLTVQNRLSTGDRIKAWNSGQLVTCTLCNNAEETRDH 1408

Query: 1411 LFFSCHFSLGVWSLIAQKCFNSTFHASWEASLRLIDKDFSSNSIDSI-VVKLAFQASIYH 1587
            LFFSC ++  VW  + Q+  ++ +   W     L+    S+   D + + +  FQASIYH
Sbjct: 1409 LFFSCQYTSYVWEALTQRLLSTNYSRDWNRLFTLLCT--SNLPRDHLFLFRYVFQASIYH 1466

Query: 1588 VWNERNRRR 1614
            +W ERN RR
Sbjct: 1467 IWRERNARR 1475


>gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1352

 Score =  312 bits (799), Expect = 4e-82
 Identities = 187/544 (34%), Positives = 286/544 (52%), Gaps = 6/544 (1%)
 Frame = +1

Query: 1    FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180
            FSV VNG  +GFFRS+RG+RQG  +SPYL+ I M VLS +     +   +     C+   
Sbjct: 785  FSVQVNGELSGFFRSERGLRQGCSLSPYLYVICMNVLSCMLDKAAVEKKISYHPRCRNMN 844

Query: 181  ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360
            ++HL FADD++VF   +S ++   L +   FA  S L+I+ +KS IF+AGIS N + S++
Sbjct: 845  LTHLCFADDIMVFSDGTSKSIQGTLAIFEKFAAMSWLKISLEKSTIFMAGISPNAKTSIL 904

Query: 361  EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540
            +   F  G  PVKYLGLPL++ R+  S   PL++ + +++ SW +R LS+AGR  L++SV
Sbjct: 905  QQFPFELGTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITSWTNRFLSFAGRLQLIKSV 964

Query: 541  LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSSKH-KFSLISWKDIARPLEEGGLNIR 717
            L S+  +W   F +P +   ++E +   FL SG   + K + I+W ++ +  EEGGL ++
Sbjct: 965  LSSITNFWLSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKAKIAWSEVCKLKEEGGLGLK 1024

Query: 718  CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGE-XXXXXXXXXXXXXXXRRIFKLR 894
              K+ N+  LLKLIW++++ +DSLWV+WV+   ++ E                R+I K R
Sbjct: 1025 PLKEANEVSLLKLIWRILSARDSLWVKWVNKHLIRKETFWSVKENTGLGSWLWRKILKQR 1084

Query: 895  DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWR 1074
            D A+      V +G  TSFW D W   G L   +     I LGI  NA VA   + +  R
Sbjct: 1085 DKARLFHRMEVRSGTFTSFWHDHWCPLGRLHQHMGSRGTIDLGIPNNATVA--EVMNTHR 1142

Query: 1075 FPYSRIPLIREIWNQCSSLYCLPTLEEDEVVWKA---TTSGLFSTKSAWEITRFKHPKCS 1245
                R   + +I +Q        + + D  +WK    T    FS+   W+  R    +C 
Sbjct: 1143 RKRHRADFLNQIKSQIELARQDRSTDGDRSLWKQKEDTFKSSFSSSKTWQQIRSISLRCD 1202

Query: 1246 WAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHLFFSC 1425
            W + +WF    PK++ + W     ++ T  K+          C+FC    ET  HLFFSC
Sbjct: 1203 WYRGVWFSASTPKYSFVTWLAFHNRLTTSDKICKWNSGARYDCVFCGEELETRDHLFFSC 1262

Query: 1426 HFSLGVWSLIAQKCFNSTFHASWE-ASLRLIDKDFSSNSIDSIVVKLAFQASIYHVWNER 1602
             +S  VW  + +   N     +W   +  L+D   S   +    ++ AFQASI+ +W ER
Sbjct: 1263 PYSSHVWFSLTKGLLNGRNILNWNLITPHLLDS--SRPYLHVFTLRYAFQASIHSLWRER 1320

Query: 1603 NRRR 1614
            N RR
Sbjct: 1321 NCRR 1324


>ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca
            subsp. vesca]
          Length = 958

 Score =  298 bits (764), Expect = 5e-78
 Identities = 177/583 (30%), Positives = 270/583 (46%), Gaps = 2/583 (0%)
 Frame = +1

Query: 1    FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLN-HCKPF 177
            FSV VNG  AGFF  +RG+RQGDP+SPYLF IAMEVLS   +  +        +  C   
Sbjct: 459  FSVCVNGELAGFFARRRGLRQGDPLSPYLFVIAMEVLSLCIQRRINCSPCFRYHWRCDQL 518

Query: 178  GISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSL 357
             +SHL FADDLL+F     N++        +F   S L+ N  +S IFLAG+  N  DS+
Sbjct: 519  NLSHLCFADDLLMFCNGDENSVRTLHDAFSNFESLSSLKANVSESKIFLAGVDGNSSDSV 578

Query: 358  VEASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRS 537
            ++ + F  G  PV+YLG+PLI+++L +  C PL+D +E++++SW+++ LS+AGR  L++S
Sbjct: 579  LQVTNFSLGTCPVRYLGIPLITSKLRMQDCSPLLDRIETRIKSWENKVLSFAGRLQLIQS 638

Query: 538  VLQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGS-SKHKFSLISWKDIARPLEEGGLNI 714
            VL S+ +YW+    +P  +   +E  +R FL +G+ S    + ++W +I  P  EGGL I
Sbjct: 639  VLSSIQVYWASHLILPKKVLKDIEKRLRCFLWAGNCSGRAATKVAWSEICLPKCEGGLGI 698

Query: 715  RCPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGEXXXXXXXXXXXXXXXRRIFKLR 894
            +     NKA ++  IW L+++  + W  WV    LKG                R++ K+R
Sbjct: 699  KDLHCWNKALMISHIWNLVSSSSNFWTDWVKVYLLKGNSFWNAPLPSICSWNWRKLLKIR 758

Query: 895  DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWR 1074
            +L       I+G+GR+TS W D WH  G L L     I    G+ ++A++          
Sbjct: 759  ELCCSFFVNIIGDGRATSLWFDNWHPLGPLTLRWSSNIIGESGLSKSAML---------- 808

Query: 1075 FPYSRIPLIREIWNQCSSLYCLPTLEEDEVVWKATTSGLFSTKSAWEITRFKHPKCSWAK 1254
                                              T +G +ST SAW   R       W +
Sbjct: 809  ----------------------------------TPNGFYSTSSAWNTLRPSRFIVPWYR 834

Query: 1255 LIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHLFFSCHFS 1434
            L+WF                                           ET  HLFF C +S
Sbjct: 835  LVWFV-----------------------------------------AETHNHLFFDCAYS 853

Query: 1435 LGVWSLIAQKCFNSTFHASWEASLRLIDKDFSSNSIDSIVVKLAFQASIYHVWNERNRRR 1614
             G+W+ +  KC  S     W   +  +  ++  NS+  +++KLA QA +Y +W ERN RR
Sbjct: 854  FGIWTHVLSKCDVSKPLLPWSDFIFWVATNWKGNSLPVVILKLALQAVVYAIWRERNNRR 913

Query: 1615 FQSRGRXXXXXXXXXXXXVMSKLKNVSSNLSTSNRFLAENWGL 1743
            F++               +   L +     + SN ++   W L
Sbjct: 914  FRNESLPPAVVFKGIVESIRLCLLSWKIPHTPSNAYIFHEWRL 956


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  295 bits (754), Expect = 7e-77
 Identities = 180/545 (33%), Positives = 267/545 (48%), Gaps = 8/545 (1%)
 Frame = +1

Query: 1    FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180
            FS+ V+GS  G+F+  +G+RQGDP+SP LF IAME+LS + +++   G +          
Sbjct: 631  FSINVSGSLCGYFKGSKGLRQGDPLSPSLFVIAMEILSRLLENKFSDGSIGYHPKASEVR 690

Query: 181  ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGI-SSNLEDSL 357
            IS L FADDL++F    +++L     ++ SF   SGLE+N +KS ++ AG+  ++ ED+L
Sbjct: 691  ISSLAFADDLMIFYDGKASSLRGIKSVLESFKNLSGLEMNTEKSAVYTAGLEDTDKEDTL 750

Query: 358  VEASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRS 537
              A GF+ G FP +YLGLPL+  +L  S    LID + ++   W +++LS+AGR  L+ S
Sbjct: 751  --AFGFVNGTFPFRYLGLPLLHRKLRRSDYSQLIDKIAARFNHWATKTLSFAGRLQLISS 808

Query: 538  VLQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSSKHKFSL-ISWKDIARPLEEGGLNI 714
            V+ S   +W  +F +P      +E +  RFL       +  + +SW++   P  EGGL +
Sbjct: 809  VIYSTVNFWLSSFILPKCCLKTIEQMCNRFLWGNDITRRGDIKVSWQNSCLPKAEGGLGL 868

Query: 715  RCPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGEXXXXXXXXXXXXXXXRRIFKLR 894
            R     NK   L+LIW L   +DSLWV W H   L+                 + I  LR
Sbjct: 869  RNFWTWNKTLNLRLIWMLFARRDSLWVAWNHANRLRHVNFWNAEAASHHSWIWKAILGLR 928

Query: 895  DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWR 1074
             LAK  +   VGNG+  S+W D W N G L   I        GI  +A+V     + GW 
Sbjct: 929  PLAKRFLRGAVGNGQLLSYWYDHWSNLGPLIEAIGASGPQLTGIHESAVVTEASSSTGWI 988

Query: 1075 FPYSRIPLIREIWNQCSSLYCLPTLE----EDEVVW--KATTSGLFSTKSAWEITRFKHP 1236
             P +R      + N  S+L   P       ED   W  + ++S  FS+K  WE  R +  
Sbjct: 989  LPSAR-TRNASLANLRSTLLNSPAPSGDRGEDTYTWYIEGSSSTSFSSKLTWECLRQRDT 1047

Query: 1237 KCSWAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHLF 1416
               WA  +W+   IPK+A   W   L ++P   +  H      S C  C    ET  HLF
Sbjct: 1048 TKLWAAAVWYKGCIPKYAFNFWVAHLNRLPVRARTTHWSTNRPSLCCVCQRETETRDHLF 1107

Query: 1417 FSCHFSLGVWSLIAQKCFNSTFHASWEASLRLIDKDFSSNSIDSIVVKLAFQASIYHVWN 1596
              C     +W  +  +   S     W+  +  +    +  S    + KLA Q +I+H+W 
Sbjct: 1108 IHCTLGSLIWQQVLARFGRSQMFREWKDIIEWMLS--NQGSFSGTLKKLAVQTAIFHIWK 1165

Query: 1597 ERNRR 1611
            ERN R
Sbjct: 1166 ERNSR 1170


>emb|CAB72467.1| putative protein [Arabidopsis thaliana]
          Length = 762

 Score =  291 bits (746), Expect = 6e-76
 Identities = 173/483 (35%), Positives = 253/483 (52%), Gaps = 10/483 (2%)
 Frame = +1

Query: 1    FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180
            FSV VNG  AGFF+S RG+RQG  +SPYLF I M+VLS +    + +G +    HCK  G
Sbjct: 191  FSVQVNGELAGFFQSSRGLRQGCALSPYLFVICMDVLSKLLDKVVGIGRIGYHPHCKRMG 250

Query: 181  ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360
            ++HL FADDL++       ++   + +   F+++SGL+I+ +KS IF AG+SS     L 
Sbjct: 251  LTHLSFADDLMILTDGQCRSIEGIIEVFDLFSKWSGLKISMEKSTIFSAGLSSTSRAQLH 310

Query: 361  EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540
                F  G  P++YLGLPL++ RL      PLI+ +  ++ SW SR LS+AGR  L+ S+
Sbjct: 311  THFPFEVGELPIRYLGLPLVTKRLSSVDYAPLIEQIRKRIGSWSSRFLSFAGRFNLISSI 370

Query: 541  LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSS-KHKFSLISWKDIARPLEEGGLNIR 717
            + S   +W  AF +P +   ++E +   FL SG++   K + ISW  + +P  EGGL +R
Sbjct: 371  IWSSCNFWLSAFQLPRACIQEIEKLCSSFLWSGTNLNSKKAKISWNQVCKPKSEGGLGLR 430

Query: 718  CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGE-XXXXXXXXXXXXXXXRRIFKLR 894
              K+ N    LKL+W++I++ DSLWV+WV    LK E                ++I K R
Sbjct: 431  SLKEANDVCCLKLVWRIISHGDSLWVKWVEHNLLKREIFWIVKENANLGSWIWKKILKYR 490

Query: 895  DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGW- 1071
             +AK      VGNG STSFW D W   G L         I +GI R   VA     D W 
Sbjct: 491  GVAKRFCKAEVGNGESTSFWFDDWSLLGRLIDVAGIRGTIDMGISRTMSVA-----DAWT 545

Query: 1072 --RFPYSRIPLIREIWNQCSSLYCLPTLEEDE--VVWKATT---SGLFSTKSAWEITRFK 1230
              R  + R  ++  I    S+ +   T ++ +  V+WK         FSTK+ W   R  
Sbjct: 546  SRRRRHHRQEILNTIEEVLSTQHQKRTQQQQQGRVLWKGKNDIYKDKFSTKNTWNYLRTT 605

Query: 1231 HPKCSWAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKH 1410
              + +W K +WFP   PK++  +W     ++ T  ++      ++  C FC  G ET  H
Sbjct: 606  SNEVAWHKGVWFPHATPKYSFCLWLAAHDRLATGARMIKWNRGETGDCTFCRQGIETRDH 665

Query: 1411 LFF 1419
            LFF
Sbjct: 666  LFF 668


>gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13)
            [Arabidopsis thaliana]
          Length = 1164

 Score =  273 bits (697), Expect = 3e-70
 Identities = 163/484 (33%), Positives = 243/484 (50%), Gaps = 6/484 (1%)
 Frame = +1

Query: 1    FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180
            FSV++NG  AG F S +G+RQGDP+SPYLF +AMEV S + +     G++          
Sbjct: 529  FSVILNGHSAGHFWSSKGLRQGDPMSPYLFVLAMEVFSGLLQSRYTSGYIAYHPKTSQLE 588

Query: 181  ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360
            ISHLMFADD+++F    S++L+  +  +  FA +SGL +N  K+ ++ AG+S +  DS+ 
Sbjct: 589  ISHLMFADDVMIFFDGKSSSLHGIVESLEDFAGWSGLLMNTNKTQLYHAGLSQSESDSMA 648

Query: 361  EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540
             + GF  G  PV+YLGLPL+S +L I+   PLI+ + ++  SW  R LS+AGR  LL SV
Sbjct: 649  -SYGFKLGSLPVRYLGLPLMSRKLTIAEYAPLIEKITARFNSWVVRLLSFAGRVQLLASV 707

Query: 541  LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGS-SKHKFSLISWKDIARPLEEGGLNIR 717
            +  +  +W  +F +P     K+ES+  RFL S    K   + ++W  +  P  EGG+ +R
Sbjct: 708  ISGIVNFWISSFILPLGCIKKIESLCSRFLWSSRIDKKGIAKVAWSQVCLPKAEGGIGLR 767

Query: 718  CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYL-KGEXXXXXXXXXXXXXXXRRIFKLR 894
                 N+   L++IW L +N  SLWV W     L K                 + + +LR
Sbjct: 768  RFAVSNRTLYLRMIWLLFSNSGSLWVAWHKQHSLGKSTSFWNQPEKPHDSWNWKCLLRLR 827

Query: 895  DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWR 1074
             +A+  I   VGNGR  SFW D W   G L   +       L +  NA ++    ++GW 
Sbjct: 828  VVAERFIRCNVGNGRDASFWFDNWTPFGPLIKFLGNEGPRDLRVHLNAKISDVCTSEGWS 887

Query: 1075 FPYSRIPLIREIWNQCSSLYCLPTLEE----DEVVWKATTSGLFSTKSAWEITRFKHPKC 1242
                R      +    +++      ++    D VV      G FS  + W   R      
Sbjct: 888  IADPRSDQALSLHTHLTNISMPSDAQDLDSYDWVVDNKVCQG-FSAAATWSALRPSSAPV 946

Query: 1243 SWAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHLFFS 1422
             WA+ +WF    PKHA  +W   L ++PT  +L    M   ++C  C    ET  HLF S
Sbjct: 947  PWARAVWFKGATPKHAFHLWTAHLDRLPTKVRLASWGMQIDTTCGLCSLHPETRDHLFLS 1006

Query: 1423 CHFS 1434
            C F+
Sbjct: 1007 CDFA 1010


>gb|AAC19278.1| T14P8.10 [Arabidopsis thaliana] gi|7269009|emb|CAB80742.1| AT4g02490
            [Arabidopsis thaliana]
          Length = 657

 Score =  270 bits (689), Expect = 2e-69
 Identities = 164/496 (33%), Positives = 251/496 (50%), Gaps = 13/496 (2%)
 Frame = +1

Query: 1    FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180
            +S+  NG   GFF  ++GIRQGDP+S +LF + M++L+       + G   L   C    
Sbjct: 179  YSIAYNGELIGFFVGKKGIRQGDPMSSHLFVLVMDILARSLDLGAVEGRFVLHPKCLAPM 238

Query: 181  ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360
            I+HL FADD+LVF   S ++L A L ++  F + SGL IN +K+ + L G +      + 
Sbjct: 239  ITHLSFADDILVFCDGSLSSLVAILDILDVFKKGSGLGINLQKTALLLDGGNFERNRIMA 298

Query: 361  EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540
             + G  +G  PV+YLG+PL+S +++    QPL+D + S+  SW +R LS+AGR  LL+SV
Sbjct: 299  ASLGVSQGSLPVRYLGVPLMSQKMKKHDYQPLVDRINSRFTSWTARHLSFAGRLQLLKSV 358

Query: 541  LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGS-SKHKFSLISWKDIARPLEEGGLNIR 717
            + S   +W+  F +P+    KLE +   FL SG+ +  + + ISW  +    E GGL ++
Sbjct: 359  IYSTINFWASIFILPNQCLHKLEQMCNAFLWSGAPNSAREAKISWDIVCSSKESGGLGLK 418

Query: 718  CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGEXXXXXXXXXXXXXXXRRIFKLRD 897
                 NK   LKLIW L T   SLWV WV                       R++ KLR+
Sbjct: 419  RLSSWNKVLALKLIWLLFTASGSLWVSWVR-------------------WVWRKLCKLRE 459

Query: 898  LAKDCICTIVGNGRSTSFWVDIWHNHGVL----ALCIPELIQISLGIDRNALVAHFRIND 1065
            +A+  +   VG+G +  FW D W  HG L     L  P+L+ +S+     ++V     ND
Sbjct: 460  VARPFVICEVGSGITARFWQDNWTGHGPLIHLTGLTGPQLVGLSI----TSVVRDAIRND 515

Query: 1066 GWRFPYSR-----IPLIREIWNQCSSLYCLPTLEEDEVVWKA---TTSGLFSTKSAWEIT 1221
             W    SR     I L++ +     +L  +    +D  +WK      S  FST   W   
Sbjct: 516  DWWIASSRSRNPVILLLKSLLPPVGNL--VDCEHDDSYLWKVGDRVPSSKFSTADTWRAL 573

Query: 1222 RFKHPKCSWAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEET 1401
            +      SW K +WF   +PKHA + W     ++ T  +L+   ++  + C+ C   +ET
Sbjct: 574  QPFSVSVSWHKAVWFTNQVPKHAFISWVTAWNRLHTRDRLRSWGLIVPAECVLCNLVDET 633

Query: 1402 EKHLFFSCHFSLGVWS 1449
              HLFF+C FS  +W+
Sbjct: 634  RDHLFFACRFSSRIWT 649


>gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana]
          Length = 740

 Score =  262 bits (670), Expect = 4e-67
 Identities = 153/454 (33%), Positives = 238/454 (52%), Gaps = 6/454 (1%)
 Frame = +1

Query: 1    FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180
            FSV VNG  AGFF S+RG+RQG  +SPYLF I M VLS +     +  ++     CK   
Sbjct: 211  FSVQVNGELAGFFGSKRGLRQGCALSPYLFVICMNVLSHMIDVAAVHRNIGYHPKCKKLS 270

Query: 181  ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360
            ++HL FADDL+VF+     ++   + + + FA  SGL I+ +KS ++LAG+S    ++++
Sbjct: 271  LTHLCFADDLMVFIDGQQRSVEGVINIFKEFAGKSGLHISLEKSTLYLAGVSELNRNNIL 330

Query: 361  EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540
             A  F  G  PV+YLGLPL++ ++  +   PL+D + SK+ SW +RSLS+AGR  L+ SV
Sbjct: 331  SAFPFASGQLPVRYLGLPLLTKQMTTADYSPLLDKVRSKISSWTARSLSYAGRLALINSV 390

Query: 541  LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSSKH-KFSLISWKDIARPLEEGGLNIR 717
            + SL  +W  A+ +P+    ++E +   FL SG   + K + I+W  + +  +EGGL I+
Sbjct: 391  IVSLSNFWMSAYRLPAGCIKEIEKLCSAFLWSGPELNPKKAKITWTSLCKLKQEGGLGIK 450

Query: 718  CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYL-KGEXXXXXXXXXXXXXXXRRIFKLR 894
               + NK   LKLIW+L++ + SLWV WV    + KG                +++ K R
Sbjct: 451  SLLEANKVSCLKLIWRLVSRQSSLWVNWVWTYIIRKGSFWSANDRSSLGSWMWKKLLKYR 510

Query: 895  DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWR 1074
            D+AK      + +G STSFW D W   G L         I +GI   A VA   +    R
Sbjct: 511  DVAKSMCKVEIKSGSSTSFWYDNWSQLGQLVDVTNARRTIDMGIPLAATVA--TVLASHR 568

Query: 1075 FPYSRIPLIREIWNQCSSLYCLPTLEEDEV-VWKATTSGL---FSTKSAWEITRFKHPKC 1242
              + R  +  +I  +  S+         ++ +W+++       F TK  W   R  H   
Sbjct: 569  TKHHRTAIYNKIEAEIQSILQRERSGAPDIFLWRSSGDNFRQSFITKVTWHNIRVIHTHR 628

Query: 1243 SWAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLK 1344
             W K +WF    PK++ L+W     ++ T  ++K
Sbjct: 629  QWYKGVWFSYNTPKYSFLLWLAIHDRLSTGDRIK 662


>emb|CAN78577.1| hypothetical protein VITISV_020585 [Vitis vinifera]
          Length = 1848

 Score =  246 bits (627), Expect = 4e-62
 Identities = 189/577 (32%), Positives = 268/577 (46%), Gaps = 37/577 (6%)
 Frame = +1

Query: 1    FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180
            FSVL+NG+P GFF+S RG+RQGDP+SPYLF I MEV SS     +  G+   ++ C+  G
Sbjct: 1252 FSVLINGTPKGFFQSSRGLRQGDPLSPYLFVIXMEVFSSFLNRAVDNGY---ISGCQVKG 1308

Query: 181  -------ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISS 339
                   ISHL+FADD LVF QAS + L     L+  F   SG+ IN  KS +   G   
Sbjct: 1309 RNEGGIQISHLLFADDTLVFCQASQDQLTYLSWLLMWFEAXSGMRINLDKSELIPVGRVV 1368

Query: 340  NLEDSLVEASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGR 519
            +++D  ++  G   G  P  YLGLPL +    ++    + +    +L  WK + LS  GR
Sbjct: 1369 DIDDLALD-FGCKVGSLPSTYLGLPLGAPFKSVAMWDGVEERFRKRLTMWKRQYLSKGGR 1427

Query: 520  TVLLRSVLQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSS-KHKFSLISWKDIARPLE 696
              L+RS L +L IY+     +PSS+  +LE I R FL  G S + K  L+ WK +    +
Sbjct: 1428 ATLIRSTLSNLPIYYMSVLRLPSSVRSRLEQIQRDFLWGGGSLERKPHLVRWKVVCLSKK 1487

Query: 697  EGGLNIRCPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRY--LKGEXXXXXXXXXXXXXX 870
            +GGL I+C  ++NKA L K  W+    +++LW + + G+Y   +G               
Sbjct: 1488 KGGLGIKCLSNLNKALLSKWNWRYANEREALWNQVIRGKYGEDRGGWSTREVREAHGVGL 1547

Query: 871  XRRIFKLRDLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAH 1050
             + I    DL    I   VGNGR  SFW D W     L    P +   +L I++ A VA 
Sbjct: 1548 WKGIRMDWDLVGARISFSVGNGRRVSFWRDRWCGXAPLCDSFPSI--YALSIEKEAWVA- 1604

Query: 1051 FRINDGWRFPYSRIPLI---REIWNQCSS--------------LYCLPTL-----EEDEV 1164
                D W       PL+   R  WN C S              L CL        E+D+V
Sbjct: 1605 ----DVWD------PLVQGGRGGWNPCFSRALNDWEMEEAELFLGCLHGKRVIGDEDDKV 1654

Query: 1165 VWKATTSGLFSTKSAWEITRFKHPKCSWAKLIWFPQMIPKHASLVWRLCLKKIPTMHKLK 1344
            VW  T SG+FS KS +       P    +  IW   + PK +   W     K  T+  ++
Sbjct: 1655 VWTETKSGIFSAKSLYLALEADCPSSFPSSCIWKVWVQPKISFFAWEAAWGKALTLDLVQ 1714

Query: 1345 HLKMVDSSSCIFCWTGEETEKHLFFSCHFSLGVWSLIAQKCFNSTFHASW--EASLRLID 1518
                  ++ C  C   EET  HL   C  +  +W L+      S F  SW    S+R   
Sbjct: 1715 RRGWSLANRCYMCMEKEETIDHLLLHCSKTRVLWELLF-----SLFGVSWVMPCSVRETL 1769

Query: 1519 KDFSSNSIDSIVVKLAFQASI---YHVWNERNRRRFQ 1620
              + ++S+     K+   A +   + VW  RNR  F+
Sbjct: 1770 LSWQTSSVGKKHRKVWRAAPLHIFWTVWKARNRLAFK 1806


>emb|CCA66153.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1381

 Score =  241 bits (616), Expect = 7e-61
 Identities = 197/695 (28%), Positives = 306/695 (44%), Gaps = 40/695 (5%)
 Frame = +1

Query: 4    SVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKP-FG 180
            S+L+NGSP    +  RG+RQGDP+SP+LF + +E L+ + K  + L     +  C+    
Sbjct: 613  SILINGSPTPPIKLHRGLRQGDPLSPFLFDLVVEPLNLLIKKAVSLKLWDGIETCRNGLR 672

Query: 181  ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360
            I+HL +ADD ++F       L+     +  F   SGL++N  KS++    +  NL +   
Sbjct: 673  ITHLQYADDTIIFCPPKLEFLSNIKKTLILFQLASGLQVNFHKSSLLGVNVHENLLNDFA 732

Query: 361  EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540
            +      G  P  YLGLP+      +S   P+I  +E KL SWKS  LS  GR  L+++ 
Sbjct: 733  KHLLCKVGKLPFTYLGLPIGGNITRLSLWDPVISKLEKKLASWKSNLLSIGGRLTLIKAC 792

Query: 541  LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSG-SSKHKFSLISWKDIARPLEEGGLNIR 717
            L +L +Y+   F IP  +  K+ +I RRFL SG SSK    L+SW  IA P   GGL + 
Sbjct: 793  LSNLPLYYMSLFPIPKGVLGKIVAIQRRFLWSGNSSKKGMPLVSWDLIALPKHLGGLGLG 852

Query: 718  CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRY-LKGEXXXXXXXXXXXXXXXRRIF--- 885
                 N A L K IW+ +    +LW + VHG+Y LK                   I    
Sbjct: 853  NLHHKNTALLFKWIWRFLNEPHALWRQVVHGKYGLKDSFTTRDLSLSSYGGPWNGICNAI 912

Query: 886  ----KLRDLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHF 1053
                + + LA   +   +G+G +T FW D+W     L    P L ++SL  D    +  F
Sbjct: 913  LKSPQAKKLAFHQVRVQIGDGSNTLFWHDVWVGANPLKTECPRLFRLSLQQDAYVSLCGF 972

Query: 1054 RINDGWRFP--YSRIPLIREIWNQCSSL-----YCLPTLEEDEVVWKATTSGLFSTKS-A 1209
                 WR+   +SR    R++  Q + L       L    +D ++W  + SG+FS KS +
Sbjct: 973  WDGLCWRWSLLWSRPLRQRDLHEQATLLNIINRAVLQKDGKDHLIWAPSKSGIFSVKSFS 1032

Query: 1210 WEITRFKHPKCSWAKLIWFPQMIPKHASL-VWRLCLKKIPTMHKLKHLKMV--DSSSCIF 1380
             E+   +  +   A    +  ++P    + VW + L ++ T  KL +LK++  + SSCIF
Sbjct: 1033 LELANMEESRSFEATKELWKGLVPFRIEIFVWFVILGRLNTKEKLLNLKLISNEDSSCIF 1092

Query: 1381 CWTGEETEKHLFFSCHFSLGVWSLIAQKCFNSTFHASWEASLRLIDKDFSSNSIDSIVVK 1560
            C +  E+  HLF  C +S  +W    Q      ++ +W     +  K+  ++ I     K
Sbjct: 1093 CSSSIESTNHLFLECSYSKELWHWWFQ-----IWNVAWVLPSSI--KELFTHWIPPFKGK 1145

Query: 1561 L-------AFQASIYHVWNERNRRRFQSRGRXXXXXXXXXXXXVMSKLKNVSSNLSTS-- 1713
                     F   ++ +W ERN R FQ +              +   +K  +     S  
Sbjct: 1146 FFKKVWMSCFFIILWTIWKERNSRIFQEKPNSKLQLKELILLRLGWWIKGWNEPFPYSAE 1205

Query: 1714 ---NRFLAENWGLPLKLNVEFLEV----KWKPP---EREWQLACDGSFSSSRASCGGLLR 1863
                  L  NW  P+K     +       W PP     +W +      S  ++S GG+LR
Sbjct: 1206 DIVRNPLCLNWLTPVKPQKAIMPAPFPQHWSPPSIGSLKWNVDASIKSSLQKSSIGGVLR 1265

Query: 1864 SKKGELKLAFHSDCQIESSLRSEVKGLLFGLRIVA 1968
              KG     F S         +EV  +   L+I A
Sbjct: 1266 DHKGNFICMFSSPIPFMEINNAEVLAIHRALKISA 1300


>gb|AAG51098.1|AC025295_6 hypothetical protein [Arabidopsis thaliana]
          Length = 504

 Score =  238 bits (606), Expect = 1e-59
 Identities = 143/449 (31%), Positives = 222/449 (49%), Gaps = 11/449 (2%)
 Frame = +1

Query: 100  MEVLSSIFKHELMLGHLQLLNHCKPFGISHLMFADDLLVFLQASSNNLNAFLGLMRSFAE 279
            M+VLS +               CK  G++HL FADDL+V       ++   + +  +FA+
Sbjct: 1    MDVLSKLLDKAAGQRKFGYHPRCKQIGLTHLSFADDLMVLSDGKVRSIEGIVDVFDTFAK 60

Query: 280  YSGLEINKKKSNIFLAGISSNLEDSLVEASGFLKGLFPVKYLGLPLISARLEISHCQPLI 459
             S L+I+ +KS ++LAG+S      +++   F  G  PV+YLGLPL++ +   +   PLI
Sbjct: 61   CSDLKISMEKSTVYLAGLSHTTRQEVIDRFSFAVGTLPVRYLGLPLVTKQFSSTDYLPLI 120

Query: 460  DLMESKLQSWKSRSLSWAGRTVLLRSVLQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSG 639
            D ++ K+ SW +R LS+ GR  L+ S+L S+  +W GAF +P     +++ +   +L SG
Sbjct: 121  DHIKQKICSWSARFLSYTGRLNLISSILWSICNFWMGAFRLPRDCIREIDKMCSAYLWSG 180

Query: 640  ----SSKHKFSLISWKDIARPLEEGGLNIRCPKDMNKAGLLKLIWKLITNKDSLWVRWVH 807
                +SK K   I+W  + +P EEGGL +R  K+ N    LKLIW++I++ DSLWV+W+ 
Sbjct: 181  GELNTSKAK---IAWAFVCKPKEEGGLGLRSLKEANDVCCLKLIWRIISHADSLWVKWIQ 237

Query: 808  GRYLKGE-XXXXXXXXXXXXXXXRRIFKLRDLAKDCICTIVGNGRSTSFWVDIWHNHGVL 984
               LK                  R+I K RD+A+      + NG  TSFW D W + G L
Sbjct: 238  SSLLKKVFFWAVRENTSLGSWMWRKILKFRDIARTLCKVEINNGAQTSFWYDDWSDLGRL 297

Query: 985  ALCIPELIQISLGIDRNALVAHFRINDGW---RFPYSRIPLIREIWNQCSSLYCLPTLEE 1155
                 +   I LGI+++A V      + W   R    R   +  +  +    +      E
Sbjct: 298  IESAGDRGAIDLGINKHATVV-----EAWGNRRRRRHRANFLNRVEERLVLSWNSRNQAE 352

Query: 1156 DEVVWKATTS---GLFSTKSAWEITRFKHPKCSWAKLIWFPQMIPKHASLVWRLCLKKIP 1326
            D  +WK   +    +FSTK  W   R    K +W K +WF Q IPKHA  +W     ++ 
Sbjct: 353  DCALWKGKENRFRSIFSTKDTWNHIRTVSNKVAWYKGVWFAQAIPKHAFCMWLAVHNRLS 412

Query: 1327 TMHKLKHLKMVDSSSCIFCWTGEETEKHL 1413
            T  ++    M   ++CI C    E+  HL
Sbjct: 413  TGDRMTLWNMGVDATCILCNNALESRDHL 441


>ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
            gi|508722459|gb|EOY14356.1| Uncharacterized protein
            TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  237 bits (605), Expect = 1e-59
 Identities = 191/678 (28%), Positives = 307/678 (45%), Gaps = 20/678 (2%)
 Frame = +1

Query: 1    FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCK--P 174
            FS+L+NG   G+F+S+RG+RQGD ISP LF +A E LS       +      L++    P
Sbjct: 1499 FSLLLNGRIEGYFKSERGLRQGDSISPQLFILAAEYLSRGLN--ALYDQYPSLHYSSGVP 1556

Query: 175  FGISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFL-AGISSNLED 351
              +SHL FADD+L+F   S + L   L  ++ + E SG  IN +KS       I ++   
Sbjct: 1557 LSVSHLAFADDVLIFTNGSKSALQRILVFLQEYEEISGQRINAQKSCFVTHTNIPNSRRQ 1616

Query: 352  SLVEASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLL 531
             + +A+GF   L P+ YLG PL     ++     L+  +E ++  W+++ LS  GR  LL
Sbjct: 1617 IIAQATGFNHQLLPITYLGAPLYKGHKKVILFNDLVAKIEERITGWENKILSPGGRITLL 1676

Query: 532  RSVLQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSSKHK-FSLISWKDIARPLEEGGL 708
            RSVL SL IY       P  +  ++  +   FL  GS+  K     SW  IA P+ EGGL
Sbjct: 1677 RSVLASLPIYLLQVLKPPVCVLERVNRLFNSFLWGGSAASKRIHWASWAKIALPVTEGGL 1736

Query: 709  NIRCPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGEXXXXXXXXXXXXXXXRRIFK 888
            +IR   ++ +A  +KL W+  T  DSLW R++  +Y +G+               +R+  
Sbjct: 1737 DIRSLAEVFEAFSMKLWWRFRTT-DSLWTRFMRMKYCRGQLPMQTQPKLHDSQTWKRMLT 1795

Query: 889  LRDLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDG 1068
               + +  +   VG G +  FW D W     L     E     +       V  F  N+ 
Sbjct: 1796 SSTITEQHMRWRVGQG-NVFFWHDCWMGEAPLISSNQEFTSSMV------QVCDFFTNNS 1848

Query: 1069 WRFPYSRIPLIREIWNQCSSLYCLPTLEEDEVVWKATTSGLFSTKSAWEITRFKHPKCSW 1248
            W     +  L +E+ ++ + +  + T+ +DE  W  T +G FSTKSAW++ R +      
Sbjct: 1849 WNIEKLKTVLQQEVVDEIAKI-PIDTMNKDEAYWTPTPNGDFSTKSAWQLIRKRKVVNPV 1907

Query: 1249 AKLIWFPQMIPKHASLVWRLCLKKIPTMHKLKHLKMVDSSSCIFCWTGEETEKHLFFSCH 1428
               IW   +    +  +WRL    IP   K+K   +  +S C  C   EE+  H+ +   
Sbjct: 1908 FNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRC-RCCKSEESIMHVMWDNP 1966

Query: 1429 FSLGVWS--------LIAQKCFNSTFHASWEASLRLIDKDFSSNSIDSIVVKLAFQASIY 1584
             ++ VW+        LI   C  +    +W         D+        +V L     ++
Sbjct: 1967 VAMQVWNYFAKLFQILIINPCTINQIIGAW-----FYSGDYCKPGHIRTLVPLFI---LW 2018

Query: 1585 HVWNERNRRRFQSRGRXXXXXXXXXXXXV--MSKLKNVSSNLSTSNRFLAENWGLPLKLN 1758
             +W ERN  + ++ G             +  +S  + +       ++ +A+ WG+  +  
Sbjct: 2019 FLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQIAQEWGIIFQ-- 2076

Query: 1759 VEFLE----VKW-KPPEREWQLACDGSFSSS-RASCGGLLRSKKGELKLAFHSDCQIESS 1920
             E L       W KP   E++L  DGS   S  A+ GG+LR   GE+   F  +   ++S
Sbjct: 2077 AESLAPPKVFSWHKPSLGEFKLNVDGSAKQSHNAAGGGILRDHAGEMVFGFSENLGTQNS 2136

Query: 1921 LRSEVKGLLFGLRIVADY 1974
            L++E+  L  GL +  DY
Sbjct: 2137 LQAELLALYRGLILCRDY 2154


>dbj|BAB01344.1| reverse transcriptase-like protein [Arabidopsis thaliana]
          Length = 1115

 Score =  237 bits (605), Expect = 1e-59
 Identities = 146/395 (36%), Positives = 208/395 (52%), Gaps = 3/395 (0%)
 Frame = +1

Query: 1    FSVLVNGSPAGFFRSQRGIRQGDPISPYLFSIAMEVLSSIFKHELMLGHLQLLNHCKPFG 180
            FSV VNG  AG+FRS RGIRQG  +SPYLF I+MEVLS +               CK  G
Sbjct: 555  FSVQVNGELAGYFRSARGIRQGCALSPYLFVISMEVLSKMLDQAAGAKRFGFHPKCKNLG 614

Query: 181  ISHLMFADDLLVFLQASSNNLNAFLGLMRSFAEYSGLEINKKKSNIFLAGISSNLEDSLV 360
            ++HL FADDL++       +++  + +M  FA+ SGL+IN +K+ ++ AG+S +    ++
Sbjct: 615  LTHLCFADDLMILTDGKVRSVDGIVEVMNLFAKRSGLKINMEKTTLYTAGVSDHNRHMMI 674

Query: 361  EASGFLKGLFPVKYLGLPLISARLEISHCQPLIDLMESKLQSWKSRSLSWAGRTVLLRSV 540
                F     PV+YLGLPL++ RL      PL + + +++ +W SR LS+AGR  L+ SV
Sbjct: 675  SRYPFGLAQLPVRYLGLPLVTKRLTKEDLSPLFEQIRNRIGTWTSRYLSFAGRLNLISSV 734

Query: 541  LQSLYIYWSGAFAIPSSIFCKLESIMRRFLVSGSS-KHKFSLISWKDIARPLEEGGLNIR 717
            L S   +W  AF +PS+   ++ SI   FL SG     + + +SW DI +P ++GGL +R
Sbjct: 735  LWSTMNFWMSAFRLPSACLKEINSICSAFLWSGPELNRRKAKVSWDDICKP-KQGGLGLR 793

Query: 718  CPKDMNKAGLLKLIWKLITNKDSLWVRWVHGRYLKGE-XXXXXXXXXXXXXXXRRIFKLR 894
               + N   +LKLIW++ +N DSLWV+W     LK E                +++ K R
Sbjct: 794  SLTEANVVSVLKLIWRVTSNDDSLWVKWSKMNLLKQESFWSLKPNSSLGSWMWKKMLKYR 853

Query: 895  DLAKDCICTIVGNGRSTSFWVDIWHNHGVLALCIPELIQISLGIDRNALVAHFRINDGWR 1074
            + AK      V NG  TSFW D W   G L     +  QI LGI RN  VA    N   R
Sbjct: 854  ETAKPFSRVEVNNGARTSFWFDNWSGMGHLMDVTGQRGQIDLGISRNKTVAEAWSNR--R 911

Query: 1075 FPYSRIPLIREIWNQCSSLY-CLPTLEEDEVVWKA 1176
                R   + +I    +  Y     L ED  +W+A
Sbjct: 912  RRKHRTEQLNDIEAALNQKYQTRILLREDAALWRA 946


Top