BLASTX nr result

ID: Akebia27_contig00030210 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00030210
         (1321 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   235   4e-59
dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thal...   217   8e-54
gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana]              216   1e-53
dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like ...   213   2e-52
emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-li...   213   2e-52
gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00...   212   3e-52
ref|XP_007224193.1| hypothetical protein PRUPE_ppa017155mg, part...   211   8e-52
gb|AAK71569.2|AC087852_29 putative reverse transcriptase [Oryza ...   211   8e-52
gb|EEC76169.1| hypothetical protein OsI_13484 [Oryza sativa Indi...   210   1e-51
ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom...   209   3e-51
gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcrip...   208   5e-51
ref|XP_004253275.1| PREDICTED: uncharacterized protein LOC101268...   207   6e-51
emb|CAN68838.1| hypothetical protein VITISV_030956 [Vitis vinifera]   207   6e-51
ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom...   207   8e-51
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   206   1e-50
ref|XP_004298219.1| PREDICTED: uncharacterized protein LOC101304...   205   4e-50
ref|XP_004250606.1| PREDICTED: uncharacterized protein LOC101247...   205   4e-50
gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,...   205   4e-50
ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom...   203   2e-49
emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|72694...   202   2e-49

>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  235 bits (599), Expect = 4e-59
 Identities = 138/441 (31%), Positives = 225/441 (51%), Gaps = 5/441 (1%)
 Frame = +2

Query: 2    VYASTDAQVRWELWRDIKYIA---TTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDF 172
            VYA      R  LW +++ +A   TT   PW++LGD N +L+  +   G       +E+F
Sbjct: 108  VYAVNCRYGRRRLWSELELLAANQTTSDKPWIILGDFNQSLDPVDASTGGSRITRGMEEF 167

Query: 173  RECLFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLS 352
            RECL  + + DL     H TW N Q    I  K+DR+LVN+ W+     ++ SF     S
Sbjct: 168  RECLLTSNISDLPFRGNHYTWWNNQENNPIAKKIDRILVNDSWLIASPLSYGSFCAMEFS 227

Query: 353  DHSPAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAWE-IKVSGNPMFKLIMKLKNV 529
            DH P+ V IS +     +PFK  NF     EF+  ++  W+ +   G+ MF L  K K +
Sbjct: 228  DHCPSCVNISNQSGGRNKPFKLSNFLMHHPEFIEKIRVTWDRLAYQGSAMFTLSKKSKFL 287

Query: 530  KLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKCE 709
            K  ++T+++  +  ++  +     +L   Q N+  +P +  LA  E+   +  + L+  E
Sbjct: 288  KGTIRTFNREHYSGLEKRVVQAAQNLKTCQNNLLAAP-SSYLAGLEKEAHRSWAELALAE 346

Query: 710  ESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRH 889
            E    QKSRV WLK GDS+T  FH+ M  RRA N+I  +  + G  + +  +++   +  
Sbjct: 347  ERFLCQKSRVLWLKCGDSNTTFFHRMMTARRAINEIHYLLDQTGRRIENTDELQTHCVDF 406

Query: 890  FKATFGTPAK-CNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGP 1066
            FK  FG+ +   +    SQ       + +E  R                F +  +K+ GP
Sbjct: 407  FKELFGSSSHLISAEGISQINSLTRFKCDENTRQLLEAEVSEADIKSEFFALPSNKSPGP 466

Query: 1067 DGFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPIS 1246
            DG+++ FF++ W I+    + A++    +G++  + N+TA+T++ K     R+ EFRPIS
Sbjct: 467  DGYTSEFFKKTWSIVGPSLIAAVQEFFRSGRLLGQWNSTAVTMVPKKPNADRITEFRPIS 526

Query: 1247 CCNVVCKAISKVLANRLKPLL 1309
            CCN + K ISK+LA RL+ +L
Sbjct: 527  CCNAIYKVISKLLARRLENIL 547


>dbj|BAF00918.1| putative reverse transcriptase [Arabidopsis thaliana]
          Length = 910

 Score =  217 bits (553), Expect = 8e-54
 Identities = 139/448 (31%), Positives = 223/448 (49%), Gaps = 8/448 (1%)
 Frame = +2

Query: 2    VYASTDAQVRWELWRDIKYIATTMTV---PWMVLGDMNVTLNHDE--KIEGRMPSKNSIE 166
            VY       R  LW DI  ++ T  +   PW++LGD N      E   I   + +   +E
Sbjct: 107  VYGRNSELDRRSLWEDILVLSRTSPLSVTPWLLLGDFNQIAAASEHYSINQSLLNLRGME 166

Query: 167  DFRECLFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSG 346
            D + CL +++L DL +     TW N Q +  I+ KLDR L N EW   F  A + F P G
Sbjct: 167  DLQCCLRDSQLSDLPSRGVFFTWSNHQQDNPILRKLDRALANGEWFAVFPSALAVFDPPG 226

Query: 347  LSDHSPAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAWEIK-VSGNPMFKLIMKLK 523
             SDH+P ++ I  +     + FK+F+F +    ++  +  AWE   + G+ MF L   LK
Sbjct: 227  DSDHAPCIILIDNQPPPSKKSFKYFSFLSSHPSYLAALSTAWEANTLVGSHMFSLRQHLK 286

Query: 524  NVKLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSK 703
              KL  +T ++ +F N+        + L ++Q+ +  SP +  L   E    K+  F + 
Sbjct: 287  VAKLCCRTLNRLRFSNIQQRTAQSLTRLEDIQVELLTSP-SDTLFRREHVARKQWIFFAA 345

Query: 704  CEESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVI 883
              ES  +QKSR+ WL  GD++TR FH+++   +A N I  ++ ++G  + +   I+  +I
Sbjct: 346  ALESFFRQKSRIRWLHEGDANTRFFHRAVIAHQATNLIKFLRGDDGFRVENVDQIKGMLI 405

Query: 884  RHFKATFGTPAKCNTSVFS--Q*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKA 1057
             ++    G P++ N + FS  + +  L  + +                    F M  +KA
Sbjct: 406  AYYSHLLGIPSE-NVTPFSVEKIKGLLPFRCDSFLASQLTTIPSEEEITQVLFSMPRNKA 464

Query: 1058 LGPDGFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFR 1237
             GPDGF   FF  AW I+    + AI+    +G + R  NATAITLI KV    RL +FR
Sbjct: 465  PGPDGFPVEFFIEAWAIVKSSVVAAIREFFISGNLPRGFNATAITLIPKVTGADRLTQFR 524

Query: 1238 PISCCNVVCKAISKVLANRLKPLLHKLV 1321
            P++CC  + K I+++++ RLK  + + V
Sbjct: 525  PVACCTTIYKVITRIISRRLKLFIDQAV 552


>gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana]
          Length = 1161

 Score =  216 bits (551), Expect = 1e-53
 Identities = 139/448 (31%), Positives = 223/448 (49%), Gaps = 8/448 (1%)
 Frame = +2

Query: 2    VYASTDAQVRWELWRDIKYIATTMTV---PWMVLGDMNVTLNHDE--KIEGRMPSKNSIE 166
            VY       R  LW DI  ++ T  +   PW++LGD N      E   I   + +   +E
Sbjct: 150  VYGRNSELDRRSLWEDILVLSRTSPLSVTPWLLLGDFNQIAAASEHYSINQSLLNLRGME 209

Query: 167  DFRECLFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSG 346
            D + CL +++L DL +     TW N Q +  I+ KLDR L N EW   F  A + F P G
Sbjct: 210  DLQCCLRDSQLSDLPSRGVFFTWSNHQQDNPILRKLDRALANGEWFAVFPSALAVFDPPG 269

Query: 347  LSDHSPAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAWEIK-VSGNPMFKLIMKLK 523
             SDH+P ++ I  +     + FK+F+F +    ++  +  AWE   + G+ MF L   LK
Sbjct: 270  DSDHAPCIILIDNQPPPSKKSFKYFSFLSSHPSYLAALSTAWEENTLVGSHMFSLRQHLK 329

Query: 524  NVKLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSK 703
              KL  +T ++ +F N+        + L ++Q+ +  SP +  L   E    K+  F + 
Sbjct: 330  VAKLCCRTLNRLRFSNIQQRTAQSLTRLEDIQVELLTSP-SDTLFRREHVARKQWIFFAA 388

Query: 704  CEESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVI 883
              ES  +QKSR+ WL  GD++TR FH+++   +A N I  ++ ++G  + +   I+  +I
Sbjct: 389  ALESFFRQKSRIRWLHEGDANTRFFHRAVIAHQATNLIKFLRGDDGFRVENVDQIKGMLI 448

Query: 884  RHFKATFGTPAKCNTSVFS--Q*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKA 1057
             ++    G P++ N + FS  + +  L  + +                    F M  +KA
Sbjct: 449  AYYSHLLGIPSE-NVTPFSVEKIKGLLPFRCDSFLASQLTTIPSEEEITQVLFSMPRNKA 507

Query: 1058 LGPDGFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFR 1237
             GPDGF   FF  AW I+    + AI+    +G + R  NATAITLI KV    RL +FR
Sbjct: 508  PGPDGFPVEFFIEAWAIVKSSVVAAIREFFISGNLPRGFNATAITLIPKVTGADRLTQFR 567

Query: 1238 PISCCNVVCKAISKVLANRLKPLLHKLV 1321
            P++CC  + K I+++++ RLK  + + V
Sbjct: 568  PVACCTTIYKVITRIISRRLKLFIDQAV 595


>dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 893

 Score =  213 bits (541), Expect = 2e-52
 Identities = 135/449 (30%), Positives = 235/449 (52%), Gaps = 9/449 (2%)
 Frame = +2

Query: 2    VYASTDAQVRWELWRDIKYIATTMTV---PWMVLGDMNVTLNHDEKIEGRMPSKNSIEDF 172
            VYAS +   R ELW ++  +A +  V    W+VLGD N  LN +  I   +  K  I  F
Sbjct: 109  VYASNEEGTRKELWNELVQLALSPVVVGRSWIVLGDFNQILNPESAINANIGRK--IRAF 166

Query: 173  RECLFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLS 352
            R CL ++ L DL  +    TW N+     +  K+DR+LVN+ W   F  A+++F     S
Sbjct: 167  RSCLLDSDLYDLVYKGSSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEPDFS 226

Query: 353  DHSPAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAW-EIKVSGNPMFKLIMKLKNV 529
            DHS   V +        RPF+FFN++  + +F+ +++E W    VSG+ M+++  KLK++
Sbjct: 227  DHSSCEVVLDPAVLKAKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHL 286

Query: 530  KLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKCE 709
            KL +  +S+  + +++  + +  + + + Q     +P +   A  E    ++   L+K E
Sbjct: 287  KLPICCFSRENYSDIEKRVSEAHAIVLHRQRITLTNP-SVVHATLELEATRKWQILAKAE 345

Query: 710  ESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRH 889
            ES   QKS ++WL  GD++T  FH+    R++ N I  +  + GE +  +  I+ E I+ 
Sbjct: 346  ESFFCQKSSISWLYEGDNNTAYFHKMADMRKSINTINFLIDDFGERIETQQGIK-EGIKE 404

Query: 890  FKATFGTPAKCNTS-VFSQ*RDDLHHQLN---EEDRMXXXXXXXXXXXXXXXFQMMP-DK 1054
                F     C      S  + D++  L+     D++               F  +P +K
Sbjct: 405  HSCNFFESLLCGVEGENSLAQSDMNLLLSFRCSVDQINDLERSFSDLDIQEAFFSLPRNK 464

Query: 1055 ALGPDGFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEF 1234
            A GPDG+S+ FF+  W ++  +  +A++    +G++ ++ NAT + LI K+   S++ +F
Sbjct: 465  ASGPDGYSSEFFKGVWFVVGPEVTEAVQEFFRSGQLLKQWNATTLVLIPKITNSSKMTDF 524

Query: 1235 RPISCCNVVCKAISKVLANRLKPLLHKLV 1321
            RPISC N + K I+K+L +RLK LL++++
Sbjct: 525  RPISCLNTLYKVIAKLLTSRLKKLLNEVI 553


>emb|CAA66812.1| non-ltr retrotransposon reverse transcriptase-like protein
            [Arabidopsis thaliana]
          Length = 893

 Score =  213 bits (541), Expect = 2e-52
 Identities = 135/449 (30%), Positives = 235/449 (52%), Gaps = 9/449 (2%)
 Frame = +2

Query: 2    VYASTDAQVRWELWRDIKYIATTMTV---PWMVLGDMNVTLNHDEKIEGRMPSKNSIEDF 172
            VYAS +   R ELW ++  +A +  V    W+VLGD N  LN +  I   +  K  I  F
Sbjct: 109  VYASNEEGTRKELWNELVQLALSPVVVGRSWIVLGDFNQILNPESAINANIGRK--IRAF 166

Query: 173  RECLFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLS 352
            R CL ++ L DL  +    TW N+     +  K+DR+LVN+ W   F  A+++F     S
Sbjct: 167  RSCLLDSDLYDLVYKGSSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEPDFS 226

Query: 353  DHSPAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAW-EIKVSGNPMFKLIMKLKNV 529
            DHS   V +        RPF+FFN++  + +F+ +++E W    VSG+ M+++  KLK++
Sbjct: 227  DHSSCEVVLDPAVLKAKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHL 286

Query: 530  KLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKCE 709
            KL +  +S+  + +++  + +  + + + Q     +P +   A  E    ++   L+K E
Sbjct: 287  KLPICCFSRENYSDIEKRVSEAHAIVLHRQRITLTNP-SVVHATLELEATRKWQILAKAE 345

Query: 710  ESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRH 889
            ES   QKS ++WL  GD++T  FH+    R++ N I  +  + GE +  +  I+ E I+ 
Sbjct: 346  ESFFCQKSSISWLYEGDNNTAYFHKMADMRKSINTINFLIDDFGERIETQQGIK-EGIKE 404

Query: 890  FKATFGTPAKCNTS-VFSQ*RDDLHHQLN---EEDRMXXXXXXXXXXXXXXXFQMMP-DK 1054
                F     C      S  + D++  L+     D++               F  +P +K
Sbjct: 405  HSCNFFESLLCGVEGENSLAQSDMNLLLSFRCSVDQINDLERSFSDLDIQEAFFSLPRNK 464

Query: 1055 ALGPDGFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEF 1234
            A GPDG+S+ FF+  W ++  +  +A++    +G++ ++ NAT + LI K+   S++ +F
Sbjct: 465  ASGPDGYSSEFFKGVWFVVGPEVTEAVQEFFRSGQLLKQWNATTLVLIPKITNSSKMTDF 524

Query: 1235 RPISCCNVVCKAISKVLANRLKPLLHKLV 1321
            RPISC N + K I+K+L +RLK LL++++
Sbjct: 525  RPISCLNTLYKVIAKLLTSRLKKLLNEVI 553


>gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis
            thaliana]
          Length = 1253

 Score =  212 bits (539), Expect = 3e-52
 Identities = 138/449 (30%), Positives = 222/449 (49%), Gaps = 9/449 (2%)
 Frame = +2

Query: 2    VYASTDAQVRWELWRDIKYIATTMT---VPWMVLGDMNVTLNHDEKIEGRMPSKNS-IED 169
            VYA+ +A  R ELW ++  ++ +++    PW++LGD N  L   E  +    + N  ++ 
Sbjct: 58   VYAANEAITRKELWEELLLLSVSLSGNGKPWIMLGDFNQVLCPAEHSQATSLNVNRRMKV 117

Query: 170  FRECLFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGL 349
            FR+CLFEA L DL  +    TW N+     +  KLDR+LVN  W   F  A++ F     
Sbjct: 118  FRDCLFEAELCDLVFKGNTFTWWNKSATRPVAKKLDRILVNESWCSRFPSAYAVFGEPDF 177

Query: 350  SDHSPAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAW-EIKVSGNPMFKLIMKLKN 526
            SDH+   V I+       RPF+F+NF   + +F+++V E W  I V G+ MFK+  KLK 
Sbjct: 178  SDHASCGVIINPLMHREKRPFRFYNFLLQNPDFISLVGELWYSINVVGSSMFKMSKKLKA 237

Query: 527  VKLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKC 706
            +K  ++T+S   F N++  +K+  + +   Q      P   + A +  A  K +  L K 
Sbjct: 238  LKNPIRTFSMENFSNLEKRVKEAHNLVLYRQNKTLSDPTIPNAALEMEAQRKWL-ILVKA 296

Query: 707  EESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIR 886
            EES   Q+SRV W+  GDS+T  FH+    R+A N I  I  +NG  +  +  I+   I 
Sbjct: 297  EESFFCQRSRVTWMGEGDSNTSYFHRMADSRKAVNTIHIIIDDNGVKIDTQLGIKEHCIE 356

Query: 887  HFKATFGTPAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGP 1066
            +F    G        +       L  + + + +                F    +K  GP
Sbjct: 357  YFSNLLGGEVGPPMLIQEDFDLLLPFRCSHDQKKELAMSFSRQDIKSAFFSFPSNKTSGP 416

Query: 1067 DGFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPIS 1246
            DGF   FF+  W +I  +   A+     +  + ++ NAT + LI K+   S++N+FRPIS
Sbjct: 417  DGFPVEFFKETWSVIGTEVTDAVSEFFTSSVLLKQWNATTLVLIPKITNASKMNDFRPIS 476

Query: 1247 CCN----VVCKAISKVLANRLKPLLHKLV 1321
            C +     + K I+++L NRL+ LL +++
Sbjct: 477  CNDFGPITLYKVIARLLTNRLQCLLSQVI 505


>ref|XP_007224193.1| hypothetical protein PRUPE_ppa017155mg, partial [Prunus persica]
            gi|462421129|gb|EMJ25392.1| hypothetical protein
            PRUPE_ppa017155mg, partial [Prunus persica]
          Length = 916

 Score =  211 bits (536), Expect = 8e-52
 Identities = 134/425 (31%), Positives = 203/425 (47%), Gaps = 4/425 (0%)
 Frame = +2

Query: 59   IATTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDFRECLFEARLQDLKAERCHLTWC 238
            +  T  +PW+  GD N  L  DEK+ GR   +  +  FR+ +     +D+       TW 
Sbjct: 1    LEATNYLPWLCCGDFNEILRADEKLGGRRRREGQMLGFRQAIDTCGFKDMGYTGPKYTWW 60

Query: 239  -NRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDHSPAVVTISKKRKICGRPFK 415
             N  +E RI  +LDRVL   +W   F       L    SDH P  VTIS++  + GR  K
Sbjct: 61   RNNPMEIRI--RLDRVLATADWCSRFLGTKVIHLNPTKSDHLPLKVTISERMLLNGRRKK 118

Query: 416  FFNF---WADDSEFMTVVQEAWEIKVSGNPMFKLIMKLKNVKLDLKTWSKNKFGNMDNSI 586
             F F   WA+    M  +Q+ W+    G+  F    KLK  +  L  WSK  FG++ N I
Sbjct: 119  LFRFEEMWAEHVNCMQTIQDGWQRTCRGSAPFTTTEKLKCTRHQLLGWSKCNFGHLPNQI 178

Query: 587  KDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKCEESAAKQKSRVNWLKLGDSH 766
            K  +  L  L     ++P +        A+ K++  L    E   +Q+SR  WLK GD +
Sbjct: 179  KITREKLGELL----DAPPSHHTVELRNALTKQLDSLMAKNEVYWRQRSRATWLKAGDRN 234

Query: 767  TRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRHFKATFGTPAKCNTSVFSQ* 946
            ++ FH      R RN I++++ E+G +   E  +   V+ +F+  F +     +S +++ 
Sbjct: 235  SKFFHYKASSCRRRNTISALEDEHGHWQTTEQGLTQTVVNYFQHLFSS---IGSSDYTEV 291

Query: 947  RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGPDGFSACFFQRAWIIINRDFL 1126
             D +  ++ EE                  FQM P KA GPD FS  F+Q+ W I+  D +
Sbjct: 292  VDGVRGRVTEEMNQALLAEFTPEEIKIALFQMHPSKAPGPDDFSPFFYQKYWQIVGEDMV 351

Query: 1127 KAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPISCCNVVCKAISKVLANRLKPL 1306
             A+      GK+ +++N T + LI KV  P  + + RPIS CNV  K  +KVLA  LK +
Sbjct: 352  AAVLHFFKTGKLLKKINFTHVALIPKVHEPKNMTQLRPISLCNVFNKIGAKVLATHLKAI 411

Query: 1307 LHKLV 1321
            L  L+
Sbjct: 412  LPTLI 416


>gb|AAK71569.2|AC087852_29 putative reverse transcriptase [Oryza sativa Japonica Group]
          Length = 1833

 Score =  211 bits (536), Expect = 8e-52
 Identities = 137/444 (30%), Positives = 220/444 (49%), Gaps = 4/444 (0%)
 Frame = +2

Query: 2    VYASTDAQVRWELWRDIKYIATTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDFREC 181
            VY    AQ R  +W  ++ I +    PW+++GD N  +   E    R  S++ + DFRE 
Sbjct: 85   VYGEPRAQDRHLMWSLLRRIRSNSGDPWLMIGDFNEAMWQTEHKSHRKRSESQMRDFREV 144

Query: 182  LFEARLQDLKAERCHLTWCNRQIEGR-IMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDH 358
            L E  L D+  +    T+CN Q EGR +  +LDR + +  W   F +A  + L +  SDH
Sbjct: 145  LSECDLHDIGFQGAPWTFCNMQREGRNVKVRLDRGVASPAWSSRFPQAVITHLTTPSSDH 204

Query: 359  SPAVVTISKKRKICGRPFKFFNF---WADDSEFMTVVQEAWEIKVSGNPMFKLIMKLKNV 529
            +P +  + ++     RP K   +   W  +S    V+QEAW +    + +  +  K+K  
Sbjct: 205  APLL--LEREETTLARPMKIMRYEEVWERESSLPEVIQEAWTMGADASTLGDINDKMKVT 262

Query: 530  KLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKCE 709
               L +WSK+K GN+   IKDL+  L  L+ NI       D   +  ++ KE+  +   E
Sbjct: 263  MTKLVSWSKDKIGNVRKKIKDLREKLGELR-NI----GLLDTDNEVHSVKKELEEMLHRE 317

Query: 710  ESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRH 889
            E   KQ+SR+ WLK GD +TR FH     R  +NKI  +K  +G    ++ +++ E+ R 
Sbjct: 318  EIWWKQRSRITWLKEGDLNTRYFHLKASWRAKKNKIKKLKKNDGSTTMNKKEMK-EINRS 376

Query: 890  FKATFGTPAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGPD 1069
            F     T       V     +  H +++E+                  FQ+ P KA GPD
Sbjct: 377  FFQQLYTKDDNLNPV--NLLNMFHEKISEQMNADLIKPFTNEEISDALFQIGPLKAPGPD 434

Query: 1070 GFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPISC 1249
            GF A F QR W ++  + + A+++   +  +   VN T I +I K  +   + +FRPIS 
Sbjct: 435  GFPARFLQRNWGLLKGEVIAAVRNFFEDEVMQEGVNDTVIVMIPKKNLAEDMKDFRPISL 494

Query: 1250 CNVVCKAISKVLANRLKPLLHKLV 1321
            CNVV K ++K L NR++P+L +++
Sbjct: 495  CNVVYKVVAKCLVNRMRPMLQEII 518


>gb|EEC76169.1| hypothetical protein OsI_13484 [Oryza sativa Indica Group]
          Length = 1874

 Score =  210 bits (534), Expect = 1e-51
 Identities = 136/444 (30%), Positives = 220/444 (49%), Gaps = 4/444 (0%)
 Frame = +2

Query: 2    VYASTDAQVRWELWRDIKYIATTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDFREC 181
            +Y    AQ R  +W  ++ I +    PW+++GD N  +   E    R  S++ + DFRE 
Sbjct: 60   LYGEPRAQDRHLMWSLLRRIRSNSGDPWLMIGDFNEAMWQTEHKSHRKRSESQMRDFREV 119

Query: 182  LFEARLQDLKAERCHLTWCNRQIEGR-IMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDH 358
            L E  L D+  +    T+CN Q EGR +  +LDR + +  W   F +A  + L +  SDH
Sbjct: 120  LSECDLHDIGFQGAPWTFCNMQREGRNVKVRLDRGVASPAWSSRFPQAVITHLTTPSSDH 179

Query: 359  SPAVVTISKKRKICGRPFKFFNF---WADDSEFMTVVQEAWEIKVSGNPMFKLIMKLKNV 529
            +P +  + ++     RP K   +   W  +S    V+QEAW +    + +  +  K+K  
Sbjct: 180  APLL--LEREETTLARPMKIMRYEEVWERESSLPEVIQEAWTMGADASTLGDINDKMKVT 237

Query: 530  KLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKCE 709
               L +WSK+K GN+   IKDL+  L  L+ NI       D   +  ++ KE+  +   E
Sbjct: 238  MTKLVSWSKDKIGNVRKKIKDLREKLGELR-NI----GLLDTDNEVHSVKKELEEMLHRE 292

Query: 710  ESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRH 889
            E   KQ+SR+ WLK GD +TR FH     R  +NKI  +K  +G    ++ +++ E+ R 
Sbjct: 293  EIWWKQRSRITWLKEGDLNTRYFHLKASWRAKKNKIKKLKKNDGSTTMNKKEMK-EISRS 351

Query: 890  FKATFGTPAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGPD 1069
            F     T       V     +  H +++E+                  FQ+ P KA GPD
Sbjct: 352  FFQQLYTKDDNLNPV--NLLNMFHEKISEQMNADLIKPFTDEEISDALFQIGPLKAPGPD 409

Query: 1070 GFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPISC 1249
            GF A F QR W ++  + + A+++   +  +   VN T I +I K  +   + +FRPIS 
Sbjct: 410  GFPARFLQRNWGLLKGEVIAAVRNFFEDEVMQEGVNDTVIVMIPKKNLAEDMKDFRPISL 469

Query: 1250 CNVVCKAISKVLANRLKPLLHKLV 1321
            CNVV K ++K L NR++P+L +++
Sbjct: 470  CNVVYKVVAKCLVNRMRPMLQEII 493


>ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
            gi|508725616|gb|EOY17513.1| Uncharacterized protein
            TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  209 bits (531), Expect = 3e-51
 Identities = 130/443 (29%), Positives = 215/443 (48%), Gaps = 3/443 (0%)
 Frame = +2

Query: 2    VYASTDAQVRWELWRDIKYIATTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDFREC 181
            VYA      R  LW  ++ +A  M  PW+V GD N+ L  +E++ G  P + SIEDF   
Sbjct: 951  VYAKCTRSERTPLWNCLRNLAADMEGPWIVGGDFNIILKREERLYGADPHEGSIEDFASV 1010

Query: 182  LFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDHS 361
            L +  L D   E    TW N     R+  +LDR++ N +WI++F       L    SDH 
Sbjct: 1011 LLDCGLLDGGFEGNPFTWTNN----RMFQRLDRMVYNQQWINKFPITRIQHLNRDGSDHC 1066

Query: 362  PAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAWEIKVSGNPMFKLIMKLKNVKLDL 541
            P +++ S   +     F+F + WA    F   V+  W + ++G+ +     K K +K  L
Sbjct: 1067 PLLLSCSNSSEKAPSSFRFLHAWALHHNFNASVEGNWNLPINGSGLMAFWSKQKRLKQHL 1126

Query: 542  KTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSK---CEE 712
            K W+K  FG++ ++IK+ +  +   ++        +        + K  + L+K    EE
Sbjct: 1127 KWWNKTVFGDIFSNIKEAEKRVEECEI----LHQQEQTIGSRIQLNKSYAQLNKQLSMEE 1182

Query: 713  SAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRHF 892
               KQKS V W+  G+ +T+ FH  M+++R R+ I  I+ ++G ++ D   ++   I  F
Sbjct: 1183 IFWKQKSGVKWVVEGERNTKFFHMRMQKKRIRSHIFKIQEQDGNWIEDPEQLQQSAIDFF 1242

Query: 893  KATFGTPAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGPDG 1072
             +     +  +T   S     +   +++ D                 F + P+ A GPDG
Sbjct: 1243 SSLLKAESCDDTRFQSSLCPSI---ISDTDNGFLCAEPTLQEVKEAVFGIDPESAAGPDG 1299

Query: 1073 FSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPISCC 1252
            FS+ F+Q+ W II  D  +A+K       I + + +T + LI K    S+ +EFRPIS C
Sbjct: 1300 FSSHFYQQCWDIIAHDLFEAVKEFFHGADIPQGMTSTTLVLIPKTTSASKWSEFRPISLC 1359

Query: 1253 NVVCKAISKVLANRLKPLLHKLV 1321
             V+ K I+K+LANRL  +L  ++
Sbjct: 1360 TVMNKIITKILANRLAKILPSII 1382


>gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcript_fact.hmm, score:
            72.31) [Arabidopsis thaliana]
          Length = 928

 Score =  208 bits (529), Expect = 5e-51
 Identities = 132/444 (29%), Positives = 218/444 (49%), Gaps = 13/444 (2%)
 Frame = +2

Query: 29   RWELWRDIKYIATTMTV---PWMVLGDMNVTLNHDEKIEGRMP--SKNSIEDFRECLFEA 193
            R ELW D++  + +  +   PW++ GD N  L+ +E    R    +   + DF+  +   
Sbjct: 4    RKELWNDLRDHSDSPIIRSKPWIIFGDFNEILDMEEHSNSRENPVTTTGMRDFQMAVNHC 63

Query: 194  RLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDHSPAVV 373
             + DL       TW N++    I  KLDRVLVN+ W+  F  ++S F   G SDH    +
Sbjct: 64   SITDLAYHGPLFTWSNKRENDLIAKKLDRVLVNDVWLQSFPRSYSVFEAGGCSDHLRCRI 123

Query: 374  TISKKRKIC---GRPFKFFNFWADDSEFMTVVQEAWE----IKVSGNPMFKLIMKLKNVK 532
             ++          RPFKF N   +   F+  V+  W     I +S + +F+   KLK +K
Sbjct: 124  NLNVGAGAVVKGKRPFKFVNVITEMEHFIPTVESYWNETEAIFMSTSSLFRFSKKLKGLK 183

Query: 533  LDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKCEE 712
              L+   K + GN+    K+   +L   Q     +P    +  +  A  K    ++  EE
Sbjct: 184  PLLRNLGKERLGNLVKQTKEAFETLCQKQAMKMANPSPSSMQEENEAYAKW-DHIAVLEE 242

Query: 713  SAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRHF 892
               KQ+S+++WL +GD + ++FH+++  R A+N I  I   +G   + E  I+ E   HF
Sbjct: 243  KFLKQRSKLHWLDIGDRNNKAFHRAVVAREAQNSIREIICHDGSVASQEEKIKTEAEHHF 302

Query: 893  KATFGT-PAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGPD 1069
            +      P         + +D L ++ ++ D+                F M  DK+ GPD
Sbjct: 303  REFLQLIPNDFEGIAVEELQDLLPYRCSDSDKEMLTNHVSAEEIHKVVFSMPNDKSPGPD 362

Query: 1070 GFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPISC 1249
            G++A F++ AW II  +F+ AI+S    G + + +N+T + LI K +    + ++RPISC
Sbjct: 363  GYTAEFYKGAWNIIGAEFILAIQSFFAKGFLPKGINSTILALIPKKKEAKEMKDYRPISC 422

Query: 1250 CNVVCKAISKVLANRLKPLLHKLV 1321
            CNV+ K ISK++ANRLK +L K +
Sbjct: 423  CNVLYKVISKIIANRLKLVLPKFI 446


>ref|XP_004253275.1| PREDICTED: uncharacterized protein LOC101268853 [Solanum
            lycopersicum]
          Length = 1333

 Score =  207 bits (528), Expect = 6e-51
 Identities = 134/442 (30%), Positives = 213/442 (48%), Gaps = 2/442 (0%)
 Frame = +2

Query: 2    VYASTDAQVRWELWRDIKYIATTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDFREC 181
            VYA    Q+R  LW DI    +    PW ++GD NV  +  EK+ GR  + N   +F   
Sbjct: 50   VYAKCKDQLRKPLW-DIMLKRSETMYPWSIIGDFNVITSTSEKLGGRDYNINKSLEFINI 108

Query: 182  LFEARLQDLKAERCHLTWCNRQIEG-RIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDH 358
            +    L D+       TWCN + +G RI  +LDR + N++WI+    +  + LPS  SDH
Sbjct: 109  IEACGLVDMGYHGQDYTWCNHRKDGARIWKRLDRGMTNDKWIETIPHSSITHLPSVGSDH 168

Query: 359  SPAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAWEIKVSGNPMFKLIMKLKNVKLD 538
             P ++ I   +    + FKF N W ++  F+  V++ W+  V GNPM+    KL+ +   
Sbjct: 169  CPLLMEICDIQSNTIKYFKFLNCWTENDSFLETVEKCWKRDVIGNPMWNFHTKLRRLTKT 228

Query: 539  LKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKCEESA 718
            L+ WSK ++G++   +K L   L     NI    ++   +    AI  E    SK E   
Sbjct: 229  LRIWSKQEYGDVFEKVK-LYEDLVKKAENIIIDNYSAKNSEKLNAINAEYIKFSKMEYKI 287

Query: 719  AKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRHFKA 898
             +QK++++WL+ GD++T+ FH  ++ +R R  I  +  E+G ++  E +I      +++ 
Sbjct: 288  LQQKTQLHWLQEGDANTKYFHTVIRGKRNRMSIHKLMDESGNWIKGEEEIAKHACDYYEK 347

Query: 899  TF-GTPAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGPDGF 1075
             F G   K    +       ++  + +E                    M P  A GPDGF
Sbjct: 348  IFTGMNGKIKEDIL----QCINPMITQEQNKDLDRIPDMDELRRTIMSMNPHSAPGPDGF 403

Query: 1076 SACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPISCCN 1255
               F+Q  + II  D L A+K       + R +    +TLI K++ P RL +FRPIS  N
Sbjct: 404  GGKFYQVCFDIIKEDLLAAVKHFYVGNIMPRYLTHACLTLIPKIDHPCRLKDFRPISLSN 463

Query: 1256 VVCKAISKVLANRLKPLLHKLV 1321
               K ISK+L+ RL  +L  +V
Sbjct: 464  FTNKIISKILSTRLALILPSIV 485


>emb|CAN68838.1| hypothetical protein VITISV_030956 [Vitis vinifera]
          Length = 1881

 Score =  207 bits (528), Expect = 6e-51
 Identities = 142/443 (32%), Positives = 220/443 (49%), Gaps = 3/443 (0%)
 Frame = +2

Query: 2    VYASTDAQVRWELWRDIKYIATTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDFREC 181
            VY   ++ +R +LW ++  IA   +  W V GD NV     EK+ G   +  S++DF + 
Sbjct: 937  VYGPNNSALRKDLWVELSDIAGLASPRWCVGGDFNVIRRSSEKLGGSRLTP-SMKDFDDF 995

Query: 182  LFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDHS 361
            + +  L DL       TW N Q+   +  +LDR L +NEW   F ++    LP   SDH 
Sbjct: 996  ISDCELIDLPLRSASFTWSNMQVNP-VCKRLDRFLYSNEWEQTFPQSIQGVLPRWTSDHW 1054

Query: 362  PAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAW-EIKVSGNPMFKLIMKLKNVKLD 538
            P V+  +   K    PF+F N W     F       W E + +G    K + KL+ VK  
Sbjct: 1055 PIVLE-TNPFKWGPTPFRFENMWLQHPSFKENFGRWWREFQGNGWEGHKFMRKLQFVKAK 1113

Query: 539  LKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIK-EISFLSKCEES 715
            LK W+K  FG +    +D+ S+L N      E   + +L A +RAI K E+  L   EE 
Sbjct: 1114 LKVWNKASFGELSKRKEDILSALVNFDSLEQEGGLSHELLA-QRAIKKGELEELILREEI 1172

Query: 716  AAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRHFK 895
              +QK+RV W+K GD +++ FH+    RR R  I  ++ ENG+ + +   I+ E++R+F+
Sbjct: 1173 HWRQKARVKWVKEGDCNSKFFHKVANGRRNRKFIKELENENGQMMNNSESIKEEILRYFE 1232

Query: 896  ATFGTPAKCNTSVFSQ*RDDLH-HQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGPDG 1072
              + +P+  +  V     + L    ++ E  +               FQM  DKA GPDG
Sbjct: 1233 KLYTSPSGESWRV-----EGLDWSPISGESAVRLESPFTEEEICKAIFQMDRDKAPGPDG 1287

Query: 1073 FSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPISCC 1252
            F+   FQ  W +I  D +K       +G I +  NA+ I L+ K  M  R+++FRPIS  
Sbjct: 1288 FTIAVFQDCWEVIKEDLVKVFTEFHRSGIINQSTNASFIVLLPKKSMSRRISDFRPISLI 1347

Query: 1253 NVVCKAISKVLANRLKPLLHKLV 1321
              + K I+KVLA R++ +LH+ +
Sbjct: 1348 TSLYKIIAKVLAGRIREVLHETI 1370


>ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
            gi|508710341|gb|EOY02238.1| Uncharacterized protein
            TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  207 bits (527), Expect = 8e-51
 Identities = 128/440 (29%), Positives = 211/440 (47%)
 Frame = +2

Query: 2    VYASTDAQVRWELWRDIKYIATTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDFREC 181
            VYA      R ELW  ++ I+  M  PW+V GD N  ++ DE++ G +P   S+ED    
Sbjct: 952  VYAKCTRIERRELWTSLRIISDGMQAPWLVGGDFNSIVSCDERLNGAIPHDGSMEDLSST 1011

Query: 182  LFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDHS 361
            LF+  L D   E    TW N     R+  +LDRV+ N EW + F+      L    SDH 
Sbjct: 1012 LFDCGLLDAGFEGNSFTWTNN----RMFQRLDRVVYNQEWAEFFSSTRVQHLNRDGSDHC 1067

Query: 362  PAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAWEIKVSGNPMFKLIMKLKNVKLDL 541
            P +++ S   +     F+F + W    +F++ V+++W   +    +     K + +K DL
Sbjct: 1068 PLLISCSNTNQRGPATFRFLHAWTKHHDFISFVEKSWNTPIHAEGLNAFWTKQQRLKRDL 1127

Query: 542  KTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKCEESAA 721
            K W+K+ FG++   ++  +      ++N  ++P   +     +A  K    LS  EE   
Sbjct: 1128 KWWNKHIFGDIFKILRLAEVEAEQRELNFQQNPSAANRELMHKAYAKLNRQLS-IEELFW 1186

Query: 722  KQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRHFKAT 901
            +QKS V WL  G+ +T+ FH  M+++R RN I  I+ + G  L +   I+   +  F+  
Sbjct: 1187 QQKSGVKWLVEGERNTKFFHMRMRKKRMRNHIFRIQDQEGNVLEEPHLIQNSGVEFFQNL 1246

Query: 902  FGTPAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGPDGFSA 1081
                 +C+ S F          ++  D                 F +  D   GPDGFS+
Sbjct: 1247 LKAE-QCDISRFDP--SITPRIISTTDNEFLCATPSLQEVKEAVFNINKDSVAGPDGFSS 1303

Query: 1082 CFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPISCCNVV 1261
             F+Q  W II +D  +A+        + R + +T + L+ K +  S+ +EFRPIS C V+
Sbjct: 1304 LFYQHCWDIIKQDLFEAVLDFFKGSPLPRGITSTTLVLLPKTQNVSQWSEFRPISLCTVL 1363

Query: 1262 CKAISKVLANRLKPLLHKLV 1321
             K ++K+LANRL  +L  ++
Sbjct: 1364 NKIVTKLLANRLSKILPSII 1383


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  206 bits (525), Expect = 1e-50
 Identities = 134/454 (29%), Positives = 231/454 (50%), Gaps = 14/454 (3%)
 Frame = +2

Query: 2    VYASTDAQVRWELWRDIKYIATTMTV---PWMVLGDMNVTLN---HDEKIEGRMPSKNSI 163
            VYAS   + R  LW ++K    +  +   PW +LGD N TL+   H +     M +   +
Sbjct: 107  VYASNYVEERKVLWSELKDHYDSPIIRHKPWTLLGDFNETLDIAEHSQSFVHPMVTPG-M 165

Query: 164  EDFRECLFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPS 343
             DF++ +    L D+ A+    TWCN++  G IM KLDRVL+N+ W   F++++S F   
Sbjct: 166  RDFQQVINYCSLTDMAAQGPLFTWCNKREHGLIMKKLDRVLINDCWNQTFSQSYSVFEAG 225

Query: 344  GLSDHSPAVVTISKK--RKICG-RPFKFFNFWADDSEFMTVVQEAWE----IKVSGNPMF 502
            G SDH    ++++ +   K+ G +PFKF N   D  +F  +V   W+    + +S + +F
Sbjct: 226  GCSDHLRCRISLNSEAGNKVQGLKPFKFVNALTDMEDFKPMVSTYWKDTEPLILSTSTLF 285

Query: 503  KLIMKLKNVKLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIK 682
            +    LK +K  +++ ++++ GN+     +    L   Q     +P +  +  +E A   
Sbjct: 286  RFSKNLKGLKPKIRSMARDRLGNLSKKANEAYKILCAKQHVNLTNPSSMAME-EENAAYS 344

Query: 683  EISFLSKCEESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEA 862
                ++  EE   KQKS+++W ++GD +T++FH++   R A N I  I   +G       
Sbjct: 345  RWDRVAILEEKYLKQKSKLHWCQVGDQNTKAFHRAAAAREAHNTIREILSNDGIVKTKGD 404

Query: 863  DIEMEVIRHFKATFGT-PAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQ 1039
            +I+ E  R F+      P        ++ +  L  + ++ D+                F+
Sbjct: 405  EIKAEAERFFREFLQLIPNDFEGVTITELQQLLPVRCSDADQQSLIRPVTAEEIRKVLFR 464

Query: 1040 MMPDKALGPDGFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPS 1219
            M  DK+ GPDG+++ FF+  W II  +F  A++S    G + + +N+T + LI K     
Sbjct: 465  MPSDKSPGPDGYTSEFFKATWEIIGDEFTLAVQSFFTKGFLPKGINSTILALIPKKTEAR 524

Query: 1220 RLNEFRPISCCNVVCKAISKVLANRLKPLLHKLV 1321
             + ++RPISCCNV+ K ISK++ANRLK +L K +
Sbjct: 525  EMKDYRPISCCNVLYKVISKIIANRLKLVLPKFI 558


>ref|XP_004298219.1| PREDICTED: uncharacterized protein LOC101304768 [Fragaria vesca
            subsp. vesca]
          Length = 1687

 Score =  205 bits (521), Expect = 4e-50
 Identities = 143/456 (31%), Positives = 223/456 (48%), Gaps = 17/456 (3%)
 Frame = +2

Query: 5    YASTDAQVRWELWRDIKYIATTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDFRECL 184
            Y + D+Q+R   W  ++ IA ++  PW+V GD N  L   +K  G    +  I  FRE +
Sbjct: 106  YGNPDSQLRHFSWDLLRRIAKSVRGPWIVFGDFNELLCIGDKRGGGERPEAQIRRFREAV 165

Query: 185  FEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDHSP 364
             E  LQ+++      TW      G ++ +LDR  +N E    F     + +  G SDH  
Sbjct: 166  DECGLQEVEFSGPTFTWKR----GTLLERLDRCFINEEAGVLFPRFHEAHVDVGASDHLS 221

Query: 365  AVVTISKKRKICGRP--------FKFFNFWADDSEFMTVVQEAWEIKVSGNPMFKLIMKL 520
             V  +  +   CGR         F+F  FWA + E   VV +AW+    GN +  +  KL
Sbjct: 222  LV--LFSEGLNCGRKGGWKGLRRFQFEPFWAKEQESKQVVADAWQS--DGNQLNNVRAKL 277

Query: 521  KNVKLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNK--DLAADER-AIIKEIS 691
              V  +L+ W++NKFG +   I+ L   L        + PF+   ++  + R AI+ E++
Sbjct: 278  AGVSKELQRWNENKFGLIPKKIRQLNKELE-------QCPFDSSDEVVQNRRNAIVAELN 330

Query: 692  FLSKCEESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIE 871
               + EES  +Q+SR+NWL+ GD +T+ FH   K R  +N++  I    GE++  E +I+
Sbjct: 331  KSLEIEESIWRQRSRINWLQEGDRNTKFFHGFAKGRGRKNRVLGIMSSTGEWIEQETEIQ 390

Query: 872  MEVIRHFKATFGTPAKCN------TSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXX 1033
                 HF   F T   C+       +V  +  DD++ +LN+                   
Sbjct: 391  QAFNTHFSQLF-TSEGCDHMELVLDTVQRKVTDDMNAKLNKPFTKLDIDEALK------- 442

Query: 1034 FQMMPDKALGPDGFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEM 1213
             QM PDK+ G DGFSA F+Q  W I+  +        L  G   +++N T + LI K+E 
Sbjct: 443  -QMGPDKSPGEDGFSARFYQAYWEIVGDEVSNRCLQVLNEGASVKDLNHTLLALIPKIEN 501

Query: 1214 PSRLNEFRPISCCNVVCKAISKVLANRLKPLLHKLV 1321
            P  + +FRPIS CNV+ K ISK + NR+K LL +++
Sbjct: 502  PQGVADFRPISLCNVLYKLISKAMVNRMKVLLPEVI 537


>ref|XP_004250606.1| PREDICTED: uncharacterized protein LOC101247390 [Solanum
            lycopersicum]
          Length = 612

 Score =  205 bits (521), Expect = 4e-50
 Identities = 125/441 (28%), Positives = 211/441 (47%), Gaps = 1/441 (0%)
 Frame = +2

Query: 2    VYASTDAQVRWELWRDIKYIATTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDFREC 181
            +YA     +R  LW  + + A+  T PW  +GD NV  + DEK+ G   +     DF   
Sbjct: 77   IYAKCKEYLRRPLWDKLLHHASVSTNPWCAVGDYNVIFDVDEKLGGLPYNMRKSMDFIAL 136

Query: 182  LFEARLQDLKAERCHLTWCNRQ-IEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDH 358
            +    L D+       TW N++    RI  +LDR LVN+ W+++  +   + L +  SDH
Sbjct: 137  IEACGLVDIGFSGHRFTWSNKRGFNNRIWKRLDRALVNDLWLEKMPQTTITHLSTTGSDH 196

Query: 359  SPAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAWEIKVSGNPMFKLIMKLKNVKLD 538
             P ++ +        + F+F N W D+  FM  V+  W+  + GN M+K   K+K +   
Sbjct: 197  CPYLLEMVSTEVDRIKYFRFLNCWVDNPNFMLTVKNCWDRPMEGNAMWKFHQKMKRLSNT 256

Query: 539  LKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKCEESA 718
            L  WS+N+FG++   ++  +  ++  + N      + +         + I FL K E++ 
Sbjct: 257  LSVWSRNEFGDIFQKVRMYEEQVHEAEENYIRDQTDSNRITLHELNAQYIKFL-KIEDTI 315

Query: 719  AKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRHFKA 898
             KQK+++   K GD++ + FH  ++ RR +  I  I  ENG+++  E +I      HF A
Sbjct: 316  LKQKTQLQLFKDGDTNFKYFHSIIRARRRKLFIHKIITENGDWIQGENNIAQNACDHFNA 375

Query: 899  TFGTPAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGPDGFS 1078
             F +    N  +  Q  + +   +N++                  F M P+ A GPDG +
Sbjct: 376  IFTSE---NKHINEQNLECIPRMVNKDQNTQLTKLPDMDELKEVVFSMNPNSAAGPDGMN 432

Query: 1079 ACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPISCCNV 1258
              FF++   II  D ++ +        I +  + + I L+ KV   ++L EFRPIS  N 
Sbjct: 433  GYFFKKCLNIIKNDLVEVLHPFFSGQMIPKYFSHSCIVLLPKVNNTNKLTEFRPISLSNF 492

Query: 1259 VCKAISKVLANRLKPLLHKLV 1321
              K ISK+++NRL P+L  L+
Sbjct: 493  TSKIISKLVSNRLSPILLSLI 513


>gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13)
            [Arabidopsis thaliana]
          Length = 1164

 Score =  205 bits (521), Expect = 4e-50
 Identities = 135/450 (30%), Positives = 221/450 (49%), Gaps = 14/450 (3%)
 Frame = +2

Query: 2    VYASTDAQVRWELWRDIKYIATTMTV---PWMVLGDMNVTLNHDEKIEGRMPSKNSIED- 169
            VYASTD   R  LW +I   +    V   PW VLGD N  L+         PS++S  D 
Sbjct: 6    VYASTDEVTRQILWNEIVDFSNDPCVIDKPWTVLGDFNQILH---------PSEHSTSDG 56

Query: 170  ---------FRECLFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEA 322
                     FRE +  A L DL       TW N++    +  KLDR+LVN++W   F  +
Sbjct: 57   FNVDRPTRIFRETILLASLTDLSFRGNTFTWWNKRSRAPVAKKLDRILVNDKWTTTFPSS 116

Query: 323  FSSFLPSGLSDHSPAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAW-EIKVSGNPM 499
               F     SDHS   +++        +PF+F NF   D  F++++   W    V+G+ M
Sbjct: 117  LGLFGEPDFSDHSSCELSLMSASPRSKKPFRFNNFLLKDENFLSLICLKWFSTSVTGSAM 176

Query: 500  FKLIMKLKNVKLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAII 679
            +++ +KLK +K  ++ +S++ + +++   K+   +L   Q  +  SP   + AA E    
Sbjct: 177  YRVSVKLKALKKVIRDFSRDNYSDIEKRTKEAHDALLLAQSVLLASPCPSN-AAIEAETQ 235

Query: 680  KEISFLSKCEESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADE 859
            ++   L++ E S   Q+SRVNWL+ GD ++  FH+    R++ N I  +    G+ +  +
Sbjct: 236  RKWRILAEAEASFFYQRSRVNWLREGDMNSSYFHKMASARQSLNHIHFLSDPVGDRIEGQ 295

Query: 860  ADIEMEVIRHFKATFGTPAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQ 1039
             ++E   + +F++  G+         +   + L ++ +   ++               F 
Sbjct: 296  QNLENHCVEYFQSNLGSEQGLPLFEQADISNLLSYRCSPAQQVSLDTPFSSEQIKNAFFS 355

Query: 1040 MMPDKALGPDGFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPS 1219
            +  +KA GPDGFS  FF   W II  +  +AI     +GK+ ++ NAT + LI K+   S
Sbjct: 356  LPRNKASGPDGFSPEFFCACWPIIGGEVTEAIHEFFTSGKLLKQWNATNLVLIPKITNAS 415

Query: 1220 RLNEFRPISCCNVVCKAISKVLANRLKPLL 1309
             +++FRPISC N V K ISK+L +RLK  L
Sbjct: 416  SMSDFRPISCLNTVYKVISKLLTDRLKDFL 445


>ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
            gi|508778198|gb|EOY25454.1| Uncharacterized protein
            TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  203 bits (516), Expect = 2e-49
 Identities = 125/443 (28%), Positives = 217/443 (48%), Gaps = 3/443 (0%)
 Frame = +2

Query: 2    VYASTDAQVRWELWRDIKYIATTMTVPWMVLGDMNVTLNHDEKIEGRMPSKNSIEDFREC 181
            VYA      R  LW  ++ +A  + VPW+V GD N+ L  +E++ G  P + ++EDF   
Sbjct: 1158 VYAKCTRSERTLLWDCLRRLAADIEVPWLVGGDFNIILKREERLYGSAPHEGAMEDFAST 1217

Query: 182  LFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGLSDHS 361
            L +  L D   E    TW N     R+  +LDR++ N+ WI++F       L    SDH 
Sbjct: 1218 LLDCGLLDGGFEGNPFTWTNN----RMFQRLDRIVYNHHWINKFPITRIQHLNRDGSDHC 1273

Query: 362  PAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAWEIKVSGNPMFKLIMKLKNVKLDL 541
            P +++     +     F+F + W    +F T V+  W + ++G+ +     K   +K  L
Sbjct: 1274 PLLISCFNSSEKAPSSFRFQHAWVLHHDFKTSVESNWNLPINGSGLQAFWSKQHRLKQHL 1333

Query: 542  KTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSK---CEE 712
            K W+K  FG++ + +K+ +  +   ++       N+        + K  + L+K    EE
Sbjct: 1334 KWWNKVMFGDIFSKLKEAEKRVEECEI----LHQNEQTVESIIKLNKSYAQLNKQLNIEE 1389

Query: 713  SAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIRHF 892
               KQKS V W+  G+ +T+ FH  M+++R R+ I  ++  +G ++ D+  ++   I++F
Sbjct: 1390 IFWKQKSGVKWVVEGERNTKFFHTRMQKKRIRSHIFKVQEPDGRWIEDQEQLKQSAIKYF 1449

Query: 893  KATFGTPAKCNTSVFSQ*RDDLHHQLNEEDRMXXXXXXXXXXXXXXXFQMMPDKALGPDG 1072
             +       C+ S F   R  +   ++  +                 F + P+ A GPDG
Sbjct: 1450 SSLLKFEP-CDDSRFQ--RSLIPSIISNSENELLCAEPNLQEVKDAVFGIDPESAAGPDG 1506

Query: 1073 FSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEFRPISCC 1252
            FS+ F+Q+ W II  D L A++       I R V +T + L+ K    S+ ++FRPIS C
Sbjct: 1507 FSSYFYQQCWNIIAHDLLDAVRDFFHGANIPRGVTSTTLILLPKKPSASKWSDFRPISLC 1566

Query: 1253 NVVCKAISKVLANRLKPLLHKLV 1321
             V+ K I+K+L+NRL  +L  ++
Sbjct: 1567 TVMNKIITKLLSNRLAKILPSII 1589


>emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|7269488|emb|CAB79491.1|
            putative protein [Arabidopsis thaliana]
          Length = 1141

 Score =  202 bits (515), Expect = 2e-49
 Identities = 135/445 (30%), Positives = 214/445 (48%), Gaps = 9/445 (2%)
 Frame = +2

Query: 2    VYASTDAQVRWELWRDIKYIAT---TMTVPWMVLGDMNVTLN-HDEKIEGRMPSKNSIED 169
            VYA+ +   R ELWR+I  +     T   PW++LGD N  L+ H+      +     I D
Sbjct: 101  VYAANEDDKRKELWREITALVASPVTFNRPWILLGDFNQVLHPHEHSRHVSLNVDRRIRD 160

Query: 170  FRECLFEARLQDLKAERCHLTWCNRQIEGRIMSKLDRVLVNNEWIDEFNEAFSSFLPSGL 349
            FRECL +A L DL  +    TW N+     +  K+DR+LVN  W + F  +F  F P   
Sbjct: 161  FRECLLDAELSDLVYKGSSFTWWNKSKTRPVAKKIDRILVNESWSNLFPSSFGLFGPPDF 220

Query: 350  SDHSPAVVTISKKRKICGRPFKFFNFWADDSEFMTVVQEAW-EIKVSGNPMFKLIMKLKN 526
            SDH+   V +        RPFKFFNF   + EF+ +V + W    V G+ MF++  KLK 
Sbjct: 221  SDHASCGVVLELDPIKAKRPFKFFNFLLKNPEFLNLVWDVWYSTNVVGSSMFRVSKKLKA 280

Query: 527  VKLDLKTWSKNKFGNMDNSIKDLKSSLNNLQMNI*ESPFNKDLAADERAIIKEISFLSKC 706
            +K  +K +S+  + N++   ++   +L + Q    ++P + + AA E    ++   L+  
Sbjct: 281  LKKPIKDFSRLNYSNLEKRTEEAHETLLSFQNLTLDNP-SLENAAHELEAQRKWQILATA 339

Query: 707  EESAAKQKSRVNWLKLGDSHTRSFHQSMKQRRARNKITSIKLENGEYLADEADIEMEVIR 886
            EES  +Q+SRV W   GD +TR FH+    R++ N IT++  ++G     + D +  +  
Sbjct: 340  EESFFRQRSRVTWFAEGDGNTRYFHRMADSRKSVNTITTLVDDSG----TQIDSQQGIAD 395

Query: 887  HFKATFGTPAKCNTSVFSQ*RDDLH----HQLNEEDRMXXXXXXXXXXXXXXXFQMMPDK 1054
            H    F      +   +S  +DD++    ++                      F +  +K
Sbjct: 396  HCALYFENLLSDDNDPYSLEQDDMNLLLTYRCPYSQVADLEAMFSDEDIKAAFFGLPSNK 455

Query: 1055 ALGPDGFSACFFQRAWIIINRDFLKAIKSTL*NGKIFREVNATAITLISKVEMPSRLNEF 1234
            A GPDGF      R + I              +G + ++ NAT I LI K    S  ++F
Sbjct: 456  ACGPDGFPVTAAVREFFI--------------SGNLLKQWNATTIVLIPKFPNASCTSDF 501

Query: 1235 RPISCCNVVCKAISKVLANRLKPLL 1309
            RPISC N + K I+++L +RL+ LL
Sbjct: 502  RPISCMNTLYKVIARLLTDRLQKLL 526


Top