BLASTX nr result

ID: Catharanthus22_contig00036109 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00036109
         (696 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulga...   113   5e-23
ref|XP_006480844.1| PREDICTED: putative ribonuclease H protein A...    84   7e-23
gb|ABW81175.1| non-LTR retrotransposon transposase [Arabidopsis ...    77   4e-22
gb|AAC63844.1| putative non-LTR retroelement reverse transcripta...   105   1e-20
gb|ABD28505.2| RNA-directed DNA polymerase (Reverse transcriptas...    65   2e-20
gb|AAD37021.1| putative non-LTR retrolelement reverse transcript...   103   4e-20
gb|AAF23831.1|AC007234_3 F1E22.12 [Arabidopsis thaliana]               67   7e-19
gb|EMJ14411.1| hypothetical protein PRUPE_ppb013620mg [Prunus pe...   100   7e-19
ref|XP_006490008.1| PREDICTED: uncharacterized protein LOC102624...    98   2e-18
gb|EMJ13914.1| hypothetical protein PRUPE_ppa018769mg, partial [...    97   4e-18
dbj|BAE79385.1| unnamed protein product [Ipomoea batatas]              95   2e-17
dbj|BAE79382.1| unnamed protein product [Ipomoea batatas]              95   2e-17
dbj|BAE79384.1| unnamed protein product [Ipomoea batatas]              93   7e-17
gb|EOY32757.1| Uncharacterized protein TCM_040787 [Theobroma cacao]    78   8e-17
ref|XP_004305437.1| PREDICTED: uncharacterized protein LOC101296...    90   8e-16
ref|XP_004301578.1| PREDICTED: uncharacterized protein LOC101313...    89   1e-15
gb|EOY02376.1| LINE-type retrotransposon LIb DNA, Insertion at t...    87   7e-15
gb|EMJ21003.1| hypothetical protein PRUPE_ppa026469mg, partial [...    86   9e-15
ref|XP_002452318.1| hypothetical protein SORBIDRAFT_04g023610 [S...    45   2e-14
emb|CAB10337.1| reverse transcriptase like protein [Arabidopsis ...    83   7e-14

>emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1378

 Score =  113 bits (283), Expect = 5e-23
 Identities = 76/232 (32%), Positives = 118/232 (50%), Gaps = 1/232 (0%)
 Frame = +3

Query: 3    PSQASFISGRQASDNMIIL*EIIHSIRSKIGKKGSTTIKVDLKDSL*PSRLEFP*ENFKG 182
            P+Q SF+  RQ +DN+II+ E+ HS+R+K GKKG   +K+D + +    R  F  E+   
Sbjct: 542  PTQCSFVPNRQITDNVIIVQEMFHSMRNKQGKKGFMAVKIDFEKAYDRLRWTFIRESLME 601

Query: 183  SGI*KE-VERVNSFLYH*NQII*ME*QQIGVLCTPTGLVTRRPALLLFIFVMYGATWQSY 359
              I +  V+ V + +   N  I    + +  +C   GL    P       +         
Sbjct: 602  LRIPQHLVDIVMNCVSSANLQILWNGEPMEKICPTRGLRQGDPLSPYLYVICMERLAHLI 661

Query: 360  *PSS*KGRWKPVKGSRSGLGISHIFIADDLIIFEEA*KKQASLMAEIIKDFCGVNEQKIN 539
                  G WKPVK SR+G  IS++  ADDLI+F EA  +QA +M   +  FC  +  K+N
Sbjct: 662  DQEVTNGNWKPVKASRNGPPISNLAFADDLILFSEASVEQAQVMKWCLDRFCEASGSKVN 721

Query: 540  I*KSKLFVS*NTEPWVAKKLHHKFGIPLSIDLGLYLGMPIIHGRKSRNLYEF 695
              KSK++ S NT   +   + +   +  + D G YLG+P I+GR S+  Y++
Sbjct: 722  EDKSKIYFSANTHLDIRDAVCNTLAMEATADFGKYLGVPTINGRSSKREYQY 773


>ref|XP_006480844.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Citrus
           sinensis]
          Length = 768

 Score = 83.6 bits (205), Expect(2) = 7e-23
 Identities = 40/103 (38%), Positives = 62/103 (60%)
 Frame = +3

Query: 384 WKPVKGSRSGLGISHIFIADDLIIFEEA*KKQASLMAEIIKDFCGVNEQKINI*KSKLFV 563
           WKP++ SR G  +SH+F  DDL++F EA   QA  +  ++ DFC  +  K+N  K+ ++ 
Sbjct: 94  WKPIRLSRLGTPLSHLFFTDDLLLFAEATSGQAQCINSVLGDFCLSSGTKVNQSKTHVYF 153

Query: 564 S*NTEPWVAKKLHHKFGIPLSIDLGLYLGMPIIHGRKSRNLYE 692
           S N    VA ++    G  ++ DLG YLGMP++H R S+  Y+
Sbjct: 154 SKNVPDAVATRIWRDLGYTVTKDLGKYLGMPLLHSRVSQQTYQ 196



 Score = 50.4 bits (119), Expect(2) = 7e-23
 Identities = 29/89 (32%), Positives = 48/89 (53%), Gaps = 1/89 (1%)
 Frame = +1

Query: 136 AYDRVD*NFLEKILKAVGFEKKLRELILFCIIKTKLS-KWNSNKLESFALQRGL*QGDLH 312
           AYDR+  NF+ + L  +     L +LI+ CI  T ++  W+    + F+  RG+ QGD  
Sbjct: 10  AYDRLSWNFIYETLTELALPIGLIQLIMECITSTSMNILWHGELTDDFSPSRGVRQGDPL 69

Query: 313 SSCLFLLCMELLGKAISRVVKKDVGNQLR 399
           S  +F+LC+E L   I + + +D    +R
Sbjct: 70  SPYIFVLCVERLSHGIYQSIHQDHWKPIR 98


>gb|ABW81175.1| non-LTR retrotransposon transposase [Arabidopsis cebennensis]
          Length = 799

 Score = 77.4 bits (189), Expect(3) = 4e-22
 Identities = 39/102 (38%), Positives = 64/102 (62%)
 Frame = +3

Query: 384 WKPVKGSRSGLGISHIFIADDLIIFEEA*KKQASLMAEIIKDFCGVNEQKINI*KSKLFV 563
           WKP+  SR G  +SHI  ADDLI+F EA   Q  ++ ++++ FC  + QK+++ KSK+F 
Sbjct: 119 WKPISMSRGGPLLSHICFADDLILFAEASVAQIRVVRKVLEKFCIASGQKVSLEKSKIFF 178

Query: 564 S*NTEPWVAKKLHHKFGIPLSIDLGLYLGMPIIHGRKSRNLY 689
           S N    + K +  + GI  + +LG YLGMP++  R +++ +
Sbjct: 179 SQNVHRDLEKFISDESGIKSTKELGKYLGMPVLQKRINKDTF 220



 Score = 38.9 bits (89), Expect(3) = 4e-22
 Identities = 21/41 (51%), Positives = 23/41 (56%)
 Frame = +1

Query: 250 WNSNKLESFALQRGL*QGDLHSSCLFLLCMELLGKAISRVV 372
           WN  K ESF   RGL QGD  S  LF+LC+E L   I   V
Sbjct: 74  WNGEKTESFIPSRGLRQGDPLSPYLFVLCLERLCHQIDLAV 114



 Score = 35.0 bits (79), Expect(3) = 4e-22
 Identities = 18/45 (40%), Positives = 29/45 (64%)
 Frame = +3

Query: 3   PSQASFISGRQASDNMIIL*EIIHSIRSKIGKKGSTTIKVDLKDS 137
           P+Q+SFI G  ++DN++++ E +HS+R    KKG T    +L  S
Sbjct: 15  PTQSSFIPGWLSADNIVVVQEAVHSMRR---KKGHTLHAANLPSS 56


>gb|AAC63844.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1231

 Score =  105 bits (262), Expect = 1e-20
 Identities = 68/230 (29%), Positives = 118/230 (51%), Gaps = 1/230 (0%)
 Frame = +3

Query: 3    PSQASFISGRQASDNMIIL*EIIHSIRSKIGKKGSTTIKVDLKDSL*PSRLEFP*ENFKG 182
            P+QASFI GR + DN++++ E +HS+R K G+KG   +K+DL+ +    R +F  E  + 
Sbjct: 399  PAQASFIPGRLSIDNIVLVQEAVHSMRRKKGRKGWMLLKLDLEKAYDRVRWDFLQETLEA 458

Query: 183  SGI*KE-VERVNSFLYH*NQII*ME*QQIGVLCTPTGLVTRRPALLLFIFVMYGATWQSY 359
            +G+ +    R+ + +   +  +    ++        GL    P       +         
Sbjct: 459  AGLSEGWTSRIMAGVTDPSMSVLWNGERTDSFVPARGLRQGDPLSPYLFVLCLERLCHLI 518

Query: 360  *PSS*KGRWKPVKGSRSGLGISHIFIADDLIIFEEA*KKQASLMAEIIKDFCGVNEQKIN 539
              S  K  WKP+  S  G  +SH+  ADDLI+F EA   Q  ++  +++ FC  + QK++
Sbjct: 519  EASVGKREWKPIAVSCGGSKLSHVCFADDLILFAEASVAQIRIIRRVLERFCEASGQKVS 578

Query: 540  I*KSKLFVS*NTEPWVAKKLHHKFGIPLSIDLGLYLGMPIIHGRKSRNLY 689
            + KSK+F S N    + + +  + GI  + +LG YLGMPI+  R ++  +
Sbjct: 579  LEKSKIFFSHNVSREMEQLISEESGIGCTKELGKYLGMPILQKRMNKETF 628


>gb|ABD28505.2| RNA-directed DNA polymerase (Reverse transcriptase);
           Polynucleotidyl transferase, Ribonuclease H fold
           [Medicago truncatula]
          Length = 729

 Score = 64.7 bits (156), Expect(2) = 2e-20
 Identities = 37/99 (37%), Positives = 54/99 (54%)
 Frame = +3

Query: 384 WKPVKGSRSGLGISHIFIADDLIIFEEA*KKQASLMAEIIKDFCGVNEQKINI*KSKLFV 563
           WKP++  R G  ISH+  ADDL++F EA  +QA  +   +  FC  + QKIN  K++++ 
Sbjct: 94  WKPMRAGRYGPPISHLLFADDLLLFAEASIEQAHCVLHCLDMFCQSSGQKINREKTQVYF 153

Query: 564 S*NTEPWVAKKLHHKFGIPLSIDLGLYLGMPIIHGRKSR 680
           S N +  + + +    G      LG YLG  I  GR SR
Sbjct: 154 SKNVDNHLREDIIQHTGFNQVNSLGKYLGANITPGRTSR 192



 Score = 60.8 bits (146), Expect(2) = 2e-20
 Identities = 37/90 (41%), Positives = 49/90 (54%), Gaps = 1/90 (1%)
 Frame = +1

Query: 136 AYDRVD*NFLEKILKAVGFEKKLRELILFCIIKTKLS-KWNSNKLESFALQRGL*QGDLH 312
           AYD ++ NF+E+ LK   F  KL  +I  CI        WN +K ESF   RG+ QGD  
Sbjct: 10  AYDLLNWNFVEECLKECKFPSKLINIIHHCISTPSYKIMWNGDKSESFYPSRGIRQGDPL 69

Query: 313 SSCLFLLCMELLGKAISRVVKKDVGNQLRA 402
           S  LF++CME L   I+  V+ D    +RA
Sbjct: 70  SPYLFVICMERLSHIIADQVEADYWKPMRA 99


>gb|AAD37021.1| putative non-LTR retrolelement reverse transcriptase [Arabidopsis
           thaliana]
          Length = 732

 Score =  103 bits (258), Expect = 4e-20
 Identities = 71/229 (31%), Positives = 117/229 (51%)
 Frame = +3

Query: 3   PSQASFISGRQASDNMIIL*EIIHSIRSKIGKKGSTTIKVDLKDSL*PSRLEFP*ENFKG 182
           P+QASFISGR A+DN++I+ E +HS+R K G+KG   +K+DL+ +    R EF  +  + 
Sbjct: 96  PAQASFISGRLAADNIVIMQEAVHSMRRKKGRKGWMLLKLDLEKAYDRIRWEFLEDTLRA 155

Query: 183 SGI*KEVERVNSFLYH*NQII*ME*QQIGVLCTPTGLVTRRPALLLFIFVMYGATWQSY* 362
                 V     ++    Q +                    P++ L         W  + 
Sbjct: 156 ------VRLPEKWIVWIMQCV------------------TEPSMSLL--------WNEH- 182

Query: 363 PSS*KGRWKPVKGSRSGLGISHIFIADDLIIFEEA*KKQASLMAEIIKDFCGVNEQKINI 542
            S  +  WKP+  S+ G  +SHI  ADDLI+F EA   Q  ++  +++ FC  + QK+++
Sbjct: 183 -SIARKDWKPISLSQGGPKLSHICFADDLILFAEASVAQIRVIRRVLERFCVASGQKVSL 241

Query: 543 *KSKLFVS*NTEPWVAKKLHHKFGIPLSIDLGLYLGMPIIHGRKSRNLY 689
            KSK+F S N    + K +  + GI  + +LG YLGMP++  R +++ +
Sbjct: 242 EKSKIFFSENVSRDLGKLISDESGISSTRELGKYLGMPVLQRRINKDTF 290


>gb|AAF23831.1|AC007234_3 F1E22.12 [Arabidopsis thaliana]
          Length = 1055

 Score = 66.6 bits (161), Expect(2) = 7e-19
 Identities = 33/86 (38%), Positives = 52/86 (60%)
 Frame = +3

Query: 381 RWKPVKGSRSGLGISHIFIADDLIIFEEA*KKQASLMAEIIKDFCGVNEQKINI*KSKLF 560
           +WKP+  SR G  +SHI  ADDLI+F EA  +Q  ++  +++ FC  + QK+++ KSK+F
Sbjct: 103 QWKPINLSRGGPKLSHICFADDLILFAEASVEQVQIVRRVLEAFCTASGQKVSLEKSKIF 162

Query: 561 VS*NTEPWVAKKLHHKFGIPLSIDLG 638
            S N    + K +  + GI  + D G
Sbjct: 163 FSKNVSRELGKLISDESGIQSTCDWG 188



 Score = 53.9 bits (128), Expect(2) = 7e-19
 Identities = 34/80 (42%), Positives = 42/80 (52%), Gaps = 1/80 (1%)
 Frame = +1

Query: 136 AYDRVD*NFLEKILKAVGFEKKLRELILFCIIKTKLSK-WNSNKLESFALQRGL*QGDLH 312
           AYDR+  +FL   L A GF +     I+ C+    +S  WN  K   F   RGL QGDL 
Sbjct: 20  AYDRIRWDFLSDTLVAAGFSEVWVTWIMQCVSGPDMSLLWNGEKTTPFKPLRGLRQGDLL 79

Query: 313 SSCLFLLCMELLGKAISRVV 372
           S  LF+LCME L   I R +
Sbjct: 80  SPYLFVLCMERLCHLIERSI 99


>gb|EMJ14411.1| hypothetical protein PRUPE_ppb013620mg [Prunus persica]
          Length = 993

 Score = 99.8 bits (247), Expect = 7e-19
 Identities = 78/233 (33%), Positives = 120/233 (51%), Gaps = 7/233 (3%)
 Frame = +3

Query: 12   ASFISGRQASDNMIIL*EIIHSIRSKIGKKGSTTIKVDLKDSL*PSRLEFP*ENFKGSGI 191
            +SF+ GR  +DN++I  E++H  +   GKK     K+DL  +    RL +    F  S +
Sbjct: 753  SSFVPGRHITDNIMIAQELMHKFKLAKGKKRMFAWKIDLSKAY--DRLNW---GFIESVL 807

Query: 192  *KEVERVNSFLYH*NQII*ME*QQI---GVLC---TPTGLVTRRPALLLFIFVMYGATWQ 353
              EV   NSF+    Q +     QI   G L    +P   + +   L  ++FV+      
Sbjct: 808  -LEVGLPNSFIQLIMQCVSTMRYQICINGELTDPFSPGNGIRQGDPLSPYLFVLCIEKLS 866

Query: 354  SY*PSS*KGR-WKPVKGSRSGLGISHIFIADDLIIFEEA*KKQASLMAEIIKDFCGVNEQ 530
                 + K + WKP+K SR+G  +SH+F ADDL +F EA   QA +M   +  FC  + Q
Sbjct: 867  HIIVDAVKRKLWKPIKTSRNGPSVSHLFFADDLALFAEATPCQARVMKNCLDLFCSASGQ 926

Query: 531  KINI*KSKLFVS*NTEPWVAKKLHHKFGIPLSIDLGLYLGMPIIHGRKSRNLY 689
             +N  KS +F S NT   VAK++    G P++ +LG YLG+P++H R ++  Y
Sbjct: 927  AVNFAKSVIFCSPNTCKMVAKEIGAICGSPITENLGKYLGLPLLHSRVTKVTY 979


>ref|XP_006490008.1| PREDICTED: uncharacterized protein LOC102624085 [Citrus sinensis]
          Length = 1635

 Score = 98.2 bits (243), Expect = 2e-18
 Identities = 71/237 (29%), Positives = 117/237 (49%), Gaps = 7/237 (2%)
 Frame = +3

Query: 3    PSQASFISGRQASDNMIIL*EIIHSIRSKIGKKGSTTIKVDLKDSL*PSRLEFP*ENFKG 182
            P Q SF+ GR  ++N+I+  EIIHS+R K G+KG   IKVDL  +       F  E  + 
Sbjct: 1032 PHQTSFVPGRHITENIIVAQEIIHSMRRKKGRKGFMAIKVDLGKAYDRLSWTFIQETLQE 1091

Query: 183  SGI*KE-VERVNSFLYH*NQII*ME*QQIGVLCTPTGLVTRRPALLLFIFVM------YG 341
              +    +  +   +      +    +     C   G+    P L  +IFV+      +G
Sbjct: 1092 LNLPTMLINLIMECITTATMNVLWNGELSSEFCPGRGVRQGDP-LSPYIFVLCIERLSHG 1150

Query: 342  ATWQSY*PSS*KGRWKPVKGSRSGLGISHIFIADDLIIFEEA*KKQASLMAEIIKDFCGV 521
             +      S  +G WKP++ +R G  +SH+F ADDL+   EA  +QA ++ +II +F   
Sbjct: 1151 IS-----RSIQQGHWKPIRLARMGTPLSHLFFADDLLFLSEASSQQAIIINKIIDEFSAS 1205

Query: 522  NEQKINI*KSKLFVS*NTEPWVAKKLHHKFGIPLSIDLGLYLGMPIIHGRKSRNLYE 692
            +  K+N  K+ ++ S N     A ++    G  ++ +LG YLG+P+ H R S+  Y+
Sbjct: 1206 SGAKVNKSKTLVYFSANISAMEASRIGSDLGYSVTDNLGKYLGVPLCHSRISKQTYQ 1262


>gb|EMJ13914.1| hypothetical protein PRUPE_ppa018769mg, partial [Prunus persica]
          Length = 387

 Score = 97.4 bits (241), Expect = 4e-18
 Identities = 49/105 (46%), Positives = 68/105 (64%)
 Frame = +3

Query: 375 KGRWKPVKGSRSGLGISHIFIADDLIIFEEA*KKQASLMAEIIKDFCGVNEQKINI*KSK 554
           K RWK VK S SG  +SH+F ADDL++F EA  KQA +M + ++ FC V+ Q +N  KS 
Sbjct: 36  KKRWKCVKSSHSGPCVSHLFFADDLVLFAEASTKQAQIMRDCLEKFCSVSGQAVNFDKSA 95

Query: 555 LFVS*NTEPWVAKKLHHKFGIPLSIDLGLYLGMPIIHGRKSRNLY 689
           +F S NT   +A+ L    G PL+ +LG YLGMPI+H +  ++ Y
Sbjct: 96  IFCSPNTGNVLAQDLSRICGSPLTANLGNYLGMPILHNKVCKDTY 140


>dbj|BAE79385.1| unnamed protein product [Ipomoea batatas]
          Length = 1366

 Score = 95.1 bits (235), Expect = 2e-17
 Identities = 70/239 (29%), Positives = 115/239 (48%), Gaps = 10/239 (4%)
 Frame = +3

Query: 3    PSQASFISGRQASDNMIIL*EIIHSIRSKIGKKGSTTIKVDLKDSL*PSRLEFP*ENFKG 182
            P Q SF+ GR   DN+I+  E++HS+ +   KK    +KVDL+ +      ++  E  + 
Sbjct: 540  PHQNSFLPGRSTMDNVILTQEVVHSMNNPRRKKKQMILKVDLQKAYDSVSWDYLEETLED 599

Query: 183  SGI*KEVERVNSFLYH*NQII*ME*QQIGVLCTPTGLVTRRP----------ALLLFIFV 332
             G  + +  ++  L+       ++   + +L     L   +P          A  LF  V
Sbjct: 600  FGFPRRL--IDLILFS------LQESSLAILWNGGRLPPFKPGRGLRQGDPLAPYLFNLV 651

Query: 333  MYGATWQSY*PSS*KGRWKPVKGSRSGLGISHIFIADDLIIFEEA*KKQASLMAEIIKDF 512
            M           + +  WKPV  +R G GISH+F ADDL++F EA + QA +M + +  F
Sbjct: 652  MERLAHDIQTRVNAR-TWKPVHITRGGTGISHLFFADDLMLFGEASEHQAQIMFDCLDSF 710

Query: 513  CGVNEQKINI*KSKLFVS*NTEPWVAKKLHHKFGIPLSIDLGLYLGMPIIHGRKSRNLY 689
               +  K+N  KS LF S N    + + +     +P++  LG YLG+P++  R SRN +
Sbjct: 711  SNASGLKVNFSKSLLFCSSNVNAGLKRAIGSILQVPVAESLGTYLGIPMLKERVSRNTF 769


>dbj|BAE79382.1| unnamed protein product [Ipomoea batatas]
          Length = 1366

 Score = 95.1 bits (235), Expect = 2e-17
 Identities = 70/239 (29%), Positives = 115/239 (48%), Gaps = 10/239 (4%)
 Frame = +3

Query: 3    PSQASFISGRQASDNMIIL*EIIHSIRSKIGKKGSTTIKVDLKDSL*PSRLEFP*ENFKG 182
            P Q SF+ GR   DN+I+  E++HS+ +   KK    +KVDL+ +      ++  E  + 
Sbjct: 540  PHQNSFLPGRSTMDNVILTQEVVHSMNNPRRKKKQMILKVDLQKAYDSVSWDYLEETLED 599

Query: 183  SGI*KEVERVNSFLYH*NQII*ME*QQIGVLCTPTGLVTRRP----------ALLLFIFV 332
             G  + +  ++  L+       ++   + +L     L   +P          A  LF  V
Sbjct: 600  FGFPRRL--IDLILFS------LQESSLAILWNGGRLPPFKPGRGLRQGDPLAPYLFNLV 651

Query: 333  MYGATWQSY*PSS*KGRWKPVKGSRSGLGISHIFIADDLIIFEEA*KKQASLMAEIIKDF 512
            M           + +  WKPV  +R G GISH+F ADDL++F EA + QA +M + +  F
Sbjct: 652  MERLAHDIQTRVNAR-TWKPVHITRGGTGISHLFFADDLMLFGEASEHQAQIMFDCLDSF 710

Query: 513  CGVNEQKINI*KSKLFVS*NTEPWVAKKLHHKFGIPLSIDLGLYLGMPIIHGRKSRNLY 689
               +  K+N  KS LF S N    + + +     +P++  LG YLG+P++  R SRN +
Sbjct: 711  SNASGLKVNFSKSLLFCSSNVNAGLKRAIGSILQVPVAESLGTYLGIPMLKERVSRNTF 769


>dbj|BAE79384.1| unnamed protein product [Ipomoea batatas]
          Length = 1898

 Score = 93.2 bits (230), Expect = 7e-17
 Identities = 66/230 (28%), Positives = 112/230 (48%), Gaps = 1/230 (0%)
 Frame = +3

Query: 3    PSQASFISGRQASDNMIIL*EIIHSIRSKIGKKGSTTIKVDLKDSL*PSRLEFP*ENFKG 182
            P Q SF+ GR   DN+I+  E++HS+ +   KK    +KVDL+ +      ++  E  + 
Sbjct: 1072 PHQNSFLPGRSTMDNVILTQEVVHSMNNPRRKKKQMILKVDLQKAYDSVSWDYLEETLED 1131

Query: 183  SGI*KEVERVNSFLYH*NQII*ME*QQIGVLCTPTGLVTRRPALLLFIFVMYGATWQSY* 362
             G  + +  +  F    + +  +          P   + +   L+ ++F +         
Sbjct: 1132 FGFPRRLIDLILFSLQESSLAILWNGGRPPPFKPGRGLRQGDPLVPYLFNLVMERLAHDI 1191

Query: 363  PSS*KGR-WKPVKGSRSGLGISHIFIADDLIIFEEA*KKQASLMAEIIKDFCGVNEQKIN 539
             +    R WKPV  +R G GISH+F ADDL++F EA + QA +M + +  F   +  K+N
Sbjct: 1192 QTRVNARTWKPVHITRGGTGISHLFFADDLMLFGEASEHQAQIMFDCLDSFSDASGLKVN 1251

Query: 540  I*KSKLFVS*NTEPWVAKKLHHKFGIPLSIDLGLYLGMPIIHGRKSRNLY 689
              KS LF S N    + + +     +P++  LG YLG+P++  R SRN +
Sbjct: 1252 FSKSLLFCSSNVNAGLKRAIGSILQVPVAESLGTYLGIPMLKERVSRNTF 1301


>gb|EOY32757.1| Uncharacterized protein TCM_040787 [Theobroma cacao]
          Length = 178

 Score = 78.2 bits (191), Expect(2) = 8e-17
 Identities = 40/104 (38%), Positives = 60/104 (57%)
 Frame = +3

Query: 378 GRWKPVKGSRSGLGISHIFIADDLIIFEEA*KKQASLMAEIIKDFCGVNEQKINI*KSKL 557
           G WKP+  +  G  ++H+  ADDL++F EA  KQ   +  ++  FC  + QK+++ KS++
Sbjct: 62  GNWKPLVVTTRGPYLTHVCFADDLMLFGEASVKQVQTIMRVLDKFCLASGQKVSLEKSRM 121

Query: 558 FVS*NTEPWVAKKLHHKFGIPLSIDLGLYLGMPIIHGRKSRNLY 689
            VS N     A+ L     IPL+ D G YLG P+IHGR  +  Y
Sbjct: 122 LVSSNVPLSKARVLSSDAKIPLTKDFGKYLGSPVIHGRVLKTTY 165



 Score = 35.4 bits (80), Expect(2) = 8e-17
 Identities = 17/41 (41%), Positives = 24/41 (58%)
 Frame = +1

Query: 250 WNSNKLESFALQRGL*QGDLHSSCLFLLCMELLGKAISRVV 372
           WN    E+F   RG+ QGD  S  LF+LC+E L + ++  V
Sbjct: 19  WNGIPTETFIPTRGIRQGDPLSPYLFVLCLETLSQLVNEEV 59


>ref|XP_004305437.1| PREDICTED: uncharacterized protein LOC101296313 [Fragaria vesca
           subsp. vesca]
          Length = 449

 Score = 89.7 bits (221), Expect = 8e-16
 Identities = 46/104 (44%), Positives = 65/104 (62%)
 Frame = +3

Query: 378 GRWKPVKGSRSGLGISHIFIADDLIIFEEA*KKQASLMAEIIKDFCGVNEQKINI*KSKL 557
           G WK VK S+SG  I H+F ADDLI+F EA  +Q SL+   + +FC ++ Q ++  KS +
Sbjct: 231 GYWKAVKASQSGPKILHLFFADDLILFVEASSQQTSLLKTCLDNFCALSRQTVSFEKSLV 290

Query: 558 FVS*NTEPWVAKKLHHKFGIPLSIDLGLYLGMPIIHGRKSRNLY 689
           F S NT    A  + +  G PL+ DLG YLGMP+I+ R ++  Y
Sbjct: 291 FCSPNTSKSTASLISNVCGSPLTCDLGKYLGMPLIYDRVNKCTY 334


>ref|XP_004301578.1| PREDICTED: uncharacterized protein LOC101313223 [Fragaria vesca
           subsp. vesca]
          Length = 543

 Score = 89.0 bits (219), Expect = 1e-15
 Identities = 45/105 (42%), Positives = 65/105 (61%)
 Frame = +3

Query: 378 GRWKPVKGSRSGLGISHIFIADDLIIFEEA*KKQASLMAEIIKDFCGVNEQKINI*KSKL 557
           G WK V  S+SG  ISH+F  DDL++F EA + QA  +   + +FC ++ Q I+  KS +
Sbjct: 237 GHWKSVNASQSGPRISHLFFVDDLMLFAEATEHQAYGLKTCLDNFCAISGQIISYEKSLI 296

Query: 558 FVS*NTEPWVAKKLHHKFGIPLSIDLGLYLGMPIIHGRKSRNLYE 692
           F S NT   +A  +    G PL+ DLG YLGMP+IH R +++ Y+
Sbjct: 297 FCSPNTTKTMASSISATCGSPLTSDLGKYLGMPLIHSRVNKHTYD 341


>gb|EOY02376.1| LINE-type retrotransposon LIb DNA, Insertion at the S11 site-like
           protein [Theobroma cacao]
          Length = 620

 Score = 86.7 bits (213), Expect = 7e-15
 Identities = 64/236 (27%), Positives = 117/236 (49%), Gaps = 7/236 (2%)
 Frame = +3

Query: 6   SQASFISGRQASDNMIIL*EIIHSIRSKIGKKGSTTIKVDLKDSL*PSRLEFP*ENFKGS 185
           +QASFI      DN+I++ E++HS   K G++G   +K+DL+ +    R EF  ++   +
Sbjct: 114 TQASFILETHIVDNIIVVQEVVHSFHEKQGRRGWMMVKIDLEKAYDRLRWEFIYDSLVEA 173

Query: 186 GI*KEVERV--NSFLYH*NQII*ME*QQIGVLCT----PTGLVTRRPALLLFIFVM-YGA 344
            I + +  +   S+  H + I+          C     P+  V     L  ++FV+    
Sbjct: 174 QIPENIIDILIRSWNAHSSHIL------WNGTCFEKFFPSRGVRLGDPLAPYLFVLCIEK 227

Query: 345 TWQSY*PSS*KGRWKPVKGSRSGLGISHIFIADDLIIFEEA*KKQASLMAEIIKDFCGVN 524
                  +  +  WKP++  + G  ++++F  DDLI+  EA + Q  ++  +++DFC   
Sbjct: 228 LAHGIKQAVEQEMWKPIRLGKHGPPLTYLFFMDDLILLAEASESQMEVIKGVLEDFCACL 287

Query: 525 EQKINI*KSKLFVS*NTEPWVAKKLHHKFGIPLSIDLGLYLGMPIIHGRKSRNLYE 692
             K+ I KS  F S N    +  K+    G   S  +G Y+G+P++HGRK+ ++Y+
Sbjct: 288 RGKVCIAKSTFFCSKNVPMELNIKVKDCSGFSYSDSMGKYIGVPLLHGRKTAHIYK 343


>gb|EMJ21003.1| hypothetical protein PRUPE_ppa026469mg, partial [Prunus persica]
          Length = 212

 Score = 86.3 bits (212), Expect = 9e-15
 Identities = 43/104 (41%), Positives = 68/104 (65%)
 Frame = +3

Query: 378 GRWKPVKGSRSGLGISHIFIADDLIIFEEA*KKQASLMAEIIKDFCGVNEQKINI*KSKL 557
           G+WKPVK  ++G  +SH+F+ DDLI+F EA  ++A +M   +  FC  + Q ++  KS +
Sbjct: 1   GKWKPVKSFQTGPIVSHLFLVDDLILFTEASTQRARMMKGCLDLFCQASGQTVSFDKSTV 60

Query: 558 FVS*NTEPWVAKKLHHKFGIPLSIDLGLYLGMPIIHGRKSRNLY 689
           F S NT   +A+++   +G PL+ +LG YLGM I+H R +R+ Y
Sbjct: 61  FCSPNTIRALAQEISFIYGSPLTDNLGKYLGMHILHSRVTRSTY 104


>ref|XP_002452318.1| hypothetical protein SORBIDRAFT_04g023610 [Sorghum bicolor]
           gi|241932149|gb|EES05294.1| hypothetical protein
           SORBIDRAFT_04g023610 [Sorghum bicolor]
          Length = 701

 Score = 45.4 bits (106), Expect(3) = 2e-14
 Identities = 27/87 (31%), Positives = 45/87 (51%)
 Frame = +3

Query: 420 ISHIFIADDLIIFEEA*KKQASLMAEIIKDFCGVNEQKINI*KSKLFVS*NTEPWVAKKL 599
           I  +  ADDLII  +A +++A+ +  I+++FC V+ Q  N+ KS +  S N +      +
Sbjct: 246 IHSLLFADDLIICGQATQEEANKINSILQNFCNVSGQTPNLAKSSIMFSRNADNSSRVAV 305

Query: 600 HHKFGIPLSIDLGLYLGMPIIHGRKSR 680
              F +P      +YLG P+I     R
Sbjct: 306 KSVFPVPDLTPNTIYLGHPLIFNHNDR 332



 Score = 40.0 bits (92), Expect(3) = 2e-14
 Identities = 29/72 (40%), Positives = 37/72 (51%), Gaps = 1/72 (1%)
 Frame = +1

Query: 136 AYDRVD*NFLEKILKAVGFEKKLRELILFCIIKTKLSK-WNSNKLESFALQRGL*QGDLH 312
           A+DR++ NF+ K LK  GF     +LI   I  T LS   N     SF  QRGL QG   
Sbjct: 150 AFDRIEWNFIVKALKRQGFHDHFVDLIYKYISTTTLSVIINGESTPSFHPQRGLRQGCPL 209

Query: 313 SSCLFLLCMELL 348
           S  LF++ +  L
Sbjct: 210 SPYLFIIAVNEL 221



 Score = 39.7 bits (91), Expect(3) = 2e-14
 Identities = 19/42 (45%), Positives = 27/42 (64%)
 Frame = +3

Query: 3   PSQASFISGRQASDNMIIL*EIIHSIRSKIGKKGSTTIKVDL 128
           PSQ +F+ GR  + N+II  EIIHS   K  K+ +  +K+DL
Sbjct: 106 PSQTAFVQGRYIASNIIIAQEIIHSFNLKSWKQKAFFLKIDL 147


>emb|CAB10337.1| reverse transcriptase like protein [Arabidopsis thaliana]
           gi|7268307|emb|CAB78601.1| reverse transcriptase like
           protein [Arabidopsis thaliana]
          Length = 929

 Score = 83.2 bits (204), Expect = 7e-14
 Identities = 62/230 (26%), Positives = 110/230 (47%), Gaps = 1/230 (0%)
 Frame = +3

Query: 3   PSQASFISGRQASDNMIIL*EIIHSIRSKIGKKGSTTIKVDLKDSL*PSRLEFP*ENFKG 182
           P+QASFI GR + DN++++ E +HS+R K G+KG   +K+DL+ +    R +F  E  + 
Sbjct: 351 PAQASFIPGRLSFDNIVVVQEAVHSMRRKKGRKGWMLLKLDLEKAYDRIRWDFLAETLEA 410

Query: 183 SGI*KE-VERVNSFLYH*NQII*ME*QQIGVLCTPTGLVTRRPALLLFIFVMYGATWQSY 359
           +G+ +  ++R+   +      +    ++        GL    P       +         
Sbjct: 411 AGLSEGWIKRIMECVAGPEMSLLWNGEKTDSFTPERGLRQGDPISPYLFVLCIERLCHQI 470

Query: 360 *PSS*KGRWKPVKGSRSGLGISHIFIADDLIIFEEA*KKQASLMAEIIKDFCGVNEQKIN 539
             +  +G WK +  S+ G  +SH+  ADDLI+F EA                    QK++
Sbjct: 471 ETAVGRGDWKSISISQGGPKVSHVCFADDLILFAEA-----------------SVAQKVS 513

Query: 540 I*KSKLFVS*NTEPWVAKKLHHKFGIPLSIDLGLYLGMPIIHGRKSRNLY 689
           + KSK+F S N    +   +  + GI  + +LG YLGMP++  R +++ +
Sbjct: 514 LEKSKIFFSNNVSRDLEGLITAETGIGSTRELGKYLGMPVLQKRINKDTF 563


Top