BLASTX nr result

ID: Atropa21_contig00035737 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00035737
         (1217 letters)

Database: nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006366848.1| PREDICTED: uncharacterized protein LOC102605...   149   1e-58
gb|ABI34339.1| Polyprotein, 3'-partial, putative [Solanum demissum]   139   5e-58
gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum]     143   4e-54
ref|XP_004242076.1| PREDICTED: uncharacterized protein LOC101251...   137   3e-51
ref|XP_006347259.1| PREDICTED: uncharacterized protein LOC102584...   134   9e-50
ref|XP_004253493.1| PREDICTED: uncharacterized protein LOC101265...   134   5e-47
gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobrom...   110   2e-43
gb|EOY26390.1| Uncharacterized protein TCM_027940 [Theobroma cacao]   109   3e-43
gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum ...   110   5e-43
gb|AAT38724.1| Putative retrotransposon protein, identical [Sola...   110   2e-42
gb|EOX93842.1| Uncharacterized protein TCM_002794 [Theobroma cacao]   106   3e-42
gb|EMJ01464.1| hypothetical protein PRUPE_ppa015000mg [Prunus pe...    99   9e-40
gb|AAT39297.2| Gag-pol protein, putative [Solanum demissum]           101   2e-39
gb|EOY08404.1| Retrotransposon-like protein [Theobroma cacao]         108   6e-39
ref|XP_004488407.1| PREDICTED: uncharacterized protein LOC101502...    99   6e-37
gb|EOY19242.1| Uncharacterized protein TCM_044240 [Theobroma cacao]   103   2e-36
ref|XP_004514315.1| PREDICTED: uncharacterized protein LOC101498...    91   2e-35
gb|EOY19088.1| Uncharacterized protein TCM_043787 [Theobroma cacao]    94   2e-35
gb|EOX94203.1| DNA/RNA polymerases superfamily protein [Theobrom...    94   2e-35
gb|EOY21657.1| DNA/RNA polymerases superfamily protein [Theobrom...    94   3e-35

>ref|XP_006366848.1| PREDICTED: uncharacterized protein LOC102605741 [Solanum tuberosum]
          Length = 823

 Score =  149 bits (377), Expect(2) = 1e-58
 Identities = 79/130 (60%), Positives = 97/130 (74%)
 Frame = -2

Query: 628 MVDFDVILGID*PSPYHVILDCHAKIVTLAIPSLPKVEYRGSPNPNSKKIISFICARKLV 449
           MVDFDVILG+D  SPYH IL+CHAK VTLA+P +P V +RGS +   K +ISF+ AR  V
Sbjct: 1   MVDFDVILGMDWLSPYHAILNCHAKTVTLAMPGIPIVVWRGSLSHPPKGVISFLKARHFV 60

Query: 448 DKGFLAYLSHIRDMIVETSTLKSILVVN*FIEVFP*NLSSMLPDRDIDLCIDLDPVTRPI 269
           ++G LAYL+HIRD  VET  L+SI VV+ F EVFP +L  + PDRDID CID++P T+PI
Sbjct: 61  ERGCLAYLAHIRDTSVETPMLESISVVSEFSEVFPTDLPGLPPDRDIDFCIDIEPGTQPI 120

Query: 268 YIPP*CMALA 239
            IPP  MA A
Sbjct: 121 SIPPYRMAPA 130



 Score =  105 bits (262), Expect(2) = 1e-58
 Identities = 53/74 (71%), Positives = 59/74 (79%)
 Frame = -3

Query: 222 SVSLLGAPVLFIKKKDTTMHMCIYYR*LNKITIRNKY*IPHIDDIFDYLQGVSIFLKIGL 43
           SVS  GAPVLF+KKKD +M MCI YR LNK+TIRNKY IP IDD+FD LQG SIF KI L
Sbjct: 151 SVSPWGAPVLFVKKKDGSMRMCIDYRQLNKVTIRNKYPIPRIDDLFDQLQGASIFSKIDL 210

Query: 42  RYG*HHLKIRVEDV 1
           R G H LK+RVED+
Sbjct: 211 RSGYHQLKVRVEDI 224


>gb|ABI34339.1| Polyprotein, 3'-partial, putative [Solanum demissum]
          Length = 1475

 Score =  139 bits (350), Expect(3) = 5e-58
 Identities = 74/140 (52%), Positives = 99/140 (70%)
 Frame = -2

Query: 658 NT*IDLVILDMVDFDVILGID*PSPYHVILDCHAKIVTLAIPSLPKVEYRGSPNPNSKKI 479
           +T  DL++LDMVDFDVILG+D  SPY  +LDC +K VTLAIP +P V ++GS       +
Sbjct: 548 DTRADLILLDMVDFDVILGMDWLSPYRAVLDCFSKTVTLAIPGIPPVVWQGSRGSTPVGV 607

Query: 478 ISFICARKLVDKGFLAYLSHIRDMIVETSTLKSILVVN*FIEVFP*NLSSMLPDRDIDLC 299
           ISFI AR+LV  G L+YL+++RD+  E   ++S+ VV  FI+VFP +L  + P+RDID  
Sbjct: 608 ISFIRARRLVASGCLSYLAYVRDVSREVPPVESVPVVRDFIDVFPTDLPGLPPERDIDFP 667

Query: 298 IDLDPVTRPIYIPP*CMALA 239
           I+L+P TRPI IPP  MA A
Sbjct: 668 IELEPGTRPISIPPYRMAPA 687



 Score = 97.8 bits (242), Expect(3) = 5e-58
 Identities = 46/74 (62%), Positives = 57/74 (77%)
 Frame = -3

Query: 222 SVSLLGAPVLFIKKKDTTMHMCIYYR*LNKITIRNKY*IPHIDDIFDYLQGVSIFLKIGL 43
           SVS  GAPVLF+KKKD TM MCI YR LNK+T++N+Y +P IDD+FD LQG S+F KI L
Sbjct: 708 SVSPWGAPVLFVKKKDGTMRMCIDYRQLNKVTVKNRYPLPRIDDLFDQLQGASVFSKIDL 767

Query: 42  RYG*HHLKIRVEDV 1
           R+  H L+IR  D+
Sbjct: 768 RFDYHQLRIRAADI 781



 Score = 37.0 bits (84), Expect(3) = 5e-58
 Identities = 20/58 (34%), Positives = 28/58 (48%)
 Frame = -3

Query: 951 GGQQGTQSEV*NTHCYAFSGRVETETSDAVIISIISIGHQSATVFFDLGSILACICLF 778
           GG+   Q    ++H YA   R E E SD VI   I +  Q A   FD GS  + + ++
Sbjct: 458 GGRSDGQGRGRHSHFYAAPARAEAEASDDVITGTILLCQQPALALFDPGSTFSYVSVY 515


>gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum]
          Length = 1771

 Score =  143 bits (360), Expect(2) = 4e-54
 Identities = 75/140 (53%), Positives = 102/140 (72%)
 Frame = -2

Query: 658  NT*IDLVILDMVDFDVILGID*PSPYHVILDCHAKIVTLAIPSLPKVEYRGSPNPNSKKI 479
            +T +DL++LDMVDFDVILG+D  SPYH +LDC+AK VTLA+P +  V ++G+ +     I
Sbjct: 722  DTRVDLILLDMVDFDVILGMDWLSPYHAVLDCYAKTVTLAMPGISPVLWQGAYSHTPTWI 781

Query: 478  ISFICARKLVDKGFLAYLSHIRDMIVETSTLKSILVVN*FIEVFP*NLSSMLPDRDIDLC 299
            ISF+ AR+LV  G LAYL+++RD+  + S++ S+ VV  F +VFP +L  + PDRDID  
Sbjct: 782  ISFMRARRLVASGCLAYLAYVRDVSRDDSSVDSVPVVREFADVFPIDLPGLPPDRDIDFA 841

Query: 298  IDLDPVTRPIYIPP*CMALA 239
            IDL+P TRPI IPP  MA A
Sbjct: 842  IDLEPDTRPISIPPYRMAPA 861



 Score = 97.1 bits (240), Expect(2) = 4e-54
 Identities = 46/74 (62%), Positives = 57/74 (77%)
 Frame = -3

Query: 222  SVSLLGAPVLFIKKKDTTMHMCIYYR*LNKITIRNKY*IPHIDDIFDYLQGVSIFLKIGL 43
            SVS  GAPVLF+KKKD TM MCI YR LNK+T++N+Y +P IDD+FD LQG ++F KI L
Sbjct: 882  SVSPWGAPVLFVKKKDGTMRMCIDYRQLNKVTVKNRYPMPRIDDLFDQLQGAAVFSKIDL 941

Query: 42   RYG*HHLKIRVEDV 1
            R G H L+IR  D+
Sbjct: 942  RSGYHQLRIRAADI 955


>ref|XP_004242076.1| PREDICTED: uncharacterized protein LOC101251787 [Solanum
           lycopersicum]
          Length = 945

 Score =  137 bits (345), Expect(2) = 3e-51
 Identities = 74/134 (55%), Positives = 96/134 (71%)
 Frame = -2

Query: 640 VILDMVDFDVILGID*PSPYHVILDCHAKIVTLAIPSLPKVEYRGSPNPNSKKIISFICA 461
           VI DM+DFDVILG+D  SPYHV+LDC+AKIVTL++P +P V ++ + +     IISFI A
Sbjct: 475 VISDMIDFDVILGMDWLSPYHVVLDCYAKIVTLSMPGVPPVLWKAAYSHTPTGIISFIRA 534

Query: 460 RKLVDKGFLAYLSHIRDMIVETSTLKSILVVN*FIEVFP*NLSSMLPDRDIDLCIDLDPV 281
           R LV  G LAYL+HIRD+  E  ++ S+ VV  + +VFP +L  + P+RDID  IDL+P 
Sbjct: 535 RWLVASGCLAYLAHIRDVSREGPSVDSVPVVREYADVFPTDLPGLPPERDIDFAIDLEPG 594

Query: 280 TRPIYIPP*CMALA 239
           TRPI IPP  MA A
Sbjct: 595 TRPISIPPYRMAPA 608



 Score = 93.2 bits (230), Expect(2) = 3e-51
 Identities = 45/74 (60%), Positives = 56/74 (75%)
 Frame = -3

Query: 222 SVSLLGAPVLFIKKKDTTMHMCIYYR*LNKITIRNKY*IPHIDDIFDYLQGVSIFLKIGL 43
           SVS  GAPVLF+K KD T+ MCI YR LNK+T++N Y +P IDD+FD+LQG +IF KI L
Sbjct: 629 SVSPWGAPVLFVKNKDGTLRMCIDYRQLNKVTLKNCYPMPRIDDLFDHLQGATIFSKIDL 688

Query: 42  RYG*HHLKIRVEDV 1
           R G H L+IR  D+
Sbjct: 689 RSGYHQLRIRAADI 702


>ref|XP_006347259.1| PREDICTED: uncharacterized protein LOC102584611 [Solanum tuberosum]
          Length = 1107

 Score =  134 bits (337), Expect(2) = 9e-50
 Identities = 72/140 (51%), Positives = 100/140 (71%)
 Frame = -2

Query: 658 NT*IDLVILDMVDFDVILGID*PSPYHVILDCHAKIVTLAIPSLPKVEYRGSPNPNSKKI 479
           +T +DL++LDMVDFDVILG+D  SPYH +LD +AK VTLA+P +  V ++ + +     I
Sbjct: 116 DTRVDLILLDMVDFDVILGMDWLSPYHAVLDFYAKTVTLAMPGISPVLWQSAYSHTPTGI 175

Query: 478 ISFICARKLVDKGFLAYLSHIRDMIVETSTLKSILVVN*FIEVFP*NLSSMLPDRDIDLC 299
           ISF+ AR+LV  G LAYL+++RD+  E S++ S+ VV  F +VFP +L  + P+RDID  
Sbjct: 176 ISFMRARRLVASGCLAYLAYVRDVSREGSSVDSVPVVREFADVFPTDLPGLPPERDIDFS 235

Query: 298 IDLDPVTRPIYIPP*CMALA 239
           I+L+P TRPI IPP  MA A
Sbjct: 236 IELEPGTRPISIPPYRMAPA 255



 Score = 91.3 bits (225), Expect(2) = 9e-50
 Identities = 43/70 (61%), Positives = 55/70 (78%)
 Frame = -3

Query: 222 SVSLLGAPVLFIKKKDTTMHMCIYYR*LNKITIRNKY*IPHIDDIFDYLQGVSIFLKIGL 43
           SVS  G+PVLF+KKKD TM +CI YR LNK+T++N+Y +P IDD+FD LQG ++F KI L
Sbjct: 276 SVSPWGSPVLFVKKKDGTMSLCIDYRQLNKVTVKNRYPMPRIDDLFDQLQGAAVFSKIDL 335

Query: 42  RYG*HHLKIR 13
           R G H L+IR
Sbjct: 336 RSGYHQLRIR 345


>ref|XP_004253493.1| PREDICTED: uncharacterized protein LOC101265119 [Solanum
           lycopersicum]
          Length = 518

 Score =  134 bits (338), Expect(2) = 5e-47
 Identities = 72/144 (50%), Positives = 94/144 (65%)
 Frame = -2

Query: 670 FIRYNT*IDLVILDMVDFDVILGID*PSPYHVILDCHAKIVTLAIPSLPKVEYRGSPNPN 491
           F+   T +DLVIL M DF VILG+   SP   ILDC+AK VTLA P    + + G    N
Sbjct: 290 FVGSKTSVDLVILAMDDFGVILGMTCLSPQFAILDCNAKTVTLAKPGTDPLVWEGDYTSN 349

Query: 490 SKKIISFICARKLVDKGFLAYLSHIRDMIVETSTLKSILVVN*FIEVFP*NLSSMLPDRD 311
             +I+SF+ ARK++ KG LA+L+H++D   +   ++S  VV  F++VFP  L  M PDRD
Sbjct: 350 PVRIVSFLRARKMISKGCLAFLAHLKDDTTQVPWIESFSVVREFLDVFPAELPGMPPDRD 409

Query: 310 IDLCIDLDPVTRPIYIPP*CMALA 239
           ID CIDL+P TRPI+IPP  MA A
Sbjct: 410 IDFCIDLEPGTRPIFIPPYRMAPA 433



 Score = 81.6 bits (200), Expect(2) = 5e-47
 Identities = 39/65 (60%), Positives = 47/65 (72%)
 Frame = -3

Query: 222 SVSLLGAPVLFIKKKDTTMHMCIYYR*LNKITIRNKY*IPHIDDIFDYLQGVSIFLKIGL 43
           S S  GAP+LF+KKKD +  MCI YR LN +TI+NKY +P IDD+FD LQG  +F KI L
Sbjct: 454 SASPWGAPILFVKKKDGSFRMCIDYRQLNTVTIKNKYPLPRIDDLFDQLQGACVFSKIDL 513

Query: 42  RYG*H 28
           R G H
Sbjct: 514 RSGYH 518


>gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1447

 Score =  110 bits (274), Expect(2) = 2e-43
 Identities = 57/140 (40%), Positives = 87/140 (62%)
 Frame = -2

Query: 658 NT*IDLVILDMVDFDVILGID*PSPYHVILDCHAKIVTLAIPSLPKVEYRGSPNPNSKKI 479
           +T ++LV+LD +DFDVILG++  SP H  +DC+ K+V    P  P    +G  +     +
Sbjct: 418 DTSVNLVVLDTLDFDVILGMNWLSPCHASVDCYHKLVRFDFPGEPSFSIQGDRSNAPTNL 477

Query: 478 ISFICARKLVDKGFLAYLSHIRDMIVETSTLKSILVVN*FIEVFP*NLSSMLPDRDIDLC 299
           IS I AR+L+ +G + YL+ ++D   +   +  + VV  F++VFP  L S+ P+R+++ C
Sbjct: 478 ISVISARRLLRQGCIGYLAVVKDSQAKIGDVTQVSVVKEFVDVFPEELPSLPPEREVEFC 537

Query: 298 IDLDPVTRPIYIPP*CMALA 239
           IDL P TRPI IPP  MA A
Sbjct: 538 IDLIPDTRPISIPPYRMAPA 557



 Score = 94.0 bits (232), Expect(2) = 2e-43
 Identities = 45/74 (60%), Positives = 56/74 (75%)
 Frame = -3

Query: 222 SVSLLGAPVLFIKKKDTTMHMCIYYR*LNKITIRNKY*IPHIDDIFDYLQGVSIFLKIGL 43
           SVS  GAPVLF+KKKD ++ +CI YR LNK+T++NKY +P IDD+FD LQG   F KI L
Sbjct: 578 SVSPWGAPVLFVKKKDGSLRLCIDYRQLNKVTVKNKYPLPRIDDLFDQLQGAQCFSKIDL 637

Query: 42  RYG*HHLKIRVEDV 1
           R G H L+IR ED+
Sbjct: 638 RSGYHQLRIRNEDI 651


>gb|EOY26390.1| Uncharacterized protein TCM_027940 [Theobroma cacao]
          Length = 1052

 Score =  109 bits (273), Expect(2) = 3e-43
 Identities = 57/140 (40%), Positives = 87/140 (62%)
 Frame = -2

Query: 658 NT*IDLVILDMVDFDVILGID*PSPYHVILDCHAKIVTLAIPSLPKVEYRGSPNPNSKKI 479
           +T ++LV+LD +DFDVILG++  SP H  +DC+ K+V    P  P    +G  +     +
Sbjct: 376 DTSVNLVVLDTLDFDVILGMNWLSPCHASVDCYHKLVRFDFPGEPSFSIQGDRSNAPTNL 435

Query: 478 ISFICARKLVDKGFLAYLSHIRDMIVETSTLKSILVVN*FIEVFP*NLSSMLPDRDIDLC 299
           IS I AR+L+ +G + YL+ ++D   +   +  + VV  F++VFP  L S+ P+R+++ C
Sbjct: 436 ISVISARRLLRQGCMGYLAVLKDSQAKIGDVTQVSVVKEFVDVFPEELPSLPPEREVEFC 495

Query: 298 IDLDPVTRPIYIPP*CMALA 239
           IDL P TRPI IPP  MA A
Sbjct: 496 IDLIPDTRPISIPPYRMAPA 515



 Score = 94.0 bits (232), Expect(2) = 3e-43
 Identities = 45/74 (60%), Positives = 56/74 (75%)
 Frame = -3

Query: 222 SVSLLGAPVLFIKKKDTTMHMCIYYR*LNKITIRNKY*IPHIDDIFDYLQGVSIFLKIGL 43
           SVS  GAPVLF+KKKD ++ +CI YR LNK+T++NKY +P IDD+FD LQG   F KI L
Sbjct: 536 SVSPWGAPVLFVKKKDGSLRLCIDYRQLNKVTVKNKYPLPRIDDLFDQLQGAQCFSKIDL 595

Query: 42  RYG*HHLKIRVEDV 1
           R G H L+IR ED+
Sbjct: 596 RSGYHQLRIRNEDI 609


>gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum demissum]
          Length = 1515

 Score =  110 bits (274), Expect(2) = 5e-43
 Identities = 62/140 (44%), Positives = 86/140 (61%)
 Frame = -2

Query: 658 NT*IDLVILDMVDFDVILGID*PSPYHVILDCHAKIVTLAIPSLPKVEYRGSPNPNSKKI 479
           +T +DL+ LDMVDFDVILG+D     +  +DC  ++V    PS P +E+  S      + 
Sbjct: 534 STMVDLIELDMVDFDVILGMDWLHACYASIDCRTRVVKFQFPSEPILEWSSSSAVPKGRF 593

Query: 478 ISFICARKLVDKGFLAYLSHIRDMIVETSTLKSILVVN*FIEVFP*NLSSMLPDRDIDLC 299
           IS++ ARKLV KG + +L+ + D  VE    +S+ +V  F EVFP +L  + P+R+ID  
Sbjct: 594 ISYLKARKLVSKGCIYHLARVNDSSVEIPYFQSVPIVREFPEVFPNDLPGIPPEREIDFG 653

Query: 298 IDLDPVTRPIYIPP*CMALA 239
           IDL P TRPI IPP  MA A
Sbjct: 654 IDLIPDTRPISIPPYRMAPA 673



 Score = 92.8 bits (229), Expect(2) = 5e-43
 Identities = 44/74 (59%), Positives = 56/74 (75%)
 Frame = -3

Query: 222 SVSLLGAPVLFIKKKDTTMHMCIYYR*LNKITIRNKY*IPHIDDIFDYLQGVSIFLKIGL 43
           SVS  GAPVLF++KKD ++ MCI YR LNK+TI+NKY +P IDD+FD LQG + F KI L
Sbjct: 690 SVSPWGAPVLFVRKKDGSLRMCIDYRQLNKVTIKNKYPLPRIDDLFDQLQGATCFSKIDL 749

Query: 42  RYG*HHLKIRVEDV 1
           R G H L++R  D+
Sbjct: 750 RSGYHQLRVRERDI 763


>gb|AAT38724.1| Putative retrotransposon protein, identical [Solanum demissum]
          Length = 1602

 Score =  110 bits (274), Expect(2) = 2e-42
 Identities = 62/140 (44%), Positives = 86/140 (61%)
 Frame = -2

Query: 658 NT*IDLVILDMVDFDVILGID*PSPYHVILDCHAKIVTLAIPSLPKVEYRGSPNPNSKKI 479
           +T +DL+ LDMVDFDVILG+D     +  +DC  ++V    PS P +E+  S      + 
Sbjct: 540 STMVDLIELDMVDFDVILGMDWLHACYASIDCRTRVVKFQFPSEPILEWSSSSAVPKGRF 599

Query: 478 ISFICARKLVDKGFLAYLSHIRDMIVETSTLKSILVVN*FIEVFP*NLSSMLPDRDIDLC 299
           IS++ ARKLV KG + +L+ + D  VE    +S+ +V  F EVFP +L  + P+R+ID  
Sbjct: 600 ISYLKARKLVSKGCIYHLARVNDSSVEIPYFQSVPIVREFPEVFPDDLPGIPPEREIDFG 659

Query: 298 IDLDPVTRPIYIPP*CMALA 239
           IDL P TRPI IPP  MA A
Sbjct: 660 IDLIPDTRPISIPPYRMAPA 679



 Score = 91.3 bits (225), Expect(2) = 2e-42
 Identities = 43/74 (58%), Positives = 56/74 (75%)
 Frame = -3

Query: 222 SVSLLGAPVLFIKKKDTTMHMCIYYR*LNKITIRNKY*IPHIDDIFDYLQGVSIFLKIGL 43
           SVS  GAPVLF++KKD ++ +CI YR LNK+TI+NKY +P IDD+FD LQG + F KI L
Sbjct: 696 SVSPWGAPVLFVRKKDGSLRICIDYRQLNKVTIKNKYPLPRIDDLFDQLQGATCFSKIDL 755

Query: 42  RYG*HHLKIRVEDV 1
           R G H L++R  D+
Sbjct: 756 RSGYHQLRVRERDI 769


>gb|EOX93842.1| Uncharacterized protein TCM_002794 [Theobroma cacao]
          Length = 509

 Score =  106 bits (264), Expect(2) = 3e-42
 Identities = 55/140 (39%), Positives = 85/140 (60%)
 Frame = -2

Query: 658 NT*IDLVILDMVDFDVILGID*PSPYHVILDCHAKIVTLAIPSLPKVEYRGSPNPNSKKI 479
           +T ++LV+LD +DFDVILG++  SP H  +DC+ K+V    P  P    +G  +     +
Sbjct: 180 DTSVNLVVLDTLDFDVILGMNWLSPCHASVDCYHKLVRFDFPGEPSFSIQGDRSNAPTNL 239

Query: 478 ISFICARKLVDKGFLAYLSHIRDMIVETSTLKSILVVN*FIEVFP*NLSSMLPDRDIDLC 299
           IS I AR+L+ +G + YL+ ++D   +   +  + VV  F++VFP  L  + P+R+++ C
Sbjct: 240 ISVISARRLLRQGCIGYLAVVKDSQAKIGDVTQVSVVKEFVDVFPEELPGLPPEREVEFC 299

Query: 298 IDLDPVTRPIYIPP*CMALA 239
           IDL P  RPI IPP  MA A
Sbjct: 300 IDLIPDIRPISIPPYRMAPA 319



 Score = 94.0 bits (232), Expect(2) = 3e-42
 Identities = 45/74 (60%), Positives = 56/74 (75%)
 Frame = -3

Query: 222 SVSLLGAPVLFIKKKDTTMHMCIYYR*LNKITIRNKY*IPHIDDIFDYLQGVSIFLKIGL 43
           SVS  GAPVLF+KKKD ++ +CI YR LNK+T++NKY +P IDD+FD LQG   F KI L
Sbjct: 340 SVSPWGAPVLFVKKKDGSLRLCIDYRQLNKVTVKNKYPLPRIDDLFDQLQGAQCFSKIDL 399

Query: 42  RYG*HHLKIRVEDV 1
           R G H L+IR ED+
Sbjct: 400 RSGYHQLRIRNEDI 413


>gb|EMJ01464.1| hypothetical protein PRUPE_ppa015000mg [Prunus persica]
          Length = 1493

 Score = 99.0 bits (245), Expect(2) = 9e-40
 Identities = 59/136 (43%), Positives = 78/136 (57%)
 Frame = -2

Query: 646 DLVILDMVDFDVILGID*PSPYHVILDCHAKIVTLAIPSLPKVEYRGSPNPNSKKIISFI 467
           +L+ LD+VD D+ILG+D    +H  +DC  K VTL  P  PKV +RG        +IS I
Sbjct: 439 NLIPLDLVDLDIILGMDWLEKHHASVDCFRKEVTLRSPGQPKVTFRGERRVLPTCLISAI 498

Query: 466 CARKLVDKGFLAYLSHIRDMIVETSTLKSILVVN*FIEVFP*NLSSMLPDRDIDLCIDLD 287
            A+KL+ KG+  YL+HI D    T  L+ I VV  F  +FP +L  + P R+I+  ID  
Sbjct: 499 TAKKLLKKGYEGYLAHIIDTREITLNLEDIPVVCEFPNIFPDDLPGLPPKREIEFTIDFL 558

Query: 286 PVTRPIYIPP*CMALA 239
           P T PIY  P  MA A
Sbjct: 559 PGTNPIYQTPYRMAPA 574



 Score = 93.2 bits (230), Expect(2) = 9e-40
 Identities = 45/74 (60%), Positives = 56/74 (75%)
 Frame = -3

Query: 222 SVSLLGAPVLFIKKKDTTMHMCIYYR*LNKITIRNKY*IPHIDDIFDYLQGVSIFLKIGL 43
           SVS  GAPVLF++K+D TM +CI YR LNK+TIRN+Y +P IDD+FD L+G   F KI L
Sbjct: 595 SVSPWGAPVLFVRKQDGTMRLCIDYRQLNKVTIRNRYPLPRIDDLFDQLKGAKYFSKIDL 654

Query: 42  RYG*HHLKIRVEDV 1
           R G H L+IR ED+
Sbjct: 655 RSGYHQLRIREEDI 668


>gb|AAT39297.2| Gag-pol protein, putative [Solanum demissum]
          Length = 1554

 Score =  101 bits (251), Expect(2) = 2e-39
 Identities = 59/138 (42%), Positives = 81/138 (58%)
 Frame = -2

Query: 658  NT*IDLVILDMVDFDVILGID*PSPYHVILDCHAKIVTLAIPSLPKVEYRGSPNPNSKKI 479
            +T  DLV LDMVDFDVILG++     +  LDC  ++V    P+ P  E+  S      + 
Sbjct: 615  STMADLVELDMVDFDVILGMNWLHACYASLDCRTRVVKFQFPNEPVFEWSSSSAVPKGRF 674

Query: 478  ISFICARKLVDKGFLAYLSHIRDMIVETSTLKSILVVN*FIEVFP*NLSSMLPDRDIDLC 299
            IS++ ARKLV KG + +L  + D  VE    +S+ +V  F +VFP +L  + P+R+ID  
Sbjct: 675  ISYLKARKLVSKGCIYHLVRVHDSSVEIPHFQSVPIVREFPKVFPDDLPGIPPEREIDFG 734

Query: 298  IDLDPVTRPIYIPP*CMA 245
            IDL P T PI IPP  MA
Sbjct: 735  IDLIPDTHPISIPPYRMA 752



 Score = 89.4 bits (220), Expect(2) = 2e-39
 Identities = 43/74 (58%), Positives = 56/74 (75%)
 Frame = -3

Query: 222 SVSLLGAPVLFIKKKDTTMHMCIYYR*LNKITIRNKY*IPHIDDIFDYLQGVSIFLKIGL 43
           SVS  GAPVLF++KKD ++ MCI YR LNK+TI+NKY +P IDD+F+ LQG + F KI L
Sbjct: 775 SVSPWGAPVLFVRKKDGSLRMCIDYRQLNKVTIKNKYPLPRIDDLFNQLQGATCFSKIDL 834

Query: 42  RYG*HHLKIRVEDV 1
           R G H L++R  D+
Sbjct: 835 RSGYHQLRVRECDI 848


>gb|EOY08404.1| Retrotransposon-like protein [Theobroma cacao]
          Length = 654

 Score =  108 bits (270), Expect(2) = 6e-39
 Identities = 56/140 (40%), Positives = 86/140 (61%)
 Frame = -2

Query: 658 NT*IDLVILDMVDFDVILGID*PSPYHVILDCHAKIVTLAIPSLPKVEYRGSPNPNSKKI 479
           +T ++LV+LD +DFDVILG++  SP H  +DC+ K+V    P  P    +G  +     +
Sbjct: 399 DTSVNLVVLDTLDFDVILGMNWLSPCHASVDCYHKLVRFDFPGEPSFSIQGDRSNAPTNL 458

Query: 478 ISFICARKLVDKGFLAYLSHIRDMIVETSTLKSILVVN*FIEVFP*NLSSMLPDRDIDLC 299
           IS I AR+L+ +G + YL+ ++D   +   +  + VV  F++VFP  L  + P+R+++ C
Sbjct: 459 ISVISARRLLRQGCIGYLAVVKDSQAKIGDVTQVSVVKEFVDVFPEELPGLPPEREVEFC 518

Query: 298 IDLDPVTRPIYIPP*CMALA 239
           IDL P TRPI IPP  MA A
Sbjct: 519 IDLIPDTRPISIPPYRMAPA 538



 Score = 80.9 bits (198), Expect(2) = 6e-39
 Identities = 38/61 (62%), Positives = 47/61 (77%)
 Frame = -3

Query: 222 SVSLLGAPVLFIKKKDTTMHMCIYYR*LNKITIRNKY*IPHIDDIFDYLQGVSIFLKIGL 43
           SVS  GAPVLF+KKKD ++ +CI YR LNK+T++NKY +P IDD+FD LQG   F KI L
Sbjct: 559 SVSPWGAPVLFVKKKDGSLRLCIDYRQLNKVTVKNKYPLPRIDDLFDQLQGAQCFSKIDL 618

Query: 42  R 40
           R
Sbjct: 619 R 619


>ref|XP_004488407.1| PREDICTED: uncharacterized protein LOC101502180 [Cicer arietinum]
          Length = 1235

 Score = 98.6 bits (244), Expect(2) = 6e-37
 Identities = 56/137 (40%), Positives = 79/137 (57%)
 Frame = -2

Query: 649  IDLVILDMVDFDVILGID*PSPYHVILDCHAKIVTLAIPSLPKVEYRGSPNPNSKKIISF 470
            +DLV++D++DFDVILG+D  + +H  LDCH K+V   IP      ++G         I  
Sbjct: 634  VDLVVIDLIDFDVILGMDWLAFHHATLDCHDKVVKFEIPGQSVFSFQGERCWVPHNQILA 693

Query: 469  ICARKLVDKGFLAYLSHIRDMIVETSTLKSILVVN*FIEVFP*NLSSMLPDRDIDLCIDL 290
            + A KL+ +G  AY++ +RD  V    L+ I +   F +VFP  L  + PDR+I+  IDL
Sbjct: 694  LAASKLMRRGCQAYIALVRDTQVAEEKLEKIPIACEFPDVFPEELPGLPPDREIEFSIDL 753

Query: 289  DPVTRPIYIPP*CMALA 239
             P T PI IPP  MA A
Sbjct: 754  VPNTHPISIPPYRMAPA 770



 Score = 84.0 bits (206), Expect(2) = 6e-37
 Identities = 40/74 (54%), Positives = 53/74 (71%)
 Frame = -3

Query: 222  SVSLLGAPVLFIKKKDTTMHMCIYYR*LNKITIRNKY*IPHIDDIFDYLQGVSIFLKIGL 43
            S S  GAPVLF+KKKD +M +C+ Y+ LNK+ ++NKY  P ID++FD LQG   F KI L
Sbjct: 791  SSSPWGAPVLFVKKKDGSMRLCVDYQQLNKVIVKNKYPSPRIDELFDQLQGAQCFPKIDL 850

Query: 42   RYG*HHLKIRVEDV 1
            R G H LKI+ +D+
Sbjct: 851  RSGYHQLKIKRDDI 864


>gb|EOY19242.1| Uncharacterized protein TCM_044240 [Theobroma cacao]
          Length = 721

 Score =  103 bits (256), Expect(2) = 2e-36
 Identities = 56/140 (40%), Positives = 83/140 (59%)
 Frame = -2

Query: 658 NT*IDLVILDMVDFDVILGID*PSPYHVILDCHAKIVTLAIPSLPKVEYRGSPNPNSKKI 479
           +T ++LV+LD +DFDVILG+D  +P H  +DC+ K+V    P       +G  +     +
Sbjct: 171 DTLVNLVVLDTLDFDVILGMDWLAPCHASVDCYHKLVKFDFPCERSFSIQGDRSNAPTNL 230

Query: 478 ISFICARKLVDKGFLAYLSHIRDMIVETSTLKSILVVN*FIEVFP*NLSSMLPDRDIDLC 299
           IS +  RKL+ +  L YL+ +RD  V+   +  + VVN F +VF   L  + P+R+I+ C
Sbjct: 231 ISVMSTRKLLRQDCLGYLAVVRDTQVKVGDISQVSVVNKFKDVFSEELPCLPPEREIEFC 290

Query: 298 IDLDPVTRPIYIPP*CMALA 239
           IDL P +RPI IPP  MA A
Sbjct: 291 IDLIPYSRPISIPPYRMAFA 310



 Score = 77.4 bits (189), Expect(2) = 2e-36
 Identities = 37/63 (58%), Positives = 46/63 (73%)
 Frame = -3

Query: 222 SVSLLGAPVLFIKKKDTTMHMCIYYR*LNKITIRNKY*IPHIDDIFDYLQGVSIFLKIGL 43
           SVS  GAPVLF+KKKD ++ +CI Y  LNK+ ++NKY +P IDD+FD LQG   F KI L
Sbjct: 331 SVSPWGAPVLFVKKKDGSLRLCIDYLQLNKVMVKNKYPLPRIDDLFDQLQGAQCFSKIDL 390

Query: 42  RYG 34
           R G
Sbjct: 391 RSG 393


>ref|XP_004514315.1| PREDICTED: uncharacterized protein LOC101498372 [Cicer arietinum]
          Length = 1069

 Score = 90.5 bits (223), Expect(2) = 2e-35
 Identities = 42/77 (54%), Positives = 57/77 (74%)
 Frame = -3

Query: 231 LYLSVSLLGAPVLFIKKKDTTMHMCIYYR*LNKITIRNKY*IPHIDDIFDYLQGVSIFLK 52
           ++ S S  GAPVLF+KKKD +M +C+ YR LNK+T++NKY +P ID++FD LQG   F K
Sbjct: 530 IHSSSSPWGAPVLFVKKKDGSMRLCVDYRQLNKVTVKNKYSLPRIDELFDQLQGAQCFSK 589

Query: 51  IGLRYG*HHLKIRVEDV 1
           I LR G H LKI+ +D+
Sbjct: 590 IDLRSGYHQLKIKRDDI 606



 Score = 87.4 bits (215), Expect(2) = 2e-35
 Identities = 50/131 (38%), Positives = 74/131 (56%)
 Frame = -2

Query: 649 IDLVILDMVDFDVILGID*PSPYHVILDCHAKIVTLAIPSLPKVEYRGSPNPNSKKIISF 470
           + LV++D+++FDVILG+D  + +H  LD H K+V   IP      ++G+        I  
Sbjct: 396 VGLVVIDLINFDVILGMDWLALHHATLDFHNKVVKFEIPGQSVFSFQGAHCWVPHNQILA 455

Query: 469 ICARKLVDKGFLAYLSHIRDMIVETSTLKSILVVN*FIEVFP*NLSSMLPDRDIDLCIDL 290
           + A KL+ +G   YL+ +RD  V    L+ I +   F +VFP  L  + PDR+I+  IDL
Sbjct: 456 LRASKLMRRGCQTYLALVRDTQVTEEELEKIPIACEFPDVFPEELPGLPPDREIEFSIDL 515

Query: 289 DPVTRPIYIPP 257
            P T PI IPP
Sbjct: 516 VPNTHPISIPP 526


>gb|EOY19088.1| Uncharacterized protein TCM_043787 [Theobroma cacao]
          Length = 649

 Score = 93.6 bits (231), Expect(2) = 2e-35
 Identities = 43/74 (58%), Positives = 58/74 (78%)
 Frame = -3

Query: 222 SVSLLGAPVLFIKKKDTTMHMCIYYR*LNKITIRNKY*IPHIDDIFDYLQGVSIFLKIGL 43
           S+S  GAPVLF+KKKD T+ +CI YR LN++TI+NKY +P IDD+FD LQG ++F K+ L
Sbjct: 382 SISPWGAPVLFVKKKDGTLRLCIDYRQLNRMTIKNKYPLPRIDDLFDQLQGATVFSKVDL 441

Query: 42  RYG*HHLKIRVEDV 1
           R G H L+I+ +DV
Sbjct: 442 RSGYHQLRIKEQDV 455



 Score = 84.3 bits (207), Expect(2) = 2e-35
 Identities = 51/136 (37%), Positives = 78/136 (57%)
 Frame = -2

Query: 646 DLVILDMVDFDVILGID*PSPYHVILDCHAKIVTLAIPSLPKVEYRGSPNPNSKKIISFI 467
           DL+ L+++DFD+ILG+D  + +   +DC  K V L      ++ + G        +IS I
Sbjct: 226 DLIPLEILDFDLILGMDWLTAHRANVDCFRKEVVLRNSEGAEIVFVGKRRVLPSCVISAI 285

Query: 466 CARKLVDKGFLAYLSHIRDMIVETSTLKSILVVN*FIEVFP*NLSSMLPDRDIDLCIDLD 287
            A KLV KG+  YL+++ D       L+ + +V+ F +VFP +L  + PDR+++  IDL 
Sbjct: 286 KASKLVQKGYPTYLAYVIDTSKGEPKLEDVPIVSEFPDVFPDDLPGLPPDRELEFPIDLL 345

Query: 286 PVTRPIYIPP*CMALA 239
           P T PI IPP  MA A
Sbjct: 346 PGTAPISIPPYRMAPA 361


>gb|EOX94203.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1336

 Score = 94.4 bits (233), Expect(2) = 2e-35
 Identities = 43/74 (58%), Positives = 58/74 (78%)
 Frame = -3

Query: 222 SVSLLGAPVLFIKKKDTTMHMCIYYR*LNKITIRNKY*IPHIDDIFDYLQGVSIFLKIGL 43
           S+S  GAP+LF+KKKD T+ +CI YR LN++TI+NKY +P IDDIFD LQG ++F K+ L
Sbjct: 572 SISPWGAPILFVKKKDGTLRLCIDYRQLNRMTIKNKYPLPRIDDIFDQLQGATVFSKVNL 631

Query: 42  RYG*HHLKIRVEDV 1
           R G H L+I+ +DV
Sbjct: 632 RSGYHQLRIKEQDV 645



 Score = 83.2 bits (204), Expect(2) = 2e-35
 Identities = 51/136 (37%), Positives = 77/136 (56%)
 Frame = -2

Query: 646 DLVILDMVDFDVILGID*PSPYHVILDCHAKIVTLAIPSLPKVEYRGSPNPNSKKIISFI 467
           DL+ L ++DFD+ILG+D  + +   +DC  K V L      ++ + G        +IS I
Sbjct: 416 DLIPLKILDFDLILGMDWLTTHRANVDCFRKEVVLRNSEGAEIVFVGKHRVLPSCVISAI 475

Query: 466 CARKLVDKGFLAYLSHIRDMIVETSTLKSILVVN*FIEVFP*NLSSMLPDRDIDLCIDLD 287
            A KLV KG+  YL+++ D       L+ + +V+ F +VFP +L  + PDR+++  IDL 
Sbjct: 476 KASKLVQKGYPTYLAYVIDTSKGEPKLEDVPIVSEFPDVFPDDLPGLPPDRELEFPIDLL 535

Query: 286 PVTRPIYIPP*CMALA 239
           P T PI IPP  MA A
Sbjct: 536 PGTAPISIPPYRMAPA 551


>gb|EOY21657.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1188

 Score = 94.0 bits (232), Expect(2) = 3e-35
 Identities = 43/74 (58%), Positives = 58/74 (78%)
 Frame = -3

Query: 222 SVSLLGAPVLFIKKKDTTMHMCIYYR*LNKITIRNKY*IPHIDDIFDYLQGVSIFLKIGL 43
           S+S  GAPVLF+KKKD T+ +CIYYR LN++TI+NKY +P IDD+FD L+G  +F KI L
Sbjct: 600 SISPWGAPVLFVKKKDGTLRLCIYYRQLNRVTIKNKYPLPRIDDLFDQLRGAMVFSKIDL 659

Query: 42  RYG*HHLKIRVEDV 1
           R G + L+I+ +DV
Sbjct: 660 RSGYYQLRIKEQDV 673



 Score = 82.8 bits (203), Expect(2) = 3e-35
 Identities = 51/136 (37%), Positives = 78/136 (57%)
 Frame = -2

Query: 646 DLVILDMVDFDVILGID*PSPYHVILDCHAKIVTLAIPSLPKVEYRGSPNPNSKKIISFI 467
           DL+ L+++DFD+ILG+D  + +   +DC  K V L      ++ + G        +IS I
Sbjct: 444 DLIPLEILDFDLILGMDWLTAHWANMDCFRKEVVLRNSEGAEIVFVGERRVLPSCVISAI 503

Query: 466 CARKLVDKGFLAYLSHIRDMIVETSTLKSILVVN*FIEVFP*NLSSMLPDRDIDLCIDLD 287
            A KLV KG+ AYL+++ D       L+ + +V+ F +VF  +L  + PDR+++  IDL 
Sbjct: 504 KASKLVQKGYPAYLAYVIDTSKGEPKLEDVPIVSEFPDVFSDDLPGLPPDRELEFPIDLL 563

Query: 286 PVTRPIYIPP*CMALA 239
           P T PI IPP  MA A
Sbjct: 564 PSTAPISIPPYRMAPA 579


Top