BLASTX nr result

ID: Catharanthus23_contig00009719 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00009719
         (1402 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulga...    67   4e-20
gb|ABD28627.2| RNA-directed DNA polymerase (Reverse transcriptas...    65   4e-15
ref|XP_006354967.1| PREDICTED: uncharacterized protein LOC102599...    53   3e-14
ref|XP_002446678.1| hypothetical protein SORBIDRAFT_06g020403 [S...    52   2e-13
ref|XP_002450418.1| hypothetical protein SORBIDRAFT_05g005061 [S...    49   6e-13
gb|EOY08834.1| Uncharacterized protein TCM_024073 [Theobroma cacao]    55   1e-12
gb|EMJ22027.1| hypothetical protein PRUPE_ppb017095mg [Prunus pe...    69   1e-12
gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao]    74   3e-12
ref|XP_002452516.1| hypothetical protein SORBIDRAFT_04g027285 [S...    48   4e-12
gb|EOX96782.1| Uncharacterized protein TCM_005953 [Theobroma cacao]    74   5e-12
gb|EPS63383.1| hypothetical protein M569_11401 [Genlisea aurea]        55   6e-12
gb|EOY34749.1| Uncharacterized protein TCM_042329 [Theobroma cacao]    76   8e-12
gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao]    74   9e-12
gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao]    65   9e-12
gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]    67   1e-11
gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]    74   1e-11
gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]    72   1e-11
gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]    75   2e-11
ref|XP_002454313.1| hypothetical protein SORBIDRAFT_04g028482 [S...    47   2e-11
gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]    67   3e-11

>emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1378

 Score = 67.0 bits (162), Expect(3) = 4e-20
 Identities = 32/94 (34%), Positives = 52/94 (55%), Gaps = 5/94 (5%)
 Frame = +3

Query: 954  IDLG-----HTWSNNRMDGA*IMERLDRFWENWLWRNQFSQACVQHLPRVHSDHHPLLVS 1118
            IDLG     HTWS           RLDR   N  W+ +F++  V++LP+  SDH P+L+S
Sbjct: 173  IDLGFTGPAHTWSRGLSPTTFKSARLDRGLANSEWKLKFTEGVVRNLPKSQSDHCPILIS 232

Query: 1119 CHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220
              G +   + ++PF+ +  W +H  F +F+++ W
Sbjct: 233  TSGFAPVPRIIKPFRFQAAWLNHQVFCEFVRKNW 266



 Score = 54.3 bits (129), Expect(3) = 4e-20
 Identities = 31/91 (34%), Positives = 46/91 (50%)
 Frame = +2

Query: 500 LFDLIKLNSPAILILAETKIHSSQVAVFPRSTHFDRMICSEAQGFAGGLWVL*KSSIVSL 679
           L +L+++N+P +L L ET I   Q         F      EA+GF GG+W+  KS  V++
Sbjct: 20  LRELMRINNPTVLALVETHISGDQAQRICDRIGFSGQTRVEAEGFRGGIWLFWKSEEVTV 79

Query: 680 VCVVVDFQTIT*FLLREGKVDWVLSTVYASP 772
                  Q +T  + R G   W+ S +YASP
Sbjct: 80  TPYGSHSQHLTVEIRRIGDDPWLFSAIYASP 110



 Score = 25.0 bits (53), Expect(3) = 4e-20
 Identities = 12/33 (36%), Positives = 17/33 (51%), Gaps = 2/33 (6%)
 Frame = +1

Query: 772 SPSFT--KSLWSYVKDMAAAISLPWLFLGDVNQ 864
           SP  T  K LW  ++ +    + PWL  GD N+
Sbjct: 109 SPDSTLRKELWRELEQIKNQYTGPWLLAGDFNE 141


>gb|ABD28627.2| RNA-directed DNA polymerase (Reverse transcriptase); Ribonuclease H
            [Medicago truncatula]
          Length = 1296

 Score = 65.1 bits (157), Expect(3) = 4e-15
 Identities = 37/112 (33%), Positives = 54/112 (48%), Gaps = 5/112 (4%)
 Frame = +3

Query: 903  HWGSATAL*GMINVCRFIDLG-----HTWSNNRMDGA*IMERLDRFWENWLWRNQFSQAC 1067
            H   A      +N C  +DL       TW  N      + ++LDR   N  WR  F +A 
Sbjct: 153  HHNRAATFSNFMNNCNLLDLTTTGGRFTWHKNNNGIRILSKKLDRGMANVDWRLSFPEAF 212

Query: 1068 VQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGWA 1223
            V+ L R+HSDH+PLL+   G    T+  RPF+ E  W  H ++   ++R W+
Sbjct: 213  VEVLCRLHSDHNPLLLR-FGGLPLTRGPRPFRFEAAWIDHYDYGNVVKRSWS 263



 Score = 34.3 bits (77), Expect(3) = 4e-15
 Identities = 20/58 (34%), Positives = 34/58 (58%), Gaps = 2/58 (3%)
 Frame = +2

Query: 620 EAQGFAGGLWVL*KSSIVSLVCVVVDFQ--TIT*FLLREGKVDWVLSTVYASPHLHLQ 787
           EA G +GG+W+L K S  ++   V+DF   +IT F++  G      + +YASP+  ++
Sbjct: 59  EANGHSGGVWLL-KHSTTNITSTVLDFNQYSIT-FIIGRGAAITTCTCIYASPNYSMR 114



 Score = 29.6 bits (65), Expect(3) = 4e-15
 Identities = 10/29 (34%), Positives = 19/29 (65%)
 Frame = +1

Query: 778 SFTKSLWSYVKDMAAAISLPWLFLGDVNQ 864
           S   +LW+Y+ ++   I+ PW+ +GD N+
Sbjct: 112 SMRPNLWNYLVNINDTITGPWMLIGDFNE 140


>ref|XP_006354967.1| PREDICTED: uncharacterized protein LOC102599840 [Solanum tuberosum]
          Length = 288

 Score = 53.1 bits (126), Expect(3) = 3e-14
 Identities = 28/55 (50%), Positives = 34/55 (61%), Gaps = 6/55 (10%)
 Frame = +3

Query: 954  IDLG-----HTWSN-NRMDGA*IMERLDRFWENWLWRNQFSQACVQHLPRVHSDH 1100
            IDLG     +TWSN ++ +   IMER+DRF  N  W N F  + V HLPR HSDH
Sbjct: 201  IDLGFTGQKYTWSNKHKNNNTLIMERIDRFLSNHSWLNLFPDSHVHHLPRTHSDH 255



 Score = 41.2 bits (95), Expect(3) = 3e-14
 Identities = 27/94 (28%), Positives = 51/94 (54%)
 Frame = +2

Query: 428 PDSKPMNLIY*NCRGASS*DFSRALFDLIKLNSPAILILAETKIHSSQVAVFPRSTHFDR 607
           P+S  M +   NCRGA++  F   +  LI  ++P IL L ET++      +  ++  +  
Sbjct: 26  PESPLMKIFLWNCRGANNAKFMNNIRALIDSHNPTILALTETRMEDLDKIL--QALDYTD 83

Query: 608 MICSEAQGFAGGLWVL*KSSIVSLVCVVVDFQTI 709
           +I   A G++GG+ +L ++S +++   V+  Q I
Sbjct: 84  VIQVPAFGYSGGIALLWRNSEINVEPFVITEQEI 117



 Score = 32.0 bits (71), Expect(3) = 3e-14
 Identities = 13/27 (48%), Positives = 17/27 (62%)
 Frame = +1

Query: 787 KSLWSYVKDMAAAISLPWLFLGDVNQV 867
           K LW  +K++ A I  PWL  GD N+V
Sbjct: 145 KILWENLKNLTARIKGPWLVCGDFNEV 171


>ref|XP_002446678.1| hypothetical protein SORBIDRAFT_06g020403 [Sorghum bicolor]
            gi|241937861|gb|EES11006.1| hypothetical protein
            SORBIDRAFT_06g020403 [Sorghum bicolor]
          Length = 633

 Score = 52.4 bits (124), Expect(3) = 2e-13
 Identities = 34/106 (32%), Positives = 53/106 (50%), Gaps = 9/106 (8%)
 Frame = +3

Query: 930  GMINVCR-----FIDLGHTWSNNRMDGA*IMERLDRFWENWLWRNQFSQACVQHLPRVHS 1094
            G ++VC+     +I LG T+      G  +  RLDR   +  W  +F  A VQHL  V S
Sbjct: 496  GAVDVCQLRDIGYIGLGWTFEKKVAGGHYVRVRLDRALASVNWCARFPLAAVQHLTTVKS 555

Query: 1095 DHHPLLVSCHGPSQCTK----PVRPFKVEMDWFSHPEFPQFIQRGW 1220
            DH P+L+S H P +  +      +PF+ E+ W ++      I++ W
Sbjct: 556  DHCPILLS-HVPDERNEGGGCQGKPFRYELMWETNERLSSLIEQIW 600



 Score = 38.1 bits (87), Expect(3) = 2e-13
 Identities = 23/98 (23%), Positives = 46/98 (46%)
 Frame = +2

Query: 494 RALFDLIKLNSPAILILAETKIHSSQVAVFPRSTHFDRMICSEAQGFAGGLWVL*KSSIV 673
           + L DL K  +P+++ + ET+I   +V     +  FD      + G +GGL +   + ++
Sbjct: 351 KELRDLAKDYAPSVMFIMETQISKYRVENLRYTLSFDNSFAVNSSGRSGGLGLFWNNDVL 410

Query: 674 SLVCVVVDFQTIT*FLLREGKVDWVLSTVYASPHLHLQ 787
             +    ++   T  +   GK  W +S +Y  P+  L+
Sbjct: 411 LSIQKYSNYHIDT-IISEHGKEPWRMSFIYGEPNRSLR 447



 Score = 32.7 bits (73), Expect(3) = 2e-13
 Identities = 11/27 (40%), Positives = 17/27 (62%)
 Frame = +1

Query: 796 WSYVKDMAAAISLPWLFLGDVNQVFRR 876
           W  +K M +   LPW+ +GD N++ RR
Sbjct: 451 WDIMKQMRSDTDLPWVCMGDFNEILRR 477


>ref|XP_002450418.1| hypothetical protein SORBIDRAFT_05g005061 [Sorghum bicolor]
            gi|241936261|gb|EES09406.1| hypothetical protein
            SORBIDRAFT_05g005061 [Sorghum bicolor]
          Length = 753

 Score = 48.9 bits (115), Expect(3) = 6e-13
 Identities = 31/104 (29%), Positives = 52/104 (50%), Gaps = 9/104 (8%)
 Frame = +3

Query: 936  INVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWLWRNQFSQACVQHLPRVHSDH 1100
            +++C+  D+G+     T+      G  +  RLDR   +  W  +F  A VQHL  V SDH
Sbjct: 165  VDMCQLRDIGYIGLDWTFEKKVAGGHFVRVRLDRALASVNWCARFPLAAVQHLTAVKSDH 224

Query: 1101 HPLLVSCHGPSQCTK----PVRPFKVEMDWFSHPEFPQFIQRGW 1220
             P+L+S H P +  +      +PF+ E+ W ++      I++ W
Sbjct: 225  CPILLS-HVPDERNEGGGCQGKPFRYELMWETNERLSSLIEQIW 267



 Score = 38.1 bits (87), Expect(3) = 6e-13
 Identities = 25/105 (23%), Positives = 48/105 (45%)
 Frame = +2

Query: 461 NCRGASS*DFSRALFDLIKLNSPAILILAETKIHSSQVAVFPRSTHFDRMICSEAQGFAG 640
           NCRG  +    + L DL K  +P+++ + ET+I   +V     +  FD      + G +G
Sbjct: 7   NCRGIGNPATVKELRDLAKDYAPSVMFIMETQISKYRVENLRYTLSFDNSFAVNSSGRSG 66

Query: 641 GLWVL*KSSIVSLVCVVVDFQTIT*FLLREGKVDWVLSTVYASPH 775
           GL +   + ++  +    ++   T  +   GK    +S +Y  P+
Sbjct: 67  GLGLFWNNDVLLSIQKYSNYHIDT-IISEHGKEPRRMSFIYGEPN 110



 Score = 34.7 bits (78), Expect(3) = 6e-13
 Identities = 13/33 (39%), Positives = 19/33 (57%)
 Frame = +1

Query: 778 SFTKSLWSYVKDMAAAISLPWLFLGDVNQVFRR 876
           SF    W  +K M +   LPW+ +GD N++ RR
Sbjct: 112 SFRYRTWDIMKQMRSDTDLPWVCMGDFNEILRR 144


>gb|EOY08834.1| Uncharacterized protein TCM_024073 [Theobroma cacao]
          Length = 660

 Score = 54.7 bits (130), Expect(3) = 1e-12
 Identities = 29/81 (35%), Positives = 39/81 (48%)
 Frame = +3

Query: 969  TWSNNRMDGA*IMERLDRFWENWLWRNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKP 1148
            TW N +        RLDR + N  W   F      +LPR HSDHHP+LV C  PS     
Sbjct: 319  TWWNKKEGLDYTQVRLDRVFVNDRWHVMFPNVVAINLPRTHSDHHPVLVRCSSPSM-LPD 377

Query: 1149 VRPFKVEMDWFSHPEFPQFIQ 1211
            +  F+ +    SHP F  +++
Sbjct: 378  LNKFRFKEARTSHPSFDAYLR 398



 Score = 38.1 bits (87), Expect(3) = 1e-12
 Identities = 19/66 (28%), Positives = 33/66 (50%), Gaps = 9/66 (13%)
 Frame = +2

Query: 602 DRMICS---------EAQGFAGGLWVL*KSSIVSLVCVVVDFQTIT*FLLREGKVDWVLS 754
           D+M C          +A G++GG+WV   + ++ +  +    Q +T  LL   K  W+L+
Sbjct: 183 DKMCCKYGFQNYFKVKANGYSGGIWVFWNAEVIEVEVLAYSSQ-LTHLLLNPSKEQWLLT 241

Query: 755 TVYASP 772
            +Y SP
Sbjct: 242 EIYGSP 247



 Score = 27.7 bits (60), Expect(3) = 1e-12
 Identities = 10/27 (37%), Positives = 16/27 (59%)
 Frame = +1

Query: 787 KSLWSYVKDMAAAISLPWLFLGDVNQV 867
           K LW  +K  +    +PW+ +GD NQ+
Sbjct: 253 KHLWDSLKLASNDQDIPWMVIGDFNQI 279


>gb|EMJ22027.1| hypothetical protein PRUPE_ppb017095mg [Prunus persica]
          Length = 883

 Score = 68.6 bits (166), Expect(2) = 1e-12
 Identities = 35/97 (36%), Positives = 50/97 (51%), Gaps = 5/97 (5%)
 Frame = +3

Query: 954  IDLG-----HTWSNNRMDGA*IMERLDRFWENWLWRNQFSQACVQHLPRVHSDHHPLLVS 1118
            +DLG     +TW N +     + ER+DR      WR  ++ A V+HLPR  SDH+PL +S
Sbjct: 555  VDLGFSGPKYTWRNTK-----VSERIDRAICTMNWRGLYADAHVRHLPRTTSDHNPLKIS 609

Query: 1119 CHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGWAEV 1229
                   T  +RPF+ E  W  H +F  FI   W ++
Sbjct: 610  LQSCFHATPHLRPFRFEAMWLKHEKFGDFINNTWVKL 646



 Score = 32.3 bits (72), Expect(2) = 1e-12
 Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 2/40 (5%)
 Frame = +1

Query: 754 YCIRLTSPSFTK--SLWSYVKDMAAAISLPWLFLGDVNQV 867
           + +   SP   K  SLW Y+K +     LPWL  GD N++
Sbjct: 488 FTVVYASPCIRKRASLWEYLKFVVECHHLPWLLAGDFNEM 527


>gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao]
          Length = 926

 Score = 73.6 bits (179), Expect(2) = 3e-12
 Identities = 44/119 (36%), Positives = 59/119 (49%), Gaps = 5/119 (4%)
 Frame = +3

Query: 879  DKLDGPHPHWGSATAL*GMINVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWLW 1043
            ++L+G  PH GS   L   +  C  +D        TW+NNRM      +RLDR   N  W
Sbjct: 67   ERLNGAIPHDGSMEDLSSTLFDCGLLDASFEGNSFTWTNNRM-----FQRLDRVVYNQEW 121

Query: 1044 RNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220
               FS   VQHL R  SDH PLL+SC   +Q  +   PF+    W  H +F  F+++ W
Sbjct: 122  AELFSSTRVQHLNRDGSDHCPLLISCSNTNQ--RGPAPFRFLHAWTKHHDFLSFVEKSW 178



 Score = 26.2 bits (56), Expect(2) = 3e-12
 Identities = 9/27 (33%), Positives = 16/27 (59%)
 Frame = +1

Query: 787 KSLWSYVKDMAAAISLPWLFLGDVNQV 867
           + LWS ++ ++  +  PWL  GD N +
Sbjct: 36  RELWSSLRIISDGMQAPWLVGGDFNSI 62


>ref|XP_002452516.1| hypothetical protein SORBIDRAFT_04g027285 [Sorghum bicolor]
            gi|241932347|gb|EES05492.1| hypothetical protein
            SORBIDRAFT_04g027285 [Sorghum bicolor]
          Length = 689

 Score = 48.1 bits (113), Expect(3) = 4e-12
 Identities = 32/104 (30%), Positives = 52/104 (50%), Gaps = 9/104 (8%)
 Frame = +3

Query: 936  INVCR-----FIDLGHTWSNNRMDGA*IMERLDRFWENWLWRNQFSQACVQHLPRVHSDH 1100
            ++VC+     +I LG T+      G  +  RLDR   +  W  +F  A  QHL  V SDH
Sbjct: 498  VDVCQLRDIGYIGLGWTFEKKVAGGHYVRVRLDRALASVNWCARFPLAAGQHLTTVKSDH 557

Query: 1101 HPLLVSCHGPSQCTK----PVRPFKVEMDWFSHPEFPQFIQRGW 1220
             P+L+S H P++  +      +PF+ E+ W ++      I++ W
Sbjct: 558  CPILLS-HVPNERNEGGGCQGKPFRYELMWETNERLSSLIEQIW 600



 Score = 38.1 bits (87), Expect(3) = 4e-12
 Identities = 23/98 (23%), Positives = 46/98 (46%)
 Frame = +2

Query: 494 RALFDLIKLNSPAILILAETKIHSSQVAVFPRSTHFDRMICSEAQGFAGGLWVL*KSSIV 673
           + L DL K  +P+++ + ET+I   +V     +  FD      + G +GGL +   + ++
Sbjct: 351 KELRDLAKDYAPSVMFIMETQISKYRVENLRYTLSFDNSFAVNSSGRSGGLGLFWNNDVL 410

Query: 674 SLVCVVVDFQTIT*FLLREGKVDWVLSTVYASPHLHLQ 787
             +    ++   T  +   GK  W +S +Y  P+  L+
Sbjct: 411 LSIQKYSNYHIDT-IISEHGKEPWRMSFIYGEPNRSLR 447



 Score = 32.7 bits (73), Expect(3) = 4e-12
 Identities = 11/27 (40%), Positives = 17/27 (62%)
 Frame = +1

Query: 796 WSYVKDMAAAISLPWLFLGDVNQVFRR 876
           W  +K M +   LPW+ +GD N++ RR
Sbjct: 451 WDIMKQMRSDTDLPWVCMGDFNEILRR 477


>gb|EOX96782.1| Uncharacterized protein TCM_005953 [Theobroma cacao]
          Length = 1659

 Score = 73.9 bits (180), Expect(2) = 5e-12
 Identities = 47/120 (39%), Positives = 59/120 (49%), Gaps = 5/120 (4%)
 Frame = +3

Query: 876  ADKLDGPHPHWGSATAL*GMINVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWL 1040
            A++L+G  PH GS       +  C  ID G      TW+NN M      +RLDR   N  
Sbjct: 698  AERLNGAPPHGGSMEDFVATLFDCGLIDAGFEGNSFTWTNNHM-----FQRLDRVVYNPE 752

Query: 1041 WRNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220
            W + FS   VQHL R  SDH PLL+SC   SQ  K    F+    W  H +F  F++R W
Sbjct: 753  WAHCFSSTRVQHLNRDGSDHCPLLISCATASQ--KGPSTFRFLHAWTKHHDFLPFVERSW 810



 Score = 25.0 bits (53), Expect(2) = 5e-12
 Identities = 8/25 (32%), Positives = 16/25 (64%)
 Frame = +1

Query: 793 LWSYVKDMAAAISLPWLFLGDVNQV 867
           LW+ ++ ++A +  PW+  GD N +
Sbjct: 670 LWNCLRSLSADMQGPWMVGGDFNTI 694


>gb|EPS63383.1| hypothetical protein M569_11401 [Genlisea aurea]
          Length = 1469

 Score = 55.5 bits (132), Expect(3) = 6e-12
 Identities = 35/100 (35%), Positives = 44/100 (44%), Gaps = 7/100 (7%)
 Frame = +3

Query: 945  CRFIDLGHT-----WSNNRMDGA*IMERLDRFWENWLWRNQFSQACVQHLPRVHSDHHPL 1109
            C+  D+G T     W N R     +  RLDR      W N F +A V+HLP   SDH PL
Sbjct: 536  CQLQDIGFTGFPFTWCNKRKAPDTVRARLDRAVATTTWNNLFPRAIVKHLPYGSSDHLPL 595

Query: 1110 LVSCH--GPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGWA 1223
            L+      P+      R FK E  W + P     I + WA
Sbjct: 596  LIFLDPAAPTSIRPNKRRFKFEAFWTTIPGCADVIHQSWA 635



 Score = 40.0 bits (92), Expect(3) = 6e-12
 Identities = 34/124 (27%), Positives = 60/124 (48%), Gaps = 3/124 (2%)
 Frame = +2

Query: 425 FPDSKP--MNLIY*NCRGASS*DFSRALFDLIKLNSPAILILAETKIHSSQVAVFPRSTH 598
           +P + P  M+L+  NCRG  S    R L D+I  ++P+++ L+ETK  +S V        
Sbjct: 361 YPKAPPSAMSLLAWNCRGLRSASTVRRLRDVISSDAPSMIFLSETKCLASHVEWLKECLS 420

Query: 599 FDRMICSEAQGFAGGLWVL*KSSI-VSLVCVVVDFQTIT*FLLREGKVDWVLSTVYASPH 775
           +  +  S A G +GGL +  +  + VSL+     +  +    L     +W  +  Y +P 
Sbjct: 421 YFGVAVS-ATGLSGGLALFWRKDVCVSLLSFCSSYIDVL-VRLTPTLPEWRFTGFYGNPA 478

Query: 776 LHLQ 787
           + L+
Sbjct: 479 VQLR 482



 Score = 22.7 bits (47), Expect(3) = 6e-12
 Identities = 8/24 (33%), Positives = 12/24 (50%)
 Frame = +1

Query: 796 WSYVKDMAAAISLPWLFLGDVNQV 867
           W  ++ +      PWL  GD N+V
Sbjct: 486 WDLLRQIRHHSICPWLVAGDFNEV 509


>gb|EOY34749.1| Uncharacterized protein TCM_042329 [Theobroma cacao]
          Length = 2606

 Score = 75.9 bits (185), Expect(2) = 8e-12
 Identities = 46/131 (35%), Positives = 65/131 (49%), Gaps = 5/131 (3%)
 Frame = +3

Query: 876  ADKLDGPHPHWGSATAL*GMINVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWL 1040
            A++L G HPH GS      M+  C  +D G+     TW+NN M      +RLDR   N  
Sbjct: 996  AERLHGAHPHSGSMEDFATMLLDCGLLDAGYEGNNFTWTNNHM-----FQRLDRVVYNHE 1050

Query: 1041 WRNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220
            W + F+   +QHL R  SDH PLL+SC+   Q  +    F+    W  H +F  F++R W
Sbjct: 1051 WADCFNNTRIQHLNRDGSDHCPLLISCNNTVQ--RGPSNFRFLHAWTHHHDFIPFVERSW 1108

Query: 1221 AEVWI*TGRVI 1253
                  TG ++
Sbjct: 1109 RVPMQATGMLV 1119



 Score = 22.3 bits (46), Expect(2) = 8e-12
 Identities = 7/25 (28%), Positives = 15/25 (60%)
 Frame = +1

Query: 793  LWSYVKDMAAAISLPWLFLGDVNQV 867
            LW+ ++ ++  +  PW+  GD N +
Sbjct: 968  LWNCLRSISWDMQGPWMVGGDFNSI 992


>gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao]
          Length = 1702

 Score = 74.3 bits (181), Expect(2) = 9e-12
 Identities = 47/120 (39%), Positives = 60/120 (50%), Gaps = 5/120 (4%)
 Frame = +3

Query: 876  ADKLDGPHPHWGSATAL*GMINVCRFIDLG-----HTWSNNRMDGA*IMERLDRFWENWL 1040
            A++L+G  PH GS       +  C  ID G     +TW+NN M      +RLDR   N  
Sbjct: 387  AERLNGASPHEGSMEDFAATLLDCGLIDAGFEGNSYTWTNNHM-----FQRLDRVVYNPE 441

Query: 1041 WRNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220
            W + FS   VQHL R  SDH PLL+SC   SQ  K    F+    W  H +F  F++R W
Sbjct: 442  WVHFFSSTRVQHLNRDGSDHCPLLISCATASQ--KGPSTFRFLHAWTKHHDFLPFVERSW 499



 Score = 23.9 bits (50), Expect(2) = 9e-12
 Identities = 7/25 (28%), Positives = 16/25 (64%)
 Frame = +1

Query: 793 LWSYVKDMAAAISLPWLFLGDVNQV 867
           LW+ ++ +++ +  PW+  GD N +
Sbjct: 359 LWNCLRSLSSDMQGPWMVDGDFNTI 383


>gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao]
          Length = 910

 Score = 65.5 bits (158), Expect(2) = 9e-12
 Identities = 41/119 (34%), Positives = 57/119 (47%), Gaps = 5/119 (4%)
 Frame = +3

Query: 879  DKLDGPHPHWGSATAL*GMINVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWLW 1043
            ++L G  PH GS      ++  C  +D G      TW+NNRM      +RLDR   N  W
Sbjct: 144  ERLYGSAPHEGSMEDFASVLLDCGLLDGGFEGNPFTWTNNRM-----FQRLDRVVYNHQW 198

Query: 1044 RNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220
             N F    +QHL R  SDH PLL+SC   ++  K    F+ +  W  H +F   ++  W
Sbjct: 199  INMFPITRIQHLNRDGSDHCPLLISCFISNE--KSPSSFRFQHAWVLHHDFKTSVEGNW 255



 Score = 32.7 bits (73), Expect(2) = 9e-12
 Identities = 19/56 (33%), Positives = 26/56 (46%), Gaps = 17/56 (30%)
 Frame = +1

Query: 760 IRLTSPSFTKS-----------------LWSYVKDMAAAISLPWLFLGDVNQVFRR 876
           +RLTSP   KS                 LW  ++ +AA I +PWL  GD N + +R
Sbjct: 87  VRLTSPWLEKSFFATFVYAKCTRSERTFLWDCLRRLAADIEVPWLVGGDFNIILKR 142


>gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
          Length = 2367

 Score = 67.0 bits (162), Expect(2) = 1e-11
 Identities = 41/119 (34%), Positives = 57/119 (47%), Gaps = 5/119 (4%)
 Frame = +3

Query: 879  DKLDGPHPHWGSATAL*GMINVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWLW 1043
            ++L G  PH G+       +  C  +D G      TW+NNRM      +RLDR   N  W
Sbjct: 1199 ERLYGSAPHEGAMEDFASTLLDCGLLDGGFEGNPFTWTNNRM-----FQRLDRIVYNHHW 1253

Query: 1044 RNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220
             N+F    +QHL R  SDH PLL+SC   S+  K    F+ +  W  H +F   ++  W
Sbjct: 1254 INKFPITRIQHLNRDGSDHCPLLISCFNSSE--KAPSSFRFQHAWVLHHDFKTSVESNW 1310



 Score = 30.8 bits (68), Expect(2) = 1e-11
 Identities = 12/28 (42%), Positives = 18/28 (64%)
 Frame = +1

Query: 793  LWSYVKDMAAAISLPWLFLGDVNQVFRR 876
            LW  ++ +AA I +PWL  GD N + +R
Sbjct: 1170 LWDCLRRLAADIEVPWLVGGDFNIILKR 1197


>gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
          Length = 1954

 Score = 73.9 bits (180), Expect(2) = 1e-11
 Identities = 47/120 (39%), Positives = 59/120 (49%), Gaps = 5/120 (4%)
 Frame = +3

Query: 876  ADKLDGPHPHWGSATAL*GMINVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWL 1040
            A++L+G  PH GS       +  C  ID G      TW+NN M      +RLDR   N  
Sbjct: 731  AERLNGAPPHGGSMEDFVATLFDCGLIDAGFEGNSFTWTNNHM-----FQRLDRVVYNPE 785

Query: 1041 WRNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220
            W + FS   VQHL R  SDH PLL+SC   SQ  K    F+    W  H +F  F++R W
Sbjct: 786  WAHCFSSTRVQHLNRDGSDHCPLLISCATASQ--KGPSTFRFLHAWTKHHDFLPFVERSW 843



 Score = 23.9 bits (50), Expect(2) = 1e-11
 Identities = 7/25 (28%), Positives = 16/25 (64%)
 Frame = +1

Query: 793 LWSYVKDMAAAISLPWLFLGDVNQV 867
           LW+ ++ +++ +  PW+  GD N +
Sbjct: 703 LWNCLRSLSSDMQGPWMVGGDFNTI 727


>gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
          Length = 2214

 Score = 72.4 bits (176), Expect(2) = 1e-11
 Identities = 44/119 (36%), Positives = 59/119 (49%), Gaps = 5/119 (4%)
 Frame = +3

Query: 879  DKLDGPHPHWGSATAL*GMINVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWLW 1043
            ++L+G  PH GS   L   +  C  +D G      TW+NNRM      +RLDR   N  W
Sbjct: 993  ERLNGAIPHDGSMEDLSSTLFDCGLLDAGFEGNSFTWTNNRM-----FQRLDRVVYNQEW 1047

Query: 1044 RNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220
               FS   VQHL R  SDH PLL+SC   +Q  +    F+    W  H +F  F+++ W
Sbjct: 1048 AEFFSSTRVQHLNRDGSDHCPLLISCSNTNQ--RGPATFRFLHAWTKHHDFISFVEKSW 1104



 Score = 25.0 bits (53), Expect(2) = 1e-11
 Identities = 8/27 (29%), Positives = 16/27 (59%)
 Frame = +1

Query: 787  KSLWSYVKDMAAAISLPWLFLGDVNQV 867
            + LW+ ++ ++  +  PWL  GD N +
Sbjct: 962  RELWTSLRIISDGMQAPWLVGGDFNSI 988


>gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
          Length = 1951

 Score = 74.7 bits (182), Expect(2) = 2e-11
 Identities = 45/131 (34%), Positives = 65/131 (49%), Gaps = 5/131 (3%)
 Frame = +3

Query: 876  ADKLDGPHPHWGSATAL*GMINVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWL 1040
            A++L G HPH GS      M+  C  +D G+     TW+NN M      +RLDR   N  
Sbjct: 996  AERLHGAHPHSGSMEDFATMLLDCGLLDAGYEGNNFTWTNNHM-----FQRLDRVVYNHE 1050

Query: 1041 WRNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220
            W + F+   +QHL R  SDH PLL+SC+   Q  +    F+    W  H +F  F+++ W
Sbjct: 1051 WADCFNNTRIQHLNRDGSDHCPLLISCNNTVQ--RGPSNFRFLHAWTHHHDFIPFVEKSW 1108

Query: 1221 AEVWI*TGRVI 1253
                  TG ++
Sbjct: 1109 RVPMQATGMLV 1119



 Score = 22.3 bits (46), Expect(2) = 2e-11
 Identities = 7/25 (28%), Positives = 15/25 (60%)
 Frame = +1

Query: 793  LWSYVKDMAAAISLPWLFLGDVNQV 867
            LW+ ++ ++  +  PW+  GD N +
Sbjct: 968  LWNCLRSISWDMQGPWMVGGDFNSI 992


>ref|XP_002454313.1| hypothetical protein SORBIDRAFT_04g028482 [Sorghum bicolor]
            gi|241934144|gb|EES07289.1| hypothetical protein
            SORBIDRAFT_04g028482 [Sorghum bicolor]
          Length = 509

 Score = 47.0 bits (110), Expect(3) = 2e-11
 Identities = 30/104 (28%), Positives = 50/104 (48%), Gaps = 9/104 (8%)
 Frame = +3

Query: 936  INVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWLWRNQFSQACVQHLPRVHSDH 1100
            ++VC+  D+G+     T+      G  +  RLDR   +  W + F  A ++HL  V SDH
Sbjct: 361  VDVCQLCDIGYMGLDWTFEKKVAGGHFVRVRLDRALASASWSSYFPFAVLRHLTAVKSDH 420

Query: 1101 HPLLVSC----HGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220
             P+L+S         +C +  +PF+ E+ W ++      IQ  W
Sbjct: 421  CPILLSLQLDERSNFECGQG-KPFRYEIMWETNKGLRSLIQHKW 463



 Score = 35.0 bits (79), Expect(3) = 2e-11
 Identities = 22/96 (22%), Positives = 45/96 (46%)
 Frame = +2

Query: 524 SPAILILAETKIHSSQVAVFPRSTHFDRMICSEAQGFAGGLWVL*KSSIVSLVCVVVDFQ 703
           +PA++ + ET+I+  +V     S  +D      + G +GGL +  K+ +   +     + 
Sbjct: 224 APAVIFIMETQINKYRVENLRYSLGYDDSFAVNSSGKSGGLGLFWKNDVNVSIKKFSKYH 283

Query: 704 TIT*FLLREGKVDWVLSTVYASPHLHLQNPYGPMLK 811
             T  +   GK  W +S +Y  P+  L++    ++K
Sbjct: 284 IDT-IIEENGKEPWRMSFIYGEPNRSLRHRTWDIMK 318



 Score = 34.3 bits (77), Expect(3) = 2e-11
 Identities = 12/27 (44%), Positives = 17/27 (62%)
 Frame = +1

Query: 796 WSYVKDMAAAISLPWLFLGDVNQVFRR 876
           W  +K M +   LPWL +GD N++ RR
Sbjct: 314 WDIMKQMRSDFDLPWLCIGDFNEILRR 340


>gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
          Length = 2249

 Score = 67.4 bits (163), Expect(2) = 3e-11
 Identities = 41/119 (34%), Positives = 57/119 (47%), Gaps = 5/119 (4%)
 Frame = +3

Query: 879  DKLDGPHPHWGSATAL*GMINVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWLW 1043
            ++L G  PH G+       +  C  +D G      TW+NNRM      +RLDR   N  W
Sbjct: 1027 ERLYGSAPHEGAMEDFASTLLDCGLLDGGFEGNSFTWTNNRM-----FQRLDRIVYNHHW 1081

Query: 1044 RNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220
             N+F    +QHL R  SDH PLL+SC   S+  K    F+ +  W  H +F   ++  W
Sbjct: 1082 INKFPVTRIQHLNRDGSDHCPLLISCFNSSE--KAPSSFRFQHAWVLHHDFKTSVESNW 1138



 Score = 28.9 bits (63), Expect(2) = 3e-11
 Identities = 11/28 (39%), Positives = 17/28 (60%)
 Frame = +1

Query: 793  LWSYVKDMAAAISLPWLFLGDVNQVFRR 876
            LW  ++ +A  I +PWL  GD N + +R
Sbjct: 998  LWDCLRRLADDIEVPWLVGGDFNVILKR 1025


Top