BLASTX nr result
ID: Catharanthus23_contig00009719
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00009719 (1402 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulga... 67 4e-20 gb|ABD28627.2| RNA-directed DNA polymerase (Reverse transcriptas... 65 4e-15 ref|XP_006354967.1| PREDICTED: uncharacterized protein LOC102599... 53 3e-14 ref|XP_002446678.1| hypothetical protein SORBIDRAFT_06g020403 [S... 52 2e-13 ref|XP_002450418.1| hypothetical protein SORBIDRAFT_05g005061 [S... 49 6e-13 gb|EOY08834.1| Uncharacterized protein TCM_024073 [Theobroma cacao] 55 1e-12 gb|EMJ22027.1| hypothetical protein PRUPE_ppb017095mg [Prunus pe... 69 1e-12 gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao] 74 3e-12 ref|XP_002452516.1| hypothetical protein SORBIDRAFT_04g027285 [S... 48 4e-12 gb|EOX96782.1| Uncharacterized protein TCM_005953 [Theobroma cacao] 74 5e-12 gb|EPS63383.1| hypothetical protein M569_11401 [Genlisea aurea] 55 6e-12 gb|EOY34749.1| Uncharacterized protein TCM_042329 [Theobroma cacao] 76 8e-12 gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao] 74 9e-12 gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao] 65 9e-12 gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] 67 1e-11 gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] 74 1e-11 gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] 72 1e-11 gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao] 75 2e-11 ref|XP_002454313.1| hypothetical protein SORBIDRAFT_04g028482 [S... 47 2e-11 gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] 67 3e-11 >emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1378 Score = 67.0 bits (162), Expect(3) = 4e-20 Identities = 32/94 (34%), Positives = 52/94 (55%), Gaps = 5/94 (5%) Frame = +3 Query: 954 IDLG-----HTWSNNRMDGA*IMERLDRFWENWLWRNQFSQACVQHLPRVHSDHHPLLVS 1118 IDLG HTWS RLDR N W+ +F++ V++LP+ SDH P+L+S Sbjct: 173 IDLGFTGPAHTWSRGLSPTTFKSARLDRGLANSEWKLKFTEGVVRNLPKSQSDHCPILIS 232 Query: 1119 CHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220 G + + ++PF+ + W +H F +F+++ W Sbjct: 233 TSGFAPVPRIIKPFRFQAAWLNHQVFCEFVRKNW 266 Score = 54.3 bits (129), Expect(3) = 4e-20 Identities = 31/91 (34%), Positives = 46/91 (50%) Frame = +2 Query: 500 LFDLIKLNSPAILILAETKIHSSQVAVFPRSTHFDRMICSEAQGFAGGLWVL*KSSIVSL 679 L +L+++N+P +L L ET I Q F EA+GF GG+W+ KS V++ Sbjct: 20 LRELMRINNPTVLALVETHISGDQAQRICDRIGFSGQTRVEAEGFRGGIWLFWKSEEVTV 79 Query: 680 VCVVVDFQTIT*FLLREGKVDWVLSTVYASP 772 Q +T + R G W+ S +YASP Sbjct: 80 TPYGSHSQHLTVEIRRIGDDPWLFSAIYASP 110 Score = 25.0 bits (53), Expect(3) = 4e-20 Identities = 12/33 (36%), Positives = 17/33 (51%), Gaps = 2/33 (6%) Frame = +1 Query: 772 SPSFT--KSLWSYVKDMAAAISLPWLFLGDVNQ 864 SP T K LW ++ + + PWL GD N+ Sbjct: 109 SPDSTLRKELWRELEQIKNQYTGPWLLAGDFNE 141 >gb|ABD28627.2| RNA-directed DNA polymerase (Reverse transcriptase); Ribonuclease H [Medicago truncatula] Length = 1296 Score = 65.1 bits (157), Expect(3) = 4e-15 Identities = 37/112 (33%), Positives = 54/112 (48%), Gaps = 5/112 (4%) Frame = +3 Query: 903 HWGSATAL*GMINVCRFIDLG-----HTWSNNRMDGA*IMERLDRFWENWLWRNQFSQAC 1067 H A +N C +DL TW N + ++LDR N WR F +A Sbjct: 153 HHNRAATFSNFMNNCNLLDLTTTGGRFTWHKNNNGIRILSKKLDRGMANVDWRLSFPEAF 212 Query: 1068 VQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGWA 1223 V+ L R+HSDH+PLL+ G T+ RPF+ E W H ++ ++R W+ Sbjct: 213 VEVLCRLHSDHNPLLLR-FGGLPLTRGPRPFRFEAAWIDHYDYGNVVKRSWS 263 Score = 34.3 bits (77), Expect(3) = 4e-15 Identities = 20/58 (34%), Positives = 34/58 (58%), Gaps = 2/58 (3%) Frame = +2 Query: 620 EAQGFAGGLWVL*KSSIVSLVCVVVDFQ--TIT*FLLREGKVDWVLSTVYASPHLHLQ 787 EA G +GG+W+L K S ++ V+DF +IT F++ G + +YASP+ ++ Sbjct: 59 EANGHSGGVWLL-KHSTTNITSTVLDFNQYSIT-FIIGRGAAITTCTCIYASPNYSMR 114 Score = 29.6 bits (65), Expect(3) = 4e-15 Identities = 10/29 (34%), Positives = 19/29 (65%) Frame = +1 Query: 778 SFTKSLWSYVKDMAAAISLPWLFLGDVNQ 864 S +LW+Y+ ++ I+ PW+ +GD N+ Sbjct: 112 SMRPNLWNYLVNINDTITGPWMLIGDFNE 140 >ref|XP_006354967.1| PREDICTED: uncharacterized protein LOC102599840 [Solanum tuberosum] Length = 288 Score = 53.1 bits (126), Expect(3) = 3e-14 Identities = 28/55 (50%), Positives = 34/55 (61%), Gaps = 6/55 (10%) Frame = +3 Query: 954 IDLG-----HTWSN-NRMDGA*IMERLDRFWENWLWRNQFSQACVQHLPRVHSDH 1100 IDLG +TWSN ++ + IMER+DRF N W N F + V HLPR HSDH Sbjct: 201 IDLGFTGQKYTWSNKHKNNNTLIMERIDRFLSNHSWLNLFPDSHVHHLPRTHSDH 255 Score = 41.2 bits (95), Expect(3) = 3e-14 Identities = 27/94 (28%), Positives = 51/94 (54%) Frame = +2 Query: 428 PDSKPMNLIY*NCRGASS*DFSRALFDLIKLNSPAILILAETKIHSSQVAVFPRSTHFDR 607 P+S M + NCRGA++ F + LI ++P IL L ET++ + ++ + Sbjct: 26 PESPLMKIFLWNCRGANNAKFMNNIRALIDSHNPTILALTETRMEDLDKIL--QALDYTD 83 Query: 608 MICSEAQGFAGGLWVL*KSSIVSLVCVVVDFQTI 709 +I A G++GG+ +L ++S +++ V+ Q I Sbjct: 84 VIQVPAFGYSGGIALLWRNSEINVEPFVITEQEI 117 Score = 32.0 bits (71), Expect(3) = 3e-14 Identities = 13/27 (48%), Positives = 17/27 (62%) Frame = +1 Query: 787 KSLWSYVKDMAAAISLPWLFLGDVNQV 867 K LW +K++ A I PWL GD N+V Sbjct: 145 KILWENLKNLTARIKGPWLVCGDFNEV 171 >ref|XP_002446678.1| hypothetical protein SORBIDRAFT_06g020403 [Sorghum bicolor] gi|241937861|gb|EES11006.1| hypothetical protein SORBIDRAFT_06g020403 [Sorghum bicolor] Length = 633 Score = 52.4 bits (124), Expect(3) = 2e-13 Identities = 34/106 (32%), Positives = 53/106 (50%), Gaps = 9/106 (8%) Frame = +3 Query: 930 GMINVCR-----FIDLGHTWSNNRMDGA*IMERLDRFWENWLWRNQFSQACVQHLPRVHS 1094 G ++VC+ +I LG T+ G + RLDR + W +F A VQHL V S Sbjct: 496 GAVDVCQLRDIGYIGLGWTFEKKVAGGHYVRVRLDRALASVNWCARFPLAAVQHLTTVKS 555 Query: 1095 DHHPLLVSCHGPSQCTK----PVRPFKVEMDWFSHPEFPQFIQRGW 1220 DH P+L+S H P + + +PF+ E+ W ++ I++ W Sbjct: 556 DHCPILLS-HVPDERNEGGGCQGKPFRYELMWETNERLSSLIEQIW 600 Score = 38.1 bits (87), Expect(3) = 2e-13 Identities = 23/98 (23%), Positives = 46/98 (46%) Frame = +2 Query: 494 RALFDLIKLNSPAILILAETKIHSSQVAVFPRSTHFDRMICSEAQGFAGGLWVL*KSSIV 673 + L DL K +P+++ + ET+I +V + FD + G +GGL + + ++ Sbjct: 351 KELRDLAKDYAPSVMFIMETQISKYRVENLRYTLSFDNSFAVNSSGRSGGLGLFWNNDVL 410 Query: 674 SLVCVVVDFQTIT*FLLREGKVDWVLSTVYASPHLHLQ 787 + ++ T + GK W +S +Y P+ L+ Sbjct: 411 LSIQKYSNYHIDT-IISEHGKEPWRMSFIYGEPNRSLR 447 Score = 32.7 bits (73), Expect(3) = 2e-13 Identities = 11/27 (40%), Positives = 17/27 (62%) Frame = +1 Query: 796 WSYVKDMAAAISLPWLFLGDVNQVFRR 876 W +K M + LPW+ +GD N++ RR Sbjct: 451 WDIMKQMRSDTDLPWVCMGDFNEILRR 477 >ref|XP_002450418.1| hypothetical protein SORBIDRAFT_05g005061 [Sorghum bicolor] gi|241936261|gb|EES09406.1| hypothetical protein SORBIDRAFT_05g005061 [Sorghum bicolor] Length = 753 Score = 48.9 bits (115), Expect(3) = 6e-13 Identities = 31/104 (29%), Positives = 52/104 (50%), Gaps = 9/104 (8%) Frame = +3 Query: 936 INVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWLWRNQFSQACVQHLPRVHSDH 1100 +++C+ D+G+ T+ G + RLDR + W +F A VQHL V SDH Sbjct: 165 VDMCQLRDIGYIGLDWTFEKKVAGGHFVRVRLDRALASVNWCARFPLAAVQHLTAVKSDH 224 Query: 1101 HPLLVSCHGPSQCTK----PVRPFKVEMDWFSHPEFPQFIQRGW 1220 P+L+S H P + + +PF+ E+ W ++ I++ W Sbjct: 225 CPILLS-HVPDERNEGGGCQGKPFRYELMWETNERLSSLIEQIW 267 Score = 38.1 bits (87), Expect(3) = 6e-13 Identities = 25/105 (23%), Positives = 48/105 (45%) Frame = +2 Query: 461 NCRGASS*DFSRALFDLIKLNSPAILILAETKIHSSQVAVFPRSTHFDRMICSEAQGFAG 640 NCRG + + L DL K +P+++ + ET+I +V + FD + G +G Sbjct: 7 NCRGIGNPATVKELRDLAKDYAPSVMFIMETQISKYRVENLRYTLSFDNSFAVNSSGRSG 66 Query: 641 GLWVL*KSSIVSLVCVVVDFQTIT*FLLREGKVDWVLSTVYASPH 775 GL + + ++ + ++ T + GK +S +Y P+ Sbjct: 67 GLGLFWNNDVLLSIQKYSNYHIDT-IISEHGKEPRRMSFIYGEPN 110 Score = 34.7 bits (78), Expect(3) = 6e-13 Identities = 13/33 (39%), Positives = 19/33 (57%) Frame = +1 Query: 778 SFTKSLWSYVKDMAAAISLPWLFLGDVNQVFRR 876 SF W +K M + LPW+ +GD N++ RR Sbjct: 112 SFRYRTWDIMKQMRSDTDLPWVCMGDFNEILRR 144 >gb|EOY08834.1| Uncharacterized protein TCM_024073 [Theobroma cacao] Length = 660 Score = 54.7 bits (130), Expect(3) = 1e-12 Identities = 29/81 (35%), Positives = 39/81 (48%) Frame = +3 Query: 969 TWSNNRMDGA*IMERLDRFWENWLWRNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKP 1148 TW N + RLDR + N W F +LPR HSDHHP+LV C PS Sbjct: 319 TWWNKKEGLDYTQVRLDRVFVNDRWHVMFPNVVAINLPRTHSDHHPVLVRCSSPSM-LPD 377 Query: 1149 VRPFKVEMDWFSHPEFPQFIQ 1211 + F+ + SHP F +++ Sbjct: 378 LNKFRFKEARTSHPSFDAYLR 398 Score = 38.1 bits (87), Expect(3) = 1e-12 Identities = 19/66 (28%), Positives = 33/66 (50%), Gaps = 9/66 (13%) Frame = +2 Query: 602 DRMICS---------EAQGFAGGLWVL*KSSIVSLVCVVVDFQTIT*FLLREGKVDWVLS 754 D+M C +A G++GG+WV + ++ + + Q +T LL K W+L+ Sbjct: 183 DKMCCKYGFQNYFKVKANGYSGGIWVFWNAEVIEVEVLAYSSQ-LTHLLLNPSKEQWLLT 241 Query: 755 TVYASP 772 +Y SP Sbjct: 242 EIYGSP 247 Score = 27.7 bits (60), Expect(3) = 1e-12 Identities = 10/27 (37%), Positives = 16/27 (59%) Frame = +1 Query: 787 KSLWSYVKDMAAAISLPWLFLGDVNQV 867 K LW +K + +PW+ +GD NQ+ Sbjct: 253 KHLWDSLKLASNDQDIPWMVIGDFNQI 279 >gb|EMJ22027.1| hypothetical protein PRUPE_ppb017095mg [Prunus persica] Length = 883 Score = 68.6 bits (166), Expect(2) = 1e-12 Identities = 35/97 (36%), Positives = 50/97 (51%), Gaps = 5/97 (5%) Frame = +3 Query: 954 IDLG-----HTWSNNRMDGA*IMERLDRFWENWLWRNQFSQACVQHLPRVHSDHHPLLVS 1118 +DLG +TW N + + ER+DR WR ++ A V+HLPR SDH+PL +S Sbjct: 555 VDLGFSGPKYTWRNTK-----VSERIDRAICTMNWRGLYADAHVRHLPRTTSDHNPLKIS 609 Query: 1119 CHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGWAEV 1229 T +RPF+ E W H +F FI W ++ Sbjct: 610 LQSCFHATPHLRPFRFEAMWLKHEKFGDFINNTWVKL 646 Score = 32.3 bits (72), Expect(2) = 1e-12 Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 2/40 (5%) Frame = +1 Query: 754 YCIRLTSPSFTK--SLWSYVKDMAAAISLPWLFLGDVNQV 867 + + SP K SLW Y+K + LPWL GD N++ Sbjct: 488 FTVVYASPCIRKRASLWEYLKFVVECHHLPWLLAGDFNEM 527 >gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao] Length = 926 Score = 73.6 bits (179), Expect(2) = 3e-12 Identities = 44/119 (36%), Positives = 59/119 (49%), Gaps = 5/119 (4%) Frame = +3 Query: 879 DKLDGPHPHWGSATAL*GMINVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWLW 1043 ++L+G PH GS L + C +D TW+NNRM +RLDR N W Sbjct: 67 ERLNGAIPHDGSMEDLSSTLFDCGLLDASFEGNSFTWTNNRM-----FQRLDRVVYNQEW 121 Query: 1044 RNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220 FS VQHL R SDH PLL+SC +Q + PF+ W H +F F+++ W Sbjct: 122 AELFSSTRVQHLNRDGSDHCPLLISCSNTNQ--RGPAPFRFLHAWTKHHDFLSFVEKSW 178 Score = 26.2 bits (56), Expect(2) = 3e-12 Identities = 9/27 (33%), Positives = 16/27 (59%) Frame = +1 Query: 787 KSLWSYVKDMAAAISLPWLFLGDVNQV 867 + LWS ++ ++ + PWL GD N + Sbjct: 36 RELWSSLRIISDGMQAPWLVGGDFNSI 62 >ref|XP_002452516.1| hypothetical protein SORBIDRAFT_04g027285 [Sorghum bicolor] gi|241932347|gb|EES05492.1| hypothetical protein SORBIDRAFT_04g027285 [Sorghum bicolor] Length = 689 Score = 48.1 bits (113), Expect(3) = 4e-12 Identities = 32/104 (30%), Positives = 52/104 (50%), Gaps = 9/104 (8%) Frame = +3 Query: 936 INVCR-----FIDLGHTWSNNRMDGA*IMERLDRFWENWLWRNQFSQACVQHLPRVHSDH 1100 ++VC+ +I LG T+ G + RLDR + W +F A QHL V SDH Sbjct: 498 VDVCQLRDIGYIGLGWTFEKKVAGGHYVRVRLDRALASVNWCARFPLAAGQHLTTVKSDH 557 Query: 1101 HPLLVSCHGPSQCTK----PVRPFKVEMDWFSHPEFPQFIQRGW 1220 P+L+S H P++ + +PF+ E+ W ++ I++ W Sbjct: 558 CPILLS-HVPNERNEGGGCQGKPFRYELMWETNERLSSLIEQIW 600 Score = 38.1 bits (87), Expect(3) = 4e-12 Identities = 23/98 (23%), Positives = 46/98 (46%) Frame = +2 Query: 494 RALFDLIKLNSPAILILAETKIHSSQVAVFPRSTHFDRMICSEAQGFAGGLWVL*KSSIV 673 + L DL K +P+++ + ET+I +V + FD + G +GGL + + ++ Sbjct: 351 KELRDLAKDYAPSVMFIMETQISKYRVENLRYTLSFDNSFAVNSSGRSGGLGLFWNNDVL 410 Query: 674 SLVCVVVDFQTIT*FLLREGKVDWVLSTVYASPHLHLQ 787 + ++ T + GK W +S +Y P+ L+ Sbjct: 411 LSIQKYSNYHIDT-IISEHGKEPWRMSFIYGEPNRSLR 447 Score = 32.7 bits (73), Expect(3) = 4e-12 Identities = 11/27 (40%), Positives = 17/27 (62%) Frame = +1 Query: 796 WSYVKDMAAAISLPWLFLGDVNQVFRR 876 W +K M + LPW+ +GD N++ RR Sbjct: 451 WDIMKQMRSDTDLPWVCMGDFNEILRR 477 >gb|EOX96782.1| Uncharacterized protein TCM_005953 [Theobroma cacao] Length = 1659 Score = 73.9 bits (180), Expect(2) = 5e-12 Identities = 47/120 (39%), Positives = 59/120 (49%), Gaps = 5/120 (4%) Frame = +3 Query: 876 ADKLDGPHPHWGSATAL*GMINVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWL 1040 A++L+G PH GS + C ID G TW+NN M +RLDR N Sbjct: 698 AERLNGAPPHGGSMEDFVATLFDCGLIDAGFEGNSFTWTNNHM-----FQRLDRVVYNPE 752 Query: 1041 WRNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220 W + FS VQHL R SDH PLL+SC SQ K F+ W H +F F++R W Sbjct: 753 WAHCFSSTRVQHLNRDGSDHCPLLISCATASQ--KGPSTFRFLHAWTKHHDFLPFVERSW 810 Score = 25.0 bits (53), Expect(2) = 5e-12 Identities = 8/25 (32%), Positives = 16/25 (64%) Frame = +1 Query: 793 LWSYVKDMAAAISLPWLFLGDVNQV 867 LW+ ++ ++A + PW+ GD N + Sbjct: 670 LWNCLRSLSADMQGPWMVGGDFNTI 694 >gb|EPS63383.1| hypothetical protein M569_11401 [Genlisea aurea] Length = 1469 Score = 55.5 bits (132), Expect(3) = 6e-12 Identities = 35/100 (35%), Positives = 44/100 (44%), Gaps = 7/100 (7%) Frame = +3 Query: 945 CRFIDLGHT-----WSNNRMDGA*IMERLDRFWENWLWRNQFSQACVQHLPRVHSDHHPL 1109 C+ D+G T W N R + RLDR W N F +A V+HLP SDH PL Sbjct: 536 CQLQDIGFTGFPFTWCNKRKAPDTVRARLDRAVATTTWNNLFPRAIVKHLPYGSSDHLPL 595 Query: 1110 LVSCH--GPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGWA 1223 L+ P+ R FK E W + P I + WA Sbjct: 596 LIFLDPAAPTSIRPNKRRFKFEAFWTTIPGCADVIHQSWA 635 Score = 40.0 bits (92), Expect(3) = 6e-12 Identities = 34/124 (27%), Positives = 60/124 (48%), Gaps = 3/124 (2%) Frame = +2 Query: 425 FPDSKP--MNLIY*NCRGASS*DFSRALFDLIKLNSPAILILAETKIHSSQVAVFPRSTH 598 +P + P M+L+ NCRG S R L D+I ++P+++ L+ETK +S V Sbjct: 361 YPKAPPSAMSLLAWNCRGLRSASTVRRLRDVISSDAPSMIFLSETKCLASHVEWLKECLS 420 Query: 599 FDRMICSEAQGFAGGLWVL*KSSI-VSLVCVVVDFQTIT*FLLREGKVDWVLSTVYASPH 775 + + S A G +GGL + + + VSL+ + + L +W + Y +P Sbjct: 421 YFGVAVS-ATGLSGGLALFWRKDVCVSLLSFCSSYIDVL-VRLTPTLPEWRFTGFYGNPA 478 Query: 776 LHLQ 787 + L+ Sbjct: 479 VQLR 482 Score = 22.7 bits (47), Expect(3) = 6e-12 Identities = 8/24 (33%), Positives = 12/24 (50%) Frame = +1 Query: 796 WSYVKDMAAAISLPWLFLGDVNQV 867 W ++ + PWL GD N+V Sbjct: 486 WDLLRQIRHHSICPWLVAGDFNEV 509 >gb|EOY34749.1| Uncharacterized protein TCM_042329 [Theobroma cacao] Length = 2606 Score = 75.9 bits (185), Expect(2) = 8e-12 Identities = 46/131 (35%), Positives = 65/131 (49%), Gaps = 5/131 (3%) Frame = +3 Query: 876 ADKLDGPHPHWGSATAL*GMINVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWL 1040 A++L G HPH GS M+ C +D G+ TW+NN M +RLDR N Sbjct: 996 AERLHGAHPHSGSMEDFATMLLDCGLLDAGYEGNNFTWTNNHM-----FQRLDRVVYNHE 1050 Query: 1041 WRNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220 W + F+ +QHL R SDH PLL+SC+ Q + F+ W H +F F++R W Sbjct: 1051 WADCFNNTRIQHLNRDGSDHCPLLISCNNTVQ--RGPSNFRFLHAWTHHHDFIPFVERSW 1108 Query: 1221 AEVWI*TGRVI 1253 TG ++ Sbjct: 1109 RVPMQATGMLV 1119 Score = 22.3 bits (46), Expect(2) = 8e-12 Identities = 7/25 (28%), Positives = 15/25 (60%) Frame = +1 Query: 793 LWSYVKDMAAAISLPWLFLGDVNQV 867 LW+ ++ ++ + PW+ GD N + Sbjct: 968 LWNCLRSISWDMQGPWMVGGDFNSI 992 >gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao] Length = 1702 Score = 74.3 bits (181), Expect(2) = 9e-12 Identities = 47/120 (39%), Positives = 60/120 (50%), Gaps = 5/120 (4%) Frame = +3 Query: 876 ADKLDGPHPHWGSATAL*GMINVCRFIDLG-----HTWSNNRMDGA*IMERLDRFWENWL 1040 A++L+G PH GS + C ID G +TW+NN M +RLDR N Sbjct: 387 AERLNGASPHEGSMEDFAATLLDCGLIDAGFEGNSYTWTNNHM-----FQRLDRVVYNPE 441 Query: 1041 WRNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220 W + FS VQHL R SDH PLL+SC SQ K F+ W H +F F++R W Sbjct: 442 WVHFFSSTRVQHLNRDGSDHCPLLISCATASQ--KGPSTFRFLHAWTKHHDFLPFVERSW 499 Score = 23.9 bits (50), Expect(2) = 9e-12 Identities = 7/25 (28%), Positives = 16/25 (64%) Frame = +1 Query: 793 LWSYVKDMAAAISLPWLFLGDVNQV 867 LW+ ++ +++ + PW+ GD N + Sbjct: 359 LWNCLRSLSSDMQGPWMVDGDFNTI 383 >gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao] Length = 910 Score = 65.5 bits (158), Expect(2) = 9e-12 Identities = 41/119 (34%), Positives = 57/119 (47%), Gaps = 5/119 (4%) Frame = +3 Query: 879 DKLDGPHPHWGSATAL*GMINVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWLW 1043 ++L G PH GS ++ C +D G TW+NNRM +RLDR N W Sbjct: 144 ERLYGSAPHEGSMEDFASVLLDCGLLDGGFEGNPFTWTNNRM-----FQRLDRVVYNHQW 198 Query: 1044 RNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220 N F +QHL R SDH PLL+SC ++ K F+ + W H +F ++ W Sbjct: 199 INMFPITRIQHLNRDGSDHCPLLISCFISNE--KSPSSFRFQHAWVLHHDFKTSVEGNW 255 Score = 32.7 bits (73), Expect(2) = 9e-12 Identities = 19/56 (33%), Positives = 26/56 (46%), Gaps = 17/56 (30%) Frame = +1 Query: 760 IRLTSPSFTKS-----------------LWSYVKDMAAAISLPWLFLGDVNQVFRR 876 +RLTSP KS LW ++ +AA I +PWL GD N + +R Sbjct: 87 VRLTSPWLEKSFFATFVYAKCTRSERTFLWDCLRRLAADIEVPWLVGGDFNIILKR 142 >gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 67.0 bits (162), Expect(2) = 1e-11 Identities = 41/119 (34%), Positives = 57/119 (47%), Gaps = 5/119 (4%) Frame = +3 Query: 879 DKLDGPHPHWGSATAL*GMINVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWLW 1043 ++L G PH G+ + C +D G TW+NNRM +RLDR N W Sbjct: 1199 ERLYGSAPHEGAMEDFASTLLDCGLLDGGFEGNPFTWTNNRM-----FQRLDRIVYNHHW 1253 Query: 1044 RNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220 N+F +QHL R SDH PLL+SC S+ K F+ + W H +F ++ W Sbjct: 1254 INKFPITRIQHLNRDGSDHCPLLISCFNSSE--KAPSSFRFQHAWVLHHDFKTSVESNW 1310 Score = 30.8 bits (68), Expect(2) = 1e-11 Identities = 12/28 (42%), Positives = 18/28 (64%) Frame = +1 Query: 793 LWSYVKDMAAAISLPWLFLGDVNQVFRR 876 LW ++ +AA I +PWL GD N + +R Sbjct: 1170 LWDCLRRLAADIEVPWLVGGDFNIILKR 1197 >gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] Length = 1954 Score = 73.9 bits (180), Expect(2) = 1e-11 Identities = 47/120 (39%), Positives = 59/120 (49%), Gaps = 5/120 (4%) Frame = +3 Query: 876 ADKLDGPHPHWGSATAL*GMINVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWL 1040 A++L+G PH GS + C ID G TW+NN M +RLDR N Sbjct: 731 AERLNGAPPHGGSMEDFVATLFDCGLIDAGFEGNSFTWTNNHM-----FQRLDRVVYNPE 785 Query: 1041 WRNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220 W + FS VQHL R SDH PLL+SC SQ K F+ W H +F F++R W Sbjct: 786 WAHCFSSTRVQHLNRDGSDHCPLLISCATASQ--KGPSTFRFLHAWTKHHDFLPFVERSW 843 Score = 23.9 bits (50), Expect(2) = 1e-11 Identities = 7/25 (28%), Positives = 16/25 (64%) Frame = +1 Query: 793 LWSYVKDMAAAISLPWLFLGDVNQV 867 LW+ ++ +++ + PW+ GD N + Sbjct: 703 LWNCLRSLSSDMQGPWMVGGDFNTI 727 >gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 72.4 bits (176), Expect(2) = 1e-11 Identities = 44/119 (36%), Positives = 59/119 (49%), Gaps = 5/119 (4%) Frame = +3 Query: 879 DKLDGPHPHWGSATAL*GMINVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWLW 1043 ++L+G PH GS L + C +D G TW+NNRM +RLDR N W Sbjct: 993 ERLNGAIPHDGSMEDLSSTLFDCGLLDAGFEGNSFTWTNNRM-----FQRLDRVVYNQEW 1047 Query: 1044 RNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220 FS VQHL R SDH PLL+SC +Q + F+ W H +F F+++ W Sbjct: 1048 AEFFSSTRVQHLNRDGSDHCPLLISCSNTNQ--RGPATFRFLHAWTKHHDFISFVEKSW 1104 Score = 25.0 bits (53), Expect(2) = 1e-11 Identities = 8/27 (29%), Positives = 16/27 (59%) Frame = +1 Query: 787 KSLWSYVKDMAAAISLPWLFLGDVNQV 867 + LW+ ++ ++ + PWL GD N + Sbjct: 962 RELWTSLRIISDGMQAPWLVGGDFNSI 988 >gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao] Length = 1951 Score = 74.7 bits (182), Expect(2) = 2e-11 Identities = 45/131 (34%), Positives = 65/131 (49%), Gaps = 5/131 (3%) Frame = +3 Query: 876 ADKLDGPHPHWGSATAL*GMINVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWL 1040 A++L G HPH GS M+ C +D G+ TW+NN M +RLDR N Sbjct: 996 AERLHGAHPHSGSMEDFATMLLDCGLLDAGYEGNNFTWTNNHM-----FQRLDRVVYNHE 1050 Query: 1041 WRNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220 W + F+ +QHL R SDH PLL+SC+ Q + F+ W H +F F+++ W Sbjct: 1051 WADCFNNTRIQHLNRDGSDHCPLLISCNNTVQ--RGPSNFRFLHAWTHHHDFIPFVEKSW 1108 Query: 1221 AEVWI*TGRVI 1253 TG ++ Sbjct: 1109 RVPMQATGMLV 1119 Score = 22.3 bits (46), Expect(2) = 2e-11 Identities = 7/25 (28%), Positives = 15/25 (60%) Frame = +1 Query: 793 LWSYVKDMAAAISLPWLFLGDVNQV 867 LW+ ++ ++ + PW+ GD N + Sbjct: 968 LWNCLRSISWDMQGPWMVGGDFNSI 992 >ref|XP_002454313.1| hypothetical protein SORBIDRAFT_04g028482 [Sorghum bicolor] gi|241934144|gb|EES07289.1| hypothetical protein SORBIDRAFT_04g028482 [Sorghum bicolor] Length = 509 Score = 47.0 bits (110), Expect(3) = 2e-11 Identities = 30/104 (28%), Positives = 50/104 (48%), Gaps = 9/104 (8%) Frame = +3 Query: 936 INVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWLWRNQFSQACVQHLPRVHSDH 1100 ++VC+ D+G+ T+ G + RLDR + W + F A ++HL V SDH Sbjct: 361 VDVCQLCDIGYMGLDWTFEKKVAGGHFVRVRLDRALASASWSSYFPFAVLRHLTAVKSDH 420 Query: 1101 HPLLVSC----HGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220 P+L+S +C + +PF+ E+ W ++ IQ W Sbjct: 421 CPILLSLQLDERSNFECGQG-KPFRYEIMWETNKGLRSLIQHKW 463 Score = 35.0 bits (79), Expect(3) = 2e-11 Identities = 22/96 (22%), Positives = 45/96 (46%) Frame = +2 Query: 524 SPAILILAETKIHSSQVAVFPRSTHFDRMICSEAQGFAGGLWVL*KSSIVSLVCVVVDFQ 703 +PA++ + ET+I+ +V S +D + G +GGL + K+ + + + Sbjct: 224 APAVIFIMETQINKYRVENLRYSLGYDDSFAVNSSGKSGGLGLFWKNDVNVSIKKFSKYH 283 Query: 704 TIT*FLLREGKVDWVLSTVYASPHLHLQNPYGPMLK 811 T + GK W +S +Y P+ L++ ++K Sbjct: 284 IDT-IIEENGKEPWRMSFIYGEPNRSLRHRTWDIMK 318 Score = 34.3 bits (77), Expect(3) = 2e-11 Identities = 12/27 (44%), Positives = 17/27 (62%) Frame = +1 Query: 796 WSYVKDMAAAISLPWLFLGDVNQVFRR 876 W +K M + LPWL +GD N++ RR Sbjct: 314 WDIMKQMRSDFDLPWLCIGDFNEILRR 340 >gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 67.4 bits (163), Expect(2) = 3e-11 Identities = 41/119 (34%), Positives = 57/119 (47%), Gaps = 5/119 (4%) Frame = +3 Query: 879 DKLDGPHPHWGSATAL*GMINVCRFIDLGH-----TWSNNRMDGA*IMERLDRFWENWLW 1043 ++L G PH G+ + C +D G TW+NNRM +RLDR N W Sbjct: 1027 ERLYGSAPHEGAMEDFASTLLDCGLLDGGFEGNSFTWTNNRM-----FQRLDRIVYNHHW 1081 Query: 1044 RNQFSQACVQHLPRVHSDHHPLLVSCHGPSQCTKPVRPFKVEMDWFSHPEFPQFIQRGW 1220 N+F +QHL R SDH PLL+SC S+ K F+ + W H +F ++ W Sbjct: 1082 INKFPVTRIQHLNRDGSDHCPLLISCFNSSE--KAPSSFRFQHAWVLHHDFKTSVESNW 1138 Score = 28.9 bits (63), Expect(2) = 3e-11 Identities = 11/28 (39%), Positives = 17/28 (60%) Frame = +1 Query: 793 LWSYVKDMAAAISLPWLFLGDVNQVFRR 876 LW ++ +A I +PWL GD N + +R Sbjct: 998 LWDCLRRLADDIEVPWLVGGDFNVILKR 1025