BLASTX nr result
ID: Forsythia22_contig00031299
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia22_contig00031299 (2827 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_012065816.1| PREDICTED: uncharacterized protein LOC105628... 194 4e-46 emb|CDP14239.1| unnamed protein product [Coffea canephora] 188 2e-44 ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobrom... 182 1e-42 ref|XP_007017130.1| Uncharacterized protein TCM_042329 [Theobrom... 182 1e-42 ref|XP_007026454.1| Uncharacterized protein TCM_030494 [Theobrom... 176 7e-41 emb|CDP20930.1| unnamed protein product [Coffea canephora] 176 1e-40 ref|XP_011092915.1| PREDICTED: uncharacterized protein LOC105172... 175 2e-40 ref|XP_007010391.1| Uncharacterized protein TCM_044158 [Theobrom... 172 1e-39 ref|XP_007031319.1| Uncharacterized protein TCM_016772 [Theobrom... 172 1e-39 ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom... 170 5e-39 ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom... 169 9e-39 ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom... 167 6e-38 ref|XP_011101871.1| PREDICTED: uncharacterized protein LOC105179... 166 1e-37 ref|XP_007040951.1| Uncharacterized protein TCM_016760 [Theobrom... 166 1e-37 ref|XP_011071645.1| PREDICTED: uncharacterized protein LOC105157... 165 2e-37 ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom... 164 3e-37 ref|XP_007026456.1| Uncharacterized protein TCM_021520 [Theobrom... 105 3e-37 ref|XP_007026455.1| Uncharacterized protein TCM_021519 [Theobrom... 164 5e-37 ref|XP_012081344.1| PREDICTED: uncharacterized protein LOC105641... 162 1e-36 ref|XP_007046403.1| Uncharacterized protein TCM_011922 [Theobrom... 160 7e-36 >ref|XP_012065816.1| PREDICTED: uncharacterized protein LOC105628933 [Jatropha curcas] Length = 397 Score = 194 bits (492), Expect = 4e-46 Identities = 102/259 (39%), Positives = 147/259 (56%) Frame = -3 Query: 2657 EDQSALNQGRQTYASAVSGRPTLSQAERNERKSFSAILDAPQASVIHSITHRPPGLHRGE 2478 + S+ Q Q + A + R T + E +R S I P +T + P +G Sbjct: 7 QQASSAKQYSQGISYAAAVRNTKGKTEI-DRSFMSTITHEP------CLTSKQPNRFKGV 59 Query: 2477 PSFILTVEEEAALSEPFKFTLVGKFSHRKPTMAKVRDCFLKFGFCGDYRLGLIDSKHILI 2298 PS + +E L+ F+F LVG F +P M +R K GF G++ LGL+DS HILI Sbjct: 60 PSISFSWDESMKLANQFRFALVGIFQSGRPNMKSLRQFMDKIGFKGEFSLGLLDSSHILI 119 Query: 2297 HLTHENDYSRLFLRSLYYIDGCPMRVLKWTCDFGPDCETPIAPVWLSFPLLPVHMRSKGV 2118 E D+ R +L+ ++Y G MR+ KWT +F P+ + I P W+ F LP+H+ +K Sbjct: 120 KFELEEDFHRCWLKQIWYFQGFSMRISKWTRNFRPNTDCSIVPTWILFEGLPIHLFAKAA 179 Query: 2117 IFALAKIVGIPLRIDEATADLLRPSEARVCVEVNLEHKLPERIWIDRGDGRSFWQTVVYE 1938 +F +A ++G PL++D ATA L RPS ARVCVE++L LP ++WID GD F+Q V YE Sbjct: 180 LFPIANLIGKPLKVDAATATLSRPSVARVCVELDLSKDLPNKVWIDDGD-LGFFQPVNYE 238 Query: 1937 KPPLFCSKCRHMGHSLGQC 1881 PLFC+KC +GH + C Sbjct: 239 SLPLFCTKCCRIGHEILSC 257 >emb|CDP14239.1| unnamed protein product [Coffea canephora] Length = 587 Score = 188 bits (478), Expect = 2e-44 Identities = 84/204 (41%), Positives = 126/204 (61%) Frame = -3 Query: 2489 HRGEPSFILTVEEEAALSEPFKFTLVGKFSHRKPTMAKVRDCFLKFGFCGDYRLGLIDSK 2310 HRGEP+ + + + A ++ PF++TLVGKFS +P + +R +GL+D++ Sbjct: 4 HRGEPAVVFSAADIAVVAAPFRYTLVGKFSKGRPLLPDLRKFLSTLDLKDTATVGLLDAR 63 Query: 2309 HILIHLTHENDYSRLFLRSLYYIDGCPMRVLKWTCDFGPDCETPIAPVWLSFPLLPVHMR 2130 H+L+ E D+ R++ RSL+Y++G PMRV KWT F + E+ + P+W P LP+H+ Sbjct: 64 HVLLKFQCEADFLRVWGRSLWYVNGSPMRVFKWTSKFHVNRESSLVPIWFRLPKLPIHLF 123 Query: 2129 SKGVIFALAKIVGIPLRIDEATADLLRPSEARVCVEVNLEHKLPERIWIDRGDGRSFWQT 1950 +K +F L +G PL +D AT+ RP+ ARVCVEV+L +P R+W+D GDG FWQ Sbjct: 124 AKPCLFHLVSCLGTPLFVDAATSSFSRPNVARVCVEVDLLKSIPSRVWVDMGDGDGFWQV 183 Query: 1949 VVYEKPPLFCSKCRHMGHSLGQCR 1878 ++ E P +CS C GH QCR Sbjct: 184 LIPENLPNYCSHCYRQGHGEDQCR 207 >ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobroma cacao] gi|508715062|gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao] Length = 1951 Score = 182 bits (463), Expect = 1e-42 Identities = 109/292 (37%), Positives = 157/292 (53%), Gaps = 19/292 (6%) Frame = -3 Query: 2699 PVNPHPDPSSVFPQEDQSALNQGRQTYASAVSGRPTLSQ-------------AERNERKS 2559 P NP P P S F S L Q + +P ++ + R ++KS Sbjct: 9 PSNPPPPPVSSF-----SMLQGTNQNTKDPTNPQPPVNNVGLQATDVQKPPVSPRAQKKS 63 Query: 2558 FSAILDAPQASVIHSITHRPPGLHRGEPSFILTVEEEAALSEPFKFTLVGKFSHRKPTMA 2379 F ++ + +I T+R P +R P+ +E AL++PFK ++VGKFS R P + Sbjct: 64 FLSVAAGEKPPIIP--TNREPFWYRDRPAVAFFEDEIVALAQPFKHSMVGKFS-RMPKLN 120 Query: 2378 KVRDCFLKFGFCGDYRLGLIDSKHILIHLTHENDYSRLFLRSLYYIDGCPMRVLKWTCDF 2199 +R F G G Y + +D KHILIHL++E D +RL++R ++I MRV KW+ DF Sbjct: 121 DIRAAFKGIGLVGVYEIRWLDYKHILIHLSNEQDLNRLWMRQAWFIANQKMRVFKWSPDF 180 Query: 2198 GPDCETPIAPVWLSFPLLPVHMRSKGVIFALAKIVGIPLRIDEATADLLRPSEARVCVEV 2019 P+ E+ + PVW+SFP L H+ K + +AK VG PL +DEATA+ RPS ARVCVE Sbjct: 181 QPEKESSLVPVWISFPNLRAHLYEKSALLMIAKSVGRPLFVDEATANGTRPSVARVCVEY 240 Query: 2018 NLEHKLPERIWIDRGDGRS------FWQTVVYEKPPLFCSKCRHMGHSLGQC 1881 + + E+IWI D R+ F Q V + K P +C+ C H+GHS C Sbjct: 241 DCQQPPLEQIWIVSRDRRTGDITGGFQQKVDFAKLPNYCTHCCHVGHSASTC 292 Score = 83.6 bits (205), Expect(2) = 3e-30 Identities = 39/91 (42%), Positives = 57/91 (62%) Frame = -2 Query: 558 DFVRRSVGFDIGVQNQSSKIWCFWNVGTTVSVIVDHPQFLHLMIEDPRLARPIFLSPVYA 379 ++VRR +GF+ + N S KIW F + +++DH Q+LH+ I P L+ PIF S VYA Sbjct: 899 EYVRRRLGFETVISNVSHKIWIFCSEEIGCEILLDHVQYLHVKITVPWLSHPIFSSLVYA 958 Query: 378 SCSVVGRRDLFEGLHHVSQTVDGPWLVGGDF 286 C+ R +L+ L +S + GPW+VGGDF Sbjct: 959 KCTRQERLELWNCLRSISWDMQGPWMVGGDF 989 Score = 79.0 bits (193), Expect(2) = 3e-30 Identities = 33/66 (50%), Positives = 47/66 (71%) Frame = -3 Query: 221 DFCDMMMECGLTDAGFSGSPYTWHNRRVWKRLDRVLMSHEAASFFRQFTVRHLNRSTSDH 42 DF M+++CGL DAG+ G+ +TW N +++RLDRV+ +HE A F ++HLNR SDH Sbjct: 1011 DFATMLLDCGLLDAGYEGNNFTWTNNHMFQRLDRVVYNHEWADCFNNTRIQHLNRDGSDH 1070 Query: 41 SPLLLS 24 PLL+S Sbjct: 1071 CPLLIS 1076 >ref|XP_007017130.1| Uncharacterized protein TCM_042329 [Theobroma cacao] gi|508787493|gb|EOY34749.1| Uncharacterized protein TCM_042329 [Theobroma cacao] Length = 2606 Score = 182 bits (462), Expect = 1e-42 Identities = 109/292 (37%), Positives = 156/292 (53%), Gaps = 19/292 (6%) Frame = -3 Query: 2699 PVNPHPDPSSVFPQEDQSALNQGRQTYASAVSGRPTLSQ-------------AERNERKS 2559 P NP P P S F S L Q + +P ++ + R ++KS Sbjct: 9 PSNPPPPPVSSF-----SMLQGTNQNTKDPKNSQPPVNNDGLQAIDFQKTPVSPRAQKKS 63 Query: 2558 FSAILDAPQASVIHSITHRPPGLHRGEPSFILTVEEEAALSEPFKFTLVGKFSHRKPTMA 2379 F ++ + +I T+R P +R P+ +E AL++PFK ++VGKFS R P + Sbjct: 64 FLSVAAGEKLQIIP--TNREPFWYRDRPAVAFFEDEIVALAQPFKHSMVGKFS-RMPKLN 120 Query: 2378 KVRDCFLKFGFCGDYRLGLIDSKHILIHLTHENDYSRLFLRSLYYIDGCPMRVLKWTCDF 2199 +R F G Y + +D KHILIHL++E D +RL++R ++I MRV KWT DF Sbjct: 121 DIRAAFKGISLVGVYEIRWLDYKHILIHLSNEQDLNRLWMRQAWFIANQKMRVFKWTPDF 180 Query: 2198 GPDCETPIAPVWLSFPLLPVHMRSKGVIFALAKIVGIPLRIDEATADLLRPSEARVCVEV 2019 P+ E+ + PVW+SFP L H+ K + +AK VG PL +DEATA+ RPS ARVCVE Sbjct: 181 QPEKESSLVPVWISFPNLRAHLYEKSALLMIAKSVGRPLFVDEATANGTRPSVARVCVEY 240 Query: 2018 NLEHKLPERIWIDRGDGRS------FWQTVVYEKPPLFCSKCRHMGHSLGQC 1881 + + E+IWI D R+ F Q V + K P +C+ C H+GHS C Sbjct: 241 DCQQPPLEQIWIVTRDRRTGDITGGFQQKVDFAKLPNYCTHCCHVGHSASTC 292 Score = 167 bits (424), Expect = 3e-38 Identities = 95/274 (34%), Positives = 153/274 (55%), Gaps = 7/274 (2%) Frame = -3 Query: 2681 DPSSVFPQEDQSALNQG-RQTYASAVSGRPTLSQAERNERKSFSAILDAPQASVIHSITH 2505 +P S++ + + L+ G +QT + + P+ R+++KSF +I+ + VI Sbjct: 1652 NPPSIWTKNSRLPLSHGCQQTTPTQIQPPPS----PRSQKKSFLSIVSGDKPPVIP--LS 1705 Query: 2504 RPPGLHRGEPSFILTVEEEAALSEPFKFTLVGKFSHRKPTMAKVRDCFLKFGFCGDYRLG 2325 R P + + P+ +E L++P K +LVGKFS R P + VR F G G Y + Sbjct: 1706 RDPLVFKDRPAAAFFEDEIQTLAQPLKLSLVGKFS-RMPKLQDVRSAFKGIGLTGAYEVR 1764 Query: 2324 LIDSKHILIHLTHENDYSRLFLRSLYYIDGCPMRVLKWTCDFGPDCETPIAPVWLSFPLL 2145 +D KH+LIHL++E D +R++ + +++I MRV KWT +F P+ E+ + PVW++FP L Sbjct: 1765 WLDYKHVLIHLSNEQDCNRVWTKQVWFIANQKMRVFKWTPEFEPEKESAVVPVWIAFPNL 1824 Query: 2144 PVHMRSKGVIFALAKIVGIPLRIDEATADLLRPSEARVCVEVNLEHKLPERIWI---DRG 1974 H+ K + +AK VG PL +DEATA+ RPS ARVC+E + +++WI +R Sbjct: 1825 KAHLFEKSALLLIAKTVGKPLFVDEATANGSRPSVARVCIEFDCRRPPIDQVWIVVQNRE 1884 Query: 1973 DG---RSFWQTVVYEKPPLFCSKCRHMGHSLGQC 1881 G + Q V + + P +C C H+GH C Sbjct: 1885 TGTVTSGYPQRVEFSQMPAYCDHCCHVGHKENDC 1918 Score = 81.3 bits (199), Expect(2) = 2e-29 Identities = 38/91 (41%), Positives = 56/91 (61%) Frame = -2 Query: 558 DFVRRSVGFDIGVQNQSSKIWCFWNVGTTVSVIVDHPQFLHLMIEDPRLARPIFLSPVYA 379 ++VR +GF+ + N S KIW F + +++DH Q+LH+ I P L+ PIF S VYA Sbjct: 899 EYVRMRLGFETVISNVSHKIWIFCSEEIGCEILLDHVQYLHVKITVPWLSHPIFSSLVYA 958 Query: 378 SCSVVGRRDLFEGLHHVSQTVDGPWLVGGDF 286 C+ R +L+ L +S + GPW+VGGDF Sbjct: 959 KCTRQERLELWNCLRSISWDMQGPWMVGGDF 989 Score = 79.0 bits (193), Expect(2) = 2e-29 Identities = 33/66 (50%), Positives = 47/66 (71%) Frame = -3 Query: 221 DFCDMMMECGLTDAGFSGSPYTWHNRRVWKRLDRVLMSHEAASFFRQFTVRHLNRSTSDH 42 DF M+++CGL DAG+ G+ +TW N +++RLDRV+ +HE A F ++HLNR SDH Sbjct: 1011 DFATMLLDCGLLDAGYEGNNFTWTNNHMFQRLDRVVYNHEWADCFNNTRIQHLNRDGSDH 1070 Query: 41 SPLLLS 24 PLL+S Sbjct: 1071 CPLLIS 1076 >ref|XP_007026454.1| Uncharacterized protein TCM_030494 [Theobroma cacao] gi|508781820|gb|EOY29076.1| Uncharacterized protein TCM_030494 [Theobroma cacao] Length = 876 Score = 176 bits (447), Expect = 7e-41 Identities = 100/281 (35%), Positives = 158/281 (56%), Gaps = 8/281 (2%) Frame = -3 Query: 2699 PVN--PHPDPSSVFPQEDQSALNQGRQTYASAVSGRPTLSQAERNERKSFSAILDAPQAS 2526 PVN HP P + ++QG Q +P + R +KSF ++++A + + Sbjct: 55 PVNWTKHPPPPTT------EGISQGFQV-------QPQPPASPRTAKKSFLSVVNAVKLA 101 Query: 2525 VIHSITHRPPGLHRGEPSFILTVEEEAALSEPFKFTLVGKFSHRKPTMAKVRDCFLKFGF 2346 ++ RP ++ +P+ +E AL++PFKF +VGKFS + P + ++R F+ G Sbjct: 102 LVPPT--RPTFRYKDKPAVRFFEDEIEALAQPFKFAIVGKFS-KMPRLTEIRQSFVSLGL 158 Query: 2345 CGDYRLGLIDSKHILIHLTHENDYSRLFLRSLYYIDGCPMRVLKWTCDFGPDCETPIAPV 2166 G Y + ++ KHILIHL++E D++R++ + ++I MRV KWT DF D E+PI PV Sbjct: 159 SGVYNIRWMNYKHILIHLSNEQDFNRIWTKQTWFITNQKMRVFKWTPDFETDKESPIVPV 218 Query: 2165 WLSFPLLPVHMRSKGVIFALAKIVGIPLRIDEATADLLRPSEARVCVEVNLEHKLPERIW 1986 W+SFP L H+ K + +AK +G PL IDEATA+ RPS ARVC+E + + +W Sbjct: 219 WISFPNLKAHLFEKSALLMIAKAIGNPLYIDEATANGTRPSVARVCIEYDCLKPPVDSVW 278 Query: 1985 I---DRGD---GRSFWQTVVYEKPPLFCSKCRHMGHSLGQC 1881 I RG + Q V + P +C+ C H+GH++ +C Sbjct: 279 IVVSKRGSEDMSGGYLQKVEFAPMPEYCNHCCHVGHNVSKC 319 >emb|CDP20930.1| unnamed protein product [Coffea canephora] Length = 497 Score = 176 bits (445), Expect = 1e-40 Identities = 85/231 (36%), Positives = 135/231 (58%) Frame = -3 Query: 2570 ERKSFSAILDAPQASVIHSITHRPPGLHRGEPSFILTVEEEAALSEPFKFTLVGKFSHRK 2391 ++KSFS + P S IH + +++GE + + + + L+ PF++ LVGKFSH + Sbjct: 17 KKKSFSQLFSQPATSPIHI---QQASVYKGEAAVVFSKADADKLAAPFQWALVGKFSHGR 73 Query: 2390 PTMAKVRDCFLKFGFCGDYRLGLIDSKHILIHLTHENDYSRLFLRSLYYIDGCPMRVLKW 2211 P++ +R F +GL+D +H+LI E D++R+++R ++ + PMRV +W Sbjct: 74 PSLEDIRKFFASLNLKDHVSIGLMDYRHVLIKCMAEADFNRIWMRGIWQLGKYPMRVFRW 133 Query: 2210 TCDFGPDCETPIAPVWLSFPLLPVHMRSKGVIFALAKIVGIPLRIDEATADLLRPSEARV 2031 T +F E+ +APVW+ P LP+H K +F++ VG PL +D ATA RPS ARV Sbjct: 134 TREFHVLRESSLAPVWVVLPALPIHYFDKHSLFSILSPVGRPLFLDSATAAGTRPSLARV 193 Query: 2030 CVEVNLEHKLPERIWIDRGDGRSFWQTVVYEKPPLFCSKCRHMGHSLGQCR 1878 CVE+++ +R+W+ FWQ +V E PL+CS C +GHS QC+ Sbjct: 194 CVELDVAKSFTQRVWVAVEGESGFWQRIVPENMPLYCSSCSRLGHSQEQCK 244 >ref|XP_011092915.1| PREDICTED: uncharacterized protein LOC105172985 [Sesamum indicum] Length = 470 Score = 175 bits (444), Expect = 2e-40 Identities = 83/192 (43%), Positives = 122/192 (63%) Frame = -3 Query: 2456 EEEAALSEPFKFTLVGKFSHRKPTMAKVRDCFLKFGFCGDYRLGLIDSKHILIHLTHEND 2277 +E + LS PF++ LVGKFSH P+M +R L GF GD+ +G I+ +H+ I E D Sbjct: 18 DEISRLSLPFRYALVGKFSHGYPSMQNLRRWMLAQGFRGDFSVGAINVRHVFIKFALEED 77 Query: 2276 YSRLFLRSLYYIDGCPMRVLKWTCDFGPDCETPIAPVWLSFPLLPVHMRSKGVIFALAKI 2097 Y++L+++S ++++G PMRV KWT F P E+PI PVW+ P LP+ + +F++A + Sbjct: 78 YTKLWIKSTWFVEGFPMRVFKWTPTFNPREESPIVPVWVRLPELPIQFFDREALFSIAHL 137 Query: 2096 VGIPLRIDEATADLLRPSEARVCVEVNLEHKLPERIWIDRGDGRSFWQTVVYEKPPLFCS 1917 +G PLR D +TA L+RPS ARVCVE+NL L I + G Q V+YE+ P +C Sbjct: 138 LGTPLRTDVSTATLVRPSVARVCVEINLLEPLQTEIGLGIGT-EVIIQPVIYERLPKYCG 196 Query: 1916 KCRHMGHSLGQC 1881 C+H+GH +C Sbjct: 197 ACKHLGHDEDEC 208 >ref|XP_007010391.1| Uncharacterized protein TCM_044158 [Theobroma cacao] gi|508727304|gb|EOY19201.1| Uncharacterized protein TCM_044158 [Theobroma cacao] Length = 830 Score = 172 bits (437), Expect = 1e-39 Identities = 102/283 (36%), Positives = 157/283 (55%), Gaps = 15/283 (5%) Frame = -3 Query: 2684 PDPSSVFPQ-EDQSALNQGRQTYASAV-SGRPTLSQAE-------RNERKSFSAILDAPQ 2532 PDP P S L G A A + +P+LS R ++KSF A+ + Sbjct: 7 PDPLPTLPPVATPSMLQSGATPNALATENSKPSLSHGHTQAPVSPRTQKKSFLAVAAGEK 66 Query: 2531 ASVIHSITHRPPGLHRGEPSFILTVEEEAALSEPFKFTLVGKFSHRKPTMAKVRDCFLKF 2352 +S+I R P ++ P+ +E + L++PFKF++VGKFS R M ++R F Sbjct: 67 SSLIP--LDREPFWYKDRPAASFFDDEISTLAQPFKFSMVGKFS-RMLRMQEIRVAFKGI 123 Query: 2351 GFCGDYRLGLIDSKHILIHLTHENDYSRLFLRSLYYIDGCPMRVLKWTCDFGPDCETPIA 2172 G G Y + +D KHILI L++E+D +R++L+ +++I MRV KW+ +F P+ E+ + Sbjct: 124 GLIGAYEIRWLDYKHILIQLSNEHDLNRIWLKQVWFISNQKMRVFKWSPEFQPEKESSMV 183 Query: 2171 PVWLSFPLLPVHMRSKGVIFALAKIVGIPLRIDEATADLLRPSEARVCVEVNLEHKLPER 1992 PVW+SFP L H+ K + A+ K VG PL +DEATA+ RPS ARVCVE + + ++ Sbjct: 184 PVWISFPNLKAHLYEKSALSAIVKTVGRPLMVDEATANGTRPSVARVCVEFDCQQPPIDQ 243 Query: 1991 IWI---DRGDGR---SFWQTVVYEKPPLFCSKCRHMGHSLGQC 1881 +WI +R G + Q V + + FC+ C H+GH + C Sbjct: 244 VWIVTRNRQSGSVMGGYMQKVEFARLSEFCTHCSHVGHGVSSC 286 >ref|XP_007031319.1| Uncharacterized protein TCM_016772 [Theobroma cacao] gi|508710348|gb|EOY02245.1| Uncharacterized protein TCM_016772 [Theobroma cacao] Length = 1296 Score = 172 bits (437), Expect = 1e-39 Identities = 101/277 (36%), Positives = 153/277 (55%), Gaps = 6/277 (2%) Frame = -3 Query: 2693 NPHPDPSSVFPQEDQSALNQGRQTYASAVSGRPTLSQAERNERKSFSAILDAPQASVIHS 2514 NP P + FPQ A+ +S P +S R ++KSF +++ VI Sbjct: 34 NPPPPQNQDFPQ-------------ATNLSNYPPISP--RMQKKSFLSVVAGENPPVIP- 77 Query: 2513 ITHRPPGLHRGEPSFILTVEEEAALSEPFKFTLVGKFSHRKPTMAKVRDCFLKFGFCGDY 2334 +R P +R P+ E A L+ FKF+++GKF+ R P + ++R F G G Y Sbjct: 78 -LNREPSWYRDRPAASFFDNEIATLALSFKFSMIGKFT-RMPKLQEIRTAFKGIGLVGAY 135 Query: 2333 RLGLIDSKHILIHLTHENDYSRLFLRSLYYIDGCPMRVLKWTCDFGPDCETPIAPVWLSF 2154 + +D KHILIHL++E+D +R++++ ++I MRV KWT +F P+ E+ + PVW+SF Sbjct: 136 NIRWLDYKHILIHLSNEHDLNRIWMKQNWFIVNKKMRVFKWTPEFHPEKESSLVPVWISF 195 Query: 2153 PLLPVHMRSKGVIFALAKIVGIPLRIDEATADLLRPSEARVCVEVNLEHKLPERIWI--- 1983 P L H K + +AK VG PL +DEATA+ RP+ AR+CVE + + L ++IWI Sbjct: 196 PNLRAHFYEKSTLMMIAKSVGRPLFVDEATANGTRPNVARICVEYDCQKSLLDQIWIVTR 255 Query: 1982 DRGDGR---SFWQTVVYEKPPLFCSKCRHMGHSLGQC 1881 R G F Q V + K P +C+ C H+GH+ C Sbjct: 256 SRQTGEVTGGFIQKVEFVKMPDYCTHCCHVGHNASAC 292 Score = 76.6 bits (187), Expect(2) = 4e-14 Identities = 32/66 (48%), Positives = 46/66 (69%) Frame = -3 Query: 221 DFCDMMMECGLTDAGFSGSPYTWHNRRVWKRLDRVLMSHEAASFFRQFTVRHLNRSTSDH 42 DF M+++CGL DA + G+ +TW N +++RLDRV+ +HE A F ++HLNR SDH Sbjct: 792 DFATMLLDCGLLDASYEGNNFTWTNNHMFQRLDRVVYNHEWADCFHHTRIQHLNRDGSDH 851 Query: 41 SPLLLS 24 PLL+S Sbjct: 852 CPLLIS 857 Score = 31.6 bits (70), Expect(2) = 4e-14 Identities = 14/32 (43%), Positives = 20/32 (62%) Frame = -2 Query: 381 ASCSVVGRRDLFEGLHHVSQTVDGPWLVGGDF 286 AS + R +L+ L +S + GPW+VGGDF Sbjct: 739 ASKPMEERLELWNCLRSISWDMQGPWMVGGDF 770 >ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao] gi|508710342|gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] Length = 2127 Score = 170 bits (431), Expect = 5e-39 Identities = 92/238 (38%), Positives = 138/238 (57%), Gaps = 6/238 (2%) Frame = -3 Query: 2576 RNERKSFSAILDAPQASVIHSITHRPPGLHRGEPSFILTVEEEAALSEPFKFTLVGKFSH 2397 R ++KSF +I+ + SV+ R P +++ P+ +E L++PFK +LVGKFS Sbjct: 81 RFQKKSFLSIVSGEKPSVVPLT--RDPFVYKDRPAAAFFEDEIHILAQPFKLSLVGKFS- 137 Query: 2396 RKPTMAKVRDCFLKFGFCGDYRLGLIDSKHILIHLTHENDYSRLFLRSLYYIDGCPMRVL 2217 R P + +VR F G G Y + +D KHILIHL++E D++R + + ++I MRV Sbjct: 138 RMPKLQEVRSAFKGIGLAGSYEIRWLDYKHILIHLSNEQDFNRFWTKQAWFIANQKMRVF 197 Query: 2216 KWTCDFGPDCETPIAPVWLSFPLLPVHMRSKGVIFALAKIVGIPLRIDEATADLLRPSEA 2037 KWT +F P+ E+ + PVW+SFP L H+ K + +AK VG PL IDEATA+ RPS A Sbjct: 198 KWTPEFEPEKESAVVPVWISFPNLKAHLFEKSALLLIAKTVGKPLFIDEATANGSRPSVA 257 Query: 2036 RVCVEVNLEHKLPERIWI---DRGDG---RSFWQTVVYEKPPLFCSKCRHMGHSLGQC 1881 RVC+E + +++WI +R G + Q V + + P +C C H+GH C Sbjct: 258 RVCIEYDCREPPVDQVWIVVQNRATGAVTSGYPQKVEFAQMPAYCDHCCHVGHKEINC 315 Score = 73.9 bits (180), Expect(2) = 2e-11 Identities = 33/66 (50%), Positives = 44/66 (66%) Frame = -3 Query: 221 DFCDMMMECGLTDAGFSGSPYTWHNRRVWKRLDRVLMSHEAASFFRQFTVRHLNRSTSDH 42 DF + +CGL DAGF G+ +TW N +++RLDRV+ + E A F V+HLNR SDH Sbjct: 920 DFASTLFDCGLLDAGFEGNSFTWTNNHMFQRLDRVVYNPEWAQCFSSTRVQHLNRDGSDH 979 Query: 41 SPLLLS 24 PLL+S Sbjct: 980 CPLLIS 985 Score = 25.4 bits (54), Expect(2) = 2e-11 Identities = 8/9 (88%), Positives = 9/9 (100%) Frame = -2 Query: 312 GPWLVGGDF 286 GPW+VGGDF Sbjct: 890 GPWMVGGDF 898 >ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao] gi|508715063|gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] Length = 3503 Score = 169 bits (429), Expect = 9e-39 Identities = 103/293 (35%), Positives = 157/293 (53%), Gaps = 20/293 (6%) Frame = -3 Query: 2699 PVNPHPDPSSVFPQEDQSALNQGRQTY---ASAVSGRPTLSQ-----------AERNERK 2562 P PDP+ +P Q L Q T+ ++A +P Q + R+++K Sbjct: 1700 PSGRPPDPNQAWPATHQ--LQQSTATHQQPSTAPLPQPHSCQQVNGSQIQRPSSPRSQKK 1757 Query: 2561 SFSAILDAPQASVIHSITHRPPGLHRGEPSFILTVEEEAALSEPFKFTLVGKFSHRKPTM 2382 SF +I+ + SV+ R P + + P+ +E L++PFK +LVGKFS R P + Sbjct: 1758 SFLSIITGEKPSVVPLT--RDPFVFKDRPAAAFFEDEIQTLAKPFKLSLVGKFS-RMPKL 1814 Query: 2381 AKVRDCFLKFGFCGDYRLGLIDSKHILIHLTHENDYSRLFLRSLYYIDGCPMRVLKWTCD 2202 VR F G G Y + +D KH+LIHL++E D++R++ + ++I MRV KWT + Sbjct: 1815 QDVRAAFKGIGLAGAYEVRWLDYKHVLIHLSNEQDFNRIWTKQNWFIATQKMRVFKWTPE 1874 Query: 2201 FGPDCETPIAPVWLSFPLLPVHMRSKGVIFALAKIVGIPLRIDEATADLLRPSEARVCVE 2022 F P+ E+ + PVW+SFP L H+ K + +AK VG PL +DEATA+ RPS ARVCVE Sbjct: 1875 FEPEKESAVVPVWISFPNLKAHLFEKSALLLIAKTVGKPLFVDEATANGSRPSVARVCVE 1934 Query: 2021 VNLEHKLPERIWI---DRGDG---RSFWQTVVYEKPPLFCSKCRHMGHSLGQC 1881 + +++WI +R G + Q V + + P +C C H+GH C Sbjct: 1935 FDCRQPPLDQVWIVVQNRKTGEITNGYSQRVEFAQMPAYCDHCCHVGHKETDC 1987 Score = 150 bits (379), Expect = 6e-33 Identities = 77/176 (43%), Positives = 105/176 (59%), Gaps = 6/176 (3%) Frame = -3 Query: 2390 PTMAKVRDCFLKFGFCGDYRLGLIDSKHILIHLTHENDYSRLFLRSLYYIDGCPMRVLKW 2211 P M ++R F G G Y + +D KHILIHL++E D++R++ + ++I MRV KW Sbjct: 2 PKMQEIRQAFKGIGLTGAYVIRWLDYKHILIHLSNEQDFNRIWTKQQWFIANQKMRVFKW 61 Query: 2210 TCDFGPDCETPIAPVWLSFPLLPVHMRSKGVIFALAKIVGIPLRIDEATADLLRPSEARV 2031 + DF + E+PI PVW+SFP L H+ K + +AK VG PL IDEAT++ RPS ARV Sbjct: 62 SPDFEAEKESPIVPVWISFPNLKAHLYEKSALLLIAKTVGKPLFIDEATSNASRPSVARV 121 Query: 2030 CVEVNLEHKLPERIWI---DRGDGR---SFWQTVVYEKPPLFCSKCRHMGHSLGQC 1881 CVE N + E IWI DR G + Q V + K P +C C H+GHS+ C Sbjct: 122 CVEYNCRNAPVEEIWIVIKDRVTGTVTGGYAQKVEFSKMPDYCEHCGHVGHSVSTC 177 Score = 77.0 bits (188), Expect(2) = 7e-23 Identities = 32/66 (48%), Positives = 47/66 (71%) Frame = -3 Query: 221 DFCDMMMECGLTDAGFSGSPYTWHNRRVWKRLDRVLMSHEAASFFRQFTVRHLNRSTSDH 42 DF +++CGL D GF G+P+TW N R+++RLDR++ +H+ + F ++HLNR SDH Sbjct: 2632 DFASALLDCGLLDGGFEGNPFTWTNNRMFQRLDRMVFNHQWINKFPITRIQHLNRDGSDH 2691 Query: 41 SPLLLS 24 PLLLS Sbjct: 2692 CPLLLS 2697 Score = 60.8 bits (146), Expect(2) = 7e-23 Identities = 28/60 (46%), Positives = 40/60 (66%) Frame = -2 Query: 465 VIVDHPQFLHLMIEDPRLARPIFLSPVYASCSVVGRRDLFEGLHHVSQTVDGPWLVGGDF 286 V++DHPQ LH+ + P L PIF + VYA C+ R L++ L ++ ++GPWLVGGDF Sbjct: 2551 VLLDHPQCLHVRLTIPWLDFPIFTTFVYAKCTRSERTPLWDSLRGLAADMEGPWLVGGDF 2610 Score = 79.0 bits (193), Expect(2) = 4e-19 Identities = 34/66 (51%), Positives = 47/66 (71%) Frame = -3 Query: 221 DFCDMMMECGLTDAGFSGSPYTWHNRRVWKRLDRVLMSHEAASFFRQFTVRHLNRSTSDH 42 DF M+++CGL DAG+ G+ +TW N +++RLDRV+ +HE A F V+HLNR SDH Sbjct: 850 DFATMLLDCGLHDAGYEGNNFTWTNNHMFQRLDRVVYNHEWADCFNHTRVQHLNRDGSDH 909 Query: 41 SPLLLS 24 PLL+S Sbjct: 910 CPLLIS 915 Score = 46.2 bits (108), Expect(2) = 4e-19 Identities = 23/45 (51%), Positives = 29/45 (64%) Frame = -2 Query: 420 PRLARPIFLSPVYASCSVVGRRDLFEGLHHVSQTVDGPWLVGGDF 286 P L+ PIF S VYA C+ R +L+ L VS + GPW+VGGDF Sbjct: 784 PWLSHPIFSSFVYAKCTRQERIELWNFLRSVSWDMYGPWMVGGDF 828 >ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao] gi|508778198|gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 167 bits (422), Expect = 6e-38 Identities = 96/273 (35%), Positives = 151/273 (55%), Gaps = 6/273 (2%) Frame = -3 Query: 2681 DPSSVFPQEDQSALNQGRQTYASAVSGRPTLSQAERNERKSFSAILDAPQASVIHSITHR 2502 +P +++ + Q + G Q A PT + R+++KSF +I+ + V+ R Sbjct: 55 NPPTIWTKNPQLPPSHGCQQAAPTQFQPPT---SPRSQKKSFLSIVSGQKPPVVP--LSR 109 Query: 2501 PPGLHRGEPSFILTVEEEAALSEPFKFTLVGKFSHRKPTMAKVRDCFLKFGFCGDYRLGL 2322 P + + P+ +E L++P K +LVGKFS R P + VR F G G Y + Sbjct: 110 DPFVFKDRPAAAFYEDEIQTLAQPLKLSLVGKFS-RMPKLQDVRSAFKGIGLAGAYEVRW 168 Query: 2321 IDSKHILIHLTHENDYSRLFLRSLYYIDGCPMRVLKWTCDFGPDCETPIAPVWLSFPLLP 2142 +D KHILIHLT+E+D +R++ + +++I MRV KWT +F P+ E+ + PVW++FP L Sbjct: 169 LDYKHILIHLTNEHDCNRVWTKQVWFIANQKMRVFKWTPEFEPEKESAMVPVWIAFPNLK 228 Query: 2141 VHMRSKGVIFALAKIVGIPLRIDEATADLLRPSEARVCVEVNLEHKLPERIWI---DRGD 1971 H+ K + +AK VG PL +DEATA+ RPS ARVC+E + +++WI +R Sbjct: 229 AHLFEKSALLLIAKTVGKPLFVDEATANGSRPSVARVCIEYDCRKPPIDQVWIVVQNRET 288 Query: 1970 G---RSFWQTVVYEKPPLFCSKCRHMGHSLGQC 1881 G + Q V + + P +C C H+GH C Sbjct: 289 GTVTSGYPQKVEFSQMPAYCDHCCHVGHKEIDC 321 Score = 75.9 bits (185), Expect(2) = 6e-22 Identities = 31/66 (46%), Positives = 46/66 (69%) Frame = -3 Query: 221 DFCDMMMECGLTDAGFSGSPYTWHNRRVWKRLDRVLMSHEAASFFRQFTVRHLNRSTSDH 42 DF +++CGL D GF G+P+TW N R+++RLDR++ +H + F ++HLNR SDH Sbjct: 1213 DFASTLLDCGLLDGGFEGNPFTWTNNRMFQRLDRIVYNHHWINKFPITRIQHLNRDGSDH 1272 Query: 41 SPLLLS 24 PLL+S Sbjct: 1273 CPLLIS 1278 Score = 58.9 bits (141), Expect(2) = 6e-22 Identities = 28/60 (46%), Positives = 39/60 (65%) Frame = -2 Query: 465 VIVDHPQFLHLMIEDPRLARPIFLSPVYASCSVVGRRDLFEGLHHVSQTVDGPWLVGGDF 286 VI DHPQ LH+ + P L PIF++ VYA C+ R L++ L ++ ++ PWLVGGDF Sbjct: 1132 VIFDHPQCLHVRLTSPWLEFPIFVTFVYAKCTRSERTLLWDCLRRLAADIEVPWLVGGDF 1191 >ref|XP_011101871.1| PREDICTED: uncharacterized protein LOC105179909 [Sesamum indicum] Length = 733 Score = 166 bits (420), Expect = 1e-37 Identities = 84/214 (39%), Positives = 129/214 (60%), Gaps = 5/214 (2%) Frame = -3 Query: 2507 HRPPGL-----HRGEPSFILTVEEEAALSEPFKFTLVGKFSHRKPTMAKVRDCFLKFGFC 2343 H PP L ++G P+ T E L+ PF+F+LVGKFSH P +++ + G Sbjct: 67 HNPPPLGIKSVNQGRPTISFTNTETEELAAPFRFSLVGKFSHGAPPYSQMHQLIARLGIQ 126 Query: 2342 GDYRLGLIDSKHILIHLTHENDYSRLFLRSLYYIDGCPMRVLKWTCDFGPDCETPIAPVW 2163 G + + +I+SKH LI L+ E+DYSRL+LR ++++ G PMR+ KWT F P E+ + P++ Sbjct: 127 GAFTVSMINSKHTLISLSCESDYSRLWLRRIWFLQGFPMRIFKWTPTFTPTQESSVVPIF 186 Query: 2162 LSFPLLPVHMRSKGVIFALAKIVGIPLRIDEATADLLRPSEARVCVEVNLEHKLPERIWI 1983 + FP LP H+ K +F++A +VG PL+ID T + + S+ARVCVE++L + E + Sbjct: 187 VCFPKLPAHLFHKEALFSVASMVGSPLQIDALTLNKSKLSQARVCVEIDLLKPIIEEFDL 246 Query: 1982 DRGDGRSFWQTVVYEKPPLFCSKCRHMGHSLGQC 1881 D + Q VV+E P +C C+H+GH C Sbjct: 247 HIND-VTIVQKVVFEYLPKYCFLCKHVGHKDSDC 279 Score = 85.5 bits (210), Expect(2) = 7e-15 Identities = 40/72 (55%), Positives = 47/72 (65%) Frame = -3 Query: 221 DFCDMMMECGLTDAGFSGSPYTWHNRRVWKRLDRVLMSHEAASFFRQFTVRHLNRSTSDH 42 DF DM+M+ GLTDAGF G P+TW N+RVW+RLD VL S E A V HL R SDH Sbjct: 555 DFNDMVMDSGLTDAGFEGEPFTWTNKRVWRRLDGVLYSQEWADLLNSTRVSHLPRRLSDH 614 Query: 41 SPLLLSWVEAED 6 PLL++ ED Sbjct: 615 HPLLITATRTED 626 Score = 25.4 bits (54), Expect(2) = 7e-15 Identities = 13/25 (52%), Positives = 16/25 (64%) Frame = -2 Query: 360 RRDLFEGLHHVSQTVDGPWLVGGDF 286 RR L+E L +S PW+VGGDF Sbjct: 510 RRALWEELKRLSLN-KVPWIVGGDF 533 >ref|XP_007040951.1| Uncharacterized protein TCM_016760 [Theobroma cacao] gi|508778196|gb|EOY25452.1| Uncharacterized protein TCM_016760 [Theobroma cacao] Length = 1109 Score = 166 bits (419), Expect = 1e-37 Identities = 97/272 (35%), Positives = 146/272 (53%), Gaps = 6/272 (2%) Frame = -3 Query: 2678 PSSVFPQEDQSALNQGRQTYASAVSGRPTLSQAERNERKSFSAILDAPQASVIHSITHRP 2499 PS P +Q +Q Q + +P + R +KSF + + VI R Sbjct: 25 PSHAPPNTNQPPPHQNLQETTPTRNPQPP---SPRALKKSFLTVAVGERPPVIPP--SRD 79 Query: 2498 PGLHRGEPSFILTVEEEAALSEPFKFTLVGKFSHRKPTMAKVRDCFLKFGFCGDYRLGLI 2319 P +++ P+ I +E L+ PF +LVGKFS R P + ++R F G G Y + + Sbjct: 80 PSVYKDRPAAIFYEDEIQTLARPFSHSLVGKFS-RMPKLQEIRHAFKGIGLSGAYEIRWM 138 Query: 2318 DSKHILIHLTHENDYSRLFLRSLYYIDGCPMRVLKWTCDFGPDCETPIAPVWLSFPLLPV 2139 D KH+LIHL++E D++R++++ ++I MRV KW DF + E+ + PVW+SFP L Sbjct: 139 DYKHVLIHLSNEQDFNRVWVKQQWFIVNQKMRVFKWAPDFEAEKESAMVPVWISFPNLKA 198 Query: 2138 HMRSKGVIFALAKIVGIPLRIDEATADLLRPSEARVCVEVNLEHKLPERIWI---DRGDG 1968 H+ K + +AK VG PL +DEATA+ RPS ARVCVE + + E IWI +R G Sbjct: 199 HLYEKSALLLIAKTVGKPLYVDEATANGSRPSVARVCVEYDCRKQPVEEIWIVIRNRETG 258 Query: 1967 R---SFWQTVVYEKPPLFCSKCRHMGHSLGQC 1881 + Q V + + P +C C H+GH +C Sbjct: 259 AVTGGYSQRVEFARMPDYCGYCSHVGHKENEC 290 >ref|XP_011071645.1| PREDICTED: uncharacterized protein LOC105157045 [Sesamum indicum] Length = 507 Score = 165 bits (418), Expect = 2e-37 Identities = 82/199 (41%), Positives = 123/199 (61%) Frame = -3 Query: 2477 PSFILTVEEEAALSEPFKFTLVGKFSHRKPTMAKVRDCFLKFGFCGDYRLGLIDSKHILI 2298 P+ + T +E L+ PFKF LVGKFSH P+ + + G + + +++++H+LI Sbjct: 112 PTLLFTDDETEVLAAPFKFALVGKFSHGAPSYSILHKLIAGTGIKNKFTVSMLNTRHVLI 171 Query: 2297 HLTHENDYSRLFLRSLYYIDGCPMRVLKWTCDFGPDCETPIAPVWLSFPLLPVHMRSKGV 2118 L+ E D+SRL+LR ++YI G PMRV KWT F P E+ I PVW+SFP LP H+ K V Sbjct: 172 SLSCEADFSRLWLRRIWYIQGYPMRVFKWTPAFTPSKESSIVPVWVSFPELPAHLFRKEV 231 Query: 2117 IFALAKIVGIPLRIDEATADLLRPSEARVCVEVNLEHKLPERIWIDRGDGRSFWQTVVYE 1938 +F +A ++G PL+ID+AT + + S+AR C+E++L E I + G + Q + YE Sbjct: 232 LFTVASMIGTPLQIDDATLNQSKLSKARACIELDLLKPRLENFQI-QICGTTIVQRIEYE 290 Query: 1937 KPPLFCSKCRHMGHSLGQC 1881 P +CS C+H+GH C Sbjct: 291 DIPHYCSLCKHVGHQDSDC 309 >ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao] gi|508710341|gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 164 bits (416), Expect = 3e-37 Identities = 87/238 (36%), Positives = 137/238 (57%), Gaps = 6/238 (2%) Frame = -3 Query: 2576 RNERKSFSAILDAPQASVIHSITHRPPGLHRGEPSFILTVEEEAALSEPFKFTLVGKFSH 2397 R ++KSF +I + VI +R P +++ P+ + +E L++PF LVGKF+ Sbjct: 52 RFQKKSFLSIAAGSKPPVIP--LNRDPAVYKDRPAAVFYEDEICILAKPFSLCLVGKFT- 108 Query: 2396 RKPTMAKVRDCFLKFGFCGDYRLGLIDSKHILIHLTHENDYSRLFLRSLYYIDGCPMRVL 2217 R P + +VR F G G Y + +D KH+LIHL+++ D++R++ R ++I G MR+ Sbjct: 109 RMPKLQEVRSAFKGIGLSGAYEIKWLDYKHVLIHLSNDQDFNRIWTRQQWFIVGQKMRIF 168 Query: 2216 KWTCDFGPDCETPIAPVWLSFPLLPVHMRSKGVIFALAKIVGIPLRIDEATADLLRPSEA 2037 KW+ +F + E+P+ PVW+SFP L H+ K + +AK +G PL +DEATA RPS A Sbjct: 169 KWSPEFEAEKESPVVPVWISFPNLKAHLYEKSALLLIAKTIGKPLFVDEATAKGSRPSVA 228 Query: 2036 RVCVEVNLEHKLPERIWI---DRGDG---RSFWQTVVYEKPPLFCSKCRHMGHSLGQC 1881 RVCVE + +++WI R G + Q V + + P +C C H+GH+ C Sbjct: 229 RVCVEYDCREPPIDQVWIVTQKRETGMVTNGYAQKVEFSQMPDYCEHCCHVGHNETTC 286 Score = 77.0 bits (188), Expect(2) = 2e-21 Identities = 34/66 (51%), Positives = 45/66 (68%) Frame = -3 Query: 221 DFCDMMMECGLTDAGFSGSPYTWHNRRVWKRLDRVLMSHEAASFFRQFTVRHLNRSTSDH 42 D + +CGL DAGF G+ +TW N R+++RLDRV+ + E A FF V+HLNR SDH Sbjct: 1007 DLSSTLFDCGLLDAGFEGNSFTWTNNRMFQRLDRVVYNQEWAEFFSSTRVQHLNRDGSDH 1066 Query: 41 SPLLLS 24 PLL+S Sbjct: 1067 CPLLIS 1072 Score = 56.2 bits (134), Expect(2) = 2e-21 Identities = 29/75 (38%), Positives = 40/75 (53%) Frame = -2 Query: 510 SSKIWCFWNVGTTVSVIVDHPQFLHLMIEDPRLARPIFLSPVYASCSVVGRRDLFEGLHH 331 SS W N + + Q LH+ + P L P+F S VYA C+ + RR+L+ L Sbjct: 916 SSNNWNSLNASEPIEI-----QCLHVKLSLPWLPHPVFTSFVYAKCTRIERRELWTSLRI 970 Query: 330 VSQTVDGPWLVGGDF 286 +S + PWLVGGDF Sbjct: 971 ISDGMQAPWLVGGDF 985 >ref|XP_007026456.1| Uncharacterized protein TCM_021520 [Theobroma cacao] gi|508715061|gb|EOY06958.1| Uncharacterized protein TCM_021520 [Theobroma cacao] Length = 754 Score = 105 bits (261), Expect(2) = 3e-37 Identities = 48/142 (33%), Positives = 82/142 (57%) Frame = -2 Query: 711 FPLSVFVMSVLIWNVRGVRASKTRKRLRKLVQLHRIXXXXXXXXXXXEDSFDFVRRSVGF 532 F ++ +++ L+WNVRG+ + ++RL+KL +H++ ++++R +GF Sbjct: 195 FHPNLSMINCLLWNVRGIAGTAVQRRLKKLKLMHKVKLLVVLEPMVNTSRINYIKRRLGF 254 Query: 531 DIGVQNQSSKIWCFWNVGTTVSVIVDHPQFLHLMIEDPRLARPIFLSPVYASCSVVGRRD 352 D + N S KIW F + V++D Q LH+ + P L P++ S VYA C+ + RR+ Sbjct: 255 DNALSNCSHKIWLFCSNEICCEVVLDQIQCLHVKLSSPWLPHPVYTSFVYAKCTRLERRE 314 Query: 351 LFEGLHHVSQTVDGPWLVGGDF 286 L+ L +S ++ PWLVGGDF Sbjct: 315 LWSNLRIISDSMQAPWLVGGDF 336 Score = 80.9 bits (198), Expect(2) = 3e-37 Identities = 35/66 (53%), Positives = 47/66 (71%) Frame = -3 Query: 221 DFCDMMMECGLTDAGFSGSPYTWHNRRVWKRLDRVLMSHEAASFFRQFTVRHLNRSTSDH 42 D +++CGL DAGF G+ +TW N R+++RLDRV+ +HE A FF V+HLNR SDH Sbjct: 358 DLSSTLLDCGLLDAGFEGNSFTWTNNRMFQRLDRVVYNHEWAEFFSSTRVQHLNRDGSDH 417 Query: 41 SPLLLS 24 PLL+S Sbjct: 418 CPLLIS 423 >ref|XP_007026455.1| Uncharacterized protein TCM_021519 [Theobroma cacao] gi|508715060|gb|EOY06957.1| Uncharacterized protein TCM_021519 [Theobroma cacao] Length = 667 Score = 164 bits (414), Expect = 5e-37 Identities = 96/283 (33%), Positives = 152/283 (53%), Gaps = 10/283 (3%) Frame = -3 Query: 2699 PVNPHPDPSSV----FPQEDQSALNQGRQTYASAVSGRPTLSQAERNERKSFSAILDAPQ 2532 P++P P+ SS P A G + + PT + R ++KSF +I + Sbjct: 10 PIHPLPESSSPPMMSTPTPSFMADKNGGLQASDNHTQPPT---SPRFQKKSFLSIAAGSK 66 Query: 2531 ASVIHSITHRPPGLHRGEPSFILTVEEEAALSEPFKFTLVGKFSHRKPTMAKVRDCFLKF 2352 VI +R P +++ P+ + +E L++PF LVGKF+ R P + +VR F Sbjct: 67 PPVIP--LNRDPAVYKDRPAAVFYEDEICILAKPFSLCLVGKFT-RMPKLQEVRSAFKGI 123 Query: 2351 GFCGDYRLGLIDSKHILIHLTHENDYSRLFLRSLYYIDGCPMRVLKWTCDFGPDCETPIA 2172 G G Y + +D KH+LIHL+++ D++R++ R ++I G MR+ KW+ +F + E+P+ Sbjct: 124 GLSGAYEIKWLDYKHVLIHLSNDQDFNRIWTRQQWFIVGQKMRIFKWSPEFEAEKESPVV 183 Query: 2171 PVWLSFPLLPVHMRSKGVIFALAKIVGIPLRIDEATADLLRPSEARVCVEVNLEHKLPER 1992 PVW+SFP L H+ K + +AK +G PL +DE TA RPS ARVCVE + ++ Sbjct: 184 PVWISFPNLKAHLYEKSALLLIAKTIGKPLFVDEPTAKGSRPSVARVCVEYDCREPPIDQ 243 Query: 1991 IWI---DRGDG---RSFWQTVVYEKPPLFCSKCRHMGHSLGQC 1881 +WI R G + Q V + + P +C C H+GH+ C Sbjct: 244 VWIVTQKRETGMVTNGYAQKVEFSQMPDYCEHCCHVGHNETTC 286 >ref|XP_012081344.1| PREDICTED: uncharacterized protein LOC105641421 [Jatropha curcas] Length = 223 Score = 162 bits (411), Expect = 1e-36 Identities = 77/168 (45%), Positives = 106/168 (63%) Frame = -3 Query: 2384 MAKVRDCFLKFGFCGDYRLGLIDSKHILIHLTHENDYSRLFLRSLYYIDGCPMRVLKWTC 2205 M +R K GF GD+ LGL+DS HILI+ + D+ R +L+ ++Y G MRV KW Sbjct: 1 MKALRQFMDKIGFKGDFSLGLLDSSHILINFDLDEDFHRCWLKQIWYFQGFLMRVSKWIR 60 Query: 2204 DFGPDCETPIAPVWLSFPLLPVHMRSKGVIFALAKIVGIPLRIDEATADLLRPSEARVCV 2025 +F P+ + I P W+ F LP+H+ +K +F +A ++G PL+ID AT L RPS ARVCV Sbjct: 61 NFRPNTDCSIVPTWILFEGLPIHLFAKAALFPIANLIGKPLKIDSATTTLSRPSVARVCV 120 Query: 2024 EVNLEHKLPERIWIDRGDGRSFWQTVVYEKPPLFCSKCRHMGHSLGQC 1881 E++L LP ++WID GD F+Q V YE PLFC KC +GH + C Sbjct: 121 ELDLSKDLPNKVWIDDGD-LGFFQPVNYESLPLFCPKCCRLGHEIPSC 167 >ref|XP_007046403.1| Uncharacterized protein TCM_011922 [Theobroma cacao] gi|508710338|gb|EOY02235.1| Uncharacterized protein TCM_011922 [Theobroma cacao] Length = 928 Score = 160 bits (404), Expect = 7e-36 Identities = 85/238 (35%), Positives = 135/238 (56%), Gaps = 6/238 (2%) Frame = -3 Query: 2576 RNERKSFSAILDAPQASVIHSITHRPPGLHRGEPSFILTVEEEAALSEPFKFTLVGKFSH 2397 R ++KSF +I + VI +R P +++ P+ + +E L++PF LVGKF+ Sbjct: 52 RFQKKSFLSITAGSKPPVIP--LNRNPVVYKDRPAAVFYEDEICILAKPFSLCLVGKFT- 108 Query: 2396 RKPTMAKVRDCFLKFGFCGDYRLGLIDSKHILIHLTHENDYSRLFLRSLYYIDGCPMRVL 2217 R P + +VR F G G Y + +D KH++IHL+++ D++R++ R ++I G MR+ Sbjct: 109 RMPKLQEVRSAFKGIGLSGAYEIKWLDYKHVIIHLSNDQDFNRIWTRQQWFIVGQKMRIF 168 Query: 2216 KWTCDFGPDCETPIAPVWLSFPLLPVHMRSKGVIFALAKIVGIPLRIDEATADLLRPSEA 2037 KW+ +F + E+P+ PVW+SFP L H+ K + +AK +G PL +DEATA RPS A Sbjct: 169 KWSPEFEAEKESPVVPVWISFPNLKAHLYEKFALLLIAKTIGRPLFVDEATAKGSRPSVA 228 Query: 2036 RVCVEVNLEHKLPERIWI---DRGDG---RSFWQTVVYEKPPLFCSKCRHMGHSLGQC 1881 RVC E + ++WI R G + Q V + + P +C C H+GH+ C Sbjct: 229 RVCAEYDCRKPPINQVWIVTQKRETGTVTNGYAQKVEFSQMPAYCDHCCHVGHNETNC 286