BLASTX nr result
ID: Rehmannia28_contig00009455
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia28_contig00009455 (1268 letters) Database: ./nr 84,704,028 sequences; 31,038,470,784 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011101871.1| PREDICTED: uncharacterized protein LOC105179... 187 2e-49 ref|XP_011071645.1| PREDICTED: uncharacterized protein LOC105157... 180 4e-48 ref|XP_011092915.1| PREDICTED: uncharacterized protein LOC105172... 177 4e-47 ref|XP_012846704.1| PREDICTED: uncharacterized protein LOC105966... 174 3e-45 emb|CDP20930.1| unnamed protein product [Coffea canephora] 149 1e-36 ref|XP_012841289.1| PREDICTED: uncharacterized protein LOC105961... 141 4e-34 ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom... 145 6e-34 ref|XP_007026454.1| Uncharacterized protein TCM_030494 [Theobrom... 144 7e-34 ref|XP_011094921.1| PREDICTED: uncharacterized protein LOC105174... 129 3e-32 ref|XP_007017130.1| Uncharacterized protein TCM_042329 [Theobrom... 139 7e-32 ref|XP_007010391.1| Uncharacterized protein TCM_044158 [Theobrom... 138 8e-32 ref|XP_011075252.1| PREDICTED: uncharacterized protein LOC105159... 135 1e-31 ref|XP_007023857.1| Uncharacterized protein TCM_028230 [Theobrom... 137 2e-31 ref|XP_012065816.1| PREDICTED: uncharacterized protein LOC105628... 132 4e-31 ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom... 135 1e-30 ref|XP_007031319.1| Uncharacterized protein TCM_016772 [Theobrom... 135 1e-30 ref|XP_007026455.1| Uncharacterized protein TCM_021519 [Theobrom... 133 3e-30 ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom... 133 4e-30 ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobrom... 132 1e-29 ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom... 131 2e-29 >ref|XP_011101871.1| PREDICTED: uncharacterized protein LOC105179909 [Sesamum indicum] Length = 733 Score = 187 bits (476), Expect = 2e-49 Identities = 89/246 (36%), Positives = 142/246 (57%), Gaps = 3/246 (1%) Frame = -1 Query: 905 PFGNSRKEDGQKVLGFSSLENDRLTADWKLTLIGKFSFAIPHPKGIDSGLSALRLKRLYL 726 P G G+ + F++ E + L A ++ +L+GKFS P + ++ L ++ + Sbjct: 71 PLGIKSVNQGRPTISFTNTETEELAAPFRFSLVGKFSHGAPPYSQMHQLIARLGIQGAFT 130 Query: 725 WSFANSSHIIIKLQIEEDYNKLWMGTLWSLGDCPMRVFNWTPAFDPRLEAPIASVWI--P 552 S NS H +I L E DY++LW+ +W L PMR+F WTP F P E+ + +++ P Sbjct: 131 VSMINSKHTLISLSCESDYSRLWLRRIWFLQGFPMRIFKWTPTFTPTQESSVVPIFVCFP 190 Query: 551 GLPIHMFNYHALYAIYKEVGNPLQVDSPTARRTRLSMARACVEINLLKERVEEIVLEFAE 372 LP H+F+ AL+++ VG+PLQ+D+ T +++LS AR CVEI+LLK +EE L + Sbjct: 191 KLPAHLFHKEALFSVASMVGSPLQIDALTLNKSKLSQARVCVEIDLLKPIIEEFDLHIND 250 Query: 371 VRHVQKIIYERVPDYCLHCKHIGHNVDACYMNGNKIRPPPPVR-RPAEKKVVNRKDVVAN 195 V VQK+++E +P YC CKH+GH C+ GN +PPP + P ++ + A Sbjct: 251 VTIVQKVVFEYLPKYCFLCKHVGHKDSDCFSKGNAPKPPPRNKINPRHRQFAGIETKQAR 310 Query: 194 QKGTKV 177 +GTK+ Sbjct: 311 DRGTKM 316 >ref|XP_011071645.1| PREDICTED: uncharacterized protein LOC105157045 [Sesamum indicum] Length = 507 Score = 180 bits (457), Expect = 4e-48 Identities = 104/292 (35%), Positives = 153/292 (52%), Gaps = 14/292 (4%) Frame = -1 Query: 1019 PEGHVTSDNPIPPKSYANVTGSSFSSHVQL----SFNPKDVVPFGNSRKEDGQK--VLGF 858 P + + N P K++A V + +S + P D+ G G K L F Sbjct: 58 PSSSIPTSN-FPKKTFAEVLAPTRASKPATPAPHKYFPVDLPSPGIGTVLTGDKGPTLLF 116 Query: 857 SSLENDRLTADWKLTLIGKFSFAIPHPKGIDSGLSALRLKRLYLWSFANSSHIIIKLQIE 678 + E + L A +K L+GKFS P + ++ +K + S N+ H++I L E Sbjct: 117 TDDETEVLAAPFKFALVGKFSHGAPSYSILHKLIAGTGIKNKFTVSMLNTRHVLISLSCE 176 Query: 677 EDYNKLWMGTLWSLGDCPMRVFNWTPAFDPRLEAPIASVWI--PGLPIHMFNYHALYAIY 504 D+++LW+ +W + PMRVF WTPAF P E+ I VW+ P LP H+F L+ + Sbjct: 177 ADFSRLWLRRIWYIQGYPMRVFKWTPAFTPSKESSIVPVWVSFPELPAHLFRKEVLFTVA 236 Query: 503 KEVGNPLQVDSPTARRTRLSMARACVEINLLKERVEEIVLEFAEVRHVQKIIYERVPDYC 324 +G PLQ+D T +++LS ARAC+E++LLK R+E ++ VQ+I YE +P YC Sbjct: 237 SMIGTPLQIDDATLNQSKLSKARACIELDLLKPRLENFQIQICGTTIVQRIEYEDIPHYC 296 Query: 323 LHCKHIGHNVDACYMNGNKIRPPPPVRRP---AEKKV---VNRKDVVANQKG 186 CKH+GH CY G+ +PPP R+P A KKV V R VA + G Sbjct: 297 SLCKHVGHQDSDCYTKGDAPKPPP--RKPSNRAGKKVAEEVGRGKAVAKETG 346 >ref|XP_011092915.1| PREDICTED: uncharacterized protein LOC105172985 [Sesamum indicum] Length = 470 Score = 177 bits (448), Expect = 4e-47 Identities = 96/222 (43%), Positives = 127/222 (57%), Gaps = 2/222 (0%) Frame = -1 Query: 887 KEDGQKVLGFSSLENDRLTADWKLTLIGKFSFAIPHPKGIDSGLSALRLKRLYLWSFANS 708 ++ G KVL FSS E RL+ ++ L+GKFS P + + + A + + N Sbjct: 6 RDQGMKVLRFSSDEISRLSLPFRYALVGKFSHGYPSMQNLRRWMLAQGFRGDFSVGAINV 65 Query: 707 SHIIIKLQIEEDYNKLWMGTLWSLGDCPMRVFNWTPAFDPRLEAPIASVWI--PGLPIHM 534 H+ IK +EEDY KLW+ + W + PMRVF WTP F+PR E+PI VW+ P LPI Sbjct: 66 RHVFIKFALEEDYTKLWIKSTWFVEGFPMRVFKWTPTFNPREESPIVPVWVRLPELPIQF 125 Query: 533 FNYHALYAIYKEVGNPLQVDSPTARRTRLSMARACVEINLLKERVEEIVLEFAEVRHVQK 354 F+ AL++I +G PL+ D TA R S+AR CVEINLL+ EI L +Q Sbjct: 126 FDREALFSIAHLLGTPLRTDVSTATLVRPSVARVCVEINLLEPLQTEIGLGIGTEVIIQP 185 Query: 353 IIYERVPDYCLHCKHIGHNVDACYMNGNKIRPPPPVRRPAEK 228 +IYER+P YC CKH+GH+ D CY K R P RPA K Sbjct: 186 VIYERLPKYCGACKHLGHDEDECY---EKHRSKAPPVRPANK 224 >ref|XP_012846704.1| PREDICTED: uncharacterized protein LOC105966659 [Erythranthe guttata] Length = 582 Score = 174 bits (441), Expect = 3e-45 Identities = 103/274 (37%), Positives = 155/274 (56%), Gaps = 18/274 (6%) Frame = -1 Query: 917 KDVVPFGNSRKEDGQKVLGFSSLENDRLTADWKLTLIGKFSFAIPHPKGIDSGLSALRLK 738 +D+ P G + DG+ VL FS E D++ K TLIGKFS I H K ++ + L+ + Sbjct: 86 EDIAPIGTIKVIDGKNVLYFSKEEVDKMLEPLKYTLIGKFSHGIHHYKVMEKFIYDLKPR 145 Query: 737 RLYLWSFANSSHIIIKLQIEEDYNKLWMGTLWSLGDCPMRVFNWTPAFDPRLEAPIASVW 558 + N H++I+ + + Y+ L ++ + PMRVF +TP F+ + E IA VW Sbjct: 146 GSFELHKLNYRHVLIQFSVLDYYSLLLRRSICYIDGLPMRVFKYTPGFNLKNETSIAPVW 205 Query: 557 --IPGLPIHMFNYHALYAIYKEVGNPLQVDSPTARRTRLSMARACVEINLLKERVEEI-- 390 +PG+P +M+N A++ + +GNPL+ D TA R +LS+AR CVEI+LLK RVE+I Sbjct: 206 VNVPGVPPYMYNREAIFFLASSIGNPLEFDDFTADRKKLSVARFCVEIDLLKPRVEQIPV 265 Query: 389 VLEFAEVRHVQ-KIIYERVPDYCLHCKHIGHNVDACYMNGNKIRP--PPPVRR------- 240 + + +V + + YE VP +C C H+GH+V+ CYMNGN +P PPP +R Sbjct: 266 MTGYDDVEMISLPVNYENVPKFCTFCSHLGHSVENCYMNGNAKKPDFPPPPQRIPKPTAL 325 Query: 239 PAEK----KVVNRKDVVANQKGTKVNSKKLNSEN 150 P EK +V RK+VV S ++EN Sbjct: 326 PKEKQVWRRVEKRKNVVVENMDIPKTSGTKSTEN 359 >emb|CDP20930.1| unnamed protein product [Coffea canephora] Length = 497 Score = 149 bits (376), Expect = 1e-36 Identities = 78/208 (37%), Positives = 117/208 (56%), Gaps = 3/208 (1%) Frame = -1 Query: 878 GQKVLGFSSLENDRLTADWKLTLIGKFSFAIPHPKGIDSGLSALRLKRLYLWSFANSSHI 699 G+ + FS + D+L A ++ L+GKFS P + I ++L LK + H+ Sbjct: 43 GEAAVVFSKADADKLAAPFQWALVGKFSHGRPSLEDIRKFFASLNLKDHVSIGLMDYRHV 102 Query: 698 IIKLQIEEDYNKLWMGTLWSLGDCPMRVFNWTPAFDPRLEAPIASVWI--PGLPIHMFNY 525 +IK E D+N++WM +W LG PMRVF WT F E+ +A VW+ P LPIH F+ Sbjct: 103 LIKCMAEADFNRIWMRGIWQLGKYPMRVFRWTREFHVLRESSLAPVWVVLPALPIHYFDK 162 Query: 524 HALYAIYKEVGNPLQVDSPTARRTRLSMARACVEINLLKERVEEI-VLEFAEVRHVQKII 348 H+L++I VG PL +DS TA TR S+AR CVE+++ K + + V E Q+I+ Sbjct: 163 HSLFSILSPVGRPLFLDSATAAGTRPSLARVCVELDVAKSFTQRVWVAVEGESGFWQRIV 222 Query: 347 YERVPDYCLHCKHIGHNVDACYMNGNKI 264 E +P YC C +GH+ + C N ++ Sbjct: 223 PENMPLYCSSCSRLGHSQEQCKKNVTEV 250 >ref|XP_012841289.1| PREDICTED: uncharacterized protein LOC105961601 [Erythranthe guttata] Length = 449 Score = 141 bits (356), Expect = 4e-34 Identities = 79/206 (38%), Positives = 121/206 (58%), Gaps = 18/206 (8%) Frame = -1 Query: 713 NSSHIIIKLQIEEDYNKLWMGTLWSLGDCPMRVFNWTPAFDPRLEAPIASVWI--PGLPI 540 N H++I+ + +DY+ L ++ + PMRVF +TP F+ + E IA VW+ PG+P Sbjct: 20 NYRHVLIQFSVLDDYSLLLRRSICYIHGLPMRVFKYTPGFNLKNETSIAPVWVNVPGVPP 79 Query: 539 HMFNYHALYAIYKEVGNPLQVDSPTARRTRLSMARACVEINLLKERVEEI--VLEFAEVR 366 +M+N A++ + +GNPL+ D TA R ++S+AR CVEI+LLK RVE+I + + ++ Sbjct: 80 YMYNREAIFFLASSIGNPLEFDDFTADRKKISVARFCVEIDLLKPRVEQIPVMTGYDDIE 139 Query: 365 HVQ-KIIYERVPDYCLHCKHIGHNVDACYMNGNKIRP--PPPVRR-------PAEK---- 228 + YE VP +C C H+GH+V+ CYMNGN +P PPP +R P EK Sbjct: 140 MISLPGNYENVPKFCTFCSHLGHSVENCYMNGNAKKPDFPPPPQRIPKPTAPPKEKQVWR 199 Query: 227 KVVNRKDVVANQKGTKVNSKKLNSEN 150 +V RK+VV + S ++EN Sbjct: 200 RVEKRKNVVVENMDIPITSGTKSTEN 225 >ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao] gi|508715063|gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] Length = 3503 Score = 145 bits (365), Expect = 6e-34 Identities = 81/243 (33%), Positives = 128/243 (52%), Gaps = 9/243 (3%) Frame = -1 Query: 848 ENDRLTADWKLTLIGKFSFAIPHPKGIDSGLSALRLKRLYLWSFANSSHIIIKLQIEEDY 669 E L +KL+L+GKFS +P + + + + L Y + + H++I L E+D+ Sbjct: 1792 EIQTLAKPFKLSLVGKFS-RMPKLQDVRAAFKGIGLAGAYEVRWLDYKHVLIHLSNEQDF 1850 Query: 668 NKLWMGTLWSLGDCPMRVFNWTPAFDPRLEAPIASVWI--PGLPIHMFNYHALYAIYKEV 495 N++W W + MRVF WTP F+P E+ + VWI P L H+F AL I K V Sbjct: 1851 NRIWTKQNWFIATQKMRVFKWTPEFEPEKESAVVPVWISFPNLKAHLFEKSALLLIAKTV 1910 Query: 494 GNPLQVDSPTARRTRLSMARACVEINLLKERVEE--IVLEFAEVRHV-----QKIIYERV 336 G PL VD TA +R S+AR CVE + + +++ IV++ + + Q++ + ++ Sbjct: 1911 GKPLFVDEATANGSRPSVARVCVEFDCRQPPLDQVWIVVQNRKTGEITNGYSQRVEFAQM 1970 Query: 335 PDYCLHCKHIGHNVDACYMNGNKIRPPPPVRRPAEKKVVNRKDVVANQKGTKVNSKKLNS 156 P YC HC H+GH C + GNK RPP ++P + + V + + G K+ N Sbjct: 1971 PAYCDHCCHVGHKETDCILLGNKARPPGITKQPNSRLEDGGRRVGSKEDGEFTTEKRKNI 2030 Query: 155 ENA 147 EN+ Sbjct: 2031 ENS 2033 Score = 131 bits (330), Expect = 2e-29 Identities = 75/221 (33%), Positives = 116/221 (52%), Gaps = 9/221 (4%) Frame = -1 Query: 788 IPHPKGIDSGLSALRLKRLYLWSFANSSHIIIKLQIEEDYNKLWMGTLWSLGDCPMRVFN 609 +P + I + L Y+ + + HI+I L E+D+N++W W + + MRVF Sbjct: 1 MPKMQEIRQAFKGIGLTGAYVIRWLDYKHILIHLSNEQDFNRIWTKQQWFIANQKMRVFK 60 Query: 608 WTPAFDPRLEAPIASVWI--PGLPIHMFNYHALYAIYKEVGNPLQVDSPTARRTRLSMAR 435 W+P F+ E+PI VWI P L H++ AL I K VG PL +D T+ +R S+AR Sbjct: 61 WSPDFEAEKESPIVPVWISFPNLKAHLYEKSALLLIAKTVGKPLFIDEATSNASRPSVAR 120 Query: 434 ACVEINLLKERVEEI-------VLEFAEVRHVQKIIYERVPDYCLHCKHIGHNVDACYMN 276 CVE N VEEI V + QK+ + ++PDYC HC H+GH+V C + Sbjct: 121 VCVEYNCRNAPVEEIWIVIKDRVTGTVTGGYAQKVEFSKMPDYCEHCGHVGHSVSTCLVL 180 Query: 275 GNKIRPPPPVRRPAEKKVVNRKDVVANQKGTKVNSKKLNSE 153 GN+ +R+ V ++ +A +K T+ + K L+S+ Sbjct: 181 GNR---SENLRKEKLSNVHSKS--LAGKKQTENDDKGLDSK 216 >ref|XP_007026454.1| Uncharacterized protein TCM_030494 [Theobroma cacao] gi|508781820|gb|EOY29076.1| Uncharacterized protein TCM_030494 [Theobroma cacao] Length = 876 Score = 144 bits (363), Expect = 7e-34 Identities = 90/253 (35%), Positives = 130/253 (51%), Gaps = 11/253 (4%) Frame = -1 Query: 992 PIPPKSYANVTGSSFS--SHVQLSFNPKDVVPFGNSRKEDGQKVLGFSSLENDRLTADWK 819 P PP S S S + V+L+ P F R +D V F E + L +K Sbjct: 78 PQPPASPRTAKKSFLSVVNAVKLALVPPTRPTF---RYKDKPAVRFFED-EIEALAQPFK 133 Query: 818 LTLIGKFSFAIPHPKGIDSGLSALRLKRLYLWSFANSSHIIIKLQIEEDYNKLWMGTLWS 639 ++GKFS +P I +L L +Y + N HI+I L E+D+N++W W Sbjct: 134 FAIVGKFS-KMPRLTEIRQSFVSLGLSGVYNIRWMNYKHILIHLSNEQDFNRIWTKQTWF 192 Query: 638 LGDCPMRVFNWTPAFDPRLEAPIASVWI--PGLPIHMFNYHALYAIYKEVGNPLQVDSPT 465 + + MRVF WTP F+ E+PI VWI P L H+F AL I K +GNPL +D T Sbjct: 193 ITNQKMRVFKWTPDFETDKESPIVPVWISFPNLKAHLFEKSALLMIAKAIGNPLYIDEAT 252 Query: 464 ARRTRLSMARACVEINLLKERVEEIVL-------EFAEVRHVQKIIYERVPDYCLHCKHI 306 A TR S+AR C+E + LK V+ + + E ++QK+ + +P+YC HC H+ Sbjct: 253 ANGTRPSVARVCIEYDCLKPPVDSVWIVVSKRGSEDMSGGYLQKVEFAPMPEYCNHCCHV 312 Query: 305 GHNVDACYMNGNK 267 GHNV C + G++ Sbjct: 313 GHNVSKCLILGSR 325 >ref|XP_011094921.1| PREDICTED: uncharacterized protein LOC105174492 [Sesamum indicum] Length = 171 Score = 129 bits (324), Expect = 3e-32 Identities = 65/150 (43%), Positives = 88/150 (58%), Gaps = 6/150 (4%) Frame = -1 Query: 623 MRVFNWTPAFDPRLEAPIASVWI--PGLPIHMFNYHALYAIYKEVGNPLQVDSPTARRTR 450 MRVF WTP F P E+ I VW+ P LP H+F L+ + + PLQ+D T +++ Sbjct: 1 MRVFKWTPTFTPSKESSIVPVWVSFPKLPAHLFRKEVLFTVASMIETPLQIDDATLNQSK 60 Query: 449 LSMARACVEINLLKERVEEIVLEFAEVRHVQKIIYERVPDYCLHCKHIGHNVDACYMNGN 270 LS ARAC+E++LLK R+E+ ++ VQ+I YE +P YC CKH+GH CY G+ Sbjct: 61 LSKARACIELDLLKPRLEDFQIQICGATIVQRIEYEDIPHYCSLCKHVGHRDSDCYTEGD 120 Query: 269 KIRPPP-PVRRPAEKKV---VNRKDVVANQ 192 +PPP R AEKKV V R VA + Sbjct: 121 APKPPPQKPRNRAEKKVAEEVGRGKAVAKE 150 >ref|XP_007017130.1| Uncharacterized protein TCM_042329 [Theobroma cacao] gi|508787493|gb|EOY34749.1| Uncharacterized protein TCM_042329 [Theobroma cacao] Length = 2606 Score = 139 bits (349), Expect = 7e-32 Identities = 83/245 (33%), Positives = 127/245 (51%), Gaps = 16/245 (6%) Frame = -1 Query: 836 LTADWKLTLIGKFSFAIPHPKGIDSGLSALRLKRLYLWSFANSSHIIIKLQIEEDYNKLW 657 L +K +++GKFS +P I + + L +Y + + HI+I L E+D N+LW Sbjct: 101 LAQPFKHSMVGKFS-RMPKLNDIRAAFKGISLVGVYEIRWLDYKHILIHLSNEQDLNRLW 159 Query: 656 MGTLWSLGDCPMRVFNWTPAFDPRLEAPIASVWI--PGLPIHMFNYHALYAIYKEVGNPL 483 M W + + MRVF WTP F P E+ + VWI P L H++ AL I K VG PL Sbjct: 160 MRQAWFIANQKMRVFKWTPDFQPEKESSLVPVWISFPNLRAHLYEKSALLMIAKSVGRPL 219 Query: 482 QVDSPTARRTRLSMARACVEINLLKERVEEIVLEFAEVR-------HVQKIIYERVPDYC 324 VD TA TR S+AR CVE + + +E+I + + R QK+ + ++P+YC Sbjct: 220 FVDEATANGTRPSVARVCVEYDCQQPPLEQIWIVTRDRRTGDITGGFQQKVDFAKLPNYC 279 Query: 323 LHCKHIGHNVDACYMNGNKIR-------PPPPVRRPAEKKVVNRKDVVANQKGTKVNSKK 165 HC H+GH+ C + G+++ P R+ AE + RK+V G ++SK Sbjct: 280 THCCHVGHSASTCLVMGHRMEKAENSNAQPYTGRKQAENE---RKEVANKPTGDPMSSKG 336 Query: 164 LNSEN 150 + +N Sbjct: 337 TDRKN 341 Score = 132 bits (332), Expect = 1e-29 Identities = 73/203 (35%), Positives = 111/203 (54%), Gaps = 9/203 (4%) Frame = -1 Query: 848 ENDRLTADWKLTLIGKFSFAIPHPKGIDSGLSALRLKRLYLWSFANSSHIIIKLQIEEDY 669 E L KL+L+GKFS +P + + S + L Y + + H++I L E+D Sbjct: 1723 EIQTLAQPLKLSLVGKFS-RMPKLQDVRSAFKGIGLTGAYEVRWLDYKHVLIHLSNEQDC 1781 Query: 668 NKLWMGTLWSLGDCPMRVFNWTPAFDPRLEAPIASVWI--PGLPIHMFNYHALYAIYKEV 495 N++W +W + + MRVF WTP F+P E+ + VWI P L H+F AL I K V Sbjct: 1782 NRVWTKQVWFIANQKMRVFKWTPEFEPEKESAVVPVWIAFPNLKAHLFEKSALLLIAKTV 1841 Query: 494 GNPLQVDSPTARRTRLSMARACVEINLLKERVEE--IVLEFAEVRHV-----QKIIYERV 336 G PL VD TA +R S+AR C+E + + +++ IV++ E V Q++ + ++ Sbjct: 1842 GKPLFVDEATANGSRPSVARVCIEFDCRRPPIDQVWIVVQNRETGTVTSGYPQRVEFSQM 1901 Query: 335 PDYCLHCKHIGHNVDACYMNGNK 267 P YC HC H+GH + C + GNK Sbjct: 1902 PAYCDHCCHVGHKENDCIVLGNK 1924 >ref|XP_007010391.1| Uncharacterized protein TCM_044158 [Theobroma cacao] gi|508727304|gb|EOY19201.1| Uncharacterized protein TCM_044158 [Theobroma cacao] Length = 830 Score = 138 bits (347), Expect = 8e-32 Identities = 89/292 (30%), Positives = 140/292 (47%), Gaps = 13/292 (4%) Frame = -1 Query: 1043 MANSGSSPPEGHVTSDNPIPP----KSYANVTGSSFSSHVQLSFNPKDVVPFGNSRKEDG 876 +A S P H + P+ P KS+ V SS + P D PF + Sbjct: 31 LATENSKPSLSHGHTQAPVSPRTQKKSFLAVAAGEKSSLI-----PLDREPFWYKDRP-- 83 Query: 875 QKVLGFSSLENDRLTADWKLTLIGKFSFAIPHPKGIDSGLSALRLKRLYLWSFANSSHII 696 F E L +K +++GKFS + + I + L Y + + HI+ Sbjct: 84 --AASFFDDEISTLAQPFKFSMVGKFSRML-RMQEIRVAFKGIGLIGAYEIRWLDYKHIL 140 Query: 695 IKLQIEEDYNKLWMGTLWSLGDCPMRVFNWTPAFDPRLEAPIASVWI--PGLPIHMFNYH 522 I+L E D N++W+ +W + + MRVF W+P F P E+ + VWI P L H++ Sbjct: 141 IQLSNEHDLNRIWLKQVWFISNQKMRVFKWSPEFQPEKESSMVPVWISFPNLKAHLYEKS 200 Query: 521 ALYAIYKEVGNPLQVDSPTARRTRLSMARACVEINLLKERVEEIVLEFAEVR-------H 363 AL AI K VG PL VD TA TR S+AR CVE + + ++++ + + + Sbjct: 201 ALSAIVKTVGRPLMVDEATANGTRPSVARVCVEFDCQQPPIDQVWIVTRNRQSGSVMGGY 260 Query: 362 VQKIIYERVPDYCLHCKHIGHNVDACYMNGNKIRPPPPVRRPAEKKVVNRKD 207 +QK+ + R+ ++C HC H+GH V +C + GN RP + KK + ++D Sbjct: 261 MQKVEFARLSEFCTHCSHVGHGVSSCMVIGN--RPEKNKQPMGGKKQLKKED 310 >ref|XP_011075252.1| PREDICTED: uncharacterized protein LOC105159763 [Sesamum indicum] Length = 476 Score = 135 bits (339), Expect = 1e-31 Identities = 90/287 (31%), Positives = 128/287 (44%), Gaps = 10/287 (3%) Frame = -1 Query: 989 IPPKSYANVTGSSFSSHVQLS----FNPKDVVP--FGNSRKEDGQKVLGFSSLENDRLTA 828 +P KS+A V +S S + P D P FG D L F+ E + L A Sbjct: 67 LPKKSFAEVVAPPQASKTAKSAHHKYFPTDSPPAAFGTVLTGDNGPTLQFTDAETEILAA 126 Query: 827 DWKLTLIGKFSFAIPHPKGIDSGLSALRLKRLYLWSFANSSHIIIKLQIEEDYNKLWMGT 648 ++ L+GKFS P S ++ KL + G Sbjct: 127 PFRFALVGKFSHGAP------------------------SYSMLHKLMAGTGIKNRFTGY 162 Query: 647 LWSLGDCPMRVFNWTPAFDPRLEAPIASVWI--PGLPIHMFNYHALYAIYKEVGNPLQVD 474 PMRVF WTP F P E+ I W+ P LP ++F L+ + +G PLQ+D Sbjct: 163 -------PMRVFKWTPTFTPSQESSIVPGWVSFPELPAYLFRKEVLFTVASMIGTPLQID 215 Query: 473 SPTARRTRLSMARACVEINLLKERVEEIVLEFAEVRHVQKIIYERVPDYCLHCKHIGHNV 294 T +++LS ARAC+E++LLK R+E ++ VQ+I YE +P YC CK +GH Sbjct: 216 DATLNQSKLSKARACIELDLLKPRLENFQIQICGTTIVQRIEYEDIPHYCSLCKQVGHQD 275 Query: 293 DACYMNGNKIRPPP--PVRRPAEKKVVNRKDVVANQKGTKVNSKKLN 159 CY G+ +PPP P R +K A K T +SK ++ Sbjct: 276 SDCYTKGDAPKPPPRKPSNRAGKKVAEEVGRGKAEAKETGESSKMMD 322 >ref|XP_007023857.1| Uncharacterized protein TCM_028230 [Theobroma cacao] gi|508779223|gb|EOY26479.1| Uncharacterized protein TCM_028230 [Theobroma cacao] Length = 748 Score = 137 bits (344), Expect = 2e-31 Identities = 78/248 (31%), Positives = 125/248 (50%), Gaps = 11/248 (4%) Frame = -1 Query: 779 PKGIDSGLSALRLKRLYLWSFANSSHIIIKLQIEEDYNKLWMGTLWSLGDCPMRVFNWTP 600 P I + + L Y + + HI I L E+D N++W+ +W + + +RVF WT Sbjct: 90 PTEIRNAFKGIGLAGAYDIRWLDYKHIHIGLSNEQDMNRIWLKQVWFISNQKLRVFKWTK 149 Query: 599 AFDPRLEAPIASVWI--PGLPIHMFNYHALYAIYKEVGNPLQVDSPTARRTRLSMARACV 426 F P E+ + VWI P L H++ A+ I K VG PL VD T TR S+AR C+ Sbjct: 150 DFQPEKESSLVPVWISFPNLRAHLYEKSAVLVIAKTVGRPLFVDEATDNGTRPSLARVCI 209 Query: 425 EINLLKERVEEIVLEFAEVR-------HVQKIIYERVPDYCLHCKHIGHNVDACYMNGNK 267 E + LK ++++ + + R +QK+ +ER+PDYC HC H+GH+V C + GNK Sbjct: 210 EYDCLKPPLDQVWIVMRDRRTGEITGGFMQKVDFERMPDYCTHCCHVGHSVSTCIVMGNK 269 Query: 266 --IRPPPPVRRPAEKKVVNRKDVVANQKGTKVNSKKLNSENAINSGLIKGAPTD*AWVMV 93 ++ P + EK +N +++ +++ + + + +EN S I W V Sbjct: 270 RVMQGPERAKPSDEKNKINTEEIGKDKQPVERRERLVRTENGNESIDINVKKQGMEWREV 329 Query: 92 RKKGARET 69 K G T Sbjct: 330 MKAGKSGT 337 >ref|XP_012065816.1| PREDICTED: uncharacterized protein LOC105628933 [Jatropha curcas] Length = 397 Score = 132 bits (332), Expect = 4e-31 Identities = 67/203 (33%), Positives = 111/203 (54%), Gaps = 2/203 (0%) Frame = -1 Query: 878 GQKVLGFSSLENDRLTADWKLTLIGKFSFAIPHPKGIDSGLSALRLKRLYLWSFANSSHI 699 G + FS E+ +L ++ L+G F P+ K + + + K + +SSHI Sbjct: 58 GVPSISFSWDESMKLANQFRFALVGIFQSGRPNMKSLRQFMDKIGFKGEFSLGLLDSSHI 117 Query: 698 IIKLQIEEDYNKLWMGTLWSLGDCPMRVFNWTPAFDPRLEAPIASVWI--PGLPIHMFNY 525 +IK ++EED+++ W+ +W MR+ WT F P + I WI GLPIH+F Sbjct: 118 LIKFELEEDFHRCWLKQIWYFQGFSMRISKWTRNFRPNTDCSIVPTWILFEGLPIHLFAK 177 Query: 524 HALYAIYKEVGNPLQVDSPTARRTRLSMARACVEINLLKERVEEIVLEFAEVRHVQKIIY 345 AL+ I +G PL+VD+ TA +R S+AR CVE++L K+ ++ ++ ++ Q + Y Sbjct: 178 AALFPIANLIGKPLKVDAATATLSRPSVARVCVELDLSKDLPNKVWIDDGDLGFFQPVNY 237 Query: 344 ERVPDYCLHCKHIGHNVDACYMN 276 E +P +C C IGH + +C +N Sbjct: 238 ESLPLFCTKCCRIGHEILSCPLN 260 >ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao] gi|508725616|gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 135 bits (340), Expect = 1e-30 Identities = 78/225 (34%), Positives = 120/225 (53%), Gaps = 17/225 (7%) Frame = -1 Query: 788 IPHPKGIDSGLSALRLKRLYLWSFANSSHIIIKLQIEEDYNKLWMGTLWSLGDCPMRVFN 609 +P + + + + L Y + + H++I L E+D+N++W W + MRVF Sbjct: 1 MPKLQDVRAAFKGIALTGAYEVRWLDYKHVLIHLSNEQDFNRIWTKQNWFIATQKMRVFK 60 Query: 608 WTPAFDPRLEAPIASVWI--PGLPIHMFNYHALYAIYKEVGNPLQVDSPTARRTRLSMAR 435 WTP F+P E+ + VWI P L H+F AL I K VG PL VD TA +R S+AR Sbjct: 61 WTPEFEPEKESAVVPVWISFPNLKAHLFEKSALLLIAKTVGKPLFVDEATANGSRPSVAR 120 Query: 434 ACVEINLLKERVEE--IVLEFAEVRHV-----QKIIYERVPDYCLHCKHIGHNVDACYMN 276 CVE + K V++ IV++ + V Q++ + ++P YC HC H+GH C + Sbjct: 121 VCVEYDCRKSPVDQVWIVVQNRKTGEVMNGYSQRVEFAQMPAYCDHCCHVGHKETDCILL 180 Query: 275 GNKIRPP----PPVRR--PAEKKVVNRKDV--VANQKGTKVNSKK 165 GNK RPP PP R E+++ ++D + +++ T NSKK Sbjct: 181 GNKPRPPGTSKPPTSRIEDGERRIGLKEDAEFITDKRKTVANSKK 225 >ref|XP_007031319.1| Uncharacterized protein TCM_016772 [Theobroma cacao] gi|508710348|gb|EOY02245.1| Uncharacterized protein TCM_016772 [Theobroma cacao] Length = 1296 Score = 135 bits (339), Expect = 1e-30 Identities = 75/199 (37%), Positives = 108/199 (54%), Gaps = 9/199 (4%) Frame = -1 Query: 836 LTADWKLTLIGKFSFAIPHPKGIDSGLSALRLKRLYLWSFANSSHIIIKLQIEEDYNKLW 657 L +K ++IGKF+ +P + I + + L Y + + HI+I L E D N++W Sbjct: 101 LALSFKFSMIGKFT-RMPKLQEIRTAFKGIGLVGAYNIRWLDYKHILIHLSNEHDLNRIW 159 Query: 656 MGTLWSLGDCPMRVFNWTPAFDPRLEAPIASVWI--PGLPIHMFNYHALYAIYKEVGNPL 483 M W + + MRVF WTP F P E+ + VWI P L H + L I K VG PL Sbjct: 160 MKQNWFIVNKKMRVFKWTPEFHPEKESSLVPVWISFPNLRAHFYEKSTLMMIAKSVGRPL 219 Query: 482 QVDSPTARRTRLSMARACVEINLLKERVEEIVL-----EFAEVR--HVQKIIYERVPDYC 324 VD TA TR ++AR CVE + K +++I + + EV +QK+ + ++PDYC Sbjct: 220 FVDEATANGTRPNVARICVEYDCQKSLLDQIWIVTRSRQTGEVTGGFIQKVEFVKMPDYC 279 Query: 323 LHCKHIGHNVDACYMNGNK 267 HC H+GHN AC + GNK Sbjct: 280 THCCHVGHNASACLVLGNK 298 >ref|XP_007026455.1| Uncharacterized protein TCM_021519 [Theobroma cacao] gi|508715060|gb|EOY06957.1| Uncharacterized protein TCM_021519 [Theobroma cacao] Length = 667 Score = 133 bits (334), Expect = 3e-30 Identities = 72/240 (30%), Positives = 123/240 (51%), Gaps = 9/240 (3%) Frame = -1 Query: 836 LTADWKLTLIGKFSFAIPHPKGIDSGLSALRLKRLYLWSFANSSHIIIKLQIEEDYNKLW 657 L + L L+GKF+ +P + + S + L Y + + H++I L ++D+N++W Sbjct: 95 LAKPFSLCLVGKFT-RMPKLQEVRSAFKGIGLSGAYEIKWLDYKHVLIHLSNDQDFNRIW 153 Query: 656 MGTLWSLGDCPMRVFNWTPAFDPRLEAPIASVWI--PGLPIHMFNYHALYAIYKEVGNPL 483 W + MR+F W+P F+ E+P+ VWI P L H++ AL I K +G PL Sbjct: 154 TRQQWFIVGQKMRIFKWSPEFEAEKESPVVPVWISFPNLKAHLYEKSALLLIAKTIGKPL 213 Query: 482 QVDSPTARRTRLSMARACVEINLLKERVEE--IVLEFAEV-----RHVQKIIYERVPDYC 324 VD PTA+ +R S+AR CVE + + +++ IV + E + QK+ + ++PDYC Sbjct: 214 FVDEPTAKGSRPSVARVCVEYDCREPPIDQVWIVTQKRETGMVTNGYAQKVEFSQMPDYC 273 Query: 323 LHCKHIGHNVDACYMNGNKIRPPPPVRRPAEKKVVNRKDVVANQKGTKVNSKKLNSENAI 144 HC H+GHN C + GN + ++ + + ++ Q K + +K + I Sbjct: 274 EHCCHVGHNETTCLVLGNNSKSSGSMKAQLKGQTKQTLNMSKTQTREKTDGEKEDKAKGI 333 >ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao] gi|508778198|gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 133 bits (335), Expect = 4e-30 Identities = 77/206 (37%), Positives = 111/206 (53%), Gaps = 9/206 (4%) Frame = -1 Query: 848 ENDRLTADWKLTLIGKFSFAIPHPKGIDSGLSALRLKRLYLWSFANSSHIIIKLQIEEDY 669 E L KL+L+GKFS +P + + S + L Y + + HI+I L E D Sbjct: 126 EIQTLAQPLKLSLVGKFS-RMPKLQDVRSAFKGIGLAGAYEVRWLDYKHILIHLTNEHDC 184 Query: 668 NKLWMGTLWSLGDCPMRVFNWTPAFDPRLEAPIASVWI--PGLPIHMFNYHALYAIYKEV 495 N++W +W + + MRVF WTP F+P E+ + VWI P L H+F AL I K V Sbjct: 185 NRVWTKQVWFIANQKMRVFKWTPEFEPEKESAMVPVWIAFPNLKAHLFEKSALLLIAKTV 244 Query: 494 GNPLQVDSPTARRTRLSMARACVEINLLKERVEE--IVLEFAEVRHV-----QKIIYERV 336 G PL VD TA +R S+AR C+E + K +++ IV++ E V QK+ + ++ Sbjct: 245 GKPLFVDEATANGSRPSVARVCIEYDCRKPPIDQVWIVVQNRETGTVTSGYPQKVEFSQM 304 Query: 335 PDYCLHCKHIGHNVDACYMNGNKIRP 258 P YC HC H+GH C + GNK +P Sbjct: 305 PAYCDHCCHVGHKEIDCIVLGNKDKP 330 >ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobroma cacao] gi|508715062|gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao] Length = 1951 Score = 132 bits (331), Expect = 1e-29 Identities = 80/245 (32%), Positives = 125/245 (51%), Gaps = 16/245 (6%) Frame = -1 Query: 836 LTADWKLTLIGKFSFAIPHPKGIDSGLSALRLKRLYLWSFANSSHIIIKLQIEEDYNKLW 657 L +K +++GKFS +P I + + L +Y + + HI+I L E+D N+LW Sbjct: 101 LAQPFKHSMVGKFS-RMPKLNDIRAAFKGIGLVGVYEIRWLDYKHILIHLSNEQDLNRLW 159 Query: 656 MGTLWSLGDCPMRVFNWTPAFDPRLEAPIASVWI--PGLPIHMFNYHALYAIYKEVGNPL 483 M W + + MRVF W+P F P E+ + VWI P L H++ AL I K VG PL Sbjct: 160 MRQAWFIANQKMRVFKWSPDFQPEKESSLVPVWISFPNLRAHLYEKSALLMIAKSVGRPL 219 Query: 482 QVDSPTARRTRLSMARACVEINLLKERVEEIVLEFAEVR-------HVQKIIYERVPDYC 324 VD TA TR S+AR CVE + + +E+I + + R QK+ + ++P+YC Sbjct: 220 FVDEATANGTRPSVARVCVEYDCQQPPLEQIWIVSRDRRTGDITGGFQQKVDFAKLPNYC 279 Query: 323 LHCKHIGHNVDACYMNGNKIR-------PPPPVRRPAEKKVVNRKDVVANQKGTKVNSKK 165 HC H+GH+ C + G+++ P R+ AE + K+V G ++ K Sbjct: 280 THCCHVGHSASTCLVMGHRMEKANNSNAQPYTGRKQAEN---DGKEVANKPTGDLMSCKG 336 Query: 164 LNSEN 150 + +N Sbjct: 337 TDRKN 341 >ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao] gi|508710342|gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] Length = 2127 Score = 131 bits (330), Expect = 2e-29 Identities = 93/301 (30%), Positives = 139/301 (46%), Gaps = 29/301 (9%) Frame = -1 Query: 1004 TSDNPIPPKSYANVTGSSFSSHVQLSFNPKDVVPFGNSR-KEDGQKVLGFSSLENDRLTA 828 T+ PP S SF S V S VVP + F E L Sbjct: 69 TTQTQTPPPSSPRFQKKSFLSIV--SGEKPSVVPLTRDPFVYKDRPAAAFFEDEIHILAQ 126 Query: 827 DWKLTLIGKFSFAIPHPKGIDSGLSALRLKRLYLWSFANSSHIIIKLQIEEDYNKLWMGT 648 +KL+L+GKFS +P + + S + L Y + + HI+I L E+D+N+ W Sbjct: 127 PFKLSLVGKFS-RMPKLQEVRSAFKGIGLAGSYEIRWLDYKHILIHLSNEQDFNRFWTKQ 185 Query: 647 LWSLGDCPMRVFNWTPAFDPRLEAPIASVWI--PGLPIHMFNYHALYAIYKEVGNPLQVD 474 W + + MRVF WTP F+P E+ + VWI P L H+F AL I K VG PL +D Sbjct: 186 AWFIANQKMRVFKWTPEFEPEKESAVVPVWISFPNLKAHLFEKSALLLIAKTVGKPLFID 245 Query: 473 SPTARRTRLSMARACVEINLLKERVEE--IVLEFAEVRHV-----QKIIYERVPDYCLHC 315 TA +R S+AR C+E + + V++ IV++ V QK+ + ++P YC HC Sbjct: 246 EATANGSRPSVARVCIEYDCREPPVDQVWIVVQNRATGAVTSGYPQKVEFAQMPAYCDHC 305 Query: 314 KHIGHNVDACYMNGN-------------------KIRPPPPVRRPAEKKVVNRKDVVANQ 192 H+GH C + GN K+R ++ P + K+V+ +D +Q Sbjct: 306 CHVGHKEINCIVLGNKNGLQGSGKPQPHSVVDADKLRNLEKIKNPDKGKIVSTEDQAKHQ 365 Query: 191 K 189 + Sbjct: 366 Q 366