BLASTX nr result
ID: Rehmannia25_contig00000244
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia25_contig00000244 (3126 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006350037.1| PREDICTED: uncharacterized protein LOC102601... 198 1e-47 ref|XP_006350032.1| PREDICTED: uncharacterized protein LOC102601... 197 2e-47 ref|XP_004243407.1| PREDICTED: uncharacterized protein LOC101260... 196 5e-47 ref|XP_004251799.1| PREDICTED: uncharacterized protein LOC101256... 189 5e-45 gb|EXB78097.1| hypothetical protein L484_004798 [Morus notabilis] 182 8e-43 ref|XP_002525120.1| hypothetical protein RCOM_0553590 [Ricinus c... 172 1e-39 gb|EOX92817.1| Uncharacterized protein isoform 1 [Theobroma caca... 164 2e-37 ref|XP_002279986.1| PREDICTED: uncharacterized protein LOC100250... 164 3e-37 ref|XP_006446765.1| hypothetical protein CICLE_v10015391mg [Citr... 159 7e-36 ref|XP_006469032.1| PREDICTED: uncharacterized protein LOC102611... 158 1e-35 ref|XP_004303170.1| PREDICTED: uncharacterized protein LOC101303... 157 2e-35 ref|XP_006348849.1| PREDICTED: uncharacterized protein LOC102594... 148 1e-32 gb|EOX92819.1| Uncharacterized protein isoform 3, partial [Theob... 146 6e-32 ref|XP_002881158.1| hypothetical protein ARALYDRAFT_482041 [Arab... 144 3e-31 gb|EOX92820.1| Uncharacterized protein isoform 4, partial [Theob... 138 1e-29 ref|XP_006294244.1| hypothetical protein CARUB_v10023243mg, part... 137 3e-29 ref|NP_565715.1| uncharacterized protein [Arabidopsis thaliana] ... 137 3e-29 ref|XP_006410238.1| hypothetical protein EUTSA_v10016698mg [Eutr... 137 4e-29 gb|EPS62712.1| hypothetical protein M569_12076, partial [Genlise... 132 9e-28 gb|EOX92821.1| Uncharacterized protein isoform 5 [Theobroma caca... 132 9e-28 >ref|XP_006350037.1| PREDICTED: uncharacterized protein LOC102601397 isoform X6 [Solanum tuberosum] Length = 420 Score = 198 bits (504), Expect = 1e-47 Identities = 161/483 (33%), Positives = 236/483 (48%), Gaps = 39/483 (8%) Frame = -1 Query: 1614 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1435 MD KGI W G+IYQKFE MCLE+E+ MY+DT +YVENQVQ VG SVK+FYS+V+ DL P Sbjct: 1 MDLKGIAWVGDIYQKFEAMCLEMEDAMYQDTARYVENQVQTVGASVKRFYSDVVLDLHPQ 60 Query: 1434 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 1255 +DPVKVA DLSLNPYAHT+++K L + N+++ D Sbjct: 61 FNIDPVKVAAADLSLNPYAHTEISKKLKAKLKG----GHPMVINKELID----------- 105 Query: 1254 NDANHLSSPSPRGLVENTHSDVCFTKSKKVGVYKR-PIGIKRISQNNH-PSKISRPMTSL 1081 + V KSK GVY+R +GIK I ++NH PSK S + + Sbjct: 106 ------------------DTQVIKGKSKSGGVYRRQSVGIKEIVRDNHPPSKKSDALCLV 147 Query: 1080 SGDKSRLL----------VASDDMNVTTSVRCHPG--------EAVNNI--TPISEA--C 967 SG+ +L VASD M +T+ + G E N+I T +S A Sbjct: 148 SGNAIKLSSDSKVRGGFEVASDHMTMTSPLASVKGRSSAETGKEVSNHIIKTDVSAAGIS 207 Query: 966 VESLASDKILIAESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAES 787 + ASD+ L + V + + D S D +S D+ C+ + + K+S+ Sbjct: 208 INVAASDRSLSVDCVGQNQADLRNTSSVGD----LQSDSHDRGTCKELAGDTGLKISS-- 261 Query: 786 LRQKKDDSQCTSADHGLSTKPIDEFRQGGLFSQINPNIVSNTWDIESIGEDVFPCYEDNF 607 + D+ ++++ I+ + I SNT D GE++ ++ Sbjct: 262 ----------NTGDNNIASEEINNIAK----------ISSNTGDNNITGEEINESCKERS 301 Query: 606 D---------MEVIENE-EAVEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIRE 457 D ++IE++ E VE + SKLE+TC+LV+ +KLH V Q + K KSYKKK+R+ Sbjct: 302 DKSCSPPPEKYDLIESDVEIVEHYDESKLEETCVLVEAEKLH-VPQESVKQKSYKKKLRQ 360 Query: 456 ALSSKLRSTRKQDPCVSHCKDLGGQNNGGV----TTIPALEMDSDKRNL-PVHDSFESDW 292 S K +STRK+ L G + + L +S+ + L D ES+W Sbjct: 361 VFSMKKKSTRKE---YEQLGALHGDQQPNLEPEEKPMQVLSKNSNMKKLSSADDHSESEW 417 Query: 291 EIL 283 E+L Sbjct: 418 ELL 420 >ref|XP_006350032.1| PREDICTED: uncharacterized protein LOC102601397 isoform X1 [Solanum tuberosum] gi|565366720|ref|XP_006350033.1| PREDICTED: uncharacterized protein LOC102601397 isoform X2 [Solanum tuberosum] gi|565366722|ref|XP_006350034.1| PREDICTED: uncharacterized protein LOC102601397 isoform X3 [Solanum tuberosum] gi|565366724|ref|XP_006350035.1| PREDICTED: uncharacterized protein LOC102601397 isoform X4 [Solanum tuberosum] gi|565366726|ref|XP_006350036.1| PREDICTED: uncharacterized protein LOC102601397 isoform X5 [Solanum tuberosum] Length = 421 Score = 197 bits (502), Expect = 2e-47 Identities = 161/483 (33%), Positives = 235/483 (48%), Gaps = 39/483 (8%) Frame = -1 Query: 1614 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1435 MD KGI W G+IYQKFE MCLE+E+ MY+DT +YVENQVQ VG SVK+FYS+V+ DL P Sbjct: 1 MDLKGIAWVGDIYQKFEAMCLEMEDAMYQDTARYVENQVQTVGASVKRFYSDVVLDLHPQ 60 Query: 1434 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 1255 +DPVKVA DLSLNPYAHT+++K L + N+++ D Sbjct: 61 FNIDPVKVAAADLSLNPYAHTEISKKLKAKLKG----GHPMVINKELID----------- 105 Query: 1254 NDANHLSSPSPRGLVENTHSDVCFTKSKKVGVYKR-PIGIKRISQNNH-PSKISRPMTSL 1081 + V KSK GVY+R +GIK I ++NH PSK S + + Sbjct: 106 ------------------DTQVIKGKSKSGGVYRRQSVGIKEIVRDNHPPSKKSDALCLV 147 Query: 1080 SGDKSRLL----------VASDDMNVTTSVRCHPG--------EAVNNI--TPISEA--C 967 SG+ +L VASD M +T+ + G E N+I T +S A Sbjct: 148 SGNAIKLSSDSKVRGGFEVASDHMTMTSPLASVKGRSSAETGKEVSNHIIKTDVSAAGIS 207 Query: 966 VESLASDKILIAESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAES 787 + ASD+ L + V + + D S D L D+ C+ + + K+S+ Sbjct: 208 INVAASDRSLSVDCVGQNQADLRNTSSVGD---LQSDSHADRGTCKELAGDTGLKISS-- 262 Query: 786 LRQKKDDSQCTSADHGLSTKPIDEFRQGGLFSQINPNIVSNTWDIESIGEDVFPCYEDNF 607 + D+ ++++ I+ + I SNT D GE++ ++ Sbjct: 263 ----------NTGDNNIASEEINNIAK----------ISSNTGDNNITGEEINESCKERS 302 Query: 606 D---------MEVIENE-EAVEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIRE 457 D ++IE++ E VE + SKLE+TC+LV+ +KLH V Q + K KSYKKK+R+ Sbjct: 303 DKSCSPPPEKYDLIESDVEIVEHYDESKLEETCVLVEAEKLH-VPQESVKQKSYKKKLRQ 361 Query: 456 ALSSKLRSTRKQDPCVSHCKDLGGQNNGGV----TTIPALEMDSDKRNL-PVHDSFESDW 292 S K +STRK+ L G + + L +S+ + L D ES+W Sbjct: 362 VFSMKKKSTRKE---YEQLGALHGDQQPNLEPEEKPMQVLSKNSNMKKLSSADDHSESEW 418 Query: 291 EIL 283 E+L Sbjct: 419 ELL 421 >ref|XP_004243407.1| PREDICTED: uncharacterized protein LOC101260247 [Solanum lycopersicum] Length = 374 Score = 196 bits (498), Expect = 5e-47 Identities = 163/463 (35%), Positives = 227/463 (49%), Gaps = 19/463 (4%) Frame = -1 Query: 1614 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1435 MD K I+W GNIYQKFETMCLE+EE MY+DTVKYVENQ+ VG +VK+F SEVMQD+ P Sbjct: 1 MDLKSISWVGNIYQKFETMCLEMEEAMYQDTVKYVENQMNTVGTNVKRFCSEVMQDVHPQ 60 Query: 1434 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 1255 +DPVKVA DLSLNPYAH +++K L K S R Sbjct: 61 CNIDPVKVAAADLSLNPYAHYEIDKKLKANL----------------------KGSAR-- 96 Query: 1254 NDANHLSSPSPRGLVENTHSDVCFTKSKKVGVYKRP-IGIKRISQNNHPSKISRPMTSLS 1078 +N L N + V KSK GVYKR +GIK I +++H +K + S Sbjct: 97 GFSNKL----------NDDTQVIKGKSKSGGVYKRQNVGIKEIVRDSHLTKKPNAICLAS 146 Query: 1077 GDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILIAESVKEEK--DD 904 GD +L ++S G LASD + + ++ K D Sbjct: 147 GDALKL---------SSSAEVRGG--------------FELASDHVTLTSALASVKGSDS 183 Query: 903 SECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDS-----QC-----T 754 E AS S++++ D S AS +S ES+ +K+ D+ C T Sbjct: 184 GEVASKVSNHVI---QTNVSTADTSITSEAS-VMMSVESVGKKQTDTCTKELACNTRFKT 239 Query: 753 SAD--HGLSTKPIDEFRQGGLFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEE 580 S+D + L+ + IDE + + + N++S IES D+E+ Sbjct: 240 SSDVRNNLANEEIDESHE-----EKSDNLLSKYDSIES-------------DLEI----- 276 Query: 579 AVEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKLRSTRKQDPCVSHC 400 VE + +L +TC+LV+ D++H V QG K KSYKKK+R+A S+K R TRK+ Sbjct: 277 -VEKFDEFQLNETCVLVEEDRIH-VPQGPVKQKSYKKKLRDAFSTKKRLTRKE---YEQL 331 Query: 399 KDLGGQNNGGVTT----IPALEMDSDKRNLPVHDSFESDWEIL 283 L G V + +P L M+S+ + L +D ES+WEIL Sbjct: 332 GALYGDQQIKVESEDKVMPVLAMNSNTKMLSANDHPESEWEIL 374 >ref|XP_004251799.1| PREDICTED: uncharacterized protein LOC101256948 [Solanum lycopersicum] Length = 421 Score = 189 bits (481), Expect = 5e-45 Identities = 158/483 (32%), Positives = 234/483 (48%), Gaps = 39/483 (8%) Frame = -1 Query: 1614 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1435 MD KGI W G+IYQKFE MCLE+E+ MY+DT +YVENQVQ VG SVK+FYS+V+ DL P Sbjct: 1 MDLKGIAWVGDIYQKFEAMCLEMEDAMYQDTARYVENQVQTVGASVKRFYSDVVLDLHPQ 60 Query: 1434 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 1255 +DPVKVA DLSLNPYAHT+++K L + RV Sbjct: 61 FNIDPVKVAAADLSLNPYAHTEISKKLKAQL---------------------KGGHPRVI 99 Query: 1254 NDANHLSSPSPRGLVENTHSDVCFTKSKKVGVYKR-PIGIKRISQNNH-PSKISRPMTSL 1081 N + L+++T V KSK GVY+R +G+K I ++NH PSK S + + Sbjct: 100 N----------KELIDDT--QVIKGKSKSGGVYRRQSVGMKEIVRDNHPPSKKSDALCLV 147 Query: 1080 SGDKSRLL----------VASDDMNVTTSVRCHPG--------EAVNNI----TPISEAC 967 SG+ +L VASD M +T+ + G E N+I P + Sbjct: 148 SGNTIKLSSDSKVRGGFEVASDHMTMTSPLASVKGLKSTETGKEVSNHIIKTEVPAAGIS 207 Query: 966 VESLASDKILIAESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAES 787 + ASD L + V + + D N ++ D R + K+L+ ++ Sbjct: 208 INIAASDTSLSVDCVGQNQADLR-------NTFSVGDLQSDSH----VDRGTRKELAGDT 256 Query: 786 LRQKKDDSQCTSADHGLSTKPIDEFRQGGLFSQINPNIVSNTWDIESIGEDVFPCYEDNF 607 + + + D+ +++K ++ + I SNT D GE++ + Sbjct: 257 GLKISSN----TGDNNIASKEVNNIAK----------ISSNTDDNNIAGEEIKESCKARS 302 Query: 606 D---------MEVIENE-EAVEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIRE 457 D ++IE++ E VE + KLE+TC+LV+ +KLH V QG+ K KSYKKK+R+ Sbjct: 303 DKSCSPPPDKYDLIESDVEIVERYDEPKLEETCVLVEAEKLH-VPQGSVKRKSYKKKLRQ 361 Query: 456 ALSSKLRSTRKQDPCVSHCKDLGGQNNGGV----TTIPALEMDSDKRNL-PVHDSFESDW 292 S K +STR + L G + + L +S+ + L D ES+W Sbjct: 362 VFSMKKKSTRTE---YEQLGALYGDQQPNLQPEEKQMQVLSKNSNPKKLSSADDHSESEW 418 Query: 291 EIL 283 E+L Sbjct: 419 ELL 421 >gb|EXB78097.1| hypothetical protein L484_004798 [Morus notabilis] Length = 443 Score = 182 bits (462), Expect = 8e-43 Identities = 160/485 (32%), Positives = 229/485 (47%), Gaps = 41/485 (8%) Frame = -1 Query: 1614 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1435 MD KGITW GN+YQKFE MCLEVEE+MY+DTVKYVENQVQ VG SVK+FYS+VMQDLLPP Sbjct: 1 MDVKGITWVGNVYQKFEAMCLEVEEIMYQDTVKYVENQVQTVGASVKRFYSDVMQDLLPP 60 Query: 1434 SCVDPVKV----------APGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDL 1285 S D KV + +S P K KP D + ++ ++ Sbjct: 61 SSQDSEKVSLCGFIGKQDSDDGISKKPNV---AKKEKPAKADDEQLIRTLKVTSDSKDVY 117 Query: 1284 IAEKSSLRVHNDANHLSSPS---PRGLVENTHS-----DVCFTKSKKVGVYKRPIGIKRI 1129 +A S+ V D +++ PS +G N S DV S + V + K I Sbjct: 118 LA--PSIHVRCDVDNMCRPSGECVKGACSNLRSRKKCRDVSVHSSSNLSVNENRSDKKLI 175 Query: 1128 SQN-----NHPSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACV 964 +SRP++S S + + S D TT +VN Sbjct: 176 PPETSCAITREKHLSRPLSSYSEFVNEIHEISLDQTGTTK-----APSVN---------- 220 Query: 963 ESLASDKILIAESVKEEKDDSECAS------HASDNILLAESVKQDKEDCECASRASDKK 802 E +SD I+ ES E ++ SEC + HAS I+L +SV D + + S Sbjct: 221 EDTSSDSIV--ESCDEIENSSECMADLSSSFHASSEIILVKSVGYDGNEMDVPSGGG--- 275 Query: 801 LSAESLRQKKDDSQCTSADHGLSTKPIDEFRQGGLFSQINPNIVSNTWDIESIGEDVF-- 628 LS Q D + + L++ GG S N ++ + EDVF Sbjct: 276 LS----EQANGDYTSKCSSNSLAS-------TGG--SSQNEEARNDKY----ADEDVFVS 318 Query: 627 -PCYEDNFDMEVIENE-------EAVEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYK 472 P D++++ + E+E E ++ + KLE+TC+LV+ D+LH + Q K + YK Sbjct: 319 LPRKFDDWNLNITESEIATEHGTETIQQRDKVKLEETCVLVNEDELHILPQRGGKWRPYK 378 Query: 471 KKIREALSSKLRSTRKQ--DPCVSHCKDLGGQNNGGVTTIPALEMDSDKRNLPVHDSFES 298 KKIR+AL S++RS RK+ + V D N + + +++ LP DS ES Sbjct: 379 KKIRDALYSRMRSARKEEYEQLVLQYGDNKKLNQDFGEALAPTLIVKERKKLPHLDSCES 438 Query: 297 DWEIL 283 +WE+L Sbjct: 439 EWELL 443 >ref|XP_002525120.1| hypothetical protein RCOM_0553590 [Ricinus communis] gi|223535579|gb|EEF37247.1| hypothetical protein RCOM_0553590 [Ricinus communis] Length = 490 Score = 172 bits (435), Expect = 1e-39 Identities = 164/524 (31%), Positives = 226/524 (43%), Gaps = 80/524 (15%) Frame = -1 Query: 1614 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1435 MD KGI+W GNIYQKFE MCLEVEEVMY+DTVKYVENQVQ VG SVK+FYS+VMQDLLPP Sbjct: 1 MDLKGISWVGNIYQKFEAMCLEVEEVMYQDTVKYVENQVQTVGSSVKRFYSDVMQDLLPP 60 Query: 1434 SCVDPVKVAPGDLSLNPYAHTDL-NKSKPTMLDSYGEFKKKEIENEDISDLIAEKS--SL 1264 S VD K A D+ L YA + K K + + G+ +E ED +KS L Sbjct: 61 SSVDAAKGAGVDVPLELYADLGIYMKPKVGVKEKQGKVDDRERLTEDPKITTDKKSMDPL 120 Query: 1263 RVHNDANHLSSPSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRISQNNHP-SKISRPMT 1087 H GLVEN ++ G R G + +S ++P ++ + Sbjct: 121 TFHR----------LGLVENRFP---LSQGNSAGGASRQHGKRSLSNKSNPYTRKNSNRE 167 Query: 1086 SLSGDKSRLLVASDDMNVTTSV----------------------RCHPGEA--------- 1000 ++S DK ++ D + + C P + Sbjct: 168 NMSVDKKLEAISCLDKGLIRASFSERSNENLGDSGGGAPKQYGDSCLPKDTSLGTNGNSE 227 Query: 999 ----------------VNNITPISEACVESLASDKILIAESVK----------------E 916 N++T S C S + K + + K E Sbjct: 228 RQNIFLHEKARVVIPLYNDLTRASSICELSNENHKDCVDQQAKITTPGSVEMTGHDSVDE 287 Query: 915 EKDDSECASH----ASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQCTSA 748 K + E AS D + ES D C+S S LSAE+ DD A Sbjct: 288 SKYEIENASEQIPDIPDMVNSTESGASKGMDMTCSSHGS---LSAEA--HAADDCMSHGA 342 Query: 747 DHGLSTKPIDEFRQG---GLFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEEA 577 D P D F G G S + + VSN+ + DV+ D + E Sbjct: 343 DF-----PADSFVNGNGKGQSSDSDEDFVSNSGS-DDCNTDVY-----KIDFSISHEMEI 391 Query: 576 VEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKLRSTRKQDPCVSHCK 397 ++ V+ +KLE++CILV+ D+ H++ Q K KSYKKKIR+ S + RS RK + +S C Sbjct: 392 IQQVDKAKLEESCILVNRDECHYLPQSERKSKSYKKKIRDVFSPRKRSMRKHEQ-LSICP 450 Query: 396 DLGGQNNGGVTTIPALEM------DSDKRNLPVHDSFESDWEIL 283 G +N M D+D+ + P D +S+WE L Sbjct: 451 --GSDSNPNQEECAKNSMPRHTIKDADRYSTP--DCCDSEWEFL 490 >gb|EOX92817.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508700922|gb|EOX92818.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 397 Score = 164 bits (415), Expect = 2e-37 Identities = 143/461 (31%), Positives = 221/461 (47%), Gaps = 16/461 (3%) Frame = -1 Query: 1617 TMDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYS----EVMQ 1450 +MD KGITW G++Y+KFE MCLEVEEVMY+DTVKYVEN+VQ VG SVKKFYS +VMQ Sbjct: 3 SMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQ 62 Query: 1449 DLLPPSCVDPVK-VAPGDLSLNPYAHTDLNKSKPTMLDS--YGEFKKKEIENEDISDLIA 1279 DLL PS ++P+K VA DL + YA T L K + + G+ ++ ++E I+D+ Sbjct: 63 DLLLPSSLEPMKAVAASDLPVEIYAET-LKKPNVGLKEDAIQGDSEQLTEDSEVIADVNE 121 Query: 1278 E----KSSLRVHNDANHLSSPSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRIS-QNNH 1114 SS ++H N S S VE SD+ G + + + + ++ Sbjct: 122 NAAHVPSSCQLHMVDNIFESCS-GSFVERASSDLL------SGEHNNRCTLNKTNVEHLL 174 Query: 1113 PSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILI 934 P++ S + + R+ + N V CH A +TP+S VE D I Sbjct: 175 PAETSSEAGCVENEFGRMSSFCGNANANHEVSCHQIPA--TLTPVS---VEEDDCDS--I 227 Query: 933 AESVKEEKDDSECASH-ASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQC 757 ES E K S+ D + L V++++ + C+S + + S L KD S Sbjct: 228 EESSNEIKSASDSVPEILPDGLHLVGIVEKNEMEMRCSSSIIESEESNGKLNWTKDAS-- 285 Query: 756 TSADHGLSTKPIDEFRQGGLFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEEA 577 G ST E E Sbjct: 286 -----GSSTVGRKEI-------------------------------------------ET 297 Query: 576 VEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKLRSTRKQD---PCVS 406 V+ ++ +++++C +V+G +LHF Q KHK+Y++KIR+A+SS++RS RK++ + Sbjct: 298 VQQLDKIRVDESCFMVNGAELHFHPQREGKHKTYQRKIRDAISSRMRSARKKEYEQLPLW 357 Query: 405 HCKDLGGQNNGGVTTIPALEMDSDKRNLPVHDSFESDWEIL 283 + D+ + ++ AL + +R L HD +S+WE+L Sbjct: 358 YGDDVKSDQDSEGSSTSALTREDTRRTLN-HDDLDSEWELL 397 >ref|XP_002279986.1| PREDICTED: uncharacterized protein LOC100250516 [Vitis vinifera] gi|302143402|emb|CBI21963.3| unnamed protein product [Vitis vinifera] Length = 451 Score = 164 bits (414), Expect = 3e-37 Identities = 156/493 (31%), Positives = 233/493 (47%), Gaps = 49/493 (9%) Frame = -1 Query: 1614 MDFKGITWAGNIYQKFETMCLEVEEVMYEDT-------VKYVENQVQKVGVSVKKFYSEV 1456 MDFKGITW GN+YQKFET+CLEVE++MY+DT VKYVE+QV+ VG SVKKF SE+ Sbjct: 1 MDFKGITWVGNMYQKFETICLEVEDIMYQDTVKYFENHVKYVEDQVETVGESVKKFCSEI 60 Query: 1455 MQDLLPPSCVDPVKVAPGDLSLNPYAHTDLNKSKPTM---------------LDSYGEFK 1321 +QDLL P D ++V +LSL+ + + L K KP + + EF Sbjct: 61 VQDLLLP---DSLEVTDSNLSLDQHDNVKLCK-KPKVGIKEEAKVGFKEEPKVSIKEEFI 116 Query: 1320 KKEI----ENEDISDL---IAEKSSLRVHNDANHL----SSPSPRGLVENTH----SDVC 1186 K +I E+ +I+DL + KSS + N+L S S G + H D Sbjct: 117 KFDIDRLTEHSEIADLNEDVEHKSSFTGLHGVNNLFQSYSGNSVTGACSDLHLVQNDDGV 176 Query: 1185 FTKSKKVGVYKRPIGIKRISQNNHPSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPG 1006 K+ G+ + P+ ++SQ P ++S + +SGD SRL +N +C+ Sbjct: 177 MCKNLDAGIKRNPV---KVSQ--FPIEVSGVIAPISGDVSRL---PSSLNENCENKCNQM 228 Query: 1005 EAVNNITPISEACVESLASDKILIAESVKEEKDDSECASHASDNILLAESVKQDKEDCEC 826 + S A VE + + ++ E D S ++ L ESV ++ + Sbjct: 229 AITS-----SPASVEITDCN---LEGAICNEIADVTAISVDLPSVPLVESVGKEGREMVF 280 Query: 825 ASRAS-DKKLSAESLRQKKDDSQCTSADHGLSTKPIDEFRQGGLFSQINPNIVSNTWDIE 649 +SR +L+A ++ D+G+ + G F I N + D+ Sbjct: 281 SSRGGLSSELNAGNI----------PMDNGVGSLI-------GSFRDIQQNETAEKKDLL 323 Query: 648 SIGEDVFPCYEDNFDMEVIENEEAVEP-VETS-------KLEDTCILVDGDKLHFVSQGT 493 S E D ++++ IE + +E +ET+ KLED C++VDGD+LH VS Sbjct: 324 SHSEG-----SDGWNIDAIEINDVIEQGIETTKDLLDKMKLEDACVMVDGDELHVVSHRE 378 Query: 492 EKHKSYKKKIREALSSKLRSTRKQDPCVS---HCKDLGGQNNGGVTTIPALEMDSDKRNL 322 K KKK+R A SK R RK+ ++ D G P+ DSDKR Sbjct: 379 GKVWLVKKKLRNAFYSKRRLARKEYERLAVWHRVIDSESNQPGAEGLTPSPSTDSDKRTS 438 Query: 321 PVHDSFESDWEIL 283 P D +S+WE+L Sbjct: 439 PDDDFCQSEWELL 451 >ref|XP_006446765.1| hypothetical protein CICLE_v10015391mg [Citrus clementina] gi|567908905|ref|XP_006446766.1| hypothetical protein CICLE_v10015391mg [Citrus clementina] gi|557549376|gb|ESR60005.1| hypothetical protein CICLE_v10015391mg [Citrus clementina] gi|557549377|gb|ESR60006.1| hypothetical protein CICLE_v10015391mg [Citrus clementina] Length = 416 Score = 159 bits (402), Expect = 7e-36 Identities = 138/457 (30%), Positives = 224/457 (49%), Gaps = 13/457 (2%) Frame = -1 Query: 1614 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1435 MD KGITW G++YQKFE MCLEVEE+MY+DTVKYVENQVQ VG +VKKFYS+V++DLLPP Sbjct: 1 MDLKGITWVGHVYQKFEAMCLEVEEIMYQDTVKYVENQVQTVGSTVKKFYSDVIEDLLPP 60 Query: 1434 SCVDPVKVA-PGDLSLNPYAHTDL-NKSKPTMLDSYGEFKKKEIENEDISDLIAEK---- 1273 VD VK A +L L A + K K + + +++ ++ +K Sbjct: 61 PSVDLVKGAVASNLPLEQNADVGIYKKPKIGIKEEAMNVNNEQLSESSLATTDLDKGAGG 120 Query: 1272 --SSLRVHNDANHLSSPSPRGLVENTHSDVCFTKSKKV--GVYKRPIGIKRISQNNH--P 1111 S R H + PS ++ SD ++K + G + I +++IS+ ++ P Sbjct: 121 GQSFCRFHIEDTSF-QPSLGDTLKGVFSD-AYSKEYDIRSGHNQSSICMQKISKEDNLPP 178 Query: 1110 SKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILIA 931 S++S + R S C + + ++ + + ++ Sbjct: 179 SEMSGAGPHMERGLRR-----------ASSSCELLDKIQEVSDDQVVVDPTPVTTEVASC 227 Query: 930 ESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQCTS 751 +S +E D+ E AS + L + ++ ++ E A +S LSAE + CT+ Sbjct: 228 KSFEEIYDELEKASKGASGALTSSPAAKNCDESENA-HSSCSSLSAEL------NGICTN 280 Query: 750 ADHGLSTKPIDEFRQGGLFSQINPNIVSNTW-DIESIGEDVFPCYEDNFDMEVIENEEAV 574 D +S + S +N ++ + + D E N D+E + E V Sbjct: 281 -DGVVSL----------VGSFVNEDVQPSEFPDPGRSDYSTVDATESNIDVE--QGYETV 327 Query: 573 EPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKLRSTRKQDPCVSHCKD 394 + V+ ++E+TC+LV+GD+L FV KH+ YKKKI++A+SS++RSTRK + K Sbjct: 328 QRVDNIQVEETCVLVNGDELCFVPCREGKHRPYKKKIQDAISSRMRSTRKHE-----YKQ 382 Query: 393 LGGQNNGGVTTIPALEMDSDKRNLPVHDSFESDWEIL 283 L N + + +++ + P H E +WE+L Sbjct: 383 LAVWYN---EDEKSKQQNAEMKGKPSHGYCELEWELL 416 >ref|XP_006469032.1| PREDICTED: uncharacterized protein LOC102611541 isoform X1 [Citrus sinensis] gi|568829444|ref|XP_006469033.1| PREDICTED: uncharacterized protein LOC102611541 isoform X2 [Citrus sinensis] gi|568829446|ref|XP_006469034.1| PREDICTED: uncharacterized protein LOC102611541 isoform X3 [Citrus sinensis] gi|568829448|ref|XP_006469035.1| PREDICTED: uncharacterized protein LOC102611541 isoform X4 [Citrus sinensis] Length = 416 Score = 158 bits (400), Expect = 1e-35 Identities = 137/457 (29%), Positives = 224/457 (49%), Gaps = 13/457 (2%) Frame = -1 Query: 1614 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1435 MD KGITW G++YQKFE MCLEVEE+MY+DTVKYVENQVQ VG +VKKFYS+V++DLLPP Sbjct: 1 MDLKGITWVGHVYQKFEAMCLEVEEIMYQDTVKYVENQVQTVGSTVKKFYSDVIEDLLPP 60 Query: 1434 SCVDPVKVA-PGDLSLNPYAHTDL-NKSKPTMLDSYGEFKKKEIENEDISDLIAEK---- 1273 VD VK A +L L A + K K + + + +++ ++ +K Sbjct: 61 PSVDLVKGAVASNLPLEQNADVGIYKKPKIGIKEEAMKVNNEQLSESSLATTDLDKGAGG 120 Query: 1272 --SSLRVHNDANHLSSPSPRGLVENTHSDVCFTKSKKV--GVYKRPIGIKRISQNNH--P 1111 S R H + PS ++ SD + K + G + I +++IS+ ++ P Sbjct: 121 GQSFCRFHIEDTSF-QPSLGNTLKGVFSD-AYPKEYDIRSGHNQSSICMQKISKEDNLPP 178 Query: 1110 SKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILIA 931 S++S + R S C + + ++ + + ++ Sbjct: 179 SEMSGAGPHMERGLRR-----------ASSSCELLDKIQEVSDDQVVVDPTSVTTEVASC 227 Query: 930 ESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQCTS 751 +S +E D+ E AS + L + ++ ++ E A +S LSAE + CT+ Sbjct: 228 KSFEEIYDELEKASKGASGALTSSPAAKNCDESESA-HSSCSSLSAEL------NGICTN 280 Query: 750 ADHGLSTKPIDEFRQGGLFSQINPNIVSNTW-DIESIGEDVFPCYEDNFDMEVIENEEAV 574 D +S + S +N ++ + + D E N D+E + E V Sbjct: 281 -DGVVSL----------VGSFVNEDVQPSEFPDPGRSDYSTVDATESNIDVE--QGYETV 327 Query: 573 EPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKLRSTRKQDPCVSHCKD 394 + V+ ++E+TC+LV+GD+L FV +KH+ KKKI++A+SS++RSTRK + K Sbjct: 328 QRVDNIQVEETCVLVNGDELCFVPCREDKHRPCKKKIQDAISSRMRSTRKHE-----YKQ 382 Query: 393 LGGQNNGGVTTIPALEMDSDKRNLPVHDSFESDWEIL 283 L N + + +++ + P H E +WE+L Sbjct: 383 LAVWYN---EDEKSKQQNAETKGKPSHGYCELEWELL 416 >ref|XP_004303170.1| PREDICTED: uncharacterized protein LOC101303722 [Fragaria vesca subsp. vesca] Length = 389 Score = 157 bits (398), Expect = 2e-35 Identities = 142/459 (30%), Positives = 204/459 (44%), Gaps = 14/459 (3%) Frame = -1 Query: 1617 TMDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLP 1438 TMD KGITW G +Y+KFE+MCLEVEE MYEDTVK+VE+QVQ VG SVKKFY++VMQDLL Sbjct: 3 TMDVKGITWVGCVYEKFESMCLEVEENMYEDTVKFVEDQVQTVGESVKKFYADVMQDLLC 62 Query: 1437 PSCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSL-R 1261 S +D V+ G + Y+ D +KSK KKKE + ++ + + Sbjct: 63 DSSLDRDDVSAGGFPVEHYSDVDNSKSKIR--------KKKEHVKAGVEEVKGDSEVISA 114 Query: 1260 VHNDANHLSSPSPRGLVENTHSDVCFTKSK----KVGVYKRPIGI----KRISQNNHPSK 1105 V D +H GL TKS K+ ++ G+ K+I P K Sbjct: 115 VLKDVDH------TGLFHRQRVYDSCTKSSGNCAKLACSRQDHGVRSCNKKIVVRETPIK 168 Query: 1104 ISRP--MTSLSGDKSR--LLVASDDMNVTTSVRC-HPGEAVNNITPISEACVESLASDKI 940 P T++ D SR L S+ N C P E + P +S+ S+ Sbjct: 169 DRLPGANTAVGKDFSRESLSSCSEFSNEDRDTSCDQPDEVITPSKPPEGMRCDSM-SESC 227 Query: 939 LIAESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQ 760 ++A + + DD +SD I+L D D + + D + Sbjct: 228 VVANASQCTGDDVSVNCQSSDMIVL------DNSDGKRWNELLDSSI------------- 268 Query: 759 CTSADHGLSTKPIDEFRQGGLFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEE 580 GGL +++N ++ + D E N E Sbjct: 269 ------------------GGLSTELNGGSINPSMD----------AIESNIG---THGTE 297 Query: 579 AVEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKLRSTRKQDPCVSHC 400 ++ + KLE+TC++V G+ LHFV +K YKKKI +A +S+ S RKQ+ Sbjct: 298 IIQQSDKPKLEETCVMVSGEDLHFVHHTVANYKPYKKKIPKAFTSRTSSARKQE-----Y 352 Query: 399 KDLGGQNNGGVTTIPALEMDSDKRNLPVHDSFESDWEIL 283 + L + G T LE + + P HD ES+WEIL Sbjct: 353 EQLALWH--GHHTKSILEGGEESKKSPTHDFCESEWEIL 389 >ref|XP_006348849.1| PREDICTED: uncharacterized protein LOC102594335 isoform X1 [Solanum tuberosum] Length = 260 Score = 148 bits (374), Expect = 1e-32 Identities = 107/291 (36%), Positives = 141/291 (48%), Gaps = 4/291 (1%) Frame = -1 Query: 1614 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1435 MD K I+W GNIYQKFETMCLE+EE MY+DTVKYVENQV VG +VK+F SEVMQD+ P Sbjct: 1 MDLKSISWVGNIYQKFETMCLEMEEAMYQDTVKYVENQVNTVGTNVKRFCSEVMQDVHPQ 60 Query: 1434 SCVDPVKVAPGDLSLNPYAHTDLNKS-KPTMLDSYGEFKKKEIENEDISDLIAEKSSLRV 1258 +DPVKVA DLS+NPYAH +++K K + S F K Sbjct: 61 CNIDPVKVAAADLSINPYAHYEIDKKLKANLKGSARRFSNK------------------- 101 Query: 1257 HNDANHLSSPSPRGLVENTHSDVCFTKSKKVGVYKRP-IGIKRISQNNHPSKISRPMTSL 1081 N + V KSK GVYKR +GIK I +++HP+K + Sbjct: 102 ----------------LNDDTQVIKGKSKSGGVYKRQNVGIKEIVRDSHPAKKPNAICLA 145 Query: 1080 SGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILIAESVKEEK--D 907 SGD +L ++S G +ASD + + ++ K D Sbjct: 146 SGDALKL---------SSSAEVRGG--------------FEMASDHVTLTSALASVKGSD 182 Query: 906 DSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQCT 754 E AS D+ + D S AS +S ES+R+K+ D+ CT Sbjct: 183 SGEAASKVRDHFI---QTNVSAADTSITSEAS-VTMSVESVRKKQTDT-CT 228 >gb|EOX92819.1| Uncharacterized protein isoform 3, partial [Theobroma cacao] Length = 343 Score = 146 bits (368), Expect = 6e-32 Identities = 128/406 (31%), Positives = 193/406 (47%), Gaps = 13/406 (3%) Frame = -1 Query: 1617 TMDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYS----EVMQ 1450 +MD KGITW G++Y+KFE MCLEVEEVMY+DTVKYVEN+VQ VG SVKKFYS +VMQ Sbjct: 3 SMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQ 62 Query: 1449 DLLPPSCVDPVK-VAPGDLSLNPYAHTDLNKSKPTMLDS--YGEFKKKEIENEDISDLIA 1279 DLL PS ++P+K VA DL + YA T L K + + G+ ++ ++E I+D+ Sbjct: 63 DLLLPSSLEPMKAVAASDLPVEIYAET-LKKPNVGLKEDAIQGDSEQLTEDSEVIADVNE 121 Query: 1278 E----KSSLRVHNDANHLSSPSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRIS-QNNH 1114 SS ++H N S S VE SD+ G + + + + ++ Sbjct: 122 NAAHVPSSCQLHMVDNIFESCS-GSFVERASSDLL------SGEHNNRCTLNKTNVEHLL 174 Query: 1113 PSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILI 934 P++ S + + R+ + N V CH A +TP+S VE D I Sbjct: 175 PAETSSEAGCVENEFGRMSSFCGNANANHEVSCHQIPA--TLTPVS---VEEDDCDS--I 227 Query: 933 AESVKEEKDDSECASH-ASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQC 757 ES E K S+ D + L V++++ + C+S + + S L KD S Sbjct: 228 EESSNEIKSASDSVPEILPDGLHLVGIVEKNEMEMRCSSSIIESEESNGKLNWTKDAS-- 285 Query: 756 TSADHGLSTKPIDEFRQGGLFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEEA 577 G ST E E Sbjct: 286 -----GSSTVGRKEI-------------------------------------------ET 297 Query: 576 VEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKL 439 V+ ++ +++++C +V+G +LHF Q KHK+Y++KIR+A+SS++ Sbjct: 298 VQQLDKIRVDESCFMVNGAELHFHPQREGKHKTYQRKIRDAISSRM 343 >ref|XP_002881158.1| hypothetical protein ARALYDRAFT_482041 [Arabidopsis lyrata subsp. lyrata] gi|297326997|gb|EFH57417.1| hypothetical protein ARALYDRAFT_482041 [Arabidopsis lyrata subsp. lyrata] Length = 418 Score = 144 bits (362), Expect = 3e-31 Identities = 126/421 (29%), Positives = 200/421 (47%), Gaps = 22/421 (5%) Frame = -1 Query: 1614 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1435 MDFKGI W GN+YQKFE MCLEVEE++ +DT KYVENQVQ VG SVKKF S+V+QDLLP Sbjct: 1 MDFKGIKWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVQTVGNSVKKFCSDVVQDLLPD 60 Query: 1434 SCVDPVKVAPGDLSLNPYAHT-DLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRV 1258 VD K P + L+ YA K + +M + K+++ E D A+K Sbjct: 61 DSVDSGKPLPVSM-LHEYAPVCSFKKKRDSMNRKTRDVKQEQEVTEGKKDGCAQKF---- 115 Query: 1257 HNDANHLSSPSPRGLVENTHSDVCFTKSKKV--GVYKRP-IGIKRISQNNHPSKISRPMT 1087 RGL + + D+C + + G Y+R +G K+I + S+++RP Sbjct: 116 ------------RGLDADDY-DICTSPRQYSYGGPYRRTRVGRKQIFKKEELSQVTRPY- 161 Query: 1086 SLSGDKSRLLV-----ASDD---MNVTTSVRCHPGEAVNNITPISEACVESLASDKILIA 931 + D S L + DD +N ++ H +++ ++ + + + S + I Sbjct: 162 -MQKDSSSLSMVHSARVKDDVGTVNSSSLSMVHSARVKDDVGTVNSSSLTMVHSAR--IK 218 Query: 930 ESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQCTS 751 + V K I E K DK + + L+ + ++ D Sbjct: 219 DDVGTVKSSDSPPGEVEKLIYKKECQKDDK-------TKNQQSLTVVNSVKRNDSEIRID 271 Query: 750 ADHGLSTKPIDEFRQGGLFSQINPNIVSNTWDIESIGEDVFPC-YEDNFDMEVI------ 592 +HGL + S+I P++ ++ + G D C E N D + Sbjct: 272 NEHGLMGDSSQD-------SEIQPSVATSL----AAGSD--DCRKETNVDTKTSSSSVSE 318 Query: 591 ENEEAVEPVETSKLEDTCILVDGDKLHFV---SQGTEKHKSYKKKIREALSSKLRSTRKQ 421 + E ++P+ +E++CILVD D+ H V +KHK Y KKIR+A+SS+++ R++ Sbjct: 319 QKSEILQPLSGRSVEESCILVDRDEFHCVFPDKMENDKHKPY-KKIRDAISSRMKQNREK 377 Query: 420 D 418 + Sbjct: 378 E 378 >gb|EOX92820.1| Uncharacterized protein isoform 4, partial [Theobroma cacao] Length = 341 Score = 138 bits (348), Expect = 1e-29 Identities = 127/406 (31%), Positives = 191/406 (47%), Gaps = 13/406 (3%) Frame = -1 Query: 1617 TMDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYS----EVMQ 1450 +MD KGITW G++Y+KFE MCLEVEEVMY+DTVKYVEN+VQ VG SVKKFYS +VMQ Sbjct: 3 SMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQ 62 Query: 1449 DLLPPSCVDPVK-VAPGDLSLNPYAHTDLNKSKPTMLDS--YGEFKKKEIENEDISDLIA 1279 DLL PS ++P+K VA DL + YA T L K + + G+ ++ ++E I+D+ Sbjct: 63 DLLLPSSLEPMKAVAASDLPVEIYAET-LKKPNVGLKEDAIQGDSEQLTEDSEVIADVNE 121 Query: 1278 E----KSSLRVHNDANHLSSPSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRIS-QNNH 1114 SS ++H N S S VE SD+ G + + + + ++ Sbjct: 122 NAAHVPSSCQLHMVDNIFESCS-GSFVERASSDLL------SGEHNNRCTLNKTNVEHLL 174 Query: 1113 PSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILI 934 P++ S + + R+ + N V CH A +TP+S VE D I Sbjct: 175 PAETSSEAGCVENEFGRMSSFCGNANANHEVSCHQIPA--TLTPVS---VEEDDCDS--I 227 Query: 933 AESVKEEKDDSECASH-ASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQC 757 ES E K S+ D + L V++++ + C+S + + S L KD S Sbjct: 228 EESSNEIKSASDSVPEILPDGLHLVGIVEKNEMEMRCSSSIIESEESNGKLNWTKDAS-- 285 Query: 756 TSADHGLSTKPIDEFRQGGLFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEEA 577 G ST E E Sbjct: 286 -----GSSTVGRKEI-------------------------------------------ET 297 Query: 576 VEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKL 439 V+ ++ +++++C +V+G +LHF Q KHK+Y +IR+A+SS++ Sbjct: 298 VQQLDKIRVDESCFMVNGAELHFHPQREGKHKTY--QIRDAISSRM 341 >ref|XP_006294244.1| hypothetical protein CARUB_v10023243mg, partial [Capsella rubella] gi|482562952|gb|EOA27142.1| hypothetical protein CARUB_v10023243mg, partial [Capsella rubella] Length = 436 Score = 137 bits (345), Expect = 3e-29 Identities = 123/414 (29%), Positives = 195/414 (47%), Gaps = 15/414 (3%) Frame = -1 Query: 1614 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1435 MDFKGI W GN+YQKFE MCLEVEE++ +DT KYVENQV VG SVKKF S+V+QDLLP Sbjct: 13 MDFKGIKWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVHTVGNSVKKFCSDVVQDLLP- 71 Query: 1434 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKK-EIENEDISDLIAEKSSLRV 1258 D V G P + LN+ P FKKK E N D+ E+ Sbjct: 72 ---DDDSVGSG----KPLPVSMLNEYAPVC-----SFKKKRESANRKTRDVKQEEEVTEG 119 Query: 1257 HNDANHLSSPSPRGLVENTHSDVCFTKSKKV--GVYKR-PIGIKRISQNNHPSKISRPMT 1087 D + + RGL + + D+C + + G Y+R +G K+I + S+I+RP Sbjct: 120 KKDG---CAMNLRGLDADDY-DICTSPRQYSYGGPYRRGRVGRKQIFKKEELSQITRPY- 174 Query: 1086 SLSGDKSRLLV-----ASDD---MNVTTSVRCHPGEAVNNITPISEACVESLASDKILIA 931 + D S L + DD +N ++ H G +++ ++ + + + S + I Sbjct: 175 -IQKDSSNLTMVHSARVKDDVGTVNSSSLSMAHSGRVKDDVGTVNSSSLSMVHSAR--IK 231 Query: 930 ESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQCTS 751 V+ K I E K D+ D + L+ + + KD T Sbjct: 232 ADVETVKSSDSRPGEIERLISKKECQKDDRTD-------NQHGLTMVNSVRSKDSEIRTE 284 Query: 750 ADHGLSTKPIDEFRQGGLFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEEAVE 571 +H L+ + + + ++++ + + E E E + + E ++ Sbjct: 285 IEHSLTVVNSVRSQDSEILPSVATSLLTGSSN-EFRKETKEDSMEASSSSVSEQKSEILQ 343 Query: 570 PVETSKLEDTCILVDGDKLHFV---SQGTEKHKSYKKKIREALSSKLRSTRKQD 418 + +E++CILVD D+ H V +KHK Y KKIR+A+SS+++ R+++ Sbjct: 344 HLSGRSVEESCILVDRDEFHCVFPDKMENDKHKPY-KKIRDAISSRMKQNREKE 396 >ref|NP_565715.1| uncharacterized protein [Arabidopsis thaliana] gi|16612317|gb|AAL27517.1|AF439849_1 At2g31130/T16B12.6 [Arabidopsis thaliana] gi|20197328|gb|AAC63838.2| expressed protein [Arabidopsis thaliana] gi|23506163|gb|AAN31093.1| At2g31130/T16B12.6 [Arabidopsis thaliana] gi|330253402|gb|AEC08496.1| uncharacterized protein AT2G31130 [Arabidopsis thaliana] Length = 419 Score = 137 bits (345), Expect = 3e-29 Identities = 138/466 (29%), Positives = 209/466 (44%), Gaps = 22/466 (4%) Frame = -1 Query: 1614 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1435 MDFKGI W GN+YQKFE MCLEVEE++ +DT KYVENQVQ VG SVKKF S+V+ DLLP Sbjct: 1 MDFKGIKWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVQTVGNSVKKFCSDVVHDLLPD 60 Query: 1434 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 1255 VD K P + L+ YA Y KKK+ N D+ E+ Sbjct: 61 ESVDSGKPLPVSM-LHEYAPV------------YSFKKKKDSMNRKTKDVTQEQEVTEGK 107 Query: 1254 NDANHLSSPSPRGLVENTHSDVCFTKSKKV--GVYKRP-IGIKRISQNNHPSKISRP--- 1093 D + RGL + + D+C + + G Y+R IG K+I + S++ RP Sbjct: 108 KDG---FAKKLRGLDADDY-DICTSPRQYSYGGPYRRTRIGRKQIFKKEELSQVIRPYIQ 163 Query: 1092 --MTSLSGDKSRLL------VASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKIL 937 +TSLS S + V S +++ S R + N + +S S+ D Sbjct: 164 KDLTSLSMVHSARVKDDLGTVNSSSLSMVHSARVNDDVGTVNSSSLSMVHHASMKDDVGT 223 Query: 936 IAESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKK-LSAESLRQKKDDSQ 760 + S + + S K+ C+ +A +++ L+ + + D Sbjct: 224 VKSSDSPPGEVEKLIS---------------KKKCQKDDKAKNQQSLTVVNSVKSNDSEV 268 Query: 759 CTSADHGLSTKPIDEFRQGGLFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEE 580 +HGLS R L +I P++ ++ ES E + E Sbjct: 269 IVDNEHGLSAD--KSVRSQDL--EIQPSLATSL-PAESDDCRKETNVETSSSSVSEPKSE 323 Query: 579 AVEPVETSKLEDTCILVDGDKLHFV---SQGTEKHKSYKKKIREALSSKLRSTRKQD--- 418 ++ + +E++CILVD D+ H V +KHK Y KKIR+A+SS+++ R+++ Sbjct: 324 ILQHLSGRSVEESCILVDRDEFHSVFPDKMENDKHKPY-KKIRDAISSRMKQNREKEYKR 382 Query: 417 -PCVSHCKDLGGQNNGGVTTIPALEMDSDKRNLPVHDSFESDWEIL 283 + +D+ G P E S S ES+WE+L Sbjct: 383 LARQWYAEDVENGRECGDNPKPIEENQS---------SEESEWELL 419 >ref|XP_006410238.1| hypothetical protein EUTSA_v10016698mg [Eutrema salsugineum] gi|567211021|ref|XP_006410239.1| hypothetical protein EUTSA_v10016698mg [Eutrema salsugineum] gi|557111407|gb|ESQ51691.1| hypothetical protein EUTSA_v10016698mg [Eutrema salsugineum] gi|557111408|gb|ESQ51692.1| hypothetical protein EUTSA_v10016698mg [Eutrema salsugineum] Length = 426 Score = 137 bits (344), Expect = 4e-29 Identities = 122/414 (29%), Positives = 201/414 (48%), Gaps = 15/414 (3%) Frame = -1 Query: 1614 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1435 M FKGITW GN+YQKFE MCLEVEE++ +DT KYVENQVQ VG S+KKF S+V+ D LP Sbjct: 1 MAFKGITWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVQTVGNSMKKFCSDVVGDFLPD 60 Query: 1434 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 1255 V K P + L+ YA K KK+E N D+ E+ Sbjct: 61 ESVGSEKPLPVSM-LHEYAPVCSFK------------KKRESLNRKTRDVKQEQEVSEGK 107 Query: 1254 NDANHLSSPSPRGLVENTHSDVCFTKSKKV--GVYKRP-IGIKRISQNNHPSKISRPMTS 1084 D + RGL + + D+C + + G Y+R +G K+I +N +++RP + Sbjct: 108 KDGCEMKF---RGLDADDY-DICTSPRQYSYGGPYRRTRLGRKQIYKNEEVFQVTRP-SY 162 Query: 1083 LSGDKSRLLV-----ASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDK---ILIAE 928 + D S L + ++D+ S P E I+ E C + ++ + + Sbjct: 163 IQKDSSSLSMVHRSRVNNDVGAVKSSDSPPVEVERLIS--KEECQKDDRTENQHGLTVVN 220 Query: 927 SVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQCTSA 748 SV+ + DSE + + + +SV+ +D E ++ S+R +D Sbjct: 221 SVRSQ--DSETRTKKEHGLTMVDSVR--SQDSETRTKNEHGLTMVNSVR-SEDSEIGIEN 275 Query: 747 DHGLSTKPIDEFRQGGLFSQINPNIVSNTWDI-ESIGEDVFPCYEDNFDMEVIENEEAVE 571 +HGL+ + + + ++ + + + D + E+ + + ++E E Sbjct: 276 EHGLTVVNSGRCQDSEIQTSVSTSSPAGSDDCRKETNENSMETSSSSVSEQ--KSEILQE 333 Query: 570 PVETSKLEDTCILVDGDKLHFV---SQGTEKHKSYKKKIREALSSKLRSTRKQD 418 E LE++CI+VD D+LH V + +KHK Y KKIR+A+SS+++ R+++ Sbjct: 334 LSEGRSLEESCIIVDRDELHCVFPDRKENDKHKPY-KKIRDAISSRMKQNREKE 386 >gb|EPS62712.1| hypothetical protein M569_12076, partial [Genlisea aurea] Length = 147 Score = 132 bits (332), Expect = 9e-28 Identities = 80/165 (48%), Positives = 95/165 (57%), Gaps = 1/165 (0%) Frame = -1 Query: 1614 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1435 MDFKGI W GN+YQKFE MCLEVEEV+YEDTVKY+E Q+QKV SVKKFY+E+M DL P Sbjct: 1 MDFKGIAWVGNVYQKFEAMCLEVEEVVYEDTVKYMEGQMQKVSGSVKKFYTEIMDDLNPS 60 Query: 1434 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 1255 S P K + DL +P+ H L K KP + +E E D D A K Sbjct: 61 SGDAPAKYSESDLVWDPFGHVHLMK-KPR------DIVPEEKEVGDAFDFAAGKKD---- 109 Query: 1254 NDANHLSSPSPRGLVENTH-SDVCFTKSKKVGVYKRPIGIKRISQ 1123 P VE+ H TKS K+G +RPIGIKRIS+ Sbjct: 110 ---------PPLVFVEDLHCGSRAATKSPKLGACRRPIGIKRISK 145 >gb|EOX92821.1| Uncharacterized protein isoform 5 [Theobroma cacao] gi|508700926|gb|EOX92822.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 334 Score = 132 bits (332), Expect = 9e-28 Identities = 122/395 (30%), Positives = 182/395 (46%), Gaps = 13/395 (3%) Frame = -1 Query: 1617 TMDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYS----EVMQ 1450 +MD KGITW G++Y+KFE MCLEVEEVMY+DTVKYVEN+VQ VG SVKKFYS +VMQ Sbjct: 3 SMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQ 62 Query: 1449 DLLPPSCVDPVK-VAPGDLSLNPYAHTDLNKSKPTMLDS--YGEFKKKEIENEDISDLIA 1279 DLL PS ++P+K VA DL + YA T L K + + G+ ++ ++E I+D+ Sbjct: 63 DLLLPSSLEPMKAVAASDLPVEIYAET-LKKPNVGLKEDAIQGDSEQLTEDSEVIADVNE 121 Query: 1278 E----KSSLRVHNDANHLSSPSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRIS-QNNH 1114 SS ++H N S S VE SD+ G + + + + ++ Sbjct: 122 NAAHVPSSCQLHMVDNIFESCS-GSFVERASSDLL------SGEHNNRCTLNKTNVEHLL 174 Query: 1113 PSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILI 934 P++ S + + R+ + N V CH A +TP+S VE D I Sbjct: 175 PAETSSEAGCVENEFGRMSSFCGNANANHEVSCHQIPA--TLTPVS---VEEDDCDS--I 227 Query: 933 AESVKEEKDDSECASH-ASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQC 757 ES E K S+ D + L V++++ + C+S + + S L KD S Sbjct: 228 EESSNEIKSASDSVPEILPDGLHLVGIVEKNEMEMRCSSSIIESEESNGKLNWTKDAS-- 285 Query: 756 TSADHGLSTKPIDEFRQGGLFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEEA 577 G ST E E Sbjct: 286 -----GSSTVGRKEI-------------------------------------------ET 297 Query: 576 VEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYK 472 V+ ++ +++++C +V+G +LHF Q KHK+Y+ Sbjct: 298 VQQLDKIRVDESCFMVNGAELHFHPQREGKHKTYQ 332