BLASTX nr result
ID: Rehmannia23_contig00010992
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia23_contig00010992 (1402 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006350037.1| PREDICTED: uncharacterized protein LOC102601... 199 2e-48 ref|XP_006350032.1| PREDICTED: uncharacterized protein LOC102601... 198 4e-48 ref|XP_004243407.1| PREDICTED: uncharacterized protein LOC101260... 197 9e-48 ref|XP_004251799.1| PREDICTED: uncharacterized protein LOC101256... 189 2e-45 gb|EXB78097.1| hypothetical protein L484_004798 [Morus notabilis] 179 2e-42 ref|XP_002525120.1| hypothetical protein RCOM_0553590 [Ricinus c... 172 2e-40 ref|XP_002279986.1| PREDICTED: uncharacterized protein LOC100250... 166 2e-38 gb|EOX92817.1| Uncharacterized protein isoform 1 [Theobroma caca... 165 5e-38 ref|XP_006446765.1| hypothetical protein CICLE_v10015391mg [Citr... 160 9e-37 ref|XP_006469032.1| PREDICTED: uncharacterized protein LOC102611... 159 3e-36 ref|XP_004303170.1| PREDICTED: uncharacterized protein LOC101303... 159 3e-36 ref|XP_006348849.1| PREDICTED: uncharacterized protein LOC102594... 148 5e-33 gb|EOX92819.1| Uncharacterized protein isoform 3, partial [Theob... 147 1e-32 ref|XP_002881158.1| hypothetical protein ARALYDRAFT_482041 [Arab... 144 1e-31 gb|EOX92820.1| Uncharacterized protein isoform 4, partial [Theob... 139 3e-30 ref|XP_006294244.1| hypothetical protein CARUB_v10023243mg, part... 139 4e-30 ref|XP_004149372.1| PREDICTED: uncharacterized protein LOC101205... 138 5e-30 ref|NP_565715.1| uncharacterized protein [Arabidopsis thaliana] ... 136 2e-29 ref|XP_006410238.1| hypothetical protein EUTSA_v10016698mg [Eutr... 135 3e-29 gb|EOX92821.1| Uncharacterized protein isoform 5 [Theobroma caca... 133 2e-28 >ref|XP_006350037.1| PREDICTED: uncharacterized protein LOC102601397 isoform X6 [Solanum tuberosum] Length = 420 Score = 199 bits (506), Expect = 2e-48 Identities = 162/483 (33%), Positives = 239/483 (49%), Gaps = 39/483 (8%) Frame = -3 Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182 MD KGI W G+IYQKFE MCLE+E+ MY+DT +YVENQVQ VG SVK+FYS+V+ DL P Sbjct: 1 MDLKGIAWVGDIYQKFEAMCLEMEDAMYQDTARYVENQVQTVGASVKRFYSDVVLDLHPQ 60 Query: 1181 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 1002 +DPVKVA DLSLNPYAHT+++K L+ Sbjct: 61 FNIDPVKVAAADLSLNPYAHTEISK------------------------------KLKAK 90 Query: 1001 NDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKR-PIGIKRISQNNH-PSKISRPMTSL 828 H ++ + L+++T V KSK GVY+R +GIK I ++NH PSK S + + Sbjct: 91 LKGGHPMVIN-KELIDDT--QVIKGKSKSGGVYRRQSVGIKEIVRDNHPPSKKSDALCLV 147 Query: 827 SGDKSRLL----------VASDDMNVTTSVRCHPG--------EAVNNI--TPISEA--C 714 SG+ +L VASD M +T+ + G E N+I T +S A Sbjct: 148 SGNAIKLSSDSKVRGGFEVASDHMTMTSPLASVKGRSSAETGKEVSNHIIKTDVSAAGIS 207 Query: 713 VESLASDKILIAESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAES 534 + ASD+ L + V + + D S D +S D+ C+ + + K+S+ Sbjct: 208 INVAASDRSLSVDCVGQNQADLRNTSSVGD----LQSDSHDRGTCKELAGDTGLKISS-- 261 Query: 533 LRQKKDDSQCTSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNF 354 + D+ ++++ I+ + I SNT D GE++ ++ Sbjct: 262 ----------NTGDNNIASEEINNIAK----------ISSNTGDNNITGEEINESCKERS 301 Query: 353 D---------MEVIENE-EAVEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIRE 204 D ++IE++ E VE + SKLE+TC+LV+ +KLH V Q + K KSYKKK+R+ Sbjct: 302 DKSCSPPPEKYDLIESDVEIVEHYDESKLEETCVLVEAEKLH-VPQESVKQKSYKKKLRQ 360 Query: 203 ALSSKLRSTRKQDPCVSHCKDLGGQNNGGV----TTIPALEMDSDKRNL-PVHDSFESDW 39 S K +STRK+ L G + + L +S+ + L D ES+W Sbjct: 361 VFSMKKKSTRKE---YEQLGALHGDQQPNLEPEEKPMQVLSKNSNMKKLSSADDHSESEW 417 Query: 38 EIL 30 E+L Sbjct: 418 ELL 420 >ref|XP_006350032.1| PREDICTED: uncharacterized protein LOC102601397 isoform X1 [Solanum tuberosum] gi|565366720|ref|XP_006350033.1| PREDICTED: uncharacterized protein LOC102601397 isoform X2 [Solanum tuberosum] gi|565366722|ref|XP_006350034.1| PREDICTED: uncharacterized protein LOC102601397 isoform X3 [Solanum tuberosum] gi|565366724|ref|XP_006350035.1| PREDICTED: uncharacterized protein LOC102601397 isoform X4 [Solanum tuberosum] gi|565366726|ref|XP_006350036.1| PREDICTED: uncharacterized protein LOC102601397 isoform X5 [Solanum tuberosum] Length = 421 Score = 198 bits (504), Expect = 4e-48 Identities = 162/483 (33%), Positives = 238/483 (49%), Gaps = 39/483 (8%) Frame = -3 Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182 MD KGI W G+IYQKFE MCLE+E+ MY+DT +YVENQVQ VG SVK+FYS+V+ DL P Sbjct: 1 MDLKGIAWVGDIYQKFEAMCLEMEDAMYQDTARYVENQVQTVGASVKRFYSDVVLDLHPQ 60 Query: 1181 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 1002 +DPVKVA DLSLNPYAHT+++K L+ Sbjct: 61 FNIDPVKVAAADLSLNPYAHTEISK------------------------------KLKAK 90 Query: 1001 NDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKR-PIGIKRISQNNH-PSKISRPMTSL 828 H ++ + L+++T V KSK GVY+R +GIK I ++NH PSK S + + Sbjct: 91 LKGGHPMVIN-KELIDDT--QVIKGKSKSGGVYRRQSVGIKEIVRDNHPPSKKSDALCLV 147 Query: 827 SGDKSRLL----------VASDDMNVTTSVRCHPG--------EAVNNI--TPISEA--C 714 SG+ +L VASD M +T+ + G E N+I T +S A Sbjct: 148 SGNAIKLSSDSKVRGGFEVASDHMTMTSPLASVKGRSSAETGKEVSNHIIKTDVSAAGIS 207 Query: 713 VESLASDKILIAESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAES 534 + ASD+ L + V + + D S D L D+ C+ + + K+S+ Sbjct: 208 INVAASDRSLSVDCVGQNQADLRNTSSVGD---LQSDSHADRGTCKELAGDTGLKISS-- 262 Query: 533 LRQKKDDSQCTSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNF 354 + D+ ++++ I+ + I SNT D GE++ ++ Sbjct: 263 ----------NTGDNNIASEEINNIAK----------ISSNTGDNNITGEEINESCKERS 302 Query: 353 D---------MEVIENE-EAVEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIRE 204 D ++IE++ E VE + SKLE+TC+LV+ +KLH V Q + K KSYKKK+R+ Sbjct: 303 DKSCSPPPEKYDLIESDVEIVEHYDESKLEETCVLVEAEKLH-VPQESVKQKSYKKKLRQ 361 Query: 203 ALSSKLRSTRKQDPCVSHCKDLGGQNNGGV----TTIPALEMDSDKRNL-PVHDSFESDW 39 S K +STRK+ L G + + L +S+ + L D ES+W Sbjct: 362 VFSMKKKSTRKE---YEQLGALHGDQQPNLEPEEKPMQVLSKNSNMKKLSSADDHSESEW 418 Query: 38 EIL 30 E+L Sbjct: 419 ELL 421 >ref|XP_004243407.1| PREDICTED: uncharacterized protein LOC101260247 [Solanum lycopersicum] Length = 374 Score = 197 bits (501), Expect = 9e-48 Identities = 164/463 (35%), Positives = 227/463 (49%), Gaps = 19/463 (4%) Frame = -3 Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182 MD K I+W GNIYQKFETMCLE+EE MY+DTVKYVENQ+ VG +VK+F SEVMQD+ P Sbjct: 1 MDLKSISWVGNIYQKFETMCLEMEEAMYQDTVKYVENQMNTVGTNVKRFCSEVMQDVHPQ 60 Query: 1181 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 1002 +DPVKVA DLSLNPYAH +++K L K S R Sbjct: 61 CNIDPVKVAAADLSLNPYAHYEIDKKLKANL----------------------KGSAR-- 96 Query: 1001 NDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKRP-IGIKRISQNNHPSKISRPMTSLS 825 +N L N + V KSK GVYKR +GIK I +++H +K + S Sbjct: 97 GFSNKL----------NDDTQVIKGKSKSGGVYKRQNVGIKEIVRDSHLTKKPNAICLAS 146 Query: 824 GDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILIAESVKEEK--DD 651 GD +L ++S G LASD + + ++ K D Sbjct: 147 GDALKL---------SSSAEVRGG--------------FELASDHVTLTSALASVKGSDS 183 Query: 650 SECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDS-----QC-----T 501 E AS S++++ D S AS +S ES+ +K+ D+ C T Sbjct: 184 GEVASKVSNHVI---QTNVSTADTSITSEAS-VMMSVESVGKKQTDTCTKELACNTRFKT 239 Query: 500 SAD--HGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEE 327 S+D + L+ + IDE S + + N++S IES D+E+ Sbjct: 240 SSDVRNNLANEEIDE-----SHEEKSDNLLSKYDSIES-------------DLEI----- 276 Query: 326 AVEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKLRSTRKQDPCVSHC 147 VE + +L +TC+LV+ D++H V QG K KSYKKK+R+A S+K R TRK+ Sbjct: 277 -VEKFDEFQLNETCVLVEEDRIH-VPQGPVKQKSYKKKLRDAFSTKKRLTRKE---YEQL 331 Query: 146 KDLGGQNNGGVTT----IPALEMDSDKRNLPVHDSFESDWEIL 30 L G V + +P L M+S+ + L +D ES+WEIL Sbjct: 332 GALYGDQQIKVESEDKVMPVLAMNSNTKMLSANDHPESEWEIL 374 >ref|XP_004251799.1| PREDICTED: uncharacterized protein LOC101256948 [Solanum lycopersicum] Length = 421 Score = 189 bits (481), Expect = 2e-45 Identities = 158/483 (32%), Positives = 234/483 (48%), Gaps = 39/483 (8%) Frame = -3 Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182 MD KGI W G+IYQKFE MCLE+E+ MY+DT +YVENQVQ VG SVK+FYS+V+ DL P Sbjct: 1 MDLKGIAWVGDIYQKFEAMCLEMEDAMYQDTARYVENQVQTVGASVKRFYSDVVLDLHPQ 60 Query: 1181 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 1002 +DPVKVA DLSLNPYAHT+++K L + RV Sbjct: 61 FNIDPVKVAAADLSLNPYAHTEISKKLKAQL---------------------KGGHPRVI 99 Query: 1001 NDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKR-PIGIKRISQNNH-PSKISRPMTSL 828 N + L+++T V KSK GVY+R +G+K I ++NH PSK S + + Sbjct: 100 N----------KELIDDT--QVIKGKSKSGGVYRRQSVGMKEIVRDNHPPSKKSDALCLV 147 Query: 827 SGDKSRLL----------VASDDMNVTTSVRCHPG--------EAVNNI----TPISEAC 714 SG+ +L VASD M +T+ + G E N+I P + Sbjct: 148 SGNTIKLSSDSKVRGGFEVASDHMTMTSPLASVKGLKSTETGKEVSNHIIKTEVPAAGIS 207 Query: 713 VESLASDKILIAESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAES 534 + ASD L + V + + D N ++ D R + K+L+ ++ Sbjct: 208 INIAASDTSLSVDCVGQNQADLR-------NTFSVGDLQSDSH----VDRGTRKELAGDT 256 Query: 533 LRQKKDDSQCTSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNF 354 + + + D+ +++K ++ + I SNT D GE++ + Sbjct: 257 GLKISSN----TGDNNIASKEVNNIAK----------ISSNTDDNNIAGEEIKESCKARS 302 Query: 353 D---------MEVIENE-EAVEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIRE 204 D ++IE++ E VE + KLE+TC+LV+ +KLH V QG+ K KSYKKK+R+ Sbjct: 303 DKSCSPPPDKYDLIESDVEIVERYDEPKLEETCVLVEAEKLH-VPQGSVKRKSYKKKLRQ 361 Query: 203 ALSSKLRSTRKQDPCVSHCKDLGGQNNGGV----TTIPALEMDSDKRNL-PVHDSFESDW 39 S K +STR + L G + + L +S+ + L D ES+W Sbjct: 362 VFSMKKKSTRTE---YEQLGALYGDQQPNLQPEEKQMQVLSKNSNPKKLSSADDHSESEW 418 Query: 38 EIL 30 E+L Sbjct: 419 ELL 421 >gb|EXB78097.1| hypothetical protein L484_004798 [Morus notabilis] Length = 443 Score = 179 bits (455), Expect = 2e-42 Identities = 158/485 (32%), Positives = 226/485 (46%), Gaps = 41/485 (8%) Frame = -3 Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182 MD KGITW GN+YQKFE MCLEVEE+MY+DTVKYVENQVQ VG SVK+FYS+VMQDLLPP Sbjct: 1 MDVKGITWVGNVYQKFEAMCLEVEEIMYQDTVKYVENQVQTVGASVKRFYSDVMQDLLPP 60 Query: 1181 SCVDPVKV----------APGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDL 1032 S D KV + +S P K KP D + ++ ++ Sbjct: 61 SSQDSEKVSLCGFIGKQDSDDGISKKPNV---AKKEKPAKADDEQLIRTLKVTSDSKDVY 117 Query: 1031 IAEKSSLRVHNDANHL---SSLSPRGLVENTHS-----DVCFTKSKKVGVYKRPIGIKRI 876 +A S+ V D +++ S +G N S DV S + V + K I Sbjct: 118 LA--PSIHVRCDVDNMCRPSGECVKGACSNLRSRKKCRDVSVHSSSNLSVNENRSDKKLI 175 Query: 875 SQN-----NHPSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACV 711 +SRP++S S + + S D TT +VN Sbjct: 176 PPETSCAITREKHLSRPLSSYSEFVNEIHEISLDQTGTTK-----APSVN---------- 220 Query: 710 ESLASDKILIAESVKEEKDDSECAS------HASDNILLAESVKQDKEDCECASRASDKK 549 E +SD I+ ES E ++ SEC + HAS I+L +SV D + + S Sbjct: 221 EDTSSDSIV--ESCDEIENSSECMADLSSSFHASSEIILVKSVGYDGNEMDVPSGGG--- 275 Query: 548 LSAESLRQKKDDSQCTSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVF-- 375 LS Q D + + L++ GGS + + EDVF Sbjct: 276 LS----EQANGDYTSKCSSNSLAS-------TGGSSQN------EEARNDKYADEDVFVS 318 Query: 374 -PCYEDNFDMEVIENE-------EAVEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYK 219 P D++++ + E+E E ++ + KLE+TC+LV+ D+LH + Q K + YK Sbjct: 319 LPRKFDDWNLNITESEIATEHGTETIQQRDKVKLEETCVLVNEDELHILPQRGGKWRPYK 378 Query: 218 KKIREALSSKLRSTRKQ--DPCVSHCKDLGGQNNGGVTTIPALEMDSDKRNLPVHDSFES 45 KKIR+AL S++RS RK+ + V D N + + +++ LP DS ES Sbjct: 379 KKIRDALYSRMRSARKEEYEQLVLQYGDNKKLNQDFGEALAPTLIVKERKKLPHLDSCES 438 Query: 44 DWEIL 30 +WE+L Sbjct: 439 EWELL 443 >ref|XP_002525120.1| hypothetical protein RCOM_0553590 [Ricinus communis] gi|223535579|gb|EEF37247.1| hypothetical protein RCOM_0553590 [Ricinus communis] Length = 490 Score = 172 bits (437), Expect = 2e-40 Identities = 164/524 (31%), Positives = 226/524 (43%), Gaps = 80/524 (15%) Frame = -3 Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182 MD KGI+W GNIYQKFE MCLEVEEVMY+DTVKYVENQVQ VG SVK+FYS+VMQDLLPP Sbjct: 1 MDLKGISWVGNIYQKFEAMCLEVEEVMYQDTVKYVENQVQTVGSSVKRFYSDVMQDLLPP 60 Query: 1181 SCVDPVKVAPGDLSLNPYAHTDL-NKSKPTMLDSYGEFKKKEIENEDISDLIAEKS--SL 1011 S VD K A D+ L YA + K K + + G+ +E ED +KS L Sbjct: 61 SSVDAAKGAGVDVPLELYADLGIYMKPKVGVKEKQGKVDDRERLTEDPKITTDKKSMDPL 120 Query: 1010 RVHNDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRISQNNHP-SKISRPMT 834 H GLVEN ++ G R G + +S ++P ++ + Sbjct: 121 TFHR----------LGLVENRFP---LSQGNSAGGASRQHGKRSLSNKSNPYTRKNSNRE 167 Query: 833 SLSGDKSRLLVASDDMNVTTSV----------------------RCHPGEA--------- 747 ++S DK ++ D + + C P + Sbjct: 168 NMSVDKKLEAISCLDKGLIRASFSERSNENLGDSGGGAPKQYGDSCLPKDTSLGTNGNSE 227 Query: 746 ----------------VNNITPISEACVESLASDKILIAESVK----------------E 663 N++T S C S + K + + K E Sbjct: 228 RQNIFLHEKARVVIPLYNDLTRASSICELSNENHKDCVDQQAKITTPGSVEMTGHDSVDE 287 Query: 662 EKDDSECASH----ASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQCTSA 495 K + E AS D + ES D C+S S LSAE+ DD A Sbjct: 288 SKYEIENASEQIPDIPDMVNSTESGASKGMDMTCSSHGS---LSAEA--HAADDCMSHGA 342 Query: 494 DHGLSTKPIDEFRQG---GSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEEA 324 D P D F G G S + + VSN+ + DV+ D + E Sbjct: 343 DF-----PADSFVNGNGKGQSSDSDEDFVSNSGS-DDCNTDVY-----KIDFSISHEMEI 391 Query: 323 VEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKLRSTRKQDPCVSHCK 144 ++ V+ +KLE++CILV+ D+ H++ Q K KSYKKKIR+ S + RS RK + +S C Sbjct: 392 IQQVDKAKLEESCILVNRDECHYLPQSERKSKSYKKKIRDVFSPRKRSMRKHEQ-LSICP 450 Query: 143 DLGGQNNGGVTTIPALEM------DSDKRNLPVHDSFESDWEIL 30 G +N M D+D+ + P D +S+WE L Sbjct: 451 --GSDSNPNQEECAKNSMPRHTIKDADRYSTP--DCCDSEWEFL 490 >ref|XP_002279986.1| PREDICTED: uncharacterized protein LOC100250516 [Vitis vinifera] gi|302143402|emb|CBI21963.3| unnamed protein product [Vitis vinifera] Length = 451 Score = 166 bits (420), Expect = 2e-38 Identities = 157/495 (31%), Positives = 236/495 (47%), Gaps = 51/495 (10%) Frame = -3 Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDT-------VKYVENQVQKVGVSVKKFYSEV 1203 MDFKGITW GN+YQKFET+CLEVE++MY+DT VKYVE+QV+ VG SVKKF SE+ Sbjct: 1 MDFKGITWVGNMYQKFETICLEVEDIMYQDTVKYFENHVKYVEDQVETVGESVKKFCSEI 60 Query: 1202 MQDLLPPSCVDPVKVAPGDLSLNPYAHTDLNKSKPTM---------------LDSYGEFK 1068 +QDLL P D ++V +LSL+ + + L K KP + + EF Sbjct: 61 VQDLLLP---DSLEVTDSNLSLDQHDNVKLCK-KPKVGIKEEAKVGFKEEPKVSIKEEFI 116 Query: 1067 KKEI----ENEDISDL---IAEKSSLRVHNDANHL----------SSLSPRGLVENTHSD 939 K +I E+ +I+DL + KSS + N+L + S LV+N Sbjct: 117 KFDIDRLTEHSEIADLNEDVEHKSSFTGLHGVNNLFQSYSGNSVTGACSDLHLVQNDDGV 176 Query: 938 VCFTKSKKVGVYKRPIGIKRISQNNHPSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCH 759 +C K+ G+ + P+ ++SQ P ++S + +SGD SRL +N +C+ Sbjct: 177 MC--KNLDAGIKRNPV---KVSQ--FPIEVSGVIAPISGDVSRL---PSSLNENCENKCN 226 Query: 758 PGEAVNNITPISEACVESLASDKILIAESVKEEKDDSECASHASDNILLAESVKQDKEDC 579 + S A VE + + ++ E D S ++ L ESV ++ + Sbjct: 227 QMAITS-----SPASVEITDCN---LEGAICNEIADVTAISVDLPSVPLVESVGKEGREM 278 Query: 578 ECASRAS-DKKLSAESLRQKKDDSQCTSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWD 402 +SR +L+A ++ D+G+ + GSF I N + D Sbjct: 279 VFSSRGGLSSELNAGNI----------PMDNGVGSLI-------GSFRDIQQNETAEKKD 321 Query: 401 IESIGEDVFPCYEDNFDMEVIENEEAVEP-VETS-------KLEDTCILVDGDKLHFVSQ 246 + S E D ++++ IE + +E +ET+ KLED C++VDGD+LH VS Sbjct: 322 LLSHSEG-----SDGWNIDAIEINDVIEQGIETTKDLLDKMKLEDACVMVDGDELHVVSH 376 Query: 245 GTEKHKSYKKKIREALSSKLRSTRKQDPCVS---HCKDLGGQNNGGVTTIPALEMDSDKR 75 K KKK+R A SK R RK+ ++ D G P+ DSDKR Sbjct: 377 REGKVWLVKKKLRNAFYSKRRLARKEYERLAVWHRVIDSESNQPGAEGLTPSPSTDSDKR 436 Query: 74 NLPVHDSFESDWEIL 30 P D +S+WE+L Sbjct: 437 TSPDDDFCQSEWELL 451 >gb|EOX92817.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508700922|gb|EOX92818.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 397 Score = 165 bits (417), Expect = 5e-38 Identities = 143/461 (31%), Positives = 221/461 (47%), Gaps = 16/461 (3%) Frame = -3 Query: 1364 TMDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYS----EVMQ 1197 +MD KGITW G++Y+KFE MCLEVEEVMY+DTVKYVEN+VQ VG SVKKFYS +VMQ Sbjct: 3 SMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQ 62 Query: 1196 DLLPPSCVDPVK-VAPGDLSLNPYAHTDLNKSKPTMLDS--YGEFKKKEIENEDISDLIA 1026 DLL PS ++P+K VA DL + YA T L K + + G+ ++ ++E I+D+ Sbjct: 63 DLLLPSSLEPMKAVAASDLPVEIYAET-LKKPNVGLKEDAIQGDSEQLTEDSEVIADVNE 121 Query: 1025 E----KSSLRVHNDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRIS-QNNH 861 SS ++H N S S VE SD+ G + + + + ++ Sbjct: 122 NAAHVPSSCQLHMVDNIFESCS-GSFVERASSDLL------SGEHNNRCTLNKTNVEHLL 174 Query: 860 PSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILI 681 P++ S + + R+ + N V CH A +TP+S VE D I Sbjct: 175 PAETSSEAGCVENEFGRMSSFCGNANANHEVSCHQIPA--TLTPVS---VEEDDCDS--I 227 Query: 680 AESVKEEKDDSECASH-ASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQC 504 ES E K S+ D + L V++++ + C+S + + S L KD S Sbjct: 228 EESSNEIKSASDSVPEILPDGLHLVGIVEKNEMEMRCSSSIIESEESNGKLNWTKDAS-- 285 Query: 503 TSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEEA 324 G ST E E Sbjct: 286 -----GSSTVGRKEI-------------------------------------------ET 297 Query: 323 VEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKLRSTRKQD---PCVS 153 V+ ++ +++++C +V+G +LHF Q KHK+Y++KIR+A+SS++RS RK++ + Sbjct: 298 VQQLDKIRVDESCFMVNGAELHFHPQREGKHKTYQRKIRDAISSRMRSARKKEYEQLPLW 357 Query: 152 HCKDLGGQNNGGVTTIPALEMDSDKRNLPVHDSFESDWEIL 30 + D+ + ++ AL + +R L HD +S+WE+L Sbjct: 358 YGDDVKSDQDSEGSSTSALTREDTRRTLN-HDDLDSEWELL 397 >ref|XP_006446765.1| hypothetical protein CICLE_v10015391mg [Citrus clementina] gi|567908905|ref|XP_006446766.1| hypothetical protein CICLE_v10015391mg [Citrus clementina] gi|557549376|gb|ESR60005.1| hypothetical protein CICLE_v10015391mg [Citrus clementina] gi|557549377|gb|ESR60006.1| hypothetical protein CICLE_v10015391mg [Citrus clementina] Length = 416 Score = 160 bits (406), Expect = 9e-37 Identities = 141/458 (30%), Positives = 226/458 (49%), Gaps = 14/458 (3%) Frame = -3 Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182 MD KGITW G++YQKFE MCLEVEE+MY+DTVKYVENQVQ VG +VKKFYS+V++DLLPP Sbjct: 1 MDLKGITWVGHVYQKFEAMCLEVEEIMYQDTVKYVENQVQTVGSTVKKFYSDVIEDLLPP 60 Query: 1181 SCVDPVKVA-PGDLSLNPYAHTDL-NKSKPTMLDSYGEFKKKEIENEDISDLIAEK---- 1020 VD VK A +L L A + K K + + +++ ++ +K Sbjct: 61 PSVDLVKGAVASNLPLEQNADVGIYKKPKIGIKEEAMNVNNEQLSESSLATTDLDKGAGG 120 Query: 1019 --SSLRVH-NDANHLSSLSP--RGLVENTHSDVCFTKSKKVGVYKRPIGIKRISQNNH-- 861 S R H D + SL +G+ + +S +S G + I +++IS+ ++ Sbjct: 121 GQSFCRFHIEDTSFQPSLGDTLKGVFSDAYSKEYDIRS---GHNQSSICMQKISKEDNLP 177 Query: 860 PSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILI 681 PS++S + R S C + + ++ + + ++ Sbjct: 178 PSEMSGAGPHMERGLRR-----------ASSSCELLDKIQEVSDDQVVVDPTPVTTEVAS 226 Query: 680 AESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQCT 501 +S +E D+ E AS + L + ++ ++ E A +S LSAE + CT Sbjct: 227 CKSFEEIYDELEKASKGASGALTSSPAAKNCDESENA-HSSCSSLSAEL------NGICT 279 Query: 500 SADHGLSTKPIDEFRQGGSFSQINPNIVSNTW-DIESIGEDVFPCYEDNFDMEVIENEEA 324 + D +S GSF +N ++ + + D E N D+E + E Sbjct: 280 N-DGVVSLV--------GSF--VNEDVQPSEFPDPGRSDYSTVDATESNIDVE--QGYET 326 Query: 323 VEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKLRSTRKQDPCVSHCK 144 V+ V+ ++E+TC+LV+GD+L FV KH+ YKKKI++A+SS++RSTRK + K Sbjct: 327 VQRVDNIQVEETCVLVNGDELCFVPCREGKHRPYKKKIQDAISSRMRSTRKHE-----YK 381 Query: 143 DLGGQNNGGVTTIPALEMDSDKRNLPVHDSFESDWEIL 30 L N + + +++ + P H E +WE+L Sbjct: 382 QLAVWYN---EDEKSKQQNAEMKGKPSHGYCELEWELL 416 >ref|XP_006469032.1| PREDICTED: uncharacterized protein LOC102611541 isoform X1 [Citrus sinensis] gi|568829444|ref|XP_006469033.1| PREDICTED: uncharacterized protein LOC102611541 isoform X2 [Citrus sinensis] gi|568829446|ref|XP_006469034.1| PREDICTED: uncharacterized protein LOC102611541 isoform X3 [Citrus sinensis] gi|568829448|ref|XP_006469035.1| PREDICTED: uncharacterized protein LOC102611541 isoform X4 [Citrus sinensis] Length = 416 Score = 159 bits (402), Expect = 3e-36 Identities = 139/459 (30%), Positives = 223/459 (48%), Gaps = 15/459 (3%) Frame = -3 Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182 MD KGITW G++YQKFE MCLEVEE+MY+DTVKYVENQVQ VG +VKKFYS+V++DLLPP Sbjct: 1 MDLKGITWVGHVYQKFEAMCLEVEEIMYQDTVKYVENQVQTVGSTVKKFYSDVIEDLLPP 60 Query: 1181 SCVDPVKVA-PGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRV 1005 VD VK A +L L A + K + + ++ NE +S+ + L Sbjct: 61 PSVDLVKGAVASNLPLEQNADVGIYKKPKIGIKE----EAMKVNNEQLSESSLATTDLDK 116 Query: 1004 HNDAN------HLSSLSPRGLVENTHSDV---CFTKSKKV--GVYKRPIGIKRISQNNH- 861 H+ S + + NT V + K + G + I +++IS+ ++ Sbjct: 117 GAGGGQSFCRFHIEDTSFQPSLGNTLKGVFSDAYPKEYDIRSGHNQSSICMQKISKEDNL 176 Query: 860 -PSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKIL 684 PS++S + R S C + + ++ + + ++ Sbjct: 177 PPSEMSGAGPHMERGLRR-----------ASSSCELLDKIQEVSDDQVVVDPTSVTTEVA 225 Query: 683 IAESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQC 504 +S +E D+ E AS + L + ++ ++ E A +S LSAE + C Sbjct: 226 SCKSFEEIYDELEKASKGASGALTSSPAAKNCDESESA-HSSCSSLSAEL------NGIC 278 Query: 503 TSADHGLSTKPIDEFRQGGSFSQINPNIVSNTW-DIESIGEDVFPCYEDNFDMEVIENEE 327 T+ D +S GSF +N ++ + + D E N D+E + E Sbjct: 279 TN-DGVVSLV--------GSF--VNEDVQPSEFPDPGRSDYSTVDATESNIDVE--QGYE 325 Query: 326 AVEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKLRSTRKQDPCVSHC 147 V+ V+ ++E+TC+LV+GD+L FV +KH+ KKKI++A+SS++RSTRK + Sbjct: 326 TVQRVDNIQVEETCVLVNGDELCFVPCREDKHRPCKKKIQDAISSRMRSTRKHE-----Y 380 Query: 146 KDLGGQNNGGVTTIPALEMDSDKRNLPVHDSFESDWEIL 30 K L N + + +++ + P H E +WE+L Sbjct: 381 KQLAVWYN---EDEKSKQQNAETKGKPSHGYCELEWELL 416 >ref|XP_004303170.1| PREDICTED: uncharacterized protein LOC101303722 [Fragaria vesca subsp. vesca] Length = 389 Score = 159 bits (402), Expect = 3e-36 Identities = 147/459 (32%), Positives = 211/459 (45%), Gaps = 14/459 (3%) Frame = -3 Query: 1364 TMDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLP 1185 TMD KGITW G +Y+KFE+MCLEVEE MYEDTVK+VE+QVQ VG SVKKFY++VMQDLL Sbjct: 3 TMDVKGITWVGCVYEKFESMCLEVEENMYEDTVKFVEDQVQTVGESVKKFYADVMQDLLC 62 Query: 1184 PSCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSL-R 1008 S +D V+ G + Y+ D +KSK KKKE + ++ + + Sbjct: 63 DSSLDRDDVSAGGFPVEHYSDVDNSKSKIR--------KKKEHVKAGVEEVKGDSEVISA 114 Query: 1007 VHNDANHLSSLSPRGLVENTHSDVCFTKSK----KVGVYKRPIGI----KRISQNNHPSK 852 V D +H GL TKS K+ ++ G+ K+I P K Sbjct: 115 VLKDVDH------TGLFHRQRVYDSCTKSSGNCAKLACSRQDHGVRSCNKKIVVRETPIK 168 Query: 851 ISRP--MTSLSGDKSR--LLVASDDMNVTTSVRC-HPGEAVNNITPISEACVESLASDKI 687 P T++ D SR L S+ N C P E + P +S+ S+ Sbjct: 169 DRLPGANTAVGKDFSRESLSSCSEFSNEDRDTSCDQPDEVITPSKPPEGMRCDSM-SESC 227 Query: 686 LIAESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQ 507 ++A + + DD +SD I+L D D K+ + Sbjct: 228 VVANASQCTGDDVSVNCQSSDMIVL------DNSD------------------GKRWNEL 263 Query: 506 CTSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEE 327 S+ GLST+ GGS INP++ + +I + G ++ Sbjct: 264 LDSSIGGLSTE-----LNGGS---INPSMDAIESNIGTHGTEI----------------- 298 Query: 326 AVEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKLRSTRKQDPCVSHC 147 ++ + KLE+TC++V G+ LHFV +K YKKKI +A +S+ S RKQ+ Sbjct: 299 -IQQSDKPKLEETCVMVSGEDLHFVHHTVANYKPYKKKIPKAFTSRTSSARKQE-----Y 352 Query: 146 KDLGGQNNGGVTTIPALEMDSDKRNLPVHDSFESDWEIL 30 + L + G T LE + + P HD ES+WEIL Sbjct: 353 EQLALWH--GHHTKSILEGGEESKKSPTHDFCESEWEIL 389 >ref|XP_006348849.1| PREDICTED: uncharacterized protein LOC102594335 isoform X1 [Solanum tuberosum] Length = 260 Score = 148 bits (374), Expect = 5e-33 Identities = 107/291 (36%), Positives = 141/291 (48%), Gaps = 4/291 (1%) Frame = -3 Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182 MD K I+W GNIYQKFETMCLE+EE MY+DTVKYVENQV VG +VK+F SEVMQD+ P Sbjct: 1 MDLKSISWVGNIYQKFETMCLEMEEAMYQDTVKYVENQVNTVGTNVKRFCSEVMQDVHPQ 60 Query: 1181 SCVDPVKVAPGDLSLNPYAHTDLNKS-KPTMLDSYGEFKKKEIENEDISDLIAEKSSLRV 1005 +DPVKVA DLS+NPYAH +++K K + S F K Sbjct: 61 CNIDPVKVAAADLSINPYAHYEIDKKLKANLKGSARRFSNK------------------- 101 Query: 1004 HNDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKRP-IGIKRISQNNHPSKISRPMTSL 828 N + V KSK GVYKR +GIK I +++HP+K + Sbjct: 102 ----------------LNDDTQVIKGKSKSGGVYKRQNVGIKEIVRDSHPAKKPNAICLA 145 Query: 827 SGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILIAESVKEEK--D 654 SGD +L ++S G +ASD + + ++ K D Sbjct: 146 SGDALKL---------SSSAEVRGG--------------FEMASDHVTLTSALASVKGSD 182 Query: 653 DSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQCT 501 E AS D+ + D S AS +S ES+R+K+ D+ CT Sbjct: 183 SGEAASKVRDHFI---QTNVSAADTSITSEAS-VTMSVESVRKKQTDT-CT 228 >gb|EOX92819.1| Uncharacterized protein isoform 3, partial [Theobroma cacao] Length = 343 Score = 147 bits (370), Expect = 1e-32 Identities = 128/406 (31%), Positives = 193/406 (47%), Gaps = 13/406 (3%) Frame = -3 Query: 1364 TMDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYS----EVMQ 1197 +MD KGITW G++Y+KFE MCLEVEEVMY+DTVKYVEN+VQ VG SVKKFYS +VMQ Sbjct: 3 SMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQ 62 Query: 1196 DLLPPSCVDPVK-VAPGDLSLNPYAHTDLNKSKPTMLDS--YGEFKKKEIENEDISDLIA 1026 DLL PS ++P+K VA DL + YA T L K + + G+ ++ ++E I+D+ Sbjct: 63 DLLLPSSLEPMKAVAASDLPVEIYAET-LKKPNVGLKEDAIQGDSEQLTEDSEVIADVNE 121 Query: 1025 E----KSSLRVHNDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRIS-QNNH 861 SS ++H N S S VE SD+ G + + + + ++ Sbjct: 122 NAAHVPSSCQLHMVDNIFESCS-GSFVERASSDLL------SGEHNNRCTLNKTNVEHLL 174 Query: 860 PSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILI 681 P++ S + + R+ + N V CH A +TP+S VE D I Sbjct: 175 PAETSSEAGCVENEFGRMSSFCGNANANHEVSCHQIPA--TLTPVS---VEEDDCDS--I 227 Query: 680 AESVKEEKDDSECASH-ASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQC 504 ES E K S+ D + L V++++ + C+S + + S L KD S Sbjct: 228 EESSNEIKSASDSVPEILPDGLHLVGIVEKNEMEMRCSSSIIESEESNGKLNWTKDAS-- 285 Query: 503 TSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEEA 324 G ST E E Sbjct: 286 -----GSSTVGRKEI-------------------------------------------ET 297 Query: 323 VEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKL 186 V+ ++ +++++C +V+G +LHF Q KHK+Y++KIR+A+SS++ Sbjct: 298 VQQLDKIRVDESCFMVNGAELHFHPQREGKHKTYQRKIRDAISSRM 343 >ref|XP_002881158.1| hypothetical protein ARALYDRAFT_482041 [Arabidopsis lyrata subsp. lyrata] gi|297326997|gb|EFH57417.1| hypothetical protein ARALYDRAFT_482041 [Arabidopsis lyrata subsp. lyrata] Length = 418 Score = 144 bits (362), Expect = 1e-31 Identities = 126/421 (29%), Positives = 199/421 (47%), Gaps = 22/421 (5%) Frame = -3 Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182 MDFKGI W GN+YQKFE MCLEVEE++ +DT KYVENQVQ VG SVKKF S+V+QDLLP Sbjct: 1 MDFKGIKWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVQTVGNSVKKFCSDVVQDLLPD 60 Query: 1181 SCVDPVKVAPGDLSLNPYAHT-DLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRV 1005 VD K P + L+ YA K + +M + K+++ E D A+K Sbjct: 61 DSVDSGKPLPVSM-LHEYAPVCSFKKKRDSMNRKTRDVKQEQEVTEGKKDGCAQKF---- 115 Query: 1004 HNDANHLSSLSPRGLVENTHSDVCFTKSKKV--GVYKRP-IGIKRISQNNHPSKISRPMT 834 RGL + + D+C + + G Y+R +G K+I + S+++RP Sbjct: 116 ------------RGLDADDY-DICTSPRQYSYGGPYRRTRVGRKQIFKKEELSQVTRPY- 161 Query: 833 SLSGDKSRLLV-----ASDD---MNVTTSVRCHPGEAVNNITPISEACVESLASDKILIA 678 + D S L + DD +N ++ H +++ ++ + + + S + I Sbjct: 162 -MQKDSSSLSMVHSARVKDDVGTVNSSSLSMVHSARVKDDVGTVNSSSLTMVHSAR--IK 218 Query: 677 ESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQCTS 498 + V K I E K DK + + L+ + ++ D Sbjct: 219 DDVGTVKSSDSPPGEVEKLIYKKECQKDDK-------TKNQQSLTVVNSVKRNDSEIRID 271 Query: 497 ADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPC-YEDNFDMEVI------ 339 +HGL S+I P++ ++ + G D C E N D + Sbjct: 272 NEHGL-------MGDSSQDSEIQPSVATSL----AAGSD--DCRKETNVDTKTSSSSVSE 318 Query: 338 ENEEAVEPVETSKLEDTCILVDGDKLHFV---SQGTEKHKSYKKKIREALSSKLRSTRKQ 168 + E ++P+ +E++CILVD D+ H V +KHK Y KKIR+A+SS+++ R++ Sbjct: 319 QKSEILQPLSGRSVEESCILVDRDEFHCVFPDKMENDKHKPY-KKIRDAISSRMKQNREK 377 Query: 167 D 165 + Sbjct: 378 E 378 >gb|EOX92820.1| Uncharacterized protein isoform 4, partial [Theobroma cacao] Length = 341 Score = 139 bits (350), Expect = 3e-30 Identities = 127/406 (31%), Positives = 191/406 (47%), Gaps = 13/406 (3%) Frame = -3 Query: 1364 TMDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYS----EVMQ 1197 +MD KGITW G++Y+KFE MCLEVEEVMY+DTVKYVEN+VQ VG SVKKFYS +VMQ Sbjct: 3 SMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQ 62 Query: 1196 DLLPPSCVDPVK-VAPGDLSLNPYAHTDLNKSKPTMLDS--YGEFKKKEIENEDISDLIA 1026 DLL PS ++P+K VA DL + YA T L K + + G+ ++ ++E I+D+ Sbjct: 63 DLLLPSSLEPMKAVAASDLPVEIYAET-LKKPNVGLKEDAIQGDSEQLTEDSEVIADVNE 121 Query: 1025 E----KSSLRVHNDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRIS-QNNH 861 SS ++H N S S VE SD+ G + + + + ++ Sbjct: 122 NAAHVPSSCQLHMVDNIFESCS-GSFVERASSDLL------SGEHNNRCTLNKTNVEHLL 174 Query: 860 PSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILI 681 P++ S + + R+ + N V CH A +TP+S VE D I Sbjct: 175 PAETSSEAGCVENEFGRMSSFCGNANANHEVSCHQIPA--TLTPVS---VEEDDCDS--I 227 Query: 680 AESVKEEKDDSECASH-ASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQC 504 ES E K S+ D + L V++++ + C+S + + S L KD S Sbjct: 228 EESSNEIKSASDSVPEILPDGLHLVGIVEKNEMEMRCSSSIIESEESNGKLNWTKDAS-- 285 Query: 503 TSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEEA 324 G ST E E Sbjct: 286 -----GSSTVGRKEI-------------------------------------------ET 297 Query: 323 VEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKL 186 V+ ++ +++++C +V+G +LHF Q KHK+Y +IR+A+SS++ Sbjct: 298 VQQLDKIRVDESCFMVNGAELHFHPQREGKHKTY--QIRDAISSRM 341 >ref|XP_006294244.1| hypothetical protein CARUB_v10023243mg, partial [Capsella rubella] gi|482562952|gb|EOA27142.1| hypothetical protein CARUB_v10023243mg, partial [Capsella rubella] Length = 436 Score = 139 bits (349), Expect = 4e-30 Identities = 128/418 (30%), Positives = 200/418 (47%), Gaps = 19/418 (4%) Frame = -3 Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182 MDFKGI W GN+YQKFE MCLEVEE++ +DT KYVENQV VG SVKKF S+V+QDLLP Sbjct: 13 MDFKGIKWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVHTVGNSVKKFCSDVVQDLLP- 71 Query: 1181 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKK-EIENEDISDLIAEKSSLRV 1005 D V G P + LN+ P FKKK E N D+ E+ Sbjct: 72 ---DDDSVGSG----KPLPVSMLNEYAPVC-----SFKKKRESANRKTRDVKQEEEVTEG 119 Query: 1004 HNDANHLSSLSPRGLVENTHSDVCFTKSKKV--GVYKR-PIGIKRISQNNHPSKISRPMT 834 D +++ RGL + + D+C + + G Y+R +G K+I + S+I+RP Sbjct: 120 KKDG---CAMNLRGLDADDY-DICTSPRQYSYGGPYRRGRVGRKQIFKKEELSQITRPY- 174 Query: 833 SLSGDKSRLLV-----ASDD---MNVTTSVRCHPGEAVNNITPISEACVESLASDKILIA 678 + D S L + DD +N ++ H G +++ ++ + + + S + I Sbjct: 175 -IQKDSSNLTMVHSARVKDDVGTVNSSSLSMAHSGRVKDDVGTVNSSSLSMVHSAR--IK 231 Query: 677 ESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQCTS 498 V+ K I E K D+ D + L+ + + KD T Sbjct: 232 ADVETVKSSDSRPGEIERLISKKECQKDDRTD-------NQHGLTMVNSVRSKDSEIRTE 284 Query: 497 ADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVI----ENE 330 +H L+ ++ R S+I P++ ++ S E ED+ + + Sbjct: 285 IEHSLTV--VNSVR--SQDSEILPSVATSLL-TGSSNEFRKETKEDSMEASSSSVSEQKS 339 Query: 329 EAVEPVETSKLEDTCILVDGDKLHFV---SQGTEKHKSYKKKIREALSSKLRSTRKQD 165 E ++ + +E++CILVD D+ H V +KHK Y KKIR+A+SS+++ R+++ Sbjct: 340 EILQHLSGRSVEESCILVDRDEFHCVFPDKMENDKHKPY-KKIRDAISSRMKQNREKE 396 >ref|XP_004149372.1| PREDICTED: uncharacterized protein LOC101205697 [Cucumis sativus] Length = 379 Score = 138 bits (348), Expect = 5e-30 Identities = 129/452 (28%), Positives = 201/452 (44%), Gaps = 8/452 (1%) Frame = -3 Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182 MD KGI W G +Y+KFETMCLEVE+++ +DTVKYVENQV+ VG SVK+FYS+VMQD LPP Sbjct: 1 MDVKGIAWVGRLYEKFETMCLEVEDIICQDTVKYVENQVEVVGASVKRFYSDVMQDFLPP 60 Query: 1181 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSS---- 1014 S + KVA + +L Y + + K KPTM K E ++ + S + A+ Sbjct: 61 SELSDEKVAVCNSALENYENVVICK-KPTMGMKIERSKFSEEKSNENSKVTADAKRDIAC 119 Query: 1013 --LRVHNDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRISQN-NHPSKISR 843 R HN AN+L +S S ++ Y R K+ +N +H + Sbjct: 120 KLPRGHNHANYLYLVS---------SPYSAANRAQIDGYSR----KKDDENIHHKIDLDG 166 Query: 842 PMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILIAESVKE 663 ++ G KS L + N+ + SEA E + + ++ + Sbjct: 167 RESTTRGCKS--LTETSPTNLEKKYENDASSCCTILNRKSEASSELAGNMETMLVK---- 220 Query: 662 EKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQCTSADHGL 483 D+ C SV Q A++ ++ +++ S + Sbjct: 221 ---DTRC-----------NSVMQS---------ANETEIKTDNILPDTPSSAIVDTE--- 254 Query: 482 STKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEEAVEPVETS 303 K G S ++++ S++W ++ I E+ + ++ + + Sbjct: 255 --KETRLLSYGDSSAELDGR--SDSWSLDDI--------------ELEQGTHNIQQADET 296 Query: 302 KL-EDTCILVDGDKLHFVSQGTEKHKSYKKKIREALSSKLRSTRKQDPCVSHCKDLGGQN 126 KL E+ C+LV GD LHF K + Y KKI A S +S RKQ+ K+L ++ Sbjct: 297 KLDEEACVLVKGDDLHFDFNEEVKQRHY-KKIAGAFSFTKKSKRKQE-----YKELAMKH 350 Query: 125 NGGVTTIPALEMDSDKRNLPVHDSFESDWEIL 30 G TIP D++ L D E DW++L Sbjct: 351 GYGFGTIP---NQQDEQKLTAEDVLEQDWQLL 379 >ref|NP_565715.1| uncharacterized protein [Arabidopsis thaliana] gi|16612317|gb|AAL27517.1|AF439849_1 At2g31130/T16B12.6 [Arabidopsis thaliana] gi|20197328|gb|AAC63838.2| expressed protein [Arabidopsis thaliana] gi|23506163|gb|AAN31093.1| At2g31130/T16B12.6 [Arabidopsis thaliana] gi|330253402|gb|AEC08496.1| uncharacterized protein AT2G31130 [Arabidopsis thaliana] Length = 419 Score = 136 bits (342), Expect = 2e-29 Identities = 136/466 (29%), Positives = 208/466 (44%), Gaps = 22/466 (4%) Frame = -3 Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182 MDFKGI W GN+YQKFE MCLEVEE++ +DT KYVENQVQ VG SVKKF S+V+ DLLP Sbjct: 1 MDFKGIKWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVQTVGNSVKKFCSDVVHDLLPD 60 Query: 1181 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 1002 VD K P + L+ YA Y KKK+ N D+ E+ Sbjct: 61 ESVDSGKPLPVSM-LHEYAPV------------YSFKKKKDSMNRKTKDVTQEQEVTEGK 107 Query: 1001 NDANHLSSLSPRGLVENTHSDVCFTKSKKV--GVYKRP-IGIKRISQNNHPSKISRP--- 840 D + RGL + + D+C + + G Y+R IG K+I + S++ RP Sbjct: 108 KDG---FAKKLRGLDADDY-DICTSPRQYSYGGPYRRTRIGRKQIFKKEELSQVIRPYIQ 163 Query: 839 --MTSLSGDKSRLL------VASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKIL 684 +TSLS S + V S +++ S R + N + +S S+ D Sbjct: 164 KDLTSLSMVHSARVKDDLGTVNSSSLSMVHSARVNDDVGTVNSSSLSMVHHASMKDDVGT 223 Query: 683 IAESVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKK-LSAESLRQKKDDSQ 507 + S + + S K+ C+ +A +++ L+ + + D Sbjct: 224 VKSSDSPPGEVEKLIS---------------KKKCQKDDKAKNQQSLTVVNSVKSNDSEV 268 Query: 506 CTSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEE 327 +HGLS + +I P++ ++ ES E + E Sbjct: 269 IVDNEHGLSADKSVRSQD----LEIQPSLATSL-PAESDDCRKETNVETSSSSVSEPKSE 323 Query: 326 AVEPVETSKLEDTCILVDGDKLHFV---SQGTEKHKSYKKKIREALSSKLRSTRKQD--- 165 ++ + +E++CILVD D+ H V +KHK Y KKIR+A+SS+++ R+++ Sbjct: 324 ILQHLSGRSVEESCILVDRDEFHSVFPDKMENDKHKPY-KKIRDAISSRMKQNREKEYKR 382 Query: 164 -PCVSHCKDLGGQNNGGVTTIPALEMDSDKRNLPVHDSFESDWEIL 30 + +D+ G P E S S ES+WE+L Sbjct: 383 LARQWYAEDVENGRECGDNPKPIEENQS---------SEESEWELL 419 >ref|XP_006410238.1| hypothetical protein EUTSA_v10016698mg [Eutrema salsugineum] gi|567211021|ref|XP_006410239.1| hypothetical protein EUTSA_v10016698mg [Eutrema salsugineum] gi|557111407|gb|ESQ51691.1| hypothetical protein EUTSA_v10016698mg [Eutrema salsugineum] gi|557111408|gb|ESQ51692.1| hypothetical protein EUTSA_v10016698mg [Eutrema salsugineum] Length = 426 Score = 135 bits (341), Expect = 3e-29 Identities = 125/414 (30%), Positives = 200/414 (48%), Gaps = 15/414 (3%) Frame = -3 Query: 1361 MDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYSEVMQDLLPP 1182 M FKGITW GN+YQKFE MCLEVEE++ +DT KYVENQVQ VG S+KKF S+V+ D LP Sbjct: 1 MAFKGITWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVQTVGNSMKKFCSDVVGDFLPD 60 Query: 1181 SCVDPVKVAPGDLSLNPYAHTDLNKSKPTMLDSYGEFKKKEIENEDISDLIAEKSSLRVH 1002 V K P + L+ YA K KK+E N D+ E+ Sbjct: 61 ESVGSEKPLPVSM-LHEYAPVCSFK------------KKRESLNRKTRDVKQEQEVSEGK 107 Query: 1001 NDANHLSSLSPRGLVENTHSDVCFTKSKKV--GVYKRP-IGIKRISQNNHPSKISRPMTS 831 D + RGL + + D+C + + G Y+R +G K+I +N +++RP + Sbjct: 108 KDG---CEMKFRGLDADDY-DICTSPRQYSYGGPYRRTRLGRKQIYKNEEVFQVTRP-SY 162 Query: 830 LSGDKSRLLV-----ASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDK---ILIAE 675 + D S L + ++D+ S P E I+ E C + ++ + + Sbjct: 163 IQKDSSSLSMVHRSRVNNDVGAVKSSDSPPVEVERLIS--KEECQKDDRTENQHGLTVVN 220 Query: 674 SVKEEKDDSECASHASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQCTSA 495 SV+ + DSE + + + +SV+ +D E ++ S+R +D Sbjct: 221 SVRSQ--DSETRTKKEHGLTMVDSVR--SQDSETRTKNEHGLTMVNSVR-SEDSEIGIEN 275 Query: 494 DHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEEAVEP 315 +HGL+ ++ R S Q + + S + E E + + E ++ Sbjct: 276 EHGLTV--VNSGRCQDSEIQTSVSTSSPAGSDDCRKETNENSMETSSSSVSEQKSEILQE 333 Query: 314 V-ETSKLEDTCILVDGDKLHFV---SQGTEKHKSYKKKIREALSSKLRSTRKQD 165 + E LE++CI+VD D+LH V + +KHK Y KKIR+A+SS+++ R+++ Sbjct: 334 LSEGRSLEESCIIVDRDELHCVFPDRKENDKHKPY-KKIRDAISSRMKQNREKE 386 >gb|EOX92821.1| Uncharacterized protein isoform 5 [Theobroma cacao] gi|508700926|gb|EOX92822.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 334 Score = 133 bits (334), Expect = 2e-28 Identities = 122/395 (30%), Positives = 182/395 (46%), Gaps = 13/395 (3%) Frame = -3 Query: 1364 TMDFKGITWAGNIYQKFETMCLEVEEVMYEDTVKYVENQVQKVGVSVKKFYS----EVMQ 1197 +MD KGITW G++Y+KFE MCLEVEEVMY+DTVKYVEN+VQ VG SVKKFYS +VMQ Sbjct: 3 SMDLKGITWVGHVYEKFEAMCLEVEEVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQ 62 Query: 1196 DLLPPSCVDPVK-VAPGDLSLNPYAHTDLNKSKPTMLDS--YGEFKKKEIENEDISDLIA 1026 DLL PS ++P+K VA DL + YA T L K + + G+ ++ ++E I+D+ Sbjct: 63 DLLLPSSLEPMKAVAASDLPVEIYAET-LKKPNVGLKEDAIQGDSEQLTEDSEVIADVNE 121 Query: 1025 E----KSSLRVHNDANHLSSLSPRGLVENTHSDVCFTKSKKVGVYKRPIGIKRIS-QNNH 861 SS ++H N S S VE SD+ G + + + + ++ Sbjct: 122 NAAHVPSSCQLHMVDNIFESCS-GSFVERASSDLL------SGEHNNRCTLNKTNVEHLL 174 Query: 860 PSKISRPMTSLSGDKSRLLVASDDMNVTTSVRCHPGEAVNNITPISEACVESLASDKILI 681 P++ S + + R+ + N V CH A +TP+S VE D I Sbjct: 175 PAETSSEAGCVENEFGRMSSFCGNANANHEVSCHQIPA--TLTPVS---VEEDDCDS--I 227 Query: 680 AESVKEEKDDSECASH-ASDNILLAESVKQDKEDCECASRASDKKLSAESLRQKKDDSQC 504 ES E K S+ D + L V++++ + C+S + + S L KD S Sbjct: 228 EESSNEIKSASDSVPEILPDGLHLVGIVEKNEMEMRCSSSIIESEESNGKLNWTKDAS-- 285 Query: 503 TSADHGLSTKPIDEFRQGGSFSQINPNIVSNTWDIESIGEDVFPCYEDNFDMEVIENEEA 324 G ST E E Sbjct: 286 -----GSSTVGRKEI-------------------------------------------ET 297 Query: 323 VEPVETSKLEDTCILVDGDKLHFVSQGTEKHKSYK 219 V+ ++ +++++C +V+G +LHF Q KHK+Y+ Sbjct: 298 VQQLDKIRVDESCFMVNGAELHFHPQREGKHKTYQ 332