BLASTX nr result
ID: Cinnamomum23_contig00005310
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cinnamomum23_contig00005310 (1514 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010919192.1| PREDICTED: uncharacterized protein LOC105043... 168 9e-39 ref|XP_011043605.1| PREDICTED: uncharacterized protein LOC105139... 166 6e-38 ref|XP_010919199.1| PREDICTED: uncharacterized protein LOC105043... 165 1e-37 ref|XP_010657010.1| PREDICTED: uncharacterized protein LOC100242... 160 3e-36 emb|CBI21908.3| unnamed protein product [Vitis vinifera] 160 3e-36 ref|XP_006372766.1| hypothetical protein POPTR_0017s04850g [Popu... 160 3e-36 ref|XP_010260341.1| PREDICTED: uncharacterized protein LOC104599... 158 9e-36 ref|XP_007043708.1| Uncharacterized protein isoform 2 [Theobroma... 155 6e-35 ref|XP_012477223.1| PREDICTED: uncharacterized protein LOC105792... 155 8e-35 ref|XP_006852570.2| PREDICTED: uncharacterized protein LOC184422... 154 1e-34 ref|XP_010260342.1| PREDICTED: uncharacterized protein LOC104599... 154 1e-34 ref|XP_002517843.1| conserved hypothetical protein [Ricinus comm... 154 1e-34 emb|CDP15879.1| unnamed protein product [Coffea canephora] 153 4e-34 ref|XP_006592445.1| PREDICTED: uncharacterized protein LOC102660... 150 2e-33 ref|XP_003526252.1| PREDICTED: uncharacterized protein LOC100790... 150 3e-33 ref|XP_010093966.1| hypothetical protein L484_010532 [Morus nota... 150 3e-33 ref|XP_012088204.1| PREDICTED: uncharacterized protein LOC105646... 149 4e-33 ref|XP_004287059.1| PREDICTED: uncharacterized protein LOC101291... 148 1e-32 ref|XP_006469074.1| PREDICTED: lisH domain-containing protein C1... 146 4e-32 ref|XP_006469075.1| PREDICTED: lisH domain-containing protein C1... 144 1e-31 >ref|XP_010919192.1| PREDICTED: uncharacterized protein LOC105043365 isoform X1 [Elaeis guineensis] Length = 506 Score = 168 bits (426), Expect = 9e-39 Identities = 150/461 (32%), Positives = 204/461 (44%), Gaps = 19/461 (4%) Frame = -2 Query: 1399 MDAKASAKSKRAHTLQAKRAH----------SXXXXXXXXXXXXXXXXXXPEKTLNPKSH 1250 MD KASAKSKR H+ Q ++ H + + +S Sbjct: 1 MDPKASAKSKRNHSHQGRKNHPTPAATSAQKKKPAPAAVAAAAAGAGEVATPRRAHARSR 60 Query: 1249 LLPSNWDRYDDDEVFGNEGSD-KSLVDARLAQGIAAPKSKGADFSKLIAQAKEDARSRIN 1073 LPSNWDRYDDD + G +S A+ A G PKSKGADF L+ QA+ + + Sbjct: 61 DLPSNWDRYDDDGDGDDSGDGAESSAGAKRADGEIRPKSKGADFRFLVEQARSQPQDHRD 120 Query: 1072 PDEXXXXXXXXXXXXFY-QGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLSLDL 896 P Y QG+SSMLSV+G SLLS DDNFIVDD TSS E + LS+DL Sbjct: 121 PGTSQSAFSLDELPSDYIQGISSMLSVRGESLLSWCADDNFIVDDDSTSSCEVNLLSMDL 180 Query: 895 HALAAQLEKVDVSERLFIENDLLADELG-EQLKEMSSSRSNYGETSHEGSENINLGYEHG 719 HALAAQL K+ +S+RLFIE DLL +EL ++LK + T E ++++ G HG Sbjct: 181 HALAAQLSKLKLSQRLFIEEDLLPEELHIDELKVNQIFEQSETPTMSEHKDSLSQGRFHG 240 Query: 718 ETSYAVGTHNMEQTRSNNNELLPSPSTSNDNSQEVHDKKPFPPFTWQSGKPNPATDSELF 539 + +E++ + S + + V +K T Q+ K + + DS Sbjct: 241 NSE-------LEKSVDGQIDHWNSCNVHGITREAVVEKCQTQSPTGQATKFDLSNDSIPA 293 Query: 538 HQNAAREVLKPKSQLDATAELEKTTLRSGATPAETELDMLLNTINEKETKTLGFKAATAE 359 + REV SQL T D+ K+ +T F+AA AE Sbjct: 294 GLSGRREVQGSVSQLSK----------------HTVADL-------KQNRTSRFEAAAAE 330 Query: 358 AELDMLLDSFGETKLLDS------VDISEEQSSNFPRAQXXXXXXXXXXXXXXXXXKPVR 197 ELD+L SF ET+L S D S ++ F + P Sbjct: 331 EELDVLFSSFSETRLSSSHSDGITNDASTSHNATFNSS---------------VHMSPPS 375 Query: 196 EVPDSFNSPAMTNALDCSIDDLLARTPIVSNQDHGVHHPLE 74 D +S +L +IDDLLA T + N + V P E Sbjct: 376 VGQDLNSSGNAGTSLADAIDDLLAETSLSLNDQNTVCPPNE 416 >ref|XP_011043605.1| PREDICTED: uncharacterized protein LOC105139018 isoform X1 [Populus euphratica] Length = 474 Score = 166 bits (419), Expect = 6e-38 Identities = 158/461 (34%), Positives = 206/461 (44%), Gaps = 25/461 (5%) Frame = -2 Query: 1399 MDAKASAKSKRAHTLQAKRAHSXXXXXXXXXXXXXXXXXXPEKTLNPKSHLLPSNWDRYD 1220 MD KA AKSKRAHTLQ + P+ LPSNWDRY Sbjct: 1 MDTKALAKSKRAHTLQHNKGKKPHPNQKPSKTPSTG------NNQKPQKSKLPSNWDRYG 54 Query: 1219 DDEV--FG----NEGSDKSLVDA--RLAQGIAAPKSKGADFSKLIAQAKEDARSRINPDE 1064 DDE FG N D S + G+A PKSKGADF L+ ++A+S+ + + Sbjct: 55 DDEEDEFGVNLENPSGDNSKKPSFKDYGDGLALPKSKGADFRYLL----DEAKSKPHQVD 110 Query: 1063 XXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLSLDLHALA 884 GV +L+V+G S+LS DDNF+V+D TSS+EASFLSL+LHALA Sbjct: 111 DFPFLEHFLAEESMHGVGPLLAVRGESILSWIGDDNFVVEDETTSSHEASFLSLNLHALA 170 Query: 883 AQLEKVDVSERLFIENDLLADELGEQLKEMSSSRSNYGETSHEGSENINLGYEHGETSYA 704 QL KVDVSERLFIE DLL ELG +SS + + GSE G HG Sbjct: 171 EQLAKVDVSERLFIEADLLPTELGSN----TSSSQEFDQMQTTGSE---AGGNHGPNRKQ 223 Query: 703 VGTHNMEQTRSNNNELLPSPSTSNDNSQEVHDKKPFPPFTWQSGKPNPATDSELFHQNAA 524 TH+ E T++ + E L S N D + F SG +D F Q Sbjct: 224 T-THDKE-TKTISGE-LTFEDLSEKNKAVNQDAEIF-----VSGLTIGNSDPISFIQG-- 273 Query: 523 REVLKPKSQLDATAELEKTTLRSGATPAETELDMLLNTINEKETKTLGFKAATAEAELDM 344 L K L+ + + +PA+ L ++ F+AA AE+ELDM Sbjct: 274 ---LDVKDNLNLNQHGKFNQSAAMESPAQ-----LYACSVAPSSRLPAFEAAAAESELDM 325 Query: 343 LLDSFGETKLLDS-------VDISEEQSSNFPRAQXXXXXXXXXXXXXXXXXKPVREVPD 185 LLDS ETKLLDS + +SE++++ P Q R P Sbjct: 326 LLDSLSETKLLDSSGFGSGTLPVSEKEAA-VPLPQL------------------TRNAPG 366 Query: 184 SFNSPAMTNALDCSIDDLL----------ARTPIVSNQDHG 92 S + LD +DDLL A P+++ HG Sbjct: 367 SAKTTPTAATLDNVLDDLLEESSDLQEAAAPLPLLARNAHG 407 >ref|XP_010919199.1| PREDICTED: uncharacterized protein LOC105043365 isoform X2 [Elaeis guineensis] Length = 497 Score = 165 bits (417), Expect = 1e-37 Identities = 150/460 (32%), Positives = 203/460 (44%), Gaps = 18/460 (3%) Frame = -2 Query: 1399 MDAKASAKSKRAHTLQAKRAH----------SXXXXXXXXXXXXXXXXXXPEKTLNPKSH 1250 MD KASAKSKR H+ Q ++ H + + +S Sbjct: 1 MDPKASAKSKRNHSHQGRKNHPTPAATSAQKKKPAPAAVAAAAAGAGEVATPRRAHARSR 60 Query: 1249 LLPSNWDRYDDDEVFGNEGSD-KSLVDARLAQGIAAPKSKGADFSKLIAQAKEDARSRIN 1073 LPSNWDRYDDD + G +S A+ A G PKSKGADF L+ QA+ + + Sbjct: 61 DLPSNWDRYDDDGDGDDSGDGAESSAGAKRADGEIRPKSKGADFRFLVEQARSQPQDHRD 120 Query: 1072 PDEXXXXXXXXXXXXFY-QGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLSLDL 896 P Y QG+SSMLSV+G SLLS DDNFIVDD TSS E + LS+DL Sbjct: 121 PGTSQSAFSLDELPSDYIQGISSMLSVRGESLLSWCADDNFIVDDDSTSSCEVNLLSMDL 180 Query: 895 HALAAQLEKVDVSERLFIENDLLADELGEQLKEMSSSRSNYGETSHEGSENINLGYEHGE 716 HALAAQL K+ +S+RLFIE DLL +EL + E S + T E ++++ G HG Sbjct: 181 HALAAQLSKLKLSQRLFIEEDLLPEEL---IFEQSET-----PTMSEHKDSLSQGRFHGN 232 Query: 715 TSYAVGTHNMEQTRSNNNELLPSPSTSNDNSQEVHDKKPFPPFTWQSGKPNPATDSELFH 536 + +E++ + S + + V +K T Q+ K + + DS Sbjct: 233 SE-------LEKSVDGQIDHWNSCNVHGITREAVVEKCQTQSPTGQATKFDLSNDSIPAG 285 Query: 535 QNAAREVLKPKSQLDATAELEKTTLRSGATPAETELDMLLNTINEKETKTLGFKAATAEA 356 + REV SQL T D+ K+ +T F+AA AE Sbjct: 286 LSGRREVQGSVSQLSK----------------HTVADL-------KQNRTSRFEAAAAEE 322 Query: 355 ELDMLLDSFGETKLLDS------VDISEEQSSNFPRAQXXXXXXXXXXXXXXXXXKPVRE 194 ELD+L SF ET+L S D S ++ F + P Sbjct: 323 ELDVLFSSFSETRLSSSHSDGITNDASTSHNATFNSS---------------VHMSPPSV 367 Query: 193 VPDSFNSPAMTNALDCSIDDLLARTPIVSNQDHGVHHPLE 74 D +S +L +IDDLLA T + N + V P E Sbjct: 368 GQDLNSSGNAGTSLADAIDDLLAETSLSLNDQNTVCPPNE 407 >ref|XP_010657010.1| PREDICTED: uncharacterized protein LOC100242390 [Vitis vinifera] gi|731408881|ref|XP_010657011.1| PREDICTED: uncharacterized protein LOC100242390 [Vitis vinifera] Length = 429 Score = 160 bits (404), Expect = 3e-36 Identities = 132/362 (36%), Positives = 183/362 (50%), Gaps = 20/362 (5%) Frame = -2 Query: 1399 MDAKASAKSKRAHTLQ-AKRAHSXXXXXXXXXXXXXXXXXXPE--KTLNPKSHL------ 1247 MDAKA AKSKRAH+ +KR HS + K + K H Sbjct: 1 MDAKALAKSKRAHSQHHSKRPHSNKTSKAPSAGNVGAGNAKKQPGKQIREKPHQSMGLSR 60 Query: 1246 LPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGADFSKLIAQAKEDARSRINPD 1067 LPSNWDRY+++ G+EG S+ A + PKSKGAD+ +LI++A +RS NP Sbjct: 61 LPSNWDRYEEEFDSGSEGP--SINSTNQANDVIVPKSKGADYGELISEAISQSRS--NPY 116 Query: 1066 EXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLSLDLHAL 887 F QGV S+LSV+G +LS D+NFIV+D T+S+EA FLSL+LH+L Sbjct: 117 FDSFASLDDVVPDFNQGVGSLLSVRGQGILSWIGDNNFIVEDRATTSHEAPFLSLNLHSL 176 Query: 886 AAQLEKVDVSERLFIENDLLADEL----GEQLKEMSSSRSNYGETSHEGSENI--NLGYE 725 A QL KVD+S+RLF+E DLL+ EL E +K S+ +N + + EG++ I Sbjct: 177 AEQLTKVDLSQRLFVEEDLLSPELMSVSSEGVKVSSNQEANQMQRTSEGAKIIVDESAVR 236 Query: 724 HGETSYAVGTHNMEQTRSN----NNELLPSPSTSNDNSQEVHDKKPFPPFTWQSGKPNPA 557 + N E S+ N ++ SP+ S + +V DK Q G+ Sbjct: 237 SFPEKDKIVDKNKEVMSSDTTRIRNPVISSPNQSAKSENQVKDKAK------QFGRAAQT 290 Query: 556 TDSELFHQNAAREVLKPKSQLDATAELEKTTLRSGATPAETELDMLLNTINE-KETKTLG 380 D EL A ++ K + A+ EK A AE ELDMLL++ NE + +LG Sbjct: 291 RDLEL-----AAQINKV-----SVADPEKKQSVFEAAAAEAELDMLLDSFNETNKFDSLG 340 Query: 379 FK 374 FK Sbjct: 341 FK 342 >emb|CBI21908.3| unnamed protein product [Vitis vinifera] Length = 453 Score = 160 bits (404), Expect = 3e-36 Identities = 132/362 (36%), Positives = 183/362 (50%), Gaps = 20/362 (5%) Frame = -2 Query: 1399 MDAKASAKSKRAHTLQ-AKRAHSXXXXXXXXXXXXXXXXXXPE--KTLNPKSHL------ 1247 MDAKA AKSKRAH+ +KR HS + K + K H Sbjct: 25 MDAKALAKSKRAHSQHHSKRPHSNKTSKAPSAGNVGAGNAKKQPGKQIREKPHQSMGLSR 84 Query: 1246 LPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGADFSKLIAQAKEDARSRINPD 1067 LPSNWDRY+++ G+EG S+ A + PKSKGAD+ +LI++A +RS NP Sbjct: 85 LPSNWDRYEEEFDSGSEGP--SINSTNQANDVIVPKSKGADYGELISEAISQSRS--NPY 140 Query: 1066 EXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLSLDLHAL 887 F QGV S+LSV+G +LS D+NFIV+D T+S+EA FLSL+LH+L Sbjct: 141 FDSFASLDDVVPDFNQGVGSLLSVRGQGILSWIGDNNFIVEDRATTSHEAPFLSLNLHSL 200 Query: 886 AAQLEKVDVSERLFIENDLLADEL----GEQLKEMSSSRSNYGETSHEGSENI--NLGYE 725 A QL KVD+S+RLF+E DLL+ EL E +K S+ +N + + EG++ I Sbjct: 201 AEQLTKVDLSQRLFVEEDLLSPELMSVSSEGVKVSSNQEANQMQRTSEGAKIIVDESAVR 260 Query: 724 HGETSYAVGTHNMEQTRSN----NNELLPSPSTSNDNSQEVHDKKPFPPFTWQSGKPNPA 557 + N E S+ N ++ SP+ S + +V DK Q G+ Sbjct: 261 SFPEKDKIVDKNKEVMSSDTTRIRNPVISSPNQSAKSENQVKDKAK------QFGRAAQT 314 Query: 556 TDSELFHQNAAREVLKPKSQLDATAELEKTTLRSGATPAETELDMLLNTINE-KETKTLG 380 D EL A ++ K + A+ EK A AE ELDMLL++ NE + +LG Sbjct: 315 RDLEL-----AAQINKV-----SVADPEKKQSVFEAAAAEAELDMLLDSFNETNKFDSLG 364 Query: 379 FK 374 FK Sbjct: 365 FK 366 >ref|XP_006372766.1| hypothetical protein POPTR_0017s04850g [Populus trichocarpa] gi|550319414|gb|ERP50563.1| hypothetical protein POPTR_0017s04850g [Populus trichocarpa] Length = 474 Score = 160 bits (404), Expect = 3e-36 Identities = 159/501 (31%), Positives = 223/501 (44%), Gaps = 35/501 (6%) Frame = -2 Query: 1399 MDAKASAKSKRAHTLQAKRAHSXXXXXXXXXXXXXXXXXXPEKTLNPKSHLLPSNWDRYD 1220 MD KA AKSKRAHTLQ + P+ LPSNWDRY+ Sbjct: 1 MDTKALAKSKRAHTLQHNKGKKPHPNQNPSKTPSTG------NNQKPQKSKLPSNWDRYE 54 Query: 1219 DDEV--FG----NEGSDKSLVDA--RLAQGIAAPKSKGADFSKLIAQAKEDARSRINPDE 1064 DDE FG N D S + G+A PKSKGADF L+ ++A+S+ + + Sbjct: 55 DDEEDEFGVNLENPSGDNSKKPSFKDYGDGLALPKSKGADFKYLL----DEAKSKPHQVD 110 Query: 1063 XXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLSLDLHALA 884 GV +L+V+G S+LS DDNF+V+D TSS+EASFLSL+LHALA Sbjct: 111 DFPFLEGFLAEESMHGVGPLLAVRGESILSWIGDDNFVVEDETTSSHEASFLSLNLHALA 170 Query: 883 AQLEKVDVSERLFIENDLLADELGEQLKEMSSSRSNYGETSHEGSE-NINLGYEHGETSY 707 QL KVDVSERLFIE DLL ELG +SS + + GSE + N G +T++ Sbjct: 171 EQLAKVDVSERLFIEADLLPTELGSN----TSSSQEFDQMQTTGSEASSNHGPNRKQTTH 226 Query: 706 AVGTHNME-----QTRSNNNELLPSPSTSNDNSQEVHDKKPFPPFTWQSGKPNPATDSEL 542 T + + S N+ + + + + + P K N + Sbjct: 227 DKETKTISGELTFEDFSEKNKAVNQDAEIFVSGLTIGNSDPISFIQGLDVKDNLNLNQH- 285 Query: 541 FHQNAAREVLKPKSQLDATAELEKTTLRS-GATPAETELDMLLNTINE-KETKTLGFKAA 368 ++ R ++ +Q A++ + L + A AE+ELDMLL++++E K + GF + Sbjct: 286 -GKSNQRTAMESPAQFYASSVAPNSRLPTFEAAAAESELDMLLDSLSEAKLLDSSGFGSG 344 Query: 367 T---AEAELDMLLDSF-----GETK------LLDSV-DISEEQSSNFPRAQXXXXXXXXX 233 T +E E + L G K LD+V D E++SN A Sbjct: 345 TLPVSEKEAAVPLPQLTRNAPGSAKTTPTAATLDNVLDDLLEETSNLQEAAAPLPLL--- 401 Query: 232 XXXXXXXXKPVREVPDSFNSPAMTNALDCSIDDLLARTPIVSNQDHGVHHPLE----XXX 65 R S + + LD +DDL T +SNQ++ +H P E Sbjct: 402 ----------ARNAHGSLKTTSTAATLDDVLDDLFEETSSLSNQNN-LHQPSEKKADHVI 450 Query: 64 XXXXXXXXXXXXXLDDFDSWL 2 LDDFDSWL Sbjct: 451 QSSSSQSVNKSKVLDDFDSWL 471 >ref|XP_010260341.1| PREDICTED: uncharacterized protein LOC104599483 isoform X1 [Nelumbo nucifera] Length = 437 Score = 158 bits (400), Expect = 9e-36 Identities = 149/491 (30%), Positives = 214/491 (43%), Gaps = 25/491 (5%) Frame = -2 Query: 1399 MDAKASAKSKRAHTLQ---------AKRAHSXXXXXXXXXXXXXXXXXXPEKTLNPKSHL 1247 MDAKA AKSKRAH+ A +A + + S Sbjct: 1 MDAKALAKSKRAHSQHHSKKSHASPASKAPAVAASAGNSKKPSAKQTREKNRQFRGSSTA 60 Query: 1246 LPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGADFSKLIAQAKEDARSRINPD 1067 LPSNWDRY+++ G+E D SL + PKSKGADF LI++A+ +S + Sbjct: 61 LPSNWDRYEEEYDSGSE--DPSLGGTSRTSDVVVPKSKGADFRYLISEAQSQLQSPSDLS 118 Query: 1066 EXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLSLDLHAL 887 F QGVS++LS +G ++LS +DNF V+D+ET+S +ASFLS+DLHAL Sbjct: 119 LESFDSFGGFLPGFNQGVSTVLSARGKNILSWIGNDNFAVEDNETAS-QASFLSMDLHAL 177 Query: 886 AAQLEKVDVSERLFIENDLLADEL-GEQLKEMSSSRSNYGETSHEGSENINLGYEHGETS 710 A QL KVDVS+RLFI+ LL E+ E L++ ++ E +HE + + + + Sbjct: 178 AEQLAKVDVSQRLFIDAYLLPPEMHSEGLQKSKCQDYDHTEATHESEADDH----YLDKM 233 Query: 709 YAVGTHNMEQTRSNNNELLPSPSTSNDNSQEVHDKKPFPPFTWQSGKPNPATDSELFHQN 530 G+ N E N ++ P+ ++ VH P DS Q Sbjct: 234 EFHGSANGEDIMGNRPDISPA------TTENVHSVPALLPEGSMLVNLAKGGDSTQVGQT 287 Query: 529 AAREVLKPKSQLDATAELEKTTLRSGATPAETELDMLLNTINEKETKTLGFKAATAEAEL 350 + + Q + + ++++ KE K F+AA AEAEL Sbjct: 288 CPTKFMNSMEQPNRS-----------------------SSVDLKENKPSRFEAAAAEAEL 324 Query: 349 DMLLDSFGETKLL----------DSVDISEEQSSNFPRAQXXXXXXXXXXXXXXXXXKPV 200 DMLLDSFGETKL S S++Q S FP+ P Sbjct: 325 DMLLDSFGETKLFYSGFPVVKQEPSHVSSQQQVSGFPQ--------------------PS 364 Query: 199 REVPDSFNSPAMTNALDCSIDDLLARTPIVSNQDHGVHHPLE-----XXXXXXXXXXXXX 35 + PD+ + + LD +ID+ L T +NQ++ + E Sbjct: 365 VQAPDASKNASGAFDLDNAIDE-LRETSNPTNQNNAMRDQQEKAVRRNPPSDSSLASAHK 423 Query: 34 XXXLDDFDSWL 2 LDDFDSWL Sbjct: 424 SSVLDDFDSWL 434 >ref|XP_007043708.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|590691174|ref|XP_007043709.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508707643|gb|EOX99539.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508707644|gb|EOX99540.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 465 Score = 155 bits (393), Expect = 6e-35 Identities = 149/460 (32%), Positives = 204/460 (44%), Gaps = 27/460 (5%) Frame = -2 Query: 1399 MDAKASAKSKRAHTLQ-AKRAHSXXXXXXXXXXXXXXXXXXPE--KTLNPKSH------L 1247 MDAKA AKSKRAH+ +K+ HS + K + K+H Sbjct: 1 MDAKALAKSKRAHSQHHSKKPHSSQKPKPPLVGGNDAANAKKQTGKQIREKTHQAQRVSA 60 Query: 1246 LPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGADFSKLIAQAKEDARSRINPD 1067 LPSNWD Y+++ G+E D+S + PKSKGADF LIA+A+ S D Sbjct: 61 LPSNWDHYEEEFDSGSE--DQSGDSTSQVPDVVLPKSKGADFHHLIAEAQSQLESNPYTD 118 Query: 1066 EXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLSLDLHAL 887 Q V MLSV+G +LS +DNF+V+D T+++ ASFLSL+LHAL Sbjct: 119 SLCSSDDILPGDFN-QFVGIMLSVRGEGILSLIQNDNFVVEDRTTATHAASFLSLNLHAL 177 Query: 886 AAQLEKVDVSERLFIENDLLADEL-GEQLKEMSSSRSNYGETSHEG-------------- 752 A QLEKV++SERLFIE DLL+ EL E K S+ S+ +T+ EG Sbjct: 178 AEQLEKVNLSERLFIEEDLLSPELHAEGSKANSNQESDQMQTTSEGKAAAQITEELTLND 237 Query: 751 -SENINLGYEHGE-TSYAVGTHNMEQTRSNNNELLPSPSTSNDNSQEVHDKKPFPPFTWQ 578 ++ +N+ ++ E S++ G+ +++ T SN L +D DK Sbjct: 238 STDKVNIAAKNVEHISFSSGSKSVDATLSNEG-LDSVDEVYSDFISSQRDK--------- 287 Query: 577 SGKPNPATDSELFHQNAAREVLKPKSQLDATAELEKTTLRSGATPAETELDMLLNTINEK 398 SGK S + N+A K S +A A AE ELDMLLN Sbjct: 288 SGKSRALESSTHDNSNSASVPNKKVSTFEAVA-------------AEAELDMLLN----- 329 Query: 397 ETKTLGFKAATAEAELDMLLDSFGETKLLDSVDISEEQSSNFPRAQXXXXXXXXXXXXXX 218 SF ETKLLDS + ++SSN + Sbjct: 330 ---------------------SFSETKLLDSSGLKTQKSSNDYYTEGSPSLAQL------ 362 Query: 217 XXXKPVREVPDSFNSPAMTN-ALDCSIDDLLARTPIVSNQ 101 R+ DS N A N ++D +DDLL T + NQ Sbjct: 363 -----ARKGDDSSNKSAGVNSSVDDLLDDLLKETSTMVNQ 397 >ref|XP_012477223.1| PREDICTED: uncharacterized protein LOC105792917 [Gossypium raimondii] gi|763759850|gb|KJB27181.1| hypothetical protein B456_004G282700 [Gossypium raimondii] Length = 415 Score = 155 bits (392), Expect = 8e-35 Identities = 136/438 (31%), Positives = 199/438 (45%), Gaps = 11/438 (2%) Frame = -2 Query: 1399 MDAKASAKSKRAHTLQ-AKRAHSXXXXXXXXXXXXXXXXXXPE--KTLNPKSH------L 1247 MDAKA AKSKRAH+ +K+ HS + K + K+H Sbjct: 2 MDAKALAKSKRAHSQHHSKKPHSVQKSKPPSPGVNEPSNSKKQTIKQIKEKAHQAQRISA 61 Query: 1246 LPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGADFSKLIAQAKEDARSRINPD 1067 LPSNW+RY+++ G+E D + PKSKGADF L+++A+ ++ NP Sbjct: 62 LPSNWNRYEEEFDSGSE-------DPTQTPDVIVPKSKGADFRHLLSEAQSQLQA--NPY 112 Query: 1066 EXXXXXXXXXXXXFY-QGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLSLDLHA 890 + Q V SML+V+G +LS + +DNF+VDDS T++ EASFLSL+L A Sbjct: 113 SNNIPSLDDVFPGDFNQFVGSMLAVRGEGILSWTGNDNFVVDDSTTATPEASFLSLNLQA 172 Query: 889 LAAQLEKVDVSERLFIENDLLADELGEQLKEMSSSRSNYGETSHEGSENINLGYEHGETS 710 LA QLEKVD+S+RLFIE DLL +L Sbjct: 173 LAEQLEKVDLSKRLFIEEDLLPPDL----------------------------------- 197 Query: 709 YAVGTHNMEQTRSNNNELLPSPSTSNDNSQEVHDKKPFPPFTWQSGKPNPATDSELFHQN 530 E+++ N++ D Q D+K T + PN S+ Sbjct: 198 ------RSERSKVKNDQ-------EPDQMQAAPDRKEAAKIT-EGSTPNDLPGSK----- 238 Query: 529 AAREVLKPKSQLDATAELEKTTLRS-GATPAETELDMLLNTINEKETKTLGFKAATAEAE 353 A + + S LD AE++ ++ S + +E+ LN K F+AA AEA+ Sbjct: 239 -AIDAILSNSGLDLMAEVQSVSISSQNSESSESRAPDNLNFTTASNKKVPKFEAAAAEAK 297 Query: 352 LDMLLDSFGETKLLDSVDISEEQSSNFPRAQXXXXXXXXXXXXXXXXXKPVREVPDSFNS 173 LDMLL+SF ETKLLD+ ++S E+ S+ + R + DS + Sbjct: 298 LDMLLNSFNETKLLDTSNLSSEKPSSIGSLKASNLDSLLDDLLQETSTTVNRGI-DSSKT 356 Query: 172 PAMTNALDCSIDDLLART 119 A+ + + +DDLL T Sbjct: 357 AAVNSTSEDLLDDLLQET 374 >ref|XP_006852570.2| PREDICTED: uncharacterized protein LOC18442285 [Amborella trichopoda] Length = 398 Score = 154 bits (390), Expect = 1e-34 Identities = 129/382 (33%), Positives = 179/382 (46%), Gaps = 11/382 (2%) Frame = -2 Query: 1399 MDAKASAKSKRAHTLQAKRAHSXXXXXXXXXXXXXXXXXXPEKTLNPKSHLLPSNWDRYD 1220 M+ K+SAKSKRAH+L +R H+ ++TL LPSNWDRYD Sbjct: 1 MNVKSSAKSKRAHSLHGRRTHNPSPKPSPSTKKQS------DQTLTRHDSRLPSNWDRYD 54 Query: 1219 DDEVFGNEGSDKSLVDARLAQGIAAPKSKGADFSKLIAQAKEDARSRINPDEXXXXXXXX 1040 D + G + D + + + PKSKGAD++ L++ AK ++ S ++ D Sbjct: 55 DIDFSGAQPEDPNQENVNVG-----PKSKGADYAYLLSLAKSESLSLLSFDSVIPDLI-- 107 Query: 1039 XXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLSLDLHALAAQLEKVDV 860 QG MLS KG SLLS + DNFIVDD E + EASFLS+DLH LA +L +++ Sbjct: 108 ------QGAGPMLSFKGKSLLSWNSYDNFIVDDEEHLNQEASFLSIDLHKLATKLANINL 161 Query: 859 SERLFIENDLLADELGEQLKEMSSSRSNYGETSHEGSENINLGYEH-----GETSYAVGT 695 S+R+FIE DLL +EL T +GS LG EH G+ VG+ Sbjct: 162 SKRIFIEEDLLPEEL--------------CGTERQGS--TTLGIEHVKRALGKDGGNVGS 205 Query: 694 HNMEQTRSNNNELLPSPSTSNDN------SQEVHDKKPFPPFTWQSGKPNPATDSELFHQ 533 M Q N++L + N S++ + P F+ + A D L Sbjct: 206 SVMFQ---GNSDLGTKKHSQNHQDLVTSISEDYLENYSQPTFS----GIDVAVDHFLRGS 258 Query: 532 NAAREVLKPKSQLDATAELEKTTLRSGATPAETELDMLLNTINEKETKTLGFKAATAEAE 353 +E K+Q + TP LD K GF+AA AE E Sbjct: 259 ELPQEPKPNKTQEE-------------QTPRGVALD------GTNSGKNKGFEAAAAEVE 299 Query: 352 LDMLLDSFGETKLLDSVDISEE 287 LD LLD+FGET+ LD+ I+E+ Sbjct: 300 LDFLLDTFGETRRLDNFSIAED 321 >ref|XP_010260342.1| PREDICTED: uncharacterized protein LOC104599483 isoform X2 [Nelumbo nucifera] Length = 428 Score = 154 bits (390), Expect = 1e-34 Identities = 135/448 (30%), Positives = 197/448 (43%), Gaps = 20/448 (4%) Frame = -2 Query: 1399 MDAKASAKSKRAHTLQ---------AKRAHSXXXXXXXXXXXXXXXXXXPEKTLNPKSHL 1247 MDAKA AKSKRAH+ A +A + + S Sbjct: 1 MDAKALAKSKRAHSQHHSKKSHASPASKAPAVAASAGNSKKPSAKQTREKNRQFRGSSTA 60 Query: 1246 LPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGADFSKLIAQAKEDARSRINPD 1067 LPSNWDRY+++ G+E D SL + PKSKGADF LI++A+ +S + Sbjct: 61 LPSNWDRYEEEYDSGSE--DPSLGGTSRTSDVVVPKSKGADFRYLISEAQSQLQSPSDLS 118 Query: 1066 EXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLSLDLHAL 887 F QGVS++LS +G ++LS +DNF V+D+ET+S +ASFLS+DLHAL Sbjct: 119 LESFDSFGGFLPGFNQGVSTVLSARGKNILSWIGNDNFAVEDNETAS-QASFLSMDLHAL 177 Query: 886 AAQLEKVDVSERLFIENDLLADEL-GEQLKEMSSSRSNYGETSHEGSENINLGYEHGETS 710 A QL KVDVS+RLFI+ LL E+ E L++ ++ E +HE + + + + Sbjct: 178 AEQLAKVDVSQRLFIDAYLLPPEMHSEGLQKSKCQDYDHTEATHESEADDH----YLDKM 233 Query: 709 YAVGTHNMEQTRSNNNELLPSPSTSNDNSQEVHDKKPFPPFTWQSGKPNPATDSELFHQN 530 G+ N E N ++ P+ ++ VH P DS Q Sbjct: 234 EFHGSANGEDIMGNRPDISPA------TTENVHSVPALLPEGSMLVNLAKGGDSTQVGQT 287 Query: 529 AAREVLKPKSQLDATAELEKTTLRSGATPAETELDMLLNTINEKETKTLGFKAATAEAEL 350 + + Q + + ++++ KE K F+AA AEAEL Sbjct: 288 CPTKFMNSMEQPNRS-----------------------SSVDLKENKPSRFEAAAAEAEL 324 Query: 349 DMLLDSFGETKLL----------DSVDISEEQSSNFPRAQXXXXXXXXXXXXXXXXXKPV 200 DMLLDSFGETKL S S++Q S FP+ + Sbjct: 325 DMLLDSFGETKLFYSGFPVVKQEPSHVSSQQQVSGFPQPSVQAPDASKNASGAFDLDNAI 384 Query: 199 REVPDSFNSPAMTNALDCSIDDLLARTP 116 E+ ++ N NA+ + + R P Sbjct: 385 DELRETSNPTNQNNAMRDQQEKAVRRNP 412 >ref|XP_002517843.1| conserved hypothetical protein [Ricinus communis] gi|223542825|gb|EEF44361.1| conserved hypothetical protein [Ricinus communis] Length = 434 Score = 154 bits (390), Expect = 1e-34 Identities = 154/481 (32%), Positives = 214/481 (44%), Gaps = 15/481 (3%) Frame = -2 Query: 1399 MDAKASAKSKRAHTLQ--AKRAHSXXXXXXXXXXXXXXXXXXPEKTLNPKSHL------L 1244 MD+KA AKSKRAH+L K+ HS K + ++ L Sbjct: 1 MDSKALAKSKRAHSLHHSKKQFHSGQKAKVKAPTGGATDAASGNKAVGKQTREKARQSGL 60 Query: 1243 PSNWDRYDDDEVFGNEGSDKSLVDA-RLAQGIAAPKSKGADFSKLIAQAKEDARSRINPD 1067 PSN DRY+++ + GS L D+ A I PKSKGAD+ LIA+A+ +S D Sbjct: 61 PSNCDRYEEEF---DSGSGDPLGDSINNASDIILPKSKGADYRHLIAEAQSQCQSGSYLD 117 Query: 1066 EXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLSLDLHAL 887 GV MLSV+G +LS + DDNF+V+D S EA FLSL+L AL Sbjct: 118 MFPSLEDILPADFKL-GVGPMLSVRGEGILSWTGDDNFVVEDESAVSPEAHFLSLNLSAL 176 Query: 886 AAQLEKVDVSERLFIENDLLADEL-GEQLKEMSSSRSNYGETSHEGSENINLGYEHGETS 710 A QL KVD+SERLF+E D+L EL G K SS S +TS + + Sbjct: 177 AEQLLKVDISERLFMEADILPPELSGHGAKATSSLESEQKQTSEM------------KVN 224 Query: 709 YAVGTHNMEQTRSNNNELLPSPSTSNDNSQEVHDKKPFPPFTWQSGKPNPATDSELFHQN 530 V + + S NE S EV + +G+ +P + ++ F Sbjct: 225 STVSEELILKDLSEKNEFAKQ-------SSEVMSSESI-----LTGQSDPISLNQEFDM- 271 Query: 529 AAREVLKPKSQLDATAELEKTTLRSGATPAETELDMLLNTINEKETKTLGFKAATAEAEL 350 + K + A+ R+ +PAE ++I + + K F+A AEAEL Sbjct: 272 ----INKTEGDFSASRHSSSCENRAMESPAEISG----SSIADPKKKPYMFEATAAEAEL 323 Query: 349 DMLLDSFGETKLLDSVDISEEQSSNFPRAQXXXXXXXXXXXXXXXXXKPVREVPDSFNSP 170 DMLLDSF ETK LDS S S+ FP ++ +R P S + Sbjct: 324 DMLLDSFNETKFLDS---SGFTSAAFPLSKKEAPRALPQL---------IRNTPSS-SKT 370 Query: 169 AMTNALDCSIDDLLARTPIVSNQDHG-----VHHPLEXXXXXXXXXXXXXXXXLDDFDSW 5 +++ LD ++DDLL +T +SNQ++ V LDDFDSW Sbjct: 371 SISATLDDALDDLLEQTSNLSNQNNSYQSVKVTATSNEMQSSSSSRSVTKSKVLDDFDSW 430 Query: 4 L 2 L Sbjct: 431 L 431 >emb|CDP15879.1| unnamed protein product [Coffea canephora] Length = 410 Score = 153 bits (386), Expect = 4e-34 Identities = 154/479 (32%), Positives = 209/479 (43%), Gaps = 13/479 (2%) Frame = -2 Query: 1399 MDAKASAKSKRAHTLQAKRAH--SXXXXXXXXXXXXXXXXXXPEKTLNPK------SHLL 1244 MDAKA AKSKRAH+L + H S K K S L Sbjct: 1 MDAKALAKSKRAHSLHHSKKHHSSPTSKATPSSATSSSGKKPTNKQARDKPYQSQSSKAL 60 Query: 1243 PSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGADFSKLIAQAKEDARSRINPDE 1064 P+NWDRY+++ +G+ D V A + PKSKGAD++ LI++AK A+S+ N Sbjct: 61 PTNWDRYEEE--YGSGSEDSPQVSTGQASDVVVPKSKGADYAYLISEAK--AQSQANSSS 116 Query: 1063 XXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLSLDLHALA 884 F QG+ S+LSV+G LLS +D F DD TSS+EASFLSL+LH+LA Sbjct: 117 ESFSLFDDFLDGFNQGLGSLLSVRGEHLLSRISNDVFPFDDKGTSSHEASFLSLNLHSLA 176 Query: 883 AQLEKVDVSERLFIENDLLADELGEQLKEMSSSRSNYGETSHEGSENINLGYEHGETSYA 704 QL K +++ERLFIE DLL E+ +L +++ N E GS E E+ +A Sbjct: 177 EQLSKANLAERLFIEPDLLPPEMCTELD--ANNEKNPDELQATGST------EATESEFA 228 Query: 703 VGTHNMEQTRSNNNELLPSPSTSNDNSQEVHDKKPFPPFTWQSGKPNPATDSELFHQNAA 524 G + ++ N N LL S+++S+ F+ + + A D + ++ + Sbjct: 229 -GQPSSIISKENRNILLSQEYMSSNSSR-------VSQFSVPTSTDHRADDLKEISRSTS 280 Query: 523 REVLKPKSQLDATAELEKTTLRSGATPAETELDMLLNTINEKETKTLGFKAATAEAELDM 344 +K S + + EK + R A AE ELD M Sbjct: 281 ---VKLTSGVSIDSSSEKPS-RFEAAKAEAELD--------------------------M 310 Query: 343 LLDSFGETKLLDSVDISEEQSSNFPRAQXXXXXXXXXXXXXXXXXKPVREVPDSFNSPAM 164 LLDSFGETK DS + S F VRE PD+ S M Sbjct: 311 LLDSFGETKFFDS------KGSTFQSVSVAAQH--------------VREGPDATYSGRM 350 Query: 163 TNALDCSIDDLLARTPIVSNQDHGVHHPLE-----XXXXXXXXXXXXXXXXLDDFDSWL 2 ALD S+DD+L T + N PL LDDFDSWL Sbjct: 351 DAALDDSLDDILKDTSHLINTK--AVSPLNEVKAASNEGPTASQPHSKSKILDDFDSWL 407 >ref|XP_006592445.1| PREDICTED: uncharacterized protein LOC102660628 [Glycine max] gi|734309998|gb|KHM99717.1| hypothetical protein glysoja_037281 [Glycine soja] Length = 429 Score = 150 bits (380), Expect = 2e-33 Identities = 148/495 (29%), Positives = 211/495 (42%), Gaps = 30/495 (6%) Frame = -2 Query: 1399 MDAKASAKSKRAHTLQ--AKRAHSXXXXXXXXXXXXXXXXXXPEKTLNP----------K 1256 MD KA AKSKR HT K HS K NP K Sbjct: 1 MDVKALAKSKRNHTQHHSKKSPHSHKPKAPTSSSSSSVGPNDAAKN-NPLGKQQVSQKKK 59 Query: 1255 SH--LLPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGADFSKLIAQAKEDARS 1082 SH LPSNWDRY+D+E E D A + PK+KGADF L+A+A+ A + Sbjct: 60 SHRSALPSNWDRYEDEE----EELDSGSGIASKTVDVVLPKTKGADFRHLVAEAQSQAET 115 Query: 1081 RINPDEXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLSL 902 + E F G+SSML V+G ++S DDNF+VDD T + EASFLSL Sbjct: 116 SL---EGFPAFDDLLPGEFGVGLSSMLVVRGEGIVSWVGDDNFVVDDKTTGNPEASFLSL 172 Query: 901 DLHALAAQLEKVDVSERLFIENDLLADELGEQLKEMSSSRSNYGETSHEGSENINLGYEH 722 +LHALA KVD+S+RLFIE+DLL EL + +SS+ + + E SE Sbjct: 173 NLHALAESFAKVDLSKRLFIESDLLPTELCVEELAVSSNEEHKELKTKEDSE-------- 224 Query: 721 GETSYAVGTHNMEQTRSNNNELLPSPSTSNDNSQEVHDKKPFPPFTWQSGKPNPATDSEL 542 N + ++L TS+ +S H FP + + Sbjct: 225 --------LANRMSKELDLDDLAADQFTSSSSSSSSHAVSTFP------------LSNNV 264 Query: 541 FHQNAAREVLKPKSQLDATAELEKTTLRSGATPAETELDMLLNTINEKETKTLGFKAATA 362 FH P + ++A A+ + ++ A ++ L +T + + + F AA Sbjct: 265 FHI--------PVNYVNAEAQQTSCSSKNKAFVPCSDAS-LHSTEDARGKQYSAFGAADV 315 Query: 361 EAELDMLLDSFGETKLLD--------SVDISEEQSSNFPRAQXXXXXXXXXXXXXXXXXK 206 E ELDMLLDS ETK+LD S+ +S SS +P+ Sbjct: 316 EKELDMLLDSLSETKILDSSGFKSYTSIPVSLGVSSVYPQVS------------------ 357 Query: 205 PVREVPDSFNSPAMTNALDCSIDDLLARTPIVSN--------QDHGVHHPLEXXXXXXXX 50 ++ P + ++T +LD ++D+LL T + N ++ HH ++ Sbjct: 358 --KKDPVPSKTASITASLDDALDELLEETSTLMNPNVLLRPQEEKPFHHSMQ-----SSS 410 Query: 49 XXXXXXXXLDDFDSW 5 DDFDSW Sbjct: 411 HSGNKSKVADDFDSW 425 >ref|XP_003526252.1| PREDICTED: uncharacterized protein LOC100790093 [Glycine max] gi|734334005|gb|KHN07784.1| hypothetical protein glysoja_033870 [Glycine soja] Length = 433 Score = 150 bits (379), Expect = 3e-33 Identities = 146/498 (29%), Positives = 208/498 (41%), Gaps = 33/498 (6%) Frame = -2 Query: 1399 MDAKASAKSKRAHTLQ-AKRAH----------------SXXXXXXXXXXXXXXXXXXPEK 1271 MD KA AKSKR+HT +K +H S EK Sbjct: 1 MDVKALAKSKRSHTQHHSKNSHHSHKPNKAASSSSSSSSVGPNDAAKKNPLGKQQVSEEK 60 Query: 1270 TLNPKSHLLPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGADFSKLIAQAKED 1091 LPSNWDRY+D+E E D A + PKSKGADF L+A+A+ Sbjct: 61 KKKSHHSALPSNWDRYEDEE----EELDSGSGIASKTVDVVLPKSKGADFRHLVAEAQSL 116 Query: 1090 ARSRINPDEXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASF 911 A + + E F G+SSML V+G ++S + DDNF+V+D + EASF Sbjct: 117 AETSL---EGFPAFNDLLPGEFGVGLSSMLVVRGEGIVSWAGDDNFVVEDKTNGNLEASF 173 Query: 910 LSLDLHALAAQLEKVDVSERLFIENDLLADELGEQLKEMSSSRSNYGETSHEGSENINLG 731 LSL+LHALA KVD+++RLFIE DLL EL + MSSS + + + SE N Sbjct: 174 LSLNLHALAESFAKVDLAKRLFIEADLLPTELCVEESAMSSSEEHEELKTKDESELANRM 233 Query: 730 YEHGETSYAVGTHNMEQTRSNNNELLPSPSTSNDNSQEVHDKKPFPPFTWQSGKPNPATD 551 E + ++L S+ +S H FP Sbjct: 234 SEELDV----------------DDLAADQFISSSSSSSSHAASTFP-------------- 263 Query: 550 SELFHQNAAREVLKPKSQLDATAELEKTTLRSGATPAETELDMLLNTINEKETKTLGFKA 371 + + P + +DA A+ ++ ++ A ++ L +T + + F+A Sbjct: 264 -------LSNDFRIPVNYVDAEAQQTSSSGKNKAFVLSSDAS-LHSTEDTRGKPYSTFEA 315 Query: 370 ATAEAELDMLLDSFGETKLLD--------SVDISEEQSSNFPRAQXXXXXXXXXXXXXXX 215 A AE ELDMLLDSFGET +LD S+ +S +S +P Sbjct: 316 ADAEKELDMLLDSFGETNILDSSGFKSNTSIPVSSGVASVYP------------PHISNK 363 Query: 214 XXKPVREVPDSFNSPAMTNALDCSIDDLLARTPIVSN--------QDHGVHHPLEXXXXX 59 P + P +T +LD +DDLL T ++N ++ VHH ++ Sbjct: 364 DPVPSKTAP-------ITASLDDVLDDLLEGTSTLTNPNVLLRPQEEKPVHHSMQ----- 411 Query: 58 XXXXXXXXXXXLDDFDSW 5 DDFDSW Sbjct: 412 SSSNSGSKSKVADDFDSW 429 >ref|XP_010093966.1| hypothetical protein L484_010532 [Morus notabilis] gi|587865403|gb|EXB54953.1| hypothetical protein L484_010532 [Morus notabilis] Length = 423 Score = 150 bits (378), Expect = 3e-33 Identities = 141/447 (31%), Positives = 192/447 (42%), Gaps = 13/447 (2%) Frame = -2 Query: 1399 MDAKASAKSKRAHTLQAKRAH----------SXXXXXXXXXXXXXXXXXXPEKTLNPKSH 1250 MDAKA AKSKRAH+LQ R H EK L P+ Sbjct: 1 MDAKALAKSKRAHSLQHSRRHHPNQKPKAPSGVAAASETGGAKKPSGKQDKEKPLQPRGK 60 Query: 1249 -LLPSNWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGADFSKLIAQAKEDARSRIN 1073 LPSNWDRY+ + G+E S + + PKSKGAD+ LIA+A+ + + + Sbjct: 61 SALPSNWDRYEQETDSGSEEPSGSGAIQKQNPDVVLPKSKGADYRHLIAEAQSQSHAYL- 119 Query: 1072 PDEXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLSLDLH 893 + F V SMLSV+G +L+ S DDNFIV+D T+ EA+FLSL+LH Sbjct: 120 --DSFPSVDDVLAGEFSLAVGSMLSVRGEGILAWSADDNFIVNDKSTTHPEAAFLSLNLH 177 Query: 892 ALAAQLEKVDVSERLFIENDLLADELGEQLKEMS-SSRSNYGETSHEGSENINLGYEHGE 716 ALA QLEK+D++ RLFIE DLL EL ++ E S + + N +++ L E Sbjct: 178 ALAEQLEKIDLAHRLFIEADLLPPELHVEVSETSRTQKCNQMPATNDVEAVSKLPEEL-- 235 Query: 715 TSYAVGTHNMEQTRSNNNELLPSPSTSNDNSQEVHDKKPFPPFTWQSGKPNPATDSELFH 536 T N ++ + P PS S S V G N S+ H Sbjct: 236 ------TFNEVSLSASPSGGHPDPSLSIRGSSSV-----------SQGVSNVNRVSQYDH 278 Query: 535 QNAAREVLKPKSQLDATAELEKTTLRSGATPAETELDMLLNTINEKETKTLGFKAATAEA 356 ++ A +S +D A+ K A AE EL Sbjct: 279 KSNAPHFAVAQSSVDTFADPGKKRPEFEAVAAEAEL------------------------ 314 Query: 355 ELDMLLDSFGETKLLDSVDISEEQSSNFPRAQXXXXXXXXXXXXXXXXXKPVREVPDSFN 176 DMLLDSF E K+ DS +S + +P R+ P N Sbjct: 315 --DMLLDSFSEIKIPDSSGLSSADT------------LPVHEEASAAVFQPPRKDP---N 357 Query: 175 SPAMTNA-LDCSIDDLLARTPIVSNQD 98 S +TNA LD +DDLL T +++Q+ Sbjct: 358 SSVLTNANLDDDLDDLLKETSSLTSQN 384 >ref|XP_012088204.1| PREDICTED: uncharacterized protein LOC105646878 [Jatropha curcas] gi|643709663|gb|KDP24072.1| hypothetical protein JCGZ_25729 [Jatropha curcas] Length = 428 Score = 149 bits (377), Expect = 4e-33 Identities = 145/481 (30%), Positives = 203/481 (42%), Gaps = 15/481 (3%) Frame = -2 Query: 1399 MDAKASAKSKRAHTLQAKRAHSXXXXXXXXXXXXXXXXXXPEKTLNPKSHL------LPS 1238 MDAKA AKSKRAH+L + S K L ++ LP+ Sbjct: 1 MDAKALAKSKRAHSLHHSKKPSHSSLKSKAPSGGANNAGGGNKALGKQTKEKARQSGLPA 60 Query: 1237 NWDRYDDDEVFGNEGSDKSLVDARLAQGIAAPKSKGADFSKLIAQAKEDARSRINPDEXX 1058 NWDRY+++ F ++ S A + PKSKGADF LIA+A+ +S + D Sbjct: 61 NWDRYEEE--FDSDSEVPSGDSISKASDVILPKSKGADFRHLIAEAQSQCQSNVCLDTFP 118 Query: 1057 XXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLSLDLHALAAQ 878 GV SMLSV+G +LS + DDNF+V+D T+ EASF SL+L+ALA Q Sbjct: 119 SMDDILPGDFEL-GVRSMLSVRGEGILSWTGDDNFVVEDETTAIPEASFFSLNLNALAEQ 177 Query: 877 LEKVDVSERLFIENDLLADELGEQLKEMS-SSRSNYGETSHEGSENINLGYEHGETSYAV 701 L KVD+S+RL+IE DLL EL + + S S+ +TS + ++ E Sbjct: 178 LAKVDISKRLYIEEDLLPPELTDNRSKASCGPESDQMQTSETEATSM-----VAERLAFR 232 Query: 700 GTHNMEQTRSNNNELLPSPSTSNDNSQEVHDKKPFPPFTWQSGKPNPATDSELFHQNAAR 521 + + N E++ S ST+N S + + G + DS+ Sbjct: 233 DISGKNKVANKNTEVISSESTANRYSNLISPNQGLDRLNQAKGDQYSSRDSKF------S 286 Query: 520 EVLKPKSQLDATAELEKTTLRSGATPAETELDMLLNTINEKETKTLGFKAATAEAELDML 341 E KSQ D+ +L +T + A AE LDML Sbjct: 287 ESKSQKSQADSKKKL--STFEAAAAEAE----------------------------LDML 316 Query: 340 LDSFGETKLLDSVDISEEQSSNFPRAQXXXXXXXXXXXXXXXXXKPVREVPDSFNSPAMT 161 LDSF ET LDS + +SS+FP +Q S + A+ Sbjct: 317 LDSFSETNFLDS--SAGLKSSSFPVSQKEAPVVMPQFTRN-------TSSSTSLETTAVA 367 Query: 160 NALDCSIDDLLARTPIVSNQDHG--------VHHPLEXXXXXXXXXXXXXXXXLDDFDSW 5 LD +D LL T +SNQ+ H+ ++ LDDFDSW Sbjct: 368 AKLDDVLDGLLDETTNLSNQNSSCQLGKVTVTHNEIK---SSSSSQPVTKSIVLDDFDSW 424 Query: 4 L 2 L Sbjct: 425 L 425 >ref|XP_004287059.1| PREDICTED: uncharacterized protein LOC101291364 [Fragaria vesca subsp. vesca] Length = 381 Score = 148 bits (373), Expect = 1e-32 Identities = 149/470 (31%), Positives = 208/470 (44%), Gaps = 5/470 (1%) Frame = -2 Query: 1399 MDAKASAKSKRAHTLQ-AKRAHSXXXXXXXXXXXXXXXXXXPEKTLNPKSHLLPSNWDRY 1223 MD+KA AKSKRAH+ +K+ HS K +P+NWDRY Sbjct: 1 MDSKALAKSKRAHSQHHSKKYHSPNQKAKDGA-----------KPNKASGKQIPTNWDRY 49 Query: 1222 DDDEVFGNEGSDKSLVDARLAQGIAAPKSKGADFSKLIAQAKEDARSRINPDEXXXXXXX 1043 D++ G++ + A I PKSKGAD++ LIA+A+ + S+ + D Sbjct: 50 DEELDSGSQDA---------ASDIVLPKSKGADYTHLIAEAQSQSLSQFDDDVLSVEWN- 99 Query: 1042 XXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSY-EASFLSLDLHALAAQLEKV 866 +G+ SMLS +G S+LS DDNF+VDD +++ E SFLSL+LH+LA QLEKV Sbjct: 100 -------KGIMSMLSARGESILSWIGDDNFVVDDKTAAAHHEVSFLSLNLHSLAEQLEKV 152 Query: 865 DVSERLFIENDLLADELGEQLKEMSSSRSNYGETSHEGSENINLGYEHGETSYAVGTHNM 686 D+SERLFIE DLL EL + E +SS+S A GT Sbjct: 153 DLSERLFIEADLLPPELNLEGLESTSSQS---------------------ADQAQGTFVN 191 Query: 685 EQTRSNNNELLPSPSTSNDNSQEVHDKKPFPPFTWQSGKPNPATDSELFHQNAAREVLKP 506 + R ++P S S + +++ S + DS+ N LK Sbjct: 192 KGAR-----VIPEASISGEFPDKINVADQDIEIMLSS-----SPDSDCLDSNLGSISLK- 240 Query: 505 KSQLDA-TAELEKTTLRSGATP-AETELDMLLNTINEKETKTLGFKAATAEAELDMLLDS 332 Q+D ++L K+T +S P A+ + L F+AATAE ELDMLLDS Sbjct: 241 --QIDVDPSKLGKSTRQSSMKPFADIPIKNLAT-----------FEAATAEEELDMLLDS 287 Query: 331 FGETKLLD-SVDISEEQSSNFPRAQXXXXXXXXXXXXXXXXXKPVREVPDSFNSPAMTNA 155 F ETK D S S + ++ P Q R+ DS S + Sbjct: 288 FSETKRNDPSALRSLQDEASVPPLQVP------------------RKGTDS--SILVAAN 327 Query: 154 LDCSIDDLLARTPIVSNQDHGVHHPLEXXXXXXXXXXXXXXXXLDDFDSW 5 LD ++DDL+ I NQ + +DDFDSW Sbjct: 328 LDDALDDLMNEISIPINQGGPSRPQEKMAVHDFQSSQTGSKSKVDDFDSW 377 >ref|XP_006469074.1| PREDICTED: lisH domain-containing protein C1711.05-like isoform X1 [Citrus sinensis] Length = 456 Score = 146 bits (369), Expect = 4e-32 Identities = 149/504 (29%), Positives = 218/504 (43%), Gaps = 38/504 (7%) Frame = -2 Query: 1399 MDAKASAKSKRAHTLQAKRAHSXXXXXXXXXXXXXXXXXXPEKTLNPKSHL--------- 1247 MDAKA AKSKRAH+ Q K S EK ++ Sbjct: 1 MDAKALAKSKRAHSQQHKNK-SHPNQKLKAPVVASDNAGGKEKQPGKQAGAGTREARRLS 59 Query: 1246 -LPSNWDRYDDDEVFGNEGSDKSLVDARL-AQGIAAPKSKGADFSKLIAQAKEDARSR-- 1079 LPSNWDRY+D GSD D A PKSKGAD+ LIA+A+ + S+ Sbjct: 60 KLPSNWDRYED-------GSDMDSEDTTSQASDFVVPKSKGADYRHLIAEAQSQSLSQSR 112 Query: 1078 -INPDEXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLSL 902 ++ + F G+ MLSV+G +LS DDNF+V+D T+ EASFLSL Sbjct: 113 SLSYSDTFPLLDDVMPGGFAPGMGPMLSVRGEGILSWVGDDNFVVEDKTTAFQEASFLSL 172 Query: 901 DLHALAAQLEKVDVSERLFIENDLLADELGEQLKEMSSSRS-NYGETSHEGSENINLG-- 731 +L+ALA L KVD+S+RLF+E DLL ELG + SS++ +T HE ++ + Sbjct: 173 NLNALAEHLAKVDLSQRLFVEADLLPSELGTEGSIASSNQEPGLMQTEHESEADVGISRD 232 Query: 730 -------YEHGETSYAVGTH------NMEQTRSNNN-----ELLPSPSTSNDNSQEVHD- 608 + GE G H N+ + +++ + +++ + STS + V Sbjct: 233 IDIASKDFPEGEEEEESGAHKVKAAANISEDKASTDFREKVKIVDTKSTSVVGHKNVDAI 292 Query: 607 -KKPFPPFTWQSGKPNPATDSELFHQNAAREVLKPKSQL-DATAELEKTTLRSGATPAET 434 Q+ P++ + F Q+ A L+P +Q + + + K AT AE Sbjct: 293 FSNQRSALVNQTKNDVPSSQYDRFGQDKA---LEPPAQFNENSVSVSKNLPTFEATAAEA 349 Query: 433 ELDMLLNTINEKETKTLGFKAATAEAELDMLLDSFGETKLLDSVDISEEQSSNFPRAQXX 254 ELDMLL++ N+ GF S+ + + +S++ SS P Sbjct: 350 ELDMLLDSFND-----TGF--------------SYSSSSKFSNSSVSQQTSSTAPPQLS- 389 Query: 253 XXXXXXXXXXXXXXXKPVREVPDSFNSPAMTNALDCSIDDLLARTPIVSNQDHGVHHPLE 74 R+ PD S ++T + D +DDLL T + N +G+ P E Sbjct: 390 ------------------RKGPDLSKSASVTASFDDVLDDLLEETSNLMN-PNGLSRPHE 430 Query: 73 XXXXXXXXXXXXXXXXLDDFDSWL 2 LDDFDSWL Sbjct: 431 -AQSSSSSQSVKKSKVLDDFDSWL 453 >ref|XP_006469075.1| PREDICTED: lisH domain-containing protein C1711.05-like isoform X2 [Citrus sinensis] Length = 440 Score = 144 bits (364), Expect = 1e-31 Identities = 148/488 (30%), Positives = 210/488 (43%), Gaps = 22/488 (4%) Frame = -2 Query: 1399 MDAKASAKSKRAHTLQAKRAHSXXXXXXXXXXXXXXXXXXPEKTLNPKSHL--------- 1247 MDAKA AKSKRAH+ Q K S EK ++ Sbjct: 1 MDAKALAKSKRAHSQQHKNK-SHPNQKLKAPVVASDNAGGKEKQPGKQAGAGTREARRLS 59 Query: 1246 -LPSNWDRYDDDEVFGNEGSDKSLVDARL-AQGIAAPKSKGADFSKLIAQAKEDARSR-- 1079 LPSNWDRY+D GSD D A PKSKGAD+ LIA+A+ + S+ Sbjct: 60 KLPSNWDRYED-------GSDMDSEDTTSQASDFVVPKSKGADYRHLIAEAQSQSLSQSR 112 Query: 1078 -INPDEXXXXXXXXXXXXFYQGVSSMLSVKGNSLLSCSMDDNFIVDDSETSSYEASFLSL 902 ++ + F G+ MLSV+G +LS DDNF+V+D T+ EASFLSL Sbjct: 113 SLSYSDTFPLLDDVMPGGFAPGMGPMLSVRGEGILSWVGDDNFVVEDKTTAFQEASFLSL 172 Query: 901 DLHALAAQLEKVDVSERLFIENDLLADELGEQLKEMSSSRS-NYGETSHEGSENINLGYE 725 +L+ALA L KVD+S+RLF+E DLL ELG + SS++ +T HE + E Sbjct: 173 NLNALAEHLAKVDLSQRLFVEADLLPSELGTEGSIASSNQEPGLMQTEHESEADGEEEEE 232 Query: 724 HGETSYAVGTHNMEQTRSNN----NELLPSPSTSNDNSQEVHD--KKPFPPFTWQSGKPN 563 G + E S + +++ + STS + V Q+ Sbjct: 233 SGAHKVKAAANISEDKASTDFREKVKIVDTKSTSVVGHKNVDAIFSNQRSALVNQTKNDV 292 Query: 562 PATDSELFHQNAAREVLKPKSQL-DATAELEKTTLRSGATPAETELDMLLNTINEKETKT 386 P++ + F Q+ A L+P +Q + + + K AT AE ELDMLL++ N+ Sbjct: 293 PSSQYDRFGQDKA---LEPPAQFNENSVSVSKNLPTFEATAAEAELDMLLDSFND----- 344 Query: 385 LGFKAATAEAELDMLLDSFGETKLLDSVDISEEQSSNFPRAQXXXXXXXXXXXXXXXXXK 206 GF S+ + + +S++ SS P Sbjct: 345 TGF--------------SYSSSSKFSNSSVSQQTSSTAPPQLS----------------- 373 Query: 205 PVREVPDSFNSPAMTNALDCSIDDLLARTPIVSNQDHGVHHPLEXXXXXXXXXXXXXXXX 26 R+ PD S ++T + D +DDLL T + N +G+ P E Sbjct: 374 --RKGPDLSKSASVTASFDDVLDDLLEETSNLMN-PNGLSRPHE-AQSSSSSQSVKKSKV 429 Query: 25 LDDFDSWL 2 LDDFDSWL Sbjct: 430 LDDFDSWL 437