BLASTX nr result
ID: Rauwolfia21_contig00006515
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rauwolfia21_contig00006515 (2102 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006474263.1| PREDICTED: uncharacterized protein LOC102629... 369 3e-99 ref|XP_006453257.1| hypothetical protein CICLE_v10010423mg, part... 368 5e-99 ref|XP_006353007.1| PREDICTED: uncharacterized protein LOC102597... 355 4e-95 ref|XP_006474266.1| PREDICTED: uncharacterized protein LOC102629... 350 2e-93 ref|XP_004233158.1| PREDICTED: uncharacterized protein LOC101258... 342 5e-91 ref|XP_004500707.1| PREDICTED: uncharacterized protein LOC101488... 336 2e-89 ref|XP_003522270.2| PREDICTED: uncharacterized protein LOC100813... 333 1e-88 gb|EOY20032.1| RING/FYVE/PHD zinc finger superfamily protein, pu... 324 8e-86 ref|XP_003527593.1| PREDICTED: uncharacterized protein LOC100800... 323 1e-85 ref|XP_006577888.1| PREDICTED: uncharacterized protein LOC100813... 314 1e-82 gb|EMJ25408.1| hypothetical protein PRUPE_ppa017202mg, partial [... 313 1e-82 ref|XP_002308115.2| hypothetical protein POPTR_0006s07510g [Popu... 309 4e-81 ref|XP_004146372.1| PREDICTED: uncharacterized protein LOC101212... 305 7e-80 gb|EOY20034.1| RING/FYVE/PHD zinc finger superfamily protein, pu... 303 2e-79 ref|XP_004300023.1| PREDICTED: uncharacterized protein LOC101314... 301 7e-79 ref|XP_002324693.1| ELM2 domain-containing family protein [Popul... 298 6e-78 ref|XP_006577890.1| PREDICTED: uncharacterized protein LOC100813... 287 1e-74 gb|ESW09310.1| hypothetical protein PHAVU_009G117100g [Phaseolus... 280 2e-72 gb|EPS72079.1| hypothetical protein M569_02681, partial [Genlise... 222 4e-55 ref|XP_004967630.1| PREDICTED: CHD3-type chromatin-remodeling fa... 222 6e-55 >ref|XP_006474263.1| PREDICTED: uncharacterized protein LOC102629457 isoform X1 [Citrus sinensis] gi|568840621|ref|XP_006474264.1| PREDICTED: uncharacterized protein LOC102629457 isoform X2 [Citrus sinensis] gi|568840623|ref|XP_006474265.1| PREDICTED: uncharacterized protein LOC102629457 isoform X3 [Citrus sinensis] Length = 528 Score = 369 bits (947), Expect = 3e-99 Identities = 222/525 (42%), Positives = 296/525 (56%), Gaps = 16/525 (3%) Frame = +3 Query: 303 LPDPAESTPF-VPKICSQNFICDRMPGDESWEGCPRSDKHNQDGSGKSVSTVEKTSYDSS 479 LP E+ F V QN +CD +P E+W+ CP+ D GK+ S+ + Sbjct: 7 LPSSIEAAIFHVSDHGQQNSLCDWLPASETWQMCPKYD-------GKTGSSEQNADTLCP 59 Query: 480 LNFLSCSQVSTSTIMSGSESPLYVYHRKKLNTRSPDISLAQTSFKNKNGDDFHSVVSSEA 659 LNF SQ ST + MSGS +P VY R+KL S I Q K DD SVVS +A Sbjct: 60 LNFDRNSQHSTVSTMSGSTAPTLVYQRRKLRGNSVPIFSTQDPVNTKRSDDCLSVVSFDA 119 Query: 660 HSGATKE-HLVSI-ETETEGIRSPCKPPSKCSTEGLVSKSGSYNGCLDREESDEGLRTDT 833 S +E H VS+ E TE + +P PP +E + +S S ++ SD+ + Sbjct: 120 VSVPMEEQHAVSLAEVGTEAVGTPILPPIISQSEPRLLRSDSVQ---EQVVSDKLSKNIR 176 Query: 834 GRXXXXXXXXXXXXXXKLNLDLVSASLKTDGDDTGECSSSGGLVSERLWENMSERDICIS 1013 + K N++L+SAS KT+ D+TGECSSS ++ E +++S +D+CIS Sbjct: 177 HKMVEIDSINDSCSSSKSNMELLSASKKTEVDETGECSSSSAVMLETTGKDLSAKDLCIS 236 Query: 1014 IIRSHGMVEIASPLQDCVSAYDSAMPSEEIC---CSRTCKVCGCSDPSLKMLICDQCEDS 1184 I+R+ GM+E P Q SA D + S C CSR+CK+CG S+ +LK+L+CD CE++ Sbjct: 237 ILRNEGMLERFWPTQIRSSAND--VDSGTGCSGICSRSCKICGRSETALKLLLCDDCEEA 294 Query: 1185 FHTSCCTPRVKRIPIDEWFCYSCLXXXXXXXXXXSSNAPE---------NISNENKLVPI 1337 FH +C TPR+K +P DEWFC+ CL + +P N S + + PI Sbjct: 295 FHVTCYTPRIKIVPSDEWFCHLCLKKKHKTLKATARKSPNIISEKGRGRNASAKGEPSPI 354 Query: 1338 ADMLRGTKPFKSSVRVGPEYQVEVPDWAGPVIDEN-AIREPDETAELECLSLQGENSNNS 1514 ML T P+ +SVRVG +Q ++PDW P ++ A+ EP E EC SL NS N Sbjct: 355 ELMLTSTVPYTTSVRVGKGFQADIPDWLAPTNNDGYALGEPLELDTSECPSLHDLNSYNL 414 Query: 1515 LQLSSIGNWLQCQXXXXXXXXXXXXXXXXKWRRAPLFEVQTDDWECFRAVLWDPTHADCA 1694 LSSIGNWLQC+ KWRRAPLFEVQTDDWECF AV WDPTHADCA Sbjct: 415 SNLSSIGNWLQCKQVLEGTGDGVDGTSCGKWRRAPLFEVQTDDWECFCAVQWDPTHADCA 474 Query: 1695 VPQELPTDEVLKQLKYIQMLRPRLVAKKRKLDQTKTLDSEVPKEV 1829 VPQEL TDEV KQLKY++MLR RL AK+RK D+TK+ ++ +V Sbjct: 475 VPQELETDEVSKQLKYLEMLRLRLDAKRRKFDRTKSRPPQLSAKV 519 >ref|XP_006453257.1| hypothetical protein CICLE_v10010423mg, partial [Citrus clementina] gi|557556483|gb|ESR66497.1| hypothetical protein CICLE_v10010423mg, partial [Citrus clementina] Length = 518 Score = 368 bits (945), Expect = 5e-99 Identities = 221/516 (42%), Positives = 292/516 (56%), Gaps = 16/516 (3%) Frame = +3 Query: 303 LPDPAESTPF-VPKICSQNFICDRMPGDESWEGCPRSDKHNQDGSGKSVSTVEKTSYDSS 479 LP E+ F V QN +CD +P E+W+ CP+ D GK+ S+ + Sbjct: 7 LPSSIEAAIFHVSDHGQQNSLCDWLPASETWQMCPKYD-------GKTGSSEQNADTLCP 59 Query: 480 LNFLSCSQVSTSTIMSGSESPLYVYHRKKLNTRSPDISLAQTSFKNKNGDDFHSVVSSEA 659 LNF SQ ST + MSGS +P VY R+KL S I Q K DD SVVS +A Sbjct: 60 LNFDRNSQHSTVSTMSGSTAPTLVYQRRKLRGNSVPIFSTQDPVNTKRSDDCLSVVSFDA 119 Query: 660 HSGATKE-HLVSI-ETETEGIRSPCKPPSKCSTEGLVSKSGSYNGCLDREESDEGLRTDT 833 S +E H VS+ E TE + +P PP +E + +S S ++ SD+ + Sbjct: 120 VSVPMEEQHAVSLAEVGTEAVGTPILPPIISQSEPRLLRSDSVQ---EQVVSDKLSKNIR 176 Query: 834 GRXXXXXXXXXXXXXXKLNLDLVSASLKTDGDDTGECSSSGGLVSERLWENMSERDICIS 1013 + K N++L+SAS KT+ D+TGECSSS ++ E +++S +D+CIS Sbjct: 177 HKMVEIDSINDSCSSSKSNMELLSASKKTEVDETGECSSSSAVMLETTGKDLSAKDLCIS 236 Query: 1014 IIRSHGMVEIASPLQDCVSAYDSAMPSEEIC---CSRTCKVCGCSDPSLKMLICDQCEDS 1184 I+R+ GM+E P Q SA D + S C CSR+CK+CG S+ +LK+L+CD CE++ Sbjct: 237 ILRNEGMLERFWPTQIRSSAND--VDSGTGCSGICSRSCKICGRSETALKLLLCDDCEEA 294 Query: 1185 FHTSCCTPRVKRIPIDEWFCYSCLXXXXXXXXXXSSNAPE---------NISNENKLVPI 1337 FH +C TPR+K +P DEWFC+ CL + +P N S + + PI Sbjct: 295 FHVTCYTPRIKIVPSDEWFCHLCLKKKHKTLKATARKSPNIISEKGRGRNASAKGEPSPI 354 Query: 1338 ADMLRGTKPFKSSVRVGPEYQVEVPDWAGPVIDEN-AIREPDETAELECLSLQGENSNNS 1514 ML T P+ +SVRVG +Q ++PDW P ++ A+ EP E EC SL NS N Sbjct: 355 ELMLTSTVPYTTSVRVGKGFQADIPDWLAPTNNDGYALGEPLELDTSECPSLHDLNSYNL 414 Query: 1515 LQLSSIGNWLQCQXXXXXXXXXXXXXXXXKWRRAPLFEVQTDDWECFRAVLWDPTHADCA 1694 LSSIGNWLQC+ KWRRAPLFEVQTDDWECF AV WDPTHADCA Sbjct: 415 SNLSSIGNWLQCKQVLEGTGDGVDGTSCGKWRRAPLFEVQTDDWECFCAVQWDPTHADCA 474 Query: 1695 VPQELPTDEVLKQLKYIQMLRPRLVAKKRKLDQTKT 1802 VPQEL TDEV KQLKY++MLR RL AK+RK D+TK+ Sbjct: 475 VPQELETDEVSKQLKYLEMLRLRLDAKRRKFDRTKS 510 >ref|XP_006353007.1| PREDICTED: uncharacterized protein LOC102597878 [Solanum tuberosum] Length = 512 Score = 355 bits (911), Expect = 4e-95 Identities = 212/493 (43%), Positives = 279/493 (56%), Gaps = 9/493 (1%) Frame = +3 Query: 363 CDRMPGDESWEGCPRSDKHNQDGSGKSVSTVEKTSYDS---SLNFLSCSQVSTSTIMSGS 533 CD+ G E E P DK V +E S D SLN L + ST + MSGS Sbjct: 29 CDQGSGAELSEMSPEKDKC------PLVVCIESDSKDGDPCSLNSLDGPRPSTFSAMSGS 82 Query: 534 ESPLYVYHRKKLNTRSPDISLAQTSFKNKNGDDFHSVVSSEAHSGATKEHLVSIETETEG 713 P+ VY RKK + S + + + S + SE HSG KE +V+ E Sbjct: 83 SKPI-VYRRKKFKRNPLPTFFIEPSAEVRPSNGCPSELCSEVHSGTLKEGIVAAEKLATA 141 Query: 714 IRSPCKPPSKCSTEGLVSKSGSYNGCLDREE--SDEGLRTDTGRXXXXXXXXXXXXXXKL 887 +P P++C+ L+SKS S +G + EE S+ R+D R K Sbjct: 142 --TPVLLPAECNRGNLLSKSNSCDGRPEGEEQCSEAASRSDMQRTSNVCINDSHSSS-KC 198 Query: 888 NLDLVSASLKTDGDDTGECSSSGGLVSERLWENMSERDICISIIRSHGMVE--IASPLQD 1061 NLD S+SLKT DD GECSSSG L ERL +NM E+DIC +I+R +G++E + + LQ Sbjct: 199 NLDFGSSSLKTVVDDAGECSSSGALFPERLGDNMPEKDICAAILRGYGLLEKVVVTKLQ- 257 Query: 1062 CVSAYDSAMPSEEICCSRTCKVCGCSDPSLKMLICDQCEDSFHTSCCTPRVKRIPIDEWF 1241 ++ + S + CC +CK C CS+ ++KMLICD C+D++H SCC P +K P DEWF Sbjct: 258 --ASTEDFYTSSDNCCLISCKTCDCSESTVKMLICDNCDDAYHLSCCKPHIKIAPEDEWF 315 Query: 1242 CYSCLXXXXXXXXXXSSN-APENISNENKLVPIADMLRGTKPFKSSVRVGPEYQVEVPDW 1418 C +CL S N + N +E P A ML+ T +++ VR+ +YQ E+PDW Sbjct: 316 CQTCLIKKQKLLKKSSCNESSSNSPSEGVSGPTALMLKDTG-YRTRVRISKKYQAEIPDW 374 Query: 1419 AGPVIDENAIR-EPDETAELECLSLQGENSNNSLQLSSIGNWLQCQXXXXXXXXXXXXXX 1595 GPV DE EP E E L L+ ++SN L++SSIGNWLQC+ Sbjct: 375 TGPVTDEAGCSGEPFEIRLSENLCLREQSSNEHLRISSIGNWLQCRQVIEGVGKRVDGSI 434 Query: 1596 XXKWRRAPLFEVQTDDWECFRAVLWDPTHADCAVPQELPTDEVLKQLKYIQMLRPRLVAK 1775 KWRRAPLFEVQTD WECFR+VLWDP HADCAVPQEL T+EVLKQLKY++ML+PRL K Sbjct: 435 CGKWRRAPLFEVQTDKWECFRSVLWDPAHADCAVPQELETEEVLKQLKYMEMLKPRLAVK 494 Query: 1776 KRKLDQTKTLDSE 1814 +RKL+QT S+ Sbjct: 495 RRKLNQTSGAGSQ 507 >ref|XP_006474266.1| PREDICTED: uncharacterized protein LOC102629457 isoform X4 [Citrus sinensis] Length = 490 Score = 350 bits (897), Expect = 2e-93 Identities = 210/492 (42%), Positives = 279/492 (56%), Gaps = 15/492 (3%) Frame = +3 Query: 399 CPRSDKHNQDGSGKSVSTVEKTSYDSSLNFLSCSQVSTSTIMSGSESPLYVYHRKKLNTR 578 CP+ D GK+ S+ + LNF SQ ST + MSGS +P VY R+KL Sbjct: 2 CPKYD-------GKTGSSEQNADTLCPLNFDRNSQHSTVSTMSGSTAPTLVYQRRKLRGN 54 Query: 579 SPDISLAQTSFKNKNGDDFHSVVSSEAHSGATKE-HLVSI-ETETEGIRSPCKPPSKCST 752 S I Q K DD SVVS +A S +E H VS+ E TE + +P PP + Sbjct: 55 SVPIFSTQDPVNTKRSDDCLSVVSFDAVSVPMEEQHAVSLAEVGTEAVGTPILPPIISQS 114 Query: 753 EGLVSKSGSYNGCLDREESDEGLRTDTGRXXXXXXXXXXXXXXKLNLDLVSASLKTDGDD 932 E + +S S ++ SD+ + + K N++L+SAS KT+ D+ Sbjct: 115 EPRLLRSDSVQ---EQVVSDKLSKNIRHKMVEIDSINDSCSSSKSNMELLSASKKTEVDE 171 Query: 933 TGECSSSGGLVSERLWENMSERDICISIIRSHGMVEIASPLQDCVSAYDSAMPSEEIC-- 1106 TGECSSS ++ E +++S +D+CISI+R+ GM+E P Q SA D + S C Sbjct: 172 TGECSSSSAVMLETTGKDLSAKDLCISILRNEGMLERFWPTQIRSSAND--VDSGTGCSG 229 Query: 1107 -CSRTCKVCGCSDPSLKMLICDQCEDSFHTSCCTPRVKRIPIDEWFCYSCLXXXXXXXXX 1283 CSR+CK+CG S+ +LK+L+CD CE++FH +C TPR+K +P DEWFC+ CL Sbjct: 230 ICSRSCKICGRSETALKLLLCDDCEEAFHVTCYTPRIKIVPSDEWFCHLCLKKKHKTLKA 289 Query: 1284 XSSNAPE---------NISNENKLVPIADMLRGTKPFKSSVRVGPEYQVEVPDWAGPVID 1436 + +P N S + + PI ML T P+ +SVRVG +Q ++PDW P + Sbjct: 290 TARKSPNIISEKGRGRNASAKGEPSPIELMLTSTVPYTTSVRVGKGFQADIPDWLAPTNN 349 Query: 1437 EN-AIREPDETAELECLSLQGENSNNSLQLSSIGNWLQCQXXXXXXXXXXXXXXXXKWRR 1613 + A+ EP E EC SL NS N LSSIGNWLQC+ KWRR Sbjct: 350 DGYALGEPLELDTSECPSLHDLNSYNLSNLSSIGNWLQCKQVLEGTGDGVDGTSCGKWRR 409 Query: 1614 APLFEVQTDDWECFRAVLWDPTHADCAVPQELPTDEVLKQLKYIQMLRPRLVAKKRKLDQ 1793 APLFEVQTDDWECF AV WDPTHADCAVPQEL TDEV KQLKY++MLR RL AK+RK D+ Sbjct: 410 APLFEVQTDDWECFCAVQWDPTHADCAVPQELETDEVSKQLKYLEMLRLRLDAKRRKFDR 469 Query: 1794 TKTLDSEVPKEV 1829 TK+ ++ +V Sbjct: 470 TKSRPPQLSAKV 481 >ref|XP_004233158.1| PREDICTED: uncharacterized protein LOC101258431 [Solanum lycopersicum] Length = 473 Score = 342 bits (876), Expect = 5e-91 Identities = 203/469 (43%), Positives = 265/469 (56%), Gaps = 4/469 (0%) Frame = +3 Query: 402 PRSDKHNQDGSGKSVSTVEKTSYDSSLNFLSCSQVSTSTIMSGSESPLYVYHRKKLNTRS 581 P DK+ D +S S K SLN L + ST + M GS P+ VY RKK Sbjct: 3 PEKDKYPIDVCMESDS---KDGDPCSLNSLDGPRPSTFSAMPGSLKPI-VYRRKKFKQNP 58 Query: 582 PDISLAQTSFKNKNGDDFHSVVSSEAHSGATKEHLVSIETETEGIRSPCKPPSKCSTEGL 761 + S + + + S + SE HSG KE +V+ E +P P++C+ L Sbjct: 59 RPTFFIEPSAEVRPSNGCPSELCSEVHSGTLKEGIVAAEKLATA--TPVLLPAECNRGNL 116 Query: 762 VSKSGSYNGCLDREE--SDEGLRTDTGRXXXXXXXXXXXXXXKLNLDLVSASLKTDGDDT 935 +SKS S +G + EE S+ R+D R K NLD S+SLKT DD Sbjct: 117 LSKSNSCDGRPEGEEQCSEAASRSDMQRTSNVCINDSHSSS-KCNLDFGSSSLKTLVDDA 175 Query: 936 GECSSSGGLVSERLWENMSERDICISIIRSHGMVEIASPLQDCVSAYDSAMPSEEICCSR 1115 GECSSSG L ERL NM E+DIC +I+R +G++E + S D S+ CC Sbjct: 176 GECSSSGALFPERLGNNMPEKDICTAILRGYGLLENVVVTKLGASTEDFYTSSDN-CCLI 234 Query: 1116 TCKVCGCSDPSLKMLICDQCEDSFHTSCCTPRVKRIPIDEWFCYSCLXXXXXXXXXXSSN 1295 +CK C CS+ ++KMLICD C+D++H SCC P K P DEWFC +CL S N Sbjct: 235 SCKACDCSESTVKMLICDNCDDAYHLSCCKPHKKIAPEDEWFCQTCLIKKQRVLKKSSCN 294 Query: 1296 -APENISNENKLVPIADMLRGTKPFKSSVRVGPEYQVEVPDWAGPVIDENAIR-EPDETA 1469 + N +E + P A ML+ T +K+ VR+ +YQ E+PDW GP D+ EP E Sbjct: 295 ESSSNSPSEGESGPTALMLKDTG-YKTRVRISKKYQAEIPDWTGPATDDAGCSGEPFEIT 353 Query: 1470 ELECLSLQGENSNNSLQLSSIGNWLQCQXXXXXXXXXXXXXXXXKWRRAPLFEVQTDDWE 1649 E L L ++SN +++SSIGNWLQC+ KWRRAPLFEVQTD+WE Sbjct: 354 PSENLCLPKQSSNEHMRISSIGNWLQCRQVNEGFGKRVDGSICGKWRRAPLFEVQTDNWE 413 Query: 1650 CFRAVLWDPTHADCAVPQELPTDEVLKQLKYIQMLRPRLVAKKRKLDQT 1796 CFR+VLWDP HADCAVPQEL T+EVLKQLKY++ML+ RLV K+RKL+QT Sbjct: 414 CFRSVLWDPAHADCAVPQELETEEVLKQLKYMEMLKHRLVVKRRKLNQT 462 >ref|XP_004500707.1| PREDICTED: uncharacterized protein LOC101488765 [Cicer arietinum] Length = 498 Score = 336 bits (862), Expect = 2e-89 Identities = 212/502 (42%), Positives = 280/502 (55%), Gaps = 16/502 (3%) Frame = +3 Query: 351 QNFICDRMPGDESWEGCPRSDKHNQDGSGKSVSTVE-KTSYDS--SLNFL-SCSQVSTST 518 ++F+CDRMP ES + C + +K++ D K+ E + + D S NFL S Q ST++ Sbjct: 23 KDFLCDRMPSGESLQVCLKCNKYHVDWCRKAEPMEEDRRNADDPYSSNFLRSSGQPSTAS 82 Query: 519 IMSGSESP-LYVYHRKKLNTRSPDI-SLAQTSFKNKNGDDFHSVVSSEAHSGATKEHLVS 692 IM+ S +P L VY RKKL + L T + +F SV+SS H + ++ Sbjct: 83 IMTESTAPNLMVYRRKKLRKGIASLFKLGPTDVQTSA--NFPSVISSSLHLSSGEDQTAG 140 Query: 693 IETETEGIRSPCKPPSKCSTEGLVSKSGSYNGCLDREESDEGLRTDTGRXXXXXXXXXXX 872 + + +G K PS S LDR + T + Sbjct: 141 FQVKHQG--EMVKDPSMPSV------------FLDRVA-----KYTTKKNLGIDSVNGSC 181 Query: 873 XXXKLNLDLVSASLKTDGDDTGECSSSGGLVSERLWENMSERDICISIIRSHGMVEIASP 1052 K N+ LVS SL+T+ D+TGECSSS +V + EN++E+D CI+I+RS+G++ Sbjct: 182 SSSKSNMVLVSDSLETEMDETGECSSSSVIVMDITKENLTEKDFCINILRSYGLLR-GDT 240 Query: 1053 LQDCVSAYDSAMPSEEICCSRTCKVCGCSDPSLKMLICDQCEDSFHTSCCTPRVKRIPID 1232 L D V + + A+ + CCSR+CK+CG D SL ML+CD CEDS+H SC R+K++PID Sbjct: 241 LTDNVVSVEDAVTTSNNCCSRSCKICGHLDSSLNMLLCDSCEDSYHPSCYNRRLKKLPID 300 Query: 1233 EWFCYSCLXXXXXXXXXXSSNAPENISNENK---------LVPIADMLRGTKPFKSSVRV 1385 EWFC+SC +P S K + PI MLR T+P+ + VRV Sbjct: 301 EWFCHSCHDKRQKILMETFIKSPGINSEMGKCRTTSITAEMNPILLMLRDTEPYMTGVRV 360 Query: 1386 GPEYQVEVPDWAGPV-IDENAIREPDETAELECLSLQGENSNNSLQLSSIGNWLQCQXXX 1562 G +Q EV DW+GPV DE I P E E LQ EN+ N +LSSIGNWLQCQ Sbjct: 361 GKGFQAEVLDWSGPVKSDEYDIPGPLEITPSEFYRLQEENTRNPTRLSSIGNWLQCQEVI 420 Query: 1563 XXXXXXXXXXXXXKWRRAPLFEVQTDDWECFRAVLWDPTHADCAVPQELPTDEVLKQLKY 1742 KWRRAPLFEVQTD+WECF AV WDP+HADCAVPQE+ TD+VLKQLKY Sbjct: 421 DRTSRTICG----KWRRAPLFEVQTDEWECFCAVHWDPSHADCAVPQEVETDQVLKQLKY 476 Query: 1743 IQMLRPRLVAKKRKLDQTKTLD 1808 I+MLRPRL AK+RK D T D Sbjct: 477 IEMLRPRLAAKQRKSDCTNNGD 498 >ref|XP_003522270.2| PREDICTED: uncharacterized protein LOC100813057 isoform X1 [Glycine max] Length = 488 Score = 333 bits (855), Expect = 1e-88 Identities = 201/506 (39%), Positives = 281/506 (55%), Gaps = 20/506 (3%) Frame = +3 Query: 351 QNFICDRMPGDESWEGCPRSDKHNQDGSGKSVSTVEKTS-----YDSSLNFLSCSQVSTS 515 ++ +CDRMP E+W+ C + +K+ D K+ E Y SS +S Q ST+ Sbjct: 23 KDVLCDRMPSGETWQVCLKCNKYPLDWCRKAEPVEEDRRNADDPYRSSC-VVSFGQPSTA 81 Query: 516 TIMSGSESPLYVYHRKKLNTRSPDISLAQTSFKNKNGDDFHSVVSSEAHSGATKEHLVSI 695 +IM+ + +P VY RKKL S + +L T+ + + SV+SS AH + ++ Sbjct: 82 SIMTENTTPNMVYRRKKLRKDS-NFNLGPTNVQASA--NIPSVISSAAHLSSAEDQPTGF 138 Query: 696 ETE--TEGIRSPCKPPSKCS--TEGLVSKSGSYNGCLDREESDEGLRTDTGRXXXXXXXX 863 + + E ++ P P T+ K+ N D S + Sbjct: 139 QVKHAIEIVKDPTMPSVLFDGVTKDSTHKNLGINSVNDSCSSSK---------------- 182 Query: 864 XXXXXXKLNLDLVSASLKTDGDDTGECSSSGGLVSERLWENMSERDICISIIRSHGMVEI 1043 +++T+ D+TGECSSS +V + E ++E+D C++I+RSHG+++ Sbjct: 183 --------------PNMETEMDETGECSSSSIIVMDCTREEVTEKDFCMNILRSHGLLKE 228 Query: 1044 ASPLQDCVSAYDSAMPSEEICCSRTCKVCGCSDPSLKMLICDQCEDSFHTSCCTPRVKRI 1223 SP+ + S D+ S CCSR+CK+CG D SL ML+CD CED++H SC PR+K++ Sbjct: 229 NSPVDNVTSGEDAVTTSNN-CCSRSCKICGDLDNSLNMLLCDHCEDAYHLSCYNPRLKKL 287 Query: 1224 PIDEWFCYSCLXXXXXXXXXXSSNAPENISNE----------NKLVPIADMLRGTKPFKS 1373 PIDEWFC+SCL +P +I NE +L PI MLR TKP+ + Sbjct: 288 PIDEWFCHSCLIKRQKILKETVIRSP-SIHNELGKCRTAPVKAELNPILLMLRDTKPYTT 346 Query: 1374 SVRVGPEYQVEVPDWAGPV-IDENAIREPDETAELECLSLQGENSNNSLQLSSIGNWLQC 1550 VRVG +Q EV DW+GP+ DE+A+ EP E + E L GEN+ N +LSSIGNW++C Sbjct: 347 GVRVGKGFQAEVLDWSGPIKSDEDALPEPLEISPSEFYKLLGENTRNPTKLSSIGNWVKC 406 Query: 1551 QXXXXXXXXXXXXXXXXKWRRAPLFEVQTDDWECFRAVLWDPTHADCAVPQELPTDEVLK 1730 Q KWRRAPLFEVQTD WECF A+ WDP+HADCAVPQEL TD+VLK Sbjct: 407 QEIIDRANGTICG----KWRRAPLFEVQTDAWECFCAIHWDPSHADCAVPQELETDQVLK 462 Query: 1731 QLKYIQMLRPRLVAKKRKLDQTKTLD 1808 QLKYI+MLRPRL AK++K D T D Sbjct: 463 QLKYIEMLRPRLAAKRKKSDCTHNSD 488 >gb|EOY20032.1| RING/FYVE/PHD zinc finger superfamily protein, putative isoform 1 [Theobroma cacao] gi|508728136|gb|EOY20033.1| RING/FYVE/PHD zinc finger superfamily protein, putative isoform 1 [Theobroma cacao] Length = 539 Score = 324 bits (831), Expect = 8e-86 Identities = 202/520 (38%), Positives = 280/520 (53%), Gaps = 21/520 (4%) Frame = +3 Query: 330 FVPKICSQNFICDRMPGDESWEGCPRSDKHNQDGSGKSVSTVEKTSYDSSLNFLSCSQVS 509 +V + Q+ + D MPG E+ + P+ D +Q S ++ Y + SQ S Sbjct: 17 YVSEHRKQDSLSDWMPGYETRQMSPKCDVPSQ-------SECKEAEYSCPPLPRTGSQQS 69 Query: 510 TSTIMSGSESPLYVY-HRKKLNTRSPDISLAQTSF------KNKNGDDFHSVVSSEAHSG 668 + ++MS P VY RKK S S A +F +K D SVVSS+A S Sbjct: 70 SVSVMSEGPVPTLVYSRRKKRRGSSSSASAAVANFCAEAPVNSKRSGDCLSVVSSDALSV 129 Query: 669 ATKEHL-VSIETETEGIRSPCKPPSKCSTEGLVSKSGSYNGC--LDREESDEGLRTDTGR 839 A E VS P CS E +SK NG +D SD+ +T + Sbjct: 130 AVMEQNGVSQVGHGNVATGDLLTPLACSREPHISKYEFANGFSGVDNHGSDDVRKTVRQK 189 Query: 840 XXXXXXXXXXXXXXKLNLDLVSASLKTDGDDTGECSSSGGLVSERLWENMSERDICISII 1019 K N++L AS+K + D+ GEC SS + +E + E++SE+D C SI+ Sbjct: 190 TIDVDSINDSCSSSKSNMELALASIKGEMDENGECCSSSVIAAEVVREDLSEKDRCFSIL 249 Query: 1020 RSHGMVEIASPLQDCVSAYDSAMPSEEICCSRTCKVCGCSDPSLKMLICDQCEDSFHTSC 1199 R+ G VE P + ++ + S CSR CK+CG S+ + KMLICD CE++FH C Sbjct: 250 RNQGNVEEVGPSRAPLN--EEIGTSGASSCSRVCKICGRSETAQKMLICDNCEEAFHLRC 307 Query: 1200 CTPRVKRIPIDEWFCYSCLXXXXXXXXXXSSNAPENI----------SNENKLVPIADML 1349 C PR+K++P+DEW+C+SC+ ++ +I S+E + PI ML Sbjct: 308 CNPRIKKVPVDEWYCFSCMKKKRIMVKDTTARNSSSITGCMGRCRGVSSEGESSPIELML 367 Query: 1350 RGTKPFKSSVRVGPEYQVEVPDWAGPVIDE-NAIREPDETAELECLSLQGENSNNSLQLS 1526 R +P+++SVR+G +Q +VPDW+GP+ D+ + I EP E LE N N S ++S Sbjct: 368 RDAEPYRTSVRIGKGFQADVPDWSGPIDDDVDTIGEPLEWDLLEFTDFNELNCNKSSKVS 427 Query: 1527 SIGNWLQCQXXXXXXXXXXXXXXXXKWRRAPLFEVQTDDWECFRAVLWDPTHADCAVPQE 1706 SIGNWLQC+ KWRRAPLFEVQTDDWECF +V WDP+HADC+VPQE Sbjct: 428 SIGNWLQCREFIEGIGGSNGTICG-KWRRAPLFEVQTDDWECFCSVQWDPSHADCSVPQE 486 Query: 1707 LPTDEVLKQLKYIQMLRPRLVAKKRKLDQTKTLDSEVPKE 1826 L TD+VLKQLKYI+MLRPRL AK+RK QT S+ K+ Sbjct: 487 LETDQVLKQLKYIEMLRPRLSAKRRKSHQTMNCTSQDHKD 526 >ref|XP_003527593.1| PREDICTED: uncharacterized protein LOC100800660 [Glycine max] Length = 487 Score = 323 bits (829), Expect = 1e-85 Identities = 200/499 (40%), Positives = 279/499 (55%), Gaps = 17/499 (3%) Frame = +3 Query: 351 QNFICDRMPGDESWEGCPRSDKHNQDGSGKSVSTVE-KTSYDS---SLNFLSCSQVSTST 518 ++ +CDRMP E+W+ + +K+ D K+ E K + D S +S Q ST++ Sbjct: 23 KDVLCDRMPSGETWQVGLKCNKYPLDWCRKAEPVEEDKRNADDPYRSSCLVSFGQPSTAS 82 Query: 519 IMSGSESPLYVYHRKKLNTRSPDISLAQTSFKNKNGDDFHSVVSSEAHSGATKEHLVSIE 698 IM+ + +P VY RKKL + + L T+ + + SV+SS AH + ++ + Sbjct: 83 IMTENTTPNMVYRRKKL-CKDSNFDLGPTNVQASA--NCPSVISSAAHLSSAEDQPTGFQ 139 Query: 699 T--ETEGIRSPCKPPSKCSTEGLVSKSGSYNGCLDREESDEGLRTDTGRXXXXXXXXXXX 872 E E ++ P P + DR D T + Sbjct: 140 VKHEIEMVKDPTMP----------------SVLFDRVAKDS-----THKNLGINSVNDSC 178 Query: 873 XXXKLNLDLVSASLKTDGDDTGECSSSGGLVSERLWENMSERDICISIIRSHGMVEIASP 1052 K N++ T+ D+TGECSSS +V + E ++E+D CI+I+RSHG+++ SP Sbjct: 179 SSSKPNME-------TEMDETGECSSSI-IVMDCTREEVTEKDFCINILRSHGLLKEDSP 230 Query: 1053 LQDCVSAYDSAMPSEEICCSRTCKVCGCSDPSLKMLICDQCEDSFHTSCCTPRVKRIPID 1232 + + S D A+ + CCSR+CK+CG D SL ML+CD CED++H SC PR+K++PID Sbjct: 231 VDNVASGED-AVTTGNNCCSRSCKICGDLDSSLNMLLCDHCEDAYHLSCYNPRLKKLPID 289 Query: 1233 EWFCYSCLXXXXXXXXXXSSNAPENISNE----------NKLVPIADMLRGTKPFKSSVR 1382 EWFC+SCL +P +I NE +L PI MLR TKP+ + VR Sbjct: 290 EWFCHSCLKKRQKILKETVIRSP-SIHNELGKCRTAPVKAELNPILLMLRDTKPYTTGVR 348 Query: 1383 VGPEYQVEVPDWAGPV-IDENAIREPDETAELECLSLQGENSNNSLQLSSIGNWLQCQXX 1559 VG +Q EV DW+GP+ DE+A+ EP E + E L GEN N +LSSIGNW++CQ Sbjct: 349 VGKGFQAEVLDWSGPMKSDEDALPEPLEISPSEFYKLLGENMRNPTKLSSIGNWIKCQEV 408 Query: 1560 XXXXXXXXXXXXXXKWRRAPLFEVQTDDWECFRAVLWDPTHADCAVPQELPTDEVLKQLK 1739 KWRRAPLFEVQTDDW+CF A+ W+P+HADCAVPQEL TD+VLKQLK Sbjct: 409 LDRANETICG----KWRRAPLFEVQTDDWDCFCAIHWNPSHADCAVPQELETDQVLKQLK 464 Query: 1740 YIQMLRPRLVAKKRKLDQT 1796 YI+MLRPRL AK++K D T Sbjct: 465 YIEMLRPRLAAKRKKSDCT 483 >ref|XP_006577888.1| PREDICTED: uncharacterized protein LOC100813057 isoform X2 [Glycine max] gi|571448547|ref|XP_006577889.1| PREDICTED: uncharacterized protein LOC100813057 isoform X3 [Glycine max] Length = 497 Score = 314 bits (804), Expect = 1e-82 Identities = 190/488 (38%), Positives = 269/488 (55%), Gaps = 20/488 (4%) Frame = +3 Query: 351 QNFICDRMPGDESWEGCPRSDKHNQDGSGKSVSTVEKTS-----YDSSLNFLSCSQVSTS 515 ++ +CDRMP E+W+ C + +K+ D K+ E Y SS +S Q ST+ Sbjct: 23 KDVLCDRMPSGETWQVCLKCNKYPLDWCRKAEPVEEDRRNADDPYRSSC-VVSFGQPSTA 81 Query: 516 TIMSGSESPLYVYHRKKLNTRSPDISLAQTSFKNKNGDDFHSVVSSEAHSGATKEHLVSI 695 +IM+ + +P VY RKKL S + +L T+ + + SV+SS AH + ++ Sbjct: 82 SIMTENTTPNMVYRRKKLRKDS-NFNLGPTNVQASA--NIPSVISSAAHLSSAEDQPTGF 138 Query: 696 ETE--TEGIRSPCKPPSKCS--TEGLVSKSGSYNGCLDREESDEGLRTDTGRXXXXXXXX 863 + + E ++ P P T+ K+ N D S + Sbjct: 139 QVKHAIEIVKDPTMPSVLFDGVTKDSTHKNLGINSVNDSCSSSK---------------- 182 Query: 864 XXXXXXKLNLDLVSASLKTDGDDTGECSSSGGLVSERLWENMSERDICISIIRSHGMVEI 1043 +++T+ D+TGECSSS +V + E ++E+D C++I+RSHG+++ Sbjct: 183 --------------PNMETEMDETGECSSSSIIVMDCTREEVTEKDFCMNILRSHGLLKE 228 Query: 1044 ASPLQDCVSAYDSAMPSEEICCSRTCKVCGCSDPSLKMLICDQCEDSFHTSCCTPRVKRI 1223 SP+ + S D+ S CCSR+CK+CG D SL ML+CD CED++H SC PR+K++ Sbjct: 229 NSPVDNVTSGEDAVTTSNN-CCSRSCKICGDLDNSLNMLLCDHCEDAYHLSCYNPRLKKL 287 Query: 1224 PIDEWFCYSCLXXXXXXXXXXSSNAPENISNE----------NKLVPIADMLRGTKPFKS 1373 PIDEWFC+SCL +P +I NE +L PI MLR TKP+ + Sbjct: 288 PIDEWFCHSCLIKRQKILKETVIRSP-SIHNELGKCRTAPVKAELNPILLMLRDTKPYTT 346 Query: 1374 SVRVGPEYQVEVPDWAGPV-IDENAIREPDETAELECLSLQGENSNNSLQLSSIGNWLQC 1550 VRVG +Q EV DW+GP+ DE+A+ EP E + E L GEN+ N +LSSIGNW++C Sbjct: 347 GVRVGKGFQAEVLDWSGPIKSDEDALPEPLEISPSEFYKLLGENTRNPTKLSSIGNWVKC 406 Query: 1551 QXXXXXXXXXXXXXXXXKWRRAPLFEVQTDDWECFRAVLWDPTHADCAVPQELPTDEVLK 1730 Q KWRRAPLFEVQTD WECF A+ WDP+HADCAVPQEL TD+VLK Sbjct: 407 QEIIDRANGTICG----KWRRAPLFEVQTDAWECFCAIHWDPSHADCAVPQELETDQVLK 462 Query: 1731 QLKYIQML 1754 QLKYI+M+ Sbjct: 463 QLKYIEMV 470 >gb|EMJ25408.1| hypothetical protein PRUPE_ppa017202mg, partial [Prunus persica] Length = 442 Score = 313 bits (803), Expect = 1e-82 Identities = 185/453 (40%), Positives = 254/453 (56%), Gaps = 16/453 (3%) Frame = +3 Query: 492 SCSQVSTSTIMSGSESPLYVYHRKKLNTRSPDISLAQTSFKNKNGD-----DFHSVVSSE 656 S SQ+S +IMS S +P VY R+KL S I ++ + GD +F+ Sbjct: 1 SSSQLSAVSIMSESSAPNLVYKRRKLRGNSVTI-FSEDRGNTRTGDYLSFVNFNVPSEDR 59 Query: 657 AHSGATKEHLVS--IETETEGIRSPCKPPSKCSTEGLVSKSGSYNGCLDREE--SDEGLR 824 ++ K+ LV IE ET+ R+ P C+ E V S S+NGC EE SDE + Sbjct: 60 ENTRTGKKQLVDSHIEHETKATRAS---PHLCNRESHVLNSESFNGCSVGEELISDEAPK 116 Query: 825 TDTGRXXXXXXXXXXXXXXKLNLDLVSASLKTDGDDTGECSSSGGLVSERLWENMSERDI 1004 + + K N++ +SA ++ GECSSS +V E + + +SE+D+ Sbjct: 117 NNVQKVLEVNSVNDSCSSSKSNMEHLSAVRIMSENENGECSSSSVIVMEAVGD-LSEKDL 175 Query: 1005 CISIIRSHGMVEIASPLQDCVSAYDSAMPSEEICCSRTCKVCGCSDPSLKMLICDQCEDS 1184 CISI+RSHG++ + C SA D+ S + C R+CK+C + +LKMLICD CE++ Sbjct: 176 CISILRSHGLLGDVQATRICRSAEDTGTSSSD-SCHRSCKICSRAGTALKMLICDNCEEA 234 Query: 1185 FHTSCCTPRVKRIPIDEWFCYSCLXXXXXXXXXXSSNAPE-------NISNENKLVPIAD 1343 FH SCC PR+K++P DEWFC+SCL + +P N S++ ++ PI Sbjct: 235 FHMSCCHPRIKKVPFDEWFCHSCLRKKQILKEKVARKSPNITSVMCRNASSKGQVNPILL 294 Query: 1344 MLRGTKPFKSSVRVGPEYQVEVPDWAGPVIDENAIREPDETAELECLSLQGENSNNSLQL 1523 MLR +P+ +SVR G +Q EVPDW+GP+ + EL C N ++ Sbjct: 295 MLRDNEPYATSVRFGKGFQAEVPDWSGPINEY----------ELNC--------NKPSRV 336 Query: 1524 SSIGNWLQCQXXXXXXXXXXXXXXXXKWRRAPLFEVQTDDWECFRAVLWDPTHADCAVPQ 1703 SSIGNWLQC+ KWRRAPLFEVQTDDWECF ++LWDP+HADC PQ Sbjct: 337 SSIGNWLQCREVVDSANGTICG----KWRRAPLFEVQTDDWECFCSILWDPSHADCNAPQ 392 Query: 1704 ELPTDEVLKQLKYIQMLRPRLVAKKRKLDQTKT 1802 EL TD+VLKQLKYI+ LRPRL AK+ LD TK+ Sbjct: 393 ELGTDQVLKQLKYIETLRPRLSAKRHTLDGTKS 425 >ref|XP_002308115.2| hypothetical protein POPTR_0006s07510g [Populus trichocarpa] gi|550335714|gb|EEE91638.2| hypothetical protein POPTR_0006s07510g [Populus trichocarpa] Length = 718 Score = 309 bits (791), Expect = 4e-81 Identities = 199/484 (41%), Positives = 249/484 (51%), Gaps = 25/484 (5%) Frame = +3 Query: 411 DKHNQDGSGKSVSTVEKTSYDSS---LNFLSCSQVSTSTIMSGSESPLYVYHRKKLNTRS 581 D+H + G + + D S L+ Q+ TS+ MS + +VY R+KL S Sbjct: 226 DQHERGTGGALMPPPIAYNSDDSQCRLSLQGSPQLPTSSTMSEISARNFVYSRRKLRGNS 285 Query: 582 PDISLAQT-SFKNKNGDDFHSVVSSEAHSGATKE-------HLVSIETETEGIRSPCKPP 737 AQ ++ +D S++SS+ S +E H E T G +PP Sbjct: 286 ATFLSAQVPGITKRSREDCLSIISSDGPSLVVEEARVVSQDHQDQFERGTGGALP--RPP 343 Query: 738 SKCSTEGLVSKSGSYNGCLDREE--SDEGLRTDTGRXXXXXXXXXXXXXXKLNLDLVSAS 911 C E VSKS S +GC E+ SDE + + K N+DLVS S Sbjct: 344 LVCYGEPHVSKSESSSGCSLVEDLVSDEATKKSRPKIIEVDSINDSCSSSKSNMDLVSDS 403 Query: 912 LKTDGDDTGECSSSGGLVSERLWENMSERDICISIIRSHGMVEIASPLQDCVSAYDSAMP 1091 KT+GDD GECSSS + +E E+ SE D CISI+R G E P + VSA Sbjct: 404 TKTEGDDNGECSSSSIVAAEVTGEDQSENDQCISILRRQGAFEGVWPGKTHVSAKSIGDG 463 Query: 1092 SEE-ICCSRTCKVCGCSDPSLKMLICDQCEDSFHTSCCTPRVKRIPIDEWFCYSCLXXXX 1268 S SR CK C +KMLICD CEDSFH SCC PRVKRIP+DEW C SC Sbjct: 464 SGSGSSSSRPCKKCFRKGSPVKMLICDNCEDSFHVSCCNPRVKRIPVDEWLCRSCWKKKR 523 Query: 1269 XXXXXXSSNAPENI----------SNENKLVPIADMLRGTKPFKSSVRVGPEYQVEVPDW 1418 S NI S+ + PIA MLR T+P+ VRVG +QV++PDW Sbjct: 524 IIPKETISRKSLNIIGDMGRCRDASSTGESNPIALMLRDTEPYTGGVRVGKGFQVDIPDW 583 Query: 1419 AGPVIDE-NAIREPDETAELECLSLQGENSNNSLQLSSIGNWLQCQXXXXXXXXXXXXXX 1595 +GP+I+ + I +P + L SN S +L SIGNWLQC+ Sbjct: 584 SGPIINVVDIIGKPLVLEPSYFVGLFELKSNKSSKLGSIGNWLQCKQVIDDAAEGGNVTI 643 Query: 1596 XXKWRRAPLFEVQTDDWECFRAVLWDPTHADCAVPQELPTDEVLKQLKYIQMLRPRLVAK 1775 KWRRAPLFEVQT WECF V WDP HADCA PQEL TDEV+KQ+KYIQMLRPR+ AK Sbjct: 644 CGKWRRAPLFEVQTAVWECFCCVFWDPIHADCAAPQELETDEVMKQIKYIQMLRPRIAAK 703 Query: 1776 KRKL 1787 +KL Sbjct: 704 HQKL 707 >ref|XP_004146372.1| PREDICTED: uncharacterized protein LOC101212408 [Cucumis sativus] Length = 512 Score = 305 bits (780), Expect = 7e-80 Identities = 189/464 (40%), Positives = 253/464 (54%), Gaps = 12/464 (2%) Frame = +3 Query: 399 CPRSDKHNQDGSGKSVSTVEKTSYDSSLNFLSCSQVSTSTIM--SGSESPLYVYHRKKLN 572 CP D+ + DG K+ +E+ L L+ + + IM GS+S + VY RKKL Sbjct: 2 CPHCDEFSHDGCRKA-GRIEEKKNSGGLRCLNFPRTFPTVIMMPEGSKSNV-VYRRKKLR 59 Query: 573 TRSPDISLAQTSFKNKNGDDFHSVVSSEAHSGATKEHLVSIET--ETEGIRSPCKPPSKC 746 S LA NG D S++S + + KE + + E E + + P C Sbjct: 60 GSSDSRFLA-------NGTDCISLISCDGNLAEDKEQAAASQHNHEREIVGNAVPPFPVC 112 Query: 747 STEGLVSKSGSYNGCLDREE--SDEGLRTDTGRXXXXXXXXXXXXXXKLNLDLVSASLKT 920 + VS+ S NGC+ E SDE + + K N++LVSASLK Sbjct: 113 DGKTQVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSASLKV 172 Query: 921 DGDDTGECSSSGGLVSERLWENMSERDICISIIRSHGMVEIASPLQDCVSAYDSAMPSEE 1100 + DDTGECSSS V E++S RD+CISI+RS+G++ + + S + S + Sbjct: 173 EVDDTGECSSSSIQVMGDAIEDISGRDLCISILRSNGLLSSTTHAPEEESDFRS-----D 227 Query: 1101 ICCSRTCKVCGCSDPSLKMLICDQCEDSFHTSCCTPRVKRIPIDEWFCYSCLXXXXXXXX 1280 C R CK CG S+ LKMLICD CED+FH SCC R+KR+ DEW C SCL Sbjct: 228 NNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKRVSNDEWCCNSCLKKNHKILK 287 Query: 1281 XXSSNAPENISNEN-----KLVPIADMLRGTKPFKSSVRVGPEYQVEVPDWAGPVIDE-N 1442 S N S+ N + IA ML+ TKP+ + +R+G +Q EVPDW+GP+ D+ + Sbjct: 288 EAISKKLTNTSSRNGSSKGESNSIALMLKDTKPYTTCIRIGKGFQAEVPDWSGPISDDTD 347 Query: 1443 AIREPDETAELECLSLQGENSNNSLQLSSIGNWLQCQXXXXXXXXXXXXXXXXKWRRAPL 1622 AI EP E E + +++N +LS+IGNWLQCQ KWRRAPL Sbjct: 348 AIGEPLEMDSSESFRMHEQSTNKPCRLSTIGNWLQCQQVIDGVGGGNGGICG-KWRRAPL 406 Query: 1623 FEVQTDDWECFRAVLWDPTHADCAVPQELPTDEVLKQLKYIQML 1754 FEVQTDDWECF ++LWDPTHADCAVPQEL T +V KQLKYI+M+ Sbjct: 407 FEVQTDDWECFCSILWDPTHADCAVPQELETGQVSKQLKYIEMV 450 >gb|EOY20034.1| RING/FYVE/PHD zinc finger superfamily protein, putative isoform 3 [Theobroma cacao] Length = 508 Score = 303 bits (776), Expect = 2e-79 Identities = 189/496 (38%), Positives = 265/496 (53%), Gaps = 21/496 (4%) Frame = +3 Query: 330 FVPKICSQNFICDRMPGDESWEGCPRSDKHNQDGSGKSVSTVEKTSYDSSLNFLSCSQVS 509 +V + Q+ + D MPG E+ + P+ D +Q S ++ Y + SQ S Sbjct: 17 YVSEHRKQDSLSDWMPGYETRQMSPKCDVPSQ-------SECKEAEYSCPPLPRTGSQQS 69 Query: 510 TSTIMSGSESPLYVY-HRKKLNTRSPDISLAQTSF------KNKNGDDFHSVVSSEAHSG 668 + ++MS P VY RKK S S A +F +K D SVVSS+A S Sbjct: 70 SVSVMSEGPVPTLVYSRRKKRRGSSSSASAAVANFCAEAPVNSKRSGDCLSVVSSDALSV 129 Query: 669 ATKEHL-VSIETETEGIRSPCKPPSKCSTEGLVSKSGSYNGC--LDREESDEGLRTDTGR 839 A E VS P CS E +SK NG +D SD+ +T + Sbjct: 130 AVMEQNGVSQVGHGNVATGDLLTPLACSREPHISKYEFANGFSGVDNHGSDDVRKTVRQK 189 Query: 840 XXXXXXXXXXXXXXKLNLDLVSASLKTDGDDTGECSSSGGLVSERLWENMSERDICISII 1019 K N++L AS+K + D+ GEC SS + +E + E++SE+D C SI+ Sbjct: 190 TIDVDSINDSCSSSKSNMELALASIKGEMDENGECCSSSVIAAEVVREDLSEKDRCFSIL 249 Query: 1020 RSHGMVEIASPLQDCVSAYDSAMPSEEICCSRTCKVCGCSDPSLKMLICDQCEDSFHTSC 1199 R+ G VE P + ++ + S CSR CK+CG S+ + KMLICD CE++FH C Sbjct: 250 RNQGNVEEVGPSRAPLN--EEIGTSGASSCSRVCKICGRSETAQKMLICDNCEEAFHLRC 307 Query: 1200 CTPRVKRIPIDEWFCYSCLXXXXXXXXXXSSNAPENI----------SNENKLVPIADML 1349 C PR+K++P+DEW+C+SC+ ++ +I S+E + PI ML Sbjct: 308 CNPRIKKVPVDEWYCFSCMKKKRIMVKDTTARNSSSITGCMGRCRGVSSEGESSPIELML 367 Query: 1350 RGTKPFKSSVRVGPEYQVEVPDWAGPVIDE-NAIREPDETAELECLSLQGENSNNSLQLS 1526 R +P+++SVR+G +Q +VPDW+GP+ D+ + I EP E LE N N S ++S Sbjct: 368 RDAEPYRTSVRIGKGFQADVPDWSGPIDDDVDTIGEPLEWDLLEFTDFNELNCNKSSKVS 427 Query: 1527 SIGNWLQCQXXXXXXXXXXXXXXXXKWRRAPLFEVQTDDWECFRAVLWDPTHADCAVPQE 1706 SIGNWLQC+ KWRRAPLFEVQTDDWECF +V WDP+HADC+VPQE Sbjct: 428 SIGNWLQCREFIEGIGGSNGTICG-KWRRAPLFEVQTDDWECFCSVQWDPSHADCSVPQE 486 Query: 1707 LPTDEVLKQLKYIQML 1754 L TD+VLKQLKYI+M+ Sbjct: 487 LETDQVLKQLKYIEMV 502 >ref|XP_004300023.1| PREDICTED: uncharacterized protein LOC101314280 [Fragaria vesca subsp. vesca] Length = 615 Score = 301 bits (771), Expect = 7e-79 Identities = 176/444 (39%), Positives = 252/444 (56%), Gaps = 10/444 (2%) Frame = +3 Query: 501 QVSTSTIMSGSESPLYVYHRKKLNTRSPDISLAQTSFKNKNGDDFHSVVSSEAHSGATKE 680 ++S +IMS S +P +VY R+K+ S I L+ K D S + S+A S A K+ Sbjct: 172 ELSAVSIMSESTAPNFVYKRRKVPENSVTI-LSACDVKYPRTGDCLSFIHSDAPSLAGKD 230 Query: 681 -HLVSIETETEGIRSPCKPPSKCSTEGLVSKSGSYNGCLDREE--SDEGLRTDTGRXXXX 851 H+ S + + G P C+ E V KS S++G EE SD+ + + Sbjct: 231 QHVHSQKGDPNG-------PCLCNRESTVLKSQSFHGFSVNEELVSDKTPKASVPKGLEV 283 Query: 852 XXXXXXXXXXKLNLDLVSASLKTDGDDTGECSSSGGLVSERLWENMSERDICISIIRSHG 1031 K N++ VS S KT+ D+T ECSSS +V E + + +SE+D CISI+R+HG Sbjct: 284 DSVNDSCSSSKSNMENVSTSNKTEVDETAECSSSSAIVMEAVGD-LSEKDFCISILRTHG 342 Query: 1032 MVEIASPLQDCVSAYDSAMPSEEICCSRTCKVCGCSDPSLKMLICDQCEDSFHTSCCTPR 1211 ++ + C SA D + C R+CK+C S+ + K+LICD CE++FH SCC PR Sbjct: 343 LLGEVQTTRSCESAEDISTSKS---CQRSCKICNRSETAQKLLICDNCEEAFHMSCCQPR 399 Query: 1212 VKRIPIDEWFCYSCLXXXXXXXXXXSSNAPE-------NISNENKLVPIADMLRGTKPFK 1370 +K++PIDEWFC+SCL P N S+++++ PI MLR T+P++ Sbjct: 400 IKKVPIDEWFCHSCLKEKHTLLNERVRKFPNITSVMSRNASSKDEVNPILLMLRDTEPYR 459 Query: 1371 SSVRVGPEYQVEVPDWAGPVIDENAIREPDETAELECLSLQGENSNNSLQLSSIGNWLQC 1550 +SVRVG +Q +V DW+GP I+ + I EP E E L +S+ + I NW+QC Sbjct: 460 TSVRVGKGFQADVHDWSGP-INNDGIGEPLELHPSEHARLHELSSSKPSKARPISNWIQC 518 Query: 1551 QXXXXXXXXXXXXXXXXKWRRAPLFEVQTDDWECFRAVLWDPTHADCAVPQELPTDEVLK 1730 + KWRRAPLFEVQT++WECF ++LWDP+ ADC VPQEL TDEVLK Sbjct: 519 REVVDPEKGIRCG----KWRRAPLFEVQTNNWECFCSILWDPSQADCNVPQELETDEVLK 574 Query: 1731 QLKYIQMLRPRLVAKKRKLDQTKT 1802 QLKY++ LRPRL A++ L + + Sbjct: 575 QLKYVETLRPRLSAQQHNLGRANS 598 >ref|XP_002324693.1| ELM2 domain-containing family protein [Populus trichocarpa] gi|222866127|gb|EEF03258.1| ELM2 domain-containing family protein [Populus trichocarpa] Length = 714 Score = 298 bits (763), Expect = 6e-78 Identities = 191/457 (41%), Positives = 242/457 (52%), Gaps = 15/457 (3%) Frame = +3 Query: 462 TSYDS-SLNFLSCSQVSTSTIMSGSESPLYVYHRKKLNTRSPDISLAQT-SFKNKNGDDF 635 T Y S L+ Q+ T + MS + +VY R+K+ S AQ ++ D Sbjct: 251 TVYSSCQLSLQRSPQLPTFSTMSEISASKFVYSRRKMRGNSVTFLSAQVPGITKRSRQDC 310 Query: 636 HSVVSSEAHSGATKEHLVSIETETEGIRSPCKPPSKCSTEGLVSKSGSYNGC--LDREES 809 SVVSS+ S A +E V + + E S C S + E VSKS S +GC ++ + S Sbjct: 311 LSVVSSDGPSLAVEEACVVSQDQHE---SGC---SLQNGEPHVSKSESSSGCSLVEDQVS 364 Query: 810 DEGLRTDTGRXXXXXXXXXXXXXXKLNLDLVSASLKTDGDDTGECSSSGGLVSERLWENM 989 DE + + K +++LVSAS KT+G D GECSSS + +E E+ Sbjct: 365 DEASKKSRPKIIEVDGVNDSCSSSKSDVELVSASTKTEGHDNGECSSSTVMAAEFAREDQ 424 Query: 990 SERDICISIIRSHGMVEIASPLQDCVSAYDSAMPSEEICCSRTCKVCGCSDPSLKMLICD 1169 SE+ CISI+ + P + SA S SR+CK C + KMLICD Sbjct: 425 SEKHRCISILGKQRAFDGIWPGKTRASARRIGDGSGS-SSSRSCKKCFLKESPAKMLICD 483 Query: 1170 QCEDSFHTSCCTPRVKRIPIDEWFCYSCLXXXXXXXXXXSSNAPENI----------SNE 1319 CEDSFH SCC P VKRIPIDEW C SC+ S P NI S+ Sbjct: 484 NCEDSFHVSCCNPHVKRIPIDEWLCRSCMKKKRIIPNERISRKPLNIIGDMGRCRDASSI 543 Query: 1320 NKLVPIADMLRGTKPFKSSVRVGPEYQVEVPDWAGPVI-DENAIREPDETAELECLSLQG 1496 + PIA ML T+P+ VRVG +QVEVPDW+GP+I D + I +P +SL Sbjct: 544 GESDPIALMLTDTEPYTGGVRVGKGFQVEVPDWSGPIINDVDTIGKPVVLDTSYFVSLHE 603 Query: 1497 ENSNNSLQLSSIGNWLQCQXXXXXXXXXXXXXXXXKWRRAPLFEVQTDDWECFRAVLWDP 1676 N + SIGNWLQC+ KWRRAPLFEVQTDDWECF V WDP Sbjct: 604 LKYNKPSKFGSIGNWLQCRQVIDDAAEGGNVTICGKWRRAPLFEVQTDDWECFCCVFWDP 663 Query: 1677 THADCAVPQELPTDEVLKQLKYIQMLRPRLVAKKRKL 1787 HADCA PQEL TDEV+KQLKYIQMLRP++ AK++KL Sbjct: 664 IHADCATPQELETDEVMKQLKYIQMLRPQIAAKRQKL 700 Score = 73.2 bits (178), Expect = 4e-10 Identities = 55/161 (34%), Positives = 77/161 (47%), Gaps = 6/161 (3%) Frame = +3 Query: 285 MLRLSLLPDPAESTPF-VPKICSQNFICDRMPGDESWEGCPRSDKHNQDGSGKSVSTV-- 455 ML S LP+ ESTPF V QN C+ PG E W+ P+ +H D ++ S Sbjct: 1 MLIPSSLPNSIESTPFHVSDHGKQNLFCEVTPGTEIWQMGPKCHEHCHDCCKEAASITGE 60 Query: 456 -EKTSYDSSLNFLSCSQVSTSTIMSGSESPLYVYHRKKLNTRSPDISLAQTSFKNKNGDD 632 E +Y L+F T++ MS S +P +VY R+KL + D A T +G+D Sbjct: 61 KEGNNYSCLLSFPRSPHPPTTSKMSESSAPNFVYSRRKLQGNTIDFLSAIT---EGSGED 117 Query: 633 FHSVVSSEAHSGATKEHLVSIET--ETEGIRSPCKPPSKCS 749 V++S+ S KEH V E ETE +R P C+ Sbjct: 118 CPYVINSDGSSVPVKEHHVGSEDEHETEAVRESLMSPLICN 158 >ref|XP_006577890.1| PREDICTED: uncharacterized protein LOC100813057 isoform X4 [Glycine max] Length = 414 Score = 287 bits (735), Expect = 1e-74 Identities = 170/426 (39%), Positives = 237/426 (55%), Gaps = 15/426 (3%) Frame = +3 Query: 522 MSGSESPLYVYHRKKLNTRSPDISLAQTSFKNKNGDDFHSVVSSEAHSGATKEHLVSIET 701 M+ + +P VY RKKL S + +L T+ + + SV+SS AH + ++ + Sbjct: 1 MTENTTPNMVYRRKKLRKDS-NFNLGPTNVQASA--NIPSVISSAAHLSSAEDQPTGFQV 57 Query: 702 E--TEGIRSPCKPPSKCS--TEGLVSKSGSYNGCLDREESDEGLRTDTGRXXXXXXXXXX 869 + E ++ P P T+ K+ N D S + Sbjct: 58 KHAIEIVKDPTMPSVLFDGVTKDSTHKNLGINSVNDSCSSSK------------------ 99 Query: 870 XXXXKLNLDLVSASLKTDGDDTGECSSSGGLVSERLWENMSERDICISIIRSHGMVEIAS 1049 +++T+ D+TGECSSS +V + E ++E+D C++I+RSHG+++ S Sbjct: 100 ------------PNMETEMDETGECSSSSIIVMDCTREEVTEKDFCMNILRSHGLLKENS 147 Query: 1050 PLQDCVSAYDSAMPSEEICCSRTCKVCGCSDPSLKMLICDQCEDSFHTSCCTPRVKRIPI 1229 P+ + S D+ S CCSR+CK+CG D SL ML+CD CED++H SC PR+K++PI Sbjct: 148 PVDNVTSGEDAVTTSNN-CCSRSCKICGDLDNSLNMLLCDHCEDAYHLSCYNPRLKKLPI 206 Query: 1230 DEWFCYSCLXXXXXXXXXXSSNAPENISNE----------NKLVPIADMLRGTKPFKSSV 1379 DEWFC+SCL +P +I NE +L PI MLR TKP+ + V Sbjct: 207 DEWFCHSCLIKRQKILKETVIRSP-SIHNELGKCRTAPVKAELNPILLMLRDTKPYTTGV 265 Query: 1380 RVGPEYQVEVPDWAGPV-IDENAIREPDETAELECLSLQGENSNNSLQLSSIGNWLQCQX 1556 RVG +Q EV DW+GP+ DE+A+ EP E + E L GEN+ N +LSSIGNW++CQ Sbjct: 266 RVGKGFQAEVLDWSGPIKSDEDALPEPLEISPSEFYKLLGENTRNPTKLSSIGNWVKCQE 325 Query: 1557 XXXXXXXXXXXXXXXKWRRAPLFEVQTDDWECFRAVLWDPTHADCAVPQELPTDEVLKQL 1736 KWRRAPLFEVQTD WECF A+ WDP+HADCAVPQEL TD+VLKQL Sbjct: 326 IIDRANGTICG----KWRRAPLFEVQTDAWECFCAIHWDPSHADCAVPQELETDQVLKQL 381 Query: 1737 KYIQML 1754 KYI+M+ Sbjct: 382 KYIEMV 387 >gb|ESW09310.1| hypothetical protein PHAVU_009G117100g [Phaseolus vulgaris] gi|561010404|gb|ESW09311.1| hypothetical protein PHAVU_009G117100g [Phaseolus vulgaris] gi|561010405|gb|ESW09312.1| hypothetical protein PHAVU_009G117100g [Phaseolus vulgaris] gi|561010406|gb|ESW09313.1| hypothetical protein PHAVU_009G117100g [Phaseolus vulgaris] Length = 402 Score = 280 bits (716), Expect = 2e-72 Identities = 165/431 (38%), Positives = 233/431 (54%), Gaps = 8/431 (1%) Frame = +3 Query: 540 PLYVYHRKKLNTRSPDISLAQTSFKNKNGDDFHSVVSSEAHSGATKEHLVS--IETETEG 713 P VY RKKL R + L T+ + ++ S +SS AH + +E + E E Sbjct: 9 PKMVYRRKKLR-RDSNFKLEPTNMQASA--NYPSAISSAAHLSSAEEQPAGFRVNHEIEV 65 Query: 714 IRSPCKPPSKCSTEGLVSKSGSYN-GCLDREESDEGLRTDTGRXXXXXXXXXXXXXXKLN 890 +++P P +G+ S N G +S + D Sbjct: 66 VKNPTTP--SVLFDGVAKDSTHKNLGISSVNDSCSSSKPD-------------------- 103 Query: 891 LDLVSASLKTDGDDTGECSSSGGLVSERLWENMSERDICISIIRSHGMVEIASPLQDCVS 1070 ++T+ D GECSSS +V + E ++E+D CI+ +R+HG++ SP + S Sbjct: 104 -------METEMDGNGECSSSSVIVMDSTREEVTEKDFCINTLRTHGLLREYSPEDNVAS 156 Query: 1071 AYDSAMPSEEICCSRTCKVCGCSDPSLKMLICDQCEDSFHTSCCTPRVKRIPIDEWFCYS 1250 D+A+ CCSR+CK+C D SL ML+CD CED++H SC PR K++PIDEWFC+S Sbjct: 157 GEDAAITGNS-CCSRSCKICDRLDSSLNMLLCDHCEDAYHPSCYNPRSKKLPIDEWFCHS 215 Query: 1251 CLXXXXXXXXXXSSNAP----ENISNENKLVPIADMLRGTKPFKSSVRVGPEYQVEVPDW 1418 CL S + S + + PI MLR T P ++VRVG +Q EV DW Sbjct: 216 CLKKRQKNLKEPSIHNELGKCRATSVKGEANPILLMLRDTAPHTTAVRVGKGFQAEVLDW 275 Query: 1419 AGPV-IDENAIREPDETAELECLSLQGENSNNSLQLSSIGNWLQCQXXXXXXXXXXXXXX 1595 +GP+ +E+A+ EP E + L EN+ + +LS+IGNW+QCQ Sbjct: 276 SGPIKSEEDALPEPFEINPSDFNRLLEENTRSPTKLSAIGNWIQCQAVIDRANGTICG-- 333 Query: 1596 XXKWRRAPLFEVQTDDWECFRAVLWDPTHADCAVPQELPTDEVLKQLKYIQMLRPRLVAK 1775 KWRRAP FEVQTD WECF A+ WDP+HADCA PQEL T++VLKQLKY+++LRPR+ AK Sbjct: 334 --KWRRAPFFEVQTDVWECFCAIHWDPSHADCAAPQELETEQVLKQLKYVEVLRPRVAAK 391 Query: 1776 KRKLDQTKTLD 1808 ++K D T+ D Sbjct: 392 RKKSDCTQNKD 402 >gb|EPS72079.1| hypothetical protein M569_02681, partial [Genlisea aurea] Length = 277 Score = 222 bits (566), Expect = 4e-55 Identities = 115/276 (41%), Positives = 165/276 (59%), Gaps = 9/276 (3%) Frame = +3 Query: 903 SASLKTDGDDTGECSSSGGLVSERLWENMSERDICISIIRSHGMVEIASPLQDCVSAYDS 1082 +ASLK DD GECSSSG S MSE+DICI+I+++ GM++ A + + S+ + Sbjct: 10 AASLKVIVDDGGECSSSGAGKSLE----MSEKDICIAILKNQGMLDEACRVNERASS-GT 64 Query: 1083 AMPSEEICCSRTCKVCGCSDPSLKMLICDQCEDSFHTSCCTPRVKRIPIDEWFCYSCLXX 1262 S + C ++CK+C D + MLICD C+D+FH SCCTPR+K +P+ EW C SCL Sbjct: 65 TETSSDYYCWKSCKICDRRDSTKNMLICDTCDDAFHRSCCTPRIKILPVGEWLCNSCLKL 124 Query: 1263 XXXXXXXXSSNAPEN-----ISNENKLVPIADMLRGTKPFKSSVRVGPEYQVEVPDWAGP 1427 ++ EN +++ +L + ML T+P+ S+VR+G +YQ +VP W GP Sbjct: 125 KRMELKHKCTSNGENGVIDPTASDTELSSLRYMLWDTEPYMSNVRIGDKYQADVPVWHGP 184 Query: 1428 VIDENAIREPD----ETAELECLSLQGENSNNSLQLSSIGNWLQCQXXXXXXXXXXXXXX 1595 +N + +P + + ++ Q +S ++L+SIGNW+QC Sbjct: 185 ---DNEVYDPPGVPLDIDPSDFINSQETDSFQPVKLNSIGNWIQCHESIDCPGEGSDRVI 241 Query: 1596 XXKWRRAPLFEVQTDDWECFRAVLWDPTHADCAVPQ 1703 KWRRAPLFEVQTD+WECFR +LWDP+HADCAVPQ Sbjct: 242 CGKWRRAPLFEVQTDNWECFRCILWDPSHADCAVPQ 277 >ref|XP_004967630.1| PREDICTED: CHD3-type chromatin-remodeling factor PICKLE-like isoform X6 [Setaria italica] Length = 437 Score = 222 bits (565), Expect = 6e-55 Identities = 120/294 (40%), Positives = 166/294 (56%), Gaps = 6/294 (2%) Frame = +3 Query: 906 ASLKTDGDDTGECSSSGGLVSERLWENMSERDICISIIRSHGMV-EIASPLQDCVSAYDS 1082 +S+ D D ECSSS +E + E+MS RD+CI+I++ G++ E + ++D + D Sbjct: 146 SSMVLDKKDAAECSSSNISPTEPITEHMSPRDLCIAILKKDGLITESRARIKDDFTDSD- 204 Query: 1083 AMPSEEICCSRTCKVCGCSDPSLKMLICDQCEDSFHTSCCTPRVKRIPIDEWFCYSCLXX 1262 A P C CGC + SLKMLICD CE +FH SCC P +K +P DEW+C CL Sbjct: 205 ANPM------LACNTCGCLEHSLKMLICDSCEAAFHLSCCAPPIKELPTDEWYCAPCLCK 258 Query: 1263 XXXXXXXXSSNA---PENISNENK--LVPIADMLRGTKPFKSSVRVGPEYQVEVPDWAGP 1427 S P +N+ + I ML+ +P+ + VR+G ++Q EVP+W+G Sbjct: 259 KPKSVYGKLSEGKVLPSRNTNQRPHGMSHIDYMLKDAEPYVTGVRIGRDFQAEVPEWSGS 318 Query: 1428 VIDENAIREPDETAELECLSLQGENSNNSLQLSSIGNWLQCQXXXXXXXXXXXXXXXXKW 1607 + EP E E + ++N Q SSIGNW+QC+ KW Sbjct: 319 TSSDGYFDEPSEFDPAELTNFNLCKTSNQSQ-SSIGNWIQCRETLNPGDSDKQVVCG-KW 376 Query: 1608 RRAPLFEVQTDDWECFRAVLWDPTHADCAVPQELPTDEVLKQLKYIQMLRPRLV 1769 RRAPL+ VQTDDWECF +LWDP HADCAVPQEL T EV KQL+++ M++ +LV Sbjct: 377 RRAPLYVVQTDDWECFCCLLWDPAHADCAVPQELKTSEVQKQLRFVNMVKKQLV 430