BLASTX nr result
ID: Akebia24_contig00004828
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00004828 (1180 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002266542.1| PREDICTED: uncharacterized protein LOC100257... 211 6e-52 ref|XP_007201961.1| hypothetical protein PRUPE_ppa004686mg [Prun... 169 2e-39 ref|XP_006838218.1| hypothetical protein AMTR_s00106p00155270 [A... 166 2e-38 ref|XP_006430550.1| hypothetical protein CICLE_v10011438mg [Citr... 164 9e-38 ref|XP_006430548.1| hypothetical protein CICLE_v10011438mg [Citr... 164 9e-38 ref|XP_006482076.1| PREDICTED: uncharacterized protein DDB_G0283... 163 1e-37 ref|XP_006482075.1| PREDICTED: uncharacterized protein DDB_G0283... 163 1e-37 ref|XP_007028352.1| Uncharacterized protein isoform 4 [Theobroma... 163 1e-37 ref|XP_007028351.1| Uncharacterized protein isoform 3 [Theobroma... 163 1e-37 ref|XP_007028350.1| Uncharacterized protein isoform 2, partial [... 163 1e-37 ref|XP_007028349.1| Uncharacterized protein isoform 1 [Theobroma... 163 1e-37 ref|XP_004144330.1| PREDICTED: uncharacterized protein LOC101218... 141 6e-31 ref|XP_002307979.2| hypothetical protein POPTR_0006s03830g [Popu... 135 3e-29 gb|EXC34985.1| hypothetical protein L484_014712 [Morus notabilis] 130 1e-27 ref|XP_006575187.1| PREDICTED: protein starmaker-like isoform X6... 130 1e-27 ref|XP_006575186.1| PREDICTED: protein starmaker-like isoform X5... 130 1e-27 ref|XP_006575183.1| PREDICTED: protein starmaker-like isoform X2... 130 1e-27 ref|XP_003519025.1| PREDICTED: protein starmaker-like isoform X1... 129 2e-27 gb|EYU18618.1| hypothetical protein MIMGU_mgv1a007462mg [Mimulus... 127 1e-26 ref|XP_006589006.1| PREDICTED: arginine/serine-rich coiled-coil ... 118 4e-24 >ref|XP_002266542.1| PREDICTED: uncharacterized protein LOC100257160 [Vitis vinifera] gi|297739954|emb|CBI30136.3| unnamed protein product [Vitis vinifera] Length = 510 Score = 211 bits (536), Expect = 6e-52 Identities = 161/453 (35%), Positives = 210/453 (46%), Gaps = 64/453 (14%) Frame = +1 Query: 13 MDPNLSLSP-DIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXNHDRSPSPEFHR 189 MD +L P D ADA+ +FRKP+NDA NRKYRRR H+ + SP F + Sbjct: 1 MDSSLKSPPRDKADAKTAFRKPTNDATNRKYRRRSPTSGSSSSGGSPI-HEHNSSPIFSK 59 Query: 190 DNPAKISDDPRRRENGGRELEMXXXXXXXXXXXXXXXXXXXQPXXXXXXXXXXXXXXXXX 369 ++ K+SD +RR+ GREL+ Q Sbjct: 60 EDSEKVSDRRQRRKGDGRELDRDAGRSQYRKTADSYRHSDRQSSRSSRGHYRYDDHVRQE 119 Query: 370 XXA-DGGER-RYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPD---- 528 A D G+R + +SD RQESE+ R RDY + DKY+RDK D Sbjct: 120 KHAADEGDRDHHNLSSRSGRESRVGNYSDHVRQESEHSRTRDYFRGTDKYSRDKHDNAGY 179 Query: 529 -----------------------CDRHGR-RRLINSNLDEVKIGEE-RHNSXXXXXXXXX 633 DR G RR NSN ++ K GE+ +H Sbjct: 180 RSKDKEKETSSLEHQKYKDKDLSSDRAGSGRRHTNSNFEDSKAGEQDKHLRDGDGPDERK 239 Query: 634 XXXXSLGDYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERR 813 LGDYK+D S +ESRGH DST+ R++G E K+ KE+DGQK+ E++ Sbjct: 240 DYRRGLGDYKSDRSISHEESRGHRNDSTSGRDSGGYRSKEVHKNEPKEVDGQKQPKDEKK 299 Query: 814 KH----------------------------EDNEFLVKKPKLCNADEGTG-GKIISKF-T 903 K+ E+ E KKPKL + ++ T GK +S+F T Sbjct: 300 KYDEWKTDRHKDRYNRESREQFEDKTVVASENQESAAKKPKLVSLEKSTDYGKDVSRFST 359 Query: 904 CAAD-ETPSSSKQVQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNRNLVGG 1080 AD + SSSK Q+I DKV PE A + +E + DLN ELVNRNLVG Sbjct: 360 AVADMKQSSSSKLAQDIADKVTPEHAFLNNSEVAN--DLNAAKIAAMKAAELVNRNLVGV 417 Query: 1081 GYMSTDQKKKILWGNKKTSAAEESVHHWDMPLF 1179 GYMS DQKKK+LWG+KK++ AEES HHWD LF Sbjct: 418 GYMSADQKKKLLWGSKKSTTAEESGHHWDTALF 450 >ref|XP_007201961.1| hypothetical protein PRUPE_ppa004686mg [Prunus persica] gi|462397492|gb|EMJ03160.1| hypothetical protein PRUPE_ppa004686mg [Prunus persica] Length = 496 Score = 169 bits (429), Expect = 2e-39 Identities = 135/438 (30%), Positives = 193/438 (44%), Gaps = 61/438 (13%) Frame = +1 Query: 49 DARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXNHDRSPSPEFHRDNPAKISDDPRRR 228 DA+ +FRKP+ DAANRKYRRR H+ + SP+ R++P K+S+ RR Sbjct: 13 DAKTAFRKPATDAANRKYRRRSPVGGSSPSDGSPM-HEHNCSPKNSREDPGKVSEYQTRR 71 Query: 229 ENGGRELEMXXXXXXXXXXXXXXXXXXXQPXXXXXXXXXXXXXXXXXXXADGGERRYQXX 408 + GRELE Q AD ++ YQ Sbjct: 72 RDDGRELERDSNRRYYGRSSDSYRHSDRQSSRSLHGYYKHDDCIKHDKHADEEDKNYQKL 131 Query: 409 XXXXXXXXXXTHSDCTRQESEYERRDYQQHVDKYNRDK--------PDCDRH-------- 540 S + + R+Y +++DKY+RDK D DR Sbjct: 132 SSRSGR-----ESRGSAYYDHIKSREYSRNLDKYSRDKYDGSGYRNKDKDRESSFPENQK 186 Query: 541 -------------GRRRLINSNLDEVKIGEERHNSXXXXXXXXXXXXXSLGDYKNDHISS 681 GRR + + +E++ +RH + GDY ++ I S Sbjct: 187 YKDKDSSSQRVGSGRR---HGHFEEMERERDRHALDRDVQDEKKDYRRNSGDYISERIFS 243 Query: 682 FDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEF--------- 834 ++ES+G DS + R+ G++ + E KS KELD + +R+K++D E Sbjct: 244 YEESKGQRSDSISRRDEGKHRMKEGYKSELKELDDDNVSKEQRKKYDDKETSWGNRITRE 303 Query: 835 ------------------LVKKPKLCNADEGTGG-KIISKFTCAAD-ETPSSSKQVQEIV 954 K+PKL ++++G G K +SKFT AD SSSKQVQE Sbjct: 304 TSERSADKHYIKSENQESTAKRPKLFSSEKGIDGRKDVSKFTTTADGRESSSSKQVQE-- 361 Query: 955 DKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNRNLVGGG---YMSTDQKKKILWGN 1125 D++ E Q AN+A +A D+N ELVNRNL+G G M+ DQKKK+LWGN Sbjct: 362 DEMTTEKTQ--ANDAEAANDINAAKVAALKAAELVNRNLIGAGPVGCMTADQKKKLLWGN 419 Query: 1126 KKTSAAEESVHHWDMPLF 1179 KK++ AEE H WD LF Sbjct: 420 KKSTTAEEVGHRWDSTLF 437 >ref|XP_006838218.1| hypothetical protein AMTR_s00106p00155270 [Amborella trichopoda] gi|548840676|gb|ERN00787.1| hypothetical protein AMTR_s00106p00155270 [Amborella trichopoda] Length = 532 Score = 166 bits (420), Expect = 2e-38 Identities = 142/474 (29%), Positives = 191/474 (40%), Gaps = 85/474 (17%) Frame = +1 Query: 13 MDPNL-SLSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXX-NHDRSPSPEFH 186 MD L S SPD + +PSFRKPSNDA RKYR+R H S SP Sbjct: 1 MDSGLVSYSPDPVEPKPSFRKPSNDAFQRKYRKRSPTSGSASPLSSGSPQHSHSYSPNIS 60 Query: 187 RDNPAKISDDPRRRENGGRELEMXXXXXXXXXXXXXXXXXXXQPXXXXXXXXXXXXXXXX 366 + K+++D R R + RE+E Sbjct: 61 MEEAGKVTNDQRTRMDEEREVERDSSHHRSGKGSDSYGKG--SDVYGDNDRHSRGITQGY 118 Query: 367 XXXADGGERRYQXXXXXXXXXXXXTHSDCTRQ-----ESEYERRDYQQHVDKYNRDKPD- 528 D + + Q S TR +EYE+RD + NR PD Sbjct: 119 RRHDDSSKHQSQHRREVEERSSQRYSSRITRDLEGSSHAEYEKRDRDSDNFRDNRRNPDK 178 Query: 529 ------CDRHGRRRL---------------INSNLDEVKIGE-ERHNSXXXXXXXXXXXX 642 D GRR+ N+N++ K+GE ER+ Sbjct: 179 PPRDRKIDDEGRRKERDSATQGRYRDIDKPANTNMEREKMGERERYRDRGEGRDDYRDYR 238 Query: 643 XSLGDYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHE 822 SLGD + D +SS++ SRG+ +DS + R++G E +SS++E + + ++RR+ + Sbjct: 239 KSLGDTRRDRVSSYEGSRGYARDSASGRDSGSRHSREIHRSSNRESERHIEDKVQRRRGD 298 Query: 823 DNE-------------------------------------------------FLVKKPKL 855 D + KK KL Sbjct: 299 DESDRYKNKDSYNRESDDHSRGYSRSSSDYRDRSFRNGRSEDKNVHAVDDEASVGKKCKL 358 Query: 856 CNADEGTGGKI-----ISKFTCAADETPSSS-KQVQEIVDKVIPEPAQSSANEAVSACDL 1017 +AD+ +G TC AD+ S S KQ+QE V K EP QSSANEA A DL Sbjct: 359 FDADKSSGDATDRHLPSKSSTCVADDKSSLSLKQLQEPVPKETLEPVQSSANEAKIAQDL 418 Query: 1018 NXXXXXXXXXXELVNRNLVGGGYMSTDQKKKILWGNKKTSAAEESVHHWDMPLF 1179 N +VNRNLVGG Y+STD+KKK+LWGNKKTSAAEES WD +F Sbjct: 419 NAAKVAAMKAAGIVNRNLVGGSYLSTDEKKKLLWGNKKTSAAEESGTRWDTAMF 472 >ref|XP_006430550.1| hypothetical protein CICLE_v10011438mg [Citrus clementina] gi|557532607|gb|ESR43790.1| hypothetical protein CICLE_v10011438mg [Citrus clementina] Length = 538 Score = 164 bits (414), Expect = 9e-38 Identities = 133/432 (30%), Positives = 179/432 (41%), Gaps = 49/432 (11%) Frame = +1 Query: 28 SLSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXNHDRSPSPEFHRDNPAKI 207 S PD D + SFRKPSNDAANR+YRRR D + SP + RD+P+ + Sbjct: 4 SSPPDTPDTKASFRKPSNDAANRRYRRRSPANGSSSSDGSP-KRDHNASPIYSRDDPSNV 62 Query: 208 SDDPRRRENGGRELEMXXXXXXXXXXXXXXXXXXXQPXXXXXXXXXXXXXXXXXXXADGG 387 + +RR++ REL+ Q + Sbjct: 63 PEHQQRRKDDERELDRDSGRSHHGRGSDSYRHSDRQSSRSSHNYSKHDDYVRHDKHENDE 122 Query: 388 ERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDRHGRR----- 549 +R YQ S R+ ++ R +DY ++ +RDK D HG + Sbjct: 123 DRNYQRLS-----------SRSGRESRDHSRSKDYLSSEERSSRDKYDVIGHGSKDKEKE 171 Query: 550 -----RLINSNLDEV----------------KIGEERHNSXXXXXXXXXXXXXSLGDYKN 666 R N + D ++ + H S GD++N Sbjct: 172 SSYLERQKNKDKDSSSDRAGSGRKHTVAYSEELDRDWHKRDRDGRDEKRDYRRSSGDHRN 231 Query: 667 DHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEFL--- 837 D ++DESRGH S++ R+ G L E +S KELDGQK E++KH D+E Sbjct: 232 DRTVTYDESRGHRNYSSSGRDYGSYRLKEAHRSDPKELDGQKLANEEKKKHNDSETNRDR 291 Query: 838 -------------------VKKPKLCNADEGTGGKIISKFTCAADETPSSSKQVQEIVDK 960 KK + N D+G AA SSS Q Q+I D Sbjct: 292 DRYHRADKPDFASGKQENPTKKQRFSNWDKGA-----DNVKDAAGTMSSSSMQSQDIGDT 346 Query: 961 VIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNRNLVGGGYMSTDQKKKILWGNKKTSA 1140 AQS AN+AV A DL+ ELVN+NLVGG YMSTDQKKK+LWGNKK++ Sbjct: 347 --DALAQSHANDAV-ANDLDAAKVAAMRAAELVNKNLVGGSYMSTDQKKKLLWGNKKSTP 403 Query: 1141 AEESVHHWDMPL 1176 EES WD L Sbjct: 404 VEESARRWDTAL 415 >ref|XP_006430548.1| hypothetical protein CICLE_v10011438mg [Citrus clementina] gi|567875919|ref|XP_006430549.1| hypothetical protein CICLE_v10011438mg [Citrus clementina] gi|557532605|gb|ESR43788.1| hypothetical protein CICLE_v10011438mg [Citrus clementina] gi|557532606|gb|ESR43789.1| hypothetical protein CICLE_v10011438mg [Citrus clementina] Length = 482 Score = 164 bits (414), Expect = 9e-38 Identities = 133/432 (30%), Positives = 179/432 (41%), Gaps = 49/432 (11%) Frame = +1 Query: 28 SLSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXNHDRSPSPEFHRDNPAKI 207 S PD D + SFRKPSNDAANR+YRRR D + SP + RD+P+ + Sbjct: 4 SSPPDTPDTKASFRKPSNDAANRRYRRRSPANGSSSSDGSP-KRDHNASPIYSRDDPSNV 62 Query: 208 SDDPRRRENGGRELEMXXXXXXXXXXXXXXXXXXXQPXXXXXXXXXXXXXXXXXXXADGG 387 + +RR++ REL+ Q + Sbjct: 63 PEHQQRRKDDERELDRDSGRSHHGRGSDSYRHSDRQSSRSSHNYSKHDDYVRHDKHENDE 122 Query: 388 ERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDRHGRR----- 549 +R YQ S R+ ++ R +DY ++ +RDK D HG + Sbjct: 123 DRNYQRLS-----------SRSGRESRDHSRSKDYLSSEERSSRDKYDVIGHGSKDKEKE 171 Query: 550 -----RLINSNLDEV----------------KIGEERHNSXXXXXXXXXXXXXSLGDYKN 666 R N + D ++ + H S GD++N Sbjct: 172 SSYLERQKNKDKDSSSDRAGSGRKHTVAYSEELDRDWHKRDRDGRDEKRDYRRSSGDHRN 231 Query: 667 DHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEFL--- 837 D ++DESRGH S++ R+ G L E +S KELDGQK E++KH D+E Sbjct: 232 DRTVTYDESRGHRNYSSSGRDYGSYRLKEAHRSDPKELDGQKLANEEKKKHNDSETNRDR 291 Query: 838 -------------------VKKPKLCNADEGTGGKIISKFTCAADETPSSSKQVQEIVDK 960 KK + N D+G AA SSS Q Q+I D Sbjct: 292 DRYHRADKPDFASGKQENPTKKQRFSNWDKGA-----DNVKDAAGTMSSSSMQSQDIGDT 346 Query: 961 VIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNRNLVGGGYMSTDQKKKILWGNKKTSA 1140 AQS AN+AV A DL+ ELVN+NLVGG YMSTDQKKK+LWGNKK++ Sbjct: 347 --DALAQSHANDAV-ANDLDAAKVAAMRAAELVNKNLVGGSYMSTDQKKKLLWGNKKSTP 403 Query: 1141 AEESVHHWDMPL 1176 EES WD L Sbjct: 404 VEESARRWDTAL 415 >ref|XP_006482076.1| PREDICTED: uncharacterized protein DDB_G0283697-like isoform X2 [Citrus sinensis] Length = 482 Score = 163 bits (413), Expect = 1e-37 Identities = 133/432 (30%), Positives = 179/432 (41%), Gaps = 49/432 (11%) Frame = +1 Query: 28 SLSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXNHDRSPSPEFHRDNPAKI 207 S PD D + SFRKPSNDAANR+YRRR D + SP + RD+P+K+ Sbjct: 4 SSPPDTPDTKASFRKPSNDAANRRYRRRSPANGSSSSDGSP-KCDHNASPIYSRDDPSKV 62 Query: 208 SDDPRRRENGGRELEMXXXXXXXXXXXXXXXXXXXQPXXXXXXXXXXXXXXXXXXXADGG 387 + +RR++ REL+ Q + Sbjct: 63 PEHQQRRKDDERELDRDSGRSHHGRGSDSYRHSDRQSSRSSHNYSKHDDYVRHDKHENDE 122 Query: 388 ERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDRHGRR----- 549 +R YQ S R+ ++ R +DY ++ + DK D HG + Sbjct: 123 DRNYQRLS-----------SRSGRESRDHSRSKDYLSSEERSSHDKYDVIGHGSKDKEKE 171 Query: 550 -----RLINSNLDEV----------------KIGEERHNSXXXXXXXXXXXXXSLGDYKN 666 R N + D ++ + H S GD++N Sbjct: 172 SSYLERQKNKDKDSSSDRAGSGRKHTVAYSEELDRDWHKRDRDGRDEKRDYRRSSGDHRN 231 Query: 667 DHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEFL--- 837 D ++DESRGH S++ R+ G L E +S KELDGQK E++KH D+E Sbjct: 232 DRTVTYDESRGHRNYSSSGRDYGSYRLKEAHRSDPKELDGQKLANEEKKKHNDSETYRDR 291 Query: 838 -------------------VKKPKLCNADEGTGGKIISKFTCAADETPSSSKQVQEIVDK 960 KK + N D+G AA SSS Q Q+I D Sbjct: 292 DRYHRADKPDFASGKQENPTKKQRFSNWDKGA-----DNVKDAAGTMSSSSMQSQDIGDT 346 Query: 961 VIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNRNLVGGGYMSTDQKKKILWGNKKTSA 1140 AQS AN+AV A DL+ ELVN+NLVGG YMSTDQKKK+LWGNKK++ Sbjct: 347 --DALAQSHANDAV-ANDLDAAKVAAMRAAELVNKNLVGGSYMSTDQKKKLLWGNKKSTP 403 Query: 1141 AEESVHHWDMPL 1176 EES WD L Sbjct: 404 VEESARRWDTAL 415 >ref|XP_006482075.1| PREDICTED: uncharacterized protein DDB_G0283697-like isoform X1 [Citrus sinensis] Length = 538 Score = 163 bits (413), Expect = 1e-37 Identities = 133/432 (30%), Positives = 179/432 (41%), Gaps = 49/432 (11%) Frame = +1 Query: 28 SLSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXNHDRSPSPEFHRDNPAKI 207 S PD D + SFRKPSNDAANR+YRRR D + SP + RD+P+K+ Sbjct: 4 SSPPDTPDTKASFRKPSNDAANRRYRRRSPANGSSSSDGSP-KCDHNASPIYSRDDPSKV 62 Query: 208 SDDPRRRENGGRELEMXXXXXXXXXXXXXXXXXXXQPXXXXXXXXXXXXXXXXXXXADGG 387 + +RR++ REL+ Q + Sbjct: 63 PEHQQRRKDDERELDRDSGRSHHGRGSDSYRHSDRQSSRSSHNYSKHDDYVRHDKHENDE 122 Query: 388 ERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDRHGRR----- 549 +R YQ S R+ ++ R +DY ++ + DK D HG + Sbjct: 123 DRNYQRLS-----------SRSGRESRDHSRSKDYLSSEERSSHDKYDVIGHGSKDKEKE 171 Query: 550 -----RLINSNLDEV----------------KIGEERHNSXXXXXXXXXXXXXSLGDYKN 666 R N + D ++ + H S GD++N Sbjct: 172 SSYLERQKNKDKDSSSDRAGSGRKHTVAYSEELDRDWHKRDRDGRDEKRDYRRSSGDHRN 231 Query: 667 DHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEFL--- 837 D ++DESRGH S++ R+ G L E +S KELDGQK E++KH D+E Sbjct: 232 DRTVTYDESRGHRNYSSSGRDYGSYRLKEAHRSDPKELDGQKLANEEKKKHNDSETYRDR 291 Query: 838 -------------------VKKPKLCNADEGTGGKIISKFTCAADETPSSSKQVQEIVDK 960 KK + N D+G AA SSS Q Q+I D Sbjct: 292 DRYHRADKPDFASGKQENPTKKQRFSNWDKGA-----DNVKDAAGTMSSSSMQSQDIGDT 346 Query: 961 VIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNRNLVGGGYMSTDQKKKILWGNKKTSA 1140 AQS AN+AV A DL+ ELVN+NLVGG YMSTDQKKK+LWGNKK++ Sbjct: 347 --DALAQSHANDAV-ANDLDAAKVAAMRAAELVNKNLVGGSYMSTDQKKKLLWGNKKSTP 403 Query: 1141 AEESVHHWDMPL 1176 EES WD L Sbjct: 404 VEESARRWDTAL 415 >ref|XP_007028352.1| Uncharacterized protein isoform 4 [Theobroma cacao] gi|508716957|gb|EOY08854.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 462 Score = 163 bits (413), Expect = 1e-37 Identities = 138/444 (31%), Positives = 191/444 (43%), Gaps = 55/444 (12%) Frame = +1 Query: 13 MDPNLSLSP-DIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXNHDRSPSPEFHR 189 MD NL SP D +DA+ +FRK SNDA+NR+YRR DRS SP R Sbjct: 1 MDSNLQTSPPDGSDAKAAFRKFSNDASNRQYRRHSPISRSSSSEGNSPQRDRSVSPILSR 60 Query: 190 DNPAKISDDPRRRENGGRELEMXXXXXXXXXXXXXXXXXXXQPXXXXXXXXXXXXXXXXX 369 D+ AK +D R+ GREL+ Q Sbjct: 61 DDLAKGADTQPGRD--GRELDRDSSRNKYSRNSDSYRYSDRQSSRSSHGYSRHDNYVRHD 118 Query: 370 XXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPD------ 528 AD G + + THSD RQES+ R +DY ++ DKY+RD+ D Sbjct: 119 KFADEGSKYDRLSSRSGRESRFSTHSDHPRQESDISRSKDYSRNADKYSRDRYDGSGHRI 178 Query: 529 ---------------------CDRHGRRRLINSNLDEVKIGEERHNSXXXXXXXXXXXXX 645 DR G R S+ E ++ +R Sbjct: 179 RDKEKESQSLEHQKYKDKDSALDRAGSGRRQGSSFSE-EMDRDRRRRGRDSRGEKGDYHR 237 Query: 646 SLGDYKNDHISSFDESRGHGKDSTAARE--NGRNGLTETRKSSSKELDGQKRNVMERRKH 819 S GD K D+ S++ESRGH DS++ RE N + E KS KE+DGQK ER KH Sbjct: 238 SSGDRKGDYTESYEESRGHRNDSSSGRERDNDKYRRKEGYKSGLKEIDGQKP-AKERMKH 296 Query: 820 EDNEFLVKKPK--------------LCNADEGTGGKIISKFTCA--------ADETPSSS 933 ++ E ++K + ++ + K + F+ + ADE SS Sbjct: 297 DEWETNMEKDRYGGVLKEQCEEKSIFVGKNQESPAKKLKLFSSSKGNEYDKDADEKRSSL 356 Query: 934 KQVQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNRNLVGGGY--MSTDQKK 1107 +Q +E +V Q+ N+ D+N ELVNRNL+G G+ M+T+QKK Sbjct: 357 EQAEETDGRVTM--GQAHGNDVDITNDINSAKVAAMKAAELVNRNLIGAGHSNMTTEQKK 414 Query: 1108 KILWGNKKTSAAEESVHHWDMPLF 1179 K+LWG+KK++ AEES H WD LF Sbjct: 415 KLLWGSKKSTPAEESGHRWDTALF 438 >ref|XP_007028351.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508716956|gb|EOY08853.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 464 Score = 163 bits (413), Expect = 1e-37 Identities = 138/444 (31%), Positives = 191/444 (43%), Gaps = 55/444 (12%) Frame = +1 Query: 13 MDPNLSLSP-DIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXNHDRSPSPEFHR 189 MD NL SP D +DA+ +FRK SNDA+NR+YRR DRS SP R Sbjct: 1 MDSNLQTSPPDGSDAKAAFRKFSNDASNRQYRRHSPISRSSSSEGNSPQRDRSVSPILSR 60 Query: 190 DNPAKISDDPRRRENGGRELEMXXXXXXXXXXXXXXXXXXXQPXXXXXXXXXXXXXXXXX 369 D+ AK +D R+ GREL+ Q Sbjct: 61 DDLAKGADTQPGRD--GRELDRDSSRNKYSRNSDSYRYSDRQSSRSSHGYSRHDNYVRHD 118 Query: 370 XXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPD------ 528 AD G + + THSD RQES+ R +DY ++ DKY+RD+ D Sbjct: 119 KFADEGSKYDRLSSRSGRESRFSTHSDHPRQESDISRSKDYSRNADKYSRDRYDGSGHRI 178 Query: 529 ---------------------CDRHGRRRLINSNLDEVKIGEERHNSXXXXXXXXXXXXX 645 DR G R S+ E ++ +R Sbjct: 179 RDKEKESQSLEHQKYKDKDSALDRAGSGRRQGSSFSE-EMDRDRRRRGRDSRGEKGDYHR 237 Query: 646 SLGDYKNDHISSFDESRGHGKDSTAARE--NGRNGLTETRKSSSKELDGQKRNVMERRKH 819 S GD K D+ S++ESRGH DS++ RE N + E KS KE+DGQK ER KH Sbjct: 238 SSGDRKGDYTESYEESRGHRNDSSSGRERDNDKYRRKEGYKSGLKEIDGQKP-AKERMKH 296 Query: 820 EDNEFLVKKPK--------------LCNADEGTGGKIISKFTCA--------ADETPSSS 933 ++ E ++K + ++ + K + F+ + ADE SS Sbjct: 297 DEWETNMEKDRYGGVLKEQCEEKSIFVGKNQESPAKKLKLFSSSKGNEYDKDADEKRSSL 356 Query: 934 KQVQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNRNLVGGGY--MSTDQKK 1107 +Q +E +V Q+ N+ D+N ELVNRNL+G G+ M+T+QKK Sbjct: 357 EQAEETDGRVTM--GQAHGNDVDITNDINSAKVAAMKAAELVNRNLIGAGHSNMTTEQKK 414 Query: 1108 KILWGNKKTSAAEESVHHWDMPLF 1179 K+LWG+KK++ AEES H WD LF Sbjct: 415 KLLWGSKKSTPAEESGHRWDTALF 438 >ref|XP_007028350.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] gi|508716955|gb|EOY08852.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] Length = 473 Score = 163 bits (413), Expect = 1e-37 Identities = 138/444 (31%), Positives = 191/444 (43%), Gaps = 55/444 (12%) Frame = +1 Query: 13 MDPNLSLSP-DIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXNHDRSPSPEFHR 189 MD NL SP D +DA+ +FRK SNDA+NR+YRR DRS SP R Sbjct: 1 MDSNLQTSPPDGSDAKAAFRKFSNDASNRQYRRHSPISRSSSSEGNSPQRDRSVSPILSR 60 Query: 190 DNPAKISDDPRRRENGGRELEMXXXXXXXXXXXXXXXXXXXQPXXXXXXXXXXXXXXXXX 369 D+ AK +D R+ GREL+ Q Sbjct: 61 DDLAKGADTQPGRD--GRELDRDSSRNKYSRNSDSYRYSDRQSSRSSHGYSRHDNYVRHD 118 Query: 370 XXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPD------ 528 AD G + + THSD RQES+ R +DY ++ DKY+RD+ D Sbjct: 119 KFADEGSKYDRLSSRSGRESRFSTHSDHPRQESDISRSKDYSRNADKYSRDRYDGSGHRI 178 Query: 529 ---------------------CDRHGRRRLINSNLDEVKIGEERHNSXXXXXXXXXXXXX 645 DR G R S+ E ++ +R Sbjct: 179 RDKEKESQSLEHQKYKDKDSALDRAGSGRRQGSSFSE-EMDRDRRRRGRDSRGEKGDYHR 237 Query: 646 SLGDYKNDHISSFDESRGHGKDSTAARE--NGRNGLTETRKSSSKELDGQKRNVMERRKH 819 S GD K D+ S++ESRGH DS++ RE N + E KS KE+DGQK ER KH Sbjct: 238 SSGDRKGDYTESYEESRGHRNDSSSGRERDNDKYRRKEGYKSGLKEIDGQKP-AKERMKH 296 Query: 820 EDNEFLVKKPK--------------LCNADEGTGGKIISKFTCA--------ADETPSSS 933 ++ E ++K + ++ + K + F+ + ADE SS Sbjct: 297 DEWETNMEKDRYGGVLKEQCEEKSIFVGKNQESPAKKLKLFSSSKGNEYDKDADEKRSSL 356 Query: 934 KQVQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNRNLVGGGY--MSTDQKK 1107 +Q +E +V Q+ N+ D+N ELVNRNL+G G+ M+T+QKK Sbjct: 357 EQAEETDGRVTM--GQAHGNDVDITNDINSAKVAAMKAAELVNRNLIGAGHSNMTTEQKK 414 Query: 1108 KILWGNKKTSAAEESVHHWDMPLF 1179 K+LWG+KK++ AEES H WD LF Sbjct: 415 KLLWGSKKSTPAEESGHRWDTALF 438 >ref|XP_007028349.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|590634353|ref|XP_007028353.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508716954|gb|EOY08851.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508716958|gb|EOY08855.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 504 Score = 163 bits (413), Expect = 1e-37 Identities = 138/444 (31%), Positives = 191/444 (43%), Gaps = 55/444 (12%) Frame = +1 Query: 13 MDPNLSLSP-DIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXNHDRSPSPEFHR 189 MD NL SP D +DA+ +FRK SNDA+NR+YRR DRS SP R Sbjct: 1 MDSNLQTSPPDGSDAKAAFRKFSNDASNRQYRRHSPISRSSSSEGNSPQRDRSVSPILSR 60 Query: 190 DNPAKISDDPRRRENGGRELEMXXXXXXXXXXXXXXXXXXXQPXXXXXXXXXXXXXXXXX 369 D+ AK +D R+ GREL+ Q Sbjct: 61 DDLAKGADTQPGRD--GRELDRDSSRNKYSRNSDSYRYSDRQSSRSSHGYSRHDNYVRHD 118 Query: 370 XXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPD------ 528 AD G + + THSD RQES+ R +DY ++ DKY+RD+ D Sbjct: 119 KFADEGSKYDRLSSRSGRESRFSTHSDHPRQESDISRSKDYSRNADKYSRDRYDGSGHRI 178 Query: 529 ---------------------CDRHGRRRLINSNLDEVKIGEERHNSXXXXXXXXXXXXX 645 DR G R S+ E ++ +R Sbjct: 179 RDKEKESQSLEHQKYKDKDSALDRAGSGRRQGSSFSE-EMDRDRRRRGRDSRGEKGDYHR 237 Query: 646 SLGDYKNDHISSFDESRGHGKDSTAARE--NGRNGLTETRKSSSKELDGQKRNVMERRKH 819 S GD K D+ S++ESRGH DS++ RE N + E KS KE+DGQK ER KH Sbjct: 238 SSGDRKGDYTESYEESRGHRNDSSSGRERDNDKYRRKEGYKSGLKEIDGQKP-AKERMKH 296 Query: 820 EDNEFLVKKPK--------------LCNADEGTGGKIISKFTCA--------ADETPSSS 933 ++ E ++K + ++ + K + F+ + ADE SS Sbjct: 297 DEWETNMEKDRYGGVLKEQCEEKSIFVGKNQESPAKKLKLFSSSKGNEYDKDADEKRSSL 356 Query: 934 KQVQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNRNLVGGGY--MSTDQKK 1107 +Q +E +V Q+ N+ D+N ELVNRNL+G G+ M+T+QKK Sbjct: 357 EQAEETDGRVTM--GQAHGNDVDITNDINSAKVAAMKAAELVNRNLIGAGHSNMTTEQKK 414 Query: 1108 KILWGNKKTSAAEESVHHWDMPLF 1179 K+LWG+KK++ AEES H WD LF Sbjct: 415 KLLWGSKKSTPAEESGHRWDTALF 438 >ref|XP_004144330.1| PREDICTED: uncharacterized protein LOC101218861 [Cucumis sativus] Length = 472 Score = 141 bits (355), Expect = 6e-31 Identities = 118/416 (28%), Positives = 177/416 (42%), Gaps = 41/416 (9%) Frame = +1 Query: 55 RPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXNHDRSPSPEFHRDNPAKISDDPRRREN 234 + FRKPS++ A RKYRRR DRS SP+ RD+ +K S+ RR+ Sbjct: 7 KAEFRKPSSETAGRKYRRRSSVSGSSSDESP--KRDRSSSPKLLRDDASKHSERKPRRKE 64 Query: 235 GGRELEMXXXXXXXXXXXXXXXXXXXQPXXXXXXXXXXXXXXXXXXXADGGERRYQXXXX 414 R+L + ER Y+ Sbjct: 65 DERDLNKDSRNHHSRSSDSYRYSDRKSSRSLHGYSRHDDYVRHDKYADE--ERDYERLSS 122 Query: 415 XXXXXXXXT-HSDCTRQESEYER-RDYQQHVDKYNRDKPDC------------DRHGRRR 552 + H D TR+ESE+ R R+Y + V+K +RDK D +RHG Sbjct: 123 RSNRESKGSAHYDHTRRESEHSRSREYFRDVEKGSRDKYDASGHRSRDGDSLSERHGSGS 182 Query: 553 LINSNLDEVKIGEERHNSXXXXXXXXXXXXXSLGDYKNDHISSFDESRGHGKDSTAAREN 732 +++ +E++ + R+ GDYKN+ + S D+ RG+ DS R+ Sbjct: 183 RRHASFEEME--KHRNARDRDGQDEKRDNIKHSGDYKNERVLSHDDGRGNRYDSLLGRDE 240 Query: 733 GRNGLTETRKSSSKELDGQKRNVMERRKHEDNE--------------------------- 831 ++ + K+ K+LD +K + E RKH+ E Sbjct: 241 SKHRTKDINKNDRKDLDDEKSS-KEERKHDARETHWDKVQGKESKGKYDGKGVFVDENQG 299 Query: 832 FLVKKPKLCNADEGTGGKIISKFTCAADETPSSSKQVQEIVDKVIPEPAQSSANEAVSAC 1011 KKPKL ++ GK ++ A + S+SK+ Q+ + Q + ++ A Sbjct: 300 LPAKKPKLFSS-----GKEVNHEEDADENQSSTSKKEQDGKMSL----GQGQSGDSDFAA 350 Query: 1012 DLNXXXXXXXXXXELVNRNLVGGGYMSTDQKKKILWGNKKTSAAEESVHHWDMPLF 1179 D + ELVN+NLVGGGYM+TDQKKK+LWG+KK++A EES H WD LF Sbjct: 351 DFSAAKVAAMKAAELVNKNLVGGGYMTTDQKKKLLWGSKKSTAVEESAHQWDTALF 406 >ref|XP_002307979.2| hypothetical protein POPTR_0006s03830g [Populus trichocarpa] gi|550335404|gb|EEE91502.2| hypothetical protein POPTR_0006s03830g [Populus trichocarpa] Length = 473 Score = 135 bits (340), Expect = 3e-29 Identities = 124/439 (28%), Positives = 185/439 (42%), Gaps = 48/439 (10%) Frame = +1 Query: 7 GFMDPNLSLSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXNHDRSPSPEFH 186 G P L + + + +FRKPSND ANRKYRR D+S SP Sbjct: 4 GIQSPQL----ENTETKATFRKPSNDMANRKYRRHSPMNGSSLSDGSP-KRDQSSSPVVQ 58 Query: 187 RDNPAKISDDPRRRENGGRELEMXXXXXXXXXXXXXXXXXXXQPXXXXXXXXXXXXXXXX 366 RD+PAK S +RR+ +EL+ Sbjct: 59 RDDPAKAS---QRRKGEEKELDRDSGRSRYEKNGESYRHSDRYSSRSSHGYSRNDDYSRH 115 Query: 367 XXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDRHG 543 D G+R +Q +HS ++ E R RDY ++ +KY+RD+ D H Sbjct: 116 DRRVDDGDRHHQVV----------SHSGRESKDGERGRSRDYARNSEKYSRDRHDGSGHR 165 Query: 544 R----------RRLINSNLDEVKIGEER--------------HNSXXXXXXXXXXXXXSL 651 ++L + + ++G R H S Sbjct: 166 NMDKERELSEHQKLKDKDFSPDRVGSGRKYTSIVSEEKDRDWHRRDRDGRDEKRDYHRSS 225 Query: 652 GDYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHE--- 822 GD+K+D S ++++RG+ DS+ GR+ L E+ K+ KEL+G K E++KH+ Sbjct: 226 GDHKSDRSSYYEDTRGYRNDSS-----GRDRLRESYKNDPKELNGLK----EKKKHDNWE 276 Query: 823 ---DNEFLVKKPKLCNADEGTGG--------KIISKFTCAAD---------ETPSSSKQV 942 D + K P N D+ G K F+ + D + SSS Sbjct: 277 TSRDKDRYSKAPGEKNDDKSAFGSEKPESPAKKPKLFSSSKDPDYSGDVNQKQSSSSMLA 336 Query: 943 QEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNRNLVGGGYMSTDQKKKILWG 1122 QE+ +KV Q+ AN + +A DL+ ELVN+NLVG G+MST+QKKK+LWG Sbjct: 337 QEVDNKV--NVGQAHANTSEAANDLDAAKVAAMKAAELVNKNLVGVGFMSTEQKKKLLWG 394 Query: 1123 NKKTSAAEESVHHWDMPLF 1179 +KK++A EE+ WD +F Sbjct: 395 SKKSAAPEETGRRWDTVMF 413 >gb|EXC34985.1| hypothetical protein L484_014712 [Morus notabilis] Length = 491 Score = 130 bits (326), Expect = 1e-27 Identities = 125/446 (28%), Positives = 178/446 (39%), Gaps = 57/446 (12%) Frame = +1 Query: 13 MDPNL-SLSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXNHDRSPSPEFHR 189 MD NL S + D D +P+FRKP+ DA NRKYRR +RS SP+ Sbjct: 1 MDSNLQSPNQDNVDVKPAFRKPTTDATNRKYRRHSPVSGSQSDGSP--ERERSASPKLTG 58 Query: 190 DNPAKISDDPRRRENGGRELEMXXXXXXXXXXXXXXXXXXXQPXXXXXXXXXXXXXXXXX 369 ++P ++ + RR++ G+E++ Q Sbjct: 59 EDPRRVHESQSRRKDDGKEVDRDSYRSHYGRGSDSYRHSDRQFSRSSHRYSRHDDYSKHD 118 Query: 370 XXADGGERRYQXXXXXXXXXXXX-THSDCTRQESEYERRDYQQHVDKYNRDKPDC----- 531 AD ER ++ TH D ++ RD+ + KY+RD+ D Sbjct: 119 KHADDEERNHRRLSSRSGWESKGGTHIDHSKL------RDHLRDGGKYSRDRYDSYLYNS 172 Query: 532 -DR--------HGRRRLINSNLDEVKIGE----------ERHNSXXXXXXXXXXXXXSLG 654 DR H + +S+ D+ K G+ ER S G Sbjct: 173 KDRERETSSLEHHKYNDRDSSFDKAKSGKRHPHPEDVERERRGMEKDGQDDKRDFRRSSG 232 Query: 655 DYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHED--- 825 DY+ D +E +GH D + RN E K+ +KE+DGQ ++K++D Sbjct: 233 DYRGDR----EEVKGHSIDFYS-----RNRAKECYKNEAKEIDGQCLTKEGKKKYDDVET 283 Query: 826 -------------------------NEFLVKKPKLCNADEGTGGKIISKFTCAADETPSS 930 EFL K+ K GK +SKF+ AD SS Sbjct: 284 NRSNDQYIREPAEQSGEKSVIGSENQEFLSKRQKFSLDKYTDAGKKVSKFSTVADVKESS 343 Query: 931 SKQVQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNRNLVGG---GYMSTDQ 1101 +Q + K+ + N + A DLN E VN+NLVGG G+M+ DQ Sbjct: 344 PQQPPD--HKLTA--GEDQVNVSNFANDLNAAKVAAMKAAESVNKNLVGGVGTGFMTADQ 399 Query: 1102 KKKILWGNKKTSAAEESVHHWDMPLF 1179 KKK+LWGNKKT+ AEES H WD LF Sbjct: 400 KKKLLWGNKKTTIAEESGHRWDSTLF 425 >ref|XP_006575187.1| PREDICTED: protein starmaker-like isoform X6 [Glycine max] Length = 438 Score = 130 bits (326), Expect = 1e-27 Identities = 119/429 (27%), Positives = 175/429 (40%), Gaps = 40/429 (9%) Frame = +1 Query: 13 MDPNLSLSPDI-ADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXNHDRSPSPEFHR 189 MD N P +D + +FRKPS DAANR YRRR H S SP R Sbjct: 2 MDSNSPFLPHCNSDTKNAFRKPSGDAANRNYRRRSPVEGSPSPDASP-RHGHSSSPNLVR 60 Query: 190 DNPAKISDDPRRRENGGRELEMXXXXXXXXXXXXXXXXXXXQPXXXXXXXXXXXXXXXXX 369 +N A++S R+ ++ RE + Q Sbjct: 61 ENSARVSHHSRKYDD--REQDQQYGRNHYGRSSDSLRHSDRQSFKSSYGHSRHDKY---- 114 Query: 370 XXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYERRDYQQHVDKYNRDKPDCD----- 534 E RY+ T D R ES+ ++YQ+ V+KY+ DK D Sbjct: 115 ----ANEDRYREKLLSRSGHE--TRDDHMRDESDSRSKNYQRSVEKYSHDKYDRSDHRSK 168 Query: 535 --------RHGRRRLINSNLDEV----------KIGEERHNSXXXXXXXXXXXXXSLGDY 660 H + + ++S+ D+ ++ E H+ S GDY Sbjct: 169 EKRRETYLEHQKYKDMDSSYDKSASSKRHALYDEVEREGHSRDWDGRNERRDSRRSSGDY 228 Query: 661 KNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEFLV 840 ++D + ESR +S R+ G++ L E KS KE + Q E+RKH+D E Sbjct: 229 RSDQAVCYSESRNQRDESGPQRDCGKSSLKEGYKSEQKESNDQNLPWEEKRKHDDTETGK 288 Query: 841 KKP-------KLCNA-DEGTGGKIISKFTCA--------ADETPSSSKQVQEIVDKVIPE 972 K + C D+ + GK + F ADE+ +SS ++ K Sbjct: 289 GKDWKTRQASEQCGIEDKESSGKKLKLFDLDKDDNYRKDADESKTSSSKLSH-ESKADVR 347 Query: 973 PAQSSANEAVSACDLNXXXXXXXXXXELVNRNLVGGGYMSTDQKKKILWGNKKTSAAEES 1152 A++S + + DL+ ELVNRNLVG G ++TDQKKK+LWG K+++ EES Sbjct: 348 AAKTSGFDGDN--DLDAAKVAAMRAAELVNRNLVGAGCLTTDQKKKLLWGGKRSTPTEES 405 Query: 1153 VHHWDMPLF 1179 H WD +F Sbjct: 406 GHRWDTAMF 414 >ref|XP_006575186.1| PREDICTED: protein starmaker-like isoform X5 [Glycine max] Length = 440 Score = 130 bits (326), Expect = 1e-27 Identities = 119/429 (27%), Positives = 175/429 (40%), Gaps = 40/429 (9%) Frame = +1 Query: 13 MDPNLSLSPDI-ADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXNHDRSPSPEFHR 189 MD N P +D + +FRKPS DAANR YRRR H S SP R Sbjct: 2 MDSNSPFLPHCNSDTKNAFRKPSGDAANRNYRRRSPVEGSPSPDASP-RHGHSSSPNLVR 60 Query: 190 DNPAKISDDPRRRENGGRELEMXXXXXXXXXXXXXXXXXXXQPXXXXXXXXXXXXXXXXX 369 +N A++S R+ ++ RE + Q Sbjct: 61 ENSARVSHHSRKYDD--REQDQQYGRNHYGRSSDSLRHSDRQSFKSSYGHSRHDKY---- 114 Query: 370 XXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYERRDYQQHVDKYNRDKPDCD----- 534 E RY+ T D R ES+ ++YQ+ V+KY+ DK D Sbjct: 115 ----ANEDRYREKLLSRSGHE--TRDDHMRDESDSRSKNYQRSVEKYSHDKYDRSDHRSK 168 Query: 535 --------RHGRRRLINSNLDEV----------KIGEERHNSXXXXXXXXXXXXXSLGDY 660 H + + ++S+ D+ ++ E H+ S GDY Sbjct: 169 EKRRETYLEHQKYKDMDSSYDKSASSKRHALYDEVEREGHSRDWDGRNERRDSRRSSGDY 228 Query: 661 KNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEFLV 840 ++D + ESR +S R+ G++ L E KS KE + Q E+RKH+D E Sbjct: 229 RSDQAVCYSESRNQRDESGPQRDCGKSSLKEGYKSEQKESNDQNLPWEEKRKHDDTETGK 288 Query: 841 KKP-------KLCNA-DEGTGGKIISKFTCA--------ADETPSSSKQVQEIVDKVIPE 972 K + C D+ + GK + F ADE+ +SS ++ K Sbjct: 289 GKDWKTRQASEQCGIEDKESSGKKLKLFDLDKDDNYRKDADESKTSSSKLSH-ESKADVR 347 Query: 973 PAQSSANEAVSACDLNXXXXXXXXXXELVNRNLVGGGYMSTDQKKKILWGNKKTSAAEES 1152 A++S + + DL+ ELVNRNLVG G ++TDQKKK+LWG K+++ EES Sbjct: 348 AAKTSGFDGDN--DLDAAKVAAMRAAELVNRNLVGAGCLTTDQKKKLLWGGKRSTPTEES 405 Query: 1153 VHHWDMPLF 1179 H WD +F Sbjct: 406 GHRWDTAMF 414 >ref|XP_006575183.1| PREDICTED: protein starmaker-like isoform X2 [Glycine max] gi|571440534|ref|XP_006575184.1| PREDICTED: protein starmaker-like isoform X3 [Glycine max] gi|571440536|ref|XP_006575185.1| PREDICTED: protein starmaker-like isoform X4 [Glycine max] Length = 480 Score = 130 bits (326), Expect = 1e-27 Identities = 119/429 (27%), Positives = 175/429 (40%), Gaps = 40/429 (9%) Frame = +1 Query: 13 MDPNLSLSPDI-ADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXNHDRSPSPEFHR 189 MD N P +D + +FRKPS DAANR YRRR H S SP R Sbjct: 2 MDSNSPFLPHCNSDTKNAFRKPSGDAANRNYRRRSPVEGSPSPDASP-RHGHSSSPNLVR 60 Query: 190 DNPAKISDDPRRRENGGRELEMXXXXXXXXXXXXXXXXXXXQPXXXXXXXXXXXXXXXXX 369 +N A++S R+ ++ RE + Q Sbjct: 61 ENSARVSHHSRKYDD--REQDQQYGRNHYGRSSDSLRHSDRQSFKSSYGHSRHDKY---- 114 Query: 370 XXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYERRDYQQHVDKYNRDKPDCD----- 534 E RY+ T D R ES+ ++YQ+ V+KY+ DK D Sbjct: 115 ----ANEDRYREKLLSRSGHE--TRDDHMRDESDSRSKNYQRSVEKYSHDKYDRSDHRSK 168 Query: 535 --------RHGRRRLINSNLDEV----------KIGEERHNSXXXXXXXXXXXXXSLGDY 660 H + + ++S+ D+ ++ E H+ S GDY Sbjct: 169 EKRRETYLEHQKYKDMDSSYDKSASSKRHALYDEVEREGHSRDWDGRNERRDSRRSSGDY 228 Query: 661 KNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEFLV 840 ++D + ESR +S R+ G++ L E KS KE + Q E+RKH+D E Sbjct: 229 RSDQAVCYSESRNQRDESGPQRDCGKSSLKEGYKSEQKESNDQNLPWEEKRKHDDTETGK 288 Query: 841 KKP-------KLCNA-DEGTGGKIISKFTCA--------ADETPSSSKQVQEIVDKVIPE 972 K + C D+ + GK + F ADE+ +SS ++ K Sbjct: 289 GKDWKTRQASEQCGIEDKESSGKKLKLFDLDKDDNYRKDADESKTSSSKLSH-ESKADVR 347 Query: 973 PAQSSANEAVSACDLNXXXXXXXXXXELVNRNLVGGGYMSTDQKKKILWGNKKTSAAEES 1152 A++S + + DL+ ELVNRNLVG G ++TDQKKK+LWG K+++ EES Sbjct: 348 AAKTSGFDGDN--DLDAAKVAAMRAAELVNRNLVGAGCLTTDQKKKLLWGGKRSTPTEES 405 Query: 1153 VHHWDMPLF 1179 H WD +F Sbjct: 406 GHRWDTAMF 414 >ref|XP_003519025.1| PREDICTED: protein starmaker-like isoform X1 [Glycine max] Length = 479 Score = 129 bits (325), Expect = 2e-27 Identities = 120/429 (27%), Positives = 173/429 (40%), Gaps = 40/429 (9%) Frame = +1 Query: 13 MDPNLSLSPDI-ADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXNHDRSPSPEFHR 189 MD N P +D + +FRKPS DAANR YRRR H S SP R Sbjct: 2 MDSNSPFLPHCNSDTKNAFRKPSGDAANRNYRRRSPVEGSPSPDASP-RHGHSSSPNLVR 60 Query: 190 DNPAKISDDPRRRENGGRELEMXXXXXXXXXXXXXXXXXXXQPXXXXXXXXXXXXXXXXX 369 +N A++S R+ ++ RE + Q Sbjct: 61 ENSARVSHHSRKYDD--REQDQQYGRNHYGRSSDSLRHSDRQSFKSSYGHSRHDKY---- 114 Query: 370 XXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYERRDYQQHVDKYNRDKPDCD----- 534 E RY+ T D R ES+ ++YQ+ V+KY+ DK D Sbjct: 115 ----ANEDRYREKLLSRSGHE--TRDDHMRDESDSRSKNYQRSVEKYSHDKYDRSDHRSK 168 Query: 535 --------RHGRRRLINSNLDEV----------KIGEERHNSXXXXXXXXXXXXXSLGDY 660 H + + ++S+ D+ ++ E H+ S GDY Sbjct: 169 EKRRETYLEHQKYKDMDSSYDKSASSKRHALYDEVEREGHSRDWDGRNERRDSRRSSGDY 228 Query: 661 KNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEFLV 840 ++D + ESR +S R+ G++ L E KS KE + Q E+RKH+D E Sbjct: 229 RSDQAVCYSESRNQRDESGPQRDCGKSSLKEGYKSEQKESNDQNLPWEEKRKHDDTETGK 288 Query: 841 KKP-------KLCNA-DEGTGGKIISKFTCAADET--------PSSSKQVQEIVDKVIPE 972 K + C D+ + GK + F D+ SSSK E K Sbjct: 289 GKDWKTRQASEQCGIEDKESSGKKLKLFDLDKDDNYRKDDESKTSSSKLSHE--SKADVR 346 Query: 973 PAQSSANEAVSACDLNXXXXXXXXXXELVNRNLVGGGYMSTDQKKKILWGNKKTSAAEES 1152 A++S + + DL+ ELVNRNLVG G ++TDQKKK+LWG K+++ EES Sbjct: 347 AAKTSGFDGDN--DLDAAKVAAMRAAELVNRNLVGAGCLTTDQKKKLLWGGKRSTPTEES 404 Query: 1153 VHHWDMPLF 1179 H WD +F Sbjct: 405 GHRWDTAMF 413 >gb|EYU18618.1| hypothetical protein MIMGU_mgv1a007462mg [Mimulus guttatus] Length = 406 Score = 127 bits (318), Expect = 1e-26 Identities = 112/382 (29%), Positives = 158/382 (41%), Gaps = 5/382 (1%) Frame = +1 Query: 49 DARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXNHDRSPSPEFHRDNPAKISDDPRRR 228 D++ FRKPSNDAA+RKYRRR + DRS SP + + +++DD R+ Sbjct: 10 DSKAEFRKPSNDAASRKYRRRSPAGGSSSSSDGSLHRDRSSSPLPRKKDSIRVADDNRKT 69 Query: 229 ENG----GRELEMXXXXXXXXXXXXXXXXXXXQPXXXXXXXXXXXXXXXXXXXADGGERR 396 E+G GR E + D +R Sbjct: 70 EDGRNLSGRSGESYKYTDRHSSKNYPRHDEHSR----------------RDRHVDDYDRG 113 Query: 397 YQXXXXXXXXXXXXTHS-DCTRQESEYERRDYQQHVDKYNRDKPDCDRHGRRRLINSNLD 573 Y + D +R + E+ RDY + +D ++ K D L+N + D Sbjct: 114 YSKSSYRSNRDQRDNGNFDHSRSDKEHRSRDYIKDIDTHSHAKSD-------GLVNRSRD 166 Query: 574 EVKIGEERHNSXXXXXXXXXXXXXSLGDYKNDHISSFDESRGHGKDSTAARENGRNGLTE 753 + K ER S SLGD DS++ ++ + L E Sbjct: 167 KEKY--ERAGSGRGDQYVKTDRRKSLGDQS---------------DSSSRKDTSGHRLKE 209 Query: 754 TRKSSSKELDGQKRNVMERRKHEDNEFLVKKPKLCNADEGTGGKIISKFTCAADETPSSS 933 T KEL+ +K E+RK DN + K+ A E + K I KFT + P S Sbjct: 210 TSWREGKELNAEKYVNDEKRKF-DNRSIYKEEGNGEAKEHSDDKSI-KFTETVTKKPKFS 267 Query: 934 KQVQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNRNLVGGGYMSTDQKKKI 1113 +D P +S V+ D++ ELVN+NLVG GYMSTDQKKK+ Sbjct: 268 S-----LDSKAPVTDGTSEQPYVTDSDIDAAKIAAMKAAELVNKNLVGTGYMSTDQKKKL 322 Query: 1114 LWGNKKTSAAEESVHHWDMPLF 1179 LWG+KK++A EES H WD F Sbjct: 323 LWGSKKSTATEESAHRWDTITF 344 >ref|XP_006589006.1| PREDICTED: arginine/serine-rich coiled-coil protein 2-like isoform X6 [Glycine max] Length = 447 Score = 118 bits (296), Expect = 4e-24 Identities = 120/432 (27%), Positives = 168/432 (38%), Gaps = 40/432 (9%) Frame = +1 Query: 4 LGFMDPNLS-LSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXNHDRSPSPE 180 L MD NL L P +D + SFRKPS DAANR Y+ R H SP+P Sbjct: 19 LSMMDSNLPFLPPSNSDTKNSFRKPSGDAANRNYQHRSPVDRSPSPDASRHGHSSSPNPV 78 Query: 181 FHRDNPAKISDDPRRRENGGRELEMXXXXXXXXXXXXXXXXXXXQPXXXXXXXXXXXXXX 360 R+N A++S R+ ++ RE + Q Sbjct: 79 --RENSARVSHHSRKYDD--REHDQQYGRNHYGRSSDSLRHSDRQSFKSSFGHSRYDKY- 133 Query: 361 XXXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYERRDYQQHVDKYNRDKPDCDRH 540 E RY+ + D R+ES+ ++YQ VDKY+ DK D H Sbjct: 134 -------ANEDRYRERLLSRSGHE--SRDDHVREESDSRPKNYQCSVDKYSHDKYDRSDH 184 Query: 541 G---RRRLINSNLDEVK--------------------IGEERHNSXXXXXXXXXXXXXSL 651 +RR S + K + E H+ S Sbjct: 185 RSKEKRRDTYSEHQKYKDMDSSYEKSASSKRHALYDEVEREGHSRDWDGQNERRDSRRSS 244 Query: 652 GDYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNE 831 GDY++D +S R++G+ L E KS KE + Q E+RKH+D E Sbjct: 245 GDYRSDQRD----------ESGPQRDSGKFSLKEAYKSEQKESNDQNLPWEEKRKHDDTE 294 Query: 832 FLVKKP--------KLCNADEGTGGKIISKFTCA--------ADETPSSSKQVQEIVDKV 963 K + D+ + GK + F ADE+ +SS + K Sbjct: 295 IRKGKDWKTRKAGEQCAIEDKESSGKKLKLFDPDKDDNYRKDADESKTSSSNLSH-KSKE 353 Query: 964 IPEPAQSSANEAVSACDLNXXXXXXXXXXELVNRNLVGGGYMSTDQKKKILWGNKKTSAA 1143 +SS + + DL+ ELVNRNLVG G ++TDQKKK+LWG KK++ Sbjct: 354 DLWAVKSSGFDGDN--DLDAAKIAAMRAAELVNRNLVGPGCLTTDQKKKLLWGGKKSTPT 411 Query: 1144 EESVHHWDMPLF 1179 EES H WD +F Sbjct: 412 EESGHRWDTGMF 423