BLASTX nr result
ID: Akebia23_contig00013901
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00013901 (1563 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002266542.1| PREDICTED: uncharacterized protein LOC100257... 237 1e-59 ref|XP_007201961.1| hypothetical protein PRUPE_ppa004686mg [Prun... 198 5e-48 ref|XP_006482076.1| PREDICTED: uncharacterized protein DDB_G0283... 194 7e-47 ref|XP_006482075.1| PREDICTED: uncharacterized protein DDB_G0283... 194 7e-47 ref|XP_006430550.1| hypothetical protein CICLE_v10011438mg [Citr... 193 2e-46 ref|XP_006430548.1| hypothetical protein CICLE_v10011438mg [Citr... 193 2e-46 ref|XP_006838218.1| hypothetical protein AMTR_s00106p00155270 [A... 190 1e-45 ref|XP_007028352.1| Uncharacterized protein isoform 4 [Theobroma... 187 9e-45 ref|XP_007028351.1| Uncharacterized protein isoform 3 [Theobroma... 185 4e-44 ref|XP_007028350.1| Uncharacterized protein isoform 2, partial [... 185 4e-44 ref|XP_007028349.1| Uncharacterized protein isoform 1 [Theobroma... 185 4e-44 ref|XP_004144330.1| PREDICTED: uncharacterized protein LOC101218... 171 1e-39 ref|XP_002307979.2| hypothetical protein POPTR_0006s03830g [Popu... 166 3e-38 ref|XP_006575187.1| PREDICTED: protein starmaker-like isoform X6... 158 7e-36 gb|EXC34985.1| hypothetical protein L484_014712 [Morus notabilis] 157 2e-35 ref|XP_006575186.1| PREDICTED: protein starmaker-like isoform X5... 157 2e-35 ref|XP_006575183.1| PREDICTED: protein starmaker-like isoform X2... 156 2e-35 ref|XP_003519025.1| PREDICTED: protein starmaker-like isoform X1... 156 3e-35 gb|EYU18618.1| hypothetical protein MIMGU_mgv1a007462mg [Mimulus... 147 2e-32 ref|XP_006589006.1| PREDICTED: arginine/serine-rich coiled-coil ... 144 1e-31 >ref|XP_002266542.1| PREDICTED: uncharacterized protein LOC100257160 [Vitis vinifera] gi|297739954|emb|CBI30136.3| unnamed protein product [Vitis vinifera] Length = 510 Score = 237 bits (604), Expect = 1e-59 Identities = 171/468 (36%), Positives = 225/468 (48%), Gaps = 64/468 (13%) Frame = +1 Query: 58 MDPNLSLSP-DIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234 MD +L P D ADA+ +FRKP+NDA NRKYRRR H+ + SP F Sbjct: 1 MDSSLKSPPRDKADAKTAFRKPTNDATNRKYRRRSPTSGSSSSGGSPI---HEHNSSPIF 57 Query: 235 HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414 +++ K+SD +RR+ GREL+ + +RQ Sbjct: 58 SKEDSEKVSDRRQRRKGDGRELDRDAGRSQYRKTADSYRHSDRQSSRSSRGHYRYDDHVR 117 Query: 415 XXXXA-DGGER-RYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCD 585 A D G+R + +SD RQESE+ R RDY + DKY+RDK D Sbjct: 118 QEKHAADEGDRDHHNLSSRSGRESRVGNYSDHVRQESEHSRTRDYFRGTDKYSRDKHDNA 177 Query: 586 GH----------------------------GRRRLINSNLDEVKIGEE-RHNSXXXXXXX 678 G+ RR NSN ++ K GE+ +H Sbjct: 178 GYRSKDKEKETSSLEHQKYKDKDLSSDRAGSGRRHTNSNFEDSKAGEQDKHLRDGDGPDE 237 Query: 679 XXXXXXSLGDYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVME 858 LGDYK+D S +ESRGH DST+ R++G E K+ KE+DGQK+ E Sbjct: 238 RKDYRRGLGDYKSDRSISHEESRGHRNDSTSGRDSGGYRSKEVHKNEPKEVDGQKQPKDE 297 Query: 859 RRKH----------------------------EDNEFLVKKPKLCNADEGTG-GKIISKF 951 ++K+ E+ E KKPKL + ++ T GK +S+F Sbjct: 298 KKKYDEWKTDRHKDRYNRESREQFEDKTVVASENQESAAKKPKLVSLEKSTDYGKDVSRF 357 Query: 952 -TCAAD-ETPSSSKQVQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLV 1125 T AD + SSSK Q+I DKV PE A + +E + DLN ELVN+NLV Sbjct: 358 STAVADMKQSSSSKLAQDIADKVTPEHAFLNNSEVAN--DLNAAKIAAMKAAELVNRNLV 415 Query: 1126 GGGYMSTDQKKKILWGNKKTSAAEESVHHWDMPLFSDQERQEKFNKLM 1269 G GYMS DQKKK+LWG+KK++ AEES HHWD LFSD+ERQEKFNKLM Sbjct: 416 GVGYMSADQKKKLLWGSKKSTTAEESGHHWDTALFSDRERQEKFNKLM 463 >ref|XP_007201961.1| hypothetical protein PRUPE_ppa004686mg [Prunus persica] gi|462397492|gb|EMJ03160.1| hypothetical protein PRUPE_ppa004686mg [Prunus persica] Length = 496 Score = 198 bits (504), Expect = 5e-48 Identities = 146/453 (32%), Positives = 209/453 (46%), Gaps = 61/453 (13%) Frame = +1 Query: 94 DARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEFHRDNPAKISDDPR 273 DA+ +FRKP+ DAANRKYRRR H+ + SP+ R++P K+S+ Sbjct: 13 DAKTAFRKPATDAANRKYRRRSPVGGSSPSDGSPM---HEHNCSPKNSREDPGKVSEYQT 69 Query: 274 RRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXXXXXXADGGERRYQ 453 RR + GRELE + +RQ AD ++ YQ Sbjct: 70 RRRDDGRELERDSNRRYYGRSSDSYRHSDRQSSRSLHGYYKHDDCIKHDKHADEEDKNYQ 129 Query: 454 XXXXXXXXXXXXTHSDCTRQESEYERRDYQQHVDKYNRDKPDCDGH-------------- 591 S + + R+Y +++DKY+RDK D G+ Sbjct: 130 KLSSRSGR-----ESRGSAYYDHIKSREYSRNLDKYSRDKYDGSGYRNKDKDRESSFPEN 184 Query: 592 ---------------GRRRLINSNLDEVKIGEERHNSXXXXXXXXXXXXXSLGDYKNDHI 726 GRR + + +E++ +RH + GDY ++ I Sbjct: 185 QKYKDKDSSSQRVGSGRR---HGHFEEMERERDRHALDRDVQDEKKDYRRNSGDYISERI 241 Query: 727 SSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEF------- 885 S++ES+G DS + R+ G++ + E KS KELD + +R+K++D E Sbjct: 242 FSYEESKGQRSDSISRRDEGKHRMKEGYKSELKELDDDNVSKEQRKKYDDKETSWGNRIT 301 Query: 886 --------------------LVKKPKLCNADEGTGG-KIISKFTCAAD-ETPSSSKQVQE 999 K+PKL ++++G G K +SKFT AD SSSKQVQE Sbjct: 302 RETSERSADKHYIKSENQESTAKRPKLFSSEKGIDGRKDVSKFTTTADGRESSSSKQVQE 361 Query: 1000 IVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGG---YMSTDQKKKILW 1170 D++ E Q AN+A +A D+N ELVN+NL+G G M+ DQKKK+LW Sbjct: 362 --DEMTTEKTQ--ANDAEAANDINAAKVAALKAAELVNRNLIGAGPVGCMTADQKKKLLW 417 Query: 1171 GNKKTSAAEESVHHWDMPLFSDQERQEKFNKLM 1269 GNKK++ AEE H WD LFSD+ERQEKFNKLM Sbjct: 418 GNKKSTTAEEVGHRWDSTLFSDRERQEKFNKLM 450 >ref|XP_006482076.1| PREDICTED: uncharacterized protein DDB_G0283697-like isoform X2 [Citrus sinensis] Length = 482 Score = 194 bits (494), Expect = 7e-47 Identities = 147/448 (32%), Positives = 195/448 (43%), Gaps = 49/448 (10%) Frame = +1 Query: 73 SLSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEFHRDNPA 252 S PD D + SFRKPSNDAANR+YRRR D + SP + RD+P+ Sbjct: 4 SSPPDTPDTKASFRKPSNDAANRRYRRRSPANGSSSSDGSPKC---DHNASPIYSRDDPS 60 Query: 253 KISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXXXXXXAD 432 K+ + +RR++ REL+ + +RQ + Sbjct: 61 KVPEHQQRRKDDERELDRDSGRSHHGRGSDSYRHSDRQSSRSSHNYSKHDDYVRHDKHEN 120 Query: 433 GGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDGHGRR--- 600 +R YQ S R+ ++ R +DY ++ + DK D GHG + Sbjct: 121 DEDRNYQRLS-----------SRSGRESRDHSRSKDYLSSEERSSHDKYDVIGHGSKDKE 169 Query: 601 -------RLINSNLDEV----------------KIGEERHNSXXXXXXXXXXXXXSLGDY 711 R N + D ++ + H S GD+ Sbjct: 170 KESSYLERQKNKDKDSSSDRAGSGRKHTVAYSEELDRDWHKRDRDGRDEKRDYRRSSGDH 229 Query: 712 KNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEFL- 888 +ND ++DESRGH S++ R+ G L E +S KELDGQK E++KH D+E Sbjct: 230 RNDRTVTYDESRGHRNYSSSGRDYGSYRLKEAHRSDPKELDGQKLANEEKKKHNDSETYR 289 Query: 889 ---------------------VKKPKLCNADEGTGGKIISKFTCAADETPSSSKQVQEIV 1005 KK + N D+G AA SSS Q Q+I Sbjct: 290 DRDRYHRADKPDFASGKQENPTKKQRFSNWDKGA-----DNVKDAAGTMSSSSMQSQDIG 344 Query: 1006 DKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKT 1185 D AQS AN+AV A DL+ ELVNKNLVGG YMSTDQKKK+LWGNKK+ Sbjct: 345 DT--DALAQSHANDAV-ANDLDAAKVAAMRAAELVNKNLVGGSYMSTDQKKKLLWGNKKS 401 Query: 1186 SAAEESVHHWDMPLFSDQERQEKFNKLM 1269 + EES WD L DQ+RQEKFNKLM Sbjct: 402 TPVEESARRWDTALIGDQDRQEKFNKLM 429 >ref|XP_006482075.1| PREDICTED: uncharacterized protein DDB_G0283697-like isoform X1 [Citrus sinensis] Length = 538 Score = 194 bits (494), Expect = 7e-47 Identities = 147/448 (32%), Positives = 195/448 (43%), Gaps = 49/448 (10%) Frame = +1 Query: 73 SLSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEFHRDNPA 252 S PD D + SFRKPSNDAANR+YRRR D + SP + RD+P+ Sbjct: 4 SSPPDTPDTKASFRKPSNDAANRRYRRRSPANGSSSSDGSPKC---DHNASPIYSRDDPS 60 Query: 253 KISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXXXXXXAD 432 K+ + +RR++ REL+ + +RQ + Sbjct: 61 KVPEHQQRRKDDERELDRDSGRSHHGRGSDSYRHSDRQSSRSSHNYSKHDDYVRHDKHEN 120 Query: 433 GGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDGHGRR--- 600 +R YQ S R+ ++ R +DY ++ + DK D GHG + Sbjct: 121 DEDRNYQRLS-----------SRSGRESRDHSRSKDYLSSEERSSHDKYDVIGHGSKDKE 169 Query: 601 -------RLINSNLDEV----------------KIGEERHNSXXXXXXXXXXXXXSLGDY 711 R N + D ++ + H S GD+ Sbjct: 170 KESSYLERQKNKDKDSSSDRAGSGRKHTVAYSEELDRDWHKRDRDGRDEKRDYRRSSGDH 229 Query: 712 KNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEFL- 888 +ND ++DESRGH S++ R+ G L E +S KELDGQK E++KH D+E Sbjct: 230 RNDRTVTYDESRGHRNYSSSGRDYGSYRLKEAHRSDPKELDGQKLANEEKKKHNDSETYR 289 Query: 889 ---------------------VKKPKLCNADEGTGGKIISKFTCAADETPSSSKQVQEIV 1005 KK + N D+G AA SSS Q Q+I Sbjct: 290 DRDRYHRADKPDFASGKQENPTKKQRFSNWDKGA-----DNVKDAAGTMSSSSMQSQDIG 344 Query: 1006 DKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKT 1185 D AQS AN+AV A DL+ ELVNKNLVGG YMSTDQKKK+LWGNKK+ Sbjct: 345 DT--DALAQSHANDAV-ANDLDAAKVAAMRAAELVNKNLVGGSYMSTDQKKKLLWGNKKS 401 Query: 1186 SAAEESVHHWDMPLFSDQERQEKFNKLM 1269 + EES WD L DQ+RQEKFNKLM Sbjct: 402 TPVEESARRWDTALIGDQDRQEKFNKLM 429 >ref|XP_006430550.1| hypothetical protein CICLE_v10011438mg [Citrus clementina] gi|557532607|gb|ESR43790.1| hypothetical protein CICLE_v10011438mg [Citrus clementina] Length = 538 Score = 193 bits (490), Expect = 2e-46 Identities = 146/448 (32%), Positives = 195/448 (43%), Gaps = 49/448 (10%) Frame = +1 Query: 73 SLSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEFHRDNPA 252 S PD D + SFRKPSNDAANR+YRRR D + SP + RD+P+ Sbjct: 4 SSPPDTPDTKASFRKPSNDAANRRYRRRSPANGSSSSDGSP---KRDHNASPIYSRDDPS 60 Query: 253 KISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXXXXXXAD 432 + + +RR++ REL+ + +RQ + Sbjct: 61 NVPEHQQRRKDDERELDRDSGRSHHGRGSDSYRHSDRQSSRSSHNYSKHDDYVRHDKHEN 120 Query: 433 GGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDGHGRR--- 600 +R YQ S R+ ++ R +DY ++ +RDK D GHG + Sbjct: 121 DEDRNYQRLS-----------SRSGRESRDHSRSKDYLSSEERSSRDKYDVIGHGSKDKE 169 Query: 601 -------RLINSNLDEV----------------KIGEERHNSXXXXXXXXXXXXXSLGDY 711 R N + D ++ + H S GD+ Sbjct: 170 KESSYLERQKNKDKDSSSDRAGSGRKHTVAYSEELDRDWHKRDRDGRDEKRDYRRSSGDH 229 Query: 712 KNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEFL- 888 +ND ++DESRGH S++ R+ G L E +S KELDGQK E++KH D+E Sbjct: 230 RNDRTVTYDESRGHRNYSSSGRDYGSYRLKEAHRSDPKELDGQKLANEEKKKHNDSETNR 289 Query: 889 ---------------------VKKPKLCNADEGTGGKIISKFTCAADETPSSSKQVQEIV 1005 KK + N D+G AA SSS Q Q+I Sbjct: 290 DRDRYHRADKPDFASGKQENPTKKQRFSNWDKGA-----DNVKDAAGTMSSSSMQSQDIG 344 Query: 1006 DKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKT 1185 D AQS AN+AV A DL+ ELVNKNLVGG YMSTDQKKK+LWGNKK+ Sbjct: 345 DT--DALAQSHANDAV-ANDLDAAKVAAMRAAELVNKNLVGGSYMSTDQKKKLLWGNKKS 401 Query: 1186 SAAEESVHHWDMPLFSDQERQEKFNKLM 1269 + EES WD L D++RQEKFNKLM Sbjct: 402 TPVEESARRWDTALIGDRDRQEKFNKLM 429 >ref|XP_006430548.1| hypothetical protein CICLE_v10011438mg [Citrus clementina] gi|567875919|ref|XP_006430549.1| hypothetical protein CICLE_v10011438mg [Citrus clementina] gi|557532605|gb|ESR43788.1| hypothetical protein CICLE_v10011438mg [Citrus clementina] gi|557532606|gb|ESR43789.1| hypothetical protein CICLE_v10011438mg [Citrus clementina] Length = 482 Score = 193 bits (490), Expect = 2e-46 Identities = 146/448 (32%), Positives = 195/448 (43%), Gaps = 49/448 (10%) Frame = +1 Query: 73 SLSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEFHRDNPA 252 S PD D + SFRKPSNDAANR+YRRR D + SP + RD+P+ Sbjct: 4 SSPPDTPDTKASFRKPSNDAANRRYRRRSPANGSSSSDGSP---KRDHNASPIYSRDDPS 60 Query: 253 KISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXXXXXXAD 432 + + +RR++ REL+ + +RQ + Sbjct: 61 NVPEHQQRRKDDERELDRDSGRSHHGRGSDSYRHSDRQSSRSSHNYSKHDDYVRHDKHEN 120 Query: 433 GGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDGHGRR--- 600 +R YQ S R+ ++ R +DY ++ +RDK D GHG + Sbjct: 121 DEDRNYQRLS-----------SRSGRESRDHSRSKDYLSSEERSSRDKYDVIGHGSKDKE 169 Query: 601 -------RLINSNLDEV----------------KIGEERHNSXXXXXXXXXXXXXSLGDY 711 R N + D ++ + H S GD+ Sbjct: 170 KESSYLERQKNKDKDSSSDRAGSGRKHTVAYSEELDRDWHKRDRDGRDEKRDYRRSSGDH 229 Query: 712 KNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEFL- 888 +ND ++DESRGH S++ R+ G L E +S KELDGQK E++KH D+E Sbjct: 230 RNDRTVTYDESRGHRNYSSSGRDYGSYRLKEAHRSDPKELDGQKLANEEKKKHNDSETNR 289 Query: 889 ---------------------VKKPKLCNADEGTGGKIISKFTCAADETPSSSKQVQEIV 1005 KK + N D+G AA SSS Q Q+I Sbjct: 290 DRDRYHRADKPDFASGKQENPTKKQRFSNWDKGA-----DNVKDAAGTMSSSSMQSQDIG 344 Query: 1006 DKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKT 1185 D AQS AN+AV A DL+ ELVNKNLVGG YMSTDQKKK+LWGNKK+ Sbjct: 345 DT--DALAQSHANDAV-ANDLDAAKVAAMRAAELVNKNLVGGSYMSTDQKKKLLWGNKKS 401 Query: 1186 SAAEESVHHWDMPLFSDQERQEKFNKLM 1269 + EES WD L D++RQEKFNKLM Sbjct: 402 TPVEESARRWDTALIGDRDRQEKFNKLM 429 >ref|XP_006838218.1| hypothetical protein AMTR_s00106p00155270 [Amborella trichopoda] gi|548840676|gb|ERN00787.1| hypothetical protein AMTR_s00106p00155270 [Amborella trichopoda] Length = 532 Score = 190 bits (483), Expect = 1e-45 Identities = 153/488 (31%), Positives = 206/488 (42%), Gaps = 84/488 (17%) Frame = +1 Query: 58 MDPNL-SLSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234 MD L S SPD + +PSFRKPSNDA RKYR+R H S SP Sbjct: 1 MDSGLVSYSPDPVEPKPSFRKPSNDAFQRKYRKRSPTSGSASPLSSGSP-QHSHSYSPNI 59 Query: 235 HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414 + K+++D R R + RE+E + + Sbjct: 60 SMEEAGKVTNDQRTRMDEEREVERDSSHHRSGKGSDSYG--KGSDVYGDNDRHSRGITQG 117 Query: 415 XXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQ-----ESEYERRDYQQHVDKYNRDKPD 579 D + + Q S TR +EYE+RD + NR PD Sbjct: 118 YRRHDDSSKHQSQHRREVEERSSQRYSSRITRDLEGSSHAEYEKRDRDSDNFRDNRRNPD 177 Query: 580 -------CDGHGRRRL---------------INSNLDEVKIGE-ERHNSXXXXXXXXXXX 690 D GRR+ N+N++ K+GE ER+ Sbjct: 178 KPPRDRKIDDEGRRKERDSATQGRYRDIDKPANTNMEREKMGERERYRDRGEGRDDYRDY 237 Query: 691 XXSLGDYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKH 870 SLGD + D +SS++ SRG+ +DS + R++G E +SS++E + + ++RR+ Sbjct: 238 RKSLGDTRRDRVSSYEGSRGYARDSASGRDSGSRHSREIHRSSNRESERHIEDKVQRRRG 297 Query: 871 EDNE-------------------------------------------------FLVKKPK 903 +D + KK K Sbjct: 298 DDESDRYKNKDSYNRESDDHSRGYSRSSSDYRDRSFRNGRSEDKNVHAVDDEASVGKKCK 357 Query: 904 LCNADEGTGGKI-----ISKFTCAADETPSSS-KQVQEIVDKVIPEPAQSSANEAVSACD 1065 L +AD+ +G TC AD+ S S KQ+QE V K EP QSSANEA A D Sbjct: 358 LFDADKSSGDATDRHLPSKSSTCVADDKSSLSLKQLQEPVPKETLEPVQSSANEAKIAQD 417 Query: 1066 LNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKTSAAEESVHHWDMPLFSDQER 1245 LN +VN+NLVGG Y+STD+KKK+LWGNKKTSAAEES WD +FSD+ER Sbjct: 418 LNAAKVAAMKAAGIVNRNLVGGSYLSTDEKKKLLWGNKKTSAAEESGTRWDTAMFSDRER 477 Query: 1246 QEKFNKLM 1269 QEKFNKLM Sbjct: 478 QEKFNKLM 485 >ref|XP_007028352.1| Uncharacterized protein isoform 4 [Theobroma cacao] gi|508716957|gb|EOY08854.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 462 Score = 187 bits (476), Expect = 9e-45 Identities = 150/463 (32%), Positives = 209/463 (45%), Gaps = 55/463 (11%) Frame = +1 Query: 58 MDPNLSLSP-DIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234 MD NL SP D +DA+ +FRK SNDA+NR+YRR DRS SP Sbjct: 1 MDSNLQTSPPDGSDAKAAFRKFSNDASNRQYRRHSPISRSSSSEGNSP--QRDRSVSPIL 58 Query: 235 HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414 RD+ AK +D R+ GREL+ + +RQ Sbjct: 59 SRDDLAKGADTQPGRD--GRELDRDSSRNKYSRNSDSYRYSDRQSSRSSHGYSRHDNYVR 116 Query: 415 XXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDGH 591 AD G + + THSD RQES+ R +DY ++ DKY+RD+ D GH Sbjct: 117 HDKFADEGSKYDRLSSRSGRESRFSTHSDHPRQESDISRSKDYSRNADKYSRDRYDGSGH 176 Query: 592 ---------------------------GRRRLINSNLDEVKIGEERHNSXXXXXXXXXXX 690 G R S+ E ++ +R Sbjct: 177 RIRDKEKESQSLEHQKYKDKDSALDRAGSGRRQGSSFSE-EMDRDRRRRGRDSRGEKGDY 235 Query: 691 XXSLGDYKNDHISSFDESRGHGKDSTAARE--NGRNGLTETRKSSSKELDGQKRNVMERR 864 S GD K D+ S++ESRGH DS++ RE N + E KS KE+DGQK ER Sbjct: 236 HRSSGDRKGDYTESYEESRGHRNDSSSGRERDNDKYRRKEGYKSGLKEIDGQKP-AKERM 294 Query: 865 KHEDNEFLVKKPK--------------LCNADEGTGGKIISKFTCA--------ADETPS 978 KH++ E ++K + ++ + K + F+ + ADE S Sbjct: 295 KHDEWETNMEKDRYGGVLKEQCEEKSIFVGKNQESPAKKLKLFSSSKGNEYDKDADEKRS 354 Query: 979 SSKQVQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGY--MSTDQ 1152 S +Q +E +V Q+ N+ D+N ELVN+NL+G G+ M+T+Q Sbjct: 355 SLEQAEETDGRVTM--GQAHGNDVDITNDINSAKVAAMKAAELVNRNLIGAGHSNMTTEQ 412 Query: 1153 KKKILWGNKKTSAAEESVHHWDMPLFSDQERQEKFNKLMVSFL 1281 KKK+LWG+KK++ AEES H WD LF D+ERQEKFNKLMV+ + Sbjct: 413 KKKLLWGSKKSTPAEESGHRWDTALFGDRERQEKFNKLMVALV 455 >ref|XP_007028351.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508716956|gb|EOY08853.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 464 Score = 185 bits (470), Expect = 4e-44 Identities = 149/459 (32%), Positives = 206/459 (44%), Gaps = 55/459 (11%) Frame = +1 Query: 58 MDPNLSLSP-DIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234 MD NL SP D +DA+ +FRK SNDA+NR+YRR DRS SP Sbjct: 1 MDSNLQTSPPDGSDAKAAFRKFSNDASNRQYRRHSPISRSSSSEGNSP--QRDRSVSPIL 58 Query: 235 HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414 RD+ AK +D R+ GREL+ + +RQ Sbjct: 59 SRDDLAKGADTQPGRD--GRELDRDSSRNKYSRNSDSYRYSDRQSSRSSHGYSRHDNYVR 116 Query: 415 XXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDGH 591 AD G + + THSD RQES+ R +DY ++ DKY+RD+ D GH Sbjct: 117 HDKFADEGSKYDRLSSRSGRESRFSTHSDHPRQESDISRSKDYSRNADKYSRDRYDGSGH 176 Query: 592 ---------------------------GRRRLINSNLDEVKIGEERHNSXXXXXXXXXXX 690 G R S+ E ++ +R Sbjct: 177 RIRDKEKESQSLEHQKYKDKDSALDRAGSGRRQGSSFSE-EMDRDRRRRGRDSRGEKGDY 235 Query: 691 XXSLGDYKNDHISSFDESRGHGKDSTAARE--NGRNGLTETRKSSSKELDGQKRNVMERR 864 S GD K D+ S++ESRGH DS++ RE N + E KS KE+DGQK ER Sbjct: 236 HRSSGDRKGDYTESYEESRGHRNDSSSGRERDNDKYRRKEGYKSGLKEIDGQKP-AKERM 294 Query: 865 KHEDNEFLVKKPK--------------LCNADEGTGGKIISKFTCA--------ADETPS 978 KH++ E ++K + ++ + K + F+ + ADE S Sbjct: 295 KHDEWETNMEKDRYGGVLKEQCEEKSIFVGKNQESPAKKLKLFSSSKGNEYDKDADEKRS 354 Query: 979 SSKQVQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGY--MSTDQ 1152 S +Q +E +V Q+ N+ D+N ELVN+NL+G G+ M+T+Q Sbjct: 355 SLEQAEETDGRVTM--GQAHGNDVDITNDINSAKVAAMKAAELVNRNLIGAGHSNMTTEQ 412 Query: 1153 KKKILWGNKKTSAAEESVHHWDMPLFSDQERQEKFNKLM 1269 KKK+LWG+KK++ AEES H WD LF D+ERQEKFNKLM Sbjct: 413 KKKLLWGSKKSTPAEESGHRWDTALFGDRERQEKFNKLM 451 >ref|XP_007028350.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] gi|508716955|gb|EOY08852.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] Length = 473 Score = 185 bits (470), Expect = 4e-44 Identities = 149/459 (32%), Positives = 206/459 (44%), Gaps = 55/459 (11%) Frame = +1 Query: 58 MDPNLSLSP-DIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234 MD NL SP D +DA+ +FRK SNDA+NR+YRR DRS SP Sbjct: 1 MDSNLQTSPPDGSDAKAAFRKFSNDASNRQYRRHSPISRSSSSEGNSP--QRDRSVSPIL 58 Query: 235 HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414 RD+ AK +D R+ GREL+ + +RQ Sbjct: 59 SRDDLAKGADTQPGRD--GRELDRDSSRNKYSRNSDSYRYSDRQSSRSSHGYSRHDNYVR 116 Query: 415 XXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDGH 591 AD G + + THSD RQES+ R +DY ++ DKY+RD+ D GH Sbjct: 117 HDKFADEGSKYDRLSSRSGRESRFSTHSDHPRQESDISRSKDYSRNADKYSRDRYDGSGH 176 Query: 592 ---------------------------GRRRLINSNLDEVKIGEERHNSXXXXXXXXXXX 690 G R S+ E ++ +R Sbjct: 177 RIRDKEKESQSLEHQKYKDKDSALDRAGSGRRQGSSFSE-EMDRDRRRRGRDSRGEKGDY 235 Query: 691 XXSLGDYKNDHISSFDESRGHGKDSTAARE--NGRNGLTETRKSSSKELDGQKRNVMERR 864 S GD K D+ S++ESRGH DS++ RE N + E KS KE+DGQK ER Sbjct: 236 HRSSGDRKGDYTESYEESRGHRNDSSSGRERDNDKYRRKEGYKSGLKEIDGQKP-AKERM 294 Query: 865 KHEDNEFLVKKPK--------------LCNADEGTGGKIISKFTCA--------ADETPS 978 KH++ E ++K + ++ + K + F+ + ADE S Sbjct: 295 KHDEWETNMEKDRYGGVLKEQCEEKSIFVGKNQESPAKKLKLFSSSKGNEYDKDADEKRS 354 Query: 979 SSKQVQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGY--MSTDQ 1152 S +Q +E +V Q+ N+ D+N ELVN+NL+G G+ M+T+Q Sbjct: 355 SLEQAEETDGRVTM--GQAHGNDVDITNDINSAKVAAMKAAELVNRNLIGAGHSNMTTEQ 412 Query: 1153 KKKILWGNKKTSAAEESVHHWDMPLFSDQERQEKFNKLM 1269 KKK+LWG+KK++ AEES H WD LF D+ERQEKFNKLM Sbjct: 413 KKKLLWGSKKSTPAEESGHRWDTALFGDRERQEKFNKLM 451 >ref|XP_007028349.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|590634353|ref|XP_007028353.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508716954|gb|EOY08851.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508716958|gb|EOY08855.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 504 Score = 185 bits (470), Expect = 4e-44 Identities = 149/459 (32%), Positives = 206/459 (44%), Gaps = 55/459 (11%) Frame = +1 Query: 58 MDPNLSLSP-DIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234 MD NL SP D +DA+ +FRK SNDA+NR+YRR DRS SP Sbjct: 1 MDSNLQTSPPDGSDAKAAFRKFSNDASNRQYRRHSPISRSSSSEGNSP--QRDRSVSPIL 58 Query: 235 HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414 RD+ AK +D R+ GREL+ + +RQ Sbjct: 59 SRDDLAKGADTQPGRD--GRELDRDSSRNKYSRNSDSYRYSDRQSSRSSHGYSRHDNYVR 116 Query: 415 XXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDGH 591 AD G + + THSD RQES+ R +DY ++ DKY+RD+ D GH Sbjct: 117 HDKFADEGSKYDRLSSRSGRESRFSTHSDHPRQESDISRSKDYSRNADKYSRDRYDGSGH 176 Query: 592 ---------------------------GRRRLINSNLDEVKIGEERHNSXXXXXXXXXXX 690 G R S+ E ++ +R Sbjct: 177 RIRDKEKESQSLEHQKYKDKDSALDRAGSGRRQGSSFSE-EMDRDRRRRGRDSRGEKGDY 235 Query: 691 XXSLGDYKNDHISSFDESRGHGKDSTAARE--NGRNGLTETRKSSSKELDGQKRNVMERR 864 S GD K D+ S++ESRGH DS++ RE N + E KS KE+DGQK ER Sbjct: 236 HRSSGDRKGDYTESYEESRGHRNDSSSGRERDNDKYRRKEGYKSGLKEIDGQKP-AKERM 294 Query: 865 KHEDNEFLVKKPK--------------LCNADEGTGGKIISKFTCA--------ADETPS 978 KH++ E ++K + ++ + K + F+ + ADE S Sbjct: 295 KHDEWETNMEKDRYGGVLKEQCEEKSIFVGKNQESPAKKLKLFSSSKGNEYDKDADEKRS 354 Query: 979 SSKQVQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGY--MSTDQ 1152 S +Q +E +V Q+ N+ D+N ELVN+NL+G G+ M+T+Q Sbjct: 355 SLEQAEETDGRVTM--GQAHGNDVDITNDINSAKVAAMKAAELVNRNLIGAGHSNMTTEQ 412 Query: 1153 KKKILWGNKKTSAAEESVHHWDMPLFSDQERQEKFNKLM 1269 KKK+LWG+KK++ AEES H WD LF D+ERQEKFNKLM Sbjct: 413 KKKLLWGSKKSTPAEESGHRWDTALFGDRERQEKFNKLM 451 >ref|XP_004144330.1| PREDICTED: uncharacterized protein LOC101218861 [Cucumis sativus] Length = 472 Score = 171 bits (432), Expect = 1e-39 Identities = 137/430 (31%), Positives = 192/430 (44%), Gaps = 40/430 (9%) Frame = +1 Query: 100 RPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEFHRDNPAKISDDPRRR 279 + FRKPS++ A RKYRRR DRS SP+ RD+ +K S+ RR Sbjct: 7 KAEFRKPSSETAGRKYRRRSSVSGSSSDESP----KRDRSSSPKLLRDDASKHSERKPRR 62 Query: 280 ENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXXXXXXADGGERRYQXX 459 + R+L + +R+ AD ER Y+ Sbjct: 63 KEDERDLNKDSRNHHSRSSDSYRYS-DRKSSRSLHGYSRHDDYVRHDKYADE-ERDYERL 120 Query: 460 XXXXXXXXXXT-HSDCTRQESEYER-RDYQQHVDKYNRDKPDCDGHGRRRLINSNLDEVK 633 + H D TR+ESE+ R R+Y + V+K +RDK D GH R R +S + Sbjct: 121 SSRSNRESKGSAHYDHTRRESEHSRSREYFRDVEKGSRDKYDASGH-RSRDGDSLSERHG 179 Query: 634 IGEERHNSXXXXXXXXXXXXXS-----------LGDYKNDHISSFDESRGHGKDSTAARE 780 G RH S GDYKN+ + S D+ RG+ DS R+ Sbjct: 180 SGSRRHASFEEMEKHRNARDRDGQDEKRDNIKHSGDYKNERVLSHDDGRGNRYDSLLGRD 239 Query: 781 NGRNGLTETRKSSSKELDGQKRNVMERRKHEDNE-------------------------- 882 ++ + K+ K+LD +K + E RKH+ E Sbjct: 240 ESKHRTKDINKNDRKDLDDEKSS-KEERKHDARETHWDKVQGKESKGKYDGKGVFVDENQ 298 Query: 883 -FLVKKPKLCNADEGTGGKIISKFTCAADETPSSSKQVQEIVDKVIPEPAQSSANEAVSA 1059 KKPKL ++ GK ++ A + S+SK+ Q+ + Q + ++ A Sbjct: 299 GLPAKKPKLFSS-----GKEVNHEEDADENQSSTSKKEQDGKMSL----GQGQSGDSDFA 349 Query: 1060 CDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKTSAAEESVHHWDMPLFSDQ 1239 D + ELVNKNLVGGGYM+TDQKKK+LWG+KK++A EES H WD LF+D+ Sbjct: 350 ADFSAAKVAAMKAAELVNKNLVGGGYMTTDQKKKLLWGSKKSTAVEESAHQWDTALFNDR 409 Query: 1240 ERQEKFNKLM 1269 ERQEKFNKLM Sbjct: 410 ERQEKFNKLM 419 >ref|XP_002307979.2| hypothetical protein POPTR_0006s03830g [Populus trichocarpa] gi|550335404|gb|EEE91502.2| hypothetical protein POPTR_0006s03830g [Populus trichocarpa] Length = 473 Score = 166 bits (419), Expect = 3e-38 Identities = 138/454 (30%), Positives = 201/454 (44%), Gaps = 48/454 (10%) Frame = +1 Query: 52 GFMDPNLSLSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPE 231 G P L + + + +FRKPSND ANRKYRR D+S SP Sbjct: 4 GIQSPQL----ENTETKATFRKPSNDMANRKYRRHSPMNGSSLSDGSP---KRDQSSSPV 56 Query: 232 FHRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXX 411 RD+PAK S +RR+ +EL+ + +R Sbjct: 57 VQRDDPAKAS---QRRKGEEKELDRDSGRSRYEKNGESYRHSDRYSSRSSHGYSRNDDYS 113 Query: 412 XXXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYER-RDYQQHVDKYNRDKPDCDG 588 D G+R +Q +HS ++ E R RDY ++ +KY+RD+ D G Sbjct: 114 RHDRRVDDGDRHHQVV----------SHSGRESKDGERGRSRDYARNSEKYSRDRHDGSG 163 Query: 589 HGR----------RRLINSNLDEVKIGEER--------------HNSXXXXXXXXXXXXX 696 H ++L + + ++G R H Sbjct: 164 HRNMDKERELSEHQKLKDKDFSPDRVGSGRKYTSIVSEEKDRDWHRRDRDGRDEKRDYHR 223 Query: 697 SLGDYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHE- 873 S GD+K+D S ++++RG+ DS+ GR+ L E+ K+ KEL+G K E++KH+ Sbjct: 224 SSGDHKSDRSSYYEDTRGYRNDSS-----GRDRLRESYKNDPKELNGLK----EKKKHDN 274 Query: 874 -----DNEFLVKKPKLCNADEGTGG--------KIISKFTCAAD---------ETPSSSK 987 D + K P N D+ G K F+ + D + SSS Sbjct: 275 WETSRDKDRYSKAPGEKNDDKSAFGSEKPESPAKKPKLFSSSKDPDYSGDVNQKQSSSSM 334 Query: 988 QVQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKIL 1167 QE+ +KV Q+ AN + +A DL+ ELVNKNLVG G+MST+QKKK+L Sbjct: 335 LAQEVDNKV--NVGQAHANTSEAANDLDAAKVAAMKAAELVNKNLVGVGFMSTEQKKKLL 392 Query: 1168 WGNKKTSAAEESVHHWDMPLFSDQERQEKFNKLM 1269 WG+KK++A EE+ WD +F D+ERQEKFNKLM Sbjct: 393 WGSKKSAAPEETGRRWDTVMFGDRERQEKFNKLM 426 >ref|XP_006575187.1| PREDICTED: protein starmaker-like isoform X6 [Glycine max] Length = 438 Score = 158 bits (399), Expect = 7e-36 Identities = 132/445 (29%), Positives = 191/445 (42%), Gaps = 40/445 (8%) Frame = +1 Query: 58 MDPNLSLSPDI-ADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234 MD N P +D + +FRKPS DAANR YRRR H S SP Sbjct: 2 MDSNSPFLPHCNSDTKNAFRKPSGDAANRNYRRRSPVEGSPSPDASP---RHGHSSSPNL 58 Query: 235 HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414 R+N A++S R+ ++ RE + + +RQ Sbjct: 59 VRENSARVSHHSRKYDD--REQDQQYGRNHYGRSSDSLRHSDRQSFKSSYGHSRHDKY-- 114 Query: 415 XXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYERRDYQQHVDKYNRDKPDCDGHG 594 E RY+ T D R ES+ ++YQ+ V+KY+ DK D H Sbjct: 115 ------ANEDRYREKLLSRSGHE--TRDDHMRDESDSRSKNYQRSVEKYSHDKYDRSDHR 166 Query: 595 RRRL-------------INSNLDEV----------KIGEERHNSXXXXXXXXXXXXXSLG 705 + ++S+ D+ ++ E H+ S G Sbjct: 167 SKEKRRETYLEHQKYKDMDSSYDKSASSKRHALYDEVEREGHSRDWDGRNERRDSRRSSG 226 Query: 706 DYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEF 885 DY++D + ESR +S R+ G++ L E KS KE + Q E+RKH+D E Sbjct: 227 DYRSDQAVCYSESRNQRDESGPQRDCGKSSLKEGYKSEQKESNDQNLPWEEKRKHDDTET 286 Query: 886 LVKKP-------KLCNA-DEGTGGKIISKFTCA--------ADETPSSSKQVQEIVDKVI 1017 K + C D+ + GK + F ADE+ +SS ++ K Sbjct: 287 GKGKDWKTRQASEQCGIEDKESSGKKLKLFDLDKDDNYRKDADESKTSSSKLSH-ESKAD 345 Query: 1018 PEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKTSAAE 1197 A++S + + DL+ ELVN+NLVG G ++TDQKKK+LWG K+++ E Sbjct: 346 VRAAKTSGFDGDN--DLDAAKVAAMRAAELVNRNLVGAGCLTTDQKKKLLWGGKRSTPTE 403 Query: 1198 ESVHHWDMPLFSDQERQEKFNKLMV 1272 ES H WD +FSD+ERQEKFNKLMV Sbjct: 404 ESGHRWDTAMFSDRERQEKFNKLMV 428 >gb|EXC34985.1| hypothetical protein L484_014712 [Morus notabilis] Length = 491 Score = 157 bits (396), Expect = 2e-35 Identities = 137/461 (29%), Positives = 192/461 (41%), Gaps = 57/461 (12%) Frame = +1 Query: 58 MDPNL-SLSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234 MD NL S + D D +P+FRKP+ DA NRKYRR +RS SP+ Sbjct: 1 MDSNLQSPNQDNVDVKPAFRKPTTDATNRKYRRHSPVSGSQSDGSP----ERERSASPKL 56 Query: 235 HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414 ++P ++ + RR++ G+E++ + +RQ Sbjct: 57 TGEDPRRVHESQSRRKDDGKEVDRDSYRSHYGRGSDSYRHSDRQFSRSSHRYSRHDDYSK 116 Query: 415 XXXXADGGERRYQXXXXXXXXXXXX-THSDCTRQESEYERRDYQQHVDKYNRDKPDCD-- 585 AD ER ++ TH D ++ RD+ + KY+RD+ D Sbjct: 117 HDKHADDEERNHRRLSSRSGWESKGGTHIDHSKL------RDHLRDGGKYSRDRYDSYLY 170 Query: 586 ------------GHGRRRLINSNLDEVKIGE----------ERHNSXXXXXXXXXXXXXS 699 H + +S+ D+ K G+ ER S Sbjct: 171 NSKDRERETSSLEHHKYNDRDSSFDKAKSGKRHPHPEDVERERRGMEKDGQDDKRDFRRS 230 Query: 700 LGDYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHED- 876 GDY+ D +E +GH D + RN E K+ +KE+DGQ ++K++D Sbjct: 231 SGDYRGDR----EEVKGHSIDFYS-----RNRAKECYKNEAKEIDGQCLTKEGKKKYDDV 281 Query: 877 ---------------------------NEFLVKKPKLCNADEGTGGKIISKFTCAADETP 975 EFL K+ K GK +SKF+ AD Sbjct: 282 ETNRSNDQYIREPAEQSGEKSVIGSENQEFLSKRQKFSLDKYTDAGKKVSKFSTVADVKE 341 Query: 976 SSSKQVQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGG---GYMST 1146 SS +Q + K+ + N + A DLN E VNKNLVGG G+M+ Sbjct: 342 SSPQQPPD--HKLTA--GEDQVNVSNFANDLNAAKVAAMKAAESVNKNLVGGVGTGFMTA 397 Query: 1147 DQKKKILWGNKKTSAAEESVHHWDMPLFSDQERQEKFNKLM 1269 DQKKK+LWGNKKT+ AEES H WD LFSD+ERQEKFNKLM Sbjct: 398 DQKKKLLWGNKKTTIAEESGHRWDSTLFSDRERQEKFNKLM 438 >ref|XP_006575186.1| PREDICTED: protein starmaker-like isoform X5 [Glycine max] Length = 440 Score = 157 bits (396), Expect = 2e-35 Identities = 132/451 (29%), Positives = 192/451 (42%), Gaps = 40/451 (8%) Frame = +1 Query: 58 MDPNLSLSPDI-ADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234 MD N P +D + +FRKPS DAANR YRRR H S SP Sbjct: 2 MDSNSPFLPHCNSDTKNAFRKPSGDAANRNYRRRSPVEGSPSPDASP---RHGHSSSPNL 58 Query: 235 HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414 R+N A++S R+ ++ RE + + +RQ Sbjct: 59 VRENSARVSHHSRKYDD--REQDQQYGRNHYGRSSDSLRHSDRQSFKSSYGHSRHDKY-- 114 Query: 415 XXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYERRDYQQHVDKYNRDKPDCDGHG 594 E RY+ T D R ES+ ++YQ+ V+KY+ DK D H Sbjct: 115 ------ANEDRYREKLLSRSGHE--TRDDHMRDESDSRSKNYQRSVEKYSHDKYDRSDHR 166 Query: 595 RRRL-------------INSNLDEV----------KIGEERHNSXXXXXXXXXXXXXSLG 705 + ++S+ D+ ++ E H+ S G Sbjct: 167 SKEKRRETYLEHQKYKDMDSSYDKSASSKRHALYDEVEREGHSRDWDGRNERRDSRRSSG 226 Query: 706 DYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEF 885 DY++D + ESR +S R+ G++ L E KS KE + Q E+RKH+D E Sbjct: 227 DYRSDQAVCYSESRNQRDESGPQRDCGKSSLKEGYKSEQKESNDQNLPWEEKRKHDDTET 286 Query: 886 LVKKP-------KLCNA-DEGTGGKIISKFTCA--------ADETPSSSKQVQEIVDKVI 1017 K + C D+ + GK + F ADE+ +SS ++ K Sbjct: 287 GKGKDWKTRQASEQCGIEDKESSGKKLKLFDLDKDDNYRKDADESKTSSSKLSH-ESKAD 345 Query: 1018 PEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKTSAAE 1197 A++S + + DL+ ELVN+NLVG G ++TDQKKK+LWG K+++ E Sbjct: 346 VRAAKTSGFDGDN--DLDAAKVAAMRAAELVNRNLVGAGCLTTDQKKKLLWGGKRSTPTE 403 Query: 1198 ESVHHWDMPLFSDQERQEKFNKLMVSFLCHP 1290 ES H WD +FSD+ERQEKFNKLM + P Sbjct: 404 ESGHRWDTAMFSDRERQEKFNKLMSEVVLVP 434 >ref|XP_006575183.1| PREDICTED: protein starmaker-like isoform X2 [Glycine max] gi|571440534|ref|XP_006575184.1| PREDICTED: protein starmaker-like isoform X3 [Glycine max] gi|571440536|ref|XP_006575185.1| PREDICTED: protein starmaker-like isoform X4 [Glycine max] Length = 480 Score = 156 bits (395), Expect = 2e-35 Identities = 131/444 (29%), Positives = 190/444 (42%), Gaps = 40/444 (9%) Frame = +1 Query: 58 MDPNLSLSPDI-ADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234 MD N P +D + +FRKPS DAANR YRRR H S SP Sbjct: 2 MDSNSPFLPHCNSDTKNAFRKPSGDAANRNYRRRSPVEGSPSPDASP---RHGHSSSPNL 58 Query: 235 HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414 R+N A++S R+ ++ RE + + +RQ Sbjct: 59 VRENSARVSHHSRKYDD--REQDQQYGRNHYGRSSDSLRHSDRQSFKSSYGHSRHDKY-- 114 Query: 415 XXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYERRDYQQHVDKYNRDKPDCDGHG 594 E RY+ T D R ES+ ++YQ+ V+KY+ DK D H Sbjct: 115 ------ANEDRYREKLLSRSGHE--TRDDHMRDESDSRSKNYQRSVEKYSHDKYDRSDHR 166 Query: 595 RRRL-------------INSNLDEV----------KIGEERHNSXXXXXXXXXXXXXSLG 705 + ++S+ D+ ++ E H+ S G Sbjct: 167 SKEKRRETYLEHQKYKDMDSSYDKSASSKRHALYDEVEREGHSRDWDGRNERRDSRRSSG 226 Query: 706 DYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEF 885 DY++D + ESR +S R+ G++ L E KS KE + Q E+RKH+D E Sbjct: 227 DYRSDQAVCYSESRNQRDESGPQRDCGKSSLKEGYKSEQKESNDQNLPWEEKRKHDDTET 286 Query: 886 LVKKP-------KLCNA-DEGTGGKIISKFTCA--------ADETPSSSKQVQEIVDKVI 1017 K + C D+ + GK + F ADE+ +SS ++ K Sbjct: 287 GKGKDWKTRQASEQCGIEDKESSGKKLKLFDLDKDDNYRKDADESKTSSSKLSH-ESKAD 345 Query: 1018 PEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKTSAAE 1197 A++S + + DL+ ELVN+NLVG G ++TDQKKK+LWG K+++ E Sbjct: 346 VRAAKTSGFDGDN--DLDAAKVAAMRAAELVNRNLVGAGCLTTDQKKKLLWGGKRSTPTE 403 Query: 1198 ESVHHWDMPLFSDQERQEKFNKLM 1269 ES H WD +FSD+ERQEKFNKLM Sbjct: 404 ESGHRWDTAMFSDRERQEKFNKLM 427 >ref|XP_003519025.1| PREDICTED: protein starmaker-like isoform X1 [Glycine max] Length = 479 Score = 156 bits (394), Expect = 3e-35 Identities = 132/444 (29%), Positives = 188/444 (42%), Gaps = 40/444 (9%) Frame = +1 Query: 58 MDPNLSLSPDI-ADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEF 234 MD N P +D + +FRKPS DAANR YRRR H S SP Sbjct: 2 MDSNSPFLPHCNSDTKNAFRKPSGDAANRNYRRRSPVEGSPSPDASP---RHGHSSSPNL 58 Query: 235 HRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXX 414 R+N A++S R+ ++ RE + + +RQ Sbjct: 59 VRENSARVSHHSRKYDD--REQDQQYGRNHYGRSSDSLRHSDRQSFKSSYGHSRHDKY-- 114 Query: 415 XXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYERRDYQQHVDKYNRDKPDCDGHG 594 E RY+ T D R ES+ ++YQ+ V+KY+ DK D H Sbjct: 115 ------ANEDRYREKLLSRSGHE--TRDDHMRDESDSRSKNYQRSVEKYSHDKYDRSDHR 166 Query: 595 RRRL-------------INSNLDEV----------KIGEERHNSXXXXXXXXXXXXXSLG 705 + ++S+ D+ ++ E H+ S G Sbjct: 167 SKEKRRETYLEHQKYKDMDSSYDKSASSKRHALYDEVEREGHSRDWDGRNERRDSRRSSG 226 Query: 706 DYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHEDNEF 885 DY++D + ESR +S R+ G++ L E KS KE + Q E+RKH+D E Sbjct: 227 DYRSDQAVCYSESRNQRDESGPQRDCGKSSLKEGYKSEQKESNDQNLPWEEKRKHDDTET 286 Query: 886 LVKKP-------KLCNA-DEGTGGKIISKFTCAADET--------PSSSKQVQEIVDKVI 1017 K + C D+ + GK + F D+ SSSK E K Sbjct: 287 GKGKDWKTRQASEQCGIEDKESSGKKLKLFDLDKDDNYRKDDESKTSSSKLSHE--SKAD 344 Query: 1018 PEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKTSAAE 1197 A++S + + DL+ ELVN+NLVG G ++TDQKKK+LWG K+++ E Sbjct: 345 VRAAKTSGFDGDN--DLDAAKVAAMRAAELVNRNLVGAGCLTTDQKKKLLWGGKRSTPTE 402 Query: 1198 ESVHHWDMPLFSDQERQEKFNKLM 1269 ES H WD +FSD+ERQEKFNKLM Sbjct: 403 ESGHRWDTAMFSDRERQEKFNKLM 426 >gb|EYU18618.1| hypothetical protein MIMGU_mgv1a007462mg [Mimulus guttatus] Length = 406 Score = 147 bits (370), Expect = 2e-32 Identities = 124/393 (31%), Positives = 169/393 (43%), Gaps = 1/393 (0%) Frame = +1 Query: 94 DARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPSPEFHRDNPAKISDDPR 273 D++ FRKPSNDAA+RKYRRR + DRS SP + + +++DD R Sbjct: 10 DSKAEFRKPSNDAASRKYRRRSPAGGSSSSSDGSL--HRDRSSSPLPRKKDSIRVADDNR 67 Query: 274 RRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXXXXXXXXXADGGERRYQ 453 + E+G R L R D +R Y Sbjct: 68 KTEDG-RNLSGRSGESYKYTDRHSSKNYPRHDEHSRRDRH-----------VDDYDRGYS 115 Query: 454 XXXXXXXXXXXXTHS-DCTRQESEYERRDYQQHVDKYNRDKPDCDGHGRRRLINSNLDEV 630 + D +R + E+ RDY + +D ++ K D L+N + D+ Sbjct: 116 KSSYRSNRDQRDNGNFDHSRSDKEHRSRDYIKDIDTHSHAKSD-------GLVNRSRDKE 168 Query: 631 KIGEERHNSXXXXXXXXXXXXXSLGDYKNDHISSFDESRGHGKDSTAARENGRNGLTETR 810 K ER S SLGD DS++ ++ + L ET Sbjct: 169 KY--ERAGSGRGDQYVKTDRRKSLGDQS---------------DSSSRKDTSGHRLKETS 211 Query: 811 KSSSKELDGQKRNVMERRKHEDNEFLVKKPKLCNADEGTGGKIISKFTCAADETPSSSKQ 990 KEL+ +K E+RK DN + K+ A E + K I KFT + P S Sbjct: 212 WREGKELNAEKYVNDEKRKF-DNRSIYKEEGNGEAKEHSDDKSI-KFTETVTKKPKFSS- 268 Query: 991 VQEIVDKVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILW 1170 +D P +S V+ D++ ELVNKNLVG GYMSTDQKKK+LW Sbjct: 269 ----LDSKAPVTDGTSEQPYVTDSDIDAAKIAAMKAAELVNKNLVGTGYMSTDQKKKLLW 324 Query: 1171 GNKKTSAAEESVHHWDMPLFSDQERQEKFNKLM 1269 G+KK++A EES H WD F D+ERQEKFNKLM Sbjct: 325 GSKKSTATEESAHRWDTITFGDRERQEKFNKLM 357 >ref|XP_006589006.1| PREDICTED: arginine/serine-rich coiled-coil protein 2-like isoform X6 [Glycine max] Length = 447 Score = 144 bits (362), Expect = 1e-31 Identities = 133/448 (29%), Positives = 184/448 (41%), Gaps = 40/448 (8%) Frame = +1 Query: 49 LGFMDPNLS-LSPDIADARPSFRKPSNDAANRKYRRRXXXXXXXXXXXXXXXXNHDRSPS 225 L MD NL L P +D + SFRKPS DAANR Y+ R H S S Sbjct: 19 LSMMDSNLPFLPPSNSDTKNSFRKPSGDAANRNYQHRSPVDRSPSPDAS----RHGHSSS 74 Query: 226 PEFHRDNPAKISDDPRRRENGGRELEMEXXXXXXXXXXXXXXXXERQPXXXXXXXXXXXX 405 P R+N A++S R+ ++ RE + + +RQ Sbjct: 75 PNPVRENSARVSHHSRKYDD--REHDQQYGRNHYGRSSDSLRHSDRQSFKSSFGHSRYDK 132 Query: 406 XXXXXXXADGGERRYQXXXXXXXXXXXXTHSDCTRQESEYERRDYQQHVDKYNRDKPDCD 585 E RY+ + D R+ES+ ++YQ VDKY+ DK D Sbjct: 133 Y--------ANEDRYRERLLSRSGHE--SRDDHVREESDSRPKNYQCSVDKYSHDKYDRS 182 Query: 586 GHG---RRRLINSNLDEVK--------------------IGEERHNSXXXXXXXXXXXXX 696 H +RR S + K + E H+ Sbjct: 183 DHRSKEKRRDTYSEHQKYKDMDSSYEKSASSKRHALYDEVEREGHSRDWDGQNERRDSRR 242 Query: 697 SLGDYKNDHISSFDESRGHGKDSTAARENGRNGLTETRKSSSKELDGQKRNVMERRKHED 876 S GDY++D +S R++G+ L E KS KE + Q E+RKH+D Sbjct: 243 SSGDYRSDQRD----------ESGPQRDSGKFSLKEAYKSEQKESNDQNLPWEEKRKHDD 292 Query: 877 NEFLVKKP--------KLCNADEGTGGKIISKFTCA--------ADETPSSSKQVQEIVD 1008 E K + D+ + GK + F ADE+ +SS + Sbjct: 293 TEIRKGKDWKTRKAGEQCAIEDKESSGKKLKLFDPDKDDNYRKDADESKTSSSNLSH-KS 351 Query: 1009 KVIPEPAQSSANEAVSACDLNXXXXXXXXXXELVNKNLVGGGYMSTDQKKKILWGNKKTS 1188 K +SS + + DL+ ELVN+NLVG G ++TDQKKK+LWG KK++ Sbjct: 352 KEDLWAVKSSGFDGDN--DLDAAKIAAMRAAELVNRNLVGPGCLTTDQKKKLLWGGKKST 409 Query: 1189 AAEESVHHWDMPLFSDQERQEKFNKLMV 1272 EES H WD +FSD+ERQEKFNKLMV Sbjct: 410 PTEESGHRWDTGMFSDRERQEKFNKLMV 437