BLASTX nr result
ID: Sinomenium21_contig00014562
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00014562 (2050 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002274705.1| PREDICTED: uncharacterized protein LOC100264... 464 e-128 ref|XP_007022647.1| ARM repeat superfamily protein, putative iso... 442 e-121 ref|XP_007022650.1| ARM repeat superfamily protein, putative iso... 442 e-121 ref|XP_007022651.1| ARM repeat superfamily protein, putative iso... 441 e-121 ref|XP_007022648.1| ARM repeat superfamily protein, putative iso... 441 e-121 ref|XP_002511774.1| conserved hypothetical protein [Ricinus comm... 439 e-120 ref|XP_006421838.1| hypothetical protein CICLE_v10004825mg [Citr... 436 e-119 ref|XP_006348129.1| PREDICTED: ataxin-10-like [Solanum tuberosum] 419 e-114 ref|XP_004232703.1| PREDICTED: ataxin-10-like isoform 1 [Solanum... 419 e-114 ref|XP_002320751.1| ataxin-related family protein [Populus trich... 417 e-114 ref|XP_003623813.1| Ataxin-10 [Medicago truncatula] gi|355498828... 415 e-113 ref|XP_004308721.1| PREDICTED: uncharacterized protein LOC101297... 415 e-113 ref|XP_007219054.1| hypothetical protein PRUPE_ppa004765mg [Prun... 414 e-113 ref|XP_004306868.1| PREDICTED: ataxin-10-like [Fragaria vesca su... 412 e-112 ref|XP_004492673.1| PREDICTED: ataxin-10-like [Cicer arietinum] 410 e-112 ref|XP_003551615.1| PREDICTED: ataxin-10-like [Glycine max] 409 e-111 ref|XP_006366476.1| PREDICTED: ataxin-10-like isoform X1 [Solanu... 408 e-111 ref|XP_007148734.1| hypothetical protein PHAVU_005G009900g [Phas... 403 e-109 gb|EYU23502.1| hypothetical protein MIMGU_mgv1a005564mg [Mimulus... 375 e-101 gb|EYU22629.1| hypothetical protein MIMGU_mgv1a025194mg, partial... 371 e-100 >ref|XP_002274705.1| PREDICTED: uncharacterized protein LOC100264428 [Vitis vinifera] Length = 494 Score = 464 bits (1194), Expect = e-128 Identities = 257/486 (52%), Positives = 325/486 (66%), Gaps = 4/486 (0%) Frame = -1 Query: 2011 PENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLELSKSLSNAXXXX 1832 PENI++ L + S+SSTL + LE+L++ S+T GR DL KN +P+VL+LS+SLS Sbjct: 11 PENILQPLFSVSNSSTLDETLELLIEASKTPGGRLDLGSKNILPVVLQLSQSLSYPSGHD 70 Query: 1831 XXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXG-FGSDLDCEIVRIGLQLLGN 1655 LCAGE+ NQN F SD D I+R+GLQLLGN Sbjct: 71 ILLLSLKLLRNLCAGEMTNQNLFIEQNGVKAVSTILLSFVGLDSDSDYGIIRMGLQLLGN 130 Query: 1654 VSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDERVAELCGFRGL 1475 VSLAGE H +AVW FFP GFLE+AR+R E DPLC V++ C + E + E+CG +GL Sbjct: 131 VSLAGERHQRAVWHHFFPAGFLEIARVRTLETSDPLCMVIYTCFDQSHEFITEICGDQGL 190 Query: 1474 QIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDASGDFKCQD---TF 1304 I+AEIVRTAS VGFEE+WLKLLLS+IC G SG+++ + Sbjct: 191 PILAEIVRTASTVGFEEDWLKLLLSRICLEESHFPMLFSKLCPVG-TSGNYESIEFKVDV 249 Query: 1303 FTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSGLPTGSPS 1124 F EQ+FL+ I++E LN+QIN+++VS+D ALC+ GILK+ GV+D S KSG GS + Sbjct: 250 FASEQAFLMDIVAEILNEQINKMTVSSDVALCVLGILKKSAGVLDSVSTCKSGFSAGSNA 309 Query: 1123 IDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEPPEIIKKS 944 I+VL YS+ IL+++CA++ +SS GS+DVV LEPP II+K+ Sbjct: 310 INVLKYSLTILKEICARDAQKSSNEHGSVDVVDLLVSSGLLELLLCLLRDLEPPAIIRKA 369 Query: 943 ISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGILLLMQQCV 764 I + +NQ + S S K PY+GF+RD+VAVIGNC YRRKHVQ+EIR++NGILLL+QQCV Sbjct: 370 IKQGENQ-DGAASYSPKHYPYRGFRRDLVAVIGNCAYRRKHVQNEIRERNGILLLLQQCV 428 Query: 763 TDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEVDQKNRRA 584 TDE N FLREWGIW VRNLLE N ENQR VAE+ELQGSVDVPEI GLGLRVEVDQK RA Sbjct: 429 TDEENQFLREWGIWCVRNLLEGNVENQRVVAELELQGSVDVPEIAGLGLRVEVDQKTGRA 488 Query: 583 KLVNIS 566 KLVN+S Sbjct: 489 KLVNVS 494 >ref|XP_007022647.1| ARM repeat superfamily protein, putative isoform 1 [Theobroma cacao] gi|508722275|gb|EOY14172.1| ARM repeat superfamily protein, putative isoform 1 [Theobroma cacao] Length = 531 Score = 442 bits (1138), Expect = e-121 Identities = 241/501 (48%), Positives = 319/501 (63%), Gaps = 2/501 (0%) Frame = -1 Query: 2050 KKMEDSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVL 1871 K+M P E +++ L++ S+SS+L +ALEIL++ SRT R++LAL+N +P VL Sbjct: 11 KEMVGESLPEFNGLEGVLQPLLSASNSSSLKEALEILIKVSRTAAARAELALRNILPTVL 70 Query: 1870 ELSKSLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDC 1691 +L +S LCAGE+ NQN+F S+ D Sbjct: 71 KLVESFHQTSSREYLVNSLKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAALLSNPDS 130 Query: 1690 EIVRIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKD 1511 ++R+ LQ+L NVSLAGE+H +A+W +FFP F +AR+R E DPLC +L+ CC + Sbjct: 131 GVIRVSLQVLANVSLAGEDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRP 190 Query: 1510 ERVAELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDA- 1334 VAELC GL IV I+RT + VGF E+W KLLLS++C + Sbjct: 191 GLVAELCRDMGLPIVVGIIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSE 250 Query: 1333 -SGDFKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSK 1157 SG+ D F EQ+FLL I+SE LN++I EI VS++FALC+ GI KR V V+DF S+ Sbjct: 251 NSGNTDSGDDLFLSEQAFLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASR 310 Query: 1156 GKSGLPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXX 977 G S LPTG SIDV+GYS+IILRD+CA+EG K + S+DVV Sbjct: 311 GMSSLPTGCTSIDVMGYSLIILRDICAREGVGDLKND-SLDVVDMLLSHELIDILLSLLR 369 Query: 976 XLEPPEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQK 797 L+PP II+K + NQ + + K+CPYKGF+RD++AVIGNC YRRKHVQDEIRQK Sbjct: 370 DLDPPAIIRKVLKEGDNQGLNLSAS--KLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQK 427 Query: 796 NGILLLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGL 617 NGILLL+QQCVTD+ NP+LREWGIWS+RNLLE + ENQ+ VA++ELQGSVD+PE++ LGL Sbjct: 428 NGILLLLQQCVTDDDNPYLREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGL 487 Query: 616 RVEVDQKNRRAKLVNIS*DEI 554 RVEVDQK RRAK + D++ Sbjct: 488 RVEVDQKTRRAKDFALPPDQV 508 >ref|XP_007022650.1| ARM repeat superfamily protein, putative isoform 4 [Theobroma cacao] gi|508722278|gb|EOY14175.1| ARM repeat superfamily protein, putative isoform 4 [Theobroma cacao] Length = 500 Score = 442 bits (1137), Expect = e-121 Identities = 240/492 (48%), Positives = 315/492 (64%), Gaps = 2/492 (0%) Frame = -1 Query: 2050 KKMEDSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVL 1871 K+M P E +++ L++ S+SS+L +ALEIL++ SRT R++LAL+N +P VL Sbjct: 11 KEMVGESLPEFNGLEGVLQPLLSASNSSSLKEALEILIKVSRTAAARAELALRNILPTVL 70 Query: 1870 ELSKSLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDC 1691 +L +S LCAGE+ NQN+F S+ D Sbjct: 71 KLVESFHQTSSREYLVNSLKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAALLSNPDS 130 Query: 1690 EIVRIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKD 1511 ++R+ LQ+L NVSLAGE+H +A+W +FFP F +AR+R E DPLC +L+ CC + Sbjct: 131 GVIRVSLQVLANVSLAGEDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRP 190 Query: 1510 ERVAELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDA- 1334 VAELC GL IV I+RT + VGF E+W KLLLS++C + Sbjct: 191 GLVAELCRDMGLPIVVGIIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSE 250 Query: 1333 -SGDFKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSK 1157 SG+ D F EQ+FLL I+SE LN++I EI VS++FALC+ GI KR V V+DF S+ Sbjct: 251 NSGNTDSGDDLFLSEQAFLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASR 310 Query: 1156 GKSGLPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXX 977 G S LPTG SIDV+GYS+IILRD+CA+EG K + S+DVV Sbjct: 311 GMSSLPTGCTSIDVMGYSLIILRDICAREGVGDLKND-SLDVVDMLLSHELIDILLSLLR 369 Query: 976 XLEPPEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQK 797 L+PP II+K + NQ + + K+CPYKGF+RD++AVIGNC YRRKHVQDEIRQK Sbjct: 370 DLDPPAIIRKVLKEGDNQGLNLSAS--KLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQK 427 Query: 796 NGILLLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGL 617 NGILLL+QQCVTD+ NP+LREWGIWS+RNLLE + ENQ+ VA++ELQGSVD+PE++ LGL Sbjct: 428 NGILLLLQQCVTDDDNPYLREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGL 487 Query: 616 RVEVDQKNRRAK 581 RVEVDQK RRAK Sbjct: 488 RVEVDQKTRRAK 499 >ref|XP_007022651.1| ARM repeat superfamily protein, putative isoform 5 [Theobroma cacao] gi|508722279|gb|EOY14176.1| ARM repeat superfamily protein, putative isoform 5 [Theobroma cacao] Length = 519 Score = 441 bits (1134), Expect = e-121 Identities = 238/487 (48%), Positives = 315/487 (64%), Gaps = 2/487 (0%) Frame = -1 Query: 2008 ENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLELSKSLSNAXXXXX 1829 E +++ L++ S+SS+L +ALEIL++ SRT R++LAL+N +P VL+L +S Sbjct: 13 EGVLQPLLSASNSSSLKEALEILIKVSRTAAARAELALRNILPTVLKLVESFHQTSSREY 72 Query: 1828 XXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEIVRIGLQLLGNVS 1649 LCAGE+ NQN+F S+ D ++R+ LQ+L NVS Sbjct: 73 LVNSLKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAALLSNPDSGVIRVSLQVLANVS 132 Query: 1648 LAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDERVAELCGFRGLQI 1469 LAGE+H +A+W +FFP F +AR+R E DPLC +L+ CC + VAELC GL I Sbjct: 133 LAGEDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPI 192 Query: 1468 VAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDA--SGDFKCQDTFFTR 1295 V I+RT + VGF E+W KLLLS++C + SG+ D F Sbjct: 193 VVGIIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLS 252 Query: 1294 EQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSGLPTGSPSIDV 1115 EQ+FLL I+SE LN++I EI VS++FALC+ GI KR V V+DF S+G S LPTG SIDV Sbjct: 253 EQAFLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDV 312 Query: 1114 LGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEPPEIIKKSISR 935 +GYS+IILRD+CA+EG K + S+DVV L+PP II+K + Sbjct: 313 MGYSLIILRDICAREGVGDLKND-SLDVVDMLLSHELIDILLSLLRDLDPPAIIRKVLKE 371 Query: 934 TKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGILLLMQQCVTDE 755 NQ + + K+CPYKGF+RD++AVIGNC YRRKHVQDEIRQKNGILLL+QQCVTD+ Sbjct: 372 GDNQGLNLSAS--KLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCVTDD 429 Query: 754 CNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEVDQKNRRAKLV 575 NP+LREWGIWS+RNLLE + ENQ+ VA++ELQGSVD+PE++ LGLRVEVDQK RRAK Sbjct: 430 DNPYLREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRAKDF 489 Query: 574 NIS*DEI 554 + D++ Sbjct: 490 ALPPDQV 496 >ref|XP_007022648.1| ARM repeat superfamily protein, putative isoform 2 [Theobroma cacao] gi|590613384|ref|XP_007022649.1| ARM repeat superfamily protein, putative isoform 2 [Theobroma cacao] gi|590613394|ref|XP_007022652.1| ARM repeat superfamily protein, putative isoform 2 [Theobroma cacao] gi|508722276|gb|EOY14173.1| ARM repeat superfamily protein, putative isoform 2 [Theobroma cacao] gi|508722277|gb|EOY14174.1| ARM repeat superfamily protein, putative isoform 2 [Theobroma cacao] gi|508722280|gb|EOY14177.1| ARM repeat superfamily protein, putative isoform 2 [Theobroma cacao] Length = 488 Score = 441 bits (1133), Expect = e-121 Identities = 237/478 (49%), Positives = 311/478 (65%), Gaps = 2/478 (0%) Frame = -1 Query: 2008 ENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLELSKSLSNAXXXXX 1829 E +++ L++ S+SS+L +ALEIL++ SRT R++LAL+N +P VL+L +S Sbjct: 13 EGVLQPLLSASNSSSLKEALEILIKVSRTAAARAELALRNILPTVLKLVESFHQTSSREY 72 Query: 1828 XXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEIVRIGLQLLGNVS 1649 LCAGE+ NQN+F S+ D ++R+ LQ+L NVS Sbjct: 73 LVNSLKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAALLSNPDSGVIRVSLQVLANVS 132 Query: 1648 LAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDERVAELCGFRGLQI 1469 LAGE+H +A+W +FFP F +AR+R E DPLC +L+ CC + VAELC GL I Sbjct: 133 LAGEDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPI 192 Query: 1468 VAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDA--SGDFKCQDTFFTR 1295 V I+RT + VGF E+W KLLLS++C + SG+ D F Sbjct: 193 VVGIIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLS 252 Query: 1294 EQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSGLPTGSPSIDV 1115 EQ+FLL I+SE LN++I EI VS++FALC+ GI KR V V+DF S+G S LPTG SIDV Sbjct: 253 EQAFLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDV 312 Query: 1114 LGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEPPEIIKKSISR 935 +GYS+IILRD+CA+EG K + S+DVV L+PP II+K + Sbjct: 313 MGYSLIILRDICAREGVGDLKND-SLDVVDMLLSHELIDILLSLLRDLDPPAIIRKVLKE 371 Query: 934 TKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGILLLMQQCVTDE 755 NQ + + K+CPYKGF+RD++AVIGNC YRRKHVQDEIRQKNGILLL+QQCVTD+ Sbjct: 372 GDNQGLNLSAS--KLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCVTDD 429 Query: 754 CNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEVDQKNRRAK 581 NP+LREWGIWS+RNLLE + ENQ+ VA++ELQGSVD+PE++ LGLRVEVDQK RRAK Sbjct: 430 DNPYLREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRAK 487 >ref|XP_002511774.1| conserved hypothetical protein [Ricinus communis] gi|223548954|gb|EEF50443.1| conserved hypothetical protein [Ricinus communis] Length = 497 Score = 439 bits (1129), Expect = e-120 Identities = 244/487 (50%), Positives = 311/487 (63%), Gaps = 2/487 (0%) Frame = -1 Query: 2020 LCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLELSKSLSNAX 1841 L PE++++ L S S L +ALEIL++ SR GR++LA K+ +PLVL+L KS+S Sbjct: 3 LFLPEDLLQLLFRASKSYDLKEALEILIETSRIDDGRANLAAKDVLPLVLKLFKSISYPS 62 Query: 1840 XXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEIVRIGLQLL 1661 LCAGEI NQN F G + D I+R+GLQ+L Sbjct: 63 GDQFLTLSLKLLRNLCAGEITNQNCFVALNGPEMVSTLLRSAGLVYEPDYGIIRLGLQVL 122 Query: 1660 GNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDERVAELCGFR 1481 NVSLAGE+H +A+W FFP F+ +A+ R DPLC +++ CC V ELCG R Sbjct: 123 ANVSLAGEKHQQAIWHWFFPDEFVVLAKNRSQSTCDPLCMIIYTCCDGNPGFVLELCGDR 182 Query: 1480 GLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDA--SGDFKCQDT 1307 GL +VAEIVRTAS VG+ E+W KLLLS+IC AGD+ S Sbjct: 183 GLAVVAEIVRTASVVGYGEDWFKLLLSRICLEEEYFYKLFSCFYCAGDSENSEGISSSSD 242 Query: 1306 FFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSGLPTGSP 1127 F+ EQ++LLS +SE LN+++ +ISVS DFA ++GI KR VGV+DF S+G SGLPTGS Sbjct: 243 LFSTEQAYLLSTVSEILNERLEDISVSIDFAFYVFGIFKRSVGVVDFVSRGNSGLPTGSA 302 Query: 1126 SIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEPPEIIKK 947 ++DVLGYS+ ILRD CA G + S+DVV LEPP +IKK Sbjct: 303 AVDVLGYSLTILRDTCALHG--KGGLYHSVDVVDTLLSNGLLELLLFVLHDLEPPPMIKK 360 Query: 946 SISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGILLLMQQC 767 ++ + +N E S S K CPYKGF+RDIVAVIGNC ++R +VQDEIRQK+ I LL+QQC Sbjct: 361 AMKQNENH-EPASSRSYKPCPYKGFRRDIVAVIGNCAFQRNNVQDEIRQKDMIPLLLQQC 419 Query: 766 VTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEVDQKNRR 587 VTDE NPFLREWG+W VRNLLE N ENQ+ VAE+ELQG+V VPE++GLGLRVEVD RR Sbjct: 420 VTDEDNPFLREWGLWCVRNLLEGNVENQKAVAELELQGTVQVPELSGLGLRVEVDSNTRR 479 Query: 586 AKLVNIS 566 A+LVN+S Sbjct: 480 ARLVNVS 486 >ref|XP_006421838.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] gi|567858312|ref|XP_006421839.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] gi|567858314|ref|XP_006421840.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] gi|567858316|ref|XP_006421841.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] gi|568874427|ref|XP_006490317.1| PREDICTED: ataxin-10-like isoform X1 [Citrus sinensis] gi|568874429|ref|XP_006490318.1| PREDICTED: ataxin-10-like isoform X2 [Citrus sinensis] gi|557523711|gb|ESR35078.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] gi|557523712|gb|ESR35079.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] gi|557523713|gb|ESR35080.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] gi|557523714|gb|ESR35081.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] Length = 497 Score = 436 bits (1120), Expect = e-119 Identities = 234/494 (47%), Positives = 315/494 (63%), Gaps = 2/494 (0%) Frame = -1 Query: 2044 MEDSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLEL 1865 M+D+ S ++ E++++ L+ S+SS+L DALEIL++ S+T GRSDLA KN +P VL+L Sbjct: 1 MDDASSLDISLSEDVLQPLLTTSNSSSLKDALEILIESSKTTVGRSDLASKNILPEVLQL 60 Query: 1864 SKSLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEI 1685 ++S+ ++ LCAGEI NQ SF G D D I Sbjct: 61 TQSIPHSSGCHYLLLSLKLLRNLCAGEITNQKSFIEQTGVGIVLRVLRSPGVNLDKDYGI 120 Query: 1684 VRIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDER 1505 +RI LQ+L NVSLAGE H A+W QFFP F +A +R E DPLC V++ CC Sbjct: 121 IRIALQVLANVSLAGETHQHAIWCQFFPDEFATLAGVRCQETCDPLCMVIYTCCDGSSGL 180 Query: 1504 VAELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDASG- 1328 ELCG +GL I+AEIV TA+ VGF+E+W K L+S+ C G + Sbjct: 181 FKELCGDKGLAIMAEIVCTAASVGFKEDWFKFLVSRTCVEEIHFPQLFFKLSQVGASRNC 240 Query: 1327 -DFKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGK 1151 D ++ F+ EQ+FLL I+SE +N++I EI V NDFAL + GI + +G++DF+++G Sbjct: 241 EDSNSREGTFSSEQAFLLEIVSEIVNERIEEIIVPNDFALSVLGIFTKSIGLVDFYARGT 300 Query: 1150 SGLPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXL 971 LPT S +I+VLGYS+ ILR++CA+E S D+V L Sbjct: 301 PSLPTSSSAINVLGYSLSILRNICAREDPAGSSSVNRADLVDSLQSHGLIEMFLSLLRDL 360 Query: 970 EPPEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNG 791 EPP II+K++ + +NQ E + S K CPY GF+RD+VAVIGNC YRRKH+QDEIR+++G Sbjct: 361 EPPAIIRKAMRQGENQ-EGTSAKSAKTCPYIGFRRDLVAVIGNCAYRRKHIQDEIRERDG 419 Query: 790 ILLLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRV 611 ILLL+QQCVTDE NPF REWGIW VRNLLE N ENQ+ VA++ELQGS++VPE+T LGL+V Sbjct: 420 ILLLLQQCVTDEDNPFSREWGIWCVRNLLEGNAENQKVVADLELQGSINVPELTDLGLKV 479 Query: 610 EVDQKNRRAKLVNI 569 EVD+ RRAKLVN+ Sbjct: 480 EVDKNTRRAKLVNV 493 >ref|XP_006348129.1| PREDICTED: ataxin-10-like [Solanum tuberosum] Length = 501 Score = 419 bits (1078), Expect = e-114 Identities = 239/495 (48%), Positives = 301/495 (60%), Gaps = 2/495 (0%) Frame = -1 Query: 2044 MEDSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLEL 1865 ++D L PEN+ + L+ S+SS+L ALE L++ ++ GR DL+ KN V VL L Sbjct: 8 VDDQIVAELTIPENVAKELLLVSNSSSLETALEKLIELAKEEGGRLDLSSKNVVTTVLHL 67 Query: 1864 SKSLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEI 1685 +SLS+ LCAGEI+NQN F G D DC I Sbjct: 68 CQSLSSISYRYLLLLSLKVLRNLCAGEIINQNEFLQQRGVEIVVDVIMSVGLTPDPDCMI 127 Query: 1684 VRIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDER 1505 +R+GLQLLGN S+ G E VW Q FP FL++AR+R EI DPLC V++ CC D Sbjct: 128 IRVGLQLLGNYSVGGGERQCDVWYQLFPHKFLKIARVRNQEICDPLCMVIYTCCDGTDGL 187 Query: 1504 VAELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQIC--XXXXXXXXXXXXXXLAGDAS 1331 + +LC +GL I+ EI+RTAS VG +E WLKLLLS++C + + + Sbjct: 188 LTDLCSEKGLPILIEILRTASAVGLKEVWLKLLLSKLCIEGSYISSIFFKLHSYPSVENN 247 Query: 1330 GDFKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGK 1151 G F EQS+LLS LSE LN+++ I VS+DFA I+GILK GV DF +GK Sbjct: 248 GVVTHVVDQFVIEQSYLLSTLSEILNERVEHIVVSHDFARSIFGILKSASGVADFSIRGK 307 Query: 1150 SGLPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXL 971 S LP GS IDVLGYS+ ILRD+CA + SSK E S DVV L Sbjct: 308 SDLPVGSAPIDVLGYSLTILRDICASDHMTSSKEESSKDVVDVLVSSGLIEFLLNLLRDL 367 Query: 970 EPPEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNG 791 EPP I+K++ + + + E S S + CPY+GF+RDIVA++GNC YRR+HVQDEIR KNG Sbjct: 368 EPPTTIRKAMKQDQIK-EGTISSSFRCCPYQGFRRDIVAILGNCAYRRRHVQDEIRDKNG 426 Query: 790 ILLLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRV 611 ILLL+QQCV DE NPFLREWGIW VRNLLE N ENQ + ++ELQG+VDVPE+ LGLRV Sbjct: 427 ILLLLQQCVIDEDNPFLREWGIWCVRNLLEGNAENQGAITDLELQGTVDVPELVRLGLRV 486 Query: 610 EVDQKNRRAKLVNIS 566 EVD R KLVN S Sbjct: 487 EVDPVTRHTKLVNSS 501 >ref|XP_004232703.1| PREDICTED: ataxin-10-like isoform 1 [Solanum lycopersicum] gi|460373805|ref|XP_004232704.1| PREDICTED: ataxin-10-like isoform 2 [Solanum lycopersicum] Length = 501 Score = 419 bits (1077), Expect = e-114 Identities = 239/495 (48%), Positives = 300/495 (60%), Gaps = 2/495 (0%) Frame = -1 Query: 2044 MEDSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLEL 1865 M+D L PEN+ + L+ S+SS+L AL+ L+Q S+ GR DL+ KN V VL L Sbjct: 8 MDDQIVSELTIPENVAKELLLVSNSSSLETALDKLIQLSKEGGGRLDLSSKNVVTTVLHL 67 Query: 1864 SKSLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEI 1685 +SLS+ LCAGEI NQN F G D DC I Sbjct: 68 CQSLSSISYRNLLLLSLKVLRNLCAGEIRNQNGFLQQRGVEIVLDVIMSVGLSPDPDCMI 127 Query: 1684 VRIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDER 1505 +R+GLQLLGN S+ G E VW Q FP FL++AR+R EI DPLC V++ CC D Sbjct: 128 IRVGLQLLGNYSVGGGERQCDVWYQLFPHKFLKIARVRNQEICDPLCMVIYTCCDGTDGL 187 Query: 1504 VAELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQIC--XXXXXXXXXXXXXXLAGDAS 1331 + +LC +GL I+ EI+RTAS VG +E WLKLLLS++C + + + Sbjct: 188 LTDLCSEQGLPILFEILRTASAVGLKEVWLKLLLSKLCIEGSHISSIFFKLHSYPSVEDN 247 Query: 1330 GDFKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGK 1151 G F EQ +LLSILSE LN+++ I VS+DFA I+GILK GV+DF +GK Sbjct: 248 GVVTHVADQFVIEQPYLLSILSEILNERVEHIVVSHDFARSIFGILKSASGVVDFSIRGK 307 Query: 1150 SGLPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXL 971 S LP GS IDVLGYS+ ++RD+CA + SSK E S DVV L Sbjct: 308 SDLPVGSAPIDVLGYSLTLMRDICASDHLSSSKEESSKDVVDVLVSSGLIEFLLNLLRDL 367 Query: 970 EPPEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNG 791 EPP I+ ++ + + PS S + CPY+GF+RDIVA++GNC YRR+HVQDEIR KNG Sbjct: 368 EPPTTIRNAMKPDQIKEGTIPS-SFRCCPYQGFRRDIVAILGNCAYRRRHVQDEIRDKNG 426 Query: 790 ILLLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRV 611 ILLL+QQCV DE NPFLREWGIW VRNLLE N ENQ + ++ELQG+VDVPE+ LGLRV Sbjct: 427 ILLLLQQCVIDEDNPFLREWGIWCVRNLLEGNAENQGAITDLELQGTVDVPELVRLGLRV 486 Query: 610 EVDQKNRRAKLVNIS 566 EVD RR KLVN S Sbjct: 487 EVDPVTRRTKLVNSS 501 >ref|XP_002320751.1| ataxin-related family protein [Populus trichocarpa] gi|222861524|gb|EEE99066.1| ataxin-related family protein [Populus trichocarpa] Length = 496 Score = 417 bits (1072), Expect = e-114 Identities = 239/491 (48%), Positives = 304/491 (61%), Gaps = 6/491 (1%) Frame = -1 Query: 2020 LCTPEN-IIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLELSKSLSN- 1847 L P+N +E L S SS L + LEIL+ ++T GR+DLA KN +P+VL+L L N Sbjct: 9 LSFPQNDFLEPLFTASKSSDLKETLEILIAIAKTDDGRADLASKNILPVVLQLITHLLND 68 Query: 1846 AXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGS-DLDCEIVRIGL 1670 LCAGE+ NQ SF S + D I+R+GL Sbjct: 69 PFDHEYLSLSLRLMRNLCAGEVANQKSFIQLNGVGIFLTVLRSKKVASSEPDHGIIRMGL 128 Query: 1669 QLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDERVAELC 1490 Q+L NVSLAG+EH +A+W F +A++R DPLC +++ CC E V +LC Sbjct: 129 QVLANVSLAGKEHQQAIWGGLFHDELYMLAKVRSQGTCDPLCMIIYACCDGSPELVLQLC 188 Query: 1489 GFRGLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLA---GDASGDFK 1319 G +GL IV EI+RTAS VGF E WLKLLLS+IC + + Sbjct: 189 GNQGLPIVVEIIRTASLVGFGEEWLKLLLSRICLEDIYFPQLFSRIYSVCSYCENGEEIS 248 Query: 1318 CQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSGLP 1139 F EQ++LL+I+SE LN+++ EI++ NDFALCI+GI K+ V +F S+ +S LP Sbjct: 249 LSSNPFFTEQAYLLNIVSEILNERLKEITILNDFALCIFGIFKKSVEAFEFGSRAESRLP 308 Query: 1138 TGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEPPE 959 TG IDVLGYS+ ILRD+CA G E +DVV LEPP+ Sbjct: 309 TGFAVIDVLGYSLTILRDICANNGGVGK--EDLVDVVDSLLSSGLLDLLLCLLRDLEPPK 366 Query: 958 IIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGILLL 779 II+K++++ NQ E S KVCPYKGF+RD+VAVIGNC YRRKHVQD+IRQKNG+LL+ Sbjct: 367 IIRKAMNQAGNQ-EATTSYFPKVCPYKGFRRDLVAVIGNCAYRRKHVQDDIRQKNGMLLM 425 Query: 778 MQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEVDQ 599 +QQCVTDE NPFLREWGIWS+RNLLE N ENQ+ VAE+ELQGSVD+PE+ GLGL+VEVDQ Sbjct: 426 LQQCVTDEDNPFLREWGIWSMRNLLEGNSENQQAVAELELQGSVDMPELAGLGLKVEVDQ 485 Query: 598 KNRRAKLVNIS 566 R AKLVNIS Sbjct: 486 NTRSAKLVNIS 496 >ref|XP_003623813.1| Ataxin-10 [Medicago truncatula] gi|355498828|gb|AES80031.1| Ataxin-10 [Medicago truncatula] Length = 491 Score = 415 bits (1067), Expect = e-113 Identities = 231/493 (46%), Positives = 306/493 (62%), Gaps = 2/493 (0%) Frame = -1 Query: 2038 DSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLELSK 1859 D+P N + + L S+S+TL +LE L++ S++ RS A K +P +L + Sbjct: 6 DAPFSNHPISQQSLNSLFDLSNSTTLQTSLETLIESSKSTSNRSLYACKKILPTILTV-- 63 Query: 1858 SLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGF-GSDLDCEIV 1682 L + LCAGEILNQN F GSD +V Sbjct: 64 -LHSPPSLHILSLCFKLLRNLCAGEILNQNMFLENDGVFIVVSSILRSEVVGSDY--MLV 120 Query: 1681 RIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDERV 1502 R GLQ+L NV LAG+EH KAVWD+ FP GFL VARI + E+ DPLC V++ CC D+ Sbjct: 121 RWGLQVLANVCLAGKEHQKAVWDEMFPVGFLSVARIGKKEVNDPLCMVIYTCCDGNDQWF 180 Query: 1501 AELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDASG-D 1325 +E+C G ++ EIVRTAS F E+W+KLLLS+IC G D Sbjct: 181 SEVCSDGGWNVLVEIVRTASSASFGEDWIKLLLSRICLEDSQLRVLFSKLRFMDIPDGED 240 Query: 1324 FKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSG 1145 K +D F+ EQ+FLL I+S+ LN++I ++++S + A +YGI K+ +GV++ +GKSG Sbjct: 241 TKTKDDQFSSEQAFLLQIISDILNERIGDVTISLEVASFVYGIFKKSIGVLEHAVRGKSG 300 Query: 1144 LPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEP 965 LP+G +DVLGYS+ +LRD+CA + R + + +VV LEP Sbjct: 301 LPSGITDVDVLGYSLTMLRDICAHDSVRGNSED--TEVVDMLLSYGLIELVFILLGDLEP 358 Query: 964 PEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGIL 785 P II+K + ++N S S K CPYKGF+RDIVA+IGNC+YRRKHVQDEIR +NGIL Sbjct: 359 PTIIRKGMKHSENP--DGASSSSKPCPYKGFRRDIVALIGNCVYRRKHVQDEIRSRNGIL 416 Query: 784 LLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEV 605 LL+QQCVTDE NP+LREWGIW VRN+LE NEENQ+E++E++LQGS DVPEI+ LGLRVEV Sbjct: 417 LLLQQCVTDEDNPYLREWGIWCVRNMLEGNEENQKEISELQLQGSADVPEISALGLRVEV 476 Query: 604 DQKNRRAKLVNIS 566 DQK RRAKLVN+S Sbjct: 477 DQKTRRAKLVNVS 489 >ref|XP_004308721.1| PREDICTED: uncharacterized protein LOC101297970 [Fragaria vesca subsp. vesca] Length = 492 Score = 415 bits (1066), Expect = e-113 Identities = 230/495 (46%), Positives = 310/495 (62%), Gaps = 2/495 (0%) Frame = -1 Query: 2044 MEDSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLEL 1865 M+++ P PE++++ L++ S+SS L D+LE L+Q +T GR DL+ KN +P V++L Sbjct: 1 MDNTTLPECSVPEHVLQALLSVSNSSKLVDSLEDLVQVCKTADGREDLSAKNVLPTVIQL 60 Query: 1864 SKSLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEI 1685 +SLS LCAGE+ NQNSF + D I Sbjct: 61 VQSLSYPSDHYLLTLSLRLLRNLCAGEVANQNSFVEQNGVAIISNILSSAS-SLEPDFGI 119 Query: 1684 VRIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDER 1505 + +GLQ+L NV+LAGE A+W Q F F+ +AR+R + PLC +++ CC E Sbjct: 120 ICVGLQVLANVALAGERQQHAIWQQLFLENFVALARVRSQKTCGPLCMIIYACCDGTPEL 179 Query: 1504 VAELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAG--DAS 1331 VA+LCG G+ IV EIV+TA+ GF E+W KLLLS+IC G + Sbjct: 180 VAQLCGDCGVTIVKEIVKTAAADGFGEDWYKLLLSRICLEEPYFRPLFFSLQHVGGNENG 239 Query: 1330 GDFKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGK 1151 D + F EQ FLL +SE LN+++NEI+V +DFALC++GI K + V+ + ++G+ Sbjct: 240 DDTEGGQESFLEEQEFLLKNVSEILNERLNEITVPDDFALCVFGIFKNSIKVLSYATRGR 299 Query: 1150 SGLPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXL 971 SGLPTGS IDVLGYS+ ILRD+CA+ R ++ ++DVV L Sbjct: 300 SGLPTGSIDIDVLGYSLTILRDICAQGTLRGCTVD-TMDVVDALISYGLIELLLCLLRDL 358 Query: 970 EPPEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNG 791 EPP IIKKS+++ K+Q S S K CPYKGF+RDIV VIGNCLY R+ VQDEIR+K+G Sbjct: 359 EPPAIIKKSVNQAKDQEGSNYSAS-KPCPYKGFRRDIVGVIGNCLYGRQIVQDEIRRKDG 417 Query: 790 ILLLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRV 611 +LLL+QQCVTD+ NP+LREWGIW VRNLLE N+ENQ+ VAE+ELQGSVDVP++ LGLRV Sbjct: 418 LLLLLQQCVTDDDNPYLREWGIWCVRNLLERNQENQQAVAELELQGSVDVPDLARLGLRV 477 Query: 610 EVDQKNRRAKLVNIS 566 E++ R KLVNIS Sbjct: 478 EMNPATGRPKLVNIS 492 >ref|XP_007219054.1| hypothetical protein PRUPE_ppa004765mg [Prunus persica] gi|462415516|gb|EMJ20253.1| hypothetical protein PRUPE_ppa004765mg [Prunus persica] Length = 492 Score = 414 bits (1064), Expect = e-113 Identities = 228/494 (46%), Positives = 306/494 (61%), Gaps = 1/494 (0%) Frame = -1 Query: 2044 MEDSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLEL 1865 M+ + PE++++ L++ S+SSTL D+LE L+Q R GR+DLA K+ +P V++L Sbjct: 1 MDKTALQEFFVPEDVLQILLSASNSSTLIDSLETLIQVCRAADGRADLASKSILPSVVQL 60 Query: 1864 SKSLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEI 1685 +SL LCAGE+ NQ SF + D + Sbjct: 61 IQSLPYPSGRHLLTLSLKLLRNLCAGEVSNQKSFLEQSGVAIISNVLNSANISLEPDSGV 120 Query: 1684 VRIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDER 1505 +R+GLQ+L NVSLAGE H +W Q FP FL +AR++ E DPLC V+ CC E Sbjct: 121 IRMGLQVLANVSLAGERHQHEIWQQLFPKEFLALARVQSRETCDPLCMVIFACCDGSPEL 180 Query: 1504 VAELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDAS-G 1328 +LCG G+ I+ EIVRT + VGF E+W+KLLLS+IC A + Sbjct: 181 FEKLCGDGGITIMKEIVRTTAAVGFGEDWVKLLLSRICLEGPYFSSLFSNLGFATSENVE 240 Query: 1327 DFKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKS 1148 D + ++ F+ +Q+F L I+S+ LN+++ EI+V DFALC++GI K+ VG ++ ++G+S Sbjct: 241 DTEFREDLFSSDQAFFLRIISDILNERLREITVPRDFALCVFGIFKKSVGALNCVTRGQS 300 Query: 1147 GLPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLE 968 GLPTG+ IDVLGYS+ ILRDVCA++ R + E D V LE Sbjct: 301 GLPTGTSMIDVLGYSLTILRDVCAQKTLRGFQ-EDLGDAVDVLLSHGLIELILCLLRDLE 359 Query: 967 PPEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGI 788 PP II+K+I + + Q + S S K CPYKGF+RDIVAVIGNC Y+RK VQDEIRQ++GI Sbjct: 360 PPAIIRKAIKQGEGQ-DGTNSGSSKPCPYKGFRRDIVAVIGNCTYQRKPVQDEIRQRDGI 418 Query: 787 LLLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVE 608 LLL+QQC DE NPFL+EWGIW VRNLLE NE+N+R V E+ELQGSVD PEI GLG RVE Sbjct: 419 LLLLQQCGLDEDNPFLKEWGIWCVRNLLEGNEDNKRVVTELELQGSVDAPEIAGLGFRVE 478 Query: 607 VDQKNRRAKLVNIS 566 V+ + R KLVN+S Sbjct: 479 VNPETGRPKLVNVS 492 >ref|XP_004306868.1| PREDICTED: ataxin-10-like [Fragaria vesca subsp. vesca] Length = 490 Score = 412 bits (1059), Expect = e-112 Identities = 227/493 (46%), Positives = 306/493 (62%) Frame = -1 Query: 2044 MEDSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLEL 1865 M+++ P PE++I+ L++ S+SS L +++E L+Q +T GR DLA KN +P V++L Sbjct: 1 MDNTALPECSVPEDVIQALLSVSNSSNLVESMEDLIQVCKTADGREDLAAKNVLPTVIQL 60 Query: 1864 SKSLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEI 1685 +SL LCAGE+ NQNSF + D I Sbjct: 61 VQSLLYPSDHYLLTLSLRLLRNLCAGEVANQNSFVEQNGVAIVSNILSSA-ISLEPDFWI 119 Query: 1684 VRIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDER 1505 + +GLQ+L N +LAGE A+W Q F F+ +AR+R + PLC ++ CC E Sbjct: 120 ICVGLQVLANAALAGERQQHAIWQQLFSEKFVALARVRSKKTCGPLCMIISTCCDGTPEL 179 Query: 1504 VAELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDASGD 1325 VA+LCG G+ I+ EIV+TA+ V F E+W KLLLS+IC G+ + D Sbjct: 180 VAQLCGDCGVTILKEIVKTAAAVDFGEDWYKLLLSRICLVEPYFRPLFFSLEHVGENAED 239 Query: 1324 FKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSG 1145 + F++EQ FLL +SE LN+ ++EI+V NDFALC++GI K + V+ + ++G+SG Sbjct: 240 TEGGRESFSKEQEFLLKNVSEILNECLSEITVPNDFALCVFGIFKNSIKVLSYATRGRSG 299 Query: 1144 LPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEP 965 LPTGS IDVLGYS+ ILRD CA +G + ++DVV LEP Sbjct: 300 LPTGSIDIDVLGYSLTILRDTCA-QGTLRGSTKDTMDVVDALISYGLIELLLSLLRDLEP 358 Query: 964 PEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGIL 785 P IIKKSI++ +NQ E S +LK CPYKGF+RDIVAVIGNCLY RK VQDEIR+K+G+L Sbjct: 359 PAIIKKSINQAENQ-EGSSSSTLKPCPYKGFRRDIVAVIGNCLYGRKIVQDEIRRKDGLL 417 Query: 784 LLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEV 605 LL+QQCV D+ NP+ REWGIW RNLL+ N+ENQR VAE+EL+GSVDVP + LGLRVE+ Sbjct: 418 LLLQQCVIDDDNPYSREWGIWCQRNLLDRNQENQRAVAELELKGSVDVPALARLGLRVEM 477 Query: 604 DQKNRRAKLVNIS 566 + R KLVNIS Sbjct: 478 NLATGRPKLVNIS 490 >ref|XP_004492673.1| PREDICTED: ataxin-10-like [Cicer arietinum] Length = 468 Score = 410 bits (1055), Expect = e-112 Identities = 225/462 (48%), Positives = 290/462 (62%), Gaps = 1/462 (0%) Frame = -1 Query: 1951 LEILLQDSRTVQGRSDLALKNAVPLVLELSKSLSNAXXXXXXXXXXXXXXXLCAGEILNQ 1772 LE L+ S++ GRS+LA K +P VL + S + LCAGE NQ Sbjct: 9 LENLIHTSKSDSGRSNLASKRVLPAVLNILNSQTLPLDHNLLSLCFKLLRNLCAGEFENQ 68 Query: 1771 NSFXXXXXXXXXXXXXXXXGFGSDLDCEIVRIGLQLLGNVSLAGEEHGKAVWDQFFPGGF 1592 N F D +VR GLQ+L NV LAG++H KA+W++ FP GF Sbjct: 69 NLFLEFDGVVVVSSILMSEAGSLRPDHMLVRWGLQVLANVCLAGKQHQKAIWEEIFPLGF 128 Query: 1591 LEVARIRQSEIVDPLCNVLHNCCYEKDERVAELCGFRGLQIVAEIVRTASEVGFEENWLK 1412 + +AR+ EI DPLC V++ CC E ELC GL +VAEIV+TAS F E+W+K Sbjct: 129 VSLARLGTKEICDPLCMVIYTCCDGNHECFGELCSDSGLPVVAEIVKTASSASFGEDWIK 188 Query: 1411 LLLSQICXXXXXXXXXXXXXXLAGDASG-DFKCQDTFFTREQSFLLSILSENLNQQINEI 1235 LLLS+IC G D +D F+ EQ+FLL ILSE LN+++ ++ Sbjct: 189 LLLSRICLEESQLPMLFPKLRFMDIPEGEDIDSKDYQFSFEQAFLLQILSEILNERLRDV 248 Query: 1234 SVSNDFALCIYGILKRVVGVIDFFSKGKSGLPTGSPSIDVLGYSVIILRDVCAKEGARSS 1055 VS D AL +YG+ K+ VGV++ +GKSGLP+GS ++D LGYS+ ILRD+CA + R + Sbjct: 249 VVSKDVALFVYGVFKKSVGVLEHAVRGKSGLPSGSVAVDALGYSLTILRDICAHDSVRGN 308 Query: 1054 KIEGSIDVVXXXXXXXXXXXXXXXXXXLEPPEIIKKSISRTKNQVEHKPSDSLKVCPYKG 875 E + DVV LEPP II+K I +++NQ S S K CPYKG Sbjct: 309 P-EDTNDVVDVLLSQDIIELLLILLGDLEPPAIIRKGIKQSENQ--EGASCSSKPCPYKG 365 Query: 874 FQRDIVAVIGNCLYRRKHVQDEIRQKNGILLLMQQCVTDECNPFLREWGIWSVRNLLEAN 695 F+RDIV++IGNC+YRRKH QDEIR +NGILLL+QQCVTDE NPFLREWGIWSVRN+LE N Sbjct: 366 FRRDIVSLIGNCVYRRKHAQDEIRGRNGILLLLQQCVTDEDNPFLREWGIWSVRNMLEGN 425 Query: 694 EENQREVAEMELQGSVDVPEITGLGLRVEVDQKNRRAKLVNI 569 EENQ+ V+E++LQGS DVP+I+ LGLR+EVDQK RRAKLVN+ Sbjct: 426 EENQKVVSELQLQGSADVPQISALGLRIEVDQKTRRAKLVNV 467 >ref|XP_003551615.1| PREDICTED: ataxin-10-like [Glycine max] Length = 498 Score = 409 bits (1050), Expect = e-111 Identities = 230/487 (47%), Positives = 302/487 (62%), Gaps = 7/487 (1%) Frame = -1 Query: 2008 ENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLELSKSLSNAXXXXX 1829 E+ ++ L S+SS + +LEIL+Q++++ GR +LA K +P VL + SL++A Sbjct: 14 EDTLQLLFEASNSSNMEKSLEILIQNAKSDSGRLELASKRILPAVLNIVHSLTHASHHHH 73 Query: 1828 XXXXXXXXXXL------CAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEIVRIGLQ 1667 CAGE NQ+SF S D +VR GLQ Sbjct: 74 HQHNHILCLSFKLLRNLCAGEAANQDSFLELDGVAVVCSVLRSEAACSGPDHGLVRWGLQ 133 Query: 1666 LLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDERVAELCG 1487 +L NVSLAG++H A+W + + GF+ +AR+ E DPLC V++ CC E L Sbjct: 134 VLANVSLAGKQHQCAIWKELYLDGFVSLARLHTKETCDPLCMVIYTCCDGNPEWFKRLSS 193 Query: 1486 FRGLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDASGDF-KCQD 1310 G ++AEIVRTAS F E+WLKLLLS+IC A + + +D Sbjct: 194 EDGWFVMAEIVRTASSASFGEDWLKLLLSRICLEESQLPVLFSKLQFADVPKVEVAESKD 253 Query: 1309 TFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSGLPTGS 1130 F+ EQ+FLL ILSE LN++ +++VS D AL ++GI K +GV++ ++GKSGLP+G Sbjct: 254 DHFSFEQAFLLRILSEILNERHKDVTVSKDVALFVFGIFKNSIGVLEHATRGKSGLPSGF 313 Query: 1129 PSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEPPEIIK 950 +DVLGYS+ ILRD+CA++G R + E S DVV LEPP II+ Sbjct: 314 VGVDVLGYSLTILRDICAQDGVRGNT-EDSNDVVDALLSYGLIELLLYLLEALEPPAIIR 372 Query: 949 KSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGILLLMQQ 770 K + + +NQ S S K CPYKGF+RDIVA+IGNC+YRRKH QDEIR +NGILLL+QQ Sbjct: 373 KGLKQCENQ--DGASCSFKPCPYKGFRRDIVALIGNCVYRRKHAQDEIRHRNGILLLLQQ 430 Query: 769 CVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEVDQKNR 590 CVTDE NPFLREWGIWSVRN+LE N+ENQ+ VAE+E+QGS DVPEIT LGLRVEVDQ+ R Sbjct: 431 CVTDEDNPFLREWGIWSVRNMLEGNDENQKVVAELEIQGSADVPEITSLGLRVEVDQRTR 490 Query: 589 RAKLVNI 569 RAKLVNI Sbjct: 491 RAKLVNI 497 >ref|XP_006366476.1| PREDICTED: ataxin-10-like isoform X1 [Solanum tuberosum] gi|565401994|ref|XP_006366477.1| PREDICTED: ataxin-10-like isoform X2 [Solanum tuberosum] gi|565401996|ref|XP_006366478.1| PREDICTED: ataxin-10-like isoform X3 [Solanum tuberosum] gi|565401998|ref|XP_006366479.1| PREDICTED: ataxin-10-like isoform X4 [Solanum tuberosum] gi|565402000|ref|XP_006366480.1| PREDICTED: ataxin-10-like isoform X5 [Solanum tuberosum] Length = 504 Score = 408 bits (1049), Expect = e-111 Identities = 237/495 (47%), Positives = 297/495 (60%), Gaps = 2/495 (0%) Frame = -1 Query: 2044 MEDSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLEL 1865 ++D + PEN+ + L+ S+SS+L ALE L++ ++ GR DL+ KN V VL L Sbjct: 11 VDDKIVAEVTIPENVAKELLLVSNSSSLETALEKLIELAKEEGGRLDLSSKNVVTTVLHL 70 Query: 1864 SKSLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEI 1685 +SLS+ LCAGEI NQN F G D DC I Sbjct: 71 CQSLSSISYRQLLLSSLKVLRNLCAGEIRNQNEFLQQRGVEIVVDVITSVGLTPDPDCMI 130 Query: 1684 VRIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDER 1505 +R+GLQLLGN S+ G E VW Q FP FL++AR+R EI DPLC V++ CC D Sbjct: 131 IRVGLQLLGNYSVGGGERQCDVWYQLFPHKFLKIARVRSWEICDPLCMVIYTCCDGTDGL 190 Query: 1504 VAELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQIC--XXXXXXXXXXXXXXLAGDAS 1331 + +LC +GL I+ EI+RTAS V +E WLKLLLS++C + + Sbjct: 191 LTDLCSEQGLPILIEILRTASAVDRKEVWLKLLLSKLCIEGSYISSIFFKLHSFPSIQNN 250 Query: 1330 GDFKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGK 1151 G F EQ +LLSILSE +N QI I VS+DFAL I+GILK V+DF +GK Sbjct: 251 GVVTHATDQFVIEQPYLLSILSEIVNDQIEHIVVSHDFALSIFGILKSAFVVVDFSIRGK 310 Query: 1150 SGLPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXL 971 S LP G IDVLGYS+ ILRD+CA + SSK E S DVV L Sbjct: 311 SDLPVGFAPIDVLGYSLTILRDICASDHMTSSKEESSKDVVDVLVSSGLIEFLLNLLRDL 370 Query: 970 EPPEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNG 791 EPP I+K++ + + E S S + CPY+GF+RDIV++IGNC YRR++VQDEIR KNG Sbjct: 371 EPPTTIRKAMKQDQ-ITEGIISSSFRCCPYQGFRRDIVSIIGNCAYRRRYVQDEIRDKNG 429 Query: 790 ILLLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRV 611 ILLL+QQCV DE NPFLREWGIW VRNLLE N ENQ + ++ELQG+VDVPE+ LGLRV Sbjct: 430 ILLLLQQCVIDEDNPFLREWGIWCVRNLLEGNAENQGAITDLELQGTVDVPELVRLGLRV 489 Query: 610 EVDQKNRRAKLVNIS 566 EVD RR KLVN S Sbjct: 490 EVDPVTRRTKLVNAS 504 >ref|XP_007148734.1| hypothetical protein PHAVU_005G009900g [Phaseolus vulgaris] gi|561021998|gb|ESW20728.1| hypothetical protein PHAVU_005G009900g [Phaseolus vulgaris] Length = 498 Score = 403 bits (1036), Expect = e-109 Identities = 225/485 (46%), Positives = 301/485 (62%), Gaps = 5/485 (1%) Frame = -1 Query: 2008 ENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLELSKSLSNAXXXXX 1829 E+ ++ L S+SS L +LEIL+Q++++ GR +LA K +P VL + +SL+ A Sbjct: 13 EDTLQLLFQASNSSNLEKSLEILIQNAKSDSGRLELASKRILPAVLNIVQSLAQASHHHH 72 Query: 1828 XXXXXXXXXXL----CAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEIVRIGLQLL 1661 L CAGE NQ SF D +VR GLQ+L Sbjct: 73 HNQTFSLCFKLLRNLCAGEAANQVSFIELNGVAVVWSVLRSEAGSLGPDHRLVRWGLQVL 132 Query: 1660 GNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDERVAELCGFR 1481 NVSL G++H +A+W++ +P GF +AR+ EI DPLC V++ CC E +L Sbjct: 133 ANVSLGGKQHQRAIWEELYPIGFASLARVGTKEICDPLCMVIYTCCDGNPEWFKKLSSDD 192 Query: 1480 GLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDASGD-FKCQDTF 1304 G +VAEIVRTAS F+E+WLKLLLS+I G+ + ++ Sbjct: 193 GWPVVAEIVRTASSASFDEDWLKLLLSRIFLEESQLPVLFSKLQSVDVPEGEVIESKNGQ 252 Query: 1303 FTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSGLPTGSPS 1124 F+ EQ+FLL ILSE LN+++ +++VS D AL ++GI K+ +GV++ +GKSGLP+G Sbjct: 253 FSFEQAFLLQILSEILNERLGDVTVSEDVALFVFGIFKKSIGVLEHAMRGKSGLPSGFTG 312 Query: 1123 IDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEPPEIIKKS 944 +DVLGYS+ ILRD+CA++G R + + DVV LEPP II+K Sbjct: 313 VDVLGYSLTILRDICAQDGMRGN----TKDVVDVLLSYGLIEFLLSLLGALEPPAIIRKG 368 Query: 943 ISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGILLLMQQCV 764 + + +NQ S K CPYKGF+RDIVA+IGNC+YRRKH QDEIR +NGILLL+QQCV Sbjct: 369 LKQIENQ--DNASCCSKPCPYKGFRRDIVALIGNCVYRRKHAQDEIRDRNGILLLLQQCV 426 Query: 763 TDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEVDQKNRRA 584 TDE NPFLREWGIWSVRN+LE N+ENQ+ VAE+E+QGS DVPEI LGL+VEVDQ+ RR Sbjct: 427 TDEDNPFLREWGIWSVRNMLEGNDENQKLVAELEIQGSADVPEINALGLQVEVDQRTRRP 486 Query: 583 KLVNI 569 KLVNI Sbjct: 487 KLVNI 491 >gb|EYU23502.1| hypothetical protein MIMGU_mgv1a005564mg [Mimulus guttatus] Length = 479 Score = 375 bits (964), Expect = e-101 Identities = 215/491 (43%), Positives = 289/491 (58%) Frame = -1 Query: 2044 MEDSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLEL 1865 M+ S NL +N+++ L S SSTL +ALE L++ ++T GR L+ K+ + LEL Sbjct: 1 MDSVKSVNLSIQDNVLQPLFISSGSSTLHEALERLIETAKTSDGRLSLSSKDIIKPALEL 60 Query: 1864 SKSLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEI 1685 + CAGEI NQ+ F S D EI Sbjct: 61 CQYPLRVPHQELLLAVKLLRNM-CAGEIKNQDLFIEQNGVGILSTLVGSMCSNSGSDNEI 119 Query: 1684 VRIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDER 1505 +R+ LQ LGNVSLAGE+H +AVW QFF GF+++AR++ E DPLC V++ C +ER Sbjct: 120 LRMVLQALGNVSLAGEKHQEAVWAQFFSLGFIDIARVQSKETCDPLCMVIYTCSEGTNER 179 Query: 1504 VAELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDASGD 1325 EL +GL I+ EIVRT + VGF E+WLKLLLS+IC D Sbjct: 180 SGELLSDQGLDIIVEIVRTVTAVGFSEDWLKLLLSKICFDESYFSSIFSKLSENCDEDVP 239 Query: 1324 FKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSG 1145 Q + F +++FLLSILSE LN+++ EI VS+DF+L I+ IL+ V ++DF ++ KS Sbjct: 240 ---QISHFGDQEAFLLSILSEILNERLGEIVVSSDFSLSIFQILRNAVEIVDFSTRAKSS 296 Query: 1144 LPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEP 965 LPTGS DV+GY++ ++RD+ A +G V LEP Sbjct: 297 LPTGSSVTDVMGYALSLIRDITACDGPN----------VDTLLRAGLIKFLIGLLRNLEP 346 Query: 964 PEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGIL 785 P +I++S R + + P S CPYKGF+RDIV VIGNC Y R VQDEIR+++GIL Sbjct: 347 PTLIRRSTVRADTEDDTTPRFSKYCCPYKGFRRDIVGVIGNCSYGRISVQDEIREQDGIL 406 Query: 784 LLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEV 605 L++QQCVTD+ NPFLREWGIWS+RN+LE N +N+ V E+E+QGSVD PEI G+GLRVE+ Sbjct: 407 LMLQQCVTDDDNPFLREWGIWSMRNILEGNVKNRELVVELEVQGSVDTPEIAGVGLRVEI 466 Query: 604 DQKNRRAKLVN 572 D RR KLVN Sbjct: 467 DPVTRRPKLVN 477 >gb|EYU22629.1| hypothetical protein MIMGU_mgv1a025194mg, partial [Mimulus guttatus] Length = 467 Score = 371 bits (952), Expect = e-100 Identities = 210/487 (43%), Positives = 289/487 (59%), Gaps = 6/487 (1%) Frame = -1 Query: 2008 ENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLELSKSLSNAXXXXX 1829 +N+++ L S SSTL +ALE L++ ++T GR L+ K+ + LEL + Sbjct: 1 DNVLQPLFISSGSSTLHEALERLIETAKTSDGRLSLSSKDIIKPALELCRYPLRVPHQEL 60 Query: 1828 XXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEIVRIGLQLLGNVS 1649 CAGEI NQ+ F S D EI+R+ LQ LGNVS Sbjct: 61 LLAVKLLRNL-CAGEIKNQDLFIEQNGVGILSTLVGSMCSNSGSDSEILRMVLQTLGNVS 119 Query: 1648 LAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDERVAELCGFRGLQI 1469 LAGE+H +AVW QFFP GF+++AR++ E DPLC V++ C +ER EL +GL I Sbjct: 120 LAGEKHQEAVWAQFFPLGFIDIARVQSKETCDPLCMVIYTCSEGSNERWVELLSDQGLDI 179 Query: 1468 VAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDASGDFKCQDTFFTREQ 1289 + +IVRT + VGF E+W+KLL+S+IC D + Q + F E+ Sbjct: 180 IVQIVRTVTAVGFSEDWVKLLISKICFDESYFSSIFSKLSENCDENVP---QISHFGDEE 236 Query: 1288 SFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSGLPTGSPSIDVLG 1109 +FLLSILSE LN+++ EI VS +F+L IY IL+ V ++DF ++ K LPTGS D +G Sbjct: 237 AFLLSILSEILNERLGEIVVSTNFSLSIYQILRNAVEIVDFSTRAKLSLPTGSSVTDAMG 296 Query: 1108 YSVIILRDVCAKEG------ARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEPPEIIKK 947 Y++ ++RD+ A +G +R+ I+ ID+ EPP +I++ Sbjct: 297 YALSLIRDITACDGPNVDTLSRAGLIKFLIDLFRNL----------------EPPTLIRR 340 Query: 946 SISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGILLLMQQC 767 S + + P S CPYKGF+RDIV VIGNC Y R VQDEIR+++GILL++QQC Sbjct: 341 STGHADTENDTTPRFSKYCCPYKGFRRDIVGVIGNCSYGRISVQDEIREQDGILLMLQQC 400 Query: 766 VTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEVDQKNRR 587 VTDE NPFLREWGIWS+RN+LE N +N+ V ++E+QGSVD PEI G+GLRVE+D RR Sbjct: 401 VTDEDNPFLREWGIWSMRNILEGNVKNRELVVDLEVQGSVDTPEIAGVGLRVEIDHVTRR 460 Query: 586 AKLVNIS 566 KLVN S Sbjct: 461 PKLVNAS 467