BLASTX nr result
ID: Sinomenium21_contig00014561
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00014561 (1896 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002274705.1| PREDICTED: uncharacterized protein LOC100264... 488 e-135 ref|XP_007022650.1| ARM repeat superfamily protein, putative iso... 481 e-133 ref|XP_007022647.1| ARM repeat superfamily protein, putative iso... 481 e-133 ref|XP_007022651.1| ARM repeat superfamily protein, putative iso... 479 e-132 ref|XP_007022648.1| ARM repeat superfamily protein, putative iso... 479 e-132 ref|XP_004232703.1| PREDICTED: ataxin-10-like isoform 1 [Solanum... 461 e-127 ref|XP_006421838.1| hypothetical protein CICLE_v10004825mg [Citr... 459 e-126 ref|XP_007219054.1| hypothetical protein PRUPE_ppa004765mg [Prun... 459 e-126 ref|XP_002511774.1| conserved hypothetical protein [Ricinus comm... 459 e-126 ref|XP_006348129.1| PREDICTED: ataxin-10-like [Solanum tuberosum] 458 e-126 ref|XP_006366476.1| PREDICTED: ataxin-10-like isoform X1 [Solanu... 449 e-123 ref|XP_002320751.1| ataxin-related family protein [Populus trich... 444 e-122 ref|XP_004308721.1| PREDICTED: uncharacterized protein LOC101297... 441 e-121 ref|XP_004492673.1| PREDICTED: ataxin-10-like [Cicer arietinum] 439 e-120 ref|XP_004306868.1| PREDICTED: ataxin-10-like [Fragaria vesca su... 439 e-120 ref|XP_003551615.1| PREDICTED: ataxin-10-like [Glycine max] 437 e-120 ref|XP_003623813.1| Ataxin-10 [Medicago truncatula] gi|355498828... 432 e-118 ref|XP_007148734.1| hypothetical protein PHAVU_005G009900g [Phas... 431 e-118 ref|XP_006290996.1| hypothetical protein CARUB_v10017108mg [Caps... 404 e-109 ref|NP_567156.1| protein MATERNAL EFFECT EMBRYO ARREST 50 [Arabi... 396 e-107 >ref|XP_002274705.1| PREDICTED: uncharacterized protein LOC100264428 [Vitis vinifera] Length = 494 Score = 488 bits (1255), Expect = e-135 Identities = 268/499 (53%), Positives = 343/499 (68%), Gaps = 6/499 (1%) Frame = -1 Query: 1890 MEDSPLPELCAPENIIERLLAWSNSSTLSDALEILVQVSRAARGRSDLAFNNVVPVVLEL 1711 MED+ L + PENI++ L + SNSSTL + LE+L++ S+ GR DL N++PVVL+L Sbjct: 1 MEDAML-KFSLPENILQPLFSVSNSSTLDETLELLIEASKTPGGRLDLGSKNILPVVLQL 59 Query: 1710 SKSLSNASNRGNXXXXXXXXXXLCAGEILNQNSFVEGNGVEVVS-VALSSAGVCSDLDCE 1534 S+SLS S LCAGE+ NQN F+E NGV+ VS + LS G+ SD D Sbjct: 60 SQSLSYPSGHDILLLSLKLLRNLCAGEMTNQNLFIEQNGVKAVSTILLSFVGLDSDSDYG 119 Query: 1533 IVRMGLQVLGNVSLAGEKHRKVIWDRFFPGGFLEVARILRSEIVDPLCMVLHNCCDGKDE 1354 I+RMGLQ+LGNVSLAGE+H++ +W FFP GFLE+AR+ E DPLCMV++ C D E Sbjct: 120 IIRMGLQLLGNVSLAGERHQRAVWHHFFPAGFLEIARVRTLETSDPLCMVIYTCFDQSHE 179 Query: 1353 RVTELCGFRGLPIVAEIIRTASEVGFEGNWLKLLLSQQCXXXXXXXXXXXXXXLA----- 1189 +TE+CG +GLPI+AEI+RTAS VGFE +WLKLLLS+ C Sbjct: 180 FITEICGDQGLPILAEIVRTASTVGFEEDWLKLLLSRICLEESHFPMLFSKLCPVGTSGN 239 Query: 1188 YEAREDFKCQDTFFTTEQSFLLSILSENLNQQINEISVSNEFALCVYGIMKRAIGVVDFS 1009 YE+ E FK F +EQ+FL+ I++E LN+QIN+++VS++ ALCV GI+K++ GV+D Sbjct: 240 YESIE-FKVD--VFASEQAFLMDIVAEILNEQINKMTVSSDVALCVLGILKKSAGVLDSV 296 Query: 1008 SRGKSGLPTGSPSIDVLGYSVIILRDVCAKESAESAKIEGPIDVVXXXXXXXXXXXXXXX 829 S KSG GS +I+VL YS+ IL+++CA+++ +S+ G +DVV Sbjct: 297 STCKSGFSAGSNAINVLKYSLTILKEICARDAQKSSNEHGSVDVVDLLVSSGLLELLLCL 356 Query: 828 XXXLEPPEIIRKSISQSPNQVEHACSDSLKVCPYKGFRRDIVAVIGNCLYRRKRAQDEIR 649 LEPP IIRK+I Q NQ + A S S K PY+GFRRD+VAVIGNC YRRK Q+EIR Sbjct: 357 LRDLEPPAIIRKAIKQGENQ-DGAASYSPKHYPYRGFRRDLVAVIGNCAYRRKHVQNEIR 415 Query: 648 QKNGILLLMQQCVTDESNPFLREWGIWSVRNLLEGNEENKREVADMELQGSVDVPEITEL 469 ++NGILLL+QQCVTDE N FLREWGIW VRNLLEGN EN+R VA++ELQGSVDVPEI L Sbjct: 416 ERNGILLLLQQCVTDEENQFLREWGIWCVRNLLEGNVENQRVVAELELQGSVDVPEIAGL 475 Query: 468 GLRVEVDQKNRRAKLVNTS 412 GLRVEVDQK RAKLVN S Sbjct: 476 GLRVEVDQKTGRAKLVNVS 494 >ref|XP_007022650.1| ARM repeat superfamily protein, putative isoform 4 [Theobroma cacao] gi|508722278|gb|EOY14175.1| ARM repeat superfamily protein, putative isoform 4 [Theobroma cacao] Length = 500 Score = 481 bits (1239), Expect = e-133 Identities = 258/492 (52%), Positives = 327/492 (66%), Gaps = 2/492 (0%) Frame = -1 Query: 1896 KKMEDSPLPELCAPENIIERLLAWSNSSTLSDALEILVQVSRAARGRSDLAFNNVVPVVL 1717 K+M LPE E +++ LL+ SNSS+L +ALEIL++VSR A R++LA N++P VL Sbjct: 11 KEMVGESLPEFNGLEGVLQPLLSASNSSSLKEALEILIKVSRTAAARAELALRNILPTVL 70 Query: 1716 ELSKSLSNASNRGNXXXXXXXXXXLCAGEILNQNSFVEGNGVEVVSVALSSAGVCSDLDC 1537 +L +S S+R LCAGE+ NQN+F E NGVEVV L SA + S+ D Sbjct: 71 KLVESFHQTSSREYLVNSLKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAALLSNPDS 130 Query: 1536 EIVRMGLQVLGNVSLAGEKHRKVIWDRFFPGGFLEVARILRSEIVDPLCMVLHNCCDGKD 1357 ++R+ LQVL NVSLAGE H++ IW +FFP F +AR+ E DPLCM+L+ CCD + Sbjct: 131 GVIRVSLQVLANVSLAGEDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRP 190 Query: 1356 ERVTELCGFRGLPIVAEIIRTASEVGFEGNWLKLLLSQQCXXXXXXXXXXXXXXLAYEAR 1177 V ELC GLPIV IIRT + VGF +W KLLLS+ C + Sbjct: 191 GLVAELCRDMGLPIVVGIIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSE 250 Query: 1176 EDFKCQ--DTFFTTEQSFLLSILSENLNQQINEISVSNEFALCVYGIMKRAIGVVDFSSR 1003 D F +EQ+FLL I+SE LN++I EI VS+EFALCV GI KR++ VVDF+SR Sbjct: 251 NSGNTDSGDDLFLSEQAFLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASR 310 Query: 1002 GKSGLPTGSPSIDVLGYSVIILRDVCAKESAESAKIEGPIDVVXXXXXXXXXXXXXXXXX 823 G S LPTG SIDV+GYS+IILRD+CA+E K + +DVV Sbjct: 311 GMSSLPTGCTSIDVMGYSLIILRDICAREGVGDLKNDS-LDVVDMLLSHELIDILLSLLR 369 Query: 822 XLEPPEIIRKSISQSPNQVEHACSDSLKVCPYKGFRRDIVAVIGNCLYRRKRAQDEIRQK 643 L+PP IIRK + + NQ + + K+CPYKGFRRD++AVIGNC YRRK QDEIRQK Sbjct: 370 DLDPPAIIRKVLKEGDNQGLNLSAS--KLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQK 427 Query: 642 NGILLLMQQCVTDESNPFLREWGIWSVRNLLEGNEENKREVADMELQGSVDVPEITELGL 463 NGILLL+QQCVTD+ NP+LREWGIWS+RNLLEG+ EN++ VAD+ELQGSVD+PE++ LGL Sbjct: 428 NGILLLLQQCVTDDDNPYLREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGL 487 Query: 462 RVEVDQKNRRAK 427 RVEVDQK RRAK Sbjct: 488 RVEVDQKTRRAK 499 >ref|XP_007022647.1| ARM repeat superfamily protein, putative isoform 1 [Theobroma cacao] gi|508722275|gb|EOY14172.1| ARM repeat superfamily protein, putative isoform 1 [Theobroma cacao] Length = 531 Score = 481 bits (1239), Expect = e-133 Identities = 258/492 (52%), Positives = 327/492 (66%), Gaps = 2/492 (0%) Frame = -1 Query: 1896 KKMEDSPLPELCAPENIIERLLAWSNSSTLSDALEILVQVSRAARGRSDLAFNNVVPVVL 1717 K+M LPE E +++ LL+ SNSS+L +ALEIL++VSR A R++LA N++P VL Sbjct: 11 KEMVGESLPEFNGLEGVLQPLLSASNSSSLKEALEILIKVSRTAAARAELALRNILPTVL 70 Query: 1716 ELSKSLSNASNRGNXXXXXXXXXXLCAGEILNQNSFVEGNGVEVVSVALSSAGVCSDLDC 1537 +L +S S+R LCAGE+ NQN+F E NGVEVV L SA + S+ D Sbjct: 71 KLVESFHQTSSREYLVNSLKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAALLSNPDS 130 Query: 1536 EIVRMGLQVLGNVSLAGEKHRKVIWDRFFPGGFLEVARILRSEIVDPLCMVLHNCCDGKD 1357 ++R+ LQVL NVSLAGE H++ IW +FFP F +AR+ E DPLCM+L+ CCD + Sbjct: 131 GVIRVSLQVLANVSLAGEDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRP 190 Query: 1356 ERVTELCGFRGLPIVAEIIRTASEVGFEGNWLKLLLSQQCXXXXXXXXXXXXXXLAYEAR 1177 V ELC GLPIV IIRT + VGF +W KLLLS+ C + Sbjct: 191 GLVAELCRDMGLPIVVGIIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSE 250 Query: 1176 EDFKCQ--DTFFTTEQSFLLSILSENLNQQINEISVSNEFALCVYGIMKRAIGVVDFSSR 1003 D F +EQ+FLL I+SE LN++I EI VS+EFALCV GI KR++ VVDF+SR Sbjct: 251 NSGNTDSGDDLFLSEQAFLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASR 310 Query: 1002 GKSGLPTGSPSIDVLGYSVIILRDVCAKESAESAKIEGPIDVVXXXXXXXXXXXXXXXXX 823 G S LPTG SIDV+GYS+IILRD+CA+E K + +DVV Sbjct: 311 GMSSLPTGCTSIDVMGYSLIILRDICAREGVGDLKNDS-LDVVDMLLSHELIDILLSLLR 369 Query: 822 XLEPPEIIRKSISQSPNQVEHACSDSLKVCPYKGFRRDIVAVIGNCLYRRKRAQDEIRQK 643 L+PP IIRK + + NQ + + K+CPYKGFRRD++AVIGNC YRRK QDEIRQK Sbjct: 370 DLDPPAIIRKVLKEGDNQGLNLSAS--KLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQK 427 Query: 642 NGILLLMQQCVTDESNPFLREWGIWSVRNLLEGNEENKREVADMELQGSVDVPEITELGL 463 NGILLL+QQCVTD+ NP+LREWGIWS+RNLLEG+ EN++ VAD+ELQGSVD+PE++ LGL Sbjct: 428 NGILLLLQQCVTDDDNPYLREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGL 487 Query: 462 RVEVDQKNRRAK 427 RVEVDQK RRAK Sbjct: 488 RVEVDQKTRRAK 499 >ref|XP_007022651.1| ARM repeat superfamily protein, putative isoform 5 [Theobroma cacao] gi|508722279|gb|EOY14176.1| ARM repeat superfamily protein, putative isoform 5 [Theobroma cacao] Length = 519 Score = 479 bits (1233), Expect = e-132 Identities = 257/490 (52%), Positives = 325/490 (66%), Gaps = 2/490 (0%) Frame = -1 Query: 1890 MEDSPLPELCAPENIIERLLAWSNSSTLSDALEILVQVSRAARGRSDLAFNNVVPVVLEL 1711 M LPE E +++ LL+ SNSS+L +ALEIL++VSR A R++LA N++P VL+L Sbjct: 1 MVGESLPEFNGLEGVLQPLLSASNSSSLKEALEILIKVSRTAAARAELALRNILPTVLKL 60 Query: 1710 SKSLSNASNRGNXXXXXXXXXXLCAGEILNQNSFVEGNGVEVVSVALSSAGVCSDLDCEI 1531 +S S+R LCAGE+ NQN+F E NGVEVV L SA + S+ D + Sbjct: 61 VESFHQTSSREYLVNSLKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAALLSNPDSGV 120 Query: 1530 VRMGLQVLGNVSLAGEKHRKVIWDRFFPGGFLEVARILRSEIVDPLCMVLHNCCDGKDER 1351 +R+ LQVL NVSLAGE H++ IW +FFP F +AR+ E DPLCM+L+ CCD + Sbjct: 121 IRVSLQVLANVSLAGEDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRPGL 180 Query: 1350 VTELCGFRGLPIVAEIIRTASEVGFEGNWLKLLLSQQCXXXXXXXXXXXXXXLAYEARED 1171 V ELC GLPIV IIRT + VGF +W KLLLS+ C + Sbjct: 181 VAELCRDMGLPIVVGIIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSENS 240 Query: 1170 FKCQ--DTFFTTEQSFLLSILSENLNQQINEISVSNEFALCVYGIMKRAIGVVDFSSRGK 997 D F +EQ+FLL I+SE LN++I EI VS+EFALCV GI KR++ VVDF+SRG Sbjct: 241 GNTDSGDDLFLSEQAFLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASRGM 300 Query: 996 SGLPTGSPSIDVLGYSVIILRDVCAKESAESAKIEGPIDVVXXXXXXXXXXXXXXXXXXL 817 S LPTG SIDV+GYS+IILRD+CA+E K + +DVV L Sbjct: 301 SSLPTGCTSIDVMGYSLIILRDICAREGVGDLKNDS-LDVVDMLLSHELIDILLSLLRDL 359 Query: 816 EPPEIIRKSISQSPNQVEHACSDSLKVCPYKGFRRDIVAVIGNCLYRRKRAQDEIRQKNG 637 +PP IIRK + + NQ + + K+CPYKGFRRD++AVIGNC YRRK QDEIRQKNG Sbjct: 360 DPPAIIRKVLKEGDNQGLNLSAS--KLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQKNG 417 Query: 636 ILLLMQQCVTDESNPFLREWGIWSVRNLLEGNEENKREVADMELQGSVDVPEITELGLRV 457 ILLL+QQCVTD+ NP+LREWGIWS+RNLLEG+ EN++ VAD+ELQGSVD+PE++ LGLRV Sbjct: 418 ILLLLQQCVTDDDNPYLREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGLRV 477 Query: 456 EVDQKNRRAK 427 EVDQK RRAK Sbjct: 478 EVDQKTRRAK 487 >ref|XP_007022648.1| ARM repeat superfamily protein, putative isoform 2 [Theobroma cacao] gi|590613384|ref|XP_007022649.1| ARM repeat superfamily protein, putative isoform 2 [Theobroma cacao] gi|590613394|ref|XP_007022652.1| ARM repeat superfamily protein, putative isoform 2 [Theobroma cacao] gi|508722276|gb|EOY14173.1| ARM repeat superfamily protein, putative isoform 2 [Theobroma cacao] gi|508722277|gb|EOY14174.1| ARM repeat superfamily protein, putative isoform 2 [Theobroma cacao] gi|508722280|gb|EOY14177.1| ARM repeat superfamily protein, putative isoform 2 [Theobroma cacao] Length = 488 Score = 479 bits (1233), Expect = e-132 Identities = 257/490 (52%), Positives = 325/490 (66%), Gaps = 2/490 (0%) Frame = -1 Query: 1890 MEDSPLPELCAPENIIERLLAWSNSSTLSDALEILVQVSRAARGRSDLAFNNVVPVVLEL 1711 M LPE E +++ LL+ SNSS+L +ALEIL++VSR A R++LA N++P VL+L Sbjct: 1 MVGESLPEFNGLEGVLQPLLSASNSSSLKEALEILIKVSRTAAARAELALRNILPTVLKL 60 Query: 1710 SKSLSNASNRGNXXXXXXXXXXLCAGEILNQNSFVEGNGVEVVSVALSSAGVCSDLDCEI 1531 +S S+R LCAGE+ NQN+F E NGVEVV L SA + S+ D + Sbjct: 61 VESFHQTSSREYLVNSLKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAALLSNPDSGV 120 Query: 1530 VRMGLQVLGNVSLAGEKHRKVIWDRFFPGGFLEVARILRSEIVDPLCMVLHNCCDGKDER 1351 +R+ LQVL NVSLAGE H++ IW +FFP F +AR+ E DPLCM+L+ CCD + Sbjct: 121 IRVSLQVLANVSLAGEDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRPGL 180 Query: 1350 VTELCGFRGLPIVAEIIRTASEVGFEGNWLKLLLSQQCXXXXXXXXXXXXXXLAYEARED 1171 V ELC GLPIV IIRT + VGF +W KLLLS+ C + Sbjct: 181 VAELCRDMGLPIVVGIIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSENS 240 Query: 1170 FKCQ--DTFFTTEQSFLLSILSENLNQQINEISVSNEFALCVYGIMKRAIGVVDFSSRGK 997 D F +EQ+FLL I+SE LN++I EI VS+EFALCV GI KR++ VVDF+SRG Sbjct: 241 GNTDSGDDLFLSEQAFLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASRGM 300 Query: 996 SGLPTGSPSIDVLGYSVIILRDVCAKESAESAKIEGPIDVVXXXXXXXXXXXXXXXXXXL 817 S LPTG SIDV+GYS+IILRD+CA+E K + +DVV L Sbjct: 301 SSLPTGCTSIDVMGYSLIILRDICAREGVGDLKNDS-LDVVDMLLSHELIDILLSLLRDL 359 Query: 816 EPPEIIRKSISQSPNQVEHACSDSLKVCPYKGFRRDIVAVIGNCLYRRKRAQDEIRQKNG 637 +PP IIRK + + NQ + + K+CPYKGFRRD++AVIGNC YRRK QDEIRQKNG Sbjct: 360 DPPAIIRKVLKEGDNQGLNLSAS--KLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQKNG 417 Query: 636 ILLLMQQCVTDESNPFLREWGIWSVRNLLEGNEENKREVADMELQGSVDVPEITELGLRV 457 ILLL+QQCVTD+ NP+LREWGIWS+RNLLEG+ EN++ VAD+ELQGSVD+PE++ LGLRV Sbjct: 418 ILLLLQQCVTDDDNPYLREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGLRV 477 Query: 456 EVDQKNRRAK 427 EVDQK RRAK Sbjct: 478 EVDQKTRRAK 487 >ref|XP_004232703.1| PREDICTED: ataxin-10-like isoform 1 [Solanum lycopersicum] gi|460373805|ref|XP_004232704.1| PREDICTED: ataxin-10-like isoform 2 [Solanum lycopersicum] Length = 501 Score = 461 bits (1186), Expect = e-127 Identities = 251/497 (50%), Positives = 321/497 (64%), Gaps = 4/497 (0%) Frame = -1 Query: 1890 MEDSPLPELCAPENIIERLLAWSNSSTLSDALEILVQVSRAARGRSDLAFNNVVPVVLEL 1711 M+D + EL PEN+ + LL SNSS+L AL+ L+Q+S+ GR DL+ NVV VL L Sbjct: 8 MDDQIVSELTIPENVAKELLLVSNSSSLETALDKLIQLSKEGGGRLDLSSKNVVTTVLHL 67 Query: 1710 SKSLSNASNRGNXXXXXXXXXXLCAGEILNQNSFVEGNGVEVVSVALSSAGVCSDLDCEI 1531 +SLS+ S R LCAGEI NQN F++ GVE+V + S G+ D DC I Sbjct: 68 CQSLSSISYRNLLLLSLKVLRNLCAGEIRNQNGFLQQRGVEIVLDVIMSVGLSPDPDCMI 127 Query: 1530 VRMGLQVLGNVSLAGEKHRKVIWDRFFPGGFLEVARILRSEIVDPLCMVLHNCCDGKDER 1351 +R+GLQ+LGN S+ G + + +W + FP FL++AR+ EI DPLCMV++ CCDG D Sbjct: 128 IRVGLQLLGNYSVGGGERQCDVWYQLFPHKFLKIARVRNQEICDPLCMVIYTCCDGTDGL 187 Query: 1350 VTELCGFRGLPIVAEIIRTASEVGFEGNWLKLLLSQQCXXXXXXXXXXXXXXLAYEARED 1171 +T+LC +GLPI+ EI+RTAS VG + WLKLLLS+ C +Y + ED Sbjct: 188 LTDLCSEQGLPILFEILRTASAVGLKEVWLKLLLSKLC-IEGSHISSIFFKLHSYPSVED 246 Query: 1170 FKCQDTF---FTTEQSFLLSILSENLNQQINEISVSNEFALCVYGIMKRAIGVVDFSSRG 1000 F EQ +LLSILSE LN+++ I VS++FA ++GI+K A GVVDFS RG Sbjct: 247 NGVVTHVADQFVIEQPYLLSILSEILNERVEHIVVSHDFARSIFGILKSASGVVDFSIRG 306 Query: 999 KSGLPTGSPSIDVLGYSVIILRDVCAKESAESAKIEGPIDVVXXXXXXXXXXXXXXXXXX 820 KS LP GS IDVLGYS+ ++RD+CA + S+K E DVV Sbjct: 307 KSDLPVGSAPIDVLGYSLTLMRDICASDHLSSSKEESSKDVVDVLVSSGLIEFLLNLLRD 366 Query: 819 LEPPEIIRKSISQSPNQV-EHACSDSLKVCPYKGFRRDIVAVIGNCLYRRKRAQDEIRQK 643 LEPP IR ++ P+Q+ E S + CPY+GFRRDIVA++GNC YRR+ QDEIR K Sbjct: 367 LEPPTTIRNAM--KPDQIKEGTIPSSFRCCPYQGFRRDIVAILGNCAYRRRHVQDEIRDK 424 Query: 642 NGILLLMQQCVTDESNPFLREWGIWSVRNLLEGNEENKREVADMELQGSVDVPEITELGL 463 NGILLL+QQCV DE NPFLREWGIW VRNLLEGN EN+ + D+ELQG+VDVPE+ LGL Sbjct: 425 NGILLLLQQCVIDEDNPFLREWGIWCVRNLLEGNAENQGAITDLELQGTVDVPELVRLGL 484 Query: 462 RVEVDQKNRRAKLVNTS 412 RVEVD RR KLVN+S Sbjct: 485 RVEVDPVTRRTKLVNSS 501 >ref|XP_006421838.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] gi|567858312|ref|XP_006421839.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] gi|567858314|ref|XP_006421840.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] gi|567858316|ref|XP_006421841.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] gi|568874427|ref|XP_006490317.1| PREDICTED: ataxin-10-like isoform X1 [Citrus sinensis] gi|568874429|ref|XP_006490318.1| PREDICTED: ataxin-10-like isoform X2 [Citrus sinensis] gi|557523711|gb|ESR35078.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] gi|557523712|gb|ESR35079.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] gi|557523713|gb|ESR35080.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] gi|557523714|gb|ESR35081.1| hypothetical protein CICLE_v10004825mg [Citrus clementina] Length = 497 Score = 459 bits (1181), Expect = e-126 Identities = 246/493 (49%), Positives = 324/493 (65%), Gaps = 2/493 (0%) Frame = -1 Query: 1890 MEDSPLPELCAPENIIERLLAWSNSSTLSDALEILVQVSRAARGRSDLAFNNVVPVVLEL 1711 M+D+ ++ E++++ LL SNSS+L DALEIL++ S+ GRSDLA N++P VL+L Sbjct: 1 MDDASSLDISLSEDVLQPLLTTSNSSSLKDALEILIESSKTTVGRSDLASKNILPEVLQL 60 Query: 1710 SKSLSNASNRGNXXXXXXXXXXLCAGEILNQNSFVEGNGVEVVSVALSSAGVCSDLDCEI 1531 ++S+ ++S LCAGEI NQ SF+E GV +V L S GV D D I Sbjct: 61 TQSIPHSSGCHYLLLSLKLLRNLCAGEITNQKSFIEQTGVGIVLRVLRSPGVNLDKDYGI 120 Query: 1530 VRMGLQVLGNVSLAGEKHRKVIWDRFFPGGFLEVARILRSEIVDPLCMVLHNCCDGKDER 1351 +R+ LQVL NVSLAGE H+ IW +FFP F +A + E DPLCMV++ CCDG Sbjct: 121 IRIALQVLANVSLAGETHQHAIWCQFFPDEFATLAGVRCQETCDPLCMVIYTCCDGSSGL 180 Query: 1350 VTELCGFRGLPIVAEIIRTASEVGFEGNWLKLLLSQQCXXXXXXXXXXXXXXLAYEAR-- 1177 ELCG +GL I+AEI+ TA+ VGF+ +W K L+S+ C +R Sbjct: 181 FKELCGDKGLAIMAEIVCTAASVGFKEDWFKFLVSRTCVEEIHFPQLFFKLSQVGASRNC 240 Query: 1176 EDFKCQDTFFTTEQSFLLSILSENLNQQINEISVSNEFALCVYGIMKRAIGVVDFSSRGK 997 ED ++ F++EQ+FLL I+SE +N++I EI V N+FAL V GI ++IG+VDF +RG Sbjct: 241 EDSNSREGTFSSEQAFLLEIVSEIVNERIEEIIVPNDFALSVLGIFTKSIGLVDFYARGT 300 Query: 996 SGLPTGSPSIDVLGYSVIILRDVCAKESAESAKIEGPIDVVXXXXXXXXXXXXXXXXXXL 817 LPT S +I+VLGYS+ ILR++CA+E + D+V L Sbjct: 301 PSLPTSSSAINVLGYSLSILRNICAREDPAGSSSVNRADLVDSLQSHGLIEMFLSLLRDL 360 Query: 816 EPPEIIRKSISQSPNQVEHACSDSLKVCPYKGFRRDIVAVIGNCLYRRKRAQDEIRQKNG 637 EPP IIRK++ Q NQ E + S K CPY GFRRD+VAVIGNC YRRK QDEIR+++G Sbjct: 361 EPPAIIRKAMRQGENQ-EGTSAKSAKTCPYIGFRRDLVAVIGNCAYRRKHIQDEIRERDG 419 Query: 636 ILLLMQQCVTDESNPFLREWGIWSVRNLLEGNEENKREVADMELQGSVDVPEITELGLRV 457 ILLL+QQCVTDE NPF REWGIW VRNLLEGN EN++ VAD+ELQGS++VPE+T+LGL+V Sbjct: 420 ILLLLQQCVTDEDNPFSREWGIWCVRNLLEGNAENQKVVADLELQGSINVPELTDLGLKV 479 Query: 456 EVDQKNRRAKLVN 418 EVD+ RRAKLVN Sbjct: 480 EVDKNTRRAKLVN 492 >ref|XP_007219054.1| hypothetical protein PRUPE_ppa004765mg [Prunus persica] gi|462415516|gb|EMJ20253.1| hypothetical protein PRUPE_ppa004765mg [Prunus persica] Length = 492 Score = 459 bits (1181), Expect = e-126 Identities = 248/494 (50%), Positives = 328/494 (66%), Gaps = 1/494 (0%) Frame = -1 Query: 1890 MEDSPLPELCAPENIIERLLAWSNSSTLSDALEILVQVSRAARGRSDLAFNNVVPVVLEL 1711 M+ + L E PE++++ LL+ SNSSTL D+LE L+QV RAA GR+DLA +++P V++L Sbjct: 1 MDKTALQEFFVPEDVLQILLSASNSSTLIDSLETLIQVCRAADGRADLASKSILPSVVQL 60 Query: 1710 SKSLSNASNRGNXXXXXXXXXXLCAGEILNQNSFVEGNGVEVVSVALSSAGVCSDLDCEI 1531 +SL S R LCAGE+ NQ SF+E +GV ++S L+SA + + D + Sbjct: 61 IQSLPYPSGRHLLTLSLKLLRNLCAGEVSNQKSFLEQSGVAIISNVLNSANISLEPDSGV 120 Query: 1530 VRMGLQVLGNVSLAGEKHRKVIWDRFFPGGFLEVARILRSEIVDPLCMVLHNCCDGKDER 1351 +RMGLQVL NVSLAGE+H+ IW + FP FL +AR+ E DPLCMV+ CCDG E Sbjct: 121 IRMGLQVLANVSLAGERHQHEIWQQLFPKEFLALARVQSRETCDPLCMVIFACCDGSPEL 180 Query: 1350 VTELCGFRGLPIVAEIIRTASEVGFEGNWLKLLLSQQCXXXXXXXXXXXXXXLAY-EARE 1174 +LCG G+ I+ EI+RT + VGF +W+KLLLS+ C A E E Sbjct: 181 FEKLCGDGGITIMKEIVRTTAAVGFGEDWVKLLLSRICLEGPYFSSLFSNLGFATSENVE 240 Query: 1173 DFKCQDTFFTTEQSFLLSILSENLNQQINEISVSNEFALCVYGIMKRAIGVVDFSSRGKS 994 D + ++ F+++Q+F L I+S+ LN+++ EI+V +FALCV+GI K+++G ++ +RG+S Sbjct: 241 DTEFREDLFSSDQAFFLRIISDILNERLREITVPRDFALCVFGIFKKSVGALNCVTRGQS 300 Query: 993 GLPTGSPSIDVLGYSVIILRDVCAKESAESAKIEGPIDVVXXXXXXXXXXXXXXXXXXLE 814 GLPTG+ IDVLGYS+ ILRDVCA+++ + E D V LE Sbjct: 301 GLPTGTSMIDVLGYSLTILRDVCAQKTLRGFQ-EDLGDAVDVLLSHGLIELILCLLRDLE 359 Query: 813 PPEIIRKSISQSPNQVEHACSDSLKVCPYKGFRRDIVAVIGNCLYRRKRAQDEIRQKNGI 634 PP IIRK+I Q Q + S S K CPYKGFRRDIVAVIGNC Y+RK QDEIRQ++GI Sbjct: 360 PPAIIRKAIKQGEGQ-DGTNSGSSKPCPYKGFRRDIVAVIGNCTYQRKPVQDEIRQRDGI 418 Query: 633 LLLMQQCVTDESNPFLREWGIWSVRNLLEGNEENKREVADMELQGSVDVPEITELGLRVE 454 LLL+QQC DE NPFL+EWGIW VRNLLEGNE+NKR V ++ELQGSVD PEI LG RVE Sbjct: 419 LLLLQQCGLDEDNPFLKEWGIWCVRNLLEGNEDNKRVVTELELQGSVDAPEIAGLGFRVE 478 Query: 453 VDQKNRRAKLVNTS 412 V+ + R KLVN S Sbjct: 479 VNPETGRPKLVNVS 492 >ref|XP_002511774.1| conserved hypothetical protein [Ricinus communis] gi|223548954|gb|EEF50443.1| conserved hypothetical protein [Ricinus communis] Length = 497 Score = 459 bits (1181), Expect = e-126 Identities = 252/488 (51%), Positives = 321/488 (65%), Gaps = 2/488 (0%) Frame = -1 Query: 1869 ELCAPENIIERLLAWSNSSTLSDALEILVQVSRAARGRSDLAFNNVVPVVLELSKSLSNA 1690 EL PE++++ L S S L +ALEIL++ SR GR++LA +V+P+VL+L KS+S Sbjct: 2 ELFLPEDLLQLLFRASKSYDLKEALEILIETSRIDDGRANLAAKDVLPLVLKLFKSISYP 61 Query: 1689 SNRGNXXXXXXXXXXLCAGEILNQNSFVEGNGVEVVSVALSSAGVCSDLDCEIVRMGLQV 1510 S LCAGEI NQN FV NG E+VS L SAG+ + D I+R+GLQV Sbjct: 62 SGDQFLTLSLKLLRNLCAGEITNQNCFVALNGPEMVSTLLRSAGLVYEPDYGIIRLGLQV 121 Query: 1509 LGNVSLAGEKHRKVIWDRFFPGGFLEVARILRSEIVDPLCMVLHNCCDGKDERVTELCGF 1330 L NVSLAGEKH++ IW FFP F+ +A+ DPLCM+++ CCDG V ELCG Sbjct: 122 LANVSLAGEKHQQAIWHWFFPDEFVVLAKNRSQSTCDPLCMIIYTCCDGNPGFVLELCGD 181 Query: 1329 RGLPIVAEIIRTASEVGFEGNWLKLLLSQQCXXXXXXXXXXXXXXLAYEAR--EDFKCQD 1156 RGL +VAEI+RTAS VG+ +W KLLLS+ C A ++ E Sbjct: 182 RGLAVVAEIVRTASVVGYGEDWFKLLLSRICLEEEYFYKLFSCFYCAGDSENSEGISSSS 241 Query: 1155 TFFTTEQSFLLSILSENLNQQINEISVSNEFALCVYGIMKRAIGVVDFSSRGKSGLPTGS 976 F+TEQ++LLS +SE LN+++ +ISVS +FA V+GI KR++GVVDF SRG SGLPTGS Sbjct: 242 DLFSTEQAYLLSTVSEILNERLEDISVSIDFAFYVFGIFKRSVGVVDFVSRGNSGLPTGS 301 Query: 975 PSIDVLGYSVIILRDVCAKESAESAKIEGPIDVVXXXXXXXXXXXXXXXXXXLEPPEIIR 796 ++DVLGYS+ ILRD CA + +DVV LEPP +I+ Sbjct: 302 AAVDVLGYSLTILRDTCALHG--KGGLYHSVDVVDTLLSNGLLELLLFVLHDLEPPPMIK 359 Query: 795 KSISQSPNQVEHACSDSLKVCPYKGFRRDIVAVIGNCLYRRKRAQDEIRQKNGILLLMQQ 616 K++ Q+ N E A S S K CPYKGFRRDIVAVIGNC ++R QDEIRQK+ I LL+QQ Sbjct: 360 KAMKQNENH-EPASSRSYKPCPYKGFRRDIVAVIGNCAFQRNNVQDEIRQKDMIPLLLQQ 418 Query: 615 CVTDESNPFLREWGIWSVRNLLEGNEENKREVADMELQGSVDVPEITELGLRVEVDQKNR 436 CVTDE NPFLREWG+W VRNLLEGN EN++ VA++ELQG+V VPE++ LGLRVEVD R Sbjct: 419 CVTDEDNPFLREWGLWCVRNLLEGNVENQKAVAELELQGTVQVPELSGLGLRVEVDSNTR 478 Query: 435 RAKLVNTS 412 RA+LVN S Sbjct: 479 RARLVNVS 486 >ref|XP_006348129.1| PREDICTED: ataxin-10-like [Solanum tuberosum] Length = 501 Score = 458 bits (1179), Expect = e-126 Identities = 247/495 (49%), Positives = 316/495 (63%), Gaps = 2/495 (0%) Frame = -1 Query: 1890 MEDSPLPELCAPENIIERLLAWSNSSTLSDALEILVQVSRAARGRSDLAFNNVVPVVLEL 1711 ++D + EL PEN+ + LL SNSS+L ALE L+++++ GR DL+ NVV VL L Sbjct: 8 VDDQIVAELTIPENVAKELLLVSNSSSLETALEKLIELAKEEGGRLDLSSKNVVTTVLHL 67 Query: 1710 SKSLSNASNRGNXXXXXXXXXXLCAGEILNQNSFVEGNGVEVVSVALSSAGVCSDLDCEI 1531 +SLS+ S R LCAGEI+NQN F++ GVE+V + S G+ D DC I Sbjct: 68 CQSLSSISYRYLLLLSLKVLRNLCAGEIINQNEFLQQRGVEIVVDVIMSVGLTPDPDCMI 127 Query: 1530 VRMGLQVLGNVSLAGEKHRKVIWDRFFPGGFLEVARILRSEIVDPLCMVLHNCCDGKDER 1351 +R+GLQ+LGN S+ G + + +W + FP FL++AR+ EI DPLCMV++ CCDG D Sbjct: 128 IRVGLQLLGNYSVGGGERQCDVWYQLFPHKFLKIARVRNQEICDPLCMVIYTCCDGTDGL 187 Query: 1350 VTELCGFRGLPIVAEIIRTASEVGFEGNWLKLLLSQQCXXXXXXXXXXXXXXLAYEARED 1171 +T+LC +GLPI+ EI+RTAS VG + WLKLLLS+ C + Sbjct: 188 LTDLCSEKGLPILIEILRTASAVGLKEVWLKLLLSKLCIEGSYISSIFFKLHSYPSVENN 247 Query: 1170 FKCQDTF--FTTEQSFLLSILSENLNQQINEISVSNEFALCVYGIMKRAIGVVDFSSRGK 997 F EQS+LLS LSE LN+++ I VS++FA ++GI+K A GV DFS RGK Sbjct: 248 GVVTHVVDQFVIEQSYLLSTLSEILNERVEHIVVSHDFARSIFGILKSASGVADFSIRGK 307 Query: 996 SGLPTGSPSIDVLGYSVIILRDVCAKESAESAKIEGPIDVVXXXXXXXXXXXXXXXXXXL 817 S LP GS IDVLGYS+ ILRD+CA + S+K E DVV L Sbjct: 308 SDLPVGSAPIDVLGYSLTILRDICASDHMTSSKEESSKDVVDVLVSSGLIEFLLNLLRDL 367 Query: 816 EPPEIIRKSISQSPNQVEHACSDSLKVCPYKGFRRDIVAVIGNCLYRRKRAQDEIRQKNG 637 EPP IRK++ Q + E S S + CPY+GFRRDIVA++GNC YRR+ QDEIR KNG Sbjct: 368 EPPTTIRKAMKQDQIK-EGTISSSFRCCPYQGFRRDIVAILGNCAYRRRHVQDEIRDKNG 426 Query: 636 ILLLMQQCVTDESNPFLREWGIWSVRNLLEGNEENKREVADMELQGSVDVPEITELGLRV 457 ILLL+QQCV DE NPFLREWGIW VRNLLEGN EN+ + D+ELQG+VDVPE+ LGLRV Sbjct: 427 ILLLLQQCVIDEDNPFLREWGIWCVRNLLEGNAENQGAITDLELQGTVDVPELVRLGLRV 486 Query: 456 EVDQKNRRAKLVNTS 412 EVD R KLVN+S Sbjct: 487 EVDPVTRHTKLVNSS 501 >ref|XP_006366476.1| PREDICTED: ataxin-10-like isoform X1 [Solanum tuberosum] gi|565401994|ref|XP_006366477.1| PREDICTED: ataxin-10-like isoform X2 [Solanum tuberosum] gi|565401996|ref|XP_006366478.1| PREDICTED: ataxin-10-like isoform X3 [Solanum tuberosum] gi|565401998|ref|XP_006366479.1| PREDICTED: ataxin-10-like isoform X4 [Solanum tuberosum] gi|565402000|ref|XP_006366480.1| PREDICTED: ataxin-10-like isoform X5 [Solanum tuberosum] Length = 504 Score = 449 bits (1156), Expect = e-123 Identities = 247/495 (49%), Positives = 314/495 (63%), Gaps = 2/495 (0%) Frame = -1 Query: 1890 MEDSPLPELCAPENIIERLLAWSNSSTLSDALEILVQVSRAARGRSDLAFNNVVPVVLEL 1711 ++D + E+ PEN+ + LL SNSS+L ALE L+++++ GR DL+ NVV VL L Sbjct: 11 VDDKIVAEVTIPENVAKELLLVSNSSSLETALEKLIELAKEEGGRLDLSSKNVVTTVLHL 70 Query: 1710 SKSLSNASNRGNXXXXXXXXXXLCAGEILNQNSFVEGNGVEVVSVALSSAGVCSDLDCEI 1531 +SLS+ S R LCAGEI NQN F++ GVE+V ++S G+ D DC I Sbjct: 71 CQSLSSISYRQLLLSSLKVLRNLCAGEIRNQNEFLQQRGVEIVVDVITSVGLTPDPDCMI 130 Query: 1530 VRMGLQVLGNVSLAGEKHRKVIWDRFFPGGFLEVARILRSEIVDPLCMVLHNCCDGKDER 1351 +R+GLQ+LGN S+ G + + +W + FP FL++AR+ EI DPLCMV++ CCDG D Sbjct: 131 IRVGLQLLGNYSVGGGERQCDVWYQLFPHKFLKIARVRSWEICDPLCMVIYTCCDGTDGL 190 Query: 1350 VTELCGFRGLPIVAEIIRTASEVGFEGNWLKLLLSQQCXXXXXXXXXXXXXXLAYEARED 1171 +T+LC +GLPI+ EI+RTAS V + WLKLLLS+ C + + Sbjct: 191 LTDLCSEQGLPILIEILRTASAVDRKEVWLKLLLSKLCIEGSYISSIFFKLHSFPSIQNN 250 Query: 1170 FKCQDTF--FTTEQSFLLSILSENLNQQINEISVSNEFALCVYGIMKRAIGVVDFSSRGK 997 F EQ +LLSILSE +N QI I VS++FAL ++GI+K A VVDFS RGK Sbjct: 251 GVVTHATDQFVIEQPYLLSILSEIVNDQIEHIVVSHDFALSIFGILKSAFVVVDFSIRGK 310 Query: 996 SGLPTGSPSIDVLGYSVIILRDVCAKESAESAKIEGPIDVVXXXXXXXXXXXXXXXXXXL 817 S LP G IDVLGYS+ ILRD+CA + S+K E DVV L Sbjct: 311 SDLPVGFAPIDVLGYSLTILRDICASDHMTSSKEESSKDVVDVLVSSGLIEFLLNLLRDL 370 Query: 816 EPPEIIRKSISQSPNQVEHACSDSLKVCPYKGFRRDIVAVIGNCLYRRKRAQDEIRQKNG 637 EPP IRK++ Q E S S + CPY+GFRRDIV++IGNC YRR+ QDEIR KNG Sbjct: 371 EPPTTIRKAMKQD-QITEGIISSSFRCCPYQGFRRDIVSIIGNCAYRRRYVQDEIRDKNG 429 Query: 636 ILLLMQQCVTDESNPFLREWGIWSVRNLLEGNEENKREVADMELQGSVDVPEITELGLRV 457 ILLL+QQCV DE NPFLREWGIW VRNLLEGN EN+ + D+ELQG+VDVPE+ LGLRV Sbjct: 430 ILLLLQQCVIDEDNPFLREWGIWCVRNLLEGNAENQGAITDLELQGTVDVPELVRLGLRV 489 Query: 456 EVDQKNRRAKLVNTS 412 EVD RR KLVN S Sbjct: 490 EVDPVTRRTKLVNAS 504 >ref|XP_002320751.1| ataxin-related family protein [Populus trichocarpa] gi|222861524|gb|EEE99066.1| ataxin-related family protein [Populus trichocarpa] Length = 496 Score = 444 bits (1141), Expect = e-122 Identities = 248/499 (49%), Positives = 319/499 (63%), Gaps = 6/499 (1%) Frame = -1 Query: 1890 MEDSPLPELCAPEN-IIERLLAWSNSSTLSDALEILVQVSRAARGRSDLAFNNVVPVVLE 1714 M + L EL P+N +E L S SS L + LEIL+ +++ GR+DLA N++PVVL+ Sbjct: 1 MGGASLTELSFPQNDFLEPLFTASKSSDLKETLEILIAIAKTDDGRADLASKNILPVVLQ 60 Query: 1713 LSKSLSNAS-NRGNXXXXXXXXXXLCAGEILNQNSFVEGNGVEVVSVALSSAGVCS-DLD 1540 L L N + LCAGE+ NQ SF++ NGV + L S V S + D Sbjct: 61 LITHLLNDPFDHEYLSLSLRLMRNLCAGEVANQKSFIQLNGVGIFLTVLRSKKVASSEPD 120 Query: 1539 CEIVRMGLQVLGNVSLAGEKHRKVIWDRFFPGGFLEVARILRSEIVDPLCMVLHNCCDGK 1360 I+RMGLQVL NVSLAG++H++ IW F +A++ DPLCM+++ CCDG Sbjct: 121 HGIIRMGLQVLANVSLAGKEHQQAIWGGLFHDELYMLAKVRSQGTCDPLCMIIYACCDGS 180 Query: 1359 DERVTELCGFRGLPIVAEIIRTASEVGFEGNWLKLLLSQQCXXXXXXXXXXXXXXLAY-- 1186 E V +LCG +GLPIV EIIRTAS VGF WLKLLLS+ C Sbjct: 181 PELVLQLCGNQGLPIVVEIIRTASLVGFGEEWLKLLLSRICLEDIYFPQLFSRIYSVCSY 240 Query: 1185 -EAREDFKCQDTFFTTEQSFLLSILSENLNQQINEISVSNEFALCVYGIMKRAIGVVDFS 1009 E E+ F TEQ++LL+I+SE LN+++ EI++ N+FALC++GI K+++ +F Sbjct: 241 CENGEEISLSSNPFFTEQAYLLNIVSEILNERLKEITILNDFALCIFGIFKKSVEAFEFG 300 Query: 1008 SRGKSGLPTGSPSIDVLGYSVIILRDVCAKESAESAKIEGPIDVVXXXXXXXXXXXXXXX 829 SR +S LPTG IDVLGYS+ ILRD+CA E +DVV Sbjct: 301 SRAESRLPTGFAVIDVLGYSLTILRDICANNGGVGK--EDLVDVVDSLLSSGLLDLLLCL 358 Query: 828 XXXLEPPEIIRKSISQSPNQVEHACSDSLKVCPYKGFRRDIVAVIGNCLYRRKRAQDEIR 649 LEPP+IIRK+++Q+ NQ E S KVCPYKGFRRD+VAVIGNC YRRK QD+IR Sbjct: 359 LRDLEPPKIIRKAMNQAGNQ-EATTSYFPKVCPYKGFRRDLVAVIGNCAYRRKHVQDDIR 417 Query: 648 QKNGILLLMQQCVTDESNPFLREWGIWSVRNLLEGNEENKREVADMELQGSVDVPEITEL 469 QKNG+LL++QQCVTDE NPFLREWGIWS+RNLLEGN EN++ VA++ELQGSVD+PE+ L Sbjct: 418 QKNGMLLMLQQCVTDEDNPFLREWGIWSMRNLLEGNSENQQAVAELELQGSVDMPELAGL 477 Query: 468 GLRVEVDQKNRRAKLVNTS 412 GL+VEVDQ R AKLVN S Sbjct: 478 GLKVEVDQNTRSAKLVNIS 496 >ref|XP_004308721.1| PREDICTED: uncharacterized protein LOC101297970 [Fragaria vesca subsp. vesca] Length = 492 Score = 441 bits (1134), Expect = e-121 Identities = 241/495 (48%), Positives = 326/495 (65%), Gaps = 2/495 (0%) Frame = -1 Query: 1890 MEDSPLPELCAPENIIERLLAWSNSSTLSDALEILVQVSRAARGRSDLAFNNVVPVVLEL 1711 M+++ LPE PE++++ LL+ SNSS L D+LE LVQV + A GR DL+ NV+P V++L Sbjct: 1 MDNTTLPECSVPEHVLQALLSVSNSSKLVDSLEDLVQVCKTADGREDLSAKNVLPTVIQL 60 Query: 1710 SKSLSNASNRGNXXXXXXXXXXLCAGEILNQNSFVEGNGVEVVSVALSSAGVCSDLDCEI 1531 +SLS S+ LCAGE+ NQNSFVE NGV ++S LSSA D I Sbjct: 61 VQSLSYPSDHYLLTLSLRLLRNLCAGEVANQNSFVEQNGVAIISNILSSASSLEP-DFGI 119 Query: 1530 VRMGLQVLGNVSLAGEKHRKVIWDRFFPGGFLEVARILRSEIVDPLCMVLHNCCDGKDER 1351 + +GLQVL NV+LAGE+ + IW + F F+ +AR+ + PLCM+++ CCDG E Sbjct: 120 ICVGLQVLANVALAGERQQHAIWQQLFLENFVALARVRSQKTCGPLCMIIYACCDGTPEL 179 Query: 1350 VTELCGFRGLPIVAEIIRTASEVGFEGNWLKLLLSQQCXXXXXXXXXXXXXXLA--YEAR 1177 V +LCG G+ IV EI++TA+ GF +W KLLLS+ C E Sbjct: 180 VAQLCGDCGVTIVKEIVKTAAADGFGEDWYKLLLSRICLEEPYFRPLFFSLQHVGGNENG 239 Query: 1176 EDFKCQDTFFTTEQSFLLSILSENLNQQINEISVSNEFALCVYGIMKRAIGVVDFSSRGK 997 +D + F EQ FLL +SE LN+++NEI+V ++FALCV+GI K +I V+ +++RG+ Sbjct: 240 DDTEGGQESFLEEQEFLLKNVSEILNERLNEITVPDDFALCVFGIFKNSIKVLSYATRGR 299 Query: 996 SGLPTGSPSIDVLGYSVIILRDVCAKESAESAKIEGPIDVVXXXXXXXXXXXXXXXXXXL 817 SGLPTGS IDVLGYS+ ILRD+CA+ + ++ +DVV L Sbjct: 300 SGLPTGSIDIDVLGYSLTILRDICAQGTLRGCTVD-TMDVVDALISYGLIELLLCLLRDL 358 Query: 816 EPPEIIRKSISQSPNQVEHACSDSLKVCPYKGFRRDIVAVIGNCLYRRKRAQDEIRQKNG 637 EPP II+KS++Q+ +Q E + + K CPYKGFRRDIV VIGNCLY R+ QDEIR+K+G Sbjct: 359 EPPAIIKKSVNQAKDQ-EGSNYSASKPCPYKGFRRDIVGVIGNCLYGRQIVQDEIRRKDG 417 Query: 636 ILLLMQQCVTDESNPFLREWGIWSVRNLLEGNEENKREVADMELQGSVDVPEITELGLRV 457 +LLL+QQCVTD+ NP+LREWGIW VRNLLE N+EN++ VA++ELQGSVDVP++ LGLRV Sbjct: 418 LLLLLQQCVTDDDNPYLREWGIWCVRNLLERNQENQQAVAELELQGSVDVPDLARLGLRV 477 Query: 456 EVDQKNRRAKLVNTS 412 E++ R KLVN S Sbjct: 478 EMNPATGRPKLVNIS 492 >ref|XP_004492673.1| PREDICTED: ataxin-10-like [Cicer arietinum] Length = 468 Score = 439 bits (1130), Expect = e-120 Identities = 241/465 (51%), Positives = 305/465 (65%), Gaps = 1/465 (0%) Frame = -1 Query: 1809 LSDALEILVQVSRAARGRSDLAFNNVVPVVLELSKSLSNASNRGNXXXXXXXXXXLCAGE 1630 LSD LE L+ S++ GRS+LA V+P VL + S + + LCAGE Sbjct: 6 LSD-LENLIHTSKSDSGRSNLASKRVLPAVLNILNSQTLPLDHNLLSLCFKLLRNLCAGE 64 Query: 1629 ILNQNSFVEGNGVEVVSVALSSAGVCSDLDCEIVRMGLQVLGNVSLAGEKHRKVIWDRFF 1450 NQN F+E +GV VVS L S D +VR GLQVL NV LAG++H+K IW+ F Sbjct: 65 FENQNLFLEFDGVVVVSSILMSEAGSLRPDHMLVRWGLQVLANVCLAGKQHQKAIWEEIF 124 Query: 1449 PGGFLEVARILRSEIVDPLCMVLHNCCDGKDERVTELCGFRGLPIVAEIIRTASEVGFEG 1270 P GF+ +AR+ EI DPLCMV++ CCDG E ELC GLP+VAEI++TAS F Sbjct: 125 PLGFVSLARLGTKEICDPLCMVIYTCCDGNHECFGELCSDSGLPVVAEIVKTASSASFGE 184 Query: 1269 NWLKLLLSQQCXXXXXXXXXXXXXXLA-YEAREDFKCQDTFFTTEQSFLLSILSENLNQQ 1093 +W+KLLLS+ C ED +D F+ EQ+FLL ILSE LN++ Sbjct: 185 DWIKLLLSRICLEESQLPMLFPKLRFMDIPEGEDIDSKDYQFSFEQAFLLQILSEILNER 244 Query: 1092 INEISVSNEFALCVYGIMKRAIGVVDFSSRGKSGLPTGSPSIDVLGYSVIILRDVCAKES 913 + ++ VS + AL VYG+ K+++GV++ + RGKSGLP+GS ++D LGYS+ ILRD+CA +S Sbjct: 245 LRDVVVSKDVALFVYGVFKKSVGVLEHAVRGKSGLPSGSVAVDALGYSLTILRDICAHDS 304 Query: 912 AESAKIEGPIDVVXXXXXXXXXXXXXXXXXXLEPPEIIRKSISQSPNQVEHACSDSLKVC 733 E DVV LEPP IIRK I QS NQ +CS K C Sbjct: 305 VRGNP-EDTNDVVDVLLSQDIIELLLILLGDLEPPAIIRKGIKQSENQEGASCSS--KPC 361 Query: 732 PYKGFRRDIVAVIGNCLYRRKRAQDEIRQKNGILLLMQQCVTDESNPFLREWGIWSVRNL 553 PYKGFRRDIV++IGNC+YRRK AQDEIR +NGILLL+QQCVTDE NPFLREWGIWSVRN+ Sbjct: 362 PYKGFRRDIVSLIGNCVYRRKHAQDEIRGRNGILLLLQQCVTDEDNPFLREWGIWSVRNM 421 Query: 552 LEGNEENKREVADMELQGSVDVPEITELGLRVEVDQKNRRAKLVN 418 LEGNEEN++ V++++LQGS DVP+I+ LGLR+EVDQK RRAKLVN Sbjct: 422 LEGNEENQKVVSELQLQGSADVPQISALGLRIEVDQKTRRAKLVN 466 >ref|XP_004306868.1| PREDICTED: ataxin-10-like [Fragaria vesca subsp. vesca] Length = 490 Score = 439 bits (1130), Expect = e-120 Identities = 241/493 (48%), Positives = 323/493 (65%) Frame = -1 Query: 1890 MEDSPLPELCAPENIIERLLAWSNSSTLSDALEILVQVSRAARGRSDLAFNNVVPVVLEL 1711 M+++ LPE PE++I+ LL+ SNSS L +++E L+QV + A GR DLA NV+P V++L Sbjct: 1 MDNTALPECSVPEDVIQALLSVSNSSNLVESMEDLIQVCKTADGREDLAAKNVLPTVIQL 60 Query: 1710 SKSLSNASNRGNXXXXXXXXXXLCAGEILNQNSFVEGNGVEVVSVALSSAGVCSDLDCEI 1531 +SL S+ LCAGE+ NQNSFVE NGV +VS LSSA + + D I Sbjct: 61 VQSLLYPSDHYLLTLSLRLLRNLCAGEVANQNSFVEQNGVAIVSNILSSA-ISLEPDFWI 119 Query: 1530 VRMGLQVLGNVSLAGEKHRKVIWDRFFPGGFLEVARILRSEIVDPLCMVLHNCCDGKDER 1351 + +GLQVL N +LAGE+ + IW + F F+ +AR+ + PLCM++ CCDG E Sbjct: 120 ICVGLQVLANAALAGERQQHAIWQQLFSEKFVALARVRSKKTCGPLCMIISTCCDGTPEL 179 Query: 1350 VTELCGFRGLPIVAEIIRTASEVGFEGNWLKLLLSQQCXXXXXXXXXXXXXXLAYEARED 1171 V +LCG G+ I+ EI++TA+ V F +W KLLLS+ C E ED Sbjct: 180 VAQLCGDCGVTILKEIVKTAAAVDFGEDWYKLLLSRICLVEPYFRPLFFSLEHVGENAED 239 Query: 1170 FKCQDTFFTTEQSFLLSILSENLNQQINEISVSNEFALCVYGIMKRAIGVVDFSSRGKSG 991 + F+ EQ FLL +SE LN+ ++EI+V N+FALCV+GI K +I V+ +++RG+SG Sbjct: 240 TEGGRESFSKEQEFLLKNVSEILNECLSEITVPNDFALCVFGIFKNSIKVLSYATRGRSG 299 Query: 990 LPTGSPSIDVLGYSVIILRDVCAKESAESAKIEGPIDVVXXXXXXXXXXXXXXXXXXLEP 811 LPTGS IDVLGYS+ ILRD CA+ + + + +DVV LEP Sbjct: 300 LPTGSIDIDVLGYSLTILRDTCAQGTLRGST-KDTMDVVDALISYGLIELLLSLLRDLEP 358 Query: 810 PEIIRKSISQSPNQVEHACSDSLKVCPYKGFRRDIVAVIGNCLYRRKRAQDEIRQKNGIL 631 P II+KSI+Q+ NQ E + S +LK CPYKGFRRDIVAVIGNCLY RK QDEIR+K+G+L Sbjct: 359 PAIIKKSINQAENQ-EGSSSSTLKPCPYKGFRRDIVAVIGNCLYGRKIVQDEIRRKDGLL 417 Query: 630 LLMQQCVTDESNPFLREWGIWSVRNLLEGNEENKREVADMELQGSVDVPEITELGLRVEV 451 LL+QQCV D+ NP+ REWGIW RNLL+ N+EN+R VA++EL+GSVDVP + LGLRVE+ Sbjct: 418 LLLQQCVIDDDNPYSREWGIWCQRNLLDRNQENQRAVAELELKGSVDVPALARLGLRVEM 477 Query: 450 DQKNRRAKLVNTS 412 + R KLVN S Sbjct: 478 NLATGRPKLVNIS 490 >ref|XP_003551615.1| PREDICTED: ataxin-10-like [Glycine max] Length = 498 Score = 437 bits (1125), Expect = e-120 Identities = 241/486 (49%), Positives = 313/486 (64%), Gaps = 7/486 (1%) Frame = -1 Query: 1854 ENIIERLLAWSNSSTLSDALEILVQVSRAARGRSDLAFNNVVPVVLELSKSLSNASNRGN 1675 E+ ++ L SNSS + +LEIL+Q +++ GR +LA ++P VL + SL++AS+ + Sbjct: 14 EDTLQLLFEASNSSNMEKSLEILIQNAKSDSGRLELASKRILPAVLNIVHSLTHASHHHH 73 Query: 1674 XXXXXXXXXXL------CAGEILNQNSFVEGNGVEVVSVALSSAGVCSDLDCEIVRMGLQ 1513 CAGE NQ+SF+E +GV VV L S CS D +VR GLQ Sbjct: 74 HQHNHILCLSFKLLRNLCAGEAANQDSFLELDGVAVVCSVLRSEAACSGPDHGLVRWGLQ 133 Query: 1512 VLGNVSLAGEKHRKVIWDRFFPGGFLEVARILRSEIVDPLCMVLHNCCDGKDERVTELCG 1333 VL NVSLAG++H+ IW + GF+ +AR+ E DPLCMV++ CCDG E L Sbjct: 134 VLANVSLAGKQHQCAIWKELYLDGFVSLARLHTKETCDPLCMVIYTCCDGNPEWFKRLSS 193 Query: 1332 FRGLPIVAEIIRTASEVGFEGNWLKLLLSQQCXXXXXXXXXXXXXXLAYEAR-EDFKCQD 1156 G ++AEI+RTAS F +WLKLLLS+ C A + E + +D Sbjct: 194 EDGWFVMAEIVRTASSASFGEDWLKLLLSRICLEESQLPVLFSKLQFADVPKVEVAESKD 253 Query: 1155 TFFTTEQSFLLSILSENLNQQINEISVSNEFALCVYGIMKRAIGVVDFSSRGKSGLPTGS 976 F+ EQ+FLL ILSE LN++ +++VS + AL V+GI K +IGV++ ++RGKSGLP+G Sbjct: 254 DHFSFEQAFLLRILSEILNERHKDVTVSKDVALFVFGIFKNSIGVLEHATRGKSGLPSGF 313 Query: 975 PSIDVLGYSVIILRDVCAKESAESAKIEGPIDVVXXXXXXXXXXXXXXXXXXLEPPEIIR 796 +DVLGYS+ ILRD+CA++ E DVV LEPP IIR Sbjct: 314 VGVDVLGYSLTILRDICAQDGVRG-NTEDSNDVVDALLSYGLIELLLYLLEALEPPAIIR 372 Query: 795 KSISQSPNQVEHACSDSLKVCPYKGFRRDIVAVIGNCLYRRKRAQDEIRQKNGILLLMQQ 616 K + Q NQ +CS K CPYKGFRRDIVA+IGNC+YRRK AQDEIR +NGILLL+QQ Sbjct: 373 KGLKQCENQDGASCS--FKPCPYKGFRRDIVALIGNCVYRRKHAQDEIRHRNGILLLLQQ 430 Query: 615 CVTDESNPFLREWGIWSVRNLLEGNEENKREVADMELQGSVDVPEITELGLRVEVDQKNR 436 CVTDE NPFLREWGIWSVRN+LEGN+EN++ VA++E+QGS DVPEIT LGLRVEVDQ+ R Sbjct: 431 CVTDEDNPFLREWGIWSVRNMLEGNDENQKVVAELEIQGSADVPEITSLGLRVEVDQRTR 490 Query: 435 RAKLVN 418 RAKLVN Sbjct: 491 RAKLVN 496 >ref|XP_003623813.1| Ataxin-10 [Medicago truncatula] gi|355498828|gb|AES80031.1| Ataxin-10 [Medicago truncatula] Length = 491 Score = 432 bits (1112), Expect = e-118 Identities = 235/492 (47%), Positives = 314/492 (63%), Gaps = 1/492 (0%) Frame = -1 Query: 1884 DSPLPELCAPENIIERLLAWSNSSTLSDALEILVQVSRAARGRSDLAFNNVVPVVLELSK 1705 D+P + + L SNS+TL +LE L++ S++ RS A ++P +L + Sbjct: 6 DAPFSNHPISQQSLNSLFDLSNSTTLQTSLETLIESSKSTSNRSLYACKKILPTILTV-- 63 Query: 1704 SLSNASNRGNXXXXXXXXXXLCAGEILNQNSFVEGNGVEVVSVALSSAGVCSDLDCEIVR 1525 L + + LCAGEILNQN F+E +GV +V ++ + V D +VR Sbjct: 64 -LHSPPSLHILSLCFKLLRNLCAGEILNQNMFLENDGVFIVVSSILRSEVVGS-DYMLVR 121 Query: 1524 MGLQVLGNVSLAGEKHRKVIWDRFFPGGFLEVARILRSEIVDPLCMVLHNCCDGKDERVT 1345 GLQVL NV LAG++H+K +WD FP GFL VARI + E+ DPLCMV++ CCDG D+ + Sbjct: 122 WGLQVLANVCLAGKEHQKAVWDEMFPVGFLSVARIGKKEVNDPLCMVIYTCCDGNDQWFS 181 Query: 1344 ELCGFRGLPIVAEIIRTASEVGFEGNWLKLLLSQQCXXXXXXXXXXXXXXLA-YEAREDF 1168 E+C G ++ EI+RTAS F +W+KLLLS+ C ED Sbjct: 182 EVCSDGGWNVLVEIVRTASSASFGEDWIKLLLSRICLEDSQLRVLFSKLRFMDIPDGEDT 241 Query: 1167 KCQDTFFTTEQSFLLSILSENLNQQINEISVSNEFALCVYGIMKRAIGVVDFSSRGKSGL 988 K +D F++EQ+FLL I+S+ LN++I ++++S E A VYGI K++IGV++ + RGKSGL Sbjct: 242 KTKDDQFSSEQAFLLQIISDILNERIGDVTISLEVASFVYGIFKKSIGVLEHAVRGKSGL 301 Query: 987 PTGSPSIDVLGYSVIILRDVCAKESAESAKIEGPIDVVXXXXXXXXXXXXXXXXXXLEPP 808 P+G +DVLGYS+ +LRD+CA +S + +VV LEPP Sbjct: 302 PSGITDVDVLGYSLTMLRDICAHDSVRGNSED--TEVVDMLLSYGLIELVFILLGDLEPP 359 Query: 807 EIIRKSISQSPNQVEHACSDSLKVCPYKGFRRDIVAVIGNCLYRRKRAQDEIRQKNGILL 628 IIRK + S N S S K CPYKGFRRDIVA+IGNC+YRRK QDEIR +NGILL Sbjct: 360 TIIRKGMKHSENP--DGASSSSKPCPYKGFRRDIVALIGNCVYRRKHVQDEIRSRNGILL 417 Query: 627 LMQQCVTDESNPFLREWGIWSVRNLLEGNEENKREVADMELQGSVDVPEITELGLRVEVD 448 L+QQCVTDE NP+LREWGIW VRN+LEGNEEN++E+++++LQGS DVPEI+ LGLRVEVD Sbjct: 418 LLQQCVTDEDNPYLREWGIWCVRNMLEGNEENQKEISELQLQGSADVPEISALGLRVEVD 477 Query: 447 QKNRRAKLVNTS 412 QK RRAKLVN S Sbjct: 478 QKTRRAKLVNVS 489 >ref|XP_007148734.1| hypothetical protein PHAVU_005G009900g [Phaseolus vulgaris] gi|561021998|gb|ESW20728.1| hypothetical protein PHAVU_005G009900g [Phaseolus vulgaris] Length = 498 Score = 431 bits (1107), Expect = e-118 Identities = 240/496 (48%), Positives = 314/496 (63%), Gaps = 5/496 (1%) Frame = -1 Query: 1890 MEDSPLPELCAPENIIERLLAWSNSSTLSDALEILVQVSRAARGRSDLAFNNVVPVVLEL 1711 M D+ E E+ ++ L SNSS L +LEIL+Q +++ GR +LA ++P VL + Sbjct: 1 MIDTTFLEHPISEDTLQLLFQASNSSNLEKSLEILIQNAKSDSGRLELASKRILPAVLNI 60 Query: 1710 SKSLSNASNRGNXXXXXXXXXXL----CAGEILNQNSFVEGNGVEVVSVALSSAGVCSDL 1543 +SL+ AS+ + L CAGE NQ SF+E NGV VV L S Sbjct: 61 VQSLAQASHHHHHNQTFSLCFKLLRNLCAGEAANQVSFIELNGVAVVWSVLRSEAGSLGP 120 Query: 1542 DCEIVRMGLQVLGNVSLAGEKHRKVIWDRFFPGGFLEVARILRSEIVDPLCMVLHNCCDG 1363 D +VR GLQVL NVSL G++H++ IW+ +P GF +AR+ EI DPLCMV++ CCDG Sbjct: 121 DHRLVRWGLQVLANVSLGGKQHQRAIWEELYPIGFASLARVGTKEICDPLCMVIYTCCDG 180 Query: 1362 KDERVTELCGFRGLPIVAEIIRTASEVGFEGNWLKLLLSQQCXXXXXXXXXXXXXXLA-Y 1186 E +L G P+VAEI+RTAS F+ +WLKLLLS+ Sbjct: 181 NPEWFKKLSSDDGWPVVAEIVRTASSASFDEDWLKLLLSRIFLEESQLPVLFSKLQSVDV 240 Query: 1185 EAREDFKCQDTFFTTEQSFLLSILSENLNQQINEISVSNEFALCVYGIMKRAIGVVDFSS 1006 E + ++ F+ EQ+FLL ILSE LN+++ +++VS + AL V+GI K++IGV++ + Sbjct: 241 PEGEVIESKNGQFSFEQAFLLQILSEILNERLGDVTVSEDVALFVFGIFKKSIGVLEHAM 300 Query: 1005 RGKSGLPTGSPSIDVLGYSVIILRDVCAKESAESAKIEGPIDVVXXXXXXXXXXXXXXXX 826 RGKSGLP+G +DVLGYS+ ILRD+CA++ DVV Sbjct: 301 RGKSGLPSGFTGVDVLGYSLTILRDICAQDGMRG----NTKDVVDVLLSYGLIEFLLSLL 356 Query: 825 XXLEPPEIIRKSISQSPNQVEHACSDSLKVCPYKGFRRDIVAVIGNCLYRRKRAQDEIRQ 646 LEPP IIRK + Q NQ +C K CPYKGFRRDIVA+IGNC+YRRK AQDEIR Sbjct: 357 GALEPPAIIRKGLKQIENQDNASCCS--KPCPYKGFRRDIVALIGNCVYRRKHAQDEIRD 414 Query: 645 KNGILLLMQQCVTDESNPFLREWGIWSVRNLLEGNEENKREVADMELQGSVDVPEITELG 466 +NGILLL+QQCVTDE NPFLREWGIWSVRN+LEGN+EN++ VA++E+QGS DVPEI LG Sbjct: 415 RNGILLLLQQCVTDEDNPFLREWGIWSVRNMLEGNDENQKLVAELEIQGSADVPEINALG 474 Query: 465 LRVEVDQKNRRAKLVN 418 L+VEVDQ+ RR KLVN Sbjct: 475 LQVEVDQRTRRPKLVN 490 >ref|XP_006290996.1| hypothetical protein CARUB_v10017108mg [Capsella rubella] gi|482559703|gb|EOA23894.1| hypothetical protein CARUB_v10017108mg [Capsella rubella] Length = 488 Score = 404 bits (1037), Expect = e-109 Identities = 223/493 (45%), Positives = 305/493 (61%), Gaps = 9/493 (1%) Frame = -1 Query: 1869 ELCAPENIIERLLAWSNSS-TLSDALEILVQVSRAARGRSDLAFNNVVPVVLELSKSLSN 1693 E P+ +++ LL S S +L D L+ L + S+ GRSDLA ++P +L L + L Sbjct: 2 EASVPDEVLQPLLQASGLSYSLEDCLKFLQESSKTDSGRSDLASKAILPYILRLLQILPY 61 Query: 1692 ASNRGNXXXXXXXXXXLCAGEILNQNSFVEGNGVEVVSVALSSAGVCSDLDCEIVRMGLQ 1513 S+R LCAGE+ NQNSFV+ +G +VS L SA D E VR GLQ Sbjct: 62 PSSRHYLNLSLKVLRNLCAGEVSNQNSFVDHDGSVIVSDLLDSAIAEKTADFETVRFGLQ 121 Query: 1512 VLGNVSLAGEKHRKVIWDRFFPGGFLEVARILRSEIVDPLCMVLHNCCDGKDERVTELCG 1333 L NV L EK ++ +W RFFP F +A+I R E DPLCM+L+ C DG E +E+C Sbjct: 122 ALANVVLFCEKRQRDVWMRFFPERFFSIAKIRRFETCDPLCMILYACFDGSSEIASEVCS 181 Query: 1332 FRGLPIVAEIIRTASEVG-FEGNWLKLLLSQQCXXXXXXXXXXXXXXLA-----YEARED 1171 GL IVAE IRT+S VG E WLKL++S+ C ++ ED Sbjct: 182 SDGLSIVAEAIRTSSSVGSVEDYWLKLMVSRMCVEDHCFPQLFSKLYKVDLVLGHKDDED 241 Query: 1170 FKCQDTFFTTEQSFLLSILSENLNQQINEISVSNEFALCVYGIMKRAIGVVDFSSRGKSG 991 +TFFT+EQ+FLL ++S+ N++I ++S+ + + G+ K+++ V DF+S +S Sbjct: 242 ----ETFFTSEQAFLLRMVSDIANERIGKVSIPKDTTSSILGLFKQSVAVFDFASSQRSE 297 Query: 990 LPTGSPSIDVLGYSVIILRDVCAKESAESAKIEGP--IDVVXXXXXXXXXXXXXXXXXXL 817 LPTGS +DV+GYS++I+RD CA S E K + +D V L Sbjct: 298 LPTGSTIVDVMGYSLVIIRDACAGGSLEELKNDNKDSVDTVELLLSSGLIHLLLDLLRKL 357 Query: 816 EPPEIIRKSISQSPNQVEHACSDSLKVCPYKGFRRDIVAVIGNCLYRRKRAQDEIRQKNG 637 +PP I+K+++QSP+ + S SLK CPY+GFRRDIV+VIGNC YRRK QDEIR+++G Sbjct: 358 DPPTTIKKALNQSPS----SSSSSLKPCPYRGFRRDIVSVIGNCAYRRKEVQDEIRERDG 413 Query: 636 ILLLMQQCVTDESNPFLREWGIWSVRNLLEGNEENKREVADMELQGSVDVPEITELGLRV 457 + L++QQCVTD+ NPFLREWG+W VRNLLEGN EN+ VAD+E+QGSVDVP++ E+GLRV Sbjct: 414 LFLMLQQCVTDDENPFLREWGLWCVRNLLEGNPENQEVVADLEIQGSVDVPQLREIGLRV 473 Query: 456 EVDQKNRRAKLVN 418 E+D K R KLVN Sbjct: 474 EIDPKTSRPKLVN 486 >ref|NP_567156.1| protein MATERNAL EFFECT EMBRYO ARREST 50 [Arabidopsis thaliana] gi|3193319|gb|AAC19301.1| contains similarity to mouse brain protein E46 (GB:X61506) [Arabidopsis thaliana] gi|26451586|dbj|BAC42890.1| unknown protein [Arabidopsis thaliana] gi|28973257|gb|AAO63953.1| unknown protein [Arabidopsis thaliana] gi|332656441|gb|AEE81841.1| maternal effect embryo arrest 50 protein [Arabidopsis thaliana] Length = 475 Score = 396 bits (1017), Expect = e-107 Identities = 220/490 (44%), Positives = 309/490 (63%), Gaps = 6/490 (1%) Frame = -1 Query: 1869 ELCAPENIIERLLAWSNSS-TLSDALEILVQVSRAARGRSDLAFNNVVPVVLELSKSLSN 1693 E PE +++ LL S+ S +L D L+ L++ S+ GRSDLA +++P +L L + L Sbjct: 2 EASLPEEVLQPLLHASDLSYSLEDCLKFLLESSKTDSGRSDLASKSILPSILRLLQLLPY 61 Query: 1692 ASNRGNXXXXXXXXXXLCAGEILNQNSFVEGNGVEVVSVALSSAGVCSDLDCEIVRMGLQ 1513 S+R LCAGE+ NQNSFV+ +G +VS L SA D E VR GLQ Sbjct: 62 PSSRHYLNLSLKVLRNLCAGEVSNQNSFVDHDGSAIVSDLLDSAIA----DFETVRFGLQ 117 Query: 1512 VLGNVSLAGEKHRKVIWDRFFPGGFLEVARILRSEIVDPLCMVLHNCCDGKDERVTELCG 1333 VL NV L GEK ++ +W RF+P FL +A+I + E DPLCM+L+ C DG E +ELC Sbjct: 118 VLANVVLFGEKRQRDVWLRFYPERFLSIAKIRKRETFDPLCMILYTCVDGSSEIASELCS 177 Query: 1332 FRGLPIVAEIIRTASEVG-FEGNWLKLLLSQQCXXXXXXXXXXXXXXLAYEAREDFKCQD 1156 +GL I+AE +RT+S VG E WLKLL+S+ C YE E+ Sbjct: 178 CQGLTIIAETLRTSSSVGSVEDYWLKLLVSRICVEDGYFLKLFSKL---YEDAEN----- 229 Query: 1155 TFFTTEQSFLLSILSENLNQQINEISVSNEFALCVYGIMKRAIGVVDFSSRGKSGLPTGS 976 F++EQ+FL+ ++S+ N++I ++S+ + A + G+ ++++ V DF S +S LPTGS Sbjct: 230 EIFSSEQAFLVRMVSDIANERIGKVSIPKDTACSILGLFRQSVDVFDFVSGERSELPTGS 289 Query: 975 PSIDVLGYSVIILRDVCA----KESAESAKIEGPIDVVXXXXXXXXXXXXXXXXXXLEPP 808 +DV+GYS++I+RD CA +E E K G D V L+PP Sbjct: 290 TIVDVMGYSLVIIRDACAGGRLEELKEDNKDSG--DTVELLLSSGLIELLLDLLSKLDPP 347 Query: 807 EIIRKSISQSPNQVEHACSDSLKVCPYKGFRRDIVAVIGNCLYRRKRAQDEIRQKNGILL 628 I+K+++QSP+ + S SLK CPY+GFRRDIV+VIGNC YRRK QDEIR+++G+ L Sbjct: 348 TTIKKALNQSPS----SSSSSLKPCPYRGFRRDIVSVIGNCAYRRKEVQDEIRERDGLFL 403 Query: 627 LMQQCVTDESNPFLREWGIWSVRNLLEGNEENKREVADMELQGSVDVPEITELGLRVEVD 448 ++QQCVTD+ NPFLREWG+W +RNLLEGN EN+ VA++E++GSVDVP++ E+GLRVE+D Sbjct: 404 MLQQCVTDDENPFLREWGLWCIRNLLEGNPENQEVVAELEIKGSVDVPQLREIGLRVEID 463 Query: 447 QKNRRAKLVN 418 K R KLVN Sbjct: 464 PKTARPKLVN 473