BLASTX nr result

ID: Sinomenium21_contig00014562 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00014562
         (2050 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002274705.1| PREDICTED: uncharacterized protein LOC100264...   464   e-128
ref|XP_007022647.1| ARM repeat superfamily protein, putative iso...   442   e-121
ref|XP_007022650.1| ARM repeat superfamily protein, putative iso...   442   e-121
ref|XP_007022651.1| ARM repeat superfamily protein, putative iso...   441   e-121
ref|XP_007022648.1| ARM repeat superfamily protein, putative iso...   441   e-121
ref|XP_002511774.1| conserved hypothetical protein [Ricinus comm...   439   e-120
ref|XP_006421838.1| hypothetical protein CICLE_v10004825mg [Citr...   436   e-119
ref|XP_006348129.1| PREDICTED: ataxin-10-like [Solanum tuberosum]     419   e-114
ref|XP_004232703.1| PREDICTED: ataxin-10-like isoform 1 [Solanum...   419   e-114
ref|XP_002320751.1| ataxin-related family protein [Populus trich...   417   e-114
ref|XP_003623813.1| Ataxin-10 [Medicago truncatula] gi|355498828...   415   e-113
ref|XP_004308721.1| PREDICTED: uncharacterized protein LOC101297...   415   e-113
ref|XP_007219054.1| hypothetical protein PRUPE_ppa004765mg [Prun...   414   e-113
ref|XP_004306868.1| PREDICTED: ataxin-10-like [Fragaria vesca su...   412   e-112
ref|XP_004492673.1| PREDICTED: ataxin-10-like [Cicer arietinum]       410   e-112
ref|XP_003551615.1| PREDICTED: ataxin-10-like [Glycine max]           409   e-111
ref|XP_006366476.1| PREDICTED: ataxin-10-like isoform X1 [Solanu...   408   e-111
ref|XP_007148734.1| hypothetical protein PHAVU_005G009900g [Phas...   403   e-109
gb|EYU23502.1| hypothetical protein MIMGU_mgv1a005564mg [Mimulus...   375   e-101
gb|EYU22629.1| hypothetical protein MIMGU_mgv1a025194mg, partial...   371   e-100

>ref|XP_002274705.1| PREDICTED: uncharacterized protein LOC100264428 [Vitis vinifera]
          Length = 494

 Score =  464 bits (1194), Expect = e-128
 Identities = 257/486 (52%), Positives = 325/486 (66%), Gaps = 4/486 (0%)
 Frame = -1

Query: 2011 PENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLELSKSLSNAXXXX 1832
            PENI++ L + S+SSTL + LE+L++ S+T  GR DL  KN +P+VL+LS+SLS      
Sbjct: 11   PENILQPLFSVSNSSTLDETLELLIEASKTPGGRLDLGSKNILPVVLQLSQSLSYPSGHD 70

Query: 1831 XXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXG-FGSDLDCEIVRIGLQLLGN 1655
                       LCAGE+ NQN F                    SD D  I+R+GLQLLGN
Sbjct: 71   ILLLSLKLLRNLCAGEMTNQNLFIEQNGVKAVSTILLSFVGLDSDSDYGIIRMGLQLLGN 130

Query: 1654 VSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDERVAELCGFRGL 1475
            VSLAGE H +AVW  FFP GFLE+AR+R  E  DPLC V++ C  +  E + E+CG +GL
Sbjct: 131  VSLAGERHQRAVWHHFFPAGFLEIARVRTLETSDPLCMVIYTCFDQSHEFITEICGDQGL 190

Query: 1474 QIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDASGDFKCQD---TF 1304
             I+AEIVRTAS VGFEE+WLKLLLS+IC                G  SG+++  +     
Sbjct: 191  PILAEIVRTASTVGFEEDWLKLLLSRICLEESHFPMLFSKLCPVG-TSGNYESIEFKVDV 249

Query: 1303 FTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSGLPTGSPS 1124
            F  EQ+FL+ I++E LN+QIN+++VS+D ALC+ GILK+  GV+D  S  KSG   GS +
Sbjct: 250  FASEQAFLMDIVAEILNEQINKMTVSSDVALCVLGILKKSAGVLDSVSTCKSGFSAGSNA 309

Query: 1123 IDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEPPEIIKKS 944
            I+VL YS+ IL+++CA++  +SS   GS+DVV                  LEPP II+K+
Sbjct: 310  INVLKYSLTILKEICARDAQKSSNEHGSVDVVDLLVSSGLLELLLCLLRDLEPPAIIRKA 369

Query: 943  ISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGILLLMQQCV 764
            I + +NQ +   S S K  PY+GF+RD+VAVIGNC YRRKHVQ+EIR++NGILLL+QQCV
Sbjct: 370  IKQGENQ-DGAASYSPKHYPYRGFRRDLVAVIGNCAYRRKHVQNEIRERNGILLLLQQCV 428

Query: 763  TDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEVDQKNRRA 584
            TDE N FLREWGIW VRNLLE N ENQR VAE+ELQGSVDVPEI GLGLRVEVDQK  RA
Sbjct: 429  TDEENQFLREWGIWCVRNLLEGNVENQRVVAELELQGSVDVPEIAGLGLRVEVDQKTGRA 488

Query: 583  KLVNIS 566
            KLVN+S
Sbjct: 489  KLVNVS 494


>ref|XP_007022647.1| ARM repeat superfamily protein, putative isoform 1 [Theobroma cacao]
            gi|508722275|gb|EOY14172.1| ARM repeat superfamily
            protein, putative isoform 1 [Theobroma cacao]
          Length = 531

 Score =  442 bits (1138), Expect = e-121
 Identities = 241/501 (48%), Positives = 319/501 (63%), Gaps = 2/501 (0%)
 Frame = -1

Query: 2050 KKMEDSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVL 1871
            K+M     P     E +++ L++ S+SS+L +ALEIL++ SRT   R++LAL+N +P VL
Sbjct: 11   KEMVGESLPEFNGLEGVLQPLLSASNSSSLKEALEILIKVSRTAAARAELALRNILPTVL 70

Query: 1870 ELSKSLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDC 1691
            +L +S                   LCAGE+ NQN+F                   S+ D 
Sbjct: 71   KLVESFHQTSSREYLVNSLKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAALLSNPDS 130

Query: 1690 EIVRIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKD 1511
             ++R+ LQ+L NVSLAGE+H +A+W +FFP  F  +AR+R  E  DPLC +L+ CC  + 
Sbjct: 131  GVIRVSLQVLANVSLAGEDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRP 190

Query: 1510 ERVAELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDA- 1334
              VAELC   GL IV  I+RT + VGF E+W KLLLS++C                  + 
Sbjct: 191  GLVAELCRDMGLPIVVGIIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSE 250

Query: 1333 -SGDFKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSK 1157
             SG+    D  F  EQ+FLL I+SE LN++I EI VS++FALC+ GI KR V V+DF S+
Sbjct: 251  NSGNTDSGDDLFLSEQAFLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASR 310

Query: 1156 GKSGLPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXX 977
            G S LPTG  SIDV+GYS+IILRD+CA+EG    K + S+DVV                 
Sbjct: 311  GMSSLPTGCTSIDVMGYSLIILRDICAREGVGDLKND-SLDVVDMLLSHELIDILLSLLR 369

Query: 976  XLEPPEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQK 797
             L+PP II+K +    NQ  +  +   K+CPYKGF+RD++AVIGNC YRRKHVQDEIRQK
Sbjct: 370  DLDPPAIIRKVLKEGDNQGLNLSAS--KLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQK 427

Query: 796  NGILLLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGL 617
            NGILLL+QQCVTD+ NP+LREWGIWS+RNLLE + ENQ+ VA++ELQGSVD+PE++ LGL
Sbjct: 428  NGILLLLQQCVTDDDNPYLREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGL 487

Query: 616  RVEVDQKNRRAKLVNIS*DEI 554
            RVEVDQK RRAK   +  D++
Sbjct: 488  RVEVDQKTRRAKDFALPPDQV 508


>ref|XP_007022650.1| ARM repeat superfamily protein, putative isoform 4 [Theobroma cacao]
            gi|508722278|gb|EOY14175.1| ARM repeat superfamily
            protein, putative isoform 4 [Theobroma cacao]
          Length = 500

 Score =  442 bits (1137), Expect = e-121
 Identities = 240/492 (48%), Positives = 315/492 (64%), Gaps = 2/492 (0%)
 Frame = -1

Query: 2050 KKMEDSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVL 1871
            K+M     P     E +++ L++ S+SS+L +ALEIL++ SRT   R++LAL+N +P VL
Sbjct: 11   KEMVGESLPEFNGLEGVLQPLLSASNSSSLKEALEILIKVSRTAAARAELALRNILPTVL 70

Query: 1870 ELSKSLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDC 1691
            +L +S                   LCAGE+ NQN+F                   S+ D 
Sbjct: 71   KLVESFHQTSSREYLVNSLKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAALLSNPDS 130

Query: 1690 EIVRIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKD 1511
             ++R+ LQ+L NVSLAGE+H +A+W +FFP  F  +AR+R  E  DPLC +L+ CC  + 
Sbjct: 131  GVIRVSLQVLANVSLAGEDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRP 190

Query: 1510 ERVAELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDA- 1334
              VAELC   GL IV  I+RT + VGF E+W KLLLS++C                  + 
Sbjct: 191  GLVAELCRDMGLPIVVGIIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSE 250

Query: 1333 -SGDFKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSK 1157
             SG+    D  F  EQ+FLL I+SE LN++I EI VS++FALC+ GI KR V V+DF S+
Sbjct: 251  NSGNTDSGDDLFLSEQAFLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASR 310

Query: 1156 GKSGLPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXX 977
            G S LPTG  SIDV+GYS+IILRD+CA+EG    K + S+DVV                 
Sbjct: 311  GMSSLPTGCTSIDVMGYSLIILRDICAREGVGDLKND-SLDVVDMLLSHELIDILLSLLR 369

Query: 976  XLEPPEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQK 797
             L+PP II+K +    NQ  +  +   K+CPYKGF+RD++AVIGNC YRRKHVQDEIRQK
Sbjct: 370  DLDPPAIIRKVLKEGDNQGLNLSAS--KLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQK 427

Query: 796  NGILLLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGL 617
            NGILLL+QQCVTD+ NP+LREWGIWS+RNLLE + ENQ+ VA++ELQGSVD+PE++ LGL
Sbjct: 428  NGILLLLQQCVTDDDNPYLREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGL 487

Query: 616  RVEVDQKNRRAK 581
            RVEVDQK RRAK
Sbjct: 488  RVEVDQKTRRAK 499


>ref|XP_007022651.1| ARM repeat superfamily protein, putative isoform 5 [Theobroma cacao]
            gi|508722279|gb|EOY14176.1| ARM repeat superfamily
            protein, putative isoform 5 [Theobroma cacao]
          Length = 519

 Score =  441 bits (1134), Expect = e-121
 Identities = 238/487 (48%), Positives = 315/487 (64%), Gaps = 2/487 (0%)
 Frame = -1

Query: 2008 ENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLELSKSLSNAXXXXX 1829
            E +++ L++ S+SS+L +ALEIL++ SRT   R++LAL+N +P VL+L +S         
Sbjct: 13   EGVLQPLLSASNSSSLKEALEILIKVSRTAAARAELALRNILPTVLKLVESFHQTSSREY 72

Query: 1828 XXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEIVRIGLQLLGNVS 1649
                      LCAGE+ NQN+F                   S+ D  ++R+ LQ+L NVS
Sbjct: 73   LVNSLKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAALLSNPDSGVIRVSLQVLANVS 132

Query: 1648 LAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDERVAELCGFRGLQI 1469
            LAGE+H +A+W +FFP  F  +AR+R  E  DPLC +L+ CC  +   VAELC   GL I
Sbjct: 133  LAGEDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPI 192

Query: 1468 VAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDA--SGDFKCQDTFFTR 1295
            V  I+RT + VGF E+W KLLLS++C                  +  SG+    D  F  
Sbjct: 193  VVGIIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLS 252

Query: 1294 EQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSGLPTGSPSIDV 1115
            EQ+FLL I+SE LN++I EI VS++FALC+ GI KR V V+DF S+G S LPTG  SIDV
Sbjct: 253  EQAFLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDV 312

Query: 1114 LGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEPPEIIKKSISR 935
            +GYS+IILRD+CA+EG    K + S+DVV                  L+PP II+K +  
Sbjct: 313  MGYSLIILRDICAREGVGDLKND-SLDVVDMLLSHELIDILLSLLRDLDPPAIIRKVLKE 371

Query: 934  TKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGILLLMQQCVTDE 755
              NQ  +  +   K+CPYKGF+RD++AVIGNC YRRKHVQDEIRQKNGILLL+QQCVTD+
Sbjct: 372  GDNQGLNLSAS--KLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCVTDD 429

Query: 754  CNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEVDQKNRRAKLV 575
             NP+LREWGIWS+RNLLE + ENQ+ VA++ELQGSVD+PE++ LGLRVEVDQK RRAK  
Sbjct: 430  DNPYLREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRAKDF 489

Query: 574  NIS*DEI 554
             +  D++
Sbjct: 490  ALPPDQV 496


>ref|XP_007022648.1| ARM repeat superfamily protein, putative isoform 2 [Theobroma cacao]
            gi|590613384|ref|XP_007022649.1| ARM repeat superfamily
            protein, putative isoform 2 [Theobroma cacao]
            gi|590613394|ref|XP_007022652.1| ARM repeat superfamily
            protein, putative isoform 2 [Theobroma cacao]
            gi|508722276|gb|EOY14173.1| ARM repeat superfamily
            protein, putative isoform 2 [Theobroma cacao]
            gi|508722277|gb|EOY14174.1| ARM repeat superfamily
            protein, putative isoform 2 [Theobroma cacao]
            gi|508722280|gb|EOY14177.1| ARM repeat superfamily
            protein, putative isoform 2 [Theobroma cacao]
          Length = 488

 Score =  441 bits (1133), Expect = e-121
 Identities = 237/478 (49%), Positives = 311/478 (65%), Gaps = 2/478 (0%)
 Frame = -1

Query: 2008 ENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLELSKSLSNAXXXXX 1829
            E +++ L++ S+SS+L +ALEIL++ SRT   R++LAL+N +P VL+L +S         
Sbjct: 13   EGVLQPLLSASNSSSLKEALEILIKVSRTAAARAELALRNILPTVLKLVESFHQTSSREY 72

Query: 1828 XXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEIVRIGLQLLGNVS 1649
                      LCAGE+ NQN+F                   S+ D  ++R+ LQ+L NVS
Sbjct: 73   LVNSLKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAALLSNPDSGVIRVSLQVLANVS 132

Query: 1648 LAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDERVAELCGFRGLQI 1469
            LAGE+H +A+W +FFP  F  +AR+R  E  DPLC +L+ CC  +   VAELC   GL I
Sbjct: 133  LAGEDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPI 192

Query: 1468 VAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDA--SGDFKCQDTFFTR 1295
            V  I+RT + VGF E+W KLLLS++C                  +  SG+    D  F  
Sbjct: 193  VVGIIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLS 252

Query: 1294 EQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSGLPTGSPSIDV 1115
            EQ+FLL I+SE LN++I EI VS++FALC+ GI KR V V+DF S+G S LPTG  SIDV
Sbjct: 253  EQAFLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDV 312

Query: 1114 LGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEPPEIIKKSISR 935
            +GYS+IILRD+CA+EG    K + S+DVV                  L+PP II+K +  
Sbjct: 313  MGYSLIILRDICAREGVGDLKND-SLDVVDMLLSHELIDILLSLLRDLDPPAIIRKVLKE 371

Query: 934  TKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGILLLMQQCVTDE 755
              NQ  +  +   K+CPYKGF+RD++AVIGNC YRRKHVQDEIRQKNGILLL+QQCVTD+
Sbjct: 372  GDNQGLNLSAS--KLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCVTDD 429

Query: 754  CNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEVDQKNRRAK 581
             NP+LREWGIWS+RNLLE + ENQ+ VA++ELQGSVD+PE++ LGLRVEVDQK RRAK
Sbjct: 430  DNPYLREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRAK 487


>ref|XP_002511774.1| conserved hypothetical protein [Ricinus communis]
            gi|223548954|gb|EEF50443.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 497

 Score =  439 bits (1129), Expect = e-120
 Identities = 244/487 (50%), Positives = 311/487 (63%), Gaps = 2/487 (0%)
 Frame = -1

Query: 2020 LCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLELSKSLSNAX 1841
            L  PE++++ L   S S  L +ALEIL++ SR   GR++LA K+ +PLVL+L KS+S   
Sbjct: 3    LFLPEDLLQLLFRASKSYDLKEALEILIETSRIDDGRANLAAKDVLPLVLKLFKSISYPS 62

Query: 1840 XXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEIVRIGLQLL 1661
                          LCAGEI NQN F                G   + D  I+R+GLQ+L
Sbjct: 63   GDQFLTLSLKLLRNLCAGEITNQNCFVALNGPEMVSTLLRSAGLVYEPDYGIIRLGLQVL 122

Query: 1660 GNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDERVAELCGFR 1481
             NVSLAGE+H +A+W  FFP  F+ +A+ R     DPLC +++ CC      V ELCG R
Sbjct: 123  ANVSLAGEKHQQAIWHWFFPDEFVVLAKNRSQSTCDPLCMIIYTCCDGNPGFVLELCGDR 182

Query: 1480 GLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDA--SGDFKCQDT 1307
            GL +VAEIVRTAS VG+ E+W KLLLS+IC               AGD+  S        
Sbjct: 183  GLAVVAEIVRTASVVGYGEDWFKLLLSRICLEEEYFYKLFSCFYCAGDSENSEGISSSSD 242

Query: 1306 FFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSGLPTGSP 1127
             F+ EQ++LLS +SE LN+++ +ISVS DFA  ++GI KR VGV+DF S+G SGLPTGS 
Sbjct: 243  LFSTEQAYLLSTVSEILNERLEDISVSIDFAFYVFGIFKRSVGVVDFVSRGNSGLPTGSA 302

Query: 1126 SIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEPPEIIKK 947
            ++DVLGYS+ ILRD CA  G     +  S+DVV                  LEPP +IKK
Sbjct: 303  AVDVLGYSLTILRDTCALHG--KGGLYHSVDVVDTLLSNGLLELLLFVLHDLEPPPMIKK 360

Query: 946  SISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGILLLMQQC 767
            ++ + +N  E   S S K CPYKGF+RDIVAVIGNC ++R +VQDEIRQK+ I LL+QQC
Sbjct: 361  AMKQNENH-EPASSRSYKPCPYKGFRRDIVAVIGNCAFQRNNVQDEIRQKDMIPLLLQQC 419

Query: 766  VTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEVDQKNRR 587
            VTDE NPFLREWG+W VRNLLE N ENQ+ VAE+ELQG+V VPE++GLGLRVEVD   RR
Sbjct: 420  VTDEDNPFLREWGLWCVRNLLEGNVENQKAVAELELQGTVQVPELSGLGLRVEVDSNTRR 479

Query: 586  AKLVNIS 566
            A+LVN+S
Sbjct: 480  ARLVNVS 486


>ref|XP_006421838.1| hypothetical protein CICLE_v10004825mg [Citrus clementina]
            gi|567858312|ref|XP_006421839.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|567858314|ref|XP_006421840.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|567858316|ref|XP_006421841.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|568874427|ref|XP_006490317.1| PREDICTED:
            ataxin-10-like isoform X1 [Citrus sinensis]
            gi|568874429|ref|XP_006490318.1| PREDICTED:
            ataxin-10-like isoform X2 [Citrus sinensis]
            gi|557523711|gb|ESR35078.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|557523712|gb|ESR35079.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|557523713|gb|ESR35080.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|557523714|gb|ESR35081.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
          Length = 497

 Score =  436 bits (1120), Expect = e-119
 Identities = 234/494 (47%), Positives = 315/494 (63%), Gaps = 2/494 (0%)
 Frame = -1

Query: 2044 MEDSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLEL 1865
            M+D+ S ++   E++++ L+  S+SS+L DALEIL++ S+T  GRSDLA KN +P VL+L
Sbjct: 1    MDDASSLDISLSEDVLQPLLTTSNSSSLKDALEILIESSKTTVGRSDLASKNILPEVLQL 60

Query: 1864 SKSLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEI 1685
            ++S+ ++               LCAGEI NQ SF                G   D D  I
Sbjct: 61   TQSIPHSSGCHYLLLSLKLLRNLCAGEITNQKSFIEQTGVGIVLRVLRSPGVNLDKDYGI 120

Query: 1684 VRIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDER 1505
            +RI LQ+L NVSLAGE H  A+W QFFP  F  +A +R  E  DPLC V++ CC      
Sbjct: 121  IRIALQVLANVSLAGETHQHAIWCQFFPDEFATLAGVRCQETCDPLCMVIYTCCDGSSGL 180

Query: 1504 VAELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDASG- 1328
              ELCG +GL I+AEIV TA+ VGF+E+W K L+S+ C                G +   
Sbjct: 181  FKELCGDKGLAIMAEIVCTAASVGFKEDWFKFLVSRTCVEEIHFPQLFFKLSQVGASRNC 240

Query: 1327 -DFKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGK 1151
             D   ++  F+ EQ+FLL I+SE +N++I EI V NDFAL + GI  + +G++DF+++G 
Sbjct: 241  EDSNSREGTFSSEQAFLLEIVSEIVNERIEEIIVPNDFALSVLGIFTKSIGLVDFYARGT 300

Query: 1150 SGLPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXL 971
              LPT S +I+VLGYS+ ILR++CA+E    S      D+V                  L
Sbjct: 301  PSLPTSSSAINVLGYSLSILRNICAREDPAGSSSVNRADLVDSLQSHGLIEMFLSLLRDL 360

Query: 970  EPPEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNG 791
            EPP II+K++ + +NQ E   + S K CPY GF+RD+VAVIGNC YRRKH+QDEIR+++G
Sbjct: 361  EPPAIIRKAMRQGENQ-EGTSAKSAKTCPYIGFRRDLVAVIGNCAYRRKHIQDEIRERDG 419

Query: 790  ILLLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRV 611
            ILLL+QQCVTDE NPF REWGIW VRNLLE N ENQ+ VA++ELQGS++VPE+T LGL+V
Sbjct: 420  ILLLLQQCVTDEDNPFSREWGIWCVRNLLEGNAENQKVVADLELQGSINVPELTDLGLKV 479

Query: 610  EVDQKNRRAKLVNI 569
            EVD+  RRAKLVN+
Sbjct: 480  EVDKNTRRAKLVNV 493


>ref|XP_006348129.1| PREDICTED: ataxin-10-like [Solanum tuberosum]
          Length = 501

 Score =  419 bits (1078), Expect = e-114
 Identities = 239/495 (48%), Positives = 301/495 (60%), Gaps = 2/495 (0%)
 Frame = -1

Query: 2044 MEDSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLEL 1865
            ++D     L  PEN+ + L+  S+SS+L  ALE L++ ++   GR DL+ KN V  VL L
Sbjct: 8    VDDQIVAELTIPENVAKELLLVSNSSSLETALEKLIELAKEEGGRLDLSSKNVVTTVLHL 67

Query: 1864 SKSLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEI 1685
             +SLS+                LCAGEI+NQN F                G   D DC I
Sbjct: 68   CQSLSSISYRYLLLLSLKVLRNLCAGEIINQNEFLQQRGVEIVVDVIMSVGLTPDPDCMI 127

Query: 1684 VRIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDER 1505
            +R+GLQLLGN S+ G E    VW Q FP  FL++AR+R  EI DPLC V++ CC   D  
Sbjct: 128  IRVGLQLLGNYSVGGGERQCDVWYQLFPHKFLKIARVRNQEICDPLCMVIYTCCDGTDGL 187

Query: 1504 VAELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQIC--XXXXXXXXXXXXXXLAGDAS 1331
            + +LC  +GL I+ EI+RTAS VG +E WLKLLLS++C                 + + +
Sbjct: 188  LTDLCSEKGLPILIEILRTASAVGLKEVWLKLLLSKLCIEGSYISSIFFKLHSYPSVENN 247

Query: 1330 GDFKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGK 1151
            G        F  EQS+LLS LSE LN+++  I VS+DFA  I+GILK   GV DF  +GK
Sbjct: 248  GVVTHVVDQFVIEQSYLLSTLSEILNERVEHIVVSHDFARSIFGILKSASGVADFSIRGK 307

Query: 1150 SGLPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXL 971
            S LP GS  IDVLGYS+ ILRD+CA +   SSK E S DVV                  L
Sbjct: 308  SDLPVGSAPIDVLGYSLTILRDICASDHMTSSKEESSKDVVDVLVSSGLIEFLLNLLRDL 367

Query: 970  EPPEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNG 791
            EPP  I+K++ + + + E   S S + CPY+GF+RDIVA++GNC YRR+HVQDEIR KNG
Sbjct: 368  EPPTTIRKAMKQDQIK-EGTISSSFRCCPYQGFRRDIVAILGNCAYRRRHVQDEIRDKNG 426

Query: 790  ILLLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRV 611
            ILLL+QQCV DE NPFLREWGIW VRNLLE N ENQ  + ++ELQG+VDVPE+  LGLRV
Sbjct: 427  ILLLLQQCVIDEDNPFLREWGIWCVRNLLEGNAENQGAITDLELQGTVDVPELVRLGLRV 486

Query: 610  EVDQKNRRAKLVNIS 566
            EVD   R  KLVN S
Sbjct: 487  EVDPVTRHTKLVNSS 501


>ref|XP_004232703.1| PREDICTED: ataxin-10-like isoform 1 [Solanum lycopersicum]
            gi|460373805|ref|XP_004232704.1| PREDICTED:
            ataxin-10-like isoform 2 [Solanum lycopersicum]
          Length = 501

 Score =  419 bits (1077), Expect = e-114
 Identities = 239/495 (48%), Positives = 300/495 (60%), Gaps = 2/495 (0%)
 Frame = -1

Query: 2044 MEDSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLEL 1865
            M+D     L  PEN+ + L+  S+SS+L  AL+ L+Q S+   GR DL+ KN V  VL L
Sbjct: 8    MDDQIVSELTIPENVAKELLLVSNSSSLETALDKLIQLSKEGGGRLDLSSKNVVTTVLHL 67

Query: 1864 SKSLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEI 1685
             +SLS+                LCAGEI NQN F                G   D DC I
Sbjct: 68   CQSLSSISYRNLLLLSLKVLRNLCAGEIRNQNGFLQQRGVEIVLDVIMSVGLSPDPDCMI 127

Query: 1684 VRIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDER 1505
            +R+GLQLLGN S+ G E    VW Q FP  FL++AR+R  EI DPLC V++ CC   D  
Sbjct: 128  IRVGLQLLGNYSVGGGERQCDVWYQLFPHKFLKIARVRNQEICDPLCMVIYTCCDGTDGL 187

Query: 1504 VAELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQIC--XXXXXXXXXXXXXXLAGDAS 1331
            + +LC  +GL I+ EI+RTAS VG +E WLKLLLS++C                 + + +
Sbjct: 188  LTDLCSEQGLPILFEILRTASAVGLKEVWLKLLLSKLCIEGSHISSIFFKLHSYPSVEDN 247

Query: 1330 GDFKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGK 1151
            G        F  EQ +LLSILSE LN+++  I VS+DFA  I+GILK   GV+DF  +GK
Sbjct: 248  GVVTHVADQFVIEQPYLLSILSEILNERVEHIVVSHDFARSIFGILKSASGVVDFSIRGK 307

Query: 1150 SGLPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXL 971
            S LP GS  IDVLGYS+ ++RD+CA +   SSK E S DVV                  L
Sbjct: 308  SDLPVGSAPIDVLGYSLTLMRDICASDHLSSSKEESSKDVVDVLVSSGLIEFLLNLLRDL 367

Query: 970  EPPEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNG 791
            EPP  I+ ++   + +    PS S + CPY+GF+RDIVA++GNC YRR+HVQDEIR KNG
Sbjct: 368  EPPTTIRNAMKPDQIKEGTIPS-SFRCCPYQGFRRDIVAILGNCAYRRRHVQDEIRDKNG 426

Query: 790  ILLLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRV 611
            ILLL+QQCV DE NPFLREWGIW VRNLLE N ENQ  + ++ELQG+VDVPE+  LGLRV
Sbjct: 427  ILLLLQQCVIDEDNPFLREWGIWCVRNLLEGNAENQGAITDLELQGTVDVPELVRLGLRV 486

Query: 610  EVDQKNRRAKLVNIS 566
            EVD   RR KLVN S
Sbjct: 487  EVDPVTRRTKLVNSS 501


>ref|XP_002320751.1| ataxin-related family protein [Populus trichocarpa]
            gi|222861524|gb|EEE99066.1| ataxin-related family protein
            [Populus trichocarpa]
          Length = 496

 Score =  417 bits (1072), Expect = e-114
 Identities = 239/491 (48%), Positives = 304/491 (61%), Gaps = 6/491 (1%)
 Frame = -1

Query: 2020 LCTPEN-IIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLELSKSLSN- 1847
            L  P+N  +E L   S SS L + LEIL+  ++T  GR+DLA KN +P+VL+L   L N 
Sbjct: 9    LSFPQNDFLEPLFTASKSSDLKETLEILIAIAKTDDGRADLASKNILPVVLQLITHLLND 68

Query: 1846 AXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGS-DLDCEIVRIGL 1670
                            LCAGE+ NQ SF                   S + D  I+R+GL
Sbjct: 69   PFDHEYLSLSLRLMRNLCAGEVANQKSFIQLNGVGIFLTVLRSKKVASSEPDHGIIRMGL 128

Query: 1669 QLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDERVAELC 1490
            Q+L NVSLAG+EH +A+W   F      +A++R     DPLC +++ CC    E V +LC
Sbjct: 129  QVLANVSLAGKEHQQAIWGGLFHDELYMLAKVRSQGTCDPLCMIIYACCDGSPELVLQLC 188

Query: 1489 GFRGLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLA---GDASGDFK 1319
            G +GL IV EI+RTAS VGF E WLKLLLS+IC                    +   +  
Sbjct: 189  GNQGLPIVVEIIRTASLVGFGEEWLKLLLSRICLEDIYFPQLFSRIYSVCSYCENGEEIS 248

Query: 1318 CQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSGLP 1139
                 F  EQ++LL+I+SE LN+++ EI++ NDFALCI+GI K+ V   +F S+ +S LP
Sbjct: 249  LSSNPFFTEQAYLLNIVSEILNERLKEITILNDFALCIFGIFKKSVEAFEFGSRAESRLP 308

Query: 1138 TGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEPPE 959
            TG   IDVLGYS+ ILRD+CA  G      E  +DVV                  LEPP+
Sbjct: 309  TGFAVIDVLGYSLTILRDICANNGGVGK--EDLVDVVDSLLSSGLLDLLLCLLRDLEPPK 366

Query: 958  IIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGILLL 779
            II+K++++  NQ E   S   KVCPYKGF+RD+VAVIGNC YRRKHVQD+IRQKNG+LL+
Sbjct: 367  IIRKAMNQAGNQ-EATTSYFPKVCPYKGFRRDLVAVIGNCAYRRKHVQDDIRQKNGMLLM 425

Query: 778  MQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEVDQ 599
            +QQCVTDE NPFLREWGIWS+RNLLE N ENQ+ VAE+ELQGSVD+PE+ GLGL+VEVDQ
Sbjct: 426  LQQCVTDEDNPFLREWGIWSMRNLLEGNSENQQAVAELELQGSVDMPELAGLGLKVEVDQ 485

Query: 598  KNRRAKLVNIS 566
              R AKLVNIS
Sbjct: 486  NTRSAKLVNIS 496


>ref|XP_003623813.1| Ataxin-10 [Medicago truncatula] gi|355498828|gb|AES80031.1| Ataxin-10
            [Medicago truncatula]
          Length = 491

 Score =  415 bits (1067), Expect = e-113
 Identities = 231/493 (46%), Positives = 306/493 (62%), Gaps = 2/493 (0%)
 Frame = -1

Query: 2038 DSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLELSK 1859
            D+P  N    +  +  L   S+S+TL  +LE L++ S++   RS  A K  +P +L +  
Sbjct: 6    DAPFSNHPISQQSLNSLFDLSNSTTLQTSLETLIESSKSTSNRSLYACKKILPTILTV-- 63

Query: 1858 SLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGF-GSDLDCEIV 1682
             L +                LCAGEILNQN F                   GSD    +V
Sbjct: 64   -LHSPPSLHILSLCFKLLRNLCAGEILNQNMFLENDGVFIVVSSILRSEVVGSDY--MLV 120

Query: 1681 RIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDERV 1502
            R GLQ+L NV LAG+EH KAVWD+ FP GFL VARI + E+ DPLC V++ CC   D+  
Sbjct: 121  RWGLQVLANVCLAGKEHQKAVWDEMFPVGFLSVARIGKKEVNDPLCMVIYTCCDGNDQWF 180

Query: 1501 AELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDASG-D 1325
            +E+C   G  ++ EIVRTAS   F E+W+KLLLS+IC                    G D
Sbjct: 181  SEVCSDGGWNVLVEIVRTASSASFGEDWIKLLLSRICLEDSQLRVLFSKLRFMDIPDGED 240

Query: 1324 FKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSG 1145
             K +D  F+ EQ+FLL I+S+ LN++I ++++S + A  +YGI K+ +GV++   +GKSG
Sbjct: 241  TKTKDDQFSSEQAFLLQIISDILNERIGDVTISLEVASFVYGIFKKSIGVLEHAVRGKSG 300

Query: 1144 LPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEP 965
            LP+G   +DVLGYS+ +LRD+CA +  R +  +   +VV                  LEP
Sbjct: 301  LPSGITDVDVLGYSLTMLRDICAHDSVRGNSED--TEVVDMLLSYGLIELVFILLGDLEP 358

Query: 964  PEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGIL 785
            P II+K +  ++N      S S K CPYKGF+RDIVA+IGNC+YRRKHVQDEIR +NGIL
Sbjct: 359  PTIIRKGMKHSENP--DGASSSSKPCPYKGFRRDIVALIGNCVYRRKHVQDEIRSRNGIL 416

Query: 784  LLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEV 605
            LL+QQCVTDE NP+LREWGIW VRN+LE NEENQ+E++E++LQGS DVPEI+ LGLRVEV
Sbjct: 417  LLLQQCVTDEDNPYLREWGIWCVRNMLEGNEENQKEISELQLQGSADVPEISALGLRVEV 476

Query: 604  DQKNRRAKLVNIS 566
            DQK RRAKLVN+S
Sbjct: 477  DQKTRRAKLVNVS 489


>ref|XP_004308721.1| PREDICTED: uncharacterized protein LOC101297970 [Fragaria vesca
            subsp. vesca]
          Length = 492

 Score =  415 bits (1066), Expect = e-113
 Identities = 230/495 (46%), Positives = 310/495 (62%), Gaps = 2/495 (0%)
 Frame = -1

Query: 2044 MEDSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLEL 1865
            M+++  P    PE++++ L++ S+SS L D+LE L+Q  +T  GR DL+ KN +P V++L
Sbjct: 1    MDNTTLPECSVPEHVLQALLSVSNSSKLVDSLEDLVQVCKTADGREDLSAKNVLPTVIQL 60

Query: 1864 SKSLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEI 1685
             +SLS                 LCAGE+ NQNSF                    + D  I
Sbjct: 61   VQSLSYPSDHYLLTLSLRLLRNLCAGEVANQNSFVEQNGVAIISNILSSAS-SLEPDFGI 119

Query: 1684 VRIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDER 1505
            + +GLQ+L NV+LAGE    A+W Q F   F+ +AR+R  +   PLC +++ CC    E 
Sbjct: 120  ICVGLQVLANVALAGERQQHAIWQQLFLENFVALARVRSQKTCGPLCMIIYACCDGTPEL 179

Query: 1504 VAELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAG--DAS 1331
            VA+LCG  G+ IV EIV+TA+  GF E+W KLLLS+IC                G  +  
Sbjct: 180  VAQLCGDCGVTIVKEIVKTAAADGFGEDWYKLLLSRICLEEPYFRPLFFSLQHVGGNENG 239

Query: 1330 GDFKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGK 1151
             D +     F  EQ FLL  +SE LN+++NEI+V +DFALC++GI K  + V+ + ++G+
Sbjct: 240  DDTEGGQESFLEEQEFLLKNVSEILNERLNEITVPDDFALCVFGIFKNSIKVLSYATRGR 299

Query: 1150 SGLPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXL 971
            SGLPTGS  IDVLGYS+ ILRD+CA+   R   ++ ++DVV                  L
Sbjct: 300  SGLPTGSIDIDVLGYSLTILRDICAQGTLRGCTVD-TMDVVDALISYGLIELLLCLLRDL 358

Query: 970  EPPEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNG 791
            EPP IIKKS+++ K+Q     S S K CPYKGF+RDIV VIGNCLY R+ VQDEIR+K+G
Sbjct: 359  EPPAIIKKSVNQAKDQEGSNYSAS-KPCPYKGFRRDIVGVIGNCLYGRQIVQDEIRRKDG 417

Query: 790  ILLLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRV 611
            +LLL+QQCVTD+ NP+LREWGIW VRNLLE N+ENQ+ VAE+ELQGSVDVP++  LGLRV
Sbjct: 418  LLLLLQQCVTDDDNPYLREWGIWCVRNLLERNQENQQAVAELELQGSVDVPDLARLGLRV 477

Query: 610  EVDQKNRRAKLVNIS 566
            E++    R KLVNIS
Sbjct: 478  EMNPATGRPKLVNIS 492


>ref|XP_007219054.1| hypothetical protein PRUPE_ppa004765mg [Prunus persica]
            gi|462415516|gb|EMJ20253.1| hypothetical protein
            PRUPE_ppa004765mg [Prunus persica]
          Length = 492

 Score =  414 bits (1064), Expect = e-113
 Identities = 228/494 (46%), Positives = 306/494 (61%), Gaps = 1/494 (0%)
 Frame = -1

Query: 2044 MEDSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLEL 1865
            M+ +       PE++++ L++ S+SSTL D+LE L+Q  R   GR+DLA K+ +P V++L
Sbjct: 1    MDKTALQEFFVPEDVLQILLSASNSSTLIDSLETLIQVCRAADGRADLASKSILPSVVQL 60

Query: 1864 SKSLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEI 1685
             +SL                  LCAGE+ NQ SF                    + D  +
Sbjct: 61   IQSLPYPSGRHLLTLSLKLLRNLCAGEVSNQKSFLEQSGVAIISNVLNSANISLEPDSGV 120

Query: 1684 VRIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDER 1505
            +R+GLQ+L NVSLAGE H   +W Q FP  FL +AR++  E  DPLC V+  CC    E 
Sbjct: 121  IRMGLQVLANVSLAGERHQHEIWQQLFPKEFLALARVQSRETCDPLCMVIFACCDGSPEL 180

Query: 1504 VAELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDAS-G 1328
              +LCG  G+ I+ EIVRT + VGF E+W+KLLLS+IC               A   +  
Sbjct: 181  FEKLCGDGGITIMKEIVRTTAAVGFGEDWVKLLLSRICLEGPYFSSLFSNLGFATSENVE 240

Query: 1327 DFKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKS 1148
            D + ++  F+ +Q+F L I+S+ LN+++ EI+V  DFALC++GI K+ VG ++  ++G+S
Sbjct: 241  DTEFREDLFSSDQAFFLRIISDILNERLREITVPRDFALCVFGIFKKSVGALNCVTRGQS 300

Query: 1147 GLPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLE 968
            GLPTG+  IDVLGYS+ ILRDVCA++  R  + E   D V                  LE
Sbjct: 301  GLPTGTSMIDVLGYSLTILRDVCAQKTLRGFQ-EDLGDAVDVLLSHGLIELILCLLRDLE 359

Query: 967  PPEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGI 788
            PP II+K+I + + Q +   S S K CPYKGF+RDIVAVIGNC Y+RK VQDEIRQ++GI
Sbjct: 360  PPAIIRKAIKQGEGQ-DGTNSGSSKPCPYKGFRRDIVAVIGNCTYQRKPVQDEIRQRDGI 418

Query: 787  LLLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVE 608
            LLL+QQC  DE NPFL+EWGIW VRNLLE NE+N+R V E+ELQGSVD PEI GLG RVE
Sbjct: 419  LLLLQQCGLDEDNPFLKEWGIWCVRNLLEGNEDNKRVVTELELQGSVDAPEIAGLGFRVE 478

Query: 607  VDQKNRRAKLVNIS 566
            V+ +  R KLVN+S
Sbjct: 479  VNPETGRPKLVNVS 492


>ref|XP_004306868.1| PREDICTED: ataxin-10-like [Fragaria vesca subsp. vesca]
          Length = 490

 Score =  412 bits (1059), Expect = e-112
 Identities = 227/493 (46%), Positives = 306/493 (62%)
 Frame = -1

Query: 2044 MEDSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLEL 1865
            M+++  P    PE++I+ L++ S+SS L +++E L+Q  +T  GR DLA KN +P V++L
Sbjct: 1    MDNTALPECSVPEDVIQALLSVSNSSNLVESMEDLIQVCKTADGREDLAAKNVLPTVIQL 60

Query: 1864 SKSLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEI 1685
             +SL                  LCAGE+ NQNSF                    + D  I
Sbjct: 61   VQSLLYPSDHYLLTLSLRLLRNLCAGEVANQNSFVEQNGVAIVSNILSSA-ISLEPDFWI 119

Query: 1684 VRIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDER 1505
            + +GLQ+L N +LAGE    A+W Q F   F+ +AR+R  +   PLC ++  CC    E 
Sbjct: 120  ICVGLQVLANAALAGERQQHAIWQQLFSEKFVALARVRSKKTCGPLCMIISTCCDGTPEL 179

Query: 1504 VAELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDASGD 1325
            VA+LCG  G+ I+ EIV+TA+ V F E+W KLLLS+IC                G+ + D
Sbjct: 180  VAQLCGDCGVTILKEIVKTAAAVDFGEDWYKLLLSRICLVEPYFRPLFFSLEHVGENAED 239

Query: 1324 FKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSG 1145
             +     F++EQ FLL  +SE LN+ ++EI+V NDFALC++GI K  + V+ + ++G+SG
Sbjct: 240  TEGGRESFSKEQEFLLKNVSEILNECLSEITVPNDFALCVFGIFKNSIKVLSYATRGRSG 299

Query: 1144 LPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEP 965
            LPTGS  IDVLGYS+ ILRD CA +G      + ++DVV                  LEP
Sbjct: 300  LPTGSIDIDVLGYSLTILRDTCA-QGTLRGSTKDTMDVVDALISYGLIELLLSLLRDLEP 358

Query: 964  PEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGIL 785
            P IIKKSI++ +NQ E   S +LK CPYKGF+RDIVAVIGNCLY RK VQDEIR+K+G+L
Sbjct: 359  PAIIKKSINQAENQ-EGSSSSTLKPCPYKGFRRDIVAVIGNCLYGRKIVQDEIRRKDGLL 417

Query: 784  LLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEV 605
            LL+QQCV D+ NP+ REWGIW  RNLL+ N+ENQR VAE+EL+GSVDVP +  LGLRVE+
Sbjct: 418  LLLQQCVIDDDNPYSREWGIWCQRNLLDRNQENQRAVAELELKGSVDVPALARLGLRVEM 477

Query: 604  DQKNRRAKLVNIS 566
            +    R KLVNIS
Sbjct: 478  NLATGRPKLVNIS 490


>ref|XP_004492673.1| PREDICTED: ataxin-10-like [Cicer arietinum]
          Length = 468

 Score =  410 bits (1055), Expect = e-112
 Identities = 225/462 (48%), Positives = 290/462 (62%), Gaps = 1/462 (0%)
 Frame = -1

Query: 1951 LEILLQDSRTVQGRSDLALKNAVPLVLELSKSLSNAXXXXXXXXXXXXXXXLCAGEILNQ 1772
            LE L+  S++  GRS+LA K  +P VL +  S +                 LCAGE  NQ
Sbjct: 9    LENLIHTSKSDSGRSNLASKRVLPAVLNILNSQTLPLDHNLLSLCFKLLRNLCAGEFENQ 68

Query: 1771 NSFXXXXXXXXXXXXXXXXGFGSDLDCEIVRIGLQLLGNVSLAGEEHGKAVWDQFFPGGF 1592
            N F                      D  +VR GLQ+L NV LAG++H KA+W++ FP GF
Sbjct: 69   NLFLEFDGVVVVSSILMSEAGSLRPDHMLVRWGLQVLANVCLAGKQHQKAIWEEIFPLGF 128

Query: 1591 LEVARIRQSEIVDPLCNVLHNCCYEKDERVAELCGFRGLQIVAEIVRTASEVGFEENWLK 1412
            + +AR+   EI DPLC V++ CC    E   ELC   GL +VAEIV+TAS   F E+W+K
Sbjct: 129  VSLARLGTKEICDPLCMVIYTCCDGNHECFGELCSDSGLPVVAEIVKTASSASFGEDWIK 188

Query: 1411 LLLSQICXXXXXXXXXXXXXXLAGDASG-DFKCQDTFFTREQSFLLSILSENLNQQINEI 1235
            LLLS+IC                    G D   +D  F+ EQ+FLL ILSE LN+++ ++
Sbjct: 189  LLLSRICLEESQLPMLFPKLRFMDIPEGEDIDSKDYQFSFEQAFLLQILSEILNERLRDV 248

Query: 1234 SVSNDFALCIYGILKRVVGVIDFFSKGKSGLPTGSPSIDVLGYSVIILRDVCAKEGARSS 1055
             VS D AL +YG+ K+ VGV++   +GKSGLP+GS ++D LGYS+ ILRD+CA +  R +
Sbjct: 249  VVSKDVALFVYGVFKKSVGVLEHAVRGKSGLPSGSVAVDALGYSLTILRDICAHDSVRGN 308

Query: 1054 KIEGSIDVVXXXXXXXXXXXXXXXXXXLEPPEIIKKSISRTKNQVEHKPSDSLKVCPYKG 875
              E + DVV                  LEPP II+K I +++NQ     S S K CPYKG
Sbjct: 309  P-EDTNDVVDVLLSQDIIELLLILLGDLEPPAIIRKGIKQSENQ--EGASCSSKPCPYKG 365

Query: 874  FQRDIVAVIGNCLYRRKHVQDEIRQKNGILLLMQQCVTDECNPFLREWGIWSVRNLLEAN 695
            F+RDIV++IGNC+YRRKH QDEIR +NGILLL+QQCVTDE NPFLREWGIWSVRN+LE N
Sbjct: 366  FRRDIVSLIGNCVYRRKHAQDEIRGRNGILLLLQQCVTDEDNPFLREWGIWSVRNMLEGN 425

Query: 694  EENQREVAEMELQGSVDVPEITGLGLRVEVDQKNRRAKLVNI 569
            EENQ+ V+E++LQGS DVP+I+ LGLR+EVDQK RRAKLVN+
Sbjct: 426  EENQKVVSELQLQGSADVPQISALGLRIEVDQKTRRAKLVNV 467


>ref|XP_003551615.1| PREDICTED: ataxin-10-like [Glycine max]
          Length = 498

 Score =  409 bits (1050), Expect = e-111
 Identities = 230/487 (47%), Positives = 302/487 (62%), Gaps = 7/487 (1%)
 Frame = -1

Query: 2008 ENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLELSKSLSNAXXXXX 1829
            E+ ++ L   S+SS +  +LEIL+Q++++  GR +LA K  +P VL +  SL++A     
Sbjct: 14   EDTLQLLFEASNSSNMEKSLEILIQNAKSDSGRLELASKRILPAVLNIVHSLTHASHHHH 73

Query: 1828 XXXXXXXXXXL------CAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEIVRIGLQ 1667
                             CAGE  NQ+SF                   S  D  +VR GLQ
Sbjct: 74   HQHNHILCLSFKLLRNLCAGEAANQDSFLELDGVAVVCSVLRSEAACSGPDHGLVRWGLQ 133

Query: 1666 LLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDERVAELCG 1487
            +L NVSLAG++H  A+W + +  GF+ +AR+   E  DPLC V++ CC    E    L  
Sbjct: 134  VLANVSLAGKQHQCAIWKELYLDGFVSLARLHTKETCDPLCMVIYTCCDGNPEWFKRLSS 193

Query: 1486 FRGLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDASGDF-KCQD 1310
              G  ++AEIVRTAS   F E+WLKLLLS+IC               A     +  + +D
Sbjct: 194  EDGWFVMAEIVRTASSASFGEDWLKLLLSRICLEESQLPVLFSKLQFADVPKVEVAESKD 253

Query: 1309 TFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSGLPTGS 1130
              F+ EQ+FLL ILSE LN++  +++VS D AL ++GI K  +GV++  ++GKSGLP+G 
Sbjct: 254  DHFSFEQAFLLRILSEILNERHKDVTVSKDVALFVFGIFKNSIGVLEHATRGKSGLPSGF 313

Query: 1129 PSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEPPEIIK 950
              +DVLGYS+ ILRD+CA++G R +  E S DVV                  LEPP II+
Sbjct: 314  VGVDVLGYSLTILRDICAQDGVRGNT-EDSNDVVDALLSYGLIELLLYLLEALEPPAIIR 372

Query: 949  KSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGILLLMQQ 770
            K + + +NQ     S S K CPYKGF+RDIVA+IGNC+YRRKH QDEIR +NGILLL+QQ
Sbjct: 373  KGLKQCENQ--DGASCSFKPCPYKGFRRDIVALIGNCVYRRKHAQDEIRHRNGILLLLQQ 430

Query: 769  CVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEVDQKNR 590
            CVTDE NPFLREWGIWSVRN+LE N+ENQ+ VAE+E+QGS DVPEIT LGLRVEVDQ+ R
Sbjct: 431  CVTDEDNPFLREWGIWSVRNMLEGNDENQKVVAELEIQGSADVPEITSLGLRVEVDQRTR 490

Query: 589  RAKLVNI 569
            RAKLVNI
Sbjct: 491  RAKLVNI 497


>ref|XP_006366476.1| PREDICTED: ataxin-10-like isoform X1 [Solanum tuberosum]
            gi|565401994|ref|XP_006366477.1| PREDICTED:
            ataxin-10-like isoform X2 [Solanum tuberosum]
            gi|565401996|ref|XP_006366478.1| PREDICTED:
            ataxin-10-like isoform X3 [Solanum tuberosum]
            gi|565401998|ref|XP_006366479.1| PREDICTED:
            ataxin-10-like isoform X4 [Solanum tuberosum]
            gi|565402000|ref|XP_006366480.1| PREDICTED:
            ataxin-10-like isoform X5 [Solanum tuberosum]
          Length = 504

 Score =  408 bits (1049), Expect = e-111
 Identities = 237/495 (47%), Positives = 297/495 (60%), Gaps = 2/495 (0%)
 Frame = -1

Query: 2044 MEDSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLEL 1865
            ++D     +  PEN+ + L+  S+SS+L  ALE L++ ++   GR DL+ KN V  VL L
Sbjct: 11   VDDKIVAEVTIPENVAKELLLVSNSSSLETALEKLIELAKEEGGRLDLSSKNVVTTVLHL 70

Query: 1864 SKSLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEI 1685
             +SLS+                LCAGEI NQN F                G   D DC I
Sbjct: 71   CQSLSSISYRQLLLSSLKVLRNLCAGEIRNQNEFLQQRGVEIVVDVITSVGLTPDPDCMI 130

Query: 1684 VRIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDER 1505
            +R+GLQLLGN S+ G E    VW Q FP  FL++AR+R  EI DPLC V++ CC   D  
Sbjct: 131  IRVGLQLLGNYSVGGGERQCDVWYQLFPHKFLKIARVRSWEICDPLCMVIYTCCDGTDGL 190

Query: 1504 VAELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQIC--XXXXXXXXXXXXXXLAGDAS 1331
            + +LC  +GL I+ EI+RTAS V  +E WLKLLLS++C                 +   +
Sbjct: 191  LTDLCSEQGLPILIEILRTASAVDRKEVWLKLLLSKLCIEGSYISSIFFKLHSFPSIQNN 250

Query: 1330 GDFKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGK 1151
            G        F  EQ +LLSILSE +N QI  I VS+DFAL I+GILK    V+DF  +GK
Sbjct: 251  GVVTHATDQFVIEQPYLLSILSEIVNDQIEHIVVSHDFALSIFGILKSAFVVVDFSIRGK 310

Query: 1150 SGLPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXL 971
            S LP G   IDVLGYS+ ILRD+CA +   SSK E S DVV                  L
Sbjct: 311  SDLPVGFAPIDVLGYSLTILRDICASDHMTSSKEESSKDVVDVLVSSGLIEFLLNLLRDL 370

Query: 970  EPPEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNG 791
            EPP  I+K++ + +   E   S S + CPY+GF+RDIV++IGNC YRR++VQDEIR KNG
Sbjct: 371  EPPTTIRKAMKQDQ-ITEGIISSSFRCCPYQGFRRDIVSIIGNCAYRRRYVQDEIRDKNG 429

Query: 790  ILLLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRV 611
            ILLL+QQCV DE NPFLREWGIW VRNLLE N ENQ  + ++ELQG+VDVPE+  LGLRV
Sbjct: 430  ILLLLQQCVIDEDNPFLREWGIWCVRNLLEGNAENQGAITDLELQGTVDVPELVRLGLRV 489

Query: 610  EVDQKNRRAKLVNIS 566
            EVD   RR KLVN S
Sbjct: 490  EVDPVTRRTKLVNAS 504


>ref|XP_007148734.1| hypothetical protein PHAVU_005G009900g [Phaseolus vulgaris]
            gi|561021998|gb|ESW20728.1| hypothetical protein
            PHAVU_005G009900g [Phaseolus vulgaris]
          Length = 498

 Score =  403 bits (1036), Expect = e-109
 Identities = 225/485 (46%), Positives = 301/485 (62%), Gaps = 5/485 (1%)
 Frame = -1

Query: 2008 ENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLELSKSLSNAXXXXX 1829
            E+ ++ L   S+SS L  +LEIL+Q++++  GR +LA K  +P VL + +SL+ A     
Sbjct: 13   EDTLQLLFQASNSSNLEKSLEILIQNAKSDSGRLELASKRILPAVLNIVQSLAQASHHHH 72

Query: 1828 XXXXXXXXXXL----CAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEIVRIGLQLL 1661
                      L    CAGE  NQ SF                      D  +VR GLQ+L
Sbjct: 73   HNQTFSLCFKLLRNLCAGEAANQVSFIELNGVAVVWSVLRSEAGSLGPDHRLVRWGLQVL 132

Query: 1660 GNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDERVAELCGFR 1481
             NVSL G++H +A+W++ +P GF  +AR+   EI DPLC V++ CC    E   +L    
Sbjct: 133  ANVSLGGKQHQRAIWEELYPIGFASLARVGTKEICDPLCMVIYTCCDGNPEWFKKLSSDD 192

Query: 1480 GLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDASGD-FKCQDTF 1304
            G  +VAEIVRTAS   F+E+WLKLLLS+I                     G+  + ++  
Sbjct: 193  GWPVVAEIVRTASSASFDEDWLKLLLSRIFLEESQLPVLFSKLQSVDVPEGEVIESKNGQ 252

Query: 1303 FTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSGLPTGSPS 1124
            F+ EQ+FLL ILSE LN+++ +++VS D AL ++GI K+ +GV++   +GKSGLP+G   
Sbjct: 253  FSFEQAFLLQILSEILNERLGDVTVSEDVALFVFGIFKKSIGVLEHAMRGKSGLPSGFTG 312

Query: 1123 IDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEPPEIIKKS 944
            +DVLGYS+ ILRD+CA++G R +    + DVV                  LEPP II+K 
Sbjct: 313  VDVLGYSLTILRDICAQDGMRGN----TKDVVDVLLSYGLIEFLLSLLGALEPPAIIRKG 368

Query: 943  ISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGILLLMQQCV 764
            + + +NQ     S   K CPYKGF+RDIVA+IGNC+YRRKH QDEIR +NGILLL+QQCV
Sbjct: 369  LKQIENQ--DNASCCSKPCPYKGFRRDIVALIGNCVYRRKHAQDEIRDRNGILLLLQQCV 426

Query: 763  TDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEVDQKNRRA 584
            TDE NPFLREWGIWSVRN+LE N+ENQ+ VAE+E+QGS DVPEI  LGL+VEVDQ+ RR 
Sbjct: 427  TDEDNPFLREWGIWSVRNMLEGNDENQKLVAELEIQGSADVPEINALGLQVEVDQRTRRP 486

Query: 583  KLVNI 569
            KLVNI
Sbjct: 487  KLVNI 491


>gb|EYU23502.1| hypothetical protein MIMGU_mgv1a005564mg [Mimulus guttatus]
          Length = 479

 Score =  375 bits (964), Expect = e-101
 Identities = 215/491 (43%), Positives = 289/491 (58%)
 Frame = -1

Query: 2044 MEDSPSPNLCTPENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLEL 1865
            M+   S NL   +N+++ L   S SSTL +ALE L++ ++T  GR  L+ K+ +   LEL
Sbjct: 1    MDSVKSVNLSIQDNVLQPLFISSGSSTLHEALERLIETAKTSDGRLSLSSKDIIKPALEL 60

Query: 1864 SKSLSNAXXXXXXXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEI 1685
             +                     CAGEI NQ+ F                   S  D EI
Sbjct: 61   CQYPLRVPHQELLLAVKLLRNM-CAGEIKNQDLFIEQNGVGILSTLVGSMCSNSGSDNEI 119

Query: 1684 VRIGLQLLGNVSLAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDER 1505
            +R+ LQ LGNVSLAGE+H +AVW QFF  GF+++AR++  E  DPLC V++ C    +ER
Sbjct: 120  LRMVLQALGNVSLAGEKHQEAVWAQFFSLGFIDIARVQSKETCDPLCMVIYTCSEGTNER 179

Query: 1504 VAELCGFRGLQIVAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDASGD 1325
              EL   +GL I+ EIVRT + VGF E+WLKLLLS+IC                 D    
Sbjct: 180  SGELLSDQGLDIIVEIVRTVTAVGFSEDWLKLLLSKICFDESYFSSIFSKLSENCDEDVP 239

Query: 1324 FKCQDTFFTREQSFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSG 1145
               Q + F  +++FLLSILSE LN+++ EI VS+DF+L I+ IL+  V ++DF ++ KS 
Sbjct: 240  ---QISHFGDQEAFLLSILSEILNERLGEIVVSSDFSLSIFQILRNAVEIVDFSTRAKSS 296

Query: 1144 LPTGSPSIDVLGYSVIILRDVCAKEGARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEP 965
            LPTGS   DV+GY++ ++RD+ A +G            V                  LEP
Sbjct: 297  LPTGSSVTDVMGYALSLIRDITACDGPN----------VDTLLRAGLIKFLIGLLRNLEP 346

Query: 964  PEIIKKSISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGIL 785
            P +I++S  R   + +  P  S   CPYKGF+RDIV VIGNC Y R  VQDEIR+++GIL
Sbjct: 347  PTLIRRSTVRADTEDDTTPRFSKYCCPYKGFRRDIVGVIGNCSYGRISVQDEIREQDGIL 406

Query: 784  LLMQQCVTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEV 605
            L++QQCVTD+ NPFLREWGIWS+RN+LE N +N+  V E+E+QGSVD PEI G+GLRVE+
Sbjct: 407  LMLQQCVTDDDNPFLREWGIWSMRNILEGNVKNRELVVELEVQGSVDTPEIAGVGLRVEI 466

Query: 604  DQKNRRAKLVN 572
            D   RR KLVN
Sbjct: 467  DPVTRRPKLVN 477


>gb|EYU22629.1| hypothetical protein MIMGU_mgv1a025194mg, partial [Mimulus guttatus]
          Length = 467

 Score =  371 bits (952), Expect = e-100
 Identities = 210/487 (43%), Positives = 289/487 (59%), Gaps = 6/487 (1%)
 Frame = -1

Query: 2008 ENIIERLIAWSSSSTLTDALEILLQDSRTVQGRSDLALKNAVPLVLELSKSLSNAXXXXX 1829
            +N+++ L   S SSTL +ALE L++ ++T  GR  L+ K+ +   LEL +          
Sbjct: 1    DNVLQPLFISSGSSTLHEALERLIETAKTSDGRLSLSSKDIIKPALELCRYPLRVPHQEL 60

Query: 1828 XXXXXXXXXXLCAGEILNQNSFXXXXXXXXXXXXXXXXGFGSDLDCEIVRIGLQLLGNVS 1649
                       CAGEI NQ+ F                   S  D EI+R+ LQ LGNVS
Sbjct: 61   LLAVKLLRNL-CAGEIKNQDLFIEQNGVGILSTLVGSMCSNSGSDSEILRMVLQTLGNVS 119

Query: 1648 LAGEEHGKAVWDQFFPGGFLEVARIRQSEIVDPLCNVLHNCCYEKDERVAELCGFRGLQI 1469
            LAGE+H +AVW QFFP GF+++AR++  E  DPLC V++ C    +ER  EL   +GL I
Sbjct: 120  LAGEKHQEAVWAQFFPLGFIDIARVQSKETCDPLCMVIYTCSEGSNERWVELLSDQGLDI 179

Query: 1468 VAEIVRTASEVGFEENWLKLLLSQICXXXXXXXXXXXXXXLAGDASGDFKCQDTFFTREQ 1289
            + +IVRT + VGF E+W+KLL+S+IC                 D +     Q + F  E+
Sbjct: 180  IVQIVRTVTAVGFSEDWVKLLISKICFDESYFSSIFSKLSENCDENVP---QISHFGDEE 236

Query: 1288 SFLLSILSENLNQQINEISVSNDFALCIYGILKRVVGVIDFFSKGKSGLPTGSPSIDVLG 1109
            +FLLSILSE LN+++ EI VS +F+L IY IL+  V ++DF ++ K  LPTGS   D +G
Sbjct: 237  AFLLSILSEILNERLGEIVVSTNFSLSIYQILRNAVEIVDFSTRAKLSLPTGSSVTDAMG 296

Query: 1108 YSVIILRDVCAKEG------ARSSKIEGSIDVVXXXXXXXXXXXXXXXXXXLEPPEIIKK 947
            Y++ ++RD+ A +G      +R+  I+  ID+                    EPP +I++
Sbjct: 297  YALSLIRDITACDGPNVDTLSRAGLIKFLIDLFRNL----------------EPPTLIRR 340

Query: 946  SISRTKNQVEHKPSDSLKVCPYKGFQRDIVAVIGNCLYRRKHVQDEIRQKNGILLLMQQC 767
            S      + +  P  S   CPYKGF+RDIV VIGNC Y R  VQDEIR+++GILL++QQC
Sbjct: 341  STGHADTENDTTPRFSKYCCPYKGFRRDIVGVIGNCSYGRISVQDEIREQDGILLMLQQC 400

Query: 766  VTDECNPFLREWGIWSVRNLLEANEENQREVAEMELQGSVDVPEITGLGLRVEVDQKNRR 587
            VTDE NPFLREWGIWS+RN+LE N +N+  V ++E+QGSVD PEI G+GLRVE+D   RR
Sbjct: 401  VTDEDNPFLREWGIWSMRNILEGNVKNRELVVDLEVQGSVDTPEIAGVGLRVEIDHVTRR 460

Query: 586  AKLVNIS 566
             KLVN S
Sbjct: 461  PKLVNAS 467


Top