BLASTX nr result

ID: Rheum21_contig00008432 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00008432
         (1455 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY14176.1| ARM repeat superfamily protein, putative isoform ...   412   e-112
gb|EOY14175.1| ARM repeat superfamily protein, putative isoform ...   412   e-112
gb|EOY14173.1| ARM repeat superfamily protein, putative isoform ...   412   e-112
gb|EOY14172.1| ARM repeat superfamily protein, putative isoform ...   412   e-112
ref|XP_006421838.1| hypothetical protein CICLE_v10004825mg [Citr...   412   e-112
ref|XP_002511774.1| conserved hypothetical protein [Ricinus comm...   407   e-111
ref|XP_004492673.1| PREDICTED: ataxin-10-like [Cicer arietinum]       404   e-110
gb|EMJ20253.1| hypothetical protein PRUPE_ppa004765mg [Prunus pe...   398   e-108
ref|XP_002320751.1| ataxin-related family protein [Populus trich...   395   e-107
ref|XP_003551615.1| PREDICTED: ataxin-10-like [Glycine max]           395   e-107
ref|XP_002274705.1| PREDICTED: uncharacterized protein LOC100264...   395   e-107
ref|XP_004232703.1| PREDICTED: ataxin-10-like isoform 1 [Solanum...   389   e-105
ref|XP_003623813.1| Ataxin-10 [Medicago truncatula] gi|355498828...   389   e-105
gb|ESW20728.1| hypothetical protein PHAVU_005G009900g [Phaseolus...   387   e-105
ref|XP_002875041.1| hypothetical protein ARALYDRAFT_490543 [Arab...   385   e-104
ref|XP_006366476.1| PREDICTED: ataxin-10-like isoform X1 [Solanu...   382   e-103
ref|XP_006348129.1| PREDICTED: ataxin-10-like [Solanum tuberosum]     382   e-103
ref|XP_004308721.1| PREDICTED: uncharacterized protein LOC101297...   382   e-103
ref|XP_004306868.1| PREDICTED: ataxin-10-like [Fragaria vesca su...   374   e-101
ref|XP_006413924.1| hypothetical protein EUTSA_v10025092mg [Eutr...   373   e-100

>gb|EOY14176.1| ARM repeat superfamily protein, putative isoform 5 [Theobroma cacao]
          Length = 519

 Score =  412 bits (1059), Expect = e-112
 Identities = 219/414 (52%), Positives = 279/414 (67%), Gaps = 20/414 (4%)
 Frame = -2

Query: 1454 LRLLRNLCAGEARNQDMFIDHNGAEIVSSVINLFGNDFDDSDCAIVRTGLQVLANASLAG 1275
            L+LLRNLCAGE  NQ+ F + NG E+V SV+        + D  ++R  LQVLAN SLAG
Sbjct: 77   LKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAAL-LSNPDSGVIRVSLQVLANVSLAG 135

Query: 1274 DRHQQVIWNHLFPVEFVKIARIRKRETCDPLTMIIYVCCDGSHGLFSELCKEPGLAIMAE 1095
            + HQQ IW   FP EF  +AR+R +ET DPL MI+Y CCD   GL +ELC++ GL I+  
Sbjct: 136  EDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPIVVG 195

Query: 1094 LISTATTVGFGEDWLKLLLSRICIEESHLPGLFK---------------CGDACFTSEQA 960
            +I T  +VGFGEDW KLLLSR+C+E+ H P +F                 GD  F SEQA
Sbjct: 196  IIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLSEQA 255

Query: 959  FLLQIISEILHERIQEIRVGNSLALHVLNIFKKAVGAVDFSSRCQPSLPTGSAAIDVLGY 780
            FLL+IISEIL+ERI+EI+V +  AL VL IFK++V  VDF+SR   SLPTG  +IDV+GY
Sbjct: 256  FLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDVMGY 315

Query: 779  SLAILRDTCAQDGLIGDNADDSSEDVXXXXXXXXXXXXXXXXXXXXEPPSQIRKM----- 615
            SL ILRD CA++G +GD  +DS  DV                    +PP+ IRK+     
Sbjct: 316  SLIILRDICAREG-VGDLKNDSL-DVVDMLLSHELIDILLSLLRDLDPPAIIRKVLKEGD 373

Query: 614  NAAASASQKKVCPYKGFRRDIVSVIGNCAHRRKFVQDEVRNKNGVLLLLQQCVHDDENPF 435
            N   + S  K+CPYKGFRRD+++VIGNCA+RRK VQDE+R KNG+LLLLQQCV DD+NP+
Sbjct: 374  NQGLNLSASKLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCVTDDDNPY 433

Query: 434  LREWGLWTAWSLLGNNEENSKLVADLEYQESVDVPELLKAGLRVELDPQTRKPK 273
            LREWG+W+  +LL  + EN + VADLE Q SVD+PEL + GLRVE+D +TR+ K
Sbjct: 434  LREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRAK 487


>gb|EOY14175.1| ARM repeat superfamily protein, putative isoform 4 [Theobroma cacao]
          Length = 500

 Score =  412 bits (1059), Expect = e-112
 Identities = 219/414 (52%), Positives = 279/414 (67%), Gaps = 20/414 (4%)
 Frame = -2

Query: 1454 LRLLRNLCAGEARNQDMFIDHNGAEIVSSVINLFGNDFDDSDCAIVRTGLQVLANASLAG 1275
            L+LLRNLCAGE  NQ+ F + NG E+V SV+        + D  ++R  LQVLAN SLAG
Sbjct: 89   LKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAAL-LSNPDSGVIRVSLQVLANVSLAG 147

Query: 1274 DRHQQVIWNHLFPVEFVKIARIRKRETCDPLTMIIYVCCDGSHGLFSELCKEPGLAIMAE 1095
            + HQQ IW   FP EF  +AR+R +ET DPL MI+Y CCD   GL +ELC++ GL I+  
Sbjct: 148  EDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPIVVG 207

Query: 1094 LISTATTVGFGEDWLKLLLSRICIEESHLPGLFK---------------CGDACFTSEQA 960
            +I T  +VGFGEDW KLLLSR+C+E+ H P +F                 GD  F SEQA
Sbjct: 208  IIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLSEQA 267

Query: 959  FLLQIISEILHERIQEIRVGNSLALHVLNIFKKAVGAVDFSSRCQPSLPTGSAAIDVLGY 780
            FLL+IISEIL+ERI+EI+V +  AL VL IFK++V  VDF+SR   SLPTG  +IDV+GY
Sbjct: 268  FLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDVMGY 327

Query: 779  SLAILRDTCAQDGLIGDNADDSSEDVXXXXXXXXXXXXXXXXXXXXEPPSQIRKM----- 615
            SL ILRD CA++G +GD  +DS  DV                    +PP+ IRK+     
Sbjct: 328  SLIILRDICAREG-VGDLKNDSL-DVVDMLLSHELIDILLSLLRDLDPPAIIRKVLKEGD 385

Query: 614  NAAASASQKKVCPYKGFRRDIVSVIGNCAHRRKFVQDEVRNKNGVLLLLQQCVHDDENPF 435
            N   + S  K+CPYKGFRRD+++VIGNCA+RRK VQDE+R KNG+LLLLQQCV DD+NP+
Sbjct: 386  NQGLNLSASKLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCVTDDDNPY 445

Query: 434  LREWGLWTAWSLLGNNEENSKLVADLEYQESVDVPELLKAGLRVELDPQTRKPK 273
            LREWG+W+  +LL  + EN + VADLE Q SVD+PEL + GLRVE+D +TR+ K
Sbjct: 446  LREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRAK 499


>gb|EOY14173.1| ARM repeat superfamily protein, putative isoform 2 [Theobroma cacao]
            gi|508722277|gb|EOY14174.1| ARM repeat superfamily
            protein, putative isoform 2 [Theobroma cacao]
            gi|508722280|gb|EOY14177.1| ARM repeat superfamily
            protein, putative isoform 2 [Theobroma cacao]
          Length = 488

 Score =  412 bits (1059), Expect = e-112
 Identities = 219/414 (52%), Positives = 279/414 (67%), Gaps = 20/414 (4%)
 Frame = -2

Query: 1454 LRLLRNLCAGEARNQDMFIDHNGAEIVSSVINLFGNDFDDSDCAIVRTGLQVLANASLAG 1275
            L+LLRNLCAGE  NQ+ F + NG E+V SV+        + D  ++R  LQVLAN SLAG
Sbjct: 77   LKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAAL-LSNPDSGVIRVSLQVLANVSLAG 135

Query: 1274 DRHQQVIWNHLFPVEFVKIARIRKRETCDPLTMIIYVCCDGSHGLFSELCKEPGLAIMAE 1095
            + HQQ IW   FP EF  +AR+R +ET DPL MI+Y CCD   GL +ELC++ GL I+  
Sbjct: 136  EDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPIVVG 195

Query: 1094 LISTATTVGFGEDWLKLLLSRICIEESHLPGLFK---------------CGDACFTSEQA 960
            +I T  +VGFGEDW KLLLSR+C+E+ H P +F                 GD  F SEQA
Sbjct: 196  IIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLSEQA 255

Query: 959  FLLQIISEILHERIQEIRVGNSLALHVLNIFKKAVGAVDFSSRCQPSLPTGSAAIDVLGY 780
            FLL+IISEIL+ERI+EI+V +  AL VL IFK++V  VDF+SR   SLPTG  +IDV+GY
Sbjct: 256  FLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDVMGY 315

Query: 779  SLAILRDTCAQDGLIGDNADDSSEDVXXXXXXXXXXXXXXXXXXXXEPPSQIRKM----- 615
            SL ILRD CA++G +GD  +DS  DV                    +PP+ IRK+     
Sbjct: 316  SLIILRDICAREG-VGDLKNDSL-DVVDMLLSHELIDILLSLLRDLDPPAIIRKVLKEGD 373

Query: 614  NAAASASQKKVCPYKGFRRDIVSVIGNCAHRRKFVQDEVRNKNGVLLLLQQCVHDDENPF 435
            N   + S  K+CPYKGFRRD+++VIGNCA+RRK VQDE+R KNG+LLLLQQCV DD+NP+
Sbjct: 374  NQGLNLSASKLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCVTDDDNPY 433

Query: 434  LREWGLWTAWSLLGNNEENSKLVADLEYQESVDVPELLKAGLRVELDPQTRKPK 273
            LREWG+W+  +LL  + EN + VADLE Q SVD+PEL + GLRVE+D +TR+ K
Sbjct: 434  LREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRAK 487


>gb|EOY14172.1| ARM repeat superfamily protein, putative isoform 1 [Theobroma cacao]
          Length = 531

 Score =  412 bits (1059), Expect = e-112
 Identities = 219/414 (52%), Positives = 279/414 (67%), Gaps = 20/414 (4%)
 Frame = -2

Query: 1454 LRLLRNLCAGEARNQDMFIDHNGAEIVSSVINLFGNDFDDSDCAIVRTGLQVLANASLAG 1275
            L+LLRNLCAGE  NQ+ F + NG E+V SV+        + D  ++R  LQVLAN SLAG
Sbjct: 89   LKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAAL-LSNPDSGVIRVSLQVLANVSLAG 147

Query: 1274 DRHQQVIWNHLFPVEFVKIARIRKRETCDPLTMIIYVCCDGSHGLFSELCKEPGLAIMAE 1095
            + HQQ IW   FP EF  +AR+R +ET DPL MI+Y CCD   GL +ELC++ GL I+  
Sbjct: 148  EDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPIVVG 207

Query: 1094 LISTATTVGFGEDWLKLLLSRICIEESHLPGLFK---------------CGDACFTSEQA 960
            +I T  +VGFGEDW KLLLSR+C+E+ H P +F                 GD  F SEQA
Sbjct: 208  IIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLSEQA 267

Query: 959  FLLQIISEILHERIQEIRVGNSLALHVLNIFKKAVGAVDFSSRCQPSLPTGSAAIDVLGY 780
            FLL+IISEIL+ERI+EI+V +  AL VL IFK++V  VDF+SR   SLPTG  +IDV+GY
Sbjct: 268  FLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDVMGY 327

Query: 779  SLAILRDTCAQDGLIGDNADDSSEDVXXXXXXXXXXXXXXXXXXXXEPPSQIRKM----- 615
            SL ILRD CA++G +GD  +DS  DV                    +PP+ IRK+     
Sbjct: 328  SLIILRDICAREG-VGDLKNDSL-DVVDMLLSHELIDILLSLLRDLDPPAIIRKVLKEGD 385

Query: 614  NAAASASQKKVCPYKGFRRDIVSVIGNCAHRRKFVQDEVRNKNGVLLLLQQCVHDDENPF 435
            N   + S  K+CPYKGFRRD+++VIGNCA+RRK VQDE+R KNG+LLLLQQCV DD+NP+
Sbjct: 386  NQGLNLSASKLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCVTDDDNPY 445

Query: 434  LREWGLWTAWSLLGNNEENSKLVADLEYQESVDVPELLKAGLRVELDPQTRKPK 273
            LREWG+W+  +LL  + EN + VADLE Q SVD+PEL + GLRVE+D +TR+ K
Sbjct: 446  LREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRAK 499


>ref|XP_006421838.1| hypothetical protein CICLE_v10004825mg [Citrus clementina]
            gi|567858312|ref|XP_006421839.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|567858314|ref|XP_006421840.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|567858316|ref|XP_006421841.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|568874427|ref|XP_006490317.1| PREDICTED:
            ataxin-10-like isoform X1 [Citrus sinensis]
            gi|568874429|ref|XP_006490318.1| PREDICTED:
            ataxin-10-like isoform X2 [Citrus sinensis]
            gi|557523711|gb|ESR35078.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|557523712|gb|ESR35079.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|557523713|gb|ESR35080.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|557523714|gb|ESR35081.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
          Length = 497

 Score =  412 bits (1058), Expect = e-112
 Identities = 221/419 (52%), Positives = 277/419 (66%), Gaps = 21/419 (5%)
 Frame = -2

Query: 1454 LRLLRNLCAGEARNQDMFIDHNGAEIVSSVINLFGNDFDDSDCAIVRTGLQVLANASLAG 1275
            L+LLRNLCAGE  NQ  FI+  G  IV  V+   G + D  D  I+R  LQVLAN SLAG
Sbjct: 77   LKLLRNLCAGEITNQKSFIEQTGVGIVLRVLRSPGVNLD-KDYGIIRIALQVLANVSLAG 135

Query: 1274 DRHQQVIWNHLFPVEFVKIARIRKRETCDPLTMIIYVCCDGSHGLFSELCKEPGLAIMAE 1095
            + HQ  IW   FP EF  +A +R +ETCDPL M+IY CCDGS GLF ELC + GLAIMAE
Sbjct: 136  ETHQHAIWCQFFPDEFATLAGVRCQETCDPLCMVIYTCCDGSSGLFKELCGDKGLAIMAE 195

Query: 1094 LISTATTVGFGEDWLKLLLSRICIEESHLPGLF----------KCGDA-----CFTSEQA 960
            ++ TA +VGF EDW K L+SR C+EE H P LF           C D+      F+SEQA
Sbjct: 196  IVCTAASVGFKEDWFKFLVSRTCVEEIHFPQLFFKLSQVGASRNCEDSNSREGTFSSEQA 255

Query: 959  FLLQIISEILHERIQEIRVGNSLALHVLNIFKKAVGAVDFSSRCQPSLPTGSAAIDVLGY 780
            FLL+I+SEI++ERI+EI V N  AL VL IF K++G VDF +R  PSLPT S+AI+VLGY
Sbjct: 256  FLLEIVSEIVNERIEEIIVPNDFALSVLGIFTKSIGLVDFYARGTPSLPTSSSAINVLGY 315

Query: 779  SLAILRDTCAQDGLIGDNADDSSEDVXXXXXXXXXXXXXXXXXXXXEPPSQIRKM----- 615
            SL+ILR+ CA++   G ++ + + D+                    EPP+ IRK      
Sbjct: 316  SLSILRNICAREDPAGSSSVNRA-DLVDSLQSHGLIEMFLSLLRDLEPPAIIRKAMRQGE 374

Query: 614  -NAAASASQKKVCPYKGFRRDIVSVIGNCAHRRKFVQDEVRNKNGVLLLLQQCVHDDENP 438
                 SA   K CPY GFRRD+V+VIGNCA+RRK +QDE+R ++G+LLLLQQCV D++NP
Sbjct: 375  NQEGTSAKSAKTCPYIGFRRDLVAVIGNCAYRRKHIQDEIRERDGILLLLQQCVTDEDNP 434

Query: 437  FLREWGLWTAWSLLGNNEENSKLVADLEYQESVDVPELLKAGLRVELDPQTRKPKLVNV 261
            F REWG+W   +LL  N EN K+VADLE Q S++VPEL   GL+VE+D  TR+ KLVNV
Sbjct: 435  FSREWGIWCVRNLLEGNAENQKVVADLELQGSINVPELTDLGLKVEVDKNTRRAKLVNV 493


>ref|XP_002511774.1| conserved hypothetical protein [Ricinus communis]
            gi|223548954|gb|EEF50443.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 497

 Score =  407 bits (1046), Expect = e-111
 Identities = 224/429 (52%), Positives = 279/429 (65%), Gaps = 21/429 (4%)
 Frame = -2

Query: 1454 LRLLRNLCAGEARNQDMFIDHNGAEIVSSVINLFGNDFDDSDCAIVRTGLQVLANASLAG 1275
            L+LLRNLCAGE  NQ+ F+  NG E+VS+++   G  ++  D  I+R GLQVLAN SLAG
Sbjct: 71   LKLLRNLCAGEITNQNCFVALNGPEMVSTLLRSAGLVYEP-DYGIIRLGLQVLANVSLAG 129

Query: 1274 DRHQQVIWNHLFPVEFVKIARIRKRETCDPLTMIIYVCCDGSHGLFSELCKEPGLAIMAE 1095
            ++HQQ IW+  FP EFV +A+ R + TCDPL MIIY CCDG+ G   ELC + GLA++AE
Sbjct: 130  EKHQQAIWHWFFPDEFVVLAKNRSQSTCDPLCMIIYTCCDGNPGFVLELCGDRGLAVVAE 189

Query: 1094 LISTATTVGFGEDWLKLLLSRICIEESHLPGLFKC----GDA-----------CFTSEQA 960
            ++ TA+ VG+GEDW KLLLSRIC+EE +   LF C    GD+            F++EQA
Sbjct: 190  IVRTASVVGYGEDWFKLLLSRICLEEEYFYKLFSCFYCAGDSENSEGISSSSDLFSTEQA 249

Query: 959  FLLQIISEILHERIQEIRVGNSLALHVLNIFKKAVGAVDFSSRCQPSLPTGSAAIDVLGY 780
            +LL  +SEIL+ER+++I V    A +V  IFK++VG VDF SR    LPTGSAA+DVLGY
Sbjct: 250  YLLSTVSEILNERLEDISVSIDFAFYVFGIFKRSVGVVDFVSRGNSGLPTGSAAVDVLGY 309

Query: 779  SLAILRDTCAQDGLIGDNADDSSEDVXXXXXXXXXXXXXXXXXXXXEPPSQIRKM----- 615
            SL ILRDTCA   L G      S DV                    EPP  I+K      
Sbjct: 310  SLTILRDTCA---LHGKGGLYHSVDVVDTLLSNGLLELLLFVLHDLEPPPMIKKAMKQNE 366

Query: 614  -NAAASASQKKVCPYKGFRRDIVSVIGNCAHRRKFVQDEVRNKNGVLLLLQQCVHDDENP 438
             +  AS+   K CPYKGFRRDIV+VIGNCA +R  VQDE+R K+ + LLLQQCV D++NP
Sbjct: 367  NHEPASSRSYKPCPYKGFRRDIVAVIGNCAFQRNNVQDEIRQKDMIPLLLQQCVTDEDNP 426

Query: 437  FLREWGLWTAWSLLGNNEENSKLVADLEYQESVDVPELLKAGLRVELDPQTRKPKLVNVS 258
            FLREWGLW   +LL  N EN K VA+LE Q +V VPEL   GLRVE+D  TR+ +LVNVS
Sbjct: 427  FLREWGLWCVRNLLEGNVENQKAVAELELQGTVQVPELSGLGLRVEVDSNTRRARLVNVS 486

Query: 257  E*SSKDQQL 231
                KD  L
Sbjct: 487  STDDKDASL 495


>ref|XP_004492673.1| PREDICTED: ataxin-10-like [Cicer arietinum]
          Length = 468

 Score =  404 bits (1038), Expect = e-110
 Identities = 206/416 (49%), Positives = 277/416 (66%), Gaps = 19/416 (4%)
 Frame = -2

Query: 1451 RLLRNLCAGEARNQDMFIDHNGAEIVSSVINLFGNDFDDSDCAIVRTGLQVLANASLAGD 1272
            +LLRNLCAGE  NQ++F++ +G  +VSS++          D  +VR GLQVLAN  LAG 
Sbjct: 55   KLLRNLCAGEFENQNLFLEFDGVVVVSSILMSEAGSLRP-DHMLVRWGLQVLANVCLAGK 113

Query: 1271 RHQQVIWNHLFPVEFVKIARIRKRETCDPLTMIIYVCCDGSHGLFSELCKEPGLAIMAEL 1092
            +HQ+ IW  +FP+ FV +AR+  +E CDPL M+IY CCDG+H  F ELC + GL ++AE+
Sbjct: 114  QHQKAIWEEIFPLGFVSLARLGTKEICDPLCMVIYTCCDGNHECFGELCSDSGLPVVAEI 173

Query: 1091 ISTATTVGFGEDWLKLLLSRICIEESHLPGLFK--------------CGDACFTSEQAFL 954
            + TA++  FGEDW+KLLLSRIC+EES LP LF                 D  F+ EQAFL
Sbjct: 174  VKTASSASFGEDWIKLLLSRICLEESQLPMLFPKLRFMDIPEGEDIDSKDYQFSFEQAFL 233

Query: 953  LQIISEILHERIQEIRVGNSLALHVLNIFKKAVGAVDFSSRCQPSLPTGSAAIDVLGYSL 774
            LQI+SEIL+ER++++ V   +AL V  +FKK+VG ++ + R +  LP+GS A+D LGYSL
Sbjct: 234  LQILSEILNERLRDVVVSKDVALFVYGVFKKSVGVLEHAVRGKSGLPSGSVAVDALGYSL 293

Query: 773  AILRDTCAQDGLIGDNADDSSEDVXXXXXXXXXXXXXXXXXXXXEPPSQIRK-----MNA 609
             ILRD CA D + G+  D  + DV                    EPP+ IRK      N 
Sbjct: 294  TILRDICAHDSVRGNPED--TNDVVDVLLSQDIIELLLILLGDLEPPAIIRKGIKQSENQ 351

Query: 608  AASASQKKVCPYKGFRRDIVSVIGNCAHRRKFVQDEVRNKNGVLLLLQQCVHDDENPFLR 429
              ++   K CPYKGFRRDIVS+IGNC +RRK  QDE+R +NG+LLLLQQCV D++NPFLR
Sbjct: 352  EGASCSSKPCPYKGFRRDIVSLIGNCVYRRKHAQDEIRGRNGILLLLQQCVTDEDNPFLR 411

Query: 428  EWGLWTAWSLLGNNEENSKLVADLEYQESVDVPELLKAGLRVELDPQTRKPKLVNV 261
            EWG+W+  ++L  NEEN K+V++L+ Q S DVP++   GLR+E+D +TR+ KLVNV
Sbjct: 412  EWGIWSVRNMLEGNEENQKVVSELQLQGSADVPQISALGLRIEVDQKTRRAKLVNV 467


>gb|EMJ20253.1| hypothetical protein PRUPE_ppa004765mg [Prunus persica]
          Length = 492

 Score =  398 bits (1022), Expect = e-108
 Identities = 210/419 (50%), Positives = 274/419 (65%), Gaps = 20/419 (4%)
 Frame = -2

Query: 1454 LRLLRNLCAGEARNQDMFIDHNGAEIVSSVINLFGNDFDDSDCAIVRTGLQVLANASLAG 1275
            L+LLRNLCAGE  NQ  F++ +G  I+S+V+N   N   + D  ++R GLQVLAN SLAG
Sbjct: 77   LKLLRNLCAGEVSNQKSFLEQSGVAIISNVLNS-ANISLEPDSGVIRMGLQVLANVSLAG 135

Query: 1274 DRHQQVIWNHLFPVEFVKIARIRKRETCDPLTMIIYVCCDGSHGLFSELCKEPGLAIMAE 1095
            +RHQ  IW  LFP EF+ +AR++ RETCDPL M+I+ CCDGS  LF +LC + G+ IM E
Sbjct: 136  ERHQHEIWQQLFPKEFLALARVQSRETCDPLCMVIFACCDGSPELFEKLCGDGGITIMKE 195

Query: 1094 LISTATTVGFGEDWLKLLLSRICIEESHLPGLF--------------KCGDACFTSEQAF 957
            ++ T   VGFGEDW+KLLLSRIC+E  +   LF              +  +  F+S+QAF
Sbjct: 196  IVRTTAAVGFGEDWVKLLLSRICLEGPYFSSLFSNLGFATSENVEDTEFREDLFSSDQAF 255

Query: 956  LLQIISEILHERIQEIRVGNSLALHVLNIFKKAVGAVDFSSRCQPSLPTGSAAIDVLGYS 777
             L+IIS+IL+ER++EI V    AL V  IFKK+VGA++  +R Q  LPTG++ IDVLGYS
Sbjct: 256  FLRIISDILNERLREITVPRDFALCVFGIFKKSVGALNCVTRGQSGLPTGTSMIDVLGYS 315

Query: 776  LAILRDTCAQDGLIGDNADDSSEDVXXXXXXXXXXXXXXXXXXXXEPPSQIRKM------ 615
            L ILRD CAQ  L G   D    D                     EPP+ IRK       
Sbjct: 316  LTILRDVCAQKTLRGFQEDLG--DAVDVLLSHGLIELILCLLRDLEPPAIIRKAIKQGEG 373

Query: 614  NAAASASQKKVCPYKGFRRDIVSVIGNCAHRRKFVQDEVRNKNGVLLLLQQCVHDDENPF 435
                ++   K CPYKGFRRDIV+VIGNC ++RK VQDE+R ++G+LLLLQQC  D++NPF
Sbjct: 374  QDGTNSGSSKPCPYKGFRRDIVAVIGNCTYQRKPVQDEIRQRDGILLLLQQCGLDEDNPF 433

Query: 434  LREWGLWTAWSLLGNNEENSKLVADLEYQESVDVPELLKAGLRVELDPQTRKPKLVNVS 258
            L+EWG+W   +LL  NE+N ++V +LE Q SVD PE+   G RVE++P+T +PKLVNVS
Sbjct: 434  LKEWGIWCVRNLLEGNEDNKRVVTELELQGSVDAPEIAGLGFRVEVNPETGRPKLVNVS 492


>ref|XP_002320751.1| ataxin-related family protein [Populus trichocarpa]
            gi|222861524|gb|EEE99066.1| ataxin-related family protein
            [Populus trichocarpa]
          Length = 496

 Score =  395 bits (1016), Expect = e-107
 Identities = 218/421 (51%), Positives = 269/421 (63%), Gaps = 22/421 (5%)
 Frame = -2

Query: 1454 LRLLRNLCAGEARNQDMFIDHNGAEIVSSVINLFGNDFDDSDCAIVRTGLQVLANASLAG 1275
            LRL+RNLCAGE  NQ  FI  NG  I  +V+        + D  I+R GLQVLAN SLAG
Sbjct: 79   LRLMRNLCAGEVANQKSFIQLNGVGIFLTVLRSKKVASSEPDHGIIRMGLQVLANVSLAG 138

Query: 1274 DRHQQVIWNHLFPVEFVKIARIRKRETCDPLTMIIYVCCDGSHGLFSELCKEPGLAIMAE 1095
              HQQ IW  LF  E   +A++R + TCDPL MIIY CCDGS  L  +LC   GL I+ E
Sbjct: 139  KEHQQAIWGGLFHDELYMLAKVRSQGTCDPLCMIIYACCDGSPELVLQLCGNQGLPIVVE 198

Query: 1094 LISTATTVGFGEDWLKLLLSRICIEESHLPGLF-KCGDAC---------------FTSEQ 963
            +I TA+ VGFGE+WLKLLLSRIC+E+ + P LF +    C               F +EQ
Sbjct: 199  IIRTASLVGFGEEWLKLLLSRICLEDIYFPQLFSRIYSVCSYCENGEEISLSSNPFFTEQ 258

Query: 962  AFLLQIISEILHERIQEIRVGNSLALHVLNIFKKAVGAVDFSSRCQPSLPTGSAAIDVLG 783
            A+LL I+SEIL+ER++EI + N  AL +  IFKK+V A +F SR +  LPTG A IDVLG
Sbjct: 259  AYLLNIVSEILNERLKEITILNDFALCIFGIFKKSVEAFEFGSRAESRLPTGFAVIDVLG 318

Query: 782  YSLAILRDTCAQDGLIGDNADDSSEDVXXXXXXXXXXXXXXXXXXXXEPPSQIRK-MNAA 606
            YSL ILRD CA +G +G    +   DV                    EPP  IRK MN A
Sbjct: 319  YSLTILRDICANNGGVG---KEDLVDVVDSLLSSGLLDLLLCLLRDLEPPKIIRKAMNQA 375

Query: 605  ASASQ-----KKVCPYKGFRRDIVSVIGNCAHRRKFVQDEVRNKNGVLLLLQQCVHDDEN 441
             +         KVCPYKGFRRD+V+VIGNCA+RRK VQD++R KNG+LL+LQQCV D++N
Sbjct: 376  GNQEATTSYFPKVCPYKGFRRDLVAVIGNCAYRRKHVQDDIRQKNGMLLMLQQCVTDEDN 435

Query: 440  PFLREWGLWTAWSLLGNNEENSKLVADLEYQESVDVPELLKAGLRVELDPQTRKPKLVNV 261
            PFLREWG+W+  +LL  N EN + VA+LE Q SVD+PEL   GL+VE+D  TR  KLVN+
Sbjct: 436  PFLREWGIWSMRNLLEGNSENQQAVAELELQGSVDMPELAGLGLKVEVDQNTRSAKLVNI 495

Query: 260  S 258
            S
Sbjct: 496  S 496


>ref|XP_003551615.1| PREDICTED: ataxin-10-like [Glycine max]
          Length = 498

 Score =  395 bits (1015), Expect = e-107
 Identities = 209/416 (50%), Positives = 273/416 (65%), Gaps = 19/416 (4%)
 Frame = -2

Query: 1451 RLLRNLCAGEARNQDMFIDHNGAEIVSSVINLFGNDFDDSDCAIVRTGLQVLANASLAGD 1272
            +LLRNLCAGEA NQD F++ +G  +V SV+          D  +VR GLQVLAN SLAG 
Sbjct: 85   KLLRNLCAGEAANQDSFLELDGVAVVCSVLRSEAA-CSGPDHGLVRWGLQVLANVSLAGK 143

Query: 1271 RHQQVIWNHLFPVEFVKIARIRKRETCDPLTMIIYVCCDGSHGLFSELCKEPGLAIMAEL 1092
            +HQ  IW  L+   FV +AR+  +ETCDPL M+IY CCDG+   F  L  E G  +MAE+
Sbjct: 144  QHQCAIWKELYLDGFVSLARLHTKETCDPLCMVIYTCCDGNPEWFKRLSSEDGWFVMAEI 203

Query: 1091 ISTATTVGFGEDWLKLLLSRICIEESHLPGLF--------------KCGDACFTSEQAFL 954
            + TA++  FGEDWLKLLLSRIC+EES LP LF              +  D  F+ EQAFL
Sbjct: 204  VRTASSASFGEDWLKLLLSRICLEESQLPVLFSKLQFADVPKVEVAESKDDHFSFEQAFL 263

Query: 953  LQIISEILHERIQEIRVGNSLALHVLNIFKKAVGAVDFSSRCQPSLPTGSAAIDVLGYSL 774
            L+I+SEIL+ER +++ V   +AL V  IFK ++G ++ ++R +  LP+G   +DVLGYSL
Sbjct: 264  LRILSEILNERHKDVTVSKDVALFVFGIFKNSIGVLEHATRGKSGLPSGFVGVDVLGYSL 323

Query: 773  AILRDTCAQDGLIGDNADDSSEDVXXXXXXXXXXXXXXXXXXXXEPPSQIRK-----MNA 609
             ILRD CAQDG+ G+  D  S DV                    EPP+ IRK      N 
Sbjct: 324  TILRDICAQDGVRGNTED--SNDVVDALLSYGLIELLLYLLEALEPPAIIRKGLKQCENQ 381

Query: 608  AASASQKKVCPYKGFRRDIVSVIGNCAHRRKFVQDEVRNKNGVLLLLQQCVHDDENPFLR 429
              ++   K CPYKGFRRDIV++IGNC +RRK  QDE+R++NG+LLLLQQCV D++NPFLR
Sbjct: 382  DGASCSFKPCPYKGFRRDIVALIGNCVYRRKHAQDEIRHRNGILLLLQQCVTDEDNPFLR 441

Query: 428  EWGLWTAWSLLGNNEENSKLVADLEYQESVDVPELLKAGLRVELDPQTRKPKLVNV 261
            EWG+W+  ++L  N+EN K+VA+LE Q S DVPE+   GLRVE+D +TR+ KLVN+
Sbjct: 442  EWGIWSVRNMLEGNDENQKVVAELEIQGSADVPEITSLGLRVEVDQRTRRAKLVNI 497


>ref|XP_002274705.1| PREDICTED: uncharacterized protein LOC100264428 [Vitis vinifera]
          Length = 494

 Score =  395 bits (1015), Expect = e-107
 Identities = 214/421 (50%), Positives = 278/421 (66%), Gaps = 22/421 (5%)
 Frame = -2

Query: 1454 LRLLRNLCAGEARNQDMFIDHNGAEIVSSVINLFGNDFDDSDCAIVRTGLQVLANASLAG 1275
            L+LLRNLCAGE  NQ++FI+ NG + VS+++  F     DSD  I+R GLQ+L N SLAG
Sbjct: 76   LKLLRNLCAGEMTNQNLFIEQNGVKAVSTILLSFVGLDSDSDYGIIRMGLQLLGNVSLAG 135

Query: 1274 DRHQQVIWNHLFPVEFVKIARIRKRETCDPLTMIIYVCCDGSHGLFSELCKEPGLAIMAE 1095
            +RHQ+ +W+H FP  F++IAR+R  ET DPL M+IY C D SH   +E+C + GL I+AE
Sbjct: 136  ERHQRAVWHHFFPAGFLEIARVRTLETSDPLCMVIYTCFDQSHEFITEICGDQGLPILAE 195

Query: 1094 LISTATTVGFGEDWLKLLLSRICIEESHLPGLFK----CGDA-----------CFTSEQA 960
            ++ TA+TVGF EDWLKLLLSRIC+EESH P LF      G +            F SEQA
Sbjct: 196  IVRTASTVGFEEDWLKLLLSRICLEESHFPMLFSKLCPVGTSGNYESIEFKVDVFASEQA 255

Query: 959  FLLQIISEILHERIQEIRVGNSLALHVLNIFKKAVGAVDFSSRCQPSLPTGSAAIDVLGY 780
            FL+ I++EIL+E+I ++ V + +AL VL I KK+ G +D  S C+     GS AI+VL Y
Sbjct: 256  FLMDIVAEILNEQINKMTVSSDVALCVLGILKKSAGVLDSVSTCKSGFSAGSNAINVLKY 315

Query: 779  SLAILRDTCAQDGLIGDNADDSSEDVXXXXXXXXXXXXXXXXXXXXEPPSQIRK------ 618
            SL IL++ CA+D     N +  S DV                    EPP+ IRK      
Sbjct: 316  SLTILKEICARDAQKSSN-EHGSVDVVDLLVSSGLLELLLCLLRDLEPPAIIRKAIKQGE 374

Query: 617  -MNAAASASQKKVCPYKGFRRDIVSVIGNCAHRRKFVQDEVRNKNGVLLLLQQCVHDDEN 441
              + AAS S K   PY+GFRRD+V+VIGNCA+RRK VQ+E+R +NG+LLLLQQCV D+EN
Sbjct: 375  NQDGAASYSPKHY-PYRGFRRDLVAVIGNCAYRRKHVQNEIRERNGILLLLQQCVTDEEN 433

Query: 440  PFLREWGLWTAWSLLGNNEENSKLVADLEYQESVDVPELLKAGLRVELDPQTRKPKLVNV 261
             FLREWG+W   +LL  N EN ++VA+LE Q SVDVPE+   GLRVE+D +T + KLVNV
Sbjct: 434  QFLREWGIWCVRNLLEGNVENQRVVAELELQGSVDVPEIAGLGLRVEVDQKTGRAKLVNV 493

Query: 260  S 258
            S
Sbjct: 494  S 494


>ref|XP_004232703.1| PREDICTED: ataxin-10-like isoform 1 [Solanum lycopersicum]
            gi|460373805|ref|XP_004232704.1| PREDICTED:
            ataxin-10-like isoform 2 [Solanum lycopersicum]
          Length = 501

 Score =  389 bits (999), Expect = e-105
 Identities = 207/421 (49%), Positives = 268/421 (63%), Gaps = 22/421 (5%)
 Frame = -2

Query: 1454 LRLLRNLCAGEARNQDMFIDHNGAEIVSSVINLFGNDFDDSDCAIVRTGLQVLANASLAG 1275
            L++LRNLCAGE RNQ+ F+   G EIV  VI   G    D DC I+R GLQ+L N S+ G
Sbjct: 84   LKVLRNLCAGEIRNQNGFLQQRGVEIVLDVIMSVGLS-PDPDCMIIRVGLQLLGNYSVGG 142

Query: 1274 DRHQQVIWNHLFPVEFVKIARIRKRETCDPLTMIIYVCCDGSHGLFSELCKEPGLAIMAE 1095
               Q  +W  LFP +F+KIAR+R +E CDPL M+IY CCDG+ GL ++LC E GL I+ E
Sbjct: 143  GERQCDVWYQLFPHKFLKIARVRNQEICDPLCMVIYTCCDGTDGLLTDLCSEQGLPILFE 202

Query: 1094 LISTATTVGFGEDWLKLLLSRICIEESHLPGLF----------------KCGDACFTSEQ 963
            ++ TA+ VG  E WLKLLLS++CIE SH+  +F                   D  F  EQ
Sbjct: 203  ILRTASAVGLKEVWLKLLLSKLCIEGSHISSIFFKLHSYPSVEDNGVVTHVADQ-FVIEQ 261

Query: 962  AFLLQIISEILHERIQEIRVGNSLALHVLNIFKKAVGAVDFSSRCQPSLPTGSAAIDVLG 783
             +LL I+SEIL+ER++ I V +  A  +  I K A G VDFS R +  LP GSA IDVLG
Sbjct: 262  PYLLSILSEILNERVEHIVVSHDFARSIFGILKSASGVVDFSIRGKSDLPVGSAPIDVLG 321

Query: 782  YSLAILRDTCAQDGLIGDNADDSSEDVXXXXXXXXXXXXXXXXXXXXEPPSQIR------ 621
            YSL ++RD CA D L   + ++SS+DV                    EPP+ IR      
Sbjct: 322  YSLTLMRDICASDHL-SSSKEESSKDVVDVLVSSGLIEFLLNLLRDLEPPTTIRNAMKPD 380

Query: 620  KMNAAASASQKKVCPYKGFRRDIVSVIGNCAHRRKFVQDEVRNKNGVLLLLQQCVHDDEN 441
            ++      S  + CPY+GFRRDIV+++GNCA+RR+ VQDE+R+KNG+LLLLQQCV D++N
Sbjct: 381  QIKEGTIPSSFRCCPYQGFRRDIVAILGNCAYRRRHVQDEIRDKNGILLLLQQCVIDEDN 440

Query: 440  PFLREWGLWTAWSLLGNNEENSKLVADLEYQESVDVPELLKAGLRVELDPQTRKPKLVNV 261
            PFLREWG+W   +LL  N EN   + DLE Q +VDVPEL++ GLRVE+DP TR+ KLVN 
Sbjct: 441  PFLREWGIWCVRNLLEGNAENQGAITDLELQGTVDVPELVRLGLRVEVDPVTRRTKLVNS 500

Query: 260  S 258
            S
Sbjct: 501  S 501


>ref|XP_003623813.1| Ataxin-10 [Medicago truncatula] gi|355498828|gb|AES80031.1| Ataxin-10
            [Medicago truncatula]
          Length = 491

 Score =  389 bits (999), Expect = e-105
 Identities = 202/417 (48%), Positives = 277/417 (66%), Gaps = 19/417 (4%)
 Frame = -2

Query: 1451 RLLRNLCAGEARNQDMFIDHNGAEIVSSVINLFGNDFDDSDCAIVRTGLQVLANASLAGD 1272
            +LLRNLCAGE  NQ+MF++++G  IV S  ++  ++   SD  +VR GLQVLAN  LAG 
Sbjct: 78   KLLRNLCAGEILNQNMFLENDGVFIVVS--SILRSEVVGSDYMLVRWGLQVLANVCLAGK 135

Query: 1271 RHQQVIWNHLFPVEFVKIARIRKRETCDPLTMIIYVCCDGSHGLFSELCKEPGLAIMAEL 1092
             HQ+ +W+ +FPV F+ +ARI K+E  DPL M+IY CCDG+   FSE+C + G  ++ E+
Sbjct: 136  EHQKAVWDEMFPVGFLSVARIGKKEVNDPLCMVIYTCCDGNDQWFSEVCSDGGWNVLVEI 195

Query: 1091 ISTATTVGFGEDWLKLLLSRICIEESHLPGLF--------------KCGDACFTSEQAFL 954
            + TA++  FGEDW+KLLLSRIC+E+S L  LF              K  D  F+SEQAFL
Sbjct: 196  VRTASSASFGEDWIKLLLSRICLEDSQLRVLFSKLRFMDIPDGEDTKTKDDQFSSEQAFL 255

Query: 953  LQIISEILHERIQEIRVGNSLALHVLNIFKKAVGAVDFSSRCQPSLPTGSAAIDVLGYSL 774
            LQIIS+IL+ERI ++ +   +A  V  IFKK++G ++ + R +  LP+G   +DVLGYSL
Sbjct: 256  LQIISDILNERIGDVTISLEVASFVYGIFKKSIGVLEHAVRGKSGLPSGITDVDVLGYSL 315

Query: 773  AILRDTCAQDGLIGDNADDSSEDVXXXXXXXXXXXXXXXXXXXXEPPSQIRK-----MNA 609
             +LRD CA D + G++ D    D+                     PP+ IRK      N 
Sbjct: 316  TMLRDICAHDSVRGNSEDTEVVDMLLSYGLIELVFILLGDLE---PPTIIRKGMKHSENP 372

Query: 608  AASASQKKVCPYKGFRRDIVSVIGNCAHRRKFVQDEVRNKNGVLLLLQQCVHDDENPFLR 429
              ++S  K CPYKGFRRDIV++IGNC +RRK VQDE+R++NG+LLLLQQCV D++NP+LR
Sbjct: 373  DGASSSSKPCPYKGFRRDIVALIGNCVYRRKHVQDEIRSRNGILLLLQQCVTDEDNPYLR 432

Query: 428  EWGLWTAWSLLGNNEENSKLVADLEYQESVDVPELLKAGLRVELDPQTRKPKLVNVS 258
            EWG+W   ++L  NEEN K +++L+ Q S DVPE+   GLRVE+D +TR+ KLVNVS
Sbjct: 433  EWGIWCVRNMLEGNEENQKEISELQLQGSADVPEISALGLRVEVDQKTRRAKLVNVS 489


>gb|ESW20728.1| hypothetical protein PHAVU_005G009900g [Phaseolus vulgaris]
          Length = 498

 Score =  387 bits (995), Expect = e-105
 Identities = 204/416 (49%), Positives = 272/416 (65%), Gaps = 19/416 (4%)
 Frame = -2

Query: 1451 RLLRNLCAGEARNQDMFIDHNGAEIVSSVINLFGNDFDDSDCAIVRTGLQVLANASLAGD 1272
            +LLRNLCAGEA NQ  FI+ NG  +V SV+          D  +VR GLQVLAN SL G 
Sbjct: 82   KLLRNLCAGEAANQVSFIELNGVAVVWSVLRSEAGSLGP-DHRLVRWGLQVLANVSLGGK 140

Query: 1271 RHQQVIWNHLFPVEFVKIARIRKRETCDPLTMIIYVCCDGSHGLFSELCKEPGLAIMAEL 1092
            +HQ+ IW  L+P+ F  +AR+  +E CDPL M+IY CCDG+   F +L  + G  ++AE+
Sbjct: 141  QHQRAIWEELYPIGFASLARVGTKEICDPLCMVIYTCCDGNPEWFKKLSSDDGWPVVAEI 200

Query: 1091 ISTATTVGFGEDWLKLLLSRICIEESHLPGLF--------------KCGDACFTSEQAFL 954
            + TA++  F EDWLKLLLSRI +EES LP LF              +  +  F+ EQAFL
Sbjct: 201  VRTASSASFDEDWLKLLLSRIFLEESQLPVLFSKLQSVDVPEGEVIESKNGQFSFEQAFL 260

Query: 953  LQIISEILHERIQEIRVGNSLALHVLNIFKKAVGAVDFSSRCQPSLPTGSAAIDVLGYSL 774
            LQI+SEIL+ER+ ++ V   +AL V  IFKK++G ++ + R +  LP+G   +DVLGYSL
Sbjct: 261  LQILSEILNERLGDVTVSEDVALFVFGIFKKSIGVLEHAMRGKSGLPSGFTGVDVLGYSL 320

Query: 773  AILRDTCAQDGLIGDNADDSSEDVXXXXXXXXXXXXXXXXXXXXEPPSQIRK-----MNA 609
             ILRD CAQDG+ G+     ++DV                    EPP+ IRK      N 
Sbjct: 321  TILRDICAQDGMRGN-----TKDVVDVLLSYGLIEFLLSLLGALEPPAIIRKGLKQIENQ 375

Query: 608  AASASQKKVCPYKGFRRDIVSVIGNCAHRRKFVQDEVRNKNGVLLLLQQCVHDDENPFLR 429
              ++   K CPYKGFRRDIV++IGNC +RRK  QDE+R++NG+LLLLQQCV D++NPFLR
Sbjct: 376  DNASCCSKPCPYKGFRRDIVALIGNCVYRRKHAQDEIRDRNGILLLLQQCVTDEDNPFLR 435

Query: 428  EWGLWTAWSLLGNNEENSKLVADLEYQESVDVPELLKAGLRVELDPQTRKPKLVNV 261
            EWG+W+  ++L  N+EN KLVA+LE Q S DVPE+   GL+VE+D +TR+PKLVN+
Sbjct: 436  EWGIWSVRNMLEGNDENQKLVAELEIQGSADVPEINALGLQVEVDQRTRRPKLVNI 491


>ref|XP_002875041.1| hypothetical protein ARALYDRAFT_490543 [Arabidopsis lyrata subsp.
            lyrata] gi|297320878|gb|EFH51300.1| hypothetical protein
            ARALYDRAFT_490543 [Arabidopsis lyrata subsp. lyrata]
          Length = 474

 Score =  385 bits (989), Expect = e-104
 Identities = 205/406 (50%), Positives = 273/406 (67%), Gaps = 9/406 (2%)
 Frame = -2

Query: 1454 LRLLRNLCAGEARNQDMFIDHNGAEIVSSVINLFGNDFDDSDCAIVRTGLQVLANASLAG 1275
            L++LRNLCAGE  NQ+ F+DH+G+ IVS +++    DF+      VR GLQVLAN  L G
Sbjct: 72   LKVLRNLCAGEVSNQNSFVDHDGSVIVSELLDSAIADFET-----VRFGLQVLANVVLFG 126

Query: 1274 DRHQQVIWNHLFPVEFVKIARIRKRETCDPLTMIIYVCCDGSHGLFSELCKEPGLAIMAE 1095
            ++ Q+ +W   FP  F+ IA+IR+RETCDPL MI+Y C DGS  + SELC   GL I+AE
Sbjct: 127  EKRQRDVWLRFFPERFLSIAKIRRRETCDPLCMILYTCFDGSSEIASELCSSEGLTIIAE 186

Query: 1094 LISTATTVGFGED-WLKLLLSRICIEESHLPGLFK-----CGDACFTSEQAFLLQIISEI 933
             + T+++VG  ED WLKLL+SRIC+E+ + P LF        +  FTSEQAFLL+I+S+I
Sbjct: 187  TLRTSSSVGSVEDYWLKLLVSRICVEDDYFPKLFSKLYKVAENEKFTSEQAFLLRIVSDI 246

Query: 932  LHERIQEIRVGNSLALHVLNIFKKAVGAVDFSSRCQPSLPTGSAAIDVLGYSLAILRDTC 753
             +ERI ++ +    A  +L +FK++V   DF S  +  LPTGS  +DV+GYSL I+RD C
Sbjct: 247  ANERIGKVAIPKDTASSILGLFKQSVDVFDFVSGERSELPTGSTIVDVMGYSLVIIRDAC 306

Query: 752  AQDGLIGDNADDS-SEDVXXXXXXXXXXXXXXXXXXXXEPPSQIRKM--NAAASASQKKV 582
            A   L   N D+  S D                     +PP+ I+K    +  S+S  K 
Sbjct: 307  AGGSLEELNKDNKDSGDTVELLLSSGLIELLLDLLRKLDPPTTIKKALNQSPTSSSSFKP 366

Query: 581  CPYKGFRRDIVSVIGNCAHRRKFVQDEVRNKNGVLLLLQQCVHDDENPFLREWGLWTAWS 402
            CPY+GFRRDIVSVIGNCA+RRK VQDE+R ++G++L+LQQCV DDENPFLREWGLW   +
Sbjct: 367  CPYRGFRRDIVSVIGNCAYRRKEVQDEIRERDGLVLMLQQCVTDDENPFLREWGLWCVRN 426

Query: 401  LLGNNEENSKLVADLEYQESVDVPELLKAGLRVELDPQTRKPKLVN 264
            LL  N EN ++VA+LE + SVDVP+L + GLRVE+DP+T +PKLVN
Sbjct: 427  LLEGNPENQEVVAELEIKGSVDVPQLREIGLRVEIDPKTARPKLVN 472


>ref|XP_006366476.1| PREDICTED: ataxin-10-like isoform X1 [Solanum tuberosum]
            gi|565401994|ref|XP_006366477.1| PREDICTED:
            ataxin-10-like isoform X2 [Solanum tuberosum]
            gi|565401996|ref|XP_006366478.1| PREDICTED:
            ataxin-10-like isoform X3 [Solanum tuberosum]
            gi|565401998|ref|XP_006366479.1| PREDICTED:
            ataxin-10-like isoform X4 [Solanum tuberosum]
            gi|565402000|ref|XP_006366480.1| PREDICTED:
            ataxin-10-like isoform X5 [Solanum tuberosum]
          Length = 504

 Score =  382 bits (982), Expect = e-103
 Identities = 205/420 (48%), Positives = 265/420 (63%), Gaps = 21/420 (5%)
 Frame = -2

Query: 1454 LRLLRNLCAGEARNQDMFIDHNGAEIVSSVINLFGNDFDDSDCAIVRTGLQVLANASLAG 1275
            L++LRNLCAGE RNQ+ F+   G EIV  VI   G    D DC I+R GLQ+L N S+ G
Sbjct: 87   LKVLRNLCAGEIRNQNEFLQQRGVEIVVDVITSVGLT-PDPDCMIIRVGLQLLGNYSVGG 145

Query: 1274 DRHQQVIWNHLFPVEFVKIARIRKRETCDPLTMIIYVCCDGSHGLFSELCKEPGLAIMAE 1095
               Q  +W  LFP +F+KIAR+R  E CDPL M+IY CCDG+ GL ++LC E GL I+ E
Sbjct: 146  GERQCDVWYQLFPHKFLKIARVRSWEICDPLCMVIYTCCDGTDGLLTDLCSEQGLPILIE 205

Query: 1094 LISTATTVGFGEDWLKLLLSRICIEESHLPGLF---------------KCGDACFTSEQA 960
            ++ TA+ V   E WLKLLLS++CIE S++  +F                     F  EQ 
Sbjct: 206  ILRTASAVDRKEVWLKLLLSKLCIEGSYISSIFFKLHSFPSIQNNGVVTHATDQFVIEQP 265

Query: 959  FLLQIISEILHERIQEIRVGNSLALHVLNIFKKAVGAVDFSSRCQPSLPTGSAAIDVLGY 780
            +LL I+SEI++++I+ I V +  AL +  I K A   VDFS R +  LP G A IDVLGY
Sbjct: 266  YLLSILSEIVNDQIEHIVVSHDFALSIFGILKSAFVVVDFSIRGKSDLPVGFAPIDVLGY 325

Query: 779  SLAILRDTCAQDGLIGDNADDSSEDVXXXXXXXXXXXXXXXXXXXXEPPSQIRK------ 618
            SL ILRD CA D +     ++SS+DV                    EPP+ IRK      
Sbjct: 326  SLTILRDICASDHMTSSK-EESSKDVVDVLVSSGLIEFLLNLLRDLEPPTTIRKAMKQDQ 384

Query: 617  MNAAASASQKKVCPYKGFRRDIVSVIGNCAHRRKFVQDEVRNKNGVLLLLQQCVHDDENP 438
            +     +S  + CPY+GFRRDIVS+IGNCA+RR++VQDE+R+KNG+LLLLQQCV D++NP
Sbjct: 385  ITEGIISSSFRCCPYQGFRRDIVSIIGNCAYRRRYVQDEIRDKNGILLLLQQCVIDEDNP 444

Query: 437  FLREWGLWTAWSLLGNNEENSKLVADLEYQESVDVPELLKAGLRVELDPQTRKPKLVNVS 258
            FLREWG+W   +LL  N EN   + DLE Q +VDVPEL++ GLRVE+DP TR+ KLVN S
Sbjct: 445  FLREWGIWCVRNLLEGNAENQGAITDLELQGTVDVPELVRLGLRVEVDPVTRRTKLVNAS 504


>ref|XP_006348129.1| PREDICTED: ataxin-10-like [Solanum tuberosum]
          Length = 501

 Score =  382 bits (981), Expect = e-103
 Identities = 204/420 (48%), Positives = 265/420 (63%), Gaps = 21/420 (5%)
 Frame = -2

Query: 1454 LRLLRNLCAGEARNQDMFIDHNGAEIVSSVINLFGNDFDDSDCAIVRTGLQVLANASLAG 1275
            L++LRNLCAGE  NQ+ F+   G EIV  VI   G    D DC I+R GLQ+L N S+ G
Sbjct: 84   LKVLRNLCAGEIINQNEFLQQRGVEIVVDVIMSVGLT-PDPDCMIIRVGLQLLGNYSVGG 142

Query: 1274 DRHQQVIWNHLFPVEFVKIARIRKRETCDPLTMIIYVCCDGSHGLFSELCKEPGLAIMAE 1095
               Q  +W  LFP +F+KIAR+R +E CDPL M+IY CCDG+ GL ++LC E GL I+ E
Sbjct: 143  GERQCDVWYQLFPHKFLKIARVRNQEICDPLCMVIYTCCDGTDGLLTDLCSEKGLPILIE 202

Query: 1094 LISTATTVGFGEDWLKLLLSRICIEESHLPGLFKCGDAC---------------FTSEQA 960
            ++ TA+ VG  E WLKLLLS++CIE S++  +F    +                F  EQ+
Sbjct: 203  ILRTASAVGLKEVWLKLLLSKLCIEGSYISSIFFKLHSYPSVENNGVVTHVVDQFVIEQS 262

Query: 959  FLLQIISEILHERIQEIRVGNSLALHVLNIFKKAVGAVDFSSRCQPSLPTGSAAIDVLGY 780
            +LL  +SEIL+ER++ I V +  A  +  I K A G  DFS R +  LP GSA IDVLGY
Sbjct: 263  YLLSTLSEILNERVEHIVVSHDFARSIFGILKSASGVADFSIRGKSDLPVGSAPIDVLGY 322

Query: 779  SLAILRDTCAQDGLIGDNADDSSEDVXXXXXXXXXXXXXXXXXXXXEPPSQIRK------ 618
            SL ILRD CA D +     ++SS+DV                    EPP+ IRK      
Sbjct: 323  SLTILRDICASDHMTSSK-EESSKDVVDVLVSSGLIEFLLNLLRDLEPPTTIRKAMKQDQ 381

Query: 617  MNAAASASQKKVCPYKGFRRDIVSVIGNCAHRRKFVQDEVRNKNGVLLLLQQCVHDDENP 438
            +     +S  + CPY+GFRRDIV+++GNCA+RR+ VQDE+R+KNG+LLLLQQCV D++NP
Sbjct: 382  IKEGTISSSFRCCPYQGFRRDIVAILGNCAYRRRHVQDEIRDKNGILLLLQQCVIDEDNP 441

Query: 437  FLREWGLWTAWSLLGNNEENSKLVADLEYQESVDVPELLKAGLRVELDPQTRKPKLVNVS 258
            FLREWG+W   +LL  N EN   + DLE Q +VDVPEL++ GLRVE+DP TR  KLVN S
Sbjct: 442  FLREWGIWCVRNLLEGNAENQGAITDLELQGTVDVPELVRLGLRVEVDPVTRHTKLVNSS 501


>ref|XP_004308721.1| PREDICTED: uncharacterized protein LOC101297970 [Fragaria vesca
            subsp. vesca]
          Length = 492

 Score =  382 bits (981), Expect = e-103
 Identities = 207/420 (49%), Positives = 271/420 (64%), Gaps = 21/420 (5%)
 Frame = -2

Query: 1454 LRLLRNLCAGEARNQDMFIDHNGAEIVSSVINLFGNDFDDSDCAIVRTGLQVLANASLAG 1275
            LRLLRNLCAGE  NQ+ F++ NG  I+S++++   +   + D  I+  GLQVLAN +LAG
Sbjct: 77   LRLLRNLCAGEVANQNSFVEQNGVAIISNILSSASSL--EPDFGIICVGLQVLANVALAG 134

Query: 1274 DRHQQVIWNHLFPVEFVKIARIRKRETCDPLTMIIYVCCDGSHGLFSELCKEPGLAIMAE 1095
            +R Q  IW  LF   FV +AR+R ++TC PL MIIY CCDG+  L ++LC + G+ I+ E
Sbjct: 135  ERQQHAIWQQLFLENFVALARVRSQKTCGPLCMIIYACCDGTPELVAQLCGDCGVTIVKE 194

Query: 1094 LISTATTVGFGEDWLKLLLSRICIEESHLPGLF---------------KCGDACFTSEQA 960
            ++ TA   GFGEDW KLLLSRIC+EE +   LF               + G   F  EQ 
Sbjct: 195  IVKTAAADGFGEDWYKLLLSRICLEEPYFRPLFFSLQHVGGNENGDDTEGGQESFLEEQE 254

Query: 959  FLLQIISEILHERIQEIRVGNSLALHVLNIFKKAVGAVDFSSRCQPSLPTGSAAIDVLGY 780
            FLL+ +SEIL+ER+ EI V +  AL V  IFK ++  + +++R +  LPTGS  IDVLGY
Sbjct: 255  FLLKNVSEILNERLNEITVPDDFALCVFGIFKNSIKVLSYATRGRSGLPTGSIDIDVLGY 314

Query: 779  SLAILRDTCAQDGLIGDNADDSSEDVXXXXXXXXXXXXXXXXXXXXEPPSQIRK-MNAA- 606
            SL ILRD CAQ  L G   D  + DV                    EPP+ I+K +N A 
Sbjct: 315  SLTILRDICAQGTLRGCTVD--TMDVVDALISYGLIELLLCLLRDLEPPAIIKKSVNQAK 372

Query: 605  ----ASASQKKVCPYKGFRRDIVSVIGNCAHRRKFVQDEVRNKNGVLLLLQQCVHDDENP 438
                ++ S  K CPYKGFRRDIV VIGNC + R+ VQDE+R K+G+LLLLQQCV DD+NP
Sbjct: 373  DQEGSNYSASKPCPYKGFRRDIVGVIGNCLYGRQIVQDEIRRKDGLLLLLQQCVTDDDNP 432

Query: 437  FLREWGLWTAWSLLGNNEENSKLVADLEYQESVDVPELLKAGLRVELDPQTRKPKLVNVS 258
            +LREWG+W   +LL  N+EN + VA+LE Q SVDVP+L + GLRVE++P T +PKLVN+S
Sbjct: 433  YLREWGIWCVRNLLERNQENQQAVAELELQGSVDVPDLARLGLRVEMNPATGRPKLVNIS 492


>ref|XP_004306868.1| PREDICTED: ataxin-10-like [Fragaria vesca subsp. vesca]
          Length = 490

 Score =  374 bits (961), Expect = e-101
 Identities = 207/418 (49%), Positives = 271/418 (64%), Gaps = 19/418 (4%)
 Frame = -2

Query: 1454 LRLLRNLCAGEARNQDMFIDHNGAEIVSSVINLFGNDFDDSDCAIVRTGLQVLANASLAG 1275
            LRLLRNLCAGE  NQ+ F++ NG  IVS++++   +   + D  I+  GLQVLANA+LAG
Sbjct: 77   LRLLRNLCAGEVANQNSFVEQNGVAIVSNILSSAISL--EPDFWIICVGLQVLANAALAG 134

Query: 1274 DRHQQVIWNHLFPVEFVKIARIRKRETCDPLTMIIYVCCDGSHGLFSELCKEPGLAIMAE 1095
            +R Q  IW  LF  +FV +AR+R ++TC PL MII  CCDG+  L ++LC + G+ I+ E
Sbjct: 135  ERQQHAIWQQLFSEKFVALARVRSKKTCGPLCMIISTCCDGTPELVAQLCGDCGVTILKE 194

Query: 1094 LISTATTVGFGEDWLKLLLSRICIEESHLPGLF-------------KCGDACFTSEQAFL 954
            ++ TA  V FGEDW KLLLSRIC+ E +   LF             + G   F+ EQ FL
Sbjct: 195  IVKTAAAVDFGEDWYKLLLSRICLVEPYFRPLFFSLEHVGENAEDTEGGRESFSKEQEFL 254

Query: 953  LQIISEILHERIQEIRVGNSLALHVLNIFKKAVGAVDFSSRCQPSLPTGSAAIDVLGYSL 774
            L+ +SEIL+E + EI V N  AL V  IFK ++  + +++R +  LPTGS  IDVLGYSL
Sbjct: 255  LKNVSEILNECLSEITVPNDFALCVFGIFKNSIKVLSYATRGRSGLPTGSIDIDVLGYSL 314

Query: 773  AILRDTCAQDGLIGDNADDSSEDVXXXXXXXXXXXXXXXXXXXXEPPSQIRK-MNAA--- 606
             ILRDTCAQ  L G   D  + DV                    EPP+ I+K +N A   
Sbjct: 315  TILRDTCAQGTLRGSTKD--TMDVVDALISYGLIELLLSLLRDLEPPAIIKKSINQAENQ 372

Query: 605  --ASASQKKVCPYKGFRRDIVSVIGNCAHRRKFVQDEVRNKNGVLLLLQQCVHDDENPFL 432
              +S+S  K CPYKGFRRDIV+VIGNC + RK VQDE+R K+G+LLLLQQCV DD+NP+ 
Sbjct: 373  EGSSSSTLKPCPYKGFRRDIVAVIGNCLYGRKIVQDEIRRKDGLLLLLQQCVIDDDNPYS 432

Query: 431  REWGLWTAWSLLGNNEENSKLVADLEYQESVDVPELLKAGLRVELDPQTRKPKLVNVS 258
            REWG+W   +LL  N+EN + VA+LE + SVDVP L + GLRVE++  T +PKLVN+S
Sbjct: 433  REWGIWCQRNLLDRNQENQRAVAELELKGSVDVPALARLGLRVEMNLATGRPKLVNIS 490


>ref|XP_006413924.1| hypothetical protein EUTSA_v10025092mg [Eutrema salsugineum]
            gi|557115094|gb|ESQ55377.1| hypothetical protein
            EUTSA_v10025092mg [Eutrema salsugineum]
          Length = 476

 Score =  373 bits (957), Expect = e-100
 Identities = 195/408 (47%), Positives = 268/408 (65%), Gaps = 11/408 (2%)
 Frame = -2

Query: 1454 LRLLRNLCAGEARNQDMFIDHNGAEIVSSVINLFGNDFDDSDCAIVRTGLQVLANASLAG 1275
            L++LRNLCAGE  NQD F+DH+G+ +VS ++     DF+      +R GLQVLAN  + G
Sbjct: 72   LKVLRNLCAGETWNQDAFVDHDGSVVVSDLLGSAIEDFET-----LRFGLQVLANVLVLG 126

Query: 1274 DRHQQVIWNHLFPVEFVKIARIRKRETCDPLTMIIYVCCDGSHGLFSELCKEPGLAIMAE 1095
             + Q+ +W   FP  F+ IA++R+RETCDPL MI+Y C DGS  + S+LC   GL I+ E
Sbjct: 127  QKRQRNVWLRFFPERFLAIAKVRRRETCDPLCMILYACFDGSSEIASQLCSNQGLDIVTE 186

Query: 1094 LISTATTVGFGED-WLKLLLSRICIEESHLPGLFKC--------GDACFTSEQAFLLQII 942
             + T+++VG  +D WLK+L+SR+C+E    P LF          G+  FTSE AFLL+++
Sbjct: 187  ALRTSSSVGSVDDYWLKVLVSRLCVEGDCFPDLFSKLYRTDIVQGNETFTSEHAFLLRMV 246

Query: 941  SEILHERIQEIRVGNSLALHVLNIFKKAVGAVDFSSRCQPSLPTGSAAIDVLGYSLAILR 762
            S+I +ER++++ +       ++ +FK+++G  DF    +  LPTGS  IDV+GYSL I+R
Sbjct: 247  SDIANERLKQVTIPKDTTHFIMGLFKQSIGVFDFVLGEKSELPTGSTVIDVMGYSLVIIR 306

Query: 761  DTCAQDGLIGDNAD-DSSEDVXXXXXXXXXXXXXXXXXXXXEPPSQIRK-MNAAASASQK 588
            DTCA   L    +D + S D                     EPP+ I+K +  + S+S  
Sbjct: 307  DTCAGGSLEELKSDTNGSGDTVELLLSSGLIELLLDLLRKLEPPTTIKKALKQSPSSSSS 366

Query: 587  KVCPYKGFRRDIVSVIGNCAHRRKFVQDEVRNKNGVLLLLQQCVHDDENPFLREWGLWTA 408
            K CPY+GFRRDIVSVIGNCA+RRK VQDE+R ++G+ L+LQQCV DDEN FLREWGLW  
Sbjct: 367  KPCPYRGFRRDIVSVIGNCAYRRKDVQDEIRERDGLFLMLQQCVTDDENSFLREWGLWCV 426

Query: 407  WSLLGNNEENSKLVADLEYQESVDVPELLKAGLRVELDPQTRKPKLVN 264
             +LL  N+EN K+VA+LE Q SVDVP+L + GLRVE+DP T +PKLVN
Sbjct: 427  RNLLEGNQENQKVVAELEMQGSVDVPQLREIGLRVEIDPVTARPKLVN 474


Top