BLASTX nr result

ID: Akebia23_contig00017128 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00017128
         (1950 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002274705.1| PREDICTED: uncharacterized protein LOC100264...   492   e-136
ref|XP_006421838.1| hypothetical protein CICLE_v10004825mg [Citr...   465   e-128
ref|XP_002320751.1| ataxin-related family protein [Populus trich...   457   e-125
ref|XP_007219054.1| hypothetical protein PRUPE_ppa004765mg [Prun...   454   e-125
ref|XP_002511774.1| conserved hypothetical protein [Ricinus comm...   452   e-124
ref|XP_004492673.1| PREDICTED: ataxin-10-like [Cicer arietinum]       451   e-124
ref|XP_007022651.1| ARM repeat superfamily protein, putative iso...   449   e-123
ref|XP_007022650.1| ARM repeat superfamily protein, putative iso...   449   e-123
ref|XP_007022648.1| ARM repeat superfamily protein, putative iso...   449   e-123
ref|XP_007022647.1| ARM repeat superfamily protein, putative iso...   449   e-123
ref|XP_004308721.1| PREDICTED: uncharacterized protein LOC101297...   446   e-122
ref|XP_003551615.1| PREDICTED: ataxin-10-like [Glycine max]           434   e-119
ref|XP_004306868.1| PREDICTED: ataxin-10-like [Fragaria vesca su...   431   e-118
ref|XP_003623813.1| Ataxin-10 [Medicago truncatula] gi|355498828...   431   e-118
ref|XP_007148734.1| hypothetical protein PHAVU_005G009900g [Phas...   429   e-117
ref|XP_004232703.1| PREDICTED: ataxin-10-like isoform 1 [Solanum...   425   e-116
ref|XP_006348129.1| PREDICTED: ataxin-10-like [Solanum tuberosum]     424   e-116
ref|XP_006366476.1| PREDICTED: ataxin-10-like isoform X1 [Solanu...   417   e-113
gb|EYU23502.1| hypothetical protein MIMGU_mgv1a005564mg [Mimulus...   402   e-109
gb|EYU22629.1| hypothetical protein MIMGU_mgv1a025194mg, partial...   393   e-106

>ref|XP_002274705.1| PREDICTED: uncharacterized protein LOC100264428 [Vitis vinifera]
          Length = 494

 Score =  492 bits (1266), Expect = e-136
 Identities = 270/489 (55%), Positives = 333/489 (68%), Gaps = 4/489 (0%)
 Frame = -1

Query: 1893 MDLTVPEHILQPLLTSANXXXXXXXXXXXXXXSRTVHGRSDLASQNILPIVLQLIKSLSN 1714
            +  ++PE+ILQPL + +N              S+T  GR DL S+NILP+VLQL +SLS 
Sbjct: 6    LKFSLPENILQPLFSVSNSSTLDETLELLIEASKTPGGRLDLGSKNILPVVLQLSQSLSY 65

Query: 1713 PXXXXXXXXXXXXXXXLCAGEILNQNLFITKNGVEXXXXXXXXXXXXXXXXS-GIVRIGL 1537
            P               LCAGE+ NQNLFI +NGV+                  GI+R+GL
Sbjct: 66   PSGHDILLLSLKLLRNLCAGEMTNQNLFIEQNGVKAVSTILLSFVGLDSDSDYGIIRMGL 125

Query: 1536 QLLGNVALAGEEHKTAVWGHFFPIGFTEIAKVGRLEICDTLCMVLYTCCSGNDERVGELC 1357
            QLLGNV+LAGE H+ AVW HFFP GF EIA+V  LE  D LCMV+YTC   + E + E+C
Sbjct: 126  QLLGNVSLAGERHQRAVWHHFFPAGFLEIARVRTLETSDPLCMVIYTCFDQSHEFITEIC 185

Query: 1356 GLGGLKIVSEIIRTASTVGFEEDWLKWLLSRICFKESHFLPLFSELSSAGLVDDISEFRG 1177
            G  GL I++EI+RTASTVGFEEDWLK LLSRIC +ESHF  LFS+L   G   +      
Sbjct: 186  GDQGLPILAEIVRTASTVGFEEDWLKLLLSRICLEESHFPMLFSKLCPVGTSGNYESIEF 245

Query: 1176 EDNFFATEQAFLLSILAESLNQNLDEITVSNDFALCILGIVKRALGVVDLFSRGKTSLPT 997
            + + FA+EQAFL+ I+AE LN+ ++++TVS+D ALC+LGI+K++ GV+D  S  K+    
Sbjct: 246  KVDVFASEQAFLMDIVAEILNEQINKMTVSSDVALCVLGILKKSAGVLDSVSTCKSGFSA 305

Query: 996  GSPAIDVLGYSITILRDICAQGTGSSKTE-DSTNXXXXXXXXXXXXXXXXXLRCLEPPEI 820
            GS AI+VL YS+TIL++ICA+    S  E  S +                 LR LEPP I
Sbjct: 306  GSNAINVLKYSLTILKEICARDAQKSSNEHGSVDVVDLLVSSGLLELLLCLLRDLEPPAI 365

Query: 819  IRKSVSQGENQ--VTSDSLNVCPYKGYRRDIVAVIGNCLYRRKHIQDEIRKKNGILLLMQ 646
            IRK++ QGENQ    S S    PY+G+RRD+VAVIGNC YRRKH+Q+EIR++NGILLL+Q
Sbjct: 366  IRKAIKQGENQDGAASYSPKHYPYRGFRRDLVAVIGNCAYRRKHVQNEIRERNGILLLLQ 425

Query: 645  QCVTEEENPFLREWGIWSVRNLLEGNEENQRQVTELEVQGSVDVPEIAGLGLRVEVDQKT 466
            QCVT+EEN FLREWGIW VRNLLEGN ENQR V ELE+QGSVDVPEIAGLGLRVEVDQKT
Sbjct: 426  QCVTDEENQFLREWGIWCVRNLLEGNVENQRVVAELELQGSVDVPEIAGLGLRVEVDQKT 485

Query: 465  RRAKLVNVS 439
             RAKLVNVS
Sbjct: 486  GRAKLVNVS 494


>ref|XP_006421838.1| hypothetical protein CICLE_v10004825mg [Citrus clementina]
            gi|567858312|ref|XP_006421839.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|567858314|ref|XP_006421840.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|567858316|ref|XP_006421841.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|568874427|ref|XP_006490317.1| PREDICTED:
            ataxin-10-like isoform X1 [Citrus sinensis]
            gi|568874429|ref|XP_006490318.1| PREDICTED:
            ataxin-10-like isoform X2 [Citrus sinensis]
            gi|557523711|gb|ESR35078.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|557523712|gb|ESR35079.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|557523713|gb|ESR35080.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|557523714|gb|ESR35081.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
          Length = 497

 Score =  465 bits (1196), Expect = e-128
 Identities = 247/493 (50%), Positives = 325/493 (65%), Gaps = 3/493 (0%)
 Frame = -1

Query: 1911 MEHNPSMDLTVPEHILQPLLTSANXXXXXXXXXXXXXXSRTVHGRSDLASQNILPIVLQL 1732
            M+   S+D+++ E +LQPLLT++N              S+T  GRSDLAS+NILP VLQL
Sbjct: 1    MDDASSLDISLSEDVLQPLLTTSNSSSLKDALEILIESSKTTVGRSDLASKNILPEVLQL 60

Query: 1731 IKSLSNPXXXXXXXXXXXXXXXLCAGEILNQNLFITKNGVEXXXXXXXXXXXXXXXXSGI 1552
             +S+ +                LCAGEI NQ  FI + GV                  GI
Sbjct: 61   TQSIPHSSGCHYLLLSLKLLRNLCAGEITNQKSFIEQTGVGIVLRVLRSPGVNLDKDYGI 120

Query: 1551 VRIGLQLLGNVALAGEEHKTAVWGHFFPIGFTEIAKVGRLEICDTLCMVLYTCCSGNDER 1372
            +RI LQ+L NV+LAGE H+ A+W  FFP  F  +A V   E CD LCMV+YTCC G+   
Sbjct: 121  IRIALQVLANVSLAGETHQHAIWCQFFPDEFATLAGVRCQETCDPLCMVIYTCCDGSSGL 180

Query: 1371 VGELCGLGGLKIVSEIIRTASTVGFEEDWLKWLLSRICFKESHFLPLFSELSSAGLVDDI 1192
              ELCG  GL I++EI+ TA++VGF+EDW K+L+SR C +E HF  LF +LS  G   + 
Sbjct: 181  FKELCGDKGLAIMAEIVCTAASVGFKEDWFKFLVSRTCVEEIHFPQLFFKLSQVGASRNC 240

Query: 1191 SEFRGEDNFFATEQAFLLSILAESLNQNLDEITVSNDFALCILGIVKRALGVVDLFSRGK 1012
             +    +  F++EQAFLL I++E +N+ ++EI V NDFAL +LGI  +++G+VD ++RG 
Sbjct: 241  EDSNSREGTFSSEQAFLLEIVSEIVNERIEEIIVPNDFALSVLGIFTKSIGLVDFYARGT 300

Query: 1011 TSLPTGSPAIDVLGYSITILRDICA-QGTGSSKTEDSTNXXXXXXXXXXXXXXXXXLRCL 835
             SLPT S AI+VLGYS++ILR+ICA +    S + +  +                 LR L
Sbjct: 301  PSLPTSSSAINVLGYSLSILRNICAREDPAGSSSVNRADLVDSLQSHGLIEMFLSLLRDL 360

Query: 834  EPPEIIRKSVSQGENQ--VTSDSLNVCPYKGYRRDIVAVIGNCLYRRKHIQDEIRKKNGI 661
            EPP IIRK++ QGENQ   ++ S   CPY G+RRD+VAVIGNC YRRKHIQDEIR+++GI
Sbjct: 361  EPPAIIRKAMRQGENQEGTSAKSAKTCPYIGFRRDLVAVIGNCAYRRKHIQDEIRERDGI 420

Query: 660  LLLMQQCVTEEENPFLREWGIWSVRNLLEGNEENQRQVTELEVQGSVDVPEIAGLGLRVE 481
            LLL+QQCVT+E+NPF REWGIW VRNLLEGN ENQ+ V +LE+QGS++VPE+  LGL+VE
Sbjct: 421  LLLLQQCVTDEDNPFSREWGIWCVRNLLEGNAENQKVVADLELQGSINVPELTDLGLKVE 480

Query: 480  VDQKTRRAKLVNV 442
            VD+ TRRAKLVNV
Sbjct: 481  VDKNTRRAKLVNV 493


>ref|XP_002320751.1| ataxin-related family protein [Populus trichocarpa]
            gi|222861524|gb|EEE99066.1| ataxin-related family protein
            [Populus trichocarpa]
          Length = 496

 Score =  457 bits (1175), Expect = e-125
 Identities = 253/490 (51%), Positives = 321/490 (65%), Gaps = 6/490 (1%)
 Frame = -1

Query: 1890 DLTVPEH-ILQPLLTSANXXXXXXXXXXXXXXSRTVHGRSDLASQNILPIVLQLIKSLSN 1714
            +L+ P++  L+PL T++               ++T  GR+DLAS+NILP+VLQLI  L N
Sbjct: 8    ELSFPQNDFLEPLFTASKSSDLKETLEILIAIAKTDDGRADLASKNILPVVLQLITHLLN 67

Query: 1713 -PXXXXXXXXXXXXXXXLCAGEILNQNLFITKNGVEXXXXXXXXXXXXXXXXS-GIVRIG 1540
             P               LCAGE+ NQ  FI  NGV                   GI+R+G
Sbjct: 68   DPFDHEYLSLSLRLMRNLCAGEVANQKSFIQLNGVGIFLTVLRSKKVASSEPDHGIIRMG 127

Query: 1539 LQLLGNVALAGEEHKTAVWGHFFPIGFTEIAKVGRLEICDTLCMVLYTCCSGNDERVGEL 1360
            LQ+L NV+LAG+EH+ A+WG  F      +AKV     CD LCM++Y CC G+ E V +L
Sbjct: 128  LQVLANVSLAGKEHQQAIWGGLFHDELYMLAKVRSQGTCDPLCMIIYACCDGSPELVLQL 187

Query: 1359 CGLGGLKIVSEIIRTASTVGFEEDWLKWLLSRICFKESHFLPLFSELSSA-GLVDDISEF 1183
            CG  GL IV EIIRTAS VGF E+WLK LLSRIC ++ +F  LFS + S     ++  E 
Sbjct: 188  CGNQGLPIVVEIIRTASLVGFGEEWLKLLLSRICLEDIYFPQLFSRIYSVCSYCENGEEI 247

Query: 1182 RGEDNFFATEQAFLLSILAESLNQNLDEITVSNDFALCILGIVKRALGVVDLFSRGKTSL 1003
                N F TEQA+LL+I++E LN+ L EIT+ NDFALCI GI K+++   +  SR ++ L
Sbjct: 248  SLSSNPFFTEQAYLLNIVSEILNERLKEITILNDFALCIFGIFKKSVEAFEFGSRAESRL 307

Query: 1002 PTGSPAIDVLGYSITILRDICAQGTGSSKTEDSTNXXXXXXXXXXXXXXXXXLRCLEPPE 823
            PTG   IDVLGYS+TILRDICA   G  K ED  +                 LR LEPP+
Sbjct: 308  PTGFAVIDVLGYSLTILRDICANNGGVGK-EDLVDVVDSLLSSGLLDLLLCLLRDLEPPK 366

Query: 822  IIRKSVSQGENQVTSDSL--NVCPYKGYRRDIVAVIGNCLYRRKHIQDEIRKKNGILLLM 649
            IIRK+++Q  NQ  + S    VCPYKG+RRD+VAVIGNC YRRKH+QD+IR+KNG+LL++
Sbjct: 367  IIRKAMNQAGNQEATTSYFPKVCPYKGFRRDLVAVIGNCAYRRKHVQDDIRQKNGMLLML 426

Query: 648  QQCVTEEENPFLREWGIWSVRNLLEGNEENQRQVTELEVQGSVDVPEIAGLGLRVEVDQK 469
            QQCVT+E+NPFLREWGIWS+RNLLEGN ENQ+ V ELE+QGSVD+PE+AGLGL+VEVDQ 
Sbjct: 427  QQCVTDEDNPFLREWGIWSMRNLLEGNSENQQAVAELELQGSVDMPELAGLGLKVEVDQN 486

Query: 468  TRRAKLVNVS 439
            TR AKLVN+S
Sbjct: 487  TRSAKLVNIS 496


>ref|XP_007219054.1| hypothetical protein PRUPE_ppa004765mg [Prunus persica]
            gi|462415516|gb|EMJ20253.1| hypothetical protein
            PRUPE_ppa004765mg [Prunus persica]
          Length = 492

 Score =  454 bits (1168), Expect = e-125
 Identities = 240/493 (48%), Positives = 318/493 (64%), Gaps = 2/493 (0%)
 Frame = -1

Query: 1911 MEHNPSMDLTVPEHILQPLLTSANXXXXXXXXXXXXXXSRTVHGRSDLASQNILPIVLQL 1732
            M+     +  VPE +LQ LL+++N               R   GR+DLAS++ILP V+QL
Sbjct: 1    MDKTALQEFFVPEDVLQILLSASNSSTLIDSLETLIQVCRAADGRADLASKSILPSVVQL 60

Query: 1731 IKSLSNPXXXXXXXXXXXXXXXLCAGEILNQNLFITKNGVEXXXXXXXXXXXXXXXXSGI 1552
            I+SL  P               LCAGE+ NQ  F+ ++GV                 SG+
Sbjct: 61   IQSLPYPSGRHLLTLSLKLLRNLCAGEVSNQKSFLEQSGVAIISNVLNSANISLEPDSGV 120

Query: 1551 VRIGLQLLGNVALAGEEHKTAVWGHFFPIGFTEIAKVGRLEICDTLCMVLYTCCSGNDER 1372
            +R+GLQ+L NV+LAGE H+  +W   FP  F  +A+V   E CD LCMV++ CC G+ E 
Sbjct: 121  IRMGLQVLANVSLAGERHQHEIWQQLFPKEFLALARVQSRETCDPLCMVIFACCDGSPEL 180

Query: 1371 VGELCGLGGLKIVSEIIRTASTVGFEEDWLKWLLSRICFKESHFLPLFSELSSAGLVDDI 1192
              +LCG GG+ I+ EI+RT + VGF EDW+K LLSRIC +  +F  LFS L  A   +++
Sbjct: 181  FEKLCGDGGITIMKEIVRTTAAVGFGEDWVKLLLSRICLEGPYFSSLFSNLGFA-TSENV 239

Query: 1191 SEFRGEDNFFATEQAFLLSILAESLNQNLDEITVSNDFALCILGIVKRALGVVDLFSRGK 1012
             +    ++ F+++QAF L I+++ LN+ L EITV  DFALC+ GI K+++G ++  +RG+
Sbjct: 240  EDTEFREDLFSSDQAFFLRIISDILNERLREITVPRDFALCVFGIFKKSVGALNCVTRGQ 299

Query: 1011 TSLPTGSPAIDVLGYSITILRDICAQGTGSSKTEDSTNXXXXXXXXXXXXXXXXXLRCLE 832
            + LPTG+  IDVLGYS+TILRD+CAQ T     ED  +                 LR LE
Sbjct: 300  SGLPTGTSMIDVLGYSLTILRDVCAQKTLRGFQEDLGDAVDVLLSHGLIELILCLLRDLE 359

Query: 831  PPEIIRKSVSQGENQ--VTSDSLNVCPYKGYRRDIVAVIGNCLYRRKHIQDEIRKKNGIL 658
            PP IIRK++ QGE Q    S S   CPYKG+RRDIVAVIGNC Y+RK +QDEIR+++GIL
Sbjct: 360  PPAIIRKAIKQGEGQDGTNSGSSKPCPYKGFRRDIVAVIGNCTYQRKPVQDEIRQRDGIL 419

Query: 657  LLMQQCVTEEENPFLREWGIWSVRNLLEGNEENQRQVTELEVQGSVDVPEIAGLGLRVEV 478
            LL+QQC  +E+NPFL+EWGIW VRNLLEGNE+N+R VTELE+QGSVD PEIAGLG RVEV
Sbjct: 420  LLLQQCGLDEDNPFLKEWGIWCVRNLLEGNEDNKRVVTELELQGSVDAPEIAGLGFRVEV 479

Query: 477  DQKTRRAKLVNVS 439
            + +T R KLVNVS
Sbjct: 480  NPETGRPKLVNVS 492


>ref|XP_002511774.1| conserved hypothetical protein [Ricinus communis]
            gi|223548954|gb|EEF50443.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 497

 Score =  452 bits (1164), Expect = e-124
 Identities = 243/487 (49%), Positives = 314/487 (64%), Gaps = 2/487 (0%)
 Frame = -1

Query: 1893 MDLTVPEHILQPLLTSANXXXXXXXXXXXXXXSRTVHGRSDLASQNILPIVLQLIKSLSN 1714
            M+L +PE +LQ L  ++               SR   GR++LA++++LP+VL+L KS+S 
Sbjct: 1    MELFLPEDLLQLLFRASKSYDLKEALEILIETSRIDDGRANLAAKDVLPLVLKLFKSISY 60

Query: 1713 PXXXXXXXXXXXXXXXLCAGEILNQNLFITKNGVEXXXXXXXXXXXXXXXXSGIVRIGLQ 1534
            P               LCAGEI NQN F+  NG E                 GI+R+GLQ
Sbjct: 61   PSGDQFLTLSLKLLRNLCAGEITNQNCFVALNGPEMVSTLLRSAGLVYEPDYGIIRLGLQ 120

Query: 1533 LLGNVALAGEEHKTAVWGHFFPIGFTEIAKVGRLEICDTLCMVLYTCCSGNDERVGELCG 1354
            +L NV+LAGE+H+ A+W  FFP  F  +AK      CD LCM++YTCC GN   V ELCG
Sbjct: 121  VLANVSLAGEKHQQAIWHWFFPDEFVVLAKNRSQSTCDPLCMIIYTCCDGNPGFVLELCG 180

Query: 1353 LGGLKIVSEIIRTASTVGFEEDWLKWLLSRICFKESHFLPLFSELSSAGLVDDISEFRGE 1174
              GL +V+EI+RTAS VG+ EDW K LLSRIC +E +F  LFS    AG  ++       
Sbjct: 181  DRGLAVVAEIVRTASVVGYGEDWFKLLLSRICLEEEYFYKLFSCFYCAGDSENSEGISSS 240

Query: 1173 DNFFATEQAFLLSILAESLNQNLDEITVSNDFALCILGIVKRALGVVDLFSRGKTSLPTG 994
             + F+TEQA+LLS ++E LN+ L++I+VS DFA  + GI KR++GVVD  SRG + LPTG
Sbjct: 241  SDLFSTEQAYLLSTVSEILNERLEDISVSIDFAFYVFGIFKRSVGVVDFVSRGNSGLPTG 300

Query: 993  SPAIDVLGYSITILRDICAQGTGSSKTEDSTNXXXXXXXXXXXXXXXXXLRCLEPPEIIR 814
            S A+DVLGYS+TILRD CA   G      S +                 L  LEPP +I+
Sbjct: 301  SAAVDVLGYSLTILRDTCAL-HGKGGLYHSVDVVDTLLSNGLLELLLFVLHDLEPPPMIK 359

Query: 813  KSVSQGENQ--VTSDSLNVCPYKGYRRDIVAVIGNCLYRRKHIQDEIRKKNGILLLMQQC 640
            K++ Q EN    +S S   CPYKG+RRDIVAVIGNC ++R ++QDEIR+K+ I LL+QQC
Sbjct: 360  KAMKQNENHEPASSRSYKPCPYKGFRRDIVAVIGNCAFQRNNVQDEIRQKDMIPLLLQQC 419

Query: 639  VTEEENPFLREWGIWSVRNLLEGNEENQRQVTELEVQGSVDVPEIAGLGLRVEVDQKTRR 460
            VT+E+NPFLREWG+W VRNLLEGN ENQ+ V ELE+QG+V VPE++GLGLRVEVD  TRR
Sbjct: 420  VTDEDNPFLREWGLWCVRNLLEGNVENQKAVAELELQGTVQVPELSGLGLRVEVDSNTRR 479

Query: 459  AKLVNVS 439
            A+LVNVS
Sbjct: 480  ARLVNVS 486


>ref|XP_004492673.1| PREDICTED: ataxin-10-like [Cicer arietinum]
          Length = 468

 Score =  451 bits (1159), Expect = e-124
 Identities = 234/448 (52%), Positives = 301/448 (67%), Gaps = 1/448 (0%)
 Frame = -1

Query: 1782 GRSDLASQNILPIVLQLIKSLSNPXXXXXXXXXXXXXXXLCAGEILNQNLFITKNGVEXX 1603
            GRS+LAS+ +LP VL ++ S + P               LCAGE  NQNLF+  +GV   
Sbjct: 21   GRSNLASKRVLPAVLNILNSQTLPLDHNLLSLCFKLLRNLCAGEFENQNLFLEFDGVVVV 80

Query: 1602 XXXXXXXXXXXXXXSGIVRIGLQLLGNVALAGEEHKTAVWGHFFPIGFTEIAKVGRLEIC 1423
                            +VR GLQ+L NV LAG++H+ A+W   FP+GF  +A++G  EIC
Sbjct: 81   SSILMSEAGSLRPDHMLVRWGLQVLANVCLAGKQHQKAIWEEIFPLGFVSLARLGTKEIC 140

Query: 1422 DTLCMVLYTCCSGNDERVGELCGLGGLKIVSEIIRTASTVGFEEDWLKWLLSRICFKESH 1243
            D LCMV+YTCC GN E  GELC   GL +V+EI++TAS+  F EDW+K LLSRIC +ES 
Sbjct: 141  DPLCMVIYTCCDGNHECFGELCSDSGLPVVAEIVKTASSASFGEDWIKLLLSRICLEESQ 200

Query: 1242 FLPLFSELSSAGLVDDISEFRGEDNFFATEQAFLLSILAESLNQNLDEITVSNDFALCIL 1063
               LF +L    + +   +   +D  F+ EQAFLL IL+E LN+ L ++ VS D AL + 
Sbjct: 201  LPMLFPKLRFMDIPEG-EDIDSKDYQFSFEQAFLLQILSEILNERLRDVVVSKDVALFVY 259

Query: 1062 GIVKRALGVVDLFSRGKTSLPTGSPAIDVLGYSITILRDICAQGTGSSKTEDSTNXXXXX 883
            G+ K+++GV++   RGK+ LP+GS A+D LGYS+TILRDICA  +     ED+ +     
Sbjct: 260  GVFKKSVGVLEHAVRGKSGLPSGSVAVDALGYSLTILRDICAHDSVRGNPEDTNDVVDVL 319

Query: 882  XXXXXXXXXXXXLRCLEPPEIIRKSVSQGENQV-TSDSLNVCPYKGYRRDIVAVIGNCLY 706
                        L  LEPP IIRK + Q ENQ   S S   CPYKG+RRDIV++IGNC+Y
Sbjct: 320  LSQDIIELLLILLGDLEPPAIIRKGIKQSENQEGASCSSKPCPYKGFRRDIVSLIGNCVY 379

Query: 705  RRKHIQDEIRKKNGILLLMQQCVTEEENPFLREWGIWSVRNLLEGNEENQRQVTELEVQG 526
            RRKH QDEIR +NGILLL+QQCVT+E+NPFLREWGIWSVRN+LEGNEENQ+ V+EL++QG
Sbjct: 380  RRKHAQDEIRGRNGILLLLQQCVTDEDNPFLREWGIWSVRNMLEGNEENQKVVSELQLQG 439

Query: 525  SVDVPEIAGLGLRVEVDQKTRRAKLVNV 442
            S DVP+I+ LGLR+EVDQKTRRAKLVNV
Sbjct: 440  SADVPQISALGLRIEVDQKTRRAKLVNV 467


>ref|XP_007022651.1| ARM repeat superfamily protein, putative isoform 5 [Theobroma cacao]
            gi|508722279|gb|EOY14176.1| ARM repeat superfamily
            protein, putative isoform 5 [Theobroma cacao]
          Length = 519

 Score =  449 bits (1155), Expect = e-123
 Identities = 239/476 (50%), Positives = 315/476 (66%), Gaps = 2/476 (0%)
 Frame = -1

Query: 1875 EHILQPLLTSANXXXXXXXXXXXXXXSRTVHGRSDLASQNILPIVLQLIKSLSNPXXXXX 1696
            E +LQPLL+++N              SRT   R++LA +NILP VL+L++S         
Sbjct: 13   EGVLQPLLSASNSSSLKEALEILIKVSRTAAARAELALRNILPTVLKLVESFHQTSSREY 72

Query: 1695 XXXXXXXXXXLCAGEILNQNLFITKNGVEXXXXXXXXXXXXXXXXSGIVRIGLQLLGNVA 1516
                      LCAGE+ NQN F  +NGVE                SG++R+ LQ+L NV+
Sbjct: 73   LVNSLKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAALLSNPDSGVIRVSLQVLANVS 132

Query: 1515 LAGEEHKTAVWGHFFPIGFTEIAKVGRLEICDTLCMVLYTCCSGNDERVGELCGLGGLKI 1336
            LAGE+H+ A+W  FFP  F+ +A+V   E  D LCM+LYTCC      V ELC   GL I
Sbjct: 133  LAGEDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPI 192

Query: 1335 VSEIIRTASTVGFEEDWLKWLLSRICFKESHFLPLFSELSSAGLVDDISEFRGEDNFFAT 1156
            V  IIRT ++VGF EDW K LLSR+C ++ HF  +FS+       ++       D+ F +
Sbjct: 193  VVGIIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLS 252

Query: 1155 EQAFLLSILAESLNQNLDEITVSNDFALCILGIVKRALGVVDLFSRGKTSLPTGSPAIDV 976
            EQAFLL I++E LN+ ++EI VS++FALC+LGI KR++ VVD  SRG +SLPTG  +IDV
Sbjct: 253  EQAFLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDV 312

Query: 975  LGYSITILRDICA-QGTGSSKTEDSTNXXXXXXXXXXXXXXXXXLRCLEPPEIIRKSVSQ 799
            +GYS+ ILRDICA +G G  K  DS +                 LR L+PP IIRK + +
Sbjct: 313  MGYSLIILRDICAREGVGDLK-NDSLDVVDMLLSHELIDILLSLLRDLDPPAIIRKVLKE 371

Query: 798  GENQ-VTSDSLNVCPYKGYRRDIVAVIGNCLYRRKHIQDEIRKKNGILLLMQQCVTEEEN 622
            G+NQ +   +  +CPYKG+RRD++AVIGNC YRRKH+QDEIR+KNGILLL+QQCVT+++N
Sbjct: 372  GDNQGLNLSASKLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCVTDDDN 431

Query: 621  PFLREWGIWSVRNLLEGNEENQRQVTELEVQGSVDVPEIAGLGLRVEVDQKTRRAK 454
            P+LREWGIWS+RNLLEG+ ENQ+ V +LE+QGSVD+PE++ LGLRVEVDQKTRRAK
Sbjct: 432  PYLREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRAK 487


>ref|XP_007022650.1| ARM repeat superfamily protein, putative isoform 4 [Theobroma cacao]
            gi|508722278|gb|EOY14175.1| ARM repeat superfamily
            protein, putative isoform 4 [Theobroma cacao]
          Length = 500

 Score =  449 bits (1155), Expect = e-123
 Identities = 239/476 (50%), Positives = 315/476 (66%), Gaps = 2/476 (0%)
 Frame = -1

Query: 1875 EHILQPLLTSANXXXXXXXXXXXXXXSRTVHGRSDLASQNILPIVLQLIKSLSNPXXXXX 1696
            E +LQPLL+++N              SRT   R++LA +NILP VL+L++S         
Sbjct: 25   EGVLQPLLSASNSSSLKEALEILIKVSRTAAARAELALRNILPTVLKLVESFHQTSSREY 84

Query: 1695 XXXXXXXXXXLCAGEILNQNLFITKNGVEXXXXXXXXXXXXXXXXSGIVRIGLQLLGNVA 1516
                      LCAGE+ NQN F  +NGVE                SG++R+ LQ+L NV+
Sbjct: 85   LVNSLKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAALLSNPDSGVIRVSLQVLANVS 144

Query: 1515 LAGEEHKTAVWGHFFPIGFTEIAKVGRLEICDTLCMVLYTCCSGNDERVGELCGLGGLKI 1336
            LAGE+H+ A+W  FFP  F+ +A+V   E  D LCM+LYTCC      V ELC   GL I
Sbjct: 145  LAGEDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPI 204

Query: 1335 VSEIIRTASTVGFEEDWLKWLLSRICFKESHFLPLFSELSSAGLVDDISEFRGEDNFFAT 1156
            V  IIRT ++VGF EDW K LLSR+C ++ HF  +FS+       ++       D+ F +
Sbjct: 205  VVGIIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLS 264

Query: 1155 EQAFLLSILAESLNQNLDEITVSNDFALCILGIVKRALGVVDLFSRGKTSLPTGSPAIDV 976
            EQAFLL I++E LN+ ++EI VS++FALC+LGI KR++ VVD  SRG +SLPTG  +IDV
Sbjct: 265  EQAFLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDV 324

Query: 975  LGYSITILRDICA-QGTGSSKTEDSTNXXXXXXXXXXXXXXXXXLRCLEPPEIIRKSVSQ 799
            +GYS+ ILRDICA +G G  K  DS +                 LR L+PP IIRK + +
Sbjct: 325  MGYSLIILRDICAREGVGDLK-NDSLDVVDMLLSHELIDILLSLLRDLDPPAIIRKVLKE 383

Query: 798  GENQ-VTSDSLNVCPYKGYRRDIVAVIGNCLYRRKHIQDEIRKKNGILLLMQQCVTEEEN 622
            G+NQ +   +  +CPYKG+RRD++AVIGNC YRRKH+QDEIR+KNGILLL+QQCVT+++N
Sbjct: 384  GDNQGLNLSASKLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCVTDDDN 443

Query: 621  PFLREWGIWSVRNLLEGNEENQRQVTELEVQGSVDVPEIAGLGLRVEVDQKTRRAK 454
            P+LREWGIWS+RNLLEG+ ENQ+ V +LE+QGSVD+PE++ LGLRVEVDQKTRRAK
Sbjct: 444  PYLREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRAK 499


>ref|XP_007022648.1| ARM repeat superfamily protein, putative isoform 2 [Theobroma cacao]
            gi|590613384|ref|XP_007022649.1| ARM repeat superfamily
            protein, putative isoform 2 [Theobroma cacao]
            gi|590613394|ref|XP_007022652.1| ARM repeat superfamily
            protein, putative isoform 2 [Theobroma cacao]
            gi|508722276|gb|EOY14173.1| ARM repeat superfamily
            protein, putative isoform 2 [Theobroma cacao]
            gi|508722277|gb|EOY14174.1| ARM repeat superfamily
            protein, putative isoform 2 [Theobroma cacao]
            gi|508722280|gb|EOY14177.1| ARM repeat superfamily
            protein, putative isoform 2 [Theobroma cacao]
          Length = 488

 Score =  449 bits (1155), Expect = e-123
 Identities = 239/476 (50%), Positives = 315/476 (66%), Gaps = 2/476 (0%)
 Frame = -1

Query: 1875 EHILQPLLTSANXXXXXXXXXXXXXXSRTVHGRSDLASQNILPIVLQLIKSLSNPXXXXX 1696
            E +LQPLL+++N              SRT   R++LA +NILP VL+L++S         
Sbjct: 13   EGVLQPLLSASNSSSLKEALEILIKVSRTAAARAELALRNILPTVLKLVESFHQTSSREY 72

Query: 1695 XXXXXXXXXXLCAGEILNQNLFITKNGVEXXXXXXXXXXXXXXXXSGIVRIGLQLLGNVA 1516
                      LCAGE+ NQN F  +NGVE                SG++R+ LQ+L NV+
Sbjct: 73   LVNSLKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAALLSNPDSGVIRVSLQVLANVS 132

Query: 1515 LAGEEHKTAVWGHFFPIGFTEIAKVGRLEICDTLCMVLYTCCSGNDERVGELCGLGGLKI 1336
            LAGE+H+ A+W  FFP  F+ +A+V   E  D LCM+LYTCC      V ELC   GL I
Sbjct: 133  LAGEDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPI 192

Query: 1335 VSEIIRTASTVGFEEDWLKWLLSRICFKESHFLPLFSELSSAGLVDDISEFRGEDNFFAT 1156
            V  IIRT ++VGF EDW K LLSR+C ++ HF  +FS+       ++       D+ F +
Sbjct: 193  VVGIIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLS 252

Query: 1155 EQAFLLSILAESLNQNLDEITVSNDFALCILGIVKRALGVVDLFSRGKTSLPTGSPAIDV 976
            EQAFLL I++E LN+ ++EI VS++FALC+LGI KR++ VVD  SRG +SLPTG  +IDV
Sbjct: 253  EQAFLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDV 312

Query: 975  LGYSITILRDICA-QGTGSSKTEDSTNXXXXXXXXXXXXXXXXXLRCLEPPEIIRKSVSQ 799
            +GYS+ ILRDICA +G G  K  DS +                 LR L+PP IIRK + +
Sbjct: 313  MGYSLIILRDICAREGVGDLK-NDSLDVVDMLLSHELIDILLSLLRDLDPPAIIRKVLKE 371

Query: 798  GENQ-VTSDSLNVCPYKGYRRDIVAVIGNCLYRRKHIQDEIRKKNGILLLMQQCVTEEEN 622
            G+NQ +   +  +CPYKG+RRD++AVIGNC YRRKH+QDEIR+KNGILLL+QQCVT+++N
Sbjct: 372  GDNQGLNLSASKLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCVTDDDN 431

Query: 621  PFLREWGIWSVRNLLEGNEENQRQVTELEVQGSVDVPEIAGLGLRVEVDQKTRRAK 454
            P+LREWGIWS+RNLLEG+ ENQ+ V +LE+QGSVD+PE++ LGLRVEVDQKTRRAK
Sbjct: 432  PYLREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRAK 487


>ref|XP_007022647.1| ARM repeat superfamily protein, putative isoform 1 [Theobroma cacao]
            gi|508722275|gb|EOY14172.1| ARM repeat superfamily
            protein, putative isoform 1 [Theobroma cacao]
          Length = 531

 Score =  449 bits (1155), Expect = e-123
 Identities = 239/476 (50%), Positives = 315/476 (66%), Gaps = 2/476 (0%)
 Frame = -1

Query: 1875 EHILQPLLTSANXXXXXXXXXXXXXXSRTVHGRSDLASQNILPIVLQLIKSLSNPXXXXX 1696
            E +LQPLL+++N              SRT   R++LA +NILP VL+L++S         
Sbjct: 25   EGVLQPLLSASNSSSLKEALEILIKVSRTAAARAELALRNILPTVLKLVESFHQTSSREY 84

Query: 1695 XXXXXXXXXXLCAGEILNQNLFITKNGVEXXXXXXXXXXXXXXXXSGIVRIGLQLLGNVA 1516
                      LCAGE+ NQN F  +NGVE                SG++R+ LQ+L NV+
Sbjct: 85   LVNSLKLLRNLCAGEVANQNAFFEQNGVEVVLSVLRSAALLSNPDSGVIRVSLQVLANVS 144

Query: 1515 LAGEEHKTAVWGHFFPIGFTEIAKVGRLEICDTLCMVLYTCCSGNDERVGELCGLGGLKI 1336
            LAGE+H+ A+W  FFP  F+ +A+V   E  D LCM+LYTCC      V ELC   GL I
Sbjct: 145  LAGEDHQQAIWLKFFPNEFSVLARVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPI 204

Query: 1335 VSEIIRTASTVGFEEDWLKWLLSRICFKESHFLPLFSELSSAGLVDDISEFRGEDNFFAT 1156
            V  IIRT ++VGF EDW K LLSR+C ++ HF  +FS+       ++       D+ F +
Sbjct: 205  VVGIIRTVASVGFGEDWFKLLLSRLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLS 264

Query: 1155 EQAFLLSILAESLNQNLDEITVSNDFALCILGIVKRALGVVDLFSRGKTSLPTGSPAIDV 976
            EQAFLL I++E LN+ ++EI VS++FALC+LGI KR++ VVD  SRG +SLPTG  +IDV
Sbjct: 265  EQAFLLRIISEILNERIEEIQVSSEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDV 324

Query: 975  LGYSITILRDICA-QGTGSSKTEDSTNXXXXXXXXXXXXXXXXXLRCLEPPEIIRKSVSQ 799
            +GYS+ ILRDICA +G G  K  DS +                 LR L+PP IIRK + +
Sbjct: 325  MGYSLIILRDICAREGVGDLK-NDSLDVVDMLLSHELIDILLSLLRDLDPPAIIRKVLKE 383

Query: 798  GENQ-VTSDSLNVCPYKGYRRDIVAVIGNCLYRRKHIQDEIRKKNGILLLMQQCVTEEEN 622
            G+NQ +   +  +CPYKG+RRD++AVIGNC YRRKH+QDEIR+KNGILLL+QQCVT+++N
Sbjct: 384  GDNQGLNLSASKLCPYKGFRRDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCVTDDDN 443

Query: 621  PFLREWGIWSVRNLLEGNEENQRQVTELEVQGSVDVPEIAGLGLRVEVDQKTRRAK 454
            P+LREWGIWS+RNLLEG+ ENQ+ V +LE+QGSVD+PE++ LGLRVEVDQKTRRAK
Sbjct: 444  PYLREWGIWSLRNLLEGHAENQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRAK 499


>ref|XP_004308721.1| PREDICTED: uncharacterized protein LOC101297970 [Fragaria vesca
            subsp. vesca]
          Length = 492

 Score =  446 bits (1146), Expect = e-122
 Identities = 237/493 (48%), Positives = 315/493 (63%), Gaps = 2/493 (0%)
 Frame = -1

Query: 1911 MEHNPSMDLTVPEHILQPLLTSANXXXXXXXXXXXXXXSRTVHGRSDLASQNILPIVLQL 1732
            M++    + +VPEH+LQ LL+ +N               +T  GR DL+++N+LP V+QL
Sbjct: 1    MDNTTLPECSVPEHVLQALLSVSNSSKLVDSLEDLVQVCKTADGREDLSAKNVLPTVIQL 60

Query: 1731 IKSLSNPXXXXXXXXXXXXXXXLCAGEILNQNLFITKNGVEXXXXXXXXXXXXXXXXSGI 1552
            ++SLS P               LCAGE+ NQN F+ +NGV                  GI
Sbjct: 61   VQSLSYPSDHYLLTLSLRLLRNLCAGEVANQNSFVEQNGVAIISNILSSASSLEPDF-GI 119

Query: 1551 VRIGLQLLGNVALAGEEHKTAVWGHFFPIGFTEIAKVGRLEICDTLCMVLYTCCSGNDER 1372
            + +GLQ+L NVALAGE  + A+W   F   F  +A+V   + C  LCM++Y CC G  E 
Sbjct: 120  ICVGLQVLANVALAGERQQHAIWQQLFLENFVALARVRSQKTCGPLCMIIYACCDGTPEL 179

Query: 1371 VGELCGLGGLKIVSEIIRTASTVGFEEDWLKWLLSRICFKESHFLPLFSELSSAGLVDDI 1192
            V +LCG  G+ IV EI++TA+  GF EDW K LLSRIC +E +F PLF  L   G  ++ 
Sbjct: 180  VAQLCGDCGVTIVKEIVKTAAADGFGEDWYKLLLSRICLEEPYFRPLFFSLQHVGGNENG 239

Query: 1191 SEFRGEDNFFATEQAFLLSILAESLNQNLDEITVSNDFALCILGIVKRALGVVDLFSRGK 1012
             +  G    F  EQ FLL  ++E LN+ L+EITV +DFALC+ GI K ++ V+   +RG+
Sbjct: 240  DDTEGGQESFLEEQEFLLKNVSEILNERLNEITVPDDFALCVFGIFKNSIKVLSYATRGR 299

Query: 1011 TSLPTGSPAIDVLGYSITILRDICAQGTGSSKTEDSTNXXXXXXXXXXXXXXXXXLRCLE 832
            + LPTGS  IDVLGYS+TILRDICAQGT    T D+ +                 LR LE
Sbjct: 300  SGLPTGSIDIDVLGYSLTILRDICAQGTLRGCTVDTMDVVDALISYGLIELLLCLLRDLE 359

Query: 831  PPEIIRKSVSQGENQVTSD--SLNVCPYKGYRRDIVAVIGNCLYRRKHIQDEIRKKNGIL 658
            PP II+KSV+Q ++Q  S+  +   CPYKG+RRDIV VIGNCLY R+ +QDEIR+K+G+L
Sbjct: 360  PPAIIKKSVNQAKDQEGSNYSASKPCPYKGFRRDIVGVIGNCLYGRQIVQDEIRRKDGLL 419

Query: 657  LLMQQCVTEEENPFLREWGIWSVRNLLEGNEENQRQVTELEVQGSVDVPEIAGLGLRVEV 478
            LL+QQCVT+++NP+LREWGIW VRNLLE N+ENQ+ V ELE+QGSVDVP++A LGLRVE+
Sbjct: 420  LLLQQCVTDDDNPYLREWGIWCVRNLLERNQENQQAVAELELQGSVDVPDLARLGLRVEM 479

Query: 477  DQKTRRAKLVNVS 439
            +  T R KLVN+S
Sbjct: 480  NPATGRPKLVNIS 492


>ref|XP_003551615.1| PREDICTED: ataxin-10-like [Glycine max]
          Length = 498

 Score =  434 bits (1116), Expect = e-119
 Identities = 237/487 (48%), Positives = 304/487 (62%), Gaps = 7/487 (1%)
 Frame = -1

Query: 1881 VPEHILQPLLTSANXXXXXXXXXXXXXXSRTVHGRSDLASQNILPIVLQLIKSLSNPXXX 1702
            + E  LQ L  ++N              +++  GR +LAS+ ILP VL ++ SL++    
Sbjct: 12   ISEDTLQLLFEASNSSNMEKSLEILIQNAKSDSGRLELASKRILPAVLNIVHSLTHASHH 71

Query: 1701 XXXXXXXXXXXXL------CAGEILNQNLFITKNGVEXXXXXXXXXXXXXXXXSGIVRIG 1540
                               CAGE  NQ+ F+  +GV                  G+VR G
Sbjct: 72   HHHQHNHILCLSFKLLRNLCAGEAANQDSFLELDGVAVVCSVLRSEAACSGPDHGLVRWG 131

Query: 1539 LQLLGNVALAGEEHKTAVWGHFFPIGFTEIAKVGRLEICDTLCMVLYTCCSGNDERVGEL 1360
            LQ+L NV+LAG++H+ A+W   +  GF  +A++   E CD LCMV+YTCC GN E    L
Sbjct: 132  LQVLANVSLAGKQHQCAIWKELYLDGFVSLARLHTKETCDPLCMVIYTCCDGNPEWFKRL 191

Query: 1359 CGLGGLKIVSEIIRTASTVGFEEDWLKWLLSRICFKESHFLPLFSELSSAGLVDDISEFR 1180
                G  +++EI+RTAS+  F EDWLK LLSRIC +ES    LFS+L  A  V  +    
Sbjct: 192  SSEDGWFVMAEIVRTASSASFGEDWLKLLLSRICLEESQLPVLFSKLQFAD-VPKVEVAE 250

Query: 1179 GEDNFFATEQAFLLSILAESLNQNLDEITVSNDFALCILGIVKRALGVVDLFSRGKTSLP 1000
             +D+ F+ EQAFLL IL+E LN+   ++TVS D AL + GI K ++GV++  +RGK+ LP
Sbjct: 251  SKDDHFSFEQAFLLRILSEILNERHKDVTVSKDVALFVFGIFKNSIGVLEHATRGKSGLP 310

Query: 999  TGSPAIDVLGYSITILRDICAQGTGSSKTEDSTNXXXXXXXXXXXXXXXXXLRCLEPPEI 820
            +G   +DVLGYS+TILRDICAQ      TEDS +                 L  LEPP I
Sbjct: 311  SGFVGVDVLGYSLTILRDICAQDGVRGNTEDSNDVVDALLSYGLIELLLYLLEALEPPAI 370

Query: 819  IRKSVSQGENQV-TSDSLNVCPYKGYRRDIVAVIGNCLYRRKHIQDEIRKKNGILLLMQQ 643
            IRK + Q ENQ   S S   CPYKG+RRDIVA+IGNC+YRRKH QDEIR +NGILLL+QQ
Sbjct: 371  IRKGLKQCENQDGASCSFKPCPYKGFRRDIVALIGNCVYRRKHAQDEIRHRNGILLLLQQ 430

Query: 642  CVTEEENPFLREWGIWSVRNLLEGNEENQRQVTELEVQGSVDVPEIAGLGLRVEVDQKTR 463
            CVT+E+NPFLREWGIWSVRN+LEGN+ENQ+ V ELE+QGS DVPEI  LGLRVEVDQ+TR
Sbjct: 431  CVTDEDNPFLREWGIWSVRNMLEGNDENQKVVAELEIQGSADVPEITSLGLRVEVDQRTR 490

Query: 462  RAKLVNV 442
            RAKLVN+
Sbjct: 491  RAKLVNI 497


>ref|XP_004306868.1| PREDICTED: ataxin-10-like [Fragaria vesca subsp. vesca]
          Length = 490

 Score =  431 bits (1109), Expect = e-118
 Identities = 233/493 (47%), Positives = 307/493 (62%), Gaps = 2/493 (0%)
 Frame = -1

Query: 1911 MEHNPSMDLTVPEHILQPLLTSANXXXXXXXXXXXXXXSRTVHGRSDLASQNILPIVLQL 1732
            M++    + +VPE ++Q LL+ +N               +T  GR DLA++N+LP V+QL
Sbjct: 1    MDNTALPECSVPEDVIQALLSVSNSSNLVESMEDLIQVCKTADGREDLAAKNVLPTVIQL 60

Query: 1731 IKSLSNPXXXXXXXXXXXXXXXLCAGEILNQNLFITKNGVEXXXXXXXXXXXXXXXXSGI 1552
            ++SL  P               LCAGE+ NQN F+ +NGV                   I
Sbjct: 61   VQSLLYPSDHYLLTLSLRLLRNLCAGEVANQNSFVEQNGVAIVSNILSSAISLEPDFW-I 119

Query: 1551 VRIGLQLLGNVALAGEEHKTAVWGHFFPIGFTEIAKVGRLEICDTLCMVLYTCCSGNDER 1372
            + +GLQ+L N ALAGE  + A+W   F   F  +A+V   + C  LCM++ TCC G  E 
Sbjct: 120  ICVGLQVLANAALAGERQQHAIWQQLFSEKFVALARVRSKKTCGPLCMIISTCCDGTPEL 179

Query: 1371 VGELCGLGGLKIVSEIIRTASTVGFEEDWLKWLLSRICFKESHFLPLFSELSSAGLVDDI 1192
            V +LCG  G+ I+ EI++TA+ V F EDW K LLSRIC  E +F PLF  L   G  ++ 
Sbjct: 180  VAQLCGDCGVTILKEIVKTAAAVDFGEDWYKLLLSRICLVEPYFRPLFFSLEHVG--ENA 237

Query: 1191 SEFRGEDNFFATEQAFLLSILAESLNQNLDEITVSNDFALCILGIVKRALGVVDLFSRGK 1012
             +  G    F+ EQ FLL  ++E LN+ L EITV NDFALC+ GI K ++ V+   +RG+
Sbjct: 238  EDTEGGRESFSKEQEFLLKNVSEILNECLSEITVPNDFALCVFGIFKNSIKVLSYATRGR 297

Query: 1011 TSLPTGSPAIDVLGYSITILRDICAQGTGSSKTEDSTNXXXXXXXXXXXXXXXXXLRCLE 832
            + LPTGS  IDVLGYS+TILRD CAQGT    T+D+ +                 LR LE
Sbjct: 298  SGLPTGSIDIDVLGYSLTILRDTCAQGTLRGSTKDTMDVVDALISYGLIELLLSLLRDLE 357

Query: 831  PPEIIRKSVSQGENQVTSDS--LNVCPYKGYRRDIVAVIGNCLYRRKHIQDEIRKKNGIL 658
            PP II+KS++Q ENQ  S S  L  CPYKG+RRDIVAVIGNCLY RK +QDEIR+K+G+L
Sbjct: 358  PPAIIKKSINQAENQEGSSSSTLKPCPYKGFRRDIVAVIGNCLYGRKIVQDEIRRKDGLL 417

Query: 657  LLMQQCVTEEENPFLREWGIWSVRNLLEGNEENQRQVTELEVQGSVDVPEIAGLGLRVEV 478
            LL+QQCV +++NP+ REWGIW  RNLL+ N+ENQR V ELE++GSVDVP +A LGLRVE+
Sbjct: 418  LLLQQCVIDDDNPYSREWGIWCQRNLLDRNQENQRAVAELELKGSVDVPALARLGLRVEM 477

Query: 477  DQKTRRAKLVNVS 439
            +  T R KLVN+S
Sbjct: 478  NLATGRPKLVNIS 490


>ref|XP_003623813.1| Ataxin-10 [Medicago truncatula] gi|355498828|gb|AES80031.1| Ataxin-10
            [Medicago truncatula]
          Length = 491

 Score =  431 bits (1107), Expect = e-118
 Identities = 226/455 (49%), Positives = 303/455 (66%), Gaps = 1/455 (0%)
 Frame = -1

Query: 1794 RTVHGRSDLASQNILPIVLQLIKSLSNPXXXXXXXXXXXXXXXLCAGEILNQNLFITKNG 1615
            ++   RS  A + ILP +L ++ S   P               LCAGEILNQN+F+  +G
Sbjct: 43   KSTSNRSLYACKKILPTILTVLHS---PPSLHILSLCFKLLRNLCAGEILNQNMFLENDG 99

Query: 1614 VEXXXXXXXXXXXXXXXXSGIVRIGLQLLGNVALAGEEHKTAVWGHFFPIGFTEIAKVGR 1435
            V                   +VR GLQ+L NV LAG+EH+ AVW   FP+GF  +A++G+
Sbjct: 100  VFIVVSSILRSEVVGSDYM-LVRWGLQVLANVCLAGKEHQKAVWDEMFPVGFLSVARIGK 158

Query: 1434 LEICDTLCMVLYTCCSGNDERVGELCGLGGLKIVSEIIRTASTVGFEEDWLKWLLSRICF 1255
             E+ D LCMV+YTCC GND+   E+C  GG  ++ EI+RTAS+  F EDW+K LLSRIC 
Sbjct: 159  KEVNDPLCMVIYTCCDGNDQWFSEVCSDGGWNVLVEIVRTASSASFGEDWIKLLLSRICL 218

Query: 1254 KESHFLPLFSELSSAGLVDDISEFRGEDNFFATEQAFLLSILAESLNQNLDEITVSNDFA 1075
            ++S    LFS+L    + D   + + +D+ F++EQAFLL I+++ LN+ + ++T+S + A
Sbjct: 219  EDSQLRVLFSKLRFMDIPDG-EDTKTKDDQFSSEQAFLLQIISDILNERIGDVTISLEVA 277

Query: 1074 LCILGIVKRALGVVDLFSRGKTSLPTGSPAIDVLGYSITILRDICAQGTGSSKTEDSTNX 895
              + GI K+++GV++   RGK+ LP+G   +DVLGYS+T+LRDICA  +    +ED T  
Sbjct: 278  SFVYGIFKKSIGVLEHAVRGKSGLPSGITDVDVLGYSLTMLRDICAHDSVRGNSED-TEV 336

Query: 894  XXXXXXXXXXXXXXXXLRCLEPPEIIRKSVSQGEN-QVTSDSLNVCPYKGYRRDIVAVIG 718
                            L  LEPP IIRK +   EN    S S   CPYKG+RRDIVA+IG
Sbjct: 337  VDMLLSYGLIELVFILLGDLEPPTIIRKGMKHSENPDGASSSSKPCPYKGFRRDIVALIG 396

Query: 717  NCLYRRKHIQDEIRKKNGILLLMQQCVTEEENPFLREWGIWSVRNLLEGNEENQRQVTEL 538
            NC+YRRKH+QDEIR +NGILLL+QQCVT+E+NP+LREWGIW VRN+LEGNEENQ++++EL
Sbjct: 397  NCVYRRKHVQDEIRSRNGILLLLQQCVTDEDNPYLREWGIWCVRNMLEGNEENQKEISEL 456

Query: 537  EVQGSVDVPEIAGLGLRVEVDQKTRRAKLVNVS*N 433
            ++QGS DVPEI+ LGLRVEVDQKTRRAKLVNVS N
Sbjct: 457  QLQGSADVPEISALGLRVEVDQKTRRAKLVNVSGN 491


>ref|XP_007148734.1| hypothetical protein PHAVU_005G009900g [Phaseolus vulgaris]
            gi|561021998|gb|ESW20728.1| hypothetical protein
            PHAVU_005G009900g [Phaseolus vulgaris]
          Length = 498

 Score =  429 bits (1103), Expect = e-117
 Identities = 236/485 (48%), Positives = 304/485 (62%), Gaps = 5/485 (1%)
 Frame = -1

Query: 1881 VPEHILQPLLTSANXXXXXXXXXXXXXXSRTVHGRSDLASQNILPIVLQLIKSLSNPXXX 1702
            + E  LQ L  ++N              +++  GR +LAS+ ILP VL +++SL+     
Sbjct: 11   ISEDTLQLLFQASNSSNLEKSLEILIQNAKSDSGRLELASKRILPAVLNIVQSLAQASHH 70

Query: 1701 XXXXXXXXXXXXL----CAGEILNQNLFITKNGVEXXXXXXXXXXXXXXXXSGIVRIGLQ 1534
                        L    CAGE  NQ  FI  NGV                   +VR GLQ
Sbjct: 71   HHHNQTFSLCFKLLRNLCAGEAANQVSFIELNGVAVVWSVLRSEAGSLGPDHRLVRWGLQ 130

Query: 1533 LLGNVALAGEEHKTAVWGHFFPIGFTEIAKVGRLEICDTLCMVLYTCCSGNDERVGELCG 1354
            +L NV+L G++H+ A+W   +PIGF  +A+VG  EICD LCMV+YTCC GN E   +L  
Sbjct: 131  VLANVSLGGKQHQRAIWEELYPIGFASLARVGTKEICDPLCMVIYTCCDGNPEWFKKLSS 190

Query: 1353 LGGLKIVSEIIRTASTVGFEEDWLKWLLSRICFKESHFLPLFSELSSAGLVDDISEFRGE 1174
              G  +V+EI+RTAS+  F+EDWLK LLSRI  +ES    LFS+L S   V +      +
Sbjct: 191  DDGWPVVAEIVRTASSASFDEDWLKLLLSRIFLEESQLPVLFSKLQSVD-VPEGEVIESK 249

Query: 1173 DNFFATEQAFLLSILAESLNQNLDEITVSNDFALCILGIVKRALGVVDLFSRGKTSLPTG 994
            +  F+ EQAFLL IL+E LN+ L ++TVS D AL + GI K+++GV++   RGK+ LP+G
Sbjct: 250  NGQFSFEQAFLLQILSEILNERLGDVTVSEDVALFVFGIFKKSIGVLEHAMRGKSGLPSG 309

Query: 993  SPAIDVLGYSITILRDICAQGTGSSKTEDSTNXXXXXXXXXXXXXXXXXLRCLEPPEIIR 814
               +DVLGYS+TILRDICAQ      T+D  +                    LEPP IIR
Sbjct: 310  FTGVDVLGYSLTILRDICAQDGMRGNTKDVVDVLLSYGLIEFLLSLLG---ALEPPAIIR 366

Query: 813  KSVSQGENQVTSDSLNV-CPYKGYRRDIVAVIGNCLYRRKHIQDEIRKKNGILLLMQQCV 637
            K + Q ENQ  +   +  CPYKG+RRDIVA+IGNC+YRRKH QDEIR +NGILLL+QQCV
Sbjct: 367  KGLKQIENQDNASCCSKPCPYKGFRRDIVALIGNCVYRRKHAQDEIRDRNGILLLLQQCV 426

Query: 636  TEEENPFLREWGIWSVRNLLEGNEENQRQVTELEVQGSVDVPEIAGLGLRVEVDQKTRRA 457
            T+E+NPFLREWGIWSVRN+LEGN+ENQ+ V ELE+QGS DVPEI  LGL+VEVDQ+TRR 
Sbjct: 427  TDEDNPFLREWGIWSVRNMLEGNDENQKLVAELEIQGSADVPEINALGLQVEVDQRTRRP 486

Query: 456  KLVNV 442
            KLVN+
Sbjct: 487  KLVNI 491


>ref|XP_004232703.1| PREDICTED: ataxin-10-like isoform 1 [Solanum lycopersicum]
            gi|460373805|ref|XP_004232704.1| PREDICTED:
            ataxin-10-like isoform 2 [Solanum lycopersicum]
          Length = 501

 Score =  425 bits (1092), Expect = e-116
 Identities = 236/487 (48%), Positives = 300/487 (61%), Gaps = 3/487 (0%)
 Frame = -1

Query: 1890 DLTVPEHILQPLLTSANXXXXXXXXXXXXXXSRTVHGRSDLASQNILPIVLQLIKSLSNP 1711
            +LT+PE++ + LL  +N              S+   GR DL+S+N++  VL L +SLS+ 
Sbjct: 15   ELTIPENVAKELLLVSNSSSLETALDKLIQLSKEGGGRLDLSSKNVVTTVLHLCQSLSSI 74

Query: 1710 XXXXXXXXXXXXXXXLCAGEILNQNLFITKNGVEXXXXXXXXXXXXXXXXSGIVRIGLQL 1531
                           LCAGEI NQN F+ + GVE                  I+R+GLQL
Sbjct: 75   SYRNLLLLSLKVLRNLCAGEIRNQNGFLQQRGVEIVLDVIMSVGLSPDPDCMIIRVGLQL 134

Query: 1530 LGNVALAGEEHKTAVWGHFFPIGFTEIAKVGRLEICDTLCMVLYTCCSGNDERVGELCGL 1351
            LGN ++ G E +  VW   FP  F +IA+V   EICD LCMV+YTCC G D  + +LC  
Sbjct: 135  LGNYSVGGGERQCDVWYQLFPHKFLKIARVRNQEICDPLCMVIYTCCDGTDGLLTDLCSE 194

Query: 1350 GGLKIVSEIIRTASTVGFEEDWLKWLLSRICFKESHFLPLFSELSSAGLVDDISEFRGED 1171
             GL I+ EI+RTAS VG +E WLK LLS++C + SH   +F +L S   V+D        
Sbjct: 195  QGLPILFEILRTASAVGLKEVWLKLLLSKLCIEGSHISSIFFKLHSYPSVEDNGVVTHVA 254

Query: 1170 NFFATEQAFLLSILAESLNQNLDEITVSNDFALCILGIVKRALGVVDLFSRGKTSLPTGS 991
            + F  EQ +LLSIL+E LN+ ++ I VS+DFA  I GI+K A GVVD   RGK+ LP GS
Sbjct: 255  DQFVIEQPYLLSILSEILNERVEHIVVSHDFARSIFGILKSASGVVDFSIRGKSDLPVGS 314

Query: 990  PAIDVLGYSITILRDICAQG-TGSSKTEDSTNXXXXXXXXXXXXXXXXXLRCLEPPEIIR 814
              IDVLGYS+T++RDICA     SSK E S +                 LR LEPP  IR
Sbjct: 315  APIDVLGYSLTLMRDICASDHLSSSKEESSKDVVDVLVSSGLIEFLLNLLRDLEPPTTIR 374

Query: 813  KSV--SQGENQVTSDSLNVCPYKGYRRDIVAVIGNCLYRRKHIQDEIRKKNGILLLMQQC 640
             ++   Q +      S   CPY+G+RRDIVA++GNC YRR+H+QDEIR KNGILLL+QQC
Sbjct: 375  NAMKPDQIKEGTIPSSFRCCPYQGFRRDIVAILGNCAYRRRHVQDEIRDKNGILLLLQQC 434

Query: 639  VTEEENPFLREWGIWSVRNLLEGNEENQRQVTELEVQGSVDVPEIAGLGLRVEVDQKTRR 460
            V +E+NPFLREWGIW VRNLLEGN ENQ  +T+LE+QG+VDVPE+  LGLRVEVD  TRR
Sbjct: 435  VIDEDNPFLREWGIWCVRNLLEGNAENQGAITDLELQGTVDVPELVRLGLRVEVDPVTRR 494

Query: 459  AKLVNVS 439
             KLVN S
Sbjct: 495  TKLVNSS 501


>ref|XP_006348129.1| PREDICTED: ataxin-10-like [Solanum tuberosum]
          Length = 501

 Score =  424 bits (1090), Expect = e-116
 Identities = 234/487 (48%), Positives = 302/487 (62%), Gaps = 3/487 (0%)
 Frame = -1

Query: 1890 DLTVPEHILQPLLTSANXXXXXXXXXXXXXXSRTVHGRSDLASQNILPIVLQLIKSLSNP 1711
            +LT+PE++ + LL  +N              ++   GR DL+S+N++  VL L +SLS+ 
Sbjct: 15   ELTIPENVAKELLLVSNSSSLETALEKLIELAKEEGGRLDLSSKNVVTTVLHLCQSLSSI 74

Query: 1710 XXXXXXXXXXXXXXXLCAGEILNQNLFITKNGVEXXXXXXXXXXXXXXXXSGIVRIGLQL 1531
                           LCAGEI+NQN F+ + GVE                  I+R+GLQL
Sbjct: 75   SYRYLLLLSLKVLRNLCAGEIINQNEFLQQRGVEIVVDVIMSVGLTPDPDCMIIRVGLQL 134

Query: 1530 LGNVALAGEEHKTAVWGHFFPIGFTEIAKVGRLEICDTLCMVLYTCCSGNDERVGELCGL 1351
            LGN ++ G E +  VW   FP  F +IA+V   EICD LCMV+YTCC G D  + +LC  
Sbjct: 135  LGNYSVGGGERQCDVWYQLFPHKFLKIARVRNQEICDPLCMVIYTCCDGTDGLLTDLCSE 194

Query: 1350 GGLKIVSEIIRTASTVGFEEDWLKWLLSRICFKESHFLPLFSELSSAGLVDDISEFRGED 1171
             GL I+ EI+RTAS VG +E WLK LLS++C + S+   +F +L S   V++        
Sbjct: 195  KGLPILIEILRTASAVGLKEVWLKLLLSKLCIEGSYISSIFFKLHSYPSVENNGVVTHVV 254

Query: 1170 NFFATEQAFLLSILAESLNQNLDEITVSNDFALCILGIVKRALGVVDLFSRGKTSLPTGS 991
            + F  EQ++LLS L+E LN+ ++ I VS+DFA  I GI+K A GV D   RGK+ LP GS
Sbjct: 255  DQFVIEQSYLLSTLSEILNERVEHIVVSHDFARSIFGILKSASGVADFSIRGKSDLPVGS 314

Query: 990  PAIDVLGYSITILRDICAQG-TGSSKTEDSTNXXXXXXXXXXXXXXXXXLRCLEPPEIIR 814
              IDVLGYS+TILRDICA     SSK E S +                 LR LEPP  IR
Sbjct: 315  APIDVLGYSLTILRDICASDHMTSSKEESSKDVVDVLVSSGLIEFLLNLLRDLEPPTTIR 374

Query: 813  KSVSQGENQ--VTSDSLNVCPYKGYRRDIVAVIGNCLYRRKHIQDEIRKKNGILLLMQQC 640
            K++ Q + +    S S   CPY+G+RRDIVA++GNC YRR+H+QDEIR KNGILLL+QQC
Sbjct: 375  KAMKQDQIKEGTISSSFRCCPYQGFRRDIVAILGNCAYRRRHVQDEIRDKNGILLLLQQC 434

Query: 639  VTEEENPFLREWGIWSVRNLLEGNEENQRQVTELEVQGSVDVPEIAGLGLRVEVDQKTRR 460
            V +E+NPFLREWGIW VRNLLEGN ENQ  +T+LE+QG+VDVPE+  LGLRVEVD  TR 
Sbjct: 435  VIDEDNPFLREWGIWCVRNLLEGNAENQGAITDLELQGTVDVPELVRLGLRVEVDPVTRH 494

Query: 459  AKLVNVS 439
             KLVN S
Sbjct: 495  TKLVNSS 501


>ref|XP_006366476.1| PREDICTED: ataxin-10-like isoform X1 [Solanum tuberosum]
            gi|565401994|ref|XP_006366477.1| PREDICTED:
            ataxin-10-like isoform X2 [Solanum tuberosum]
            gi|565401996|ref|XP_006366478.1| PREDICTED:
            ataxin-10-like isoform X3 [Solanum tuberosum]
            gi|565401998|ref|XP_006366479.1| PREDICTED:
            ataxin-10-like isoform X4 [Solanum tuberosum]
            gi|565402000|ref|XP_006366480.1| PREDICTED:
            ataxin-10-like isoform X5 [Solanum tuberosum]
          Length = 504

 Score =  417 bits (1071), Expect = e-113
 Identities = 231/487 (47%), Positives = 299/487 (61%), Gaps = 3/487 (0%)
 Frame = -1

Query: 1890 DLTVPEHILQPLLTSANXXXXXXXXXXXXXXSRTVHGRSDLASQNILPIVLQLIKSLSNP 1711
            ++T+PE++ + LL  +N              ++   GR DL+S+N++  VL L +SLS+ 
Sbjct: 18   EVTIPENVAKELLLVSNSSSLETALEKLIELAKEEGGRLDLSSKNVVTTVLHLCQSLSSI 77

Query: 1710 XXXXXXXXXXXXXXXLCAGEILNQNLFITKNGVEXXXXXXXXXXXXXXXXSGIVRIGLQL 1531
                           LCAGEI NQN F+ + GVE                  I+R+GLQL
Sbjct: 78   SYRQLLLSSLKVLRNLCAGEIRNQNEFLQQRGVEIVVDVITSVGLTPDPDCMIIRVGLQL 137

Query: 1530 LGNVALAGEEHKTAVWGHFFPIGFTEIAKVGRLEICDTLCMVLYTCCSGNDERVGELCGL 1351
            LGN ++ G E +  VW   FP  F +IA+V   EICD LCMV+YTCC G D  + +LC  
Sbjct: 138  LGNYSVGGGERQCDVWYQLFPHKFLKIARVRSWEICDPLCMVIYTCCDGTDGLLTDLCSE 197

Query: 1350 GGLKIVSEIIRTASTVGFEEDWLKWLLSRICFKESHFLPLFSELSSAGLVDDISEFRGED 1171
             GL I+ EI+RTAS V  +E WLK LLS++C + S+   +F +L S   + +        
Sbjct: 198  QGLPILIEILRTASAVDRKEVWLKLLLSKLCIEGSYISSIFFKLHSFPSIQNNGVVTHAT 257

Query: 1170 NFFATEQAFLLSILAESLNQNLDEITVSNDFALCILGIVKRALGVVDLFSRGKTSLPTGS 991
            + F  EQ +LLSIL+E +N  ++ I VS+DFAL I GI+K A  VVD   RGK+ LP G 
Sbjct: 258  DQFVIEQPYLLSILSEIVNDQIEHIVVSHDFALSIFGILKSAFVVVDFSIRGKSDLPVGF 317

Query: 990  PAIDVLGYSITILRDICAQG-TGSSKTEDSTNXXXXXXXXXXXXXXXXXLRCLEPPEIIR 814
              IDVLGYS+TILRDICA     SSK E S +                 LR LEPP  IR
Sbjct: 318  APIDVLGYSLTILRDICASDHMTSSKEESSKDVVDVLVSSGLIEFLLNLLRDLEPPTTIR 377

Query: 813  KSVSQGE--NQVTSDSLNVCPYKGYRRDIVAVIGNCLYRRKHIQDEIRKKNGILLLMQQC 640
            K++ Q +    + S S   CPY+G+RRDIV++IGNC YRR+++QDEIR KNGILLL+QQC
Sbjct: 378  KAMKQDQITEGIISSSFRCCPYQGFRRDIVSIIGNCAYRRRYVQDEIRDKNGILLLLQQC 437

Query: 639  VTEEENPFLREWGIWSVRNLLEGNEENQRQVTELEVQGSVDVPEIAGLGLRVEVDQKTRR 460
            V +E+NPFLREWGIW VRNLLEGN ENQ  +T+LE+QG+VDVPE+  LGLRVEVD  TRR
Sbjct: 438  VIDEDNPFLREWGIWCVRNLLEGNAENQGAITDLELQGTVDVPELVRLGLRVEVDPVTRR 497

Query: 459  AKLVNVS 439
             KLVN S
Sbjct: 498  TKLVNAS 504


>gb|EYU23502.1| hypothetical protein MIMGU_mgv1a005564mg [Mimulus guttatus]
          Length = 479

 Score =  402 bits (1033), Expect = e-109
 Identities = 227/493 (46%), Positives = 309/493 (62%), Gaps = 4/493 (0%)
 Frame = -1

Query: 1911 MEHNPSMDLTVPEHILQPLLTSANXXXXXXXXXXXXXXSRTVHGRSDLASQNILPIVLQL 1732
            M+   S++L++ +++LQPL  S+               ++T  GR  L+S++I+   L+L
Sbjct: 1    MDSVKSVNLSIQDNVLQPLFISSGSSTLHEALERLIETAKTSDGRLSLSSKDIIKPALEL 60

Query: 1731 IK-SLSNPXXXXXXXXXXXXXXXLCAGEILNQNLFITKNGVEXXXXXXXXXXXXXXXXSG 1555
             +  L  P                CAGEI NQ+LFI +NGV                 + 
Sbjct: 61   CQYPLRVPHQELLLAVKLLRNM--CAGEIKNQDLFIEQNGVGILSTLVGSMCSNSGSDNE 118

Query: 1554 IVRIGLQLLGNVALAGEEHKTAVWGHFFPIGFTEIAKVGRLEICDTLCMVLYTCCSGNDE 1375
            I+R+ LQ LGNV+LAGE+H+ AVW  FF +GF +IA+V   E CD LCMV+YTC  G +E
Sbjct: 119  ILRMVLQALGNVSLAGEKHQEAVWAQFFSLGFIDIARVQSKETCDPLCMVIYTCSEGTNE 178

Query: 1374 RVGELCGLGGLKIVSEIIRTASTVGFEEDWLKWLLSRICFKESHFLPLFSELSSAGLVDD 1195
            R GEL    GL I+ EI+RT + VGF EDWLK LLS+ICF ES+F  +FS+LS     ++
Sbjct: 179  RSGELLSDQGLDIIVEIVRTVTAVGFSEDWLKLLLSKICFDESYFSSIFSKLS-----EN 233

Query: 1194 ISEFRGEDNFFATEQAFLLSILAESLNQNLDEITVSNDFALCILGIVKRALGVVDLFSRG 1015
              E   + + F  ++AFLLSIL+E LN+ L EI VS+DF+L I  I++ A+ +VD  +R 
Sbjct: 234  CDEDVPQISHFGDQEAFLLSILSEILNERLGEIVVSSDFSLSIFQILRNAVEIVDFSTRA 293

Query: 1014 KTSLPTGSPAIDVLGYSITILRDICAQGTGSSKTEDSTNXXXXXXXXXXXXXXXXXLRCL 835
            K+SLPTGS   DV+GY+++++RDI A    +  T                      LR L
Sbjct: 294  KSSLPTGSSVTDVMGYALSLIRDITACDGPNVDT---------LLRAGLIKFLIGLLRNL 344

Query: 834  EPPEIIRKSVSQGENQVTSD---SLNVCPYKGYRRDIVAVIGNCLYRRKHIQDEIRKKNG 664
            EPP +IR+S  + + +  +    S   CPYKG+RRDIV VIGNC Y R  +QDEIR+++G
Sbjct: 345  EPPTLIRRSTVRADTEDDTTPRFSKYCCPYKGFRRDIVGVIGNCSYGRISVQDEIREQDG 404

Query: 663  ILLLMQQCVTEEENPFLREWGIWSVRNLLEGNEENQRQVTELEVQGSVDVPEIAGLGLRV 484
            ILL++QQCVT+++NPFLREWGIWS+RN+LEGN +N+  V ELEVQGSVD PEIAG+GLRV
Sbjct: 405  ILLMLQQCVTDDDNPFLREWGIWSMRNILEGNVKNRELVVELEVQGSVDTPEIAGVGLRV 464

Query: 483  EVDQKTRRAKLVN 445
            E+D  TRR KLVN
Sbjct: 465  EIDPVTRRPKLVN 477


>gb|EYU22629.1| hypothetical protein MIMGU_mgv1a025194mg, partial [Mimulus guttatus]
          Length = 467

 Score =  393 bits (1009), Expect = e-106
 Identities = 224/483 (46%), Positives = 298/483 (61%), Gaps = 4/483 (0%)
 Frame = -1

Query: 1875 EHILQPLLTSANXXXXXXXXXXXXXXSRTVHGRSDLASQNILPIVLQLIK-SLSNPXXXX 1699
            +++LQPL  S+               ++T  GR  L+S++I+   L+L +  L  P    
Sbjct: 1    DNVLQPLFISSGSSTLHEALERLIETAKTSDGRLSLSSKDIIKPALELCRYPLRVPHQEL 60

Query: 1698 XXXXXXXXXXXLCAGEILNQNLFITKNGVEXXXXXXXXXXXXXXXXSGIVRIGLQLLGNV 1519
                        CAGEI NQ+LFI +NGV                 S I+R+ LQ LGNV
Sbjct: 61   LLAVKLLRNL--CAGEIKNQDLFIEQNGVGILSTLVGSMCSNSGSDSEILRMVLQTLGNV 118

Query: 1518 ALAGEEHKTAVWGHFFPIGFTEIAKVGRLEICDTLCMVLYTCCSGNDERVGELCGLGGLK 1339
            +LAGE+H+ AVW  FFP+GF +IA+V   E CD LCMV+YTC  G++ER  EL    GL 
Sbjct: 119  SLAGEKHQEAVWAQFFPLGFIDIARVQSKETCDPLCMVIYTCSEGSNERWVELLSDQGLD 178

Query: 1338 IVSEIIRTASTVGFEEDWLKWLLSRICFKESHFLPLFSELSSAGLVDDISEFRGEDNFFA 1159
            I+ +I+RT + VGF EDW+K L+S+ICF ES+F  +FS+LS     ++  E   + + F 
Sbjct: 179  IIVQIVRTVTAVGFSEDWVKLLISKICFDESYFSSIFSKLS-----ENCDENVPQISHFG 233

Query: 1158 TEQAFLLSILAESLNQNLDEITVSNDFALCILGIVKRALGVVDLFSRGKTSLPTGSPAID 979
             E+AFLLSIL+E LN+ L EI VS +F+L I  I++ A+ +VD  +R K SLPTGS   D
Sbjct: 234  DEEAFLLSILSEILNERLGEIVVSTNFSLSIYQILRNAVEIVDFSTRAKLSLPTGSSVTD 293

Query: 978  VLGYSITILRDICAQGTGSSKTEDSTNXXXXXXXXXXXXXXXXXLRCLEPPEIIRKSVSQ 799
             +GY+++++RDI A    +  T                       R LEPP +IR+S   
Sbjct: 294  AMGYALSLIRDITACDGPNVDTLSRAGLIKFLIDLF---------RNLEPPTLIRRSTGH 344

Query: 798  G--ENQVTSD-SLNVCPYKGYRRDIVAVIGNCLYRRKHIQDEIRKKNGILLLMQQCVTEE 628
               EN  T   S   CPYKG+RRDIV VIGNC Y R  +QDEIR+++GILL++QQCVT+E
Sbjct: 345  ADTENDTTPRFSKYCCPYKGFRRDIVGVIGNCSYGRISVQDEIREQDGILLMLQQCVTDE 404

Query: 627  ENPFLREWGIWSVRNLLEGNEENQRQVTELEVQGSVDVPEIAGLGLRVEVDQKTRRAKLV 448
            +NPFLREWGIWS+RN+LEGN +N+  V +LEVQGSVD PEIAG+GLRVE+D  TRR KLV
Sbjct: 405  DNPFLREWGIWSMRNILEGNVKNRELVVDLEVQGSVDTPEIAGVGLRVEIDHVTRRPKLV 464

Query: 447  NVS 439
            N S
Sbjct: 465  NAS 467


Top