BLASTX nr result

ID: Catharanthus23_contig00005082 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00005082
         (1760 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006348129.1| PREDICTED: ataxin-10-like [Solanum tuberosum]     479   e-132
ref|XP_006366476.1| PREDICTED: ataxin-10-like isoform X1 [Solanu...   475   e-131
ref|XP_004232703.1| PREDICTED: ataxin-10-like isoform 1 [Solanum...   473   e-131
gb|EOY14176.1| ARM repeat superfamily protein, putative isoform ...   432   e-118
gb|EOY14175.1| ARM repeat superfamily protein, putative isoform ...   432   e-118
gb|EOY14173.1| ARM repeat superfamily protein, putative isoform ...   432   e-118
gb|EOY14172.1| ARM repeat superfamily protein, putative isoform ...   432   e-118
ref|XP_002274705.1| PREDICTED: uncharacterized protein LOC100264...   432   e-118
gb|EMJ20253.1| hypothetical protein PRUPE_ppa004765mg [Prunus pe...   429   e-117
ref|XP_006421838.1| hypothetical protein CICLE_v10004825mg [Citr...   428   e-117
ref|XP_002320751.1| ataxin-related family protein [Populus trich...   427   e-117
ref|XP_004492673.1| PREDICTED: ataxin-10-like [Cicer arietinum]       420   e-114
ref|XP_002511774.1| conserved hypothetical protein [Ricinus comm...   419   e-114
ref|XP_003551615.1| PREDICTED: ataxin-10-like [Glycine max]           417   e-114
gb|ESW20728.1| hypothetical protein PHAVU_005G009900g [Phaseolus...   410   e-111
ref|XP_004308721.1| PREDICTED: uncharacterized protein LOC101297...   407   e-111
ref|XP_003623813.1| Ataxin-10 [Medicago truncatula] gi|355498828...   394   e-107
ref|XP_004306868.1| PREDICTED: ataxin-10-like [Fragaria vesca su...   386   e-104
ref|XP_002875041.1| hypothetical protein ARALYDRAFT_490543 [Arab...   385   e-104
ref|NP_567156.1| protein MATERNAL EFFECT EMBRYO ARREST 50 [Arabi...   384   e-104

>ref|XP_006348129.1| PREDICTED: ataxin-10-like [Solanum tuberosum]
          Length = 501

 Score =  479 bits (1233), Expect = e-132
 Identities = 253/498 (50%), Positives = 327/498 (65%), Gaps = 6/498 (1%)
 Frame = -3

Query: 1539 KCEKMDETFPTELPVPETITQWXXXXXXXXXXXXXXXXLIGIAKTDDGRADLAGKNIVTV 1360
            K   +D+    EL +PE + +                 LI +AK + GR DL+ KN+VT 
Sbjct: 4    KVVTVDDQIVAELTIPENVAKELLLVSNSSSLETALEKLIELAKEEGGRLDLSSKNVVTT 63

Query: 1359 TLQLCRAXXXXXXXXXXXXSFKLLRNLCAGELVNQNTFLEQGGVGIVSNIIASIKCLSDL 1180
             L LC++            S K+LRNLCAGE++NQN FL+Q GV IV ++I S+    D 
Sbjct: 64   VLHLCQSLSSISYRYLLLLSLKVLRNLCAGEIINQNEFLQQRGVEIVVDVIMSVGLTPDP 123

Query: 1179 DNQVIRFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIAAIHCMEIIDPLCMVIYTCTEG 1000
            D  +IR GLQLLGN+++GGGE +  VW +LFP+K +KIA +   EI DPLCMVIYTC +G
Sbjct: 124  DCMIIRVGLQLLGNYSVGGGERQCDVWYQLFPHKFLKIARVRNQEICDPLCMVIYTCCDG 183

Query: 999  TDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLSKICLEGSHFASIFSKLYPVSS 820
            TDGL+  L +++G  I++E+++T S VG KE W+KLLLSK+C+EGS+ +SIF KL+   S
Sbjct: 184  TDGLLTDLCSEKGLPILIEILRTASAVGLKEVWLKLLLSKLCIEGSYISSIFFKLHSYPS 243

Query: 819  NANCG------DYFADPQAFLLSILSEILNERIEDIAVDSKFPLHVLEILRTAAKAIDSV 658
              N G      D F   Q++LLS LSEILNER+E I V   F   +  IL++A+   D  
Sbjct: 244  VENNGVVTHVVDQFVIEQSYLLSTLSEILNERVEHIVVSHDFARSIFGILKSASGVADFS 303

Query: 657  PRRESGLPTGHATIDVLGYSLIILRDICACDHGLKKVQENSVDAMXXXXXXXXXXXXXXX 478
             R +S LP G A IDVLGYSL ILRDICA DH     +E+S D +               
Sbjct: 304  IRGKSDLPVGSAPIDVLGYSLTILRDICASDHMTSSKEESSKDVVDVLVSSGLIEFLLNL 363

Query: 477  XXXXEPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGIVGNCAYRRKSVQDKI 298
                EPP+TI+K MKQ + ++     S+S + CPY+GFRRDIV I+GNCAYRR+ VQD+I
Sbjct: 364  LRDLEPPTTIRKAMKQDQIKE--GTISSSFRCCPYQGFRRDIVAILGNCAYRRRHVQDEI 421

Query: 297  REENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVSDLEMQETVDVPELTG 118
            R++NGILLLLQQCV DEDNP+LREWGIW +RNLLEGNAENQ  ++DLE+Q TVDVPEL  
Sbjct: 422  RDKNGILLLLQQCVIDEDNPFLREWGIWCVRNLLEGNAENQGAITDLELQGTVDVPELVR 481

Query: 117  LGLKVEIDPQTRRAKLVN 64
            LGL+VE+DP TR  KLVN
Sbjct: 482  LGLRVEVDPVTRHTKLVN 499


>ref|XP_006366476.1| PREDICTED: ataxin-10-like isoform X1 [Solanum tuberosum]
            gi|565401994|ref|XP_006366477.1| PREDICTED:
            ataxin-10-like isoform X2 [Solanum tuberosum]
            gi|565401996|ref|XP_006366478.1| PREDICTED:
            ataxin-10-like isoform X3 [Solanum tuberosum]
            gi|565401998|ref|XP_006366479.1| PREDICTED:
            ataxin-10-like isoform X4 [Solanum tuberosum]
            gi|565402000|ref|XP_006366480.1| PREDICTED:
            ataxin-10-like isoform X5 [Solanum tuberosum]
          Length = 504

 Score =  475 bits (1223), Expect = e-131
 Identities = 252/494 (51%), Positives = 325/494 (65%), Gaps = 6/494 (1%)
 Frame = -3

Query: 1527 MDETFPTELPVPETITQWXXXXXXXXXXXXXXXXLIGIAKTDDGRADLAGKNIVTVTLQL 1348
            +D+    E+ +PE + +                 LI +AK + GR DL+ KN+VT  L L
Sbjct: 11   VDDKIVAEVTIPENVAKELLLVSNSSSLETALEKLIELAKEEGGRLDLSSKNVVTTVLHL 70

Query: 1347 CRAXXXXXXXXXXXXSFKLLRNLCAGELVNQNTFLEQGGVGIVSNIIASIKCLSDLDNQV 1168
            C++            S K+LRNLCAGE+ NQN FL+Q GV IV ++I S+    D D  +
Sbjct: 71   CQSLSSISYRQLLLSSLKVLRNLCAGEIRNQNEFLQQRGVEIVVDVITSVGLTPDPDCMI 130

Query: 1167 IRFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIAAIHCMEIIDPLCMVIYTCTEGTDGL 988
            IR GLQLLGN+++GGGE +  VW +LFP+K +KIA +   EI DPLCMVIYTC +GTDGL
Sbjct: 131  IRVGLQLLGNYSVGGGERQCDVWYQLFPHKFLKIARVRSWEICDPLCMVIYTCCDGTDGL 190

Query: 987  VAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLSKICLEGSHFASIFSKLYPVSSNANC 808
            +  L ++QG  I++E+++T S V  KE W+KLLLSK+C+EGS+ +SIF KL+   S  N 
Sbjct: 191  LTDLCSEQGLPILIEILRTASAVDRKEVWLKLLLSKLCIEGSYISSIFFKLHSFPSIQNN 250

Query: 807  G------DYFADPQAFLLSILSEILNERIEDIAVDSKFPLHVLEILRTAAKAIDSVPRRE 646
            G      D F   Q +LLSILSEI+N++IE I V   F L +  IL++A   +D   R +
Sbjct: 251  GVVTHATDQFVIEQPYLLSILSEIVNDQIEHIVVSHDFALSIFGILKSAFVVVDFSIRGK 310

Query: 645  SGLPTGHATIDVLGYSLIILRDICACDHGLKKVQENSVDAMXXXXXXXXXXXXXXXXXXX 466
            S LP G A IDVLGYSL ILRDICA DH     +E+S D +                   
Sbjct: 311  SDLPVGFAPIDVLGYSLTILRDICASDHMTSSKEESSKDVVDVLVSSGLIEFLLNLLRDL 370

Query: 465  EPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGIVGNCAYRRKSVQDKIREEN 286
            EPP+TI+K MKQ +  +     S+S + CPY+GFRRDIV I+GNCAYRR+ VQD+IR++N
Sbjct: 371  EPPTTIRKAMKQDQITE--GIISSSFRCCPYQGFRRDIVSIIGNCAYRRRYVQDEIRDKN 428

Query: 285  GILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVSDLEMQETVDVPELTGLGLK 106
            GILLLLQQCV DEDNP+LREWGIW +RNLLEGNAENQ  ++DLE+Q TVDVPEL  LGL+
Sbjct: 429  GILLLLQQCVIDEDNPFLREWGIWCVRNLLEGNAENQGAITDLELQGTVDVPELVRLGLR 488

Query: 105  VEIDPQTRRAKLVN 64
            VE+DP TRR KLVN
Sbjct: 489  VEVDPVTRRTKLVN 502


>ref|XP_004232703.1| PREDICTED: ataxin-10-like isoform 1 [Solanum lycopersicum]
            gi|460373805|ref|XP_004232704.1| PREDICTED:
            ataxin-10-like isoform 2 [Solanum lycopersicum]
          Length = 501

 Score =  473 bits (1218), Expect = e-131
 Identities = 254/498 (51%), Positives = 326/498 (65%), Gaps = 6/498 (1%)
 Frame = -3

Query: 1539 KCEKMDETFPTELPVPETITQWXXXXXXXXXXXXXXXXLIGIAKTDDGRADLAGKNIVTV 1360
            K   MD+   +EL +PE + +                 LI ++K   GR DL+ KN+VT 
Sbjct: 4    KVVTMDDQIVSELTIPENVAKELLLVSNSSSLETALDKLIQLSKEGGGRLDLSSKNVVTT 63

Query: 1359 TLQLCRAXXXXXXXXXXXXSFKLLRNLCAGELVNQNTFLEQGGVGIVSNIIASIKCLSDL 1180
             L LC++            S K+LRNLCAGE+ NQN FL+Q GV IV ++I S+    D 
Sbjct: 64   VLHLCQSLSSISYRNLLLLSLKVLRNLCAGEIRNQNGFLQQRGVEIVLDVIMSVGLSPDP 123

Query: 1179 DNQVIRFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIAAIHCMEIIDPLCMVIYTCTEG 1000
            D  +IR GLQLLGN+++GGGE +  VW +LFP+K +KIA +   EI DPLCMVIYTC +G
Sbjct: 124  DCMIIRVGLQLLGNYSVGGGERQCDVWYQLFPHKFLKIARVRNQEICDPLCMVIYTCCDG 183

Query: 999  TDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLSKICLEGSHFASIFSKLYPVSS 820
            TDGL+  L ++QG  I+ E+++T S VG KE W+KLLLSK+C+EGSH +SIF KL+   S
Sbjct: 184  TDGLLTDLCSEQGLPILFEILRTASAVGLKEVWLKLLLSKLCIEGSHISSIFFKLHSYPS 243

Query: 819  NANCG------DYFADPQAFLLSILSEILNERIEDIAVDSKFPLHVLEILRTAAKAIDSV 658
              + G      D F   Q +LLSILSEILNER+E I V   F   +  IL++A+  +D  
Sbjct: 244  VEDNGVVTHVADQFVIEQPYLLSILSEILNERVEHIVVSHDFARSIFGILKSASGVVDFS 303

Query: 657  PRRESGLPTGHATIDVLGYSLIILRDICACDHGLKKVQENSVDAMXXXXXXXXXXXXXXX 478
             R +S LP G A IDVLGYSL ++RDICA DH     +E+S D +               
Sbjct: 304  IRGKSDLPVGSAPIDVLGYSLTLMRDICASDHLSSSKEESSKDVVDVLVSSGLIEFLLNL 363

Query: 477  XXXXEPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGIVGNCAYRRKSVQDKI 298
                EPP+TI+  MK  + + EG+  S S + CPY+GFRRDIV I+GNCAYRR+ VQD+I
Sbjct: 364  LRDLEPPTTIRNAMKPDQIK-EGTIPS-SFRCCPYQGFRRDIVAILGNCAYRRRHVQDEI 421

Query: 297  REENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVSDLEMQETVDVPELTG 118
            R++NGILLLLQQCV DEDNP+LREWGIW +RNLLEGNAENQ  ++DLE+Q TVDVPEL  
Sbjct: 422  RDKNGILLLLQQCVIDEDNPFLREWGIWCVRNLLEGNAENQGAITDLELQGTVDVPELVR 481

Query: 117  LGLKVEIDPQTRRAKLVN 64
            LGL+VE+DP TRR KLVN
Sbjct: 482  LGLRVEVDPVTRRTKLVN 499


>gb|EOY14176.1| ARM repeat superfamily protein, putative isoform 5 [Theobroma cacao]
          Length = 519

 Score =  432 bits (1110), Expect = e-118
 Identities = 226/456 (49%), Positives = 307/456 (67%), Gaps = 6/456 (1%)
 Frame = -3

Query: 1422 IGIAKTDDGRADLAGKNIVTVTLQLCRAXXXXXXXXXXXXSFKLLRNLCAGELVNQNTFL 1243
            I +++T   RA+LA +NI+   L+L  +            S KLLRNLCAGE+ NQN F 
Sbjct: 36   IKVSRTAAARAELALRNILPTVLKLVESFHQTSSREYLVNSLKLLRNLCAGEVANQNAFF 95

Query: 1242 EQGGVGIVSNIIASIKCLSDLDNQVIRFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIA 1063
            EQ GV +V +++ S   LS+ D+ VIR  LQ+L N +L G +H+ A+W + FPN+   +A
Sbjct: 96   EQNGVEVVLSVLRSAALLSNPDSGVIRVSLQVLANVSLAGEDHQQAIWLKFFPNEFSVLA 155

Query: 1062 AIHCMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLS 883
             +   E  DPLCM++YTC +   GLVA L  D G  I++ +I+T+++VGF E W KLLLS
Sbjct: 156  RVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPIVVGIIRTVASVGFGEDWFKLLLS 215

Query: 882  KICLEGSHFASIFSKLYPVSSNANCG------DYFADPQAFLLSILSEILNERIEDIAVD 721
            ++CLE  HF  +FSK    SS+ N G      D F   QAFLL I+SEILNERIE+I V 
Sbjct: 216  RLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLSEQAFLLRIISEILNERIEEIQVS 275

Query: 720  SKFPLHVLEILRTAAKAIDSVPRRESGLPTGHATIDVLGYSLIILRDICACDHGLKKVQE 541
            S+F L VL I + + + +D   R  S LPTG  +IDV+GYSLIILRDICA + G+  ++ 
Sbjct: 276  SEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDVMGYSLIILRDICARE-GVGDLKN 334

Query: 540  NSVDAMXXXXXXXXXXXXXXXXXXXEPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFR 361
            +S+D +                   +PP+ I+K +K+    D      ++SK CPYKGFR
Sbjct: 335  DSLDVVDMLLSHELIDILLSLLRDLDPPAIIRKVLKEG---DNQGLNLSASKLCPYKGFR 391

Query: 360  RDIVGIVGNCAYRRKSVQDKIREENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAE 181
            RD++ ++GNCAYRRK VQD+IR++NGILLLLQQCV+D+DNPYLREWGIWS+RNLLEG+AE
Sbjct: 392  RDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCVTDDDNPYLREWGIWSLRNLLEGHAE 451

Query: 180  NQRLVSDLEMQETVDVPELTGLGLKVEIDPQTRRAK 73
            NQ+ V+DLE+Q +VD+PEL+ LGL+VE+D +TRRAK
Sbjct: 452  NQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRAK 487


>gb|EOY14175.1| ARM repeat superfamily protein, putative isoform 4 [Theobroma cacao]
          Length = 500

 Score =  432 bits (1110), Expect = e-118
 Identities = 226/456 (49%), Positives = 307/456 (67%), Gaps = 6/456 (1%)
 Frame = -3

Query: 1422 IGIAKTDDGRADLAGKNIVTVTLQLCRAXXXXXXXXXXXXSFKLLRNLCAGELVNQNTFL 1243
            I +++T   RA+LA +NI+   L+L  +            S KLLRNLCAGE+ NQN F 
Sbjct: 48   IKVSRTAAARAELALRNILPTVLKLVESFHQTSSREYLVNSLKLLRNLCAGEVANQNAFF 107

Query: 1242 EQGGVGIVSNIIASIKCLSDLDNQVIRFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIA 1063
            EQ GV +V +++ S   LS+ D+ VIR  LQ+L N +L G +H+ A+W + FPN+   +A
Sbjct: 108  EQNGVEVVLSVLRSAALLSNPDSGVIRVSLQVLANVSLAGEDHQQAIWLKFFPNEFSVLA 167

Query: 1062 AIHCMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLS 883
             +   E  DPLCM++YTC +   GLVA L  D G  I++ +I+T+++VGF E W KLLLS
Sbjct: 168  RVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPIVVGIIRTVASVGFGEDWFKLLLS 227

Query: 882  KICLEGSHFASIFSKLYPVSSNANCG------DYFADPQAFLLSILSEILNERIEDIAVD 721
            ++CLE  HF  +FSK    SS+ N G      D F   QAFLL I+SEILNERIE+I V 
Sbjct: 228  RLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLSEQAFLLRIISEILNERIEEIQVS 287

Query: 720  SKFPLHVLEILRTAAKAIDSVPRRESGLPTGHATIDVLGYSLIILRDICACDHGLKKVQE 541
            S+F L VL I + + + +D   R  S LPTG  +IDV+GYSLIILRDICA + G+  ++ 
Sbjct: 288  SEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDVMGYSLIILRDICARE-GVGDLKN 346

Query: 540  NSVDAMXXXXXXXXXXXXXXXXXXXEPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFR 361
            +S+D +                   +PP+ I+K +K+    D      ++SK CPYKGFR
Sbjct: 347  DSLDVVDMLLSHELIDILLSLLRDLDPPAIIRKVLKEG---DNQGLNLSASKLCPYKGFR 403

Query: 360  RDIVGIVGNCAYRRKSVQDKIREENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAE 181
            RD++ ++GNCAYRRK VQD+IR++NGILLLLQQCV+D+DNPYLREWGIWS+RNLLEG+AE
Sbjct: 404  RDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCVTDDDNPYLREWGIWSLRNLLEGHAE 463

Query: 180  NQRLVSDLEMQETVDVPELTGLGLKVEIDPQTRRAK 73
            NQ+ V+DLE+Q +VD+PEL+ LGL+VE+D +TRRAK
Sbjct: 464  NQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRAK 499


>gb|EOY14173.1| ARM repeat superfamily protein, putative isoform 2 [Theobroma cacao]
            gi|508722277|gb|EOY14174.1| ARM repeat superfamily
            protein, putative isoform 2 [Theobroma cacao]
            gi|508722280|gb|EOY14177.1| ARM repeat superfamily
            protein, putative isoform 2 [Theobroma cacao]
          Length = 488

 Score =  432 bits (1110), Expect = e-118
 Identities = 226/456 (49%), Positives = 307/456 (67%), Gaps = 6/456 (1%)
 Frame = -3

Query: 1422 IGIAKTDDGRADLAGKNIVTVTLQLCRAXXXXXXXXXXXXSFKLLRNLCAGELVNQNTFL 1243
            I +++T   RA+LA +NI+   L+L  +            S KLLRNLCAGE+ NQN F 
Sbjct: 36   IKVSRTAAARAELALRNILPTVLKLVESFHQTSSREYLVNSLKLLRNLCAGEVANQNAFF 95

Query: 1242 EQGGVGIVSNIIASIKCLSDLDNQVIRFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIA 1063
            EQ GV +V +++ S   LS+ D+ VIR  LQ+L N +L G +H+ A+W + FPN+   +A
Sbjct: 96   EQNGVEVVLSVLRSAALLSNPDSGVIRVSLQVLANVSLAGEDHQQAIWLKFFPNEFSVLA 155

Query: 1062 AIHCMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLS 883
             +   E  DPLCM++YTC +   GLVA L  D G  I++ +I+T+++VGF E W KLLLS
Sbjct: 156  RVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPIVVGIIRTVASVGFGEDWFKLLLS 215

Query: 882  KICLEGSHFASIFSKLYPVSSNANCG------DYFADPQAFLLSILSEILNERIEDIAVD 721
            ++CLE  HF  +FSK    SS+ N G      D F   QAFLL I+SEILNERIE+I V 
Sbjct: 216  RLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLSEQAFLLRIISEILNERIEEIQVS 275

Query: 720  SKFPLHVLEILRTAAKAIDSVPRRESGLPTGHATIDVLGYSLIILRDICACDHGLKKVQE 541
            S+F L VL I + + + +D   R  S LPTG  +IDV+GYSLIILRDICA + G+  ++ 
Sbjct: 276  SEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDVMGYSLIILRDICARE-GVGDLKN 334

Query: 540  NSVDAMXXXXXXXXXXXXXXXXXXXEPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFR 361
            +S+D +                   +PP+ I+K +K+    D      ++SK CPYKGFR
Sbjct: 335  DSLDVVDMLLSHELIDILLSLLRDLDPPAIIRKVLKEG---DNQGLNLSASKLCPYKGFR 391

Query: 360  RDIVGIVGNCAYRRKSVQDKIREENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAE 181
            RD++ ++GNCAYRRK VQD+IR++NGILLLLQQCV+D+DNPYLREWGIWS+RNLLEG+AE
Sbjct: 392  RDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCVTDDDNPYLREWGIWSLRNLLEGHAE 451

Query: 180  NQRLVSDLEMQETVDVPELTGLGLKVEIDPQTRRAK 73
            NQ+ V+DLE+Q +VD+PEL+ LGL+VE+D +TRRAK
Sbjct: 452  NQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRAK 487


>gb|EOY14172.1| ARM repeat superfamily protein, putative isoform 1 [Theobroma cacao]
          Length = 531

 Score =  432 bits (1110), Expect = e-118
 Identities = 226/456 (49%), Positives = 307/456 (67%), Gaps = 6/456 (1%)
 Frame = -3

Query: 1422 IGIAKTDDGRADLAGKNIVTVTLQLCRAXXXXXXXXXXXXSFKLLRNLCAGELVNQNTFL 1243
            I +++T   RA+LA +NI+   L+L  +            S KLLRNLCAGE+ NQN F 
Sbjct: 48   IKVSRTAAARAELALRNILPTVLKLVESFHQTSSREYLVNSLKLLRNLCAGEVANQNAFF 107

Query: 1242 EQGGVGIVSNIIASIKCLSDLDNQVIRFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIA 1063
            EQ GV +V +++ S   LS+ D+ VIR  LQ+L N +L G +H+ A+W + FPN+   +A
Sbjct: 108  EQNGVEVVLSVLRSAALLSNPDSGVIRVSLQVLANVSLAGEDHQQAIWLKFFPNEFSVLA 167

Query: 1062 AIHCMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLS 883
             +   E  DPLCM++YTC +   GLVA L  D G  I++ +I+T+++VGF E W KLLLS
Sbjct: 168  RVRSQETNDPLCMILYTCCDRRPGLVAELCRDMGLPIVVGIIRTVASVGFGEDWFKLLLS 227

Query: 882  KICLEGSHFASIFSKLYPVSSNANCG------DYFADPQAFLLSILSEILNERIEDIAVD 721
            ++CLE  HF  +FSK    SS+ N G      D F   QAFLL I+SEILNERIE+I V 
Sbjct: 228  RLCLEDIHFPLVFSKSCEGSSSENSGNTDSGDDLFLSEQAFLLRIISEILNERIEEIQVS 287

Query: 720  SKFPLHVLEILRTAAKAIDSVPRRESGLPTGHATIDVLGYSLIILRDICACDHGLKKVQE 541
            S+F L VL I + + + +D   R  S LPTG  +IDV+GYSLIILRDICA + G+  ++ 
Sbjct: 288  SEFALCVLGIFKRSVRVVDFASRGMSSLPTGCTSIDVMGYSLIILRDICARE-GVGDLKN 346

Query: 540  NSVDAMXXXXXXXXXXXXXXXXXXXEPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFR 361
            +S+D +                   +PP+ I+K +K+    D      ++SK CPYKGFR
Sbjct: 347  DSLDVVDMLLSHELIDILLSLLRDLDPPAIIRKVLKEG---DNQGLNLSASKLCPYKGFR 403

Query: 360  RDIVGIVGNCAYRRKSVQDKIREENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAE 181
            RD++ ++GNCAYRRK VQD+IR++NGILLLLQQCV+D+DNPYLREWGIWS+RNLLEG+AE
Sbjct: 404  RDMIAVIGNCAYRRKHVQDEIRQKNGILLLLQQCVTDDDNPYLREWGIWSLRNLLEGHAE 463

Query: 180  NQRLVSDLEMQETVDVPELTGLGLKVEIDPQTRRAK 73
            NQ+ V+DLE+Q +VD+PEL+ LGL+VE+D +TRRAK
Sbjct: 464  NQQAVADLELQGSVDMPELSRLGLRVEVDQKTRRAK 499


>ref|XP_002274705.1| PREDICTED: uncharacterized protein LOC100264428 [Vitis vinifera]
          Length = 494

 Score =  432 bits (1110), Expect = e-118
 Identities = 233/461 (50%), Positives = 300/461 (65%), Gaps = 7/461 (1%)
 Frame = -3

Query: 1422 IGIAKTDDGRADLAGKNIVTVTLQLCRAXXXXXXXXXXXXSFKLLRNLCAGELVNQNTFL 1243
            I  +KT  GR DL  KNI+ V LQL ++            S KLLRNLCAGE+ NQN F+
Sbjct: 35   IEASKTPGGRLDLGSKNILPVVLQLSQSLSYPSGHDILLLSLKLLRNLCAGEMTNQNLFI 94

Query: 1242 EQGGVGIVSNIIASIKCL-SDLDNQVIRFGLQLLGNFALGGGEHRYAVWKELFPNKLIKI 1066
            EQ GV  VS I+ S   L SD D  +IR GLQLLGN +L G  H+ AVW   FP   ++I
Sbjct: 95   EQNGVKAVSTILLSFVGLDSDSDYGIIRMGLQLLGNVSLAGERHQRAVWHHFFPAGFLEI 154

Query: 1065 AAIHCMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLL 886
            A +  +E  DPLCMVIYTC + +   +  +  DQG  I+ E+++T STVGF+E W+KLLL
Sbjct: 155  ARVRTLETSDPLCMVIYTCFDQSHEFITEICGDQGLPILAEIVRTASTVGFEEDWLKLLL 214

Query: 885  SKICLEGSHFASIFSKLYPVSSNANCG------DYFADPQAFLLSILSEILNERIEDIAV 724
            S+ICLE SHF  +FSKL PV ++ N        D FA  QAFL+ I++EILNE+I  + V
Sbjct: 215  SRICLEESHFPMLFSKLCPVGTSGNYESIEFKVDVFASEQAFLMDIVAEILNEQINKMTV 274

Query: 723  DSKFPLHVLEILRTAAKAIDSVPRRESGLPTGHATIDVLGYSLIILRDICACDHGLKKVQ 544
             S   L VL IL+ +A  +DSV   +SG   G   I+VL YSL IL++ICA D      +
Sbjct: 275  SSDVALCVLGILKKSAGVLDSVSTCKSGFSAGSNAINVLKYSLTILKEICARDAQKSSNE 334

Query: 543  ENSVDAMXXXXXXXXXXXXXXXXXXXEPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGF 364
              SVD +                   EPP+ I+K +KQ + +D     S S K  PY+GF
Sbjct: 335  HGSVDVVDLLVSSGLLELLLCLLRDLEPPAIIRKAIKQGENQD--GAASYSPKHYPYRGF 392

Query: 363  RRDIVGIVGNCAYRRKSVQDKIREENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNA 184
            RRD+V ++GNCAYRRK VQ++IRE NGILLLLQQCV+DE+N +LREWGIW +RNLLEGN 
Sbjct: 393  RRDLVAVIGNCAYRRKHVQNEIRERNGILLLLQQCVTDEENQFLREWGIWCVRNLLEGNV 452

Query: 183  ENQRLVSDLEMQETVDVPELTGLGLKVEIDPQTRRAKLVNM 61
            ENQR+V++LE+Q +VDVPE+ GLGL+VE+D +T RAKLVN+
Sbjct: 453  ENQRVVAELELQGSVDVPEIAGLGLRVEVDQKTGRAKLVNV 493


>gb|EMJ20253.1| hypothetical protein PRUPE_ppa004765mg [Prunus persica]
          Length = 492

 Score =  429 bits (1103), Expect = e-117
 Identities = 230/494 (46%), Positives = 314/494 (63%), Gaps = 5/494 (1%)
 Frame = -3

Query: 1527 MDETFPTELPVPETITQWXXXXXXXXXXXXXXXXLIGIAKTDDGRADLAGKNIVTVTLQL 1348
            MD+T   E  VPE + Q                 LI + +  DGRADLA K+I+   +QL
Sbjct: 1    MDKTALQEFFVPEDVLQILLSASNSSTLIDSLETLIQVCRAADGRADLASKSILPSVVQL 60

Query: 1347 CRAXXXXXXXXXXXXSFKLLRNLCAGELVNQNTFLEQGGVGIVSNIIASIKCLSDLDNQV 1168
             ++            S KLLRNLCAGE+ NQ +FLEQ GV I+SN++ S     + D+ V
Sbjct: 61   IQSLPYPSGRHLLTLSLKLLRNLCAGEVSNQKSFLEQSGVAIISNVLNSANISLEPDSGV 120

Query: 1167 IRFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIAAIHCMEIIDPLCMVIYTCTEGTDGL 988
            IR GLQ+L N +L G  H++ +W++LFP + + +A +   E  DPLCMVI+ C +G+  L
Sbjct: 121  IRMGLQVLANVSLAGERHQHEIWQQLFPKEFLALARVQSRETCDPLCMVIFACCDGSPEL 180

Query: 987  VAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLSKICLEGSHFASIFSKL-YPVSSNAN 811
               L  D G  I+ E+++T + VGF E W+KLLLS+ICLEG +F+S+FS L +  S N  
Sbjct: 181  FEKLCGDGGITIMKEIVRTTAAVGFGEDWVKLLLSRICLEGPYFSSLFSNLGFATSENVE 240

Query: 810  ----CGDYFADPQAFLLSILSEILNERIEDIAVDSKFPLHVLEILRTAAKAIDSVPRRES 643
                  D F+  QAF L I+S+ILNER+ +I V   F L V  I + +  A++ V R +S
Sbjct: 241  DTEFREDLFSSDQAFFLRIISDILNERLREITVPRDFALCVFGIFKKSVGALNCVTRGQS 300

Query: 642  GLPTGHATIDVLGYSLIILRDICACDHGLKKVQENSVDAMXXXXXXXXXXXXXXXXXXXE 463
            GLPTG + IDVLGYSL ILRD+CA    L+  QE+  DA+                   E
Sbjct: 301  GLPTGTSMIDVLGYSLTILRDVCA-QKTLRGFQEDLGDAVDVLLSHGLIELILCLLRDLE 359

Query: 462  PPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGIVGNCAYRRKSVQDKIREENG 283
            PP+ I+K +KQ + +D  +  S SSK CPYKGFRRDIV ++GNC Y+RK VQD+IR+ +G
Sbjct: 360  PPAIIRKAIKQGEGQDGTN--SGSSKPCPYKGFRRDIVAVIGNCTYQRKPVQDEIRQRDG 417

Query: 282  ILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVSDLEMQETVDVPELTGLGLKV 103
            ILLLLQQC  DEDNP+L+EWGIW +RNLLEGN +N+R+V++LE+Q +VD PE+ GLG +V
Sbjct: 418  ILLLLQQCGLDEDNPFLKEWGIWCVRNLLEGNEDNKRVVTELELQGSVDAPEIAGLGFRV 477

Query: 102  EIDPQTRRAKLVNM 61
            E++P+T R KLVN+
Sbjct: 478  EVNPETGRPKLVNV 491


>ref|XP_006421838.1| hypothetical protein CICLE_v10004825mg [Citrus clementina]
            gi|567858312|ref|XP_006421839.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|567858314|ref|XP_006421840.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|567858316|ref|XP_006421841.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|568874427|ref|XP_006490317.1| PREDICTED:
            ataxin-10-like isoform X1 [Citrus sinensis]
            gi|568874429|ref|XP_006490318.1| PREDICTED:
            ataxin-10-like isoform X2 [Citrus sinensis]
            gi|557523711|gb|ESR35078.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|557523712|gb|ESR35079.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|557523713|gb|ESR35080.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
            gi|557523714|gb|ESR35081.1| hypothetical protein
            CICLE_v10004825mg [Citrus clementina]
          Length = 497

 Score =  428 bits (1101), Expect = e-117
 Identities = 229/495 (46%), Positives = 307/495 (62%), Gaps = 6/495 (1%)
 Frame = -3

Query: 1527 MDETFPTELPVPETITQWXXXXXXXXXXXXXXXXLIGIAKTDDGRADLAGKNIVTVTLQL 1348
            MD+    ++ + E + Q                 LI  +KT  GR+DLA KNI+   LQL
Sbjct: 1    MDDASSLDISLSEDVLQPLLTTSNSSSLKDALEILIESSKTTVGRSDLASKNILPEVLQL 60

Query: 1347 CRAXXXXXXXXXXXXSFKLLRNLCAGELVNQNTFLEQGGVGIVSNIIASIKCLSDLDNQV 1168
             ++            S KLLRNLCAGE+ NQ +F+EQ GVGIV  ++ S     D D  +
Sbjct: 61   TQSIPHSSGCHYLLLSLKLLRNLCAGEITNQKSFIEQTGVGIVLRVLRSPGVNLDKDYGI 120

Query: 1167 IRFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIAAIHCMEIIDPLCMVIYTCTEGTDGL 988
            IR  LQ+L N +L G  H++A+W + FP++   +A + C E  DPLCMVIYTC +G+ GL
Sbjct: 121  IRIALQVLANVSLAGETHQHAIWCQFFPDEFATLAGVRCQETCDPLCMVIYTCCDGSSGL 180

Query: 987  VAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLSKICLEGSHFASIFSKLYPVSSNANC 808
               L  D+G  I+ E++ T ++VGFKE W K L+S+ C+E  HF  +F KL  V ++ NC
Sbjct: 181  FKELCGDKGLAIMAEIVCTAASVGFKEDWFKFLVSRTCVEEIHFPQLFFKLSQVGASRNC 240

Query: 807  GD------YFADPQAFLLSILSEILNERIEDIAVDSKFPLHVLEILRTAAKAIDSVPRRE 646
             D       F+  QAFLL I+SEI+NERIE+I V + F L VL I   +   +D   R  
Sbjct: 241  EDSNSREGTFSSEQAFLLEIVSEIVNERIEEIIVPNDFALSVLGIFTKSIGLVDFYARGT 300

Query: 645  SGLPTGHATIDVLGYSLIILRDICACDHGLKKVQENSVDAMXXXXXXXXXXXXXXXXXXX 466
              LPT  + I+VLGYSL ILR+ICA +        N  D +                   
Sbjct: 301  PSLPTSSSAINVLGYSLSILRNICAREDPAGSSSVNRADLVDSLQSHGLIEMFLSLLRDL 360

Query: 465  EPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGIVGNCAYRRKSVQDKIREEN 286
            EPP+ I+K M+Q + ++  S  + S+K CPY GFRRD+V ++GNCAYRRK +QD+IRE +
Sbjct: 361  EPPAIIRKAMRQGENQEGTS--AKSAKTCPYIGFRRDLVAVIGNCAYRRKHIQDEIRERD 418

Query: 285  GILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVSDLEMQETVDVPELTGLGLK 106
            GILLLLQQCV+DEDNP+ REWGIW +RNLLEGNAENQ++V+DLE+Q +++VPELT LGLK
Sbjct: 419  GILLLLQQCVTDEDNPFSREWGIWCVRNLLEGNAENQKVVADLELQGSINVPELTDLGLK 478

Query: 105  VEIDPQTRRAKLVNM 61
            VE+D  TRRAKLVN+
Sbjct: 479  VEVDKNTRRAKLVNV 493


>ref|XP_002320751.1| ataxin-related family protein [Populus trichocarpa]
            gi|222861524|gb|EEE99066.1| ataxin-related family protein
            [Populus trichocarpa]
          Length = 496

 Score =  427 bits (1099), Expect = e-117
 Identities = 231/463 (49%), Positives = 307/463 (66%), Gaps = 9/463 (1%)
 Frame = -3

Query: 1422 IGIAKTDDGRADLAGKNIVTVTLQLC-RAXXXXXXXXXXXXSFKLLRNLCAGELVNQNTF 1246
            I IAKTDDGRADLA KNI+ V LQL                S +L+RNLCAGE+ NQ +F
Sbjct: 37   IAIAKTDDGRADLASKNILPVVLQLITHLLNDPFDHEYLSLSLRLMRNLCAGEVANQKSF 96

Query: 1245 LEQGGVGIVSNIIASIKCLS-DLDNQVIRFGLQLLGNFALGGGEHRYAVWKELFPNKLIK 1069
            ++  GVGI   ++ S K  S + D+ +IR GLQ+L N +L G EH+ A+W  LF ++L  
Sbjct: 97   IQLNGVGIFLTVLRSKKVASSEPDHGIIRMGLQVLANVSLAGKEHQQAIWGGLFHDELYM 156

Query: 1068 IAAIHCMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLL 889
            +A +      DPLCM+IY C +G+  LV  L  +QG  I++E+I+T S VGF E W+KLL
Sbjct: 157  LAKVRSQGTCDPLCMIIYACCDGSPELVLQLCGNQGLPIVVEIIRTASLVGFGEEWLKLL 216

Query: 888  LSKICLEGSHFASIFSKLYPVSSNANCGDY-------FADPQAFLLSILSEILNERIEDI 730
            LS+ICLE  +F  +FS++Y V S    G+        F   QA+LL+I+SEILNER+++I
Sbjct: 217  LSRICLEDIYFPQLFSRIYSVCSYCENGEEISLSSNPFFTEQAYLLNIVSEILNERLKEI 276

Query: 729  AVDSKFPLHVLEILRTAAKAIDSVPRRESGLPTGHATIDVLGYSLIILRDICACDHGLKK 550
             + + F L +  I + + +A +   R ES LPTG A IDVLGYSL ILRDICA + G+ K
Sbjct: 277  TILNDFALCIFGIFKKSVEAFEFGSRAESRLPTGFAVIDVLGYSLTILRDICANNGGVGK 336

Query: 549  VQENSVDAMXXXXXXXXXXXXXXXXXXXEPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYK 370
              E+ VD +                   EPP  I+K M Q+  ++  +  S   K CPYK
Sbjct: 337  --EDLVDVVDSLLSSGLLDLLLCLLRDLEPPKIIRKAMNQAGNQE--ATTSYFPKVCPYK 392

Query: 369  GFRRDIVGIVGNCAYRRKSVQDKIREENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEG 190
            GFRRD+V ++GNCAYRRK VQD IR++NG+LL+LQQCV+DEDNP+LREWGIWS+RNLLEG
Sbjct: 393  GFRRDLVAVIGNCAYRRKHVQDDIRQKNGMLLMLQQCVTDEDNPFLREWGIWSMRNLLEG 452

Query: 189  NAENQRLVSDLEMQETVDVPELTGLGLKVEIDPQTRRAKLVNM 61
            N+ENQ+ V++LE+Q +VD+PEL GLGLKVE+D  TR AKLVN+
Sbjct: 453  NSENQQAVAELELQGSVDMPELAGLGLKVEVDQNTRSAKLVNI 495


>ref|XP_004492673.1| PREDICTED: ataxin-10-like [Cicer arietinum]
          Length = 468

 Score =  420 bits (1079), Expect = e-114
 Identities = 218/459 (47%), Positives = 296/459 (64%), Gaps = 5/459 (1%)
 Frame = -3

Query: 1422 IGIAKTDDGRADLAGKNIVTVTLQLCRAXXXXXXXXXXXXSFKLLRNLCAGELVNQNTFL 1243
            I  +K+D GR++LA K ++   L +  +             FKLLRNLCAGE  NQN FL
Sbjct: 13   IHTSKSDSGRSNLASKRVLPAVLNILNSQTLPLDHNLLSLCFKLLRNLCAGEFENQNLFL 72

Query: 1242 EQGGVGIVSNIIASIKCLSDLDNQVIRFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIA 1063
            E  GV +VS+I+ S       D+ ++R+GLQ+L N  L G +H+ A+W+E+FP   + +A
Sbjct: 73   EFDGVVVVSSILMSEAGSLRPDHMLVRWGLQVLANVCLAGKQHQKAIWEEIFPLGFVSLA 132

Query: 1062 AIHCMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLS 883
             +   EI DPLCMVIYTC +G       L +D G  ++ E++KT S+  F E WIKLLLS
Sbjct: 133  RLGTKEICDPLCMVIYTCCDGNHECFGELCSDSGLPVVAEIVKTASSASFGEDWIKLLLS 192

Query: 882  KICLEGSHFASIFSKL----YPVSSNANCGDY-FADPQAFLLSILSEILNERIEDIAVDS 718
            +ICLE S    +F KL     P   + +  DY F+  QAFLL ILSEILNER+ D+ V  
Sbjct: 193  RICLEESQLPMLFPKLRFMDIPEGEDIDSKDYQFSFEQAFLLQILSEILNERLRDVVVSK 252

Query: 717  KFPLHVLEILRTAAKAIDSVPRRESGLPTGHATIDVLGYSLIILRDICACDHGLKKVQEN 538
               L V  + + +   ++   R +SGLP+G   +D LGYSL ILRDICA D  ++   E+
Sbjct: 253  DVALFVYGVFKKSVGVLEHAVRGKSGLPSGSVAVDALGYSLTILRDICAHD-SVRGNPED 311

Query: 537  SVDAMXXXXXXXXXXXXXXXXXXXEPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRR 358
            + D +                   EPP+ I+KG+KQS+ ++  SC   SSK CPYKGFRR
Sbjct: 312  TNDVVDVLLSQDIIELLLILLGDLEPPAIIRKGIKQSENQEGASC---SSKPCPYKGFRR 368

Query: 357  DIVGIVGNCAYRRKSVQDKIREENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAEN 178
            DIV ++GNC YRRK  QD+IR  NGILLLLQQCV+DEDNP+LREWGIWS+RN+LEGN EN
Sbjct: 369  DIVSLIGNCVYRRKHAQDEIRGRNGILLLLQQCVTDEDNPFLREWGIWSVRNMLEGNEEN 428

Query: 177  QRLVSDLEMQETVDVPELTGLGLKVEIDPQTRRAKLVNM 61
            Q++VS+L++Q + DVP+++ LGL++E+D +TRRAKLVN+
Sbjct: 429  QKVVSELQLQGSADVPQISALGLRIEVDQKTRRAKLVNV 467


>ref|XP_002511774.1| conserved hypothetical protein [Ricinus communis]
            gi|223548954|gb|EEF50443.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 497

 Score =  419 bits (1078), Expect = e-114
 Identities = 226/491 (46%), Positives = 306/491 (62%), Gaps = 6/491 (1%)
 Frame = -3

Query: 1506 ELPVPETITQWXXXXXXXXXXXXXXXXLIGIAKTDDGRADLAGKNIVTVTLQLCRAXXXX 1327
            EL +PE + Q                 LI  ++ DDGRA+LA K+++ + L+L ++    
Sbjct: 2    ELFLPEDLLQLLFRASKSYDLKEALEILIETSRIDDGRANLAAKDVLPLVLKLFKSISYP 61

Query: 1326 XXXXXXXXSFKLLRNLCAGELVNQNTFLEQGGVGIVSNIIASIKCLSDLDNQVIRFGLQL 1147
                    S KLLRNLCAGE+ NQN F+   G  +VS ++ S   + + D  +IR GLQ+
Sbjct: 62   SGDQFLTLSLKLLRNLCAGEITNQNCFVALNGPEMVSTLLRSAGLVYEPDYGIIRLGLQV 121

Query: 1146 LGNFALGGGEHRYAVWKELFPNKLIKIAAIHCMEIIDPLCMVIYTCTEGTDGLVAGLLTD 967
            L N +L G +H+ A+W   FP++ + +A        DPLCM+IYTC +G  G V  L  D
Sbjct: 122  LANVSLAGEKHQQAIWHWFFPDEFVVLAKNRSQSTCDPLCMIIYTCCDGNPGFVLELCGD 181

Query: 966  QGFVIILEVIKTMSTVGFKEHWIKLLLSKICLEGSHFASIFSKLYPVSSNANC------G 805
            +G  ++ E+++T S VG+ E W KLLLS+ICLE  +F  +FS  Y    + N        
Sbjct: 182  RGLAVVAEIVRTASVVGYGEDWFKLLLSRICLEEEYFYKLFSCFYCAGDSENSEGISSSS 241

Query: 804  DYFADPQAFLLSILSEILNERIEDIAVDSKFPLHVLEILRTAAKAIDSVPRRESGLPTGH 625
            D F+  QA+LLS +SEILNER+EDI+V   F  +V  I + +   +D V R  SGLPTG 
Sbjct: 242  DLFSTEQAYLLSTVSEILNERLEDISVSIDFAFYVFGIFKRSVGVVDFVSRGNSGLPTGS 301

Query: 624  ATIDVLGYSLIILRDICACDHGLKKVQENSVDAMXXXXXXXXXXXXXXXXXXXEPPSTIK 445
            A +DVLGYSL ILRD CA  HG K    +SVD +                   EPP  IK
Sbjct: 302  AAVDVLGYSLTILRDTCAL-HG-KGGLYHSVDVVDTLLSNGLLELLLFVLHDLEPPPMIK 359

Query: 444  KGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGIVGNCAYRRKSVQDKIREENGILLLLQ 265
            K MKQ++  +  S  S S K CPYKGFRRDIV ++GNCA++R +VQD+IR+++ I LLLQ
Sbjct: 360  KAMKQNENHEPAS--SRSYKPCPYKGFRRDIVAVIGNCAFQRNNVQDEIRQKDMIPLLLQ 417

Query: 264  QCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVSDLEMQETVDVPELTGLGLKVEIDPQT 85
            QCV+DEDNP+LREWG+W +RNLLEGN ENQ+ V++LE+Q TV VPEL+GLGL+VE+D  T
Sbjct: 418  QCVTDEDNPFLREWGLWCVRNLLEGNVENQKAVAELELQGTVQVPELSGLGLRVEVDSNT 477

Query: 84   RRAKLVNM*SS 52
            RRA+LVN+ S+
Sbjct: 478  RRARLVNVSST 488


>ref|XP_003551615.1| PREDICTED: ataxin-10-like [Glycine max]
          Length = 498

 Score =  417 bits (1073), Expect = e-114
 Identities = 220/462 (47%), Positives = 298/462 (64%), Gaps = 11/462 (2%)
 Frame = -3

Query: 1413 AKTDDGRADLAGKNIVTVTLQLCRAXXXXXXXXXXXXS------FKLLRNLCAGELVNQN 1252
            AK+D GR +LA K I+   L +  +            +      FKLLRNLCAGE  NQ+
Sbjct: 40   AKSDSGRLELASKRILPAVLNIVHSLTHASHHHHHQHNHILCLSFKLLRNLCAGEAANQD 99

Query: 1251 TFLEQGGVGIVSNIIASIKCLSDLDNQVIRFGLQLLGNFALGGGEHRYAVWKELFPNKLI 1072
            +FLE  GV +V +++ S    S  D+ ++R+GLQ+L N +L G +H+ A+WKEL+ +  +
Sbjct: 100  SFLELDGVAVVCSVLRSEAACSGPDHGLVRWGLQVLANVSLAGKQHQCAIWKELYLDGFV 159

Query: 1071 KIAAIHCMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKL 892
             +A +H  E  DPLCMVIYTC +G       L ++ G+ ++ E+++T S+  F E W+KL
Sbjct: 160  SLARLHTKETCDPLCMVIYTCCDGNPEWFKRLSSEDGWFVMAEIVRTASSASFGEDWLKL 219

Query: 891  LLSKICLEGSHFASIFSKLY-----PVSSNANCGDYFADPQAFLLSILSEILNERIEDIA 727
            LLS+ICLE S    +FSKL       V    +  D+F+  QAFLL ILSEILNER +D+ 
Sbjct: 220  LLSRICLEESQLPVLFSKLQFADVPKVEVAESKDDHFSFEQAFLLRILSEILNERHKDVT 279

Query: 726  VDSKFPLHVLEILRTAAKAIDSVPRRESGLPTGHATIDVLGYSLIILRDICACDHGLKKV 547
            V     L V  I + +   ++   R +SGLP+G   +DVLGYSL ILRDICA D G++  
Sbjct: 280  VSKDVALFVFGIFKNSIGVLEHATRGKSGLPSGFVGVDVLGYSLTILRDICAQD-GVRGN 338

Query: 546  QENSVDAMXXXXXXXXXXXXXXXXXXXEPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKG 367
             E+S D +                   EPP+ I+KG+KQ + +D  SC   S K CPYKG
Sbjct: 339  TEDSNDVVDALLSYGLIELLLYLLEALEPPAIIRKGLKQCENQDGASC---SFKPCPYKG 395

Query: 366  FRRDIVGIVGNCAYRRKSVQDKIREENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGN 187
            FRRDIV ++GNC YRRK  QD+IR  NGILLLLQQCV+DEDNP+LREWGIWS+RN+LEGN
Sbjct: 396  FRRDIVALIGNCVYRRKHAQDEIRHRNGILLLLQQCVTDEDNPFLREWGIWSVRNMLEGN 455

Query: 186  AENQRLVSDLEMQETVDVPELTGLGLKVEIDPQTRRAKLVNM 61
             ENQ++V++LE+Q + DVPE+T LGL+VE+D +TRRAKLVN+
Sbjct: 456  DENQKVVAELEIQGSADVPEITSLGLRVEVDQRTRRAKLVNI 497


>gb|ESW20728.1| hypothetical protein PHAVU_005G009900g [Phaseolus vulgaris]
          Length = 498

 Score =  410 bits (1053), Expect = e-111
 Identities = 227/498 (45%), Positives = 303/498 (60%), Gaps = 9/498 (1%)
 Frame = -3

Query: 1527 MDETFPTELPVPETITQWXXXXXXXXXXXXXXXXLIGIAKTDDGRADLAGKNIVTVTLQ- 1351
            +D TF  E P+ E   Q                 LI  AK+D GR +LA K I+   L  
Sbjct: 2    IDTTF-LEHPISEDTLQLLFQASNSSNLEKSLEILIQNAKSDSGRLELASKRILPAVLNI 60

Query: 1350 ---LCRAXXXXXXXXXXXXSFKLLRNLCAGELVNQNTFLEQGGVGIVSNIIASIKCLSDL 1180
               L +A             FKLLRNLCAGE  NQ +F+E  GV +V +++ S       
Sbjct: 61   VQSLAQASHHHHHNQTFSLCFKLLRNLCAGEAANQVSFIELNGVAVVWSVLRSEAGSLGP 120

Query: 1179 DNQVIRFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIAAIHCMEIIDPLCMVIYTCTEG 1000
            D++++R+GLQ+L N +LGG +H+ A+W+EL+P     +A +   EI DPLCMVIYTC +G
Sbjct: 121  DHRLVRWGLQVLANVSLGGKQHQRAIWEELYPIGFASLARVGTKEICDPLCMVIYTCCDG 180

Query: 999  TDGLVAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLSKICLEGSHFASIFSKLYPVSS 820
                   L +D G+ ++ E+++T S+  F E W+KLLLS+I LE S    +FSKL  V  
Sbjct: 181  NPEWFKKLSSDDGWPVVAEIVRTASSASFDEDWLKLLLSRIFLEESQLPVLFSKLQSVDV 240

Query: 819  NA-----NCGDYFADPQAFLLSILSEILNERIEDIAVDSKFPLHVLEILRTAAKAIDSVP 655
                   +    F+  QAFLL ILSEILNER+ D+ V     L V  I + +   ++   
Sbjct: 241  PEGEVIESKNGQFSFEQAFLLQILSEILNERLGDVTVSEDVALFVFGIFKKSIGVLEHAM 300

Query: 654  RRESGLPTGHATIDVLGYSLIILRDICACDHGLKKVQENSVDAMXXXXXXXXXXXXXXXX 475
            R +SGLP+G   +DVLGYSL ILRDICA D     ++ N+ D +                
Sbjct: 301  RGKSGLPSGFTGVDVLGYSLTILRDICAQDG----MRGNTKDVVDVLLSYGLIEFLLSLL 356

Query: 474  XXXEPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGIVGNCAYRRKSVQDKIR 295
               EPP+ I+KG+KQ + +D  SCCS   K CPYKGFRRDIV ++GNC YRRK  QD+IR
Sbjct: 357  GALEPPAIIRKGLKQIENQDNASCCS---KPCPYKGFRRDIVALIGNCVYRRKHAQDEIR 413

Query: 294  EENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVSDLEMQETVDVPELTGL 115
            + NGILLLLQQCV+DEDNP+LREWGIWS+RN+LEGN ENQ+LV++LE+Q + DVPE+  L
Sbjct: 414  DRNGILLLLQQCVTDEDNPFLREWGIWSVRNMLEGNDENQKLVAELEIQGSADVPEINAL 473

Query: 114  GLKVEIDPQTRRAKLVNM 61
            GL+VE+D +TRR KLVN+
Sbjct: 474  GLQVEVDQRTRRPKLVNI 491


>ref|XP_004308721.1| PREDICTED: uncharacterized protein LOC101297970 [Fragaria vesca
            subsp. vesca]
          Length = 492

 Score =  407 bits (1047), Expect = e-111
 Identities = 228/495 (46%), Positives = 301/495 (60%), Gaps = 6/495 (1%)
 Frame = -3

Query: 1527 MDETFPTELPVPETITQWXXXXXXXXXXXXXXXXLIGIAKTDDGRADLAGKNIVTVTLQL 1348
            MD T   E  VPE + Q                 L+ + KT DGR DL+ KN++   +QL
Sbjct: 1    MDNTTLPECSVPEHVLQALLSVSNSSKLVDSLEDLVQVCKTADGREDLSAKNVLPTVIQL 60

Query: 1347 CRAXXXXXXXXXXXXSFKLLRNLCAGELVNQNTFLEQGGVGIVSNIIASIKCLSDLDNQV 1168
             ++            S +LLRNLCAGE+ NQN+F+EQ GV I+SNI++S   L   D  +
Sbjct: 61   VQSLSYPSDHYLLTLSLRLLRNLCAGEVANQNSFVEQNGVAIISNILSSASSLEP-DFGI 119

Query: 1167 IRFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIAAIHCMEIIDPLCMVIYTCTEGTDGL 988
            I  GLQ+L N AL G   ++A+W++LF    + +A +   +   PLCM+IY C +GT  L
Sbjct: 120  ICVGLQVLANVALAGERQQHAIWQQLFLENFVALARVRSQKTCGPLCMIIYACCDGTPEL 179

Query: 987  VAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLSKICLEGSHFASIFSKLYPVSSNANC 808
            VA L  D G  I+ E++KT +  GF E W KLLLS+ICLE  +F  +F  L  V  N N 
Sbjct: 180  VAQLCGDCGVTIVKEIVKTAAADGFGEDWYKLLLSRICLEEPYFRPLFFSLQHVGGNENG 239

Query: 807  GDY------FADPQAFLLSILSEILNERIEDIAVDSKFPLHVLEILRTAAKAIDSVPRRE 646
             D       F + Q FLL  +SEILNER+ +I V   F L V  I + + K +    R  
Sbjct: 240  DDTEGGQESFLEEQEFLLKNVSEILNERLNEITVPDDFALCVFGIFKNSIKVLSYATRGR 299

Query: 645  SGLPTGHATIDVLGYSLIILRDICACDHGLKKVQENSVDAMXXXXXXXXXXXXXXXXXXX 466
            SGLPTG   IDVLGYSL ILRDICA    L+    +++D +                   
Sbjct: 300  SGLPTGSIDIDVLGYSLTILRDICA-QGTLRGCTVDTMDVVDALISYGLIELLLCLLRDL 358

Query: 465  EPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGIVGNCAYRRKSVQDKIREEN 286
            EPP+ IKK + Q+K + EGS  S +SK CPYKGFRRDIVG++GNC Y R+ VQD+IR ++
Sbjct: 359  EPPAIIKKSVNQAKDQ-EGSNYS-ASKPCPYKGFRRDIVGVIGNCLYGRQIVQDEIRRKD 416

Query: 285  GILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVSDLEMQETVDVPELTGLGLK 106
            G+LLLLQQCV+D+DNPYLREWGIW +RNLLE N ENQ+ V++LE+Q +VDVP+L  LGL+
Sbjct: 417  GLLLLLQQCVTDDDNPYLREWGIWCVRNLLERNQENQQAVAELELQGSVDVPDLARLGLR 476

Query: 105  VEIDPQTRRAKLVNM 61
            VE++P T R KLVN+
Sbjct: 477  VEMNPATGRPKLVNI 491


>ref|XP_003623813.1| Ataxin-10 [Medicago truncatula] gi|355498828|gb|AES80031.1| Ataxin-10
            [Medicago truncatula]
          Length = 491

 Score =  394 bits (1012), Expect = e-107
 Identities = 211/420 (50%), Positives = 278/420 (66%), Gaps = 7/420 (1%)
 Frame = -3

Query: 1299 FKLLRNLCAGELVNQNTFLEQGGVGIVSNIIASIKCLSDLDNQVIRFGLQLLGNFALGGG 1120
            FKLLRNLCAGE++NQN FLE  GV IV + I   + +   D  ++R+GLQ+L N  L G 
Sbjct: 77   FKLLRNLCAGEILNQNMFLENDGVFIVVSSILRSEVVGS-DYMLVRWGLQVLANVCLAGK 135

Query: 1119 EHRYAVWKELFPNKLIKIAAIHCMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEV 940
            EH+ AVW E+FP   + +A I   E+ DPLCMVIYTC +G D   + + +D G+ +++E+
Sbjct: 136  EHQKAVWDEMFPVGFLSVARIGKKEVNDPLCMVIYTCCDGNDQWFSEVCSDGGWNVLVEI 195

Query: 939  IKTMSTVGFKEHWIKLLLSKICLEGSHFASIFSKL----YPVSSNANC-GDYFADPQAFL 775
            ++T S+  F E WIKLLLS+ICLE S    +FSKL     P   +     D F+  QAFL
Sbjct: 196  VRTASSASFGEDWIKLLLSRICLEDSQLRVLFSKLRFMDIPDGEDTKTKDDQFSSEQAFL 255

Query: 774  LSILSEILNERIEDIAVDSKFPLHVLEILRTAAKAIDSVPRRESGLPTGHATIDVLGYSL 595
            L I+S+ILNERI D+ +  +    V  I + +   ++   R +SGLP+G   +DVLGYSL
Sbjct: 256  LQIISDILNERIGDVTISLEVASFVYGIFKKSIGVLEHAVRGKSGLPSGITDVDVLGYSL 315

Query: 594  IILRDICACDHGLKKVQENSVDA--MXXXXXXXXXXXXXXXXXXXEPPSTIKKGMKQSKT 421
             +LRDICA D     V+ NS D   +                   EPP+ I+KGMK S+ 
Sbjct: 316  TMLRDICAHD----SVRGNSEDTEVVDMLLSYGLIELVFILLGDLEPPTIIRKGMKHSEN 371

Query: 420  RDEGSCCSNSSKQCPYKGFRRDIVGIVGNCAYRRKSVQDKIREENGILLLLQQCVSDEDN 241
             D  S   +SSK CPYKGFRRDIV ++GNC YRRK VQD+IR  NGILLLLQQCV+DEDN
Sbjct: 372  PDGAS---SSSKPCPYKGFRRDIVALIGNCVYRRKHVQDEIRSRNGILLLLQQCVTDEDN 428

Query: 240  PYLREWGIWSIRNLLEGNAENQRLVSDLEMQETVDVPELTGLGLKVEIDPQTRRAKLVNM 61
            PYLREWGIW +RN+LEGN ENQ+ +S+L++Q + DVPE++ LGL+VE+D +TRRAKLVN+
Sbjct: 429  PYLREWGIWCVRNMLEGNEENQKEISELQLQGSADVPEISALGLRVEVDQKTRRAKLVNV 488


>ref|XP_004306868.1| PREDICTED: ataxin-10-like [Fragaria vesca subsp. vesca]
          Length = 490

 Score =  386 bits (991), Expect = e-104
 Identities = 221/493 (44%), Positives = 294/493 (59%), Gaps = 4/493 (0%)
 Frame = -3

Query: 1527 MDETFPTELPVPETITQWXXXXXXXXXXXXXXXXLIGIAKTDDGRADLAGKNIVTVTLQL 1348
            MD T   E  VPE + Q                 LI + KT DGR DLA KN++   +QL
Sbjct: 1    MDNTALPECSVPEDVIQALLSVSNSSNLVESMEDLIQVCKTADGREDLAAKNVLPTVIQL 60

Query: 1347 CRAXXXXXXXXXXXXSFKLLRNLCAGELVNQNTFLEQGGVGIVSNIIASIKCLSDLDNQV 1168
             ++            S +LLRNLCAGE+ NQN+F+EQ GV IVSNI++S   L   D  +
Sbjct: 61   VQSLLYPSDHYLLTLSLRLLRNLCAGEVANQNSFVEQNGVAIVSNILSSAISLEP-DFWI 119

Query: 1167 IRFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIAAIHCMEIIDPLCMVIYTCTEGTDGL 988
            I  GLQ+L N AL G   ++A+W++LF  K + +A +   +   PLCM+I TC +GT  L
Sbjct: 120  ICVGLQVLANAALAGERQQHAIWQQLFSEKFVALARVRSKKTCGPLCMIISTCCDGTPEL 179

Query: 987  VAGLLTDQGFVIILEVIKTMSTVGFKEHWIKLLLSKICLEGSHFASIFSKLYPVSSNANC 808
            VA L  D G  I+ E++KT + V F E W KLLLS+ICL   +F  +F  L  V  NA  
Sbjct: 180  VAQLCGDCGVTILKEIVKTAAAVDFGEDWYKLLLSRICLVEPYFRPLFFSLEHVGENAED 239

Query: 807  GD----YFADPQAFLLSILSEILNERIEDIAVDSKFPLHVLEILRTAAKAIDSVPRRESG 640
             +     F+  Q FLL  +SEILNE + +I V + F L V  I + + K +    R  SG
Sbjct: 240  TEGGRESFSKEQEFLLKNVSEILNECLSEITVPNDFALCVFGIFKNSIKVLSYATRGRSG 299

Query: 639  LPTGHATIDVLGYSLIILRDICACDHGLKKVQENSVDAMXXXXXXXXXXXXXXXXXXXEP 460
            LPTG   IDVLGYSL ILRD CA    L+   ++++D +                   EP
Sbjct: 300  LPTGSIDIDVLGYSLTILRDTCA-QGTLRGSTKDTMDVVDALISYGLIELLLSLLRDLEP 358

Query: 459  PSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGIVGNCAYRRKSVQDKIREENGI 280
            P+ IKK + Q++ ++  S  S++ K CPYKGFRRDIV ++GNC Y RK VQD+IR ++G+
Sbjct: 359  PAIIKKSINQAENQEGSS--SSTLKPCPYKGFRRDIVAVIGNCLYGRKIVQDEIRRKDGL 416

Query: 279  LLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVSDLEMQETVDVPELTGLGLKVE 100
            LLLLQQCV D+DNPY REWGIW  RNLL+ N ENQR V++LE++ +VDVP L  LGL+VE
Sbjct: 417  LLLLQQCVIDDDNPYSREWGIWCQRNLLDRNQENQRAVAELELKGSVDVPALARLGLRVE 476

Query: 99   IDPQTRRAKLVNM 61
            ++  T R KLVN+
Sbjct: 477  MNLATGRPKLVNI 489


>ref|XP_002875041.1| hypothetical protein ARALYDRAFT_490543 [Arabidopsis lyrata subsp.
            lyrata] gi|297320878|gb|EFH51300.1| hypothetical protein
            ARALYDRAFT_490543 [Arabidopsis lyrata subsp. lyrata]
          Length = 474

 Score =  385 bits (989), Expect = e-104
 Identities = 203/453 (44%), Positives = 288/453 (63%), Gaps = 3/453 (0%)
 Frame = -3

Query: 1413 AKTDDGRADLAGKNIVTVTLQLCRAXXXXXXXXXXXXSFKLLRNLCAGELVNQNTFLEQG 1234
            +KTD GR+DLA K I+   L+L +             S K+LRNLCAGE+ NQN+F++  
Sbjct: 34   SKTDSGRSDLASKCILPSILRLLQLLPYPSSRHYLNLSLKVLRNLCAGEVSNQNSFVDHD 93

Query: 1233 GVGIVSNIIASIKCLSDLDNQVIRFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIAAIH 1054
            G  IVS ++ S    +  D + +RFGLQ+L N  L G + +  VW   FP + + IA I 
Sbjct: 94   GSVIVSELLDS----AIADFETVRFGLQVLANVVLFGEKRQRDVWLRFFPERFLSIAKIR 149

Query: 1053 CMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEVIKTMSTVG-FKEHWIKLLLSKI 877
              E  DPLCM++YTC +G+  + + L + +G  II E ++T S+VG  +++W+KLL+S+I
Sbjct: 150  RRETCDPLCMILYTCFDGSSEIASELCSSEGLTIIAETLRTSSSVGSVEDYWLKLLVSRI 209

Query: 876  CLEGSHFASIFSKLYPVSSNANCGDYFADPQAFLLSILSEILNERIEDIAVDSKFPLHVL 697
            C+E  +F  +FSKLY V+ N    + F   QAFLL I+S+I NERI  +A+       +L
Sbjct: 210  CVEDDYFPKLFSKLYKVAEN----EKFTSEQAFLLRIVSDIANERIGKVAIPKDTASSIL 265

Query: 696  EILRTAAKAIDSVPRRESGLPTGHATIDVLGYSLIILRDICACD--HGLKKVQENSVDAM 523
             + + +    D V    S LPTG   +DV+GYSL+I+RD CA      L K  ++S D +
Sbjct: 266  GLFKQSVDVFDFVSGERSELPTGSTIVDVMGYSLVIIRDACAGGSLEELNKDNKDSGDTV 325

Query: 522  XXXXXXXXXXXXXXXXXXXEPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGI 343
                               +PP+TIKK + QS T       S+S K CPY+GFRRDIV +
Sbjct: 326  ELLLSSGLIELLLDLLRKLDPPTTIKKALNQSPTS------SSSFKPCPYRGFRRDIVSV 379

Query: 342  VGNCAYRRKSVQDKIREENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVS 163
            +GNCAYRRK VQD+IRE +G++L+LQQCV+D++NP+LREWG+W +RNLLEGN ENQ +V+
Sbjct: 380  IGNCAYRRKEVQDEIRERDGLVLMLQQCVTDDENPFLREWGLWCVRNLLEGNPENQEVVA 439

Query: 162  DLEMQETVDVPELTGLGLKVEIDPQTRRAKLVN 64
            +LE++ +VDVP+L  +GL+VEIDP+T R KLVN
Sbjct: 440  ELEIKGSVDVPQLREIGLRVEIDPKTARPKLVN 472


>ref|NP_567156.1| protein MATERNAL EFFECT EMBRYO ARREST 50 [Arabidopsis thaliana]
            gi|3193319|gb|AAC19301.1| contains similarity to mouse
            brain protein E46 (GB:X61506) [Arabidopsis thaliana]
            gi|26451586|dbj|BAC42890.1| unknown protein [Arabidopsis
            thaliana] gi|28973257|gb|AAO63953.1| unknown protein
            [Arabidopsis thaliana] gi|332656441|gb|AEE81841.1|
            maternal effect embryo arrest 50 protein [Arabidopsis
            thaliana]
          Length = 475

 Score =  384 bits (987), Expect = e-104
 Identities = 201/453 (44%), Positives = 290/453 (64%), Gaps = 3/453 (0%)
 Frame = -3

Query: 1413 AKTDDGRADLAGKNIVTVTLQLCRAXXXXXXXXXXXXSFKLLRNLCAGELVNQNTFLEQG 1234
            +KTD GR+DLA K+I+   L+L +             S K+LRNLCAGE+ NQN+F++  
Sbjct: 34   SKTDSGRSDLASKSILPSILRLLQLLPYPSSRHYLNLSLKVLRNLCAGEVSNQNSFVDHD 93

Query: 1233 GVGIVSNIIASIKCLSDLDNQVIRFGLQLLGNFALGGGEHRYAVWKELFPNKLIKIAAIH 1054
            G  IVS+++ S    +  D + +RFGLQ+L N  L G + +  VW   +P + + IA I 
Sbjct: 94   GSAIVSDLLDS----AIADFETVRFGLQVLANVVLFGEKRQRDVWLRFYPERFLSIAKIR 149

Query: 1053 CMEIIDPLCMVIYTCTEGTDGLVAGLLTDQGFVIILEVIKTMSTVG-FKEHWIKLLLSKI 877
              E  DPLCM++YTC +G+  + + L + QG  II E ++T S+VG  +++W+KLL+S+I
Sbjct: 150  KRETFDPLCMILYTCVDGSSEIASELCSCQGLTIIAETLRTSSSVGSVEDYWLKLLVSRI 209

Query: 876  CLEGSHFASIFSKLYPVSSNANCGDYFADPQAFLLSILSEILNERIEDIAVDSKFPLHVL 697
            C+E  +F  +FSKLY  + N    + F+  QAFL+ ++S+I NERI  +++       +L
Sbjct: 210  CVEDGYFLKLFSKLYEDAEN----EIFSSEQAFLVRMVSDIANERIGKVSIPKDTACSIL 265

Query: 696  EILRTAAKAIDSVPRRESGLPTGHATIDVLGYSLIILRDICACDH--GLKKVQENSVDAM 523
             + R +    D V    S LPTG   +DV+GYSL+I+RD CA      LK+  ++S D +
Sbjct: 266  GLFRQSVDVFDFVSGERSELPTGSTIVDVMGYSLVIIRDACAGGRLEELKEDNKDSGDTV 325

Query: 522  XXXXXXXXXXXXXXXXXXXEPPSTIKKGMKQSKTRDEGSCCSNSSKQCPYKGFRRDIVGI 343
                               +PP+TIKK + QS      S  S+S K CPY+GFRRDIV +
Sbjct: 326  ELLLSSGLIELLLDLLSKLDPPTTIKKALNQSP-----SSSSSSLKPCPYRGFRRDIVSV 380

Query: 342  VGNCAYRRKSVQDKIREENGILLLLQQCVSDEDNPYLREWGIWSIRNLLEGNAENQRLVS 163
            +GNCAYRRK VQD+IRE +G+ L+LQQCV+D++NP+LREWG+W IRNLLEGN ENQ +V+
Sbjct: 381  IGNCAYRRKEVQDEIRERDGLFLMLQQCVTDDENPFLREWGLWCIRNLLEGNPENQEVVA 440

Query: 162  DLEMQETVDVPELTGLGLKVEIDPQTRRAKLVN 64
            +LE++ +VDVP+L  +GL+VEIDP+T R KLVN
Sbjct: 441  ELEIKGSVDVPQLREIGLRVEIDPKTARPKLVN 473


Top