BLASTX nr result

ID: Rauwolfia21_contig00013636 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00013636
         (1785 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI27399.3| unnamed protein product [Vitis vinifera]              203   2e-49
ref|XP_004239019.1| PREDICTED: uncharacterized protein LOC101258...   192   5e-46
ref|XP_004239018.1| PREDICTED: uncharacterized protein LOC101258...   192   5e-46
ref|XP_006348792.1| PREDICTED: dentin sialophosphoprotein-like [...   181   7e-43
ref|XP_006363556.1| PREDICTED: dentin sialophosphoprotein-like i...   177   1e-41
ref|XP_004237062.1| PREDICTED: uncharacterized protein LOC101254...   177   2e-41
ref|XP_002514588.1| conserved hypothetical protein [Ricinus comm...   157   2e-35
gb|EOY01582.1| 18S pre-ribosomal assembly protein gar2-related, ...   155   4e-35
gb|EOY01581.1| 18S pre-ribosomal assembly protein gar2-related, ...   155   4e-35
ref|XP_002266889.2| PREDICTED: uncharacterized protein LOC100247...   139   4e-30
emb|CAN67725.1| hypothetical protein VITISV_027041 [Vitis vinifera]   138   9e-30
gb|AAX55105.1| hypothetical protein At2g03810 [Arabidopsis thali...   135   7e-29
ref|NP_178475.2| 18S pre-ribosomal assembly protein gar2-related...   135   7e-29
ref|XP_006291111.1| hypothetical protein CARUB_v10017222mg [Caps...   134   1e-28
ref|XP_002875229.1| hypothetical protein ARALYDRAFT_484289 [Arab...   134   1e-28
gb|AAM77645.1|AF517846_1 hypothetical protein [Arabidopsis thali...   131   8e-28
ref|XP_002311412.1| 18S pre-ribosomal assembly protein gar2 [Pop...   130   2e-27
gb|EMJ23318.1| hypothetical protein PRUPE_ppa004630mg [Prunus pe...   130   2e-27
ref|XP_004144157.1| PREDICTED: uncharacterized protein LOC101217...   119   4e-24
ref|XP_006573175.1| PREDICTED: uncharacterized protein MAL13P1.3...   113   2e-22

>emb|CBI27399.3| unnamed protein product [Vitis vinifera]
          Length = 435

 Score =  203 bits (516), Expect = 2e-49
 Identities = 167/479 (34%), Positives = 233/479 (48%), Gaps = 25/479 (5%)
 Frame = -1

Query: 1632 STVVSKSSA---------LDNSTKSNGNDPGRSPMVGDQIPCHLYGFEKDAELLVSDVKH 1480
            ST+V KS +         LD +   +GN+  +       I C L G E+DA+ L  + + 
Sbjct: 12   STLVHKSDSKPFEYNDYSLDTAVPKSGNEIVKENQ--KVISCDLKGHERDADPLDGEDRF 69

Query: 1479 ENETQITQEAVRSMIDGQLANRNGKESKDSETSTTV----LQAFESDLNPFTDKNVLESE 1312
             N    T E   S+    +AN  G E ++S  +  V    L++FE D +  TDK+V + E
Sbjct: 70   WN----TSERDCSINVDDIANACGNEVRNSVATCVVSSEKLESFEKDGDMCTDKSVTKHE 125

Query: 1311 LPEFVVCYKENDIQIVKDICVDEGVPSENKILTESDKDDHAGSFVSQPSDVDRYFGTTKD 1132
            LP   VC +E+    VKDIC+DEG+ S  KIL E+ K++H G     P D D+    TK+
Sbjct: 126  LP---VCCEESTYHAVKDICIDEGMLSPEKILVENGKEEHEGFCPFLPPDTDKNVDPTKE 182

Query: 1131 P-DVESFVPGELNASSLENNSHDSAYELRTEKIXXXXXXXXXXXXXXXXXXXXVTED-FI 958
              D E  +P    AS+     +D   +L  E+                     V ED F+
Sbjct: 183  TADKELPLPDGQKASA----ENDCGKDLMQEE---ENYDARDKIISDTSEEKIVPEDIFL 235

Query: 957  IRE-----ALLESSKCDEGKVADQPDKIPSIEAILESLAVAFTSEDSRKDGPASNLCYNS 793
            I E     ++ ESS+ +  ++  Q  + P+ EA+LE+ A+   +E+S K+   + L YNS
Sbjct: 236  IPELSKANSMPESSEFNGMEIEHQCIQNPNGEAVLENPALVSEAEESDKNSFPNELSYNS 295

Query: 792  KVEGGTITFDFNSSKSACTSSMEGSGEDRGAELTPEKAPEAEDENLNDHLASSPAQLAND 613
            K+E GTITFDF SS ++  S  E S ++ G E      P  E +NL+             
Sbjct: 296  KLESGTITFDFGSSTTSMDSGREVSPQNDGCE------PPLESQNLSK------------ 337

Query: 612  EEKIKENASDTKPLRQEAPNDENGYTGNLSVSGRLQRGEGESSFSTSGPVSGLITYSGPI 433
                 E+ S++ P                  SG++QRG GESSFS +GP S LI+YSG I
Sbjct: 338  ----LEDGSESLPF-----------------SGQIQRGLGESSFSAAGPSSALISYSGQI 376

Query: 432  AYXXXXXXXXXXXXXXXXSFAFPVLQTEWNSSPVRMQKA-----RKHRGWRHGLFCCRF 271
             +                SFAFPVLQTEWNSSPVRM KA     RKHR WR G+ CCRF
Sbjct: 377  THSGNISLRSDSSTTSTRSFAFPVLQTEWNSSPVRMAKAERRHLRKHRSWRRGILCCRF 435


>ref|XP_004239019.1| PREDICTED: uncharacterized protein LOC101258367 isoform 2 [Solanum
            lycopersicum]
          Length = 554

 Score =  192 bits (487), Expect = 5e-46
 Identities = 171/582 (29%), Positives = 246/582 (42%), Gaps = 78/582 (13%)
 Frame = -1

Query: 1782 NQIGIVCGSNRYEREADRLSFSASDTLHANCSGGNPNSSAFGIEDGHDFWSTVVSKSSAL 1603
            NQ GI+  SN Y +EAD L F  +D  + N      +  A   +DG+ FW         L
Sbjct: 4    NQNGILGHSNGY-KEADALGFPVNDFGNTNVHDNREDPLACDRKDGNKFWEV-----PEL 57

Query: 1602 DNSTKSNGNDPGRSPMVGDQIPCHL---YGFEKDAELLVSDVKHENETQITQEAVRSMID 1432
            D+S   + ND  ++  V D     L    G  +       D+      +I   +V     
Sbjct: 58   DDSIFFDNNDEIKASNVRDNHNVDLSTINGDNRGGNPFACDIPSSETNEIVAASVTDDQT 117

Query: 1431 GQLAN-----------------RN-----------------GKESKDSETS-TTVLQAFE 1357
            G L+N                 RN                 G E+ DS++  T+  + FE
Sbjct: 118  GSLSNIIHTKRGGNPFECDTKDRNQPWNIPEYESLDFLDDKGNETIDSDSPFTSHSELFE 177

Query: 1356 SDLNPFTDKNVLESELPEFVVCYKENDIQIVKDICVDEGVPSENKILTESDKDDHAGSFV 1177
            ++ + ++DK V + EL E  VCY+EN+  IVKDIC+DEGVP+ +K+LTES KDD   + V
Sbjct: 178  NNKHFYSDKGVTDHELSELTVCYRENNFNIVKDICMDEGVPAVDKVLTESWKDDQLSTSV 237

Query: 1176 SQPSDVDRYFGTTKDPDVESFVPGELNASSLENN-----SHDSAYELRTEKIXXXXXXXX 1012
            S  +D +    T K  D+ S +      SS E+      +H +  E     I        
Sbjct: 238  SVDADEEHQSNTKKSVDMGSSIATVSQDSSCEDAKNIAVTHGAEIEPTGAPIPNDFNPSL 297

Query: 1011 XXXXXXXXXXXXVTEDFI----------------------------IREALLESSKCDEG 916
                          ED +                            + E+ +++S  D+ 
Sbjct: 298  ENKANKDADKDSYLEDLLMIFGSKCTTNGKTTNASEKPSSPNTVVRVEESNIKTSDGDQS 357

Query: 915  KVADQPDKIPSIEAILESLAVAFTSEDSRKDGPASNLCYNSKVEGGTITFDFNSSKSACT 736
             +  QPD++P  + +    A++   E +   G       NSK   GT  FDFN +K   T
Sbjct: 358  TL--QPDQVPFDQTLKSQTAISAADESNNNKG-------NSKEGAGTNIFDFNLTKPEST 408

Query: 735  SSMEGSGEDRGAELTPEKAPEAEDENLNDHL-ASSPAQLANDEEKIKENASDTKPLRQEA 559
            ++ EG  E+   +    KA        +D++ ASS    AN        A +      E+
Sbjct: 409  TTTEGGVENLPEDSHKPKAVSVHKNGNSDNISASSQVPFAN-------TADNAHQQHLES 461

Query: 558  PNDENGYTGNLSVSGRLQRGEGESSFSTS-GPVSGLITYSGPIAYXXXXXXXXXXXXXXX 382
             N  NG  G+ +        +GE+SFS + GP+SG ITYSGPI+Y               
Sbjct: 462  QNMANG-QGHFA--------DGEASFSAARGPISGSITYSGPISYSGSLSLRSESSTTST 512

Query: 381  XSFAFPVLQTEWNSSPVRMQKAR-----KHRGWRHGLFCCRF 271
             SFAFPVLQ EWNSSPVRM KA      K +GW+ GL CCRF
Sbjct: 513  RSFAFPVLQNEWNSSPVRMAKAERRRLSKQKGWKQGLLCCRF 554


>ref|XP_004239018.1| PREDICTED: uncharacterized protein LOC101258367 isoform 1 [Solanum
            lycopersicum]
          Length = 586

 Score =  192 bits (487), Expect = 5e-46
 Identities = 171/582 (29%), Positives = 246/582 (42%), Gaps = 78/582 (13%)
 Frame = -1

Query: 1782 NQIGIVCGSNRYEREADRLSFSASDTLHANCSGGNPNSSAFGIEDGHDFWSTVVSKSSAL 1603
            NQ GI+  SN Y +EAD L F  +D  + N      +  A   +DG+ FW         L
Sbjct: 36   NQNGILGHSNGY-KEADALGFPVNDFGNTNVHDNREDPLACDRKDGNKFWEV-----PEL 89

Query: 1602 DNSTKSNGNDPGRSPMVGDQIPCHL---YGFEKDAELLVSDVKHENETQITQEAVRSMID 1432
            D+S   + ND  ++  V D     L    G  +       D+      +I   +V     
Sbjct: 90   DDSIFFDNNDEIKASNVRDNHNVDLSTINGDNRGGNPFACDIPSSETNEIVAASVTDDQT 149

Query: 1431 GQLAN-----------------RN-----------------GKESKDSETS-TTVLQAFE 1357
            G L+N                 RN                 G E+ DS++  T+  + FE
Sbjct: 150  GSLSNIIHTKRGGNPFECDTKDRNQPWNIPEYESLDFLDDKGNETIDSDSPFTSHSELFE 209

Query: 1356 SDLNPFTDKNVLESELPEFVVCYKENDIQIVKDICVDEGVPSENKILTESDKDDHAGSFV 1177
            ++ + ++DK V + EL E  VCY+EN+  IVKDIC+DEGVP+ +K+LTES KDD   + V
Sbjct: 210  NNKHFYSDKGVTDHELSELTVCYRENNFNIVKDICMDEGVPAVDKVLTESWKDDQLSTSV 269

Query: 1176 SQPSDVDRYFGTTKDPDVESFVPGELNASSLENN-----SHDSAYELRTEKIXXXXXXXX 1012
            S  +D +    T K  D+ S +      SS E+      +H +  E     I        
Sbjct: 270  SVDADEEHQSNTKKSVDMGSSIATVSQDSSCEDAKNIAVTHGAEIEPTGAPIPNDFNPSL 329

Query: 1011 XXXXXXXXXXXXVTEDFI----------------------------IREALLESSKCDEG 916
                          ED +                            + E+ +++S  D+ 
Sbjct: 330  ENKANKDADKDSYLEDLLMIFGSKCTTNGKTTNASEKPSSPNTVVRVEESNIKTSDGDQS 389

Query: 915  KVADQPDKIPSIEAILESLAVAFTSEDSRKDGPASNLCYNSKVEGGTITFDFNSSKSACT 736
             +  QPD++P  + +    A++   E +   G       NSK   GT  FDFN +K   T
Sbjct: 390  TL--QPDQVPFDQTLKSQTAISAADESNNNKG-------NSKEGAGTNIFDFNLTKPEST 440

Query: 735  SSMEGSGEDRGAELTPEKAPEAEDENLNDHL-ASSPAQLANDEEKIKENASDTKPLRQEA 559
            ++ EG  E+   +    KA        +D++ ASS    AN        A +      E+
Sbjct: 441  TTTEGGVENLPEDSHKPKAVSVHKNGNSDNISASSQVPFAN-------TADNAHQQHLES 493

Query: 558  PNDENGYTGNLSVSGRLQRGEGESSFSTS-GPVSGLITYSGPIAYXXXXXXXXXXXXXXX 382
             N  NG  G+ +        +GE+SFS + GP+SG ITYSGPI+Y               
Sbjct: 494  QNMANG-QGHFA--------DGEASFSAARGPISGSITYSGPISYSGSLSLRSESSTTST 544

Query: 381  XSFAFPVLQTEWNSSPVRMQKAR-----KHRGWRHGLFCCRF 271
             SFAFPVLQ EWNSSPVRM KA      K +GW+ GL CCRF
Sbjct: 545  RSFAFPVLQNEWNSSPVRMAKAERRRLSKQKGWKQGLLCCRF 586


>ref|XP_006348792.1| PREDICTED: dentin sialophosphoprotein-like [Solanum tuberosum]
          Length = 586

 Score =  181 bits (460), Expect = 7e-43
 Identities = 161/577 (27%), Positives = 244/577 (42%), Gaps = 73/577 (12%)
 Frame = -1

Query: 1782 NQIGIVCGSNRYEREADRLSFSASDTLHANCSGGNPNSSAFGIEDGHDFWSTVVSKSSAL 1603
            NQ GI+  SN Y +EAD L    +D  + N      +  A   +DG++FW         L
Sbjct: 42   NQNGILSHSNGY-KEADSLGIPVNDFGNTNVHDNKEDPLACDRKDGNEFWEV-----PEL 95

Query: 1602 DNSTKSNGNDPGRSPMVGDQIPCHL---YGFEKDAELLVSDVKHENETQITQEAVRSMID 1432
            D+S   + N+  ++  V D     L    G  +       D+      +I   +V    +
Sbjct: 96   DDSIFFDNNNEIKASNVRDDHNVDLSKINGDNRGGNPFACDIPSSETNEIVAASVTDDQN 155

Query: 1431 GQLAN-----RNGK------------------------ESKDSET------STTVLQAFE 1357
            G L+N     R G                         + K++ET       T+  + F+
Sbjct: 156  GGLSNIIHSKRGGNPFECDTKDRDQPWNIPEYESLGFLDDKENETIDSDSPFTSHSELFD 215

Query: 1356 SDLNPFTDKNVLESELPEFVVCYKENDIQIVKDICVDEGVPSENKILTESDKDDHAGSFV 1177
            S+ + ++DK V + ELPE  VCY+EN+  +VKDIC+DEGVP+ +K+L ES KD    + V
Sbjct: 216  SNKHFYSDKGVTDHELPELTVCYRENNFNMVKDICMDEGVPAVDKVLIESWKDGQPSTSV 275

Query: 1176 SQPSDVDRYFGTTKDPDVESFVPGELNASSLENN-----SHDSAYELRTEKIXXXXXXXX 1012
            S  +D ++   T K  D+ S +      SS ++      +HD+  E     +        
Sbjct: 276  SVDADEEQQSNTRKSVDMGSTIASVSQDSSFKDAKNIAVTHDTEIEATGAPVPNGFNPSL 335

Query: 1011 XXXXXXXXXXXXVTEDFI-----------------------IREALLESSKCDEGKVADQ 901
                          ED +                       + E+ +++S  D+  +  Q
Sbjct: 336  ENNANKDADKDSYLEDLLMIFGSKCTTNASEKPSSLNTVVRVEESNIKTSDGDQSTL--Q 393

Query: 900  PDKIPSIEAILESLAVAFTSEDSRKDGPASNLCYNSKVEGGTITFDFNSSKSACTSSMEG 721
            PD++PS + +    AV+ + + + K         N K   GT  FD N +K   T + EG
Sbjct: 394  PDQVPSEQTLKSQTAVSASGQTNNKG--------NIKEGVGTSIFDVNLTKPESTKTTEG 445

Query: 720  S-GEDRGAELTPEKAPEAEDENLNDHLASSPAQLANDEEKIKENASDTKPLRQEAPNDEN 544
              G        P+     ++ N +++ ASS    AN        A +      E+ N  N
Sbjct: 446  GVGNLPEDSHMPKAVSVHKNGNSDNNSASSQVPFAN-------TADNAHQQHLESQNMAN 498

Query: 543  GYTGNLSVSGRLQRGEGESSFSTS-GPVSGLITYSGPIAYXXXXXXXXXXXXXXXXSFAF 367
            G +            +GE+SFS + GP+SG ITYSGPI+Y                SFAF
Sbjct: 499  GQS---------HFADGEASFSAARGPISGSITYSGPISYSGSVSLRSESSTTSTRSFAF 549

Query: 366  PVLQTEWNSSPVRMQKAR-----KHRGWRHGLFCCRF 271
            PVLQ EWNSSPVRM KA      K +GW+ G+ CCRF
Sbjct: 550  PVLQNEWNSSPVRMAKAERRRLSKQKGWKQGILCCRF 586


>ref|XP_006363556.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Solanum
            tuberosum] gi|565395867|ref|XP_006363557.1| PREDICTED:
            dentin sialophosphoprotein-like isoform X2 [Solanum
            tuberosum] gi|565395869|ref|XP_006363558.1| PREDICTED:
            dentin sialophosphoprotein-like isoform X3 [Solanum
            tuberosum] gi|565395871|ref|XP_006363559.1| PREDICTED:
            dentin sialophosphoprotein-like isoform X4 [Solanum
            tuberosum]
          Length = 532

 Score =  177 bits (449), Expect = 1e-41
 Identities = 146/523 (27%), Positives = 233/523 (44%), Gaps = 32/523 (6%)
 Frame = -1

Query: 1743 READRLSFSASDTLHANCSGGNPNSSAFGIEDGHDFWSTV-VSKSSALDNSTKSNGNDPG 1567
            +++  L     D L +N   G  +S A   ++ ++FW+   +  S   ++ ++SN ++  
Sbjct: 17   KDSKSLVLPTKDLLDSNGRDGTKDSLACE-KERNEFWNVQELDDSEFFEDISRSNKHEIR 75

Query: 1566 RSPMVGDQIPC--HLYGFEKDAELLVSDVKHENETQITQEAVRSMIDGQLANRNGKESKD 1393
             SP+  D I    +L   +++      D    +      +    MI     ++  +    
Sbjct: 76   ASPLKDDPIEALSNLTSCKRNGNPFACDTADRDHPWSIPKFEDPMIVNFFDDKEKETVVS 135

Query: 1392 SETSTTVLQAFESDLNPFTDKNVLESELPEFVVCYKENDIQIVKDICVDEGVPSENKILT 1213
            S   T++ + F ++ + +TDK VLE +LPE  +CY EN+  I+KDIC+DEGVP  +KI+T
Sbjct: 136  SAQFTSLSELFGTNTHLYTDKGVLEFKLPELTICYNENNYNIMKDICMDEGVPLMDKIVT 195

Query: 1212 ESDKDDHAGSFVSQPSDVDRYFGTTKDPDVESFVPGELNASSLEN------NSHDSAYEL 1051
            ES K     S +S   D  +   T +  D E    GE   SS+EN      + H +  + 
Sbjct: 196  ESRKYHQPDSSISLAVDEHQPRNTREGVDSELVSSGESKDSSVENAVKISVDHHTTKEDE 255

Query: 1050 RTEKIXXXXXXXXXXXXXXXXXXXXVTEDFI---------------IRE-----ALLESS 931
             T+ +                     + D +               I E       L+ S
Sbjct: 256  DTKSLGPNGINPFLEDNMSKYADKDSSLDVMKIFGSKDTTTAKATNISENESDIQNLKES 315

Query: 930  KCDEGKVADQPDKIPSIEAILESLAVAFTSEDSRKDGPASNLCYNSKVEGGTITFDFNSS 751
              D  + A Q ++IP+  A   S      ++ +  +GP SN   NSK E G IT DFN +
Sbjct: 316  NSDAEQSALQANQIPTFVAAFNSQNTVSAADGTNNNGPGSNFSNNSKSESGAITCDFNLT 375

Query: 750  KSACTSSMEGSGEDRGAELTPEKAPEAEDENLNDHL-ASSPAQLANDEEKIKENASDTKP 574
            + A +SS+  S +    +    +A  ++ +  +D   A++    AN  +    +     P
Sbjct: 376  ELALSSSVAKSDKHLPEQSHKLEAVSSQKDGSSDSFSAATQVHFANSVDSCNSSIHADPP 435

Query: 573  LRQEAPNDENGYTGNLSVSGRLQRGEGESSFSTSGPVSGLITYSGPIAYXXXXXXXXXXX 394
                  N E   +G++ +        GE+SF   GP SGLI+YSG I +           
Sbjct: 436  ---NVANLEEKNSGSIPLGVHGHFANGEASF---GPASGLISYSGHITHSGNISLRSDSS 489

Query: 393  XXXXXSFAFPVLQTEWNSSPVRMQKA--RKHRGWRHGLFCCRF 271
                 SFAFPVLQ+EWNSSPVRM KA  R ++GWR  L CC+F
Sbjct: 490  TTSARSFAFPVLQSEWNSSPVRMAKAERRHYKGWRQSLLCCKF 532


>ref|XP_004237062.1| PREDICTED: uncharacterized protein LOC101254294 [Solanum
            lycopersicum]
          Length = 532

 Score =  177 bits (448), Expect = 2e-41
 Identities = 147/536 (27%), Positives = 225/536 (41%), Gaps = 45/536 (8%)
 Frame = -1

Query: 1743 READRLSFSASDTLHANCSGGNPNSSAFGIEDGHDFWSTV----------VSKSSALDNS 1594
            +++  L     D L +N      +S A   E  ++FW+            +S+S+ L+N 
Sbjct: 16   KDSKSLVLPTKDLLDSNGRDSTKDSLACEKEK-NEFWNVQELDDSVFIEDISRSNKLENR 74

Query: 1593 TKSNGNDPGRSPMVGDQIPCHLYGFEKDAELLVSDVKHENETQITQEAVRSMIDGQLANR 1414
                 +DP       D+ P HL   +++      D    +      +    +I     ++
Sbjct: 75   ASPLKDDP-------DEAPSHLTSCKRNGNPFACDTADRDHPWSIPKFEDPIIVNFFDDK 127

Query: 1413 NGKESKDSETSTTVLQAFESDLNPFTDKNVLESELPEFVVCYKENDIQIVKDICVDEGVP 1234
              +    S   T++ + F +D + +TDK VLE ELPE  +CYKEND  I+KDIC+DEGVP
Sbjct: 128  EKETVVSSTQFTSLSELFGADTHLYTDKGVLEFELPESTICYKENDYNIMKDICMDEGVP 187

Query: 1233 SENKILTESDKDDHAGSFVSQPSDVDR--------------------------------Y 1150
              +KI+TES K D   S +S  +D  +                                +
Sbjct: 188  LMDKIVTESRKYDQPDSSISLAADEHQPRITREGVDSELVSSGESKASSVESAVKISVDH 247

Query: 1149 FGTTKDPDVESFVPGELNASSLENNSHDSAYELRTEKIXXXXXXXXXXXXXXXXXXXXVT 970
              T +D   +S VP  +N    +N S D+      EK                       
Sbjct: 248  HTTKEDEGNKSLVPNGINPFLEDNMSKDA------EKDPYLDVMKIFGSKDTTMAKPTNI 301

Query: 969  EDFIIREALLESSKCDEGKVADQPDKIPSIEAILESLAVAFTSEDSRKDGPASNLCYNSK 790
             +        + S  D  + A Q +++P+      S      ++ +   GP SN   NSK
Sbjct: 302  SEKESDSQNFKESNSDADQSAQQANQMPTSVEAFNSQYTVSPADGTNNYGPGSNFSNNSK 361

Query: 789  VEGGTITFDFNSSKSACTSSMEGSGEDRGAELTPEKAPEAEDENLNDHL-ASSPAQLAND 613
             + G IT DFN ++ A +SS+  S +    +    +A   + +  +D   A++    AN 
Sbjct: 362  SKSGAITCDFNLTELALSSSVTKSDKHLPEQSHKLEAVSGQKDGSSDSFSAATQVHFANS 421

Query: 612  EEKIKENASDTKPLRQEAPNDENGYTGNLSVSGRLQRGEGESSFSTSGPVSGLITYSGPI 433
             +    +     P       ++N  +  L V G      GE+SF   GP SGLI+YSG I
Sbjct: 422  VDSSNSSTIHADPPNVANLEEKNSSSIPLGVHGHF--ANGEASF---GPASGLISYSGHI 476

Query: 432  AYXXXXXXXXXXXXXXXXSFAFPVLQTEWNSSPVRMQKA--RKHRGWRHGLFCCRF 271
            A+                SFAFPVLQ+EWNSSPVRM KA  R ++GWR  L CC+F
Sbjct: 477  AHSGNISLRSDSSTTSARSFAFPVLQSEWNSSPVRMAKAERRHYKGWRQSLLCCKF 532


>ref|XP_002514588.1| conserved hypothetical protein [Ricinus communis]
            gi|223546192|gb|EEF47694.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 488

 Score =  157 bits (396), Expect = 2e-35
 Identities = 125/412 (30%), Positives = 188/412 (45%), Gaps = 33/412 (8%)
 Frame = -1

Query: 1407 KESKDSETSTTVLQAFESDLNPFTDKNVLESELPEFVVCYKENDIQIVKDICVDEGVPSE 1228
            KE +    ++  +++F+ D   + DKNV+E ELPE V+CYKEN   +VKDICVDEGVPS+
Sbjct: 91   KEEEVRNFTSLKIESFDKDSVFYIDKNVMEPELPELVLCYKENTYHVVKDICVDEGVPSQ 150

Query: 1227 NKILTESDKD------------DHAGSFVSQPSDVD-------RYFGTTKDPDVESFVPG 1105
               L ++  D            D       +  D+D       +   + K    ES    
Sbjct: 151  ENFLFDTSVDQEKLCPYLIPEKDIKSEIQKERVDLDMSTQYLSKNDNSFKCDSKESMAIA 210

Query: 1104 ELNASSLENNSHDSAYELRT--EKIXXXXXXXXXXXXXXXXXXXXVTEDFIIRE-----A 946
            E+   ++E  ++ ++ E  +  E +                      E   I+       
Sbjct: 211  EIEDDAMEEIANYTSKETFSLGELLLMPEVVAELSHSKSLLNSTDEAEQLSIQRPSENIV 270

Query: 945  LLESSKCDEGKVADQPDKI--PSIEAILESLAVAFTSEDSRKDGPASNLCYNSKVEGGTI 772
            L  +S C+E K A +   +  P+++ ++E        E   ++     L  +S  +    
Sbjct: 271  LATASACEESKYATEQFLLVTPAVDPLVE--------ESGHEEAKLGTLTSDSSPKASDH 322

Query: 771  TFDFNSSKSACTSSMEGSGEDRGAELTPEKAPEAEDENLNDHLASSPAQLANDEEKIKEN 592
              D     S   S      E+ GA+    K+P    ++++D  +S+P   +  EE  +  
Sbjct: 323  GHDEVILASLAPSYATEEPEN-GAKAA--KSPSHTLDSVSDLNSSAPTA-SGGEEGSQVG 378

Query: 591  ASDTKPLRQEAPNDENGYTGNLSVSGRLQRGEGESSFSTSGPVSGLITYSGPIAYXXXXX 412
             S+    R  + +++   T   S  G+LQ   GESSFS +GP+SGLI+YSGPIAY     
Sbjct: 379  GSEHLESRNSSRHEDTSITEPFS--GQLQYSHGESSFSAAGPLSGLISYSGPIAYSGSLS 436

Query: 411  XXXXXXXXXXXSFAFPVLQTEWNSSPVRMQKA-----RKHRGWRHGLFCCRF 271
                       SFAFP+LQ+EWNSSPVRM KA     RKHR WR GL CCRF
Sbjct: 437  LRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRRHFRKHRSWRQGLLCCRF 488


>gb|EOY01582.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 2
            [Theobroma cacao] gi|508709686|gb|EOY01583.1| 18S
            pre-ribosomal assembly protein gar2-related, putative
            isoform 2 [Theobroma cacao] gi|508709687|gb|EOY01584.1|
            18S pre-ribosomal assembly protein gar2-related, putative
            isoform 2 [Theobroma cacao]
          Length = 470

 Score =  155 bits (393), Expect = 4e-35
 Identities = 141/456 (30%), Positives = 200/456 (43%), Gaps = 72/456 (15%)
 Frame = -1

Query: 1422 ANRNGKESKDSETSTTV-LQAFESDLNP--FTDKNVLESELPEFVVCYKENDIQIVKDIC 1252
            AN N KE +D  TS +  L+  +S  N   + DK+V+E ELPE VVCYKE+   +VKDIC
Sbjct: 39   ANGNEKEVRDFVTSNSPSLKNMDSFQNSVFYLDKSVMECELPELVVCYKESTYHVVKDIC 98

Query: 1251 VDEGVPSENKILTESDKDDHAG-SFVSQPSDVDRYFGTTK-DPDV----ESFVPGELNAS 1090
            +DEGVP+++K L E+  D+    +F+    + D    T K + D+     S  PGE  + 
Sbjct: 99   IDEGVPTQDKFLFETGMDEKIDCNFLPSEKEQDSQLMTEKLETDMCMQDVSMSPGENQSG 158

Query: 1089 SLENNSHDSAYELRTEKIXXXXXXXXXXXXXXXXXXXXVTEDFIIREALLESSKCDEGKV 910
               +N   S  ++ T+                        +D  +    LE ++ ++G  
Sbjct: 159  KDIDNECGSNKKVDTD---------------------TCMQDVSLS---LEKNESNKGIP 194

Query: 909  ADQPDKIPSIEAILESLAVAFTSEDSRKD----GPASNLCYNSKVEGGTITFD------- 763
                 K   +  +++  A+   ++D  K+    G   ++   SKV    ++ D       
Sbjct: 195  NQCDSKDLMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSDGIE 254

Query: 762  ---FNSSK-----------SACTSSMEGSGEDRGAELTPEKAPEAEDENLNDHLASSPAQ 625
               F SS            SA   S + + E   +      A E  D    + +  SPAQ
Sbjct: 255  QQSFQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSGKGEAILISPAQ 314

Query: 624  LANDEEKIK-------------ENASDTKPLRQEAPNDE--------------NGYTGNL 526
            ++  EE                E  S T  L   AP                  G T  L
Sbjct: 315  VSTSEESTSSSLVNEVSYDNKLETGSITFNLDSSAPTSSKDECHHNLDSEPLGTGSTPKL 374

Query: 525  ------SVSGRLQRGEGESSFSTSGPVSGLITYSGPIAYXXXXXXXXXXXXXXXXSFAFP 364
                  S+S  LQ+G GESSFS +G V+GLI+YSGP+AY                SFAFP
Sbjct: 375  EVAADQSISNNLQQGIGESSFSAAGLVTGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFP 434

Query: 363  VLQTEWNSSPVRMQKA-----RKHRGWRHGLFCCRF 271
            +LQ+EWN SPVRM KA     RKH+GWRHGL CCRF
Sbjct: 435  ILQSEWNCSPVRMAKADRRHYRKHKGWRHGLLCCRF 470


>gb|EOY01581.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 1
            [Theobroma cacao]
          Length = 527

 Score =  155 bits (393), Expect = 4e-35
 Identities = 141/456 (30%), Positives = 200/456 (43%), Gaps = 72/456 (15%)
 Frame = -1

Query: 1422 ANRNGKESKDSETSTTV-LQAFESDLNP--FTDKNVLESELPEFVVCYKENDIQIVKDIC 1252
            AN N KE +D  TS +  L+  +S  N   + DK+V+E ELPE VVCYKE+   +VKDIC
Sbjct: 96   ANGNEKEVRDFVTSNSPSLKNMDSFQNSVFYLDKSVMECELPELVVCYKESTYHVVKDIC 155

Query: 1251 VDEGVPSENKILTESDKDDHAG-SFVSQPSDVDRYFGTTK-DPDV----ESFVPGELNAS 1090
            +DEGVP+++K L E+  D+    +F+    + D    T K + D+     S  PGE  + 
Sbjct: 156  IDEGVPTQDKFLFETGMDEKIDCNFLPSEKEQDSQLMTEKLETDMCMQDVSMSPGENQSG 215

Query: 1089 SLENNSHDSAYELRTEKIXXXXXXXXXXXXXXXXXXXXVTEDFIIREALLESSKCDEGKV 910
               +N   S  ++ T+                        +D  +    LE ++ ++G  
Sbjct: 216  KDIDNECGSNKKVDTD---------------------TCMQDVSLS---LEKNESNKGIP 251

Query: 909  ADQPDKIPSIEAILESLAVAFTSEDSRKD----GPASNLCYNSKVEGGTITFD------- 763
                 K   +  +++  A+   ++D  K+    G   ++   SKV    ++ D       
Sbjct: 252  NQCDSKDLMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSDGIE 311

Query: 762  ---FNSSK-----------SACTSSMEGSGEDRGAELTPEKAPEAEDENLNDHLASSPAQ 625
               F SS            SA   S + + E   +      A E  D    + +  SPAQ
Sbjct: 312  QQSFQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSGKGEAILISPAQ 371

Query: 624  LANDEEKIK-------------ENASDTKPLRQEAPNDE--------------NGYTGNL 526
            ++  EE                E  S T  L   AP                  G T  L
Sbjct: 372  VSTSEESTSSSLVNEVSYDNKLETGSITFNLDSSAPTSSKDECHHNLDSEPLGTGSTPKL 431

Query: 525  ------SVSGRLQRGEGESSFSTSGPVSGLITYSGPIAYXXXXXXXXXXXXXXXXSFAFP 364
                  S+S  LQ+G GESSFS +G V+GLI+YSGP+AY                SFAFP
Sbjct: 432  EVAADQSISNNLQQGIGESSFSAAGLVTGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFP 491

Query: 363  VLQTEWNSSPVRMQKA-----RKHRGWRHGLFCCRF 271
            +LQ+EWN SPVRM KA     RKH+GWRHGL CCRF
Sbjct: 492  ILQSEWNCSPVRMAKADRRHYRKHKGWRHGLLCCRF 527


>ref|XP_002266889.2| PREDICTED: uncharacterized protein LOC100247891 [Vitis vinifera]
          Length = 229

 Score =  139 bits (350), Expect = 4e-30
 Identities = 91/228 (39%), Positives = 120/228 (52%), Gaps = 5/228 (2%)
 Frame = -1

Query: 939 ESSKCDEGKVADQPDKIPSIEAILESLAVAFTSEDSRKDGPASNLCYNSKVEGGTITFDF 760
           ESS+ +  ++  Q  + P+ EA+LE+ A+   +E+S K+   + L YNSK+E GTITFDF
Sbjct: 41  ESSEFNGMEIEHQCIQNPNGEAVLENPALVSEAEESDKNSFPNELSYNSKLESGTITFDF 100

Query: 759 NSSKSACTSSMEGSGEDRGAELTPEKAPEAEDENLNDHLASSPAQLANDEEKIKENASDT 580
            SS ++  S  E S ++ G E      P  E +NL+                  E+ S++
Sbjct: 101 GSSTTSMDSGREVSPQNDGCE------PPLESQNLSK----------------LEDGSES 138

Query: 579 KPLRQEAPNDENGYTGNLSVSGRLQRGEGESSFSTSGPVSGLITYSGPIAYXXXXXXXXX 400
            P                  SG++QRG GESSFS +GP S LI+YSG I +         
Sbjct: 139 LPF-----------------SGQIQRGLGESSFSAAGPSSALISYSGQITHSGNISLRSD 181

Query: 399 XXXXXXXSFAFPVLQTEWNSSPVRMQKA-----RKHRGWRHGLFCCRF 271
                  SFAFPVLQTEWNSSPVRM KA     RKHR WR G+ CCRF
Sbjct: 182 SSTTSTRSFAFPVLQTEWNSSPVRMAKAERRHLRKHRSWRRGILCCRF 229


>emb|CAN67725.1| hypothetical protein VITISV_027041 [Vitis vinifera]
          Length = 745

 Score =  138 bits (347), Expect = 9e-30
 Identities = 143/490 (29%), Positives = 213/490 (43%), Gaps = 56/490 (11%)
 Frame = -1

Query: 1662 FGIEDGHDFW--STVVSKSSA---------LDNSTKSNGNDPGRSPMVGDQIPCHLYGFE 1516
            +GI D    +  ST+V KS +         LD +   +GN+  +       I C L G E
Sbjct: 306  YGIPDSEPVFCHSTLVXKSDSKPFEYNDYSLDTAVPKSGNEIVKENQ--KVISCDLKGHE 363

Query: 1515 KDAELLVSDVKHENETQITQEAVRSMIDGQLANRNGKESKDSETSTTV----LQAFESDL 1348
            +DA+ L  + +  N    T E   S+    + N  G E ++S  +  V    L++FE D 
Sbjct: 364  RDADPLDGEDRFWN----TSEXDCSINVDDIVNACGNEVRNSVATCAVSSEKLESFEKDG 419

Query: 1347 NPFTDKNVLESELPEFVVCYKENDIQIVKDICVDEGVPSENKILTESDKDDHAGSFVSQP 1168
            +  TDK+V + ELP   VC +E+   +VKDIC+DEG+ S  KIL E+ K++H G     P
Sbjct: 420  DMCTDKSVTKHELP---VCCEESTYHVVKDICIDEGMFSPEKILVENGKEEHEGFCTFLP 476

Query: 1167 SDVDRYFGTTKD-PDVESFVPGELNASS-----LENNSHDSAYELRTEKIXXXXXXXXXX 1006
             D D+    TK+  D E  +P    AS+      ++N+  S  +L  E+           
Sbjct: 477  PDTDKNVDPTKETADKELPLPDGQKASAENDCGKDDNNLCSHKDLMQEE--ENYDARDKI 534

Query: 1005 XXXXXXXXXXVTEDFIIRE-----ALLESSKCDEGKVADQPDKIPSIE------------ 877
                        + F+I E     ++ ESS+ +  ++  Q  + PS E            
Sbjct: 535  ISETSEEKIVPEDIFLIPELSKENSMPESSEFNGMEIEHQCIQNPSGEAFHLIGCFSLLS 594

Query: 876  ------------------AILESLAVAFTSEDSRKDGPASNLCYNSKVEGGTITFDFNSS 751
                              A+LE+ A+   +E+S K+   + L YNSK+E GTITFDF SS
Sbjct: 595  HFEYDVDVMVLVQNPNGEAVLENPALVSEAEESDKNSFPNELSYNSKLESGTITFDFGSS 654

Query: 750  KSACTSSMEGSGEDRGAELTPEKAPEAEDENLNDHLASSPAQLANDEEKIKENASDTKPL 571
             ++  S  E S ++ G E      P  E +NL+                  E+ S++ P 
Sbjct: 655  TTSMDSGREVSPQNDGCE------PPLESQNLSK----------------LEDGSESLPF 692

Query: 570  RQEAPNDENGYTGNLSVSGRLQRGEGESSFSTSGPVSGLITYSGPIAYXXXXXXXXXXXX 391
                             SG++QRG GESSFS +GP S LI+YSG I +            
Sbjct: 693  -----------------SGQIQRGLGESSFSAAGPSSALISYSGQITHSGNISLRSDSST 735

Query: 390  XXXXSFAFPV 361
                SFAFPV
Sbjct: 736  TSTRSFAFPV 745


>gb|AAX55105.1| hypothetical protein At2g03810 [Arabidopsis thaliana]
          Length = 439

 Score =  135 bits (339), Expect = 7e-29
 Identities = 127/434 (29%), Positives = 179/434 (41%), Gaps = 19/434 (4%)
 Frame = -1

Query: 1515 KDAELLVSDVKHENETQITQEAVRSMIDGQLANRNGKESKDS--ETSTTVLQAFESDLNP 1342
            +DAEL V +   +N   + +    +    +  N  GK+ +D+  +    V    + D   
Sbjct: 31   EDAELKVPE-NGKNNNNVCELFYDTRSGEEWENEAGKKVRDTSHDCDANVDSPEKKDPVF 89

Query: 1341 FTDKNVLESELPEFVVCYKENDIQIVKDICVDEGVPSENKILTESDKDDHAGSFVSQPSD 1162
            + DKNV   +LPE VVCYKEN   IVKDICVDEGVP + K L   +KD    S       
Sbjct: 90   YMDKNVTACDLPEIVVCYKENTYHIVKDICVDEGVPVQEKFLF-GEKDSVKSSSTEDLMK 148

Query: 1161 VDRYFGTTKDPDVESFVPGELNASSLENNSHDSAYELRTEKIXXXXXXXXXXXXXXXXXX 982
             D+   T  +P         ++        +D   +   E+                   
Sbjct: 149  ADK---TNVNPSETKSAEDSISKVDDSEFCNDHKTDRDVEE-----SSGEDFADAEGTSS 200

Query: 981  XXVTEDFIIREALLESSKCDEGKVADQPDKIPSIEAILE---------SLAVAFTSEDSR 829
                E  I+ E +  S          +PD+    E  +          +L    + ED +
Sbjct: 201  NYNQEHLIVTEEVKASPTHGLSPSEIEPDENSKDEVAISQDNDSKECLTLGDILSREDEQ 260

Query: 828  KDGPASNLCYNSKVEGGTITFDFNSSKSACTSSMEGSGEDRGAELTPEKAPEAEDENLND 649
            K     N+  +S  E           +S  T+++E        EL   + P+  +E L+ 
Sbjct: 261  KSLNQDNISSDSHEEQSPSQLQDKEKRSLETTAIE-------TELEKTEEPKQGEEKLSS 313

Query: 648  ---HLASSPAQLANDEEKIKENASDTKPLRQEAPNDENGYTGNLSVSGRLQRGEGESSFS 478
                 +  P +  N+ EK      +T+   Q+    EN Y  +   S R     GE+SFS
Sbjct: 314  VSTTTSQEPNKTCNEPEK-----PETENHHQQNCLVENSYEDDKFSSSRF----GETSFS 364

Query: 477  TSGPVS--GLITYSGPIAYXXXXXXXXXXXXXXXXSFAFPVLQTEWNSSPVRMQKARKHR 304
             +  VS  G ITYSGPIAY                SFAFP+LQ+EWNSSPVRM KA K R
Sbjct: 365  AADSVSISGHITYSGPIAYSGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKADKRR 424

Query: 303  ---GWRHGLFCCRF 271
               GWRH L CCRF
Sbjct: 425  QKGGWRHTLLCCRF 438


>ref|NP_178475.2| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis
            thaliana] gi|42570677|ref|NP_973412.1| 18S pre-ribosomal
            assembly protein gar2-related protein [Arabidopsis
            thaliana] gi|79316683|ref|NP_001030966.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|186499149|ref|NP_001118260.1|
            18S pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250656|gb|AEC05750.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250657|gb|AEC05751.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250658|gb|AEC05752.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250659|gb|AEC05753.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana]
          Length = 439

 Score =  135 bits (339), Expect = 7e-29
 Identities = 127/434 (29%), Positives = 179/434 (41%), Gaps = 19/434 (4%)
 Frame = -1

Query: 1515 KDAELLVSDVKHENETQITQEAVRSMIDGQLANRNGKESKDS--ETSTTVLQAFESDLNP 1342
            +DAEL V +   +N   + +    +    +  N  GK+ +D+  +    V    + D   
Sbjct: 31   EDAELKVPE-NGKNNNNVCELFYDTRSGEEWENEAGKKVRDTSHDCDANVDSPEKKDPVF 89

Query: 1341 FTDKNVLESELPEFVVCYKENDIQIVKDICVDEGVPSENKILTESDKDDHAGSFVSQPSD 1162
            + DKNV   +LPE VVCYKEN   IVKDICVDEGVP + K L   +KD    S       
Sbjct: 90   YMDKNVTACDLPEIVVCYKENTYHIVKDICVDEGVPVQEKFLF-GEKDSVKSSSTEDLMK 148

Query: 1161 VDRYFGTTKDPDVESFVPGELNASSLENNSHDSAYELRTEKIXXXXXXXXXXXXXXXXXX 982
             D+   T  +P         ++        +D   +   E+                   
Sbjct: 149  ADK---TNVNPSETKSAEDSISKVDDSEFCNDHKTDRDVEE-----SSGEDFADAEGTSS 200

Query: 981  XXVTEDFIIREALLESSKCDEGKVADQPDKIPSIEAILE---------SLAVAFTSEDSR 829
                E  I+ E +  S          +PD+    E  +          +L    + ED +
Sbjct: 201  NYNQEHLIVTEEVKASPTHGLSPSEIEPDENSKDEVAISQDNDSKECLTLGDILSREDEQ 260

Query: 828  KDGPASNLCYNSKVEGGTITFDFNSSKSACTSSMEGSGEDRGAELTPEKAPEAEDENLND 649
            K     N+  +S  E           +S  T+++E        EL   + P+  +E L+ 
Sbjct: 261  KSLNQDNISSDSHEEQSPSQLQDKEKRSLETTAIE-------TELEKTEEPKQGEEKLSS 313

Query: 648  ---HLASSPAQLANDEEKIKENASDTKPLRQEAPNDENGYTGNLSVSGRLQRGEGESSFS 478
                 +  P +  N+ EK      +T+   Q+    EN Y  +   S R     GE+SFS
Sbjct: 314  VSTTTSQEPNKTCNEPEK-----PETENHHQQNCLVENSYEDDKFSSSRF----GETSFS 364

Query: 477  TSGPVS--GLITYSGPIAYXXXXXXXXXXXXXXXXSFAFPVLQTEWNSSPVRMQKARKHR 304
             +  VS  G ITYSGPIAY                SFAFP+LQ+EWNSSPVRM KA K R
Sbjct: 365  AADSVSISGHITYSGPIAYSGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKADKRR 424

Query: 303  ---GWRHGLFCCRF 271
               GWRH L CCRF
Sbjct: 425  QKGGWRHTLLCCRF 438


>ref|XP_006291111.1| hypothetical protein CARUB_v10017222mg [Capsella rubella]
            gi|482559818|gb|EOA24009.1| hypothetical protein
            CARUB_v10017222mg [Capsella rubella]
          Length = 455

 Score =  134 bits (337), Expect = 1e-28
 Identities = 115/409 (28%), Positives = 165/409 (40%), Gaps = 25/409 (6%)
 Frame = -1

Query: 1422 ANRNGKESKDSETSTTVLQAFESDLNP--FTDKNVLESELPEFVVCYKENDIQIVKDICV 1249
            A+  GK+++D+             +NP  + DKNV   +LPE VVCYKEN   +VKDICV
Sbjct: 77   ADEAGKKTRDTSHDFVAKGDSPEKVNPVFYMDKNVTACDLPEIVVCYKENSYHVVKDICV 136

Query: 1248 DEGVPSENKILTESDKDDHAGSFVSQPSDVDRYFGTTKDPDVESFVPGELNASSLENNSH 1069
            DEGVP + K L        + +  +    VD         D     P E  +    N+  
Sbjct: 137  DEGVPVQEKFLFGEKDSVKSTTNSNHCGSVD-----LMKVDKTDVKPSETKSLEDSNSKV 191

Query: 1068 DSAYELRTEKIXXXXXXXXXXXXXXXXXXXXVTEDFIIREALLESSKCDEGKVADQPDKI 889
            D + E+  +K                       ++ +I  +   + K  E  +  + ++I
Sbjct: 192  DDSSEVCNDKTVQDVEESSREAFADAEGSSNYDQEHLIVTSPTLALKPSEISLEVESEEI 251

Query: 888  PSIEAILESLAVAFTSEDSRKDGPASNLCYNSKVEGGTITFDFNSSKSACTSSMEGSGED 709
               E ++ S    F SE                    ++T     S+     S++    +
Sbjct: 252  SKDEVVISS--EDFLSE--------------------SLTLGDILSREDKQKSLKNDNGN 289

Query: 708  RGAELTPEKAPEAEDENLNDHLASSPAQLANDEEKIKENASDTKPLRQEAPNDE------ 547
            R  EL+P +  E E  +L      +  +   + +  +EN S       + PN        
Sbjct: 290  RPEELSPPQHQEKEKRSLETTGLDTKLEKVEEPKTAEENLSSASTTTVQEPNKSCNDLEK 349

Query: 546  ------------NGYTGNLSVSGRLQRGEGESSFST--SGPVSGLITYSGPIAYXXXXXX 409
                        N Y  +   S R     GE+SFS   S  +SG ITYSGPIAY      
Sbjct: 350  PETENHQQNRLVNSYEDDKLSSSRF----GETSFSAAESVSISGHITYSGPIAYSGSLSV 405

Query: 408  XXXXXXXXXXSFAFPVLQTEWNSSPVRMQKARKHR---GWRHGLFCCRF 271
                      SFAFP+LQ+EWNSSPVRM KA K R   GWRH L CC+F
Sbjct: 406  RSDASTTSGRSFAFPILQSEWNSSPVRMAKADKRRQKGGWRHTLLCCKF 454


>ref|XP_002875229.1| hypothetical protein ARALYDRAFT_484289 [Arabidopsis lyrata subsp.
            lyrata] gi|297321067|gb|EFH51488.1| hypothetical protein
            ARALYDRAFT_484289 [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  134 bits (337), Expect = 1e-28
 Identities = 130/431 (30%), Positives = 176/431 (40%), Gaps = 18/431 (4%)
 Frame = -1

Query: 1515 KDAELLVSDVKHENETQITQEAVRSMIDGQLANRNGKESKD--SETSTTVLQAFESDLNP 1342
            +DAEL V +   +N   + +    +    +  N  GK+ +D   +    V    + D   
Sbjct: 31   EDAELKVPE-NGKNNNNVCELFYDTRSGDEWDNEAGKKVRDISHDCDANVDSPDKKDPVF 89

Query: 1341 FTDKNVLESELPEFVVCYKENDIQIVKDICVDEGVPSENKILTESDKDDHAGSFVSQPSD 1162
            + DKNV   +LPE VVCYKEN   +VKDICVDEGVP + K L   +KD    S     + 
Sbjct: 90   YMDKNVTACDLPEIVVCYKENTYHVVKDICVDEGVPVQEKFLF-GEKDSVKSSSTEDLTK 148

Query: 1161 VDRYFGTTKDPDVESFVPGELNASSLENNSHDSAYELRTEKIXXXXXXXXXXXXXXXXXX 982
             D+   T  +P          +A        DS +    +                    
Sbjct: 149  ADK---TNVNPSESK------SAEDSNTKVDDSEFCNNCKTDRDVEESSREDFADAEGSS 199

Query: 981  XXVTEDFIIREALLESSKCDEGKVADQPDKIPSIEAILES---------LAVAFTSEDSR 829
                E  I+ E    S          +PD+  + E  + S         L    + ED +
Sbjct: 200  AYNQEHLIVTEEAKASPSHGLNPSEIEPDENSNDEVAISSETDSKESLTLGDILSREDEQ 259

Query: 828  KDGPASNLCYNSKVEGGTITFDFNSSKSACTSSMEGSGEDRGAELTP--EKAPEAEDENL 655
            K     N+  +S  E           +S  T+++E   E +  E  P  EK P A    L
Sbjct: 260  KSLNHGNISSDSHEEQSPSQLQDKEKRSLETAAIETELE-KTEEPKPVEEKLPSASTTTL 318

Query: 654  NDHLASSPAQLANDEEKIKENASDTKPLRQEAPNDENGYTGNLSVSGRLQRGEGESSFST 475
             +     P +  ND EK      +T+   Q+    EN Y  +   S R     GE+SFS 
Sbjct: 319  QE-----PNKTCNDPEK-----PETENHHQQNSLVENSYEDDKLSSSRF----GETSFSA 364

Query: 474  --SGPVSGLITYSGPIAYXXXXXXXXXXXXXXXXSFAFPVLQTEWNSSPVRMQKARKHR- 304
              S  +SG ITYSGPIAY                SFAFP+LQ+EWNSSPVRM KA K R 
Sbjct: 365  AESVSISGHITYSGPIAYSGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKADKRRQ 424

Query: 303  --GWRHGLFCC 277
              GWRH L CC
Sbjct: 425  KGGWRHTLLCC 435


>gb|AAM77645.1|AF517846_1 hypothetical protein [Arabidopsis thaliana]
            gi|41059759|gb|AAR99354.1| hypothetical protein At2g03810
            [Arabidopsis thaliana]
          Length = 439

 Score =  131 bits (330), Expect = 8e-28
 Identities = 125/434 (28%), Positives = 177/434 (40%), Gaps = 19/434 (4%)
 Frame = -1

Query: 1515 KDAELLVSDVKHENETQITQEAVRSMIDGQLANRNGKESKDS--ETSTTVLQAFESDLNP 1342
            +DAEL V +   +N   + +    +    +  N  GK+ +D+  +    V    + D   
Sbjct: 31   EDAELKVPE-NGKNNNNVCELFYDTRSGEEWENEAGKKVRDTSHDCDANVDSPEKKDPVF 89

Query: 1341 FTDKNVLESELPEFVVCYKENDIQIVKDICVDEGVPSENKILTESDKDDHAGSFVSQPSD 1162
            + DKNV   +LPE V CYKEN   IVKDICVDE VP + K L   +KD    S       
Sbjct: 90   YMDKNVTACDLPEIVACYKENTYHIVKDICVDESVPVQEKFLF-GEKDSVKSSSTEDLMK 148

Query: 1161 VDRYFGTTKDPDVESFVPGELNASSLENNSHDSAYELRTEKIXXXXXXXXXXXXXXXXXX 982
             D+   T  +P         ++        +D   +   E+                   
Sbjct: 149  ADK---TNVNPSETKSAEDSISKVDDSEFCNDHKTDRDVEE-----SSGEDFADAEGTSS 200

Query: 981  XXVTEDFIIREALLESSKCDEGKVADQPDKIPSIEAILE---------SLAVAFTSEDSR 829
                E  I+ E +  S          +PD+    E  +          +L    + ED +
Sbjct: 201  NYNQEHLIVTEEVXASPTHGLSPSEIEPDENSKDEVAISQDNDSKECLTLGDILSREDEQ 260

Query: 828  KDGPASNLCYNSKVEGGTITFDFNSSKSACTSSMEGSGEDRGAELTPEKAPEAEDENLND 649
            K     N+  +S  E           +S  T+++E        EL   + P+  +E L+ 
Sbjct: 261  KSLNQDNISSDSHEEQSPSQLQDKEKRSLETTAIE-------TELEKTEEPKQGEEKLSS 313

Query: 648  ---HLASSPAQLANDEEKIKENASDTKPLRQEAPNDENGYTGNLSVSGRLQRGEGESSFS 478
                 +  P +  N+ EK      +T+   Q+    EN Y  +   S R     GE+SFS
Sbjct: 314  VSTTTSQEPNKTCNEPEK-----PETENHHQQNCLVENSYEDDKFSSSRF----GETSFS 364

Query: 477  TSGPVS--GLITYSGPIAYXXXXXXXXXXXXXXXXSFAFPVLQTEWNSSPVRMQKARKHR 304
             +  VS  G ITYSGPIAY                SFAFP+LQ+EWNSSPVRM KA K R
Sbjct: 365  AADSVSISGHITYSGPIAYSGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKADKRR 424

Query: 303  ---GWRHGLFCCRF 271
               GWRH L CCRF
Sbjct: 425  QKGGWRHTLLCCRF 438


>ref|XP_002311412.1| 18S pre-ribosomal assembly protein gar2 [Populus trichocarpa]
            gi|222851232|gb|EEE88779.1| 18S pre-ribosomal assembly
            protein gar2 [Populus trichocarpa]
          Length = 486

 Score =  130 bits (327), Expect = 2e-27
 Identities = 125/464 (26%), Positives = 191/464 (41%), Gaps = 70/464 (15%)
 Frame = -1

Query: 1452 AVRSMIDGQLANRNGKESK----------DSETSTTVLQAFESDLNPFTDKNVLESELPE 1303
            +V+ + +G+ ++ NG E             S  S+  ++ FE  +  + DK+V+  E+PE
Sbjct: 28   SVKEIENGKFSDLNGMEGDADRLPNVAPVPSPHSSLKMEPFEESVF-YMDKSVMVREVPE 86

Query: 1302 FVVCYKENDIQIVKDICVDEGVPSENKILTESD------------KDDHAGSFVSQPSDV 1159
             +VCYKEN    VKDICVDEGVP ++K L ++D            + D     V + SD+
Sbjct: 87   LIVCYKENTYH-VKDICVDEGVPLQDKFLFDTDAHKKNMCEFLPSERDMNNEMVKEKSDL 145

Query: 1158 DRYF-------GTTKDPDVESFVPGELNASSLENNSHDSAYELRTEKIXXXXXXXXXXXX 1000
            D             ++ D+   VP  L +S  + + HD + +   + +            
Sbjct: 146  DMLIPEMLKSSSEKQNVDLHLPVPDVLISSEEKGSKHDLSLDCDPKHLMPTEEVMDYGTK 205

Query: 999  XXXXXXXXVTEDFIIREALLES---SKCDEG--------KVADQPDKIPSIEAILESLAV 853
                      E   +R+ L  S   +KC           KV  Q    P   AILE+ + 
Sbjct: 206  KVTDNASK--EILSLRDLLSMSELGAKCTPANASYHNMDKVEQQSLLCPRENAILETDSA 263

Query: 852  AFTSEDSRKDG-------------PASNLCYNSKVEGGTITFDFNSSKSACTSSMEGSGE 712
            +  SE   ++              P  +  Y     G T     + + ++     + S E
Sbjct: 264  SEESEHCGEETISDNGLESATLAIPTQDPAYQEGDHGHTEAVLVSPTLTSAAEESD-SKE 322

Query: 711  DRGAELTPEKAPEAEDENLNDHLASSPAQLANDEEKIKENASDTKPLRQEAPNDENGYTG 532
             + A    +   E     + D L  +            ++++     R+   N E+   G
Sbjct: 323  TKLASHALDSFSEGSTSRIEDELPYNSKTETRSISFDNDSSAPAASARESPQNGESQRLG 382

Query: 531  NLSVS------------GRLQRGEGESSFSTSGPVSGLITYSGPIAYXXXXXXXXXXXXX 388
               VS            G+LQ  +GESSFS+SGP+ GL ++SGPIAY             
Sbjct: 383  TRIVSRFEDPNAERLSGGQLQYADGESSFSSSGPLFGLTSHSGPIAYSGSVSLRSDSSTT 442

Query: 387  XXXSFAFPVLQTEWNSSPVRMQKA-----RKHRGWRHGLFCCRF 271
               SFAFP+LQ+EWNSSP RM KA     +K R W  GL CCRF
Sbjct: 443  STRSFAFPILQSEWNSSPARMAKADRRHFQKPRKWMQGLLCCRF 486


>gb|EMJ23318.1| hypothetical protein PRUPE_ppa004630mg [Prunus persica]
          Length = 499

 Score =  130 bits (326), Expect = 2e-27
 Identities = 127/463 (27%), Positives = 189/463 (40%), Gaps = 92/463 (19%)
 Frame = -1

Query: 1383 STTVLQAFESDLNPFTDKNVLESELPEFVVCYKENDIQIVKDICVDEGVPSENKILTESD 1204
            S+  L+A E + + + DK+V+E ELPE +VCYKE+    +KDIC+DEGVPS++K   E+ 
Sbjct: 42   SSEKLEALEKESDYYMDKSVMECELPELIVCYKESSCNTIKDICIDEGVPSQDKNRFETG 101

Query: 1203 KDD-HAGSFVSQPSDVDRYFGTTKDPDVESFVPGELNASSLENNSH-------------- 1069
             D+    +F+S   D ++     +  D+   +P    +S+ ++                 
Sbjct: 102  VDEKECCTFLSPDEDQNKQL-LEEQMDIVVTLPDGFKSSAHDDLEKGFVIPCDSKGLTQI 160

Query: 1068 -DSAYELRTEKIXXXXXXXXXXXXXXXXXXXXVTEDFIIREALLESSKCDEGKVADQPDK 892
             D+ Y  + +                             + +  ES++  +  V    +K
Sbjct: 161  GDAIYYTQEKTEIEVSKEIFFPANVLPMQELGAGNAHSSKSSNEESTEAVQDTVQSSGEK 220

Query: 891  IPSIEAILESLAVAFTSEDSRKDGPA------------SNLCYNSKVEGGTITFDFN--- 757
            +  I     +  V+ T E S  +  A              L  NSKVE G+ T   +   
Sbjct: 221  VSEIAQTGSTAVVSVTEESSHSEKKALVSAAEESNFHVDELSNNSKVENGSTTSGLSDTS 280

Query: 756  ----SSKSAC------------TSSMEGSGEDRG-----AELTPEK-----AP------E 673
                +++ AC            T      G+D       AE+ P +     AP      E
Sbjct: 281  VHVSTTRDACPDNDVHKHFETQTMPAGDDGDDNDDNMPDAEIVPSQVQPCSAPVVTGREE 340

Query: 672  AEDENLNDHLASSPAQLANDE--------EKIKENASDTKPLRQEAPNDENGY-----TG 532
              +  +   L +S     +DE         +++  ++     R+E P  ENG      T 
Sbjct: 341  CPENGVCQPLDTSSTSKVDDEIPHSVIVSSQVQHYSAPVTISREERP--ENGVWQCPETS 398

Query: 531  NLSVSG-----------RLQRGEGESSFSTSGPVSGLITYSGPIAYXXXXXXXXXXXXXX 385
            N  + G            +QRG GESSFS +G  S L+  SGP  Y              
Sbjct: 399  NAFMVGDVNSDTQYASFHVQRGFGESSFSAAGHFSSLMNTSGP--YSGNVSLRSESSTTS 456

Query: 384  XXSFAFPVLQTEWNSSPVRMQKA-----RKHRGWRHGLFCCRF 271
              SFAFPVLQ+EWNSSPVRM KA     RKHRGW H L CCRF
Sbjct: 457  TRSFAFPVLQSEWNSSPVRMAKADRRHLRKHRGWGHSLLCCRF 499


>ref|XP_004144157.1| PREDICTED: uncharacterized protein LOC101217989 [Cucumis sativus]
            gi|449523672|ref|XP_004168847.1| PREDICTED:
            uncharacterized protein LOC101224727 [Cucumis sativus]
          Length = 431

 Score =  119 bits (298), Expect = 4e-24
 Identities = 108/425 (25%), Positives = 187/425 (44%), Gaps = 23/425 (5%)
 Frame = -1

Query: 1476 NETQITQEAVRSMIDGQ--LANRNGKESKD--SETSTTVLQAF--ESDLNP---FTDKNV 1324
            N+  + Q +  S  DG   +  +  + S D   + +   + AF   S++ P   + DK+V
Sbjct: 51   NQENVVQSS-HSSCDGNSCMITKINRSSTDVFDDNNAEGISAFGASSNMKPSFSYVDKSV 109

Query: 1323 LESELPEFVVCYKENDIQIVKDICVDEGVPSENKILTESDKDDHAGSFVSQPSDVDRYFG 1144
            +E ++ + +VC +E ++  VKDIC+D+GV S      +S  +         P + DR  G
Sbjct: 110  MECQMSKTIVCDQEVNVNDVKDICIDDGVASLENFFFKSTAEKSISKI--SPLEEDRNEG 167

Query: 1143 TTKDPDVES----FVPGELNAS-----SLENNSHDSAYELRTEKIXXXXXXXXXXXXXXX 991
            + K+ +  S    F+  +   S     +++  +H+ A +L T+                 
Sbjct: 168  SIKEKETSSEVSKFIADDRKVSLEDHFAMDWTTHNDAKDL-TQIEEEKLNLSEPELLMQK 226

Query: 990  XXXXXVTEDFIIREALLESSKCDEGKVADQPDKIPSIEAILESLAVAFTSEDSRKDGPAS 811
                  + + + +  L  S   ++  + D      S+++  ++ A+   +E  + + PA 
Sbjct: 227  LVKRSYSSESLDKIGLQISG--EKTNLEDPSSASKSVDSCNDTPALDSAAEPPKDNIPAH 284

Query: 810  NLCYNSKVEGGTITFDFNSSKSACTSSMEGSGEDRGAELTPEKAPEAEDENLNDHLASSP 631
               YN + E G+I   FNS      S +   GE+R      ++     D  +   + ++ 
Sbjct: 285  PSGYNDEFENGSIALTFNS-----ISPVANGGEER------QECCGRSDSVIGTQVLTN- 332

Query: 630  AQLANDEEKIKENASDTKPLRQEAPNDENGYTGNLSVSGRLQRGEGESSFSTSGPVSGLI 451
                     ++   SD++ L  +  +D                  GESSFS   P++ L+
Sbjct: 333  ---------LEYRTSDSRLLSSQNMHDI-----------------GESSFSAVDPLASLV 366

Query: 450  TYSGPIAYXXXXXXXXXXXXXXXXSFAFPVLQTEWNSSPVRMQKA-----RKHRGWRHGL 286
            TYSGP+AY                SFAFP+LQ+EWNSSPV+M KA     RK+RGWR GL
Sbjct: 367  TYSGPVAYSGSISLRSESSTTSTRSFAFPILQSEWNSSPVKMVKAERRHYRKYRGWREGL 426

Query: 285  FCCRF 271
             CC+F
Sbjct: 427  LCCKF 431


>ref|XP_006573175.1| PREDICTED: uncharacterized protein MAL13P1.304-like isoform X5
            [Glycine max]
          Length = 484

 Score =  113 bits (283), Expect = 2e-22
 Identities = 103/390 (26%), Positives = 163/390 (41%), Gaps = 28/390 (7%)
 Frame = -1

Query: 1356 SDLNPFTDKNVLESELPEFVVCYKENDIQIVKDICVDEGVPSENKILTESDKDDHAGSFV 1177
            + ++ + DK V E E P   VCYKE++  +VKDICVDEGV +++K++  +  D+ A +F 
Sbjct: 129  NSVDGYMDKTVTECE-PHLEVCYKESNYHVVKDICVDEGVLNKDKVMFVNTVDEKAHNFF 187

Query: 1176 SQPS-----------DVDRYFGTTKDPDVESFVPGELNASSLENNSHDSAYELRTEKIXX 1030
               S            ++    T  +    +F P E      +N S +      TE+   
Sbjct: 188  HSESYENKEKQKDNISINVLSLTPTEEKAHNFFPSESKEKQKDNTSINVLSLTPTEESDE 247

Query: 1029 XXXXXXXXXXXXXXXXXXVTEDFIIREALLESSKCDEGKVADQPDKIPSIEAILESLAVA 850
                                          ++++   G V  +   +P  + +L+ L   
Sbjct: 248  VHANHDQPKGLMHKDG--------------DATEKISGNVNKEMKPLPEDKVLLQDLL-- 291

Query: 849  FTSEDSRKDGPASNLCYNSKVEGGTITFDFNSSKSACTSSMEGSGED-RGAELTPEKAPE 673
              +EDS           +S  +G  I     S++    S  EGS      A L       
Sbjct: 292  --TEDS----------VSSDDKGEQI-----SNEPELHSQSEGSKNTVEEAILESPSLAL 334

Query: 672  AEDENLNDHLASSPAQLANDEEKIKENASDTKPLRQ----------EAPNDENGYTGNLS 523
            A+DE+ ND++ S      +  +  + +    +   Q          +      G + + +
Sbjct: 335  ADDESNNDNMLSEKESSTHQLDPSRPSDCGKEECHQAGVCKCDEIQQTMKPVEGKSDDQA 394

Query: 522  VSGRLQRGEGESSFSTSGPVSGLITYSGPIAYXXXXXXXXXXXXXXXXSFAFPVLQTEWN 343
            V+G +    GE+SFS+ GP+SG I+YSGP+ Y                SFAFP++Q+EWN
Sbjct: 395  VTGHIHHSLGEASFSSIGPMSGRISYSGPVPYSGSISLRSDSSTTSTRSFAFPIIQSEWN 454

Query: 342  SSPVRMQKA-RKHRG-----WRHGLFCCRF 271
            SSPVRM KA RKH       WR G  CC+F
Sbjct: 455  SSPVRMAKADRKHFRKQRWCWRDGFLCCKF 484


Top