BLASTX nr result

ID: Catharanthus22_contig00008042 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00008042
         (1915 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004239018.1| PREDICTED: uncharacterized protein LOC101258...   197   2e-47
ref|XP_006348792.1| PREDICTED: dentin sialophosphoprotein-like [...   176   4e-41
emb|CBI27399.3| unnamed protein product [Vitis vinifera]              173   3e-40
ref|XP_006363556.1| PREDICTED: dentin sialophosphoprotein-like i...   172   4e-40
ref|XP_004237062.1| PREDICTED: uncharacterized protein LOC101254...   172   4e-40
ref|XP_004239019.1| PREDICTED: uncharacterized protein LOC101258...   167   1e-38
gb|EOY01582.1| 18S pre-ribosomal assembly protein gar2-related, ...   149   4e-33
gb|EOY01581.1| 18S pre-ribosomal assembly protein gar2-related, ...   149   4e-33
ref|XP_002514588.1| conserved hypothetical protein [Ricinus comm...   142   5e-31
ref|XP_004144157.1| PREDICTED: uncharacterized protein LOC101217...   140   3e-30
gb|EMJ23318.1| hypothetical protein PRUPE_ppa004630mg [Prunus pe...   134   1e-28
ref|XP_006484258.1| PREDICTED: dentin sialophosphoprotein-like i...   127   2e-26
ref|XP_006484256.1| PREDICTED: dentin sialophosphoprotein-like i...   127   2e-26
ref|XP_002311412.1| 18S pre-ribosomal assembly protein gar2 [Pop...   125   8e-26
ref|XP_002875229.1| hypothetical protein ARALYDRAFT_484289 [Arab...   124   1e-25
ref|XP_006291111.1| hypothetical protein CARUB_v10017222mg [Caps...   122   5e-25
gb|ESW25465.1| hypothetical protein PHAVU_003G038300g [Phaseolus...   119   5e-24
gb|AAX55105.1| hypothetical protein At2g03810 [Arabidopsis thali...   119   6e-24
ref|NP_178475.2| 18S pre-ribosomal assembly protein gar2-related...   119   6e-24
ref|XP_002266889.2| PREDICTED: uncharacterized protein LOC100247...   118   1e-23

>ref|XP_004239018.1| PREDICTED: uncharacterized protein LOC101258367 isoform 1 [Solanum
            lycopersicum]
          Length = 586

 Score =  197 bits (500), Expect = 2e-47
 Identities = 183/611 (29%), Positives = 267/611 (43%), Gaps = 88/611 (14%)
 Frame = +2

Query: 110  MFASQLLRILQTIPTDVDHETCDFNTDKPKLSMMEDKIGIVCDSNAYGKETDRLSFSASD 289
            MFASQLLR+L+T+P D   E+  F+T + K +M  ++ GI+  SN Y KE D L F  +D
Sbjct: 1    MFASQLLRLLETLPADTSSESHVFDTAEEKPTMNGNQNGILGHSNGY-KEADALGFPVND 59

Query: 290  PLHANGHGGNSNSLVFGIQDEHDFW-------------NSAVFKSSLLDD-----STRSN 415
              + N H    + L    +D + FW             N  +  S++ D+     ST + 
Sbjct: 60   FGNTNVHDNREDPLACDRKDGNKFWEVPELDDSIFFDNNDEIKASNVRDNHNVDLSTING 119

Query: 416  DNEPGGSP----VDHLNGFEIDAES---------------------FLFDTRDKNGAQIT 520
            DN  GG+P    +      EI A S                     F  DT+D+N     
Sbjct: 120  DNR-GGNPFACDIPSSETNEIVAASVTDDQTGSLSNIIHTKRGGNPFECDTKDRNQPWNI 178

Query: 521  EEAAHSVMDGQTANGIEEESKDSETSTVPHTFESDLKSFTDKNVVECELPDFVVCYKESN 700
             E  +  +D     G E    DS  ++    FE++   ++DK V + EL +  VCY+E+N
Sbjct: 179  PE--YESLDFLDDKGNETIDSDSPFTSHSELFENNKHFYSDKGVTDHELSELTVCYRENN 236

Query: 701  TPHVKDICVDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFFVPDGFK 880
               VKDIC+DEGV   +K+L ES KD+   + VS  +DE+    T    D+   +    +
Sbjct: 237  FNIVKDICMDEGVPAVDKVLTESWKDDQLSTSVSVDADEEHQSNTKKSVDMGSSIATVSQ 296

Query: 881  VSSPEHNRHDTASEFGAEKIDKXXXXXXXXXXXXXXXXA---CVSEELILQKALLECSKC 1051
             SS E  ++  A   GAE I+                 A      +  +    ++  SKC
Sbjct: 297  DSSCEDAKN-IAVTHGAE-IEPTGAPIPNDFNPSLENKANKDADKDSYLEDLLMIFGSKC 354

Query: 1052 ------------------------------DEDKVSPQPDEVPCLESVLESLAVAFTSEQ 1141
                                          D D+ + QPD+VP  +++    A++   E 
Sbjct: 355  TTNGKTTNASEKPSSPNTVVRVEESNIKTSDGDQSTLQPDQVPFDQTLKSQTAISAADES 414

Query: 1142 SKTDGPVSNTCYNSKVEGGTITFDFNSSKPNVSNSIDASAELTTG-----KAPETEDEKP 1306
            +   G       NSK   GT  FDFN +KP  + + +   E         KA        
Sbjct: 415  NNNKG-------NSKEGAGTNIFDFNLTKPESTTTTEGGVENLPEDSHKPKAVSVHKNGN 467

Query: 1307 SDHF-ASSPAQLVNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFS 1483
            SD+  ASS     N      +N   + L  ++ +N +    G+F+        DGE+SFS
Sbjct: 468  SDNISASSQVPFANTA----DNAHQQHLESQNMANGQ----GHFA--------DGEASFS 511

Query: 1484 TS-GPVSGLITYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKAR---- 1648
             + GP+SG ITYSGPI+YSG++             FAFP+LQNEWNSSPVRM KA     
Sbjct: 512  AARGPISGSITYSGPISYSGSLSLRSESSTTSTRSFAFPVLQNEWNSSPVRMAKAERRRL 571

Query: 1649 -KHRGWRHGLL 1678
             K +GW+ GLL
Sbjct: 572  SKQKGWKQGLL 582


>ref|XP_006348792.1| PREDICTED: dentin sialophosphoprotein-like [Solanum tuberosum]
          Length = 586

 Score =  176 bits (445), Expect = 4e-41
 Identities = 166/584 (28%), Positives = 252/584 (43%), Gaps = 84/584 (14%)
 Frame = +2

Query: 179  FNTDKPKLSMMEDKIGIVCDSNAYGKETDRLSFSASDPLHANGHGGNSNSLVFGIQDEHD 358
            F+T + K +M  ++ GI+  SN Y KE D L    +D  + N H    + L    +D ++
Sbjct: 30   FDTAEEKPTMNGNQNGILSHSNGY-KEADSLGIPVNDFGNTNVHDNKEDPLACDRKDGNE 88

Query: 359  FW-------------NSAVFKSSLLDDS----TRSNDNEPGGSP---------------- 439
            FW             N+ +  S++ DD     ++ N +  GG+P                
Sbjct: 89   FWEVPELDDSIFFDNNNEIKASNVRDDHNVDLSKINGDNRGGNPFACDIPSSETNEIVAA 148

Query: 440  --VDHLNG-------FEIDAESFLFDTRDKNGA-QITEEAAHSVMDGQTANGIEEESKDS 589
               D  NG        +     F  DT+D++    I E  +   +D +     E E+ DS
Sbjct: 149  SVTDDQNGGLSNIIHSKRGGNPFECDTKDRDQPWNIPEYESLGFLDDK-----ENETIDS 203

Query: 590  ETSTVPHT--FESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEENKILI 763
            ++    H+  F+S+   ++DK V + ELP+  VCY+E+N   VKDIC+DEGV   +K+LI
Sbjct: 204  DSPFTSHSELFDSNKHFYSDKGVTDHELPELTVCYRENNFNMVKDICMDEGVPAVDKVLI 263

Query: 764  ESSKDEHAGSVVSQPSDEDRYGGT---------------------------TNDPDIEFF 862
            ES KD    + VS  +DE++   T                           T+D +IE  
Sbjct: 264  ESWKDGQPSTSVSVDADEEQQSNTRKSVDMGSTIASVSQDSSFKDAKNIAVTHDTEIEAT 323

Query: 863  ---VPDGFKVSSPEHNRHDTASEFGAEKIDKXXXXXXXXXXXXXXXXACVSEELILQKAL 1033
               VP+GF  S   +   D   +   E  D                 + ++  + ++++ 
Sbjct: 324  GAPVPNGFNPSLENNANKDADKDSYLE--DLLMIFGSKCTTNASEKPSSLNTVVRVEESN 381

Query: 1034 LECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKVEGGTITFD 1213
            ++ S  D D+ + QPD+VP  E  L+S      S Q+   G       N K   GT  FD
Sbjct: 382  IKTS--DGDQSTLQPDQVPS-EQTLKSQTAVSASGQTNNKG-------NIKEGVGTSIFD 431

Query: 1214 FNSSKPNVSNSIDASAELTTGKAPETEDEKPSDHFASSPAQLVNNEGKIK---ENPSDRK 1384
             N +KP  + + +       G  PE +   P            NN    +    N +D  
Sbjct: 432  VNLTKPESTKTTEGG----VGNLPE-DSHMPKAVSVHKNGNSDNNSASSQVPFANTADNA 486

Query: 1385 LLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTS-GPVSGLITYSGPIAYSGNVXXXXX 1561
              +   S +      +F+        DGE+SFS + GP+SG ITYSGPI+YSG+V     
Sbjct: 487  HQQHLESQNMANGQSHFA--------DGEASFSAARGPISGSITYSGPISYSGSVSLRSE 538

Query: 1562 XXXXXXXXFAFPILQNEWNSSPVRMEKAR-----KHRGWRHGLL 1678
                    FAFP+LQNEWNSSPVRM KA      K +GW+ G+L
Sbjct: 539  SSTTSTRSFAFPVLQNEWNSSPVRMAKAERRRLSKQKGWKQGIL 582


>emb|CBI27399.3| unnamed protein product [Vitis vinifera]
          Length = 435

 Score =  173 bits (438), Expect = 3e-40
 Identities = 142/422 (33%), Positives = 197/422 (46%), Gaps = 12/422 (2%)
 Frame = +2

Query: 449  LNGFEIDAESFLFDTRDKNGAQITEEAAHSVMDGQTANGIEEESKDSETSTVP-----HT 613
            L G E DA+    + R  N    T E   S+     AN    E ++S  + V       +
Sbjct: 53   LKGHERDADPLDGEDRFWN----TSERDCSINVDDIANACGNEVRNSVATCVVSSEKLES 108

Query: 614  FESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEENKILIESSKDEHAGS 793
            FE D    TDK+V + ELP   VC +ES    VKDIC+DEG+    KIL+E+ K+EH G 
Sbjct: 109  FEKDGDMCTDKSVTKHELP---VCCEESTYHAVKDICIDEGMLSPEKILVENGKEEHEGF 165

Query: 794  VVSQPSDEDRYGGTTNDP-DIEFFVPDGFKVSSPEHNRHDTASEFGAEKID-KXXXXXXX 967
                P D D+    T +  D E  +PDG K S+      D   E   E  D +       
Sbjct: 166  CPFLPPDTDKNVDPTKETADKELPLPDGQKASAENDCGKDLMQE--EENYDARDKIISDT 223

Query: 968  XXXXXXXXXACVSEELILQKALLECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQSK 1147
                       +  EL    ++ E S+ +  ++  Q  + P  E+VLE+ A+   +E+S 
Sbjct: 224  SEEKIVPEDIFLIPELSKANSMPESSEFNGMEIEHQCIQNPNGEAVLENPALVSEAEESD 283

Query: 1148 TDGPVSNTCYNSKVEGGTITFDFNSSKPNVSNSIDASAELTTGKAPETEDEKPSDHFASS 1327
             +   +   YNSK+E GTITFDF SS    + S+D+  E++    P+ +  +P       
Sbjct: 284  KNSFPNELSYNSKLESGTITFDFGSS----TTSMDSGREVS----PQNDGCEP------- 328

Query: 1328 PAQLVNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSGL 1507
                          P + + L K     E     +   S  ++RG GESSFS +GP S L
Sbjct: 329  --------------PLESQNLSKLEDGSE-----SLPFSGQIQRGLGESSFSAAGPSSAL 369

Query: 1508 ITYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA-----RKHRGWRHG 1672
            I+YSG I +SGN+             FAFP+LQ EWNSSPVRM KA     RKHR WR G
Sbjct: 370  ISYSGQITHSGNISLRSDSSTTSTRSFAFPVLQTEWNSSPVRMAKAERRHLRKHRSWRRG 429

Query: 1673 LL 1678
            +L
Sbjct: 430  IL 431


>ref|XP_006363556.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Solanum
            tuberosum] gi|565395867|ref|XP_006363557.1| PREDICTED:
            dentin sialophosphoprotein-like isoform X2 [Solanum
            tuberosum] gi|565395869|ref|XP_006363558.1| PREDICTED:
            dentin sialophosphoprotein-like isoform X3 [Solanum
            tuberosum] gi|565395871|ref|XP_006363559.1| PREDICTED:
            dentin sialophosphoprotein-like isoform X4 [Solanum
            tuberosum]
          Length = 532

 Score =  172 bits (437), Expect = 4e-40
 Identities = 147/522 (28%), Positives = 228/522 (43%), Gaps = 47/522 (9%)
 Frame = +2

Query: 254  KETDRLSFSASDPLHANGHGGNSNSLVFGIQDEHDFWN-SAVFKSSLLDDSTRSNDNEPG 430
            K++  L     D L +NG  G  +SL    ++ ++FWN   +  S   +D +RSN +E  
Sbjct: 17   KDSKSLVLPTKDLLDSNGRDGTKDSLACE-KERNEFWNVQELDDSEFFEDISRSNKHEIR 75

Query: 431  GSPV--------DHLNGFEIDAESFLFDTRDKNGAQITEEAAHSVMDGQTANGIEEESKD 586
             SP+         +L   + +   F  DT D++       +     D    N  +++ K+
Sbjct: 76   ASPLKDDPIEALSNLTSCKRNGNPFACDTADRDHPW----SIPKFEDPMIVNFFDDKEKE 131

Query: 587  SETSTVPHTFESDLKS-----FTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEEN 751
            +  S+   T  S+L       +TDK V+E +LP+  +CY E+N   +KDIC+DEGV   +
Sbjct: 132  TVVSSAQFTSLSELFGTNTHLYTDKGVLEFKLPELTICYNENNYNIMKDICMDEGVPLMD 191

Query: 752  KILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFF---------VPDGFKVSSPEHNR 904
            KI+ ES K     S +S   DE +   T    D E           V +  K+S   H  
Sbjct: 192  KIVTESRKYHQPDSSISLAVDEHQPRNTREGVDSELVSSGESKDSSVENAVKISVDHHTT 251

Query: 905  ------------------HDTASEFGAEKIDKXXXXXXXXXXXXXXXXACVSE-ELILQK 1027
                               D  S++  +                      +SE E  +Q 
Sbjct: 252  KEDEDTKSLGPNGINPFLEDNMSKYADKDSSLDVMKIFGSKDTTTAKATNISENESDIQN 311

Query: 1028 ALLECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKVEGGTIT 1207
              L+ S  D ++ + Q +++P   +   S      ++ +  +GP SN   NSK E G IT
Sbjct: 312  --LKESNSDAEQSALQANQIPTFVAAFNSQNTVSAADGTNNNGPGSNFSNNSKSESGAIT 369

Query: 1208 FDFNSSKPNVSNSI---DASAELTTGKAPETEDEKPSDHFASSPAQLVNNEGKIKENPSD 1378
             DFN ++  +S+S+   D      + K      +K     + S A  V+    +    S 
Sbjct: 370  CDFNLTELALSSSVAKSDKHLPEQSHKLEAVSSQKDGSSDSFSAATQVHFANSVDSCNSS 429

Query: 1379 RKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSGLITYSGPIAYSGNVXXXX 1558
                  + +N E  NSG+  +  +    +GE+SF   GP SGLI+YSG I +SGN+    
Sbjct: 430  IHADPPNVANLEEKNSGSIPLGVHGHFANGEASF---GPASGLISYSGHITHSGNISLRS 486

Query: 1559 XXXXXXXXXFAFPILQNEWNSSPVRMEKA--RKHRGWRHGLL 1678
                     FAFP+LQ+EWNSSPVRM KA  R ++GWR  LL
Sbjct: 487  DSSTTSARSFAFPVLQSEWNSSPVRMAKAERRHYKGWRQSLL 528


>ref|XP_004237062.1| PREDICTED: uncharacterized protein LOC101254294 [Solanum
            lycopersicum]
          Length = 532

 Score =  172 bits (437), Expect = 4e-40
 Identities = 153/531 (28%), Positives = 235/531 (44%), Gaps = 56/531 (10%)
 Frame = +2

Query: 254  KETDRLSFSASDPLHANGHGGNSNSLVFGIQDEHDFWNSAVFKSSL-LDDSTRSNDNEPG 430
            K++  L     D L +NG     +SL    +++++FWN      S+ ++D +RSN  E  
Sbjct: 16   KDSKSLVLPTKDLLDSNGRDSTKDSLACE-KEKNEFWNVQELDDSVFIEDISRSNKLENR 74

Query: 431  GSPV--------DHLNGFEIDAESFLFDTRDKNGAQITEEAAHSVMDGQTANGIEEESKD 586
             SP+         HL   + +   F  DT D++      +    ++     N  +++ K+
Sbjct: 75   ASPLKDDPDEAPSHLTSCKRNGNPFACDTADRDHPWSIPKFEDPII----VNFFDDKEKE 130

Query: 587  SETSTVPHT-----FESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEEN 751
            +  S+   T     F +D   +TDK V+E ELP+  +CYKE++   +KDIC+DEGV   +
Sbjct: 131  TVVSSTQFTSLSELFGADTHLYTDKGVLEFELPESTICYKENDYNIMKDICMDEGVPLMD 190

Query: 752  KILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFFVPDGFKVSSPEH------NRHDT 913
            KI+ ES K +   S +S  +DE +   T    D E       K SS E       + H T
Sbjct: 191  KIVTESRKYDQPDSSISLAADEHQPRITREGVDSELVSSGESKASSVESAVKISVDHHTT 250

Query: 914  ASEFG--------------------AEKIDKXXXXXXXXXXXXXXXXACVSEELILQKAL 1033
              + G                    AEK                        E       
Sbjct: 251  KEDEGNKSLVPNGINPFLEDNMSKDAEKDPYLDVMKIFGSKDTTMAKPTNISEKESDSQN 310

Query: 1034 LECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKVEGGTITFD 1213
             + S  D D+ + Q +++P       S      ++ +   GP SN   NSK + G IT D
Sbjct: 311  FKESNSDADQSAQQANQMPTSVEAFNSQYTVSPADGTNNYGPGSNFSNNSKSKSGAITCD 370

Query: 1214 FNSSKPNVSNSIDAS----------AELTTGKAPETEDEKPSD---HFASSPAQLVNNEG 1354
            FN ++  +S+S+  S           E  +G+   + D   +    HFA+S     +N  
Sbjct: 371  FNLTELALSSSVTKSDKHLPEQSHKLEAVSGQKDGSSDSFSAATQVHFANSVDS--SNSS 428

Query: 1355 KIKENPSD-RKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSGLITYSGPIA 1531
             I  +P +   L  K++S+  +G  G+F+        +GE+SF   GP SGLI+YSG IA
Sbjct: 429  TIHADPPNVANLEEKNSSSIPLGVHGHFA--------NGEASF---GPASGLISYSGHIA 477

Query: 1532 YSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA--RKHRGWRHGLL 1678
            +SGN+             FAFP+LQ+EWNSSPVRM KA  R ++GWR  LL
Sbjct: 478  HSGNISLRSDSSTTSARSFAFPVLQSEWNSSPVRMAKAERRHYKGWRQSLL 528


>ref|XP_004239019.1| PREDICTED: uncharacterized protein LOC101258367 isoform 2 [Solanum
            lycopersicum]
          Length = 554

 Score =  167 bits (423), Expect = 1e-38
 Identities = 166/573 (28%), Positives = 241/573 (42%), Gaps = 88/573 (15%)
 Frame = +2

Query: 224  GIVCDSNAYGKETDRLSFSASDPLHANGHGGNSNSLVFGIQDEHDFW------------- 364
            GI+  SN Y KE D L F  +D  + N H    + L    +D + FW             
Sbjct: 7    GILGHSNGY-KEADALGFPVNDFGNTNVHDNREDPLACDRKDGNKFWEVPELDDSIFFDN 65

Query: 365  NSAVFKSSLLDD-----STRSNDNEPGGSP----VDHLNGFEIDAES------------- 478
            N  +  S++ D+     ST + DN  GG+P    +      EI A S             
Sbjct: 66   NDEIKASNVRDNHNVDLSTINGDNR-GGNPFACDIPSSETNEIVAASVTDDQTGSLSNII 124

Query: 479  --------FLFDTRDKNGAQITEEAAHSVMDGQTANGIEEESKDSETSTVPHTFESDLKS 634
                    F  DT+D+N      E  +  +D     G E    DS  ++    FE++   
Sbjct: 125  HTKRGGNPFECDTKDRNQPWNIPE--YESLDFLDDKGNETIDSDSPFTSHSELFENNKHF 182

Query: 635  FTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEENKILIESSKDEHAGSVVSQPSD 814
            ++DK V + EL +  VCY+E+N   VKDIC+DEGV   +K+L ES KD+   + VS  +D
Sbjct: 183  YSDKGVTDHELSELTVCYRENNFNIVKDICMDEGVPAVDKVLTESWKDDQLSTSVSVDAD 242

Query: 815  EDRYGGTTNDPDIEFFVPDGFKVSSPEHNRHDTASEFGAEKIDKXXXXXXXXXXXXXXXX 994
            E+    T    D+   +    + SS E  ++  A   GAE I+                 
Sbjct: 243  EEHQSNTKKSVDMGSSIATVSQDSSCEDAKN-IAVTHGAE-IEPTGAPIPNDFNPSLENK 300

Query: 995  A---CVSEELILQKALLECSKC------------------------------DEDKVSPQ 1075
            A      +  +    ++  SKC                              D D+ + Q
Sbjct: 301  ANKDADKDSYLEDLLMIFGSKCTTNGKTTNASEKPSSPNTVVRVEESNIKTSDGDQSTLQ 360

Query: 1076 PDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKVEGGTITFDFNSSKPNVSNSIDA 1255
            PD+VP  +++    A++   E +   G       NSK   GT  FDFN +KP  + + + 
Sbjct: 361  PDQVPFDQTLKSQTAISAADESNNNKG-------NSKEGAGTNIFDFNLTKPESTTTTEG 413

Query: 1256 SAELTTG-----KAPETEDEKPSDHF-ASSPAQLVNNEGKIKENPSDRKLLRKDASNDEI 1417
              E         KA        SD+  ASS     N      +N   + L  ++ +N + 
Sbjct: 414  GVENLPEDSHKPKAVSVHKNGNSDNISASSQVPFANTA----DNAHQQHLESQNMANGQ- 468

Query: 1418 GNSGNFSVSSYLERGDGESSFSTS-GPVSGLITYSGPIAYSGNVXXXXXXXXXXXXXFAF 1594
               G+F+        DGE+SFS + GP+SG ITYSGPI+YSG++             FAF
Sbjct: 469  ---GHFA--------DGEASFSAARGPISGSITYSGPISYSGSLSLRSESSTTSTRSFAF 517

Query: 1595 PILQNEWNSSPVRMEKAR-----KHRGWRHGLL 1678
            P+LQNEWNSSPVRM KA      K +GW+ GLL
Sbjct: 518  PVLQNEWNSSPVRMAKAERRRLSKQKGWKQGLL 550


>gb|EOY01582.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 2
            [Theobroma cacao] gi|508709686|gb|EOY01583.1| 18S
            pre-ribosomal assembly protein gar2-related, putative
            isoform 2 [Theobroma cacao] gi|508709687|gb|EOY01584.1|
            18S pre-ribosomal assembly protein gar2-related, putative
            isoform 2 [Theobroma cacao]
          Length = 470

 Score =  149 bits (376), Expect = 4e-33
 Identities = 134/438 (30%), Positives = 189/438 (43%), Gaps = 57/438 (13%)
 Frame = +2

Query: 536  SVMDGQTANGIEEESKDSETSTVPHTFESDLKS----FTDKNVVECELPDFVVCYKESNT 703
            S+     ANG E+E +D  TS  P     D       + DK+V+ECELP+ VVCYKES  
Sbjct: 32   SISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQNSVFYLDKSVMECELPELVVCYKESTY 91

Query: 704  PHVKDICVDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFFVPDGFKV 883
              VKDIC+DEGV  ++K L E+  DE           E      T   + +  + D    
Sbjct: 92   HVVKDICIDEGVPTQDKFLFETGMDEKIDCNFLPSEKEQDSQLMTEKLETDMCMQDVSMS 151

Query: 884  SSPEHNRHDTASEFGAEK-------IDKXXXXXXXXXXXXXXXXACVSEELILQKALL-E 1039
                 +  D  +E G+ K       +                   C S++L+L + +  +
Sbjct: 152  PGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKGIPNQCDSKDLMLTRVVKGD 211

Query: 1040 CSKCDEDKVSPQPDEVPCLESVLESLAV--AFTSEQSKTDGPVSNTCYNSKVEGGTITFD 1213
              K   D VS +   +  L S+ E   V     S   K+DG    +  +S  +   +   
Sbjct: 212  AMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSDGIEQQSFQSSSKKEVMVMPP 271

Query: 1214 FNSSKPNVSNSIDASAELTTG-----KAPETEDEKPSDHFASSPAQLVNNE-----GKIK 1363
              S+   V  S D++ E          A E  D    +    SPAQ+  +E       + 
Sbjct: 272  LVSA---VEESKDSNEEAIVSVPALVSATEELDSGKGEAILISPAQVSTSEESTSSSLVN 328

Query: 1364 ENPSDRKL---------------LRKD-------------ASNDEIGNSGNFSVSSYLER 1459
            E   D KL                 KD              S  ++  + + S+S+ L++
Sbjct: 329  EVSYDNKLETGSITFNLDSSAPTSSKDECHHNLDSEPLGTGSTPKLEVAADQSISNNLQQ 388

Query: 1460 GDGESSFSTSGPVSGLITYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRME 1639
            G GESSFS +G V+GLI+YSGP+AYSG++             FAFPILQ+EWN SPVRM 
Sbjct: 389  GIGESSFSAAGLVTGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPVRMA 448

Query: 1640 KA-----RKHRGWRHGLL 1678
            KA     RKH+GWRHGLL
Sbjct: 449  KADRRHYRKHKGWRHGLL 466


>gb|EOY01581.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 1
            [Theobroma cacao]
          Length = 527

 Score =  149 bits (376), Expect = 4e-33
 Identities = 134/438 (30%), Positives = 189/438 (43%), Gaps = 57/438 (13%)
 Frame = +2

Query: 536  SVMDGQTANGIEEESKDSETSTVPHTFESDLKS----FTDKNVVECELPDFVVCYKESNT 703
            S+     ANG E+E +D  TS  P     D       + DK+V+ECELP+ VVCYKES  
Sbjct: 89   SISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQNSVFYLDKSVMECELPELVVCYKESTY 148

Query: 704  PHVKDICVDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFFVPDGFKV 883
              VKDIC+DEGV  ++K L E+  DE           E      T   + +  + D    
Sbjct: 149  HVVKDICIDEGVPTQDKFLFETGMDEKIDCNFLPSEKEQDSQLMTEKLETDMCMQDVSMS 208

Query: 884  SSPEHNRHDTASEFGAEK-------IDKXXXXXXXXXXXXXXXXACVSEELILQKALL-E 1039
                 +  D  +E G+ K       +                   C S++L+L + +  +
Sbjct: 209  PGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKGIPNQCDSKDLMLTRVVKGD 268

Query: 1040 CSKCDEDKVSPQPDEVPCLESVLESLAV--AFTSEQSKTDGPVSNTCYNSKVEGGTITFD 1213
              K   D VS +   +  L S+ E   V     S   K+DG    +  +S  +   +   
Sbjct: 269  AMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSDGIEQQSFQSSSKKEVMVMPP 328

Query: 1214 FNSSKPNVSNSIDASAELTTG-----KAPETEDEKPSDHFASSPAQLVNNE-----GKIK 1363
              S+   V  S D++ E          A E  D    +    SPAQ+  +E       + 
Sbjct: 329  LVSA---VEESKDSNEEAIVSVPALVSATEELDSGKGEAILISPAQVSTSEESTSSSLVN 385

Query: 1364 ENPSDRKL---------------LRKD-------------ASNDEIGNSGNFSVSSYLER 1459
            E   D KL                 KD              S  ++  + + S+S+ L++
Sbjct: 386  EVSYDNKLETGSITFNLDSSAPTSSKDECHHNLDSEPLGTGSTPKLEVAADQSISNNLQQ 445

Query: 1460 GDGESSFSTSGPVSGLITYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRME 1639
            G GESSFS +G V+GLI+YSGP+AYSG++             FAFPILQ+EWN SPVRM 
Sbjct: 446  GIGESSFSAAGLVTGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPVRMA 505

Query: 1640 KA-----RKHRGWRHGLL 1678
            KA     RKH+GWRHGLL
Sbjct: 506  KADRRHYRKHKGWRHGLL 523


>ref|XP_002514588.1| conserved hypothetical protein [Ricinus communis]
            gi|223546192|gb|EEF47694.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 488

 Score =  142 bits (358), Expect = 5e-31
 Identities = 123/413 (29%), Positives = 178/413 (43%), Gaps = 32/413 (7%)
 Frame = +2

Query: 536  SVMDGQTANGIEEESKDSE----TSTVPHTFESDLKSFTDKNVVECELPDFVVCYKESNT 703
            S +D  T   +    K+ E    TS    +F+ D   + DKNV+E ELP+ V+CYKE+  
Sbjct: 76   SKLDSCTGVNVSIHDKEEEVRNFTSLKIESFDKDSVFYIDKNVMEPELPELVLCYKENTY 135

Query: 704  PHVKDICVDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFFVP----- 868
              VKDICVDEGV  +   L ++S D+        P  + +        D++         
Sbjct: 136  HVVKDICVDEGVPSQENFLFDTSVDQEKLCPYLIPEKDIKSEIQKERVDLDMSTQYLSKN 195

Query: 869  -DGFKVSSPEHNRHDTASEFGAEKIDKXXXXXXXXXXXXXXXXACVSEELILQKALLECS 1045
             + FK  S E        +   E+I                    V+E L   K+LL  +
Sbjct: 196  DNSFKCDSKESMAIAEIEDDAMEEIANYTSKETFSLGELLLMPEVVAE-LSHSKSLLNST 254

Query: 1046 KCDEDKVSPQPDEVPCLESVLESLAVAFTSEQ----SKTDGPVSNTCYNSKVEGGTITFD 1213
               E     +P E   L +        + +EQ    +    P+     + + + GT+T D
Sbjct: 255  DEAEQLSIQRPSENIVLATASACEESKYATEQFLLVTPAVDPLVEESGHEEAKLGTLTSD 314

Query: 1214 FNSSKPNVSNSIDASAELT-------------TGKAPETEDEKPSDHFASSPAQLVNNEG 1354
             +    +  +     A L                K+P    +  SD  +S+P      EG
Sbjct: 315  SSPKASDHGHDEVILASLAPSYATEEPENGAKAAKSPSHTLDSVSDLNSSAPTASGGEEG 374

Query: 1355 KIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSGLITYSGPIAY 1534
              +   S+    R  + +++   +  FS    L+   GESSFS +GP+SGLI+YSGPIAY
Sbjct: 375  S-QVGGSEHLESRNSSRHEDTSITEPFS--GQLQYSHGESSFSAAGPLSGLISYSGPIAY 431

Query: 1535 SGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA-----RKHRGWRHGLL 1678
            SG++             FAFPILQ+EWNSSPVRM KA     RKHR WR GLL
Sbjct: 432  SGSLSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRRHFRKHRSWRQGLL 484


>ref|XP_004144157.1| PREDICTED: uncharacterized protein LOC101217989 [Cucumis sativus]
            gi|449523672|ref|XP_004168847.1| PREDICTED:
            uncharacterized protein LOC101224727 [Cucumis sativus]
          Length = 431

 Score =  140 bits (352), Expect = 3e-30
 Identities = 121/418 (28%), Positives = 185/418 (44%), Gaps = 24/418 (5%)
 Frame = +2

Query: 497  DKNGAQITE--EAAHSVMDGQTANGIEEESKDSETSTVPHTFESDLKSFTDKNVVECELP 670
            D N   IT+   ++  V D   A GI   S    +S +  +F     S+ DK+V+EC++ 
Sbjct: 64   DGNSCMITKINRSSTDVFDDNNAEGI---SAFGASSNMKPSF-----SYVDKSVMECQMS 115

Query: 671  DFVVCYKESNTPHVKDICVDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTND-- 844
              +VC +E N   VKDIC+D+GV        +S+ ++    +   P +EDR  G+  +  
Sbjct: 116  KTIVCDQEVNVNDVKDICIDDGVASLENFFFKSTAEKSISKI--SPLEEDRNEGSIKEKE 173

Query: 845  --PDIEFFVPDGFKVSSPEHNRHDTASEFGAEKIDKXXXXXXXXXXXXXXXXACVSEELI 1018
               ++  F+ D  KVS  +H   D  +   A+ + +                  +SE  +
Sbjct: 174  TSSEVSKFIADDRKVSLEDHFAMDWTTHNDAKDLTQIEEEKLN-----------LSEPEL 222

Query: 1019 LQKALLECSKCDE--DKVSPQ-----------PDEVPCLESVLESLAVAFTSEQSKTDGP 1159
            L + L++ S   E  DK+  Q                 ++S  ++ A+   +E  K + P
Sbjct: 223  LMQKLVKRSYSSESLDKIGLQISGEKTNLEDPSSASKSVDSCNDTPALDSAAEPPKDNIP 282

Query: 1160 VSNTCYNSKVEGGTITFDFNSSKPNVSNSIDASAELTTGKAPETEDEKPSDHFASSPAQL 1339
               + YN + E G+I   FNS  P           +  G     E    SD    +  Q+
Sbjct: 283  AHPSGYNDEFENGSIALTFNSISP-----------VANGGEERQECCGRSDSVIGT--QV 329

Query: 1340 VNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSGLITYS 1519
            + N   ++   SD +LL     +D                  GESSFS   P++ L+TYS
Sbjct: 330  LTN---LEYRTSDSRLLSSQNMHD-----------------IGESSFSAVDPLASLVTYS 369

Query: 1520 GPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA-----RKHRGWRHGLL 1678
            GP+AYSG++             FAFPILQ+EWNSSPV+M KA     RK+RGWR GLL
Sbjct: 370  GPVAYSGSISLRSESSTTSTRSFAFPILQSEWNSSPVKMVKAERRHYRKYRGWREGLL 427


>gb|EMJ23318.1| hypothetical protein PRUPE_ppa004630mg [Prunus persica]
          Length = 499

 Score =  134 bits (337), Expect = 1e-28
 Identities = 134/474 (28%), Positives = 187/474 (39%), Gaps = 104/474 (21%)
 Frame = +2

Query: 569  EEESKD-----SETSTVPHTFESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDICVDE 733
            E+E KD     + +S      E +   + DK+V+ECELP+ +VCYKES+   +KDIC+DE
Sbjct: 29   EDEVKDFVPPYTLSSEKLEALEKESDYYMDKSVMECELPELIVCYKESSCNTIKDICIDE 88

Query: 734  GVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFFVPDGFKVSSPEHNRHDT 913
            GV  ++K   E+  DE        P ++          DI   +PDGFK S+     HD 
Sbjct: 89   GVPSQDKNRFETGVDEKECCTFLSPDEDQNKQLLEEQMDIVVTLPDGFKSSA-----HDD 143

Query: 914  ASEFGAEKIDKXXXXXXXXXXXXXXXXA--CVSEELILQKALL----------------- 1036
              +      D                     VS+E+     +L                 
Sbjct: 144  LEKGFVIPCDSKGLTQIGDAIYYTQEKTEIEVSKEIFFPANVLPMQELGAGNAHSSKSSN 203

Query: 1037 -ECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQSKTDGP------------VSNTCY 1177
             E ++  +D V    ++V  +     +  V+ T E S ++              V     
Sbjct: 204  EESTEAVQDTVQSSGEKVSEIAQTGSTAVVSVTEESSHSEKKALVSAAEESNFHVDELSN 263

Query: 1178 NSKVEGGTITFDFNSSKPNVSNSIDA---------------------------------- 1255
            NSKVE G+ T   + +  +VS + DA                                  
Sbjct: 264  NSKVENGSTTSGLSDTSVHVSTTRDACPDNDVHKHFETQTMPAGDDGDDNDDNMPDAEIV 323

Query: 1256 -------SAELTTGK--APE--------------TEDEKPSDHFASSPAQLVNNEGKI-- 1360
                   SA + TG+   PE               +DE P     SS  Q  +    I  
Sbjct: 324  PSQVQPCSAPVVTGREECPENGVCQPLDTSSTSKVDDEIPHSVIVSSQVQHYSAPVTISR 383

Query: 1361 KENPSDRKLLRKDASND-EIG--NSGNFSVSSYLERGDGESSFSTSGPVSGLITYSGPIA 1531
            +E P +      + SN   +G  NS     S +++RG GESSFS +G  S L+  SGP  
Sbjct: 384  EERPENGVWQCPETSNAFMVGDVNSDTQYASFHVQRGFGESSFSAAGHFSSLMNTSGP-- 441

Query: 1532 YSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA-----RKHRGWRHGLL 1678
            YSGNV             FAFP+LQ+EWNSSPVRM KA     RKHRGW H LL
Sbjct: 442  YSGNVSLRSESSTTSTRSFAFPVLQSEWNSSPVRMAKADRRHLRKHRGWGHSLL 495


>ref|XP_006484258.1| PREDICTED: dentin sialophosphoprotein-like isoform X3 [Citrus
            sinensis]
          Length = 483

 Score =  127 bits (318), Expect = 2e-26
 Identities = 132/436 (30%), Positives = 172/436 (39%), Gaps = 88/436 (20%)
 Frame = +2

Query: 635  FTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVH-----------------------E 745
            + DK+V ECELP+ +VCYKE NT HVKDIC+DEGVH                       +
Sbjct: 85   YMDKSVTECELPELIVCYKE-NTYHVKDICIDEGVHSHDRILFESDVGKSVRSFLPPKED 143

Query: 746  ENKILIESSK------------------DEH----AGSVVSQPSDED----------RYG 829
             N  L+E SK                  DEH     GS     SDED          R  
Sbjct: 144  RNSELLEESKNSVIPIPDVLKSSAENYSDEHIVNRCGSSQESDSDEDIDDICDSKDLRPA 203

Query: 830  GTTNDPDIE----------FFVPDGFKV----SSPEHNRHDTASEFGAEKIDKXXXXXXX 967
            G   D   E          F + D   +    +    ++    +E  AEK          
Sbjct: 204  GDVKDDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDAEKESFQGSSAKA 263

Query: 968  XXXXXXXXXACVSEELILQKALLECSK-----CDEDKVSPQPDEVPCLESVLE-----SL 1117
                        +EE++     +  S+     C E  +S  P  V   E   +     SL
Sbjct: 264  ALANPEEANGGTAEEILTGADFVSASEESQNGCGEG-ISGNPTLVSASEKAHDKSEEASL 322

Query: 1118 A----VAFTSEQSKTDGPVSNTCYNSKVEGGTITFDFNSSKPNVSNSIDASAELTTGKAP 1285
            A    V+  SE +K       + YNS VE G+ITFDF++S P  S               
Sbjct: 323  ASPDGVSALSESTKIS-TAEKSSYNSMVETGSITFDFDASAPGASGK------------- 368

Query: 1286 ETEDEKPSDHFASSPAQLVNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGD 1465
                          P Q+   + +  E P   +L  +DA           SVSS    G 
Sbjct: 369  ------------EEPLQI--GDSQRIETPGMSRL--EDAPRQ--------SVSSQFHSGL 404

Query: 1466 GESSFSTSGPVSGLITYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA 1645
            GESSFS +G +  LI+YSGP+AYSG++             FAFPILQ EW+ SPVRM KA
Sbjct: 405  GESSFSAAGSLPSLISYSGPVAYSGSISLRSDSSTTSTRSFAFPILQTEWDRSPVRMAKA 464

Query: 1646 -----RKHRGWRHGLL 1678
                 RKH+ W+ GLL
Sbjct: 465  DRRHYRKHK-WKQGLL 479


>ref|XP_006484256.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Citrus
            sinensis] gi|568861537|ref|XP_006484257.1| PREDICTED:
            dentin sialophosphoprotein-like isoform X2 [Citrus
            sinensis]
          Length = 496

 Score =  127 bits (318), Expect = 2e-26
 Identities = 132/436 (30%), Positives = 172/436 (39%), Gaps = 88/436 (20%)
 Frame = +2

Query: 635  FTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVH-----------------------E 745
            + DK+V ECELP+ +VCYKE NT HVKDIC+DEGVH                       +
Sbjct: 98   YMDKSVTECELPELIVCYKE-NTYHVKDICIDEGVHSHDRILFESDVGKSVRSFLPPKED 156

Query: 746  ENKILIESSK------------------DEH----AGSVVSQPSDED----------RYG 829
             N  L+E SK                  DEH     GS     SDED          R  
Sbjct: 157  RNSELLEESKNSVIPIPDVLKSSAENYSDEHIVNRCGSSQESDSDEDIDDICDSKDLRPA 216

Query: 830  GTTNDPDIE----------FFVPDGFKV----SSPEHNRHDTASEFGAEKIDKXXXXXXX 967
            G   D   E          F + D   +    +    ++    +E  AEK          
Sbjct: 217  GDVKDDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDAEKESFQGSSAKA 276

Query: 968  XXXXXXXXXACVSEELILQKALLECSK-----CDEDKVSPQPDEVPCLESVLE-----SL 1117
                        +EE++     +  S+     C E  +S  P  V   E   +     SL
Sbjct: 277  ALANPEEANGGTAEEILTGADFVSASEESQNGCGEG-ISGNPTLVSASEKAHDKSEEASL 335

Query: 1118 A----VAFTSEQSKTDGPVSNTCYNSKVEGGTITFDFNSSKPNVSNSIDASAELTTGKAP 1285
            A    V+  SE +K       + YNS VE G+ITFDF++S P  S               
Sbjct: 336  ASPDGVSALSESTKIS-TAEKSSYNSMVETGSITFDFDASAPGASGK------------- 381

Query: 1286 ETEDEKPSDHFASSPAQLVNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGD 1465
                          P Q+   + +  E P   +L  +DA           SVSS    G 
Sbjct: 382  ------------EEPLQI--GDSQRIETPGMSRL--EDAPRQ--------SVSSQFHSGL 417

Query: 1466 GESSFSTSGPVSGLITYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA 1645
            GESSFS +G +  LI+YSGP+AYSG++             FAFPILQ EW+ SPVRM KA
Sbjct: 418  GESSFSAAGSLPSLISYSGPVAYSGSISLRSDSSTTSTRSFAFPILQTEWDRSPVRMAKA 477

Query: 1646 -----RKHRGWRHGLL 1678
                 RKH+ W+ GLL
Sbjct: 478  DRRHYRKHK-WKQGLL 492


>ref|XP_002311412.1| 18S pre-ribosomal assembly protein gar2 [Populus trichocarpa]
            gi|222851232|gb|EEE88779.1| 18S pre-ribosomal assembly
            protein gar2 [Populus trichocarpa]
          Length = 486

 Score =  125 bits (313), Expect = 8e-26
 Identities = 122/421 (28%), Positives = 184/421 (43%), Gaps = 73/421 (17%)
 Frame = +2

Query: 635  FTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEENKILIES------------SKD 778
            + DK+V+  E+P+ +VCYKE NT HVKDICVDEGV  ++K L ++            S+ 
Sbjct: 74   YMDKSVMVREVPELIVCYKE-NTYHVKDICVDEGVPLQDKFLFDTDAHKKNMCEFLPSER 132

Query: 779  EHAGSVVSQPSDED-------RYGGTTNDPDIEFFVPDGFKVSSPEHNRHDTA------- 916
            +    +V + SD D       +      + D+   VPD    S  + ++HD +       
Sbjct: 133  DMNNEMVKEKSDLDMLIPEMLKSSSEKQNVDLHLPVPDVLISSEEKGSKHDLSLDCDPKH 192

Query: 917  -------SEFGAEKI----DKXXXXXXXXXXXXXXXXAC------------VSEELIL-- 1021
                    ++G +K+     K                 C            V ++ +L  
Sbjct: 193  LMPTEEVMDYGTKKVTDNASKEILSLRDLLSMSELGAKCTPANASYHNMDKVEQQSLLCP 252

Query: 1022 -QKALLECSKCDEDKVSPQPDEVPCLESVLESLAVAF-TSEQSKTDGPVSNTCYNSKVEG 1195
             + A+LE     E+  S    E    ++ LES  +A  T + +  +G   +T   + +  
Sbjct: 253  RENAILETDSASEE--SEHCGEETISDNGLESATLAIPTQDPAYQEGDHGHT--EAVLVS 308

Query: 1196 GTITFDFNSSKPN----VSNSIDASAELTTGKAPETEDEKPS-----------DHFASSP 1330
             T+T     S        S+++D+ +E   G     EDE P            D+ +S+P
Sbjct: 309  PTLTSAAEESDSKETKLASHALDSFSE---GSTSRIEDELPYNSKTETRSISFDNDSSAP 365

Query: 1331 AQLVNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSGLI 1510
            A          +N   ++L  +  S  E  N+   S    L+  DGESSFS+SGP+ GL 
Sbjct: 366  AASARES---PQNGESQRLGTRIVSRFEDPNAERLS-GGQLQYADGESSFSSSGPLFGLT 421

Query: 1511 TYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA-----RKHRGWRHGL 1675
            ++SGPIAYSG+V             FAFPILQ+EWNSSP RM KA     +K R W  GL
Sbjct: 422  SHSGPIAYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPARMAKADRRHFQKPRKWMQGL 481

Query: 1676 L 1678
            L
Sbjct: 482  L 482


>ref|XP_002875229.1| hypothetical protein ARALYDRAFT_484289 [Arabidopsis lyrata subsp.
            lyrata] gi|297321067|gb|EFH51488.1| hypothetical protein
            ARALYDRAFT_484289 [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  124 bits (311), Expect = 1e-25
 Identities = 114/393 (29%), Positives = 167/393 (42%), Gaps = 15/393 (3%)
 Frame = +2

Query: 545  DGQTANGIEEESKDSETSTVPHTFESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDIC 724
            D +    + + S D + + V    + D   + DKNV  C+LP+ VVCYKE+    VKDIC
Sbjct: 61   DNEAGKKVRDISHDCDAN-VDSPDKKDPVFYMDKNVTACDLPEIVVCYKENTYHVVKDIC 119

Query: 725  VDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFFVPDGFKVSSPEHNR 904
            VDEGV  + K L    KD    SV S  +++      TN        P   K +   + +
Sbjct: 120  VDEGVPVQEKFLF-GEKD----SVKSSSTEDLTKADKTN------VNPSESKSAEDSNTK 168

Query: 905  HDTASEFGAEKIDKXXXXXXXXXXXXXXXXACVSEELILQKALLECSKCDEDKVSPQ--- 1075
             D +      K D+                +  ++E ++          +E K SP    
Sbjct: 169  VDDSEFCNNCKTDRDVEESSREDFADAEGSSAYNQEHLIVT--------EEAKASPSHGL 220

Query: 1076 -PDEVPCLESVLESLAVAFT--SEQSKTDGPV-SNTCYNSKVEGGTITFDFNSSKPNVSN 1243
             P E+   E+  + +A++    S++S T G + S       +  G I+ D +  +     
Sbjct: 221  NPSEIEPDENSNDEVAISSETDSKESLTLGDILSREDEQKSLNHGNISSDSHEEQSPSQL 280

Query: 1244 SIDASAELTTG----KAPETEDEKP-SDHFASSPAQLVNNEGKIKENPSDRKLLRKDASN 1408
                   L T     +  +TE+ KP  +   S+    +    K   +P   +       N
Sbjct: 281  QDKEKRSLETAAIETELEKTEEPKPVEEKLPSASTTTLQEPNKTCNDPEKPETENHHQQN 340

Query: 1409 DEIGNSGNFSVSSYLERGDGESSFSTSGPVSGLITYSGPIAYSGNVXXXXXXXXXXXXXF 1588
              + NS      S    G+   S + S  +SG ITYSGPIAYSG++             F
Sbjct: 341  SLVENSYEDDKLSSSRFGETSFSAAESVSISGHITYSGPIAYSGSLSVRSDASTTSGRSF 400

Query: 1589 AFPILQNEWNSSPVRMEKARKHR---GWRHGLL 1678
            AFPILQ+EWNSSPVRM KA K R   GWRH LL
Sbjct: 401  AFPILQSEWNSSPVRMAKADKRRQKGGWRHTLL 433


>ref|XP_006291111.1| hypothetical protein CARUB_v10017222mg [Capsella rubella]
            gi|482559818|gb|EOA24009.1| hypothetical protein
            CARUB_v10017222mg [Capsella rubella]
          Length = 455

 Score =  122 bits (306), Expect = 5e-25
 Identities = 118/419 (28%), Positives = 178/419 (42%), Gaps = 21/419 (5%)
 Frame = +2

Query: 485  FDTRDKNGAQITEEAAHSVMDGQTANGIEEESKDSETSTVPHTFESDLKS------FTDK 646
            +DTR  +G +  +E   ++++  +    +E  K +  ++     + D         + DK
Sbjct: 52   YDTR--SGDEWDKENDGNILEPHSCGDADEAGKKTRDTSHDFVAKGDSPEKVNPVFYMDK 109

Query: 647  NVVECELPDFVVCYKESNTPHVKDICVDEGVHEENKILI-------ESSKDEHAGSVVSQ 805
            NV  C+LP+ VVCYKE++   VKDICVDEGV  + K L         ++   H GSV   
Sbjct: 110  NVTACDLPEIVVCYKENSYHVVKDICVDEGVPVQEKFLFGEKDSVKSTTNSNHCGSVDLM 169

Query: 806  PSDEDRYGGTTNDPDIEFFVPDGFKVSSPEHNRHDTASEFGAEKIDKXXXXXXXXXXXXX 985
              D+          D++   P   K     +++ D +SE   +K  +             
Sbjct: 170  KVDK---------TDVK---PSETKSLEDSNSKVDDSSEVCNDKTVQDVEESSREAFADA 217

Query: 986  XXXACVSEELILQKALLECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQSKTDGPVS 1165
               +   +E ++  +     K  E  +  + +E+   E V+ S    F SE       +S
Sbjct: 218  EGSSNYDQEHLIVTSPTLALKPSEISLEVESEEISKDEVVISS--EDFLSESLTLGDILS 275

Query: 1166 NTCYNSKVEGGTITFDFNSSKPNVSNSIDASAELTTG---KAPETEDEKPSDHFASSPAQ 1336
                   ++          S P        S E TTG   K  + E+ K ++   SS + 
Sbjct: 276  REDKQKSLKNDNGNRPEELSPPQHQEKEKRSLE-TTGLDTKLEKVEEPKTAEENLSSAST 334

Query: 1337 LVNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFST--SGPVSGLI 1510
                E     N  ++        N  + +  +  +SS      GE+SFS   S  +SG I
Sbjct: 335  TTVQEPNKSCNDLEKPETENHQQNRLVNSYEDDKLSS---SRFGETSFSAAESVSISGHI 391

Query: 1511 TYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKARKHR---GWRHGLL 1678
            TYSGPIAYSG++             FAFPILQ+EWNSSPVRM KA K R   GWRH LL
Sbjct: 392  TYSGPIAYSGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKADKRRQKGGWRHTLL 450


>gb|ESW25465.1| hypothetical protein PHAVU_003G038300g [Phaseolus vulgaris]
          Length = 430

 Score =  119 bits (298), Expect = 5e-24
 Identities = 122/424 (28%), Positives = 182/424 (42%), Gaps = 19/424 (4%)
 Frame = +2

Query: 464  IDAESFLFDTRDKNGAQITEEAAHSVMDGQT----ANGIEEESKDSETSTVP------HT 613
            +D E+  ++T+ ++   + E  +HS  D ++     NGIE   K S TS +        +
Sbjct: 70   VDCETNEYETKVRD---LVEPLSHSSKDIESFMKFPNGIESV-KRSPTSPISSPREGVES 125

Query: 614  FESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEENKILIESSKDEHAGS 793
              + +  F  K V ECE P   VCY ESN   VKDIC+DEGV +++ I++ +  DE A  
Sbjct: 126  LRNSVDVFMVKTVTECE-PHPEVCYNESNYHVVKDICIDEGVLKKDNIMVVNPVDEKAHD 184

Query: 794  VVSQPSDE--DRYGGTTNDPDIEFFVPDGFKVSSPEHNRH-DTASEFGAEKIDKXXXXXX 964
                 S E  ++    T+   +     +G       HN+H D      +  ++K      
Sbjct: 185  FFPFESYETKEKQKDNTSINVLSLTPTEGSDKVFANHNQHKDLMLTEVSGDVNKQTPSP- 243

Query: 965  XXXXXXXXXXACVSEELILQKALLECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQS 1144
                          ++++LQ  L E S   +DK   Q    P L S+ E  ++A   ++S
Sbjct: 244  -------------GDKVLLQDLLTEDSASSDDK-GEQISIEPGLHSISEDPSMAAGEDES 289

Query: 1145 KTDGPVSNTCYNSKVEGGTITFDFNSSKPNVSNSIDASAELTTGKAPETEDEKPSDHFAS 1324
            K D                       SK   +  +D SA    GK    E+ + S    S
Sbjct: 290  KND-----------------------SKAPENAKVDPSAPADCGK----EECRQS---GS 319

Query: 1325 SPAQLVNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSG 1504
                 + +  +  E  SD +                 + +S++    GESSFS  GPVSG
Sbjct: 320  CKCDEIQHTSRPMEWKSDDQ-----------------AATSHIRHSLGESSFSAMGPVSG 362

Query: 1505 LITYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA--RKHRG----WR 1666
             I+YSGP+ +SG++             FAFPI+Q+EWNSSPVRM KA  R HR     WR
Sbjct: 363  RISYSGPVPFSGSISLRSDSSTTSTRSFAFPIIQSEWNSSPVRMAKADRRHHRKQRCCWR 422

Query: 1667 HGLL 1678
             G L
Sbjct: 423  GGFL 426


>gb|AAX55105.1| hypothetical protein At2g03810 [Arabidopsis thaliana]
          Length = 439

 Score =  119 bits (297), Expect = 6e-24
 Identities = 120/394 (30%), Positives = 170/394 (43%), Gaps = 16/394 (4%)
 Frame = +2

Query: 545  DGQTANGIEEESKDSETSTVPHTFESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDIC 724
            + +    + + S D + + V    + D   + DKNV  C+LP+ VVCYKE+    VKDIC
Sbjct: 61   ENEAGKKVRDTSHDCDAN-VDSPEKKDPVFYMDKNVTACDLPEIVVCYKENTYHIVKDIC 119

Query: 725  VDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTN-DPDIEFFVPDGF-KVSSPEH 898
            VDEGV  + K L    KD    SV S  +++      TN +P       D   KV   E 
Sbjct: 120  VDEGVPVQEKFLF-GEKD----SVKSSSTEDLMKADKTNVNPSETKSAEDSISKVDDSEF 174

Query: 899  -NRHDTASEFGAEKIDKXXXXXXXXXXXXXXXXACVSEELI------LQKALLECSKCDE 1057
             N H T  +   E   +                  V+EE+       L  + +E  +  +
Sbjct: 175  CNDHKTDRDV-EESSGEDFADAEGTSSNYNQEHLIVTEEVKASPTHGLSPSEIEPDENSK 233

Query: 1058 DKV--SPQPDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKVEGGTITFDFNSSKP 1231
            D+V  S   D   CL     +L    + E  +      N   +S  E           + 
Sbjct: 234  DEVAISQDNDSKECL-----TLGDILSREDEQKSLNQDNISSDSHEEQSPSQLQDKEKRS 288

Query: 1232 NVSNSIDASAELTTGKAPETEDEKPSDHFASSPAQLVNNEGKIKENPSDRKLLRKDASND 1411
              + +I+   E T  + P+  +EK S   +++ +Q  N      E P      +++   +
Sbjct: 289  LETTAIETELEKT--EEPKQGEEKLSS-VSTTTSQEPNKTCNEPEKPETENHHQQNCLVE 345

Query: 1412 EIGNSGNFSVSSYLERGDGESSFSTSGPVS--GLITYSGPIAYSGNVXXXXXXXXXXXXX 1585
                   FS S +     GE+SFS +  VS  G ITYSGPIAYSG++             
Sbjct: 346  NSYEDDKFSSSRF-----GETSFSAADSVSISGHITYSGPIAYSGSLSVRSDASTTSGRS 400

Query: 1586 FAFPILQNEWNSSPVRMEKARKHR---GWRHGLL 1678
            FAFPILQ+EWNSSPVRM KA K R   GWRH LL
Sbjct: 401  FAFPILQSEWNSSPVRMAKADKRRQKGGWRHTLL 434


>ref|NP_178475.2| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis
            thaliana] gi|42570677|ref|NP_973412.1| 18S pre-ribosomal
            assembly protein gar2-related protein [Arabidopsis
            thaliana] gi|79316683|ref|NP_001030966.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|186499149|ref|NP_001118260.1|
            18S pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250656|gb|AEC05750.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250657|gb|AEC05751.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250658|gb|AEC05752.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250659|gb|AEC05753.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana]
          Length = 439

 Score =  119 bits (297), Expect = 6e-24
 Identities = 120/394 (30%), Positives = 170/394 (43%), Gaps = 16/394 (4%)
 Frame = +2

Query: 545  DGQTANGIEEESKDSETSTVPHTFESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDIC 724
            + +    + + S D + + V    + D   + DKNV  C+LP+ VVCYKE+    VKDIC
Sbjct: 61   ENEAGKKVRDTSHDCDAN-VDSPEKKDPVFYMDKNVTACDLPEIVVCYKENTYHIVKDIC 119

Query: 725  VDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTN-DPDIEFFVPDGF-KVSSPEH 898
            VDEGV  + K L    KD    SV S  +++      TN +P       D   KV   E 
Sbjct: 120  VDEGVPVQEKFLF-GEKD----SVKSSSTEDLMKADKTNVNPSETKSAEDSISKVDDSEF 174

Query: 899  -NRHDTASEFGAEKIDKXXXXXXXXXXXXXXXXACVSEELI------LQKALLECSKCDE 1057
             N H T  +   E   +                  V+EE+       L  + +E  +  +
Sbjct: 175  CNDHKTDRDV-EESSGEDFADAEGTSSNYNQEHLIVTEEVKASPTHGLSPSEIEPDENSK 233

Query: 1058 DKV--SPQPDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKVEGGTITFDFNSSKP 1231
            D+V  S   D   CL     +L    + E  +      N   +S  E           + 
Sbjct: 234  DEVAISQDNDSKECL-----TLGDILSREDEQKSLNQDNISSDSHEEQSPSQLQDKEKRS 288

Query: 1232 NVSNSIDASAELTTGKAPETEDEKPSDHFASSPAQLVNNEGKIKENPSDRKLLRKDASND 1411
              + +I+   E T  + P+  +EK S   +++ +Q  N      E P      +++   +
Sbjct: 289  LETTAIETELEKT--EEPKQGEEKLSS-VSTTTSQEPNKTCNEPEKPETENHHQQNCLVE 345

Query: 1412 EIGNSGNFSVSSYLERGDGESSFSTSGPVS--GLITYSGPIAYSGNVXXXXXXXXXXXXX 1585
                   FS S +     GE+SFS +  VS  G ITYSGPIAYSG++             
Sbjct: 346  NSYEDDKFSSSRF-----GETSFSAADSVSISGHITYSGPIAYSGSLSVRSDASTTSGRS 400

Query: 1586 FAFPILQNEWNSSPVRMEKARKHR---GWRHGLL 1678
            FAFPILQ+EWNSSPVRM KA K R   GWRH LL
Sbjct: 401  FAFPILQSEWNSSPVRMAKADKRRQKGGWRHTLL 434


>ref|XP_002266889.2| PREDICTED: uncharacterized protein LOC100247891 [Vitis vinifera]
          Length = 229

 Score =  118 bits (295), Expect = 1e-23
 Identities = 82/228 (35%), Positives = 116/228 (50%), Gaps = 5/228 (2%)
 Frame = +2

Query: 1010 ELILQKALLECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKV 1189
            EL    ++ E S+ +  ++  Q  + P  E+VLE+ A+   +E+S  +   +   YNSK+
Sbjct: 32   ELSKANSMPESSEFNGMEIEHQCIQNPNGEAVLENPALVSEAEESDKNSFPNELSYNSKL 91

Query: 1190 EGGTITFDFNSSKPNVSNSIDASAELTTGKAPETEDEKPSDHFASSPAQLVNNEGKIKEN 1369
            E GTITFDF SS    + S+D+  E++    P+ +  +P                     
Sbjct: 92   ESGTITFDFGSS----TTSMDSGREVS----PQNDGCEP--------------------- 122

Query: 1370 PSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSGLITYSGPIAYSGNVX 1549
            P + + L K     E     +   S  ++RG GESSFS +GP S LI+YSG I +SGN+ 
Sbjct: 123  PLESQNLSKLEDGSE-----SLPFSGQIQRGLGESSFSAAGPSSALISYSGQITHSGNIS 177

Query: 1550 XXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA-----RKHRGWRHGLL 1678
                        FAFP+LQ EWNSSPVRM KA     RKHR WR G+L
Sbjct: 178  LRSDSSTTSTRSFAFPVLQTEWNSSPVRMAKAERRHLRKHRSWRRGIL 225


Top