BLASTX nr result

ID: Catharanthus23_contig00004536 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00004536
         (2121 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004239018.1| PREDICTED: uncharacterized protein LOC101258...   171   8e-43
ref|XP_006348792.1| PREDICTED: dentin sialophosphoprotein-like [...   175   6e-41
emb|CBI27399.3| unnamed protein product [Vitis vinifera]              173   3e-40
ref|XP_006363556.1| PREDICTED: dentin sialophosphoprotein-like i...   172   4e-40
ref|XP_004237062.1| PREDICTED: uncharacterized protein LOC101254...   171   1e-39
ref|XP_004239019.1| PREDICTED: uncharacterized protein LOC101258...   167   2e-38
gb|EOY01582.1| 18S pre-ribosomal assembly protein gar2-related, ...   149   4e-33
gb|EOY01581.1| 18S pre-ribosomal assembly protein gar2-related, ...   149   4e-33
ref|XP_002514588.1| conserved hypothetical protein [Ricinus comm...   142   4e-31
ref|XP_004144157.1| PREDICTED: uncharacterized protein LOC101217...   140   3e-30
gb|EMJ23318.1| hypothetical protein PRUPE_ppa004630mg [Prunus pe...   134   2e-28
ref|XP_006484258.1| PREDICTED: dentin sialophosphoprotein-like i...   127   3e-26
ref|XP_006484256.1| PREDICTED: dentin sialophosphoprotein-like i...   127   3e-26
ref|XP_002875229.1| hypothetical protein ARALYDRAFT_484289 [Arab...   125   7e-26
ref|XP_002311412.1| 18S pre-ribosomal assembly protein gar2 [Pop...   124   1e-25
ref|XP_006291111.1| hypothetical protein CARUB_v10017222mg [Caps...   122   6e-25
gb|ESW25465.1| hypothetical protein PHAVU_003G038300g [Phaseolus...   119   5e-24
gb|AAX55105.1| hypothetical protein At2g03810 [Arabidopsis thali...   119   7e-24
ref|NP_178475.2| 18S pre-ribosomal assembly protein gar2-related...   119   7e-24
ref|XP_002266889.2| PREDICTED: uncharacterized protein LOC100247...   118   1e-23

>ref|XP_004239018.1| PREDICTED: uncharacterized protein LOC101258367 isoform 1 [Solanum
            lycopersicum]
          Length = 586

 Score =  171 bits (434), Expect(2) = 8e-43
 Identities = 170/587 (28%), Positives = 248/587 (42%), Gaps = 88/587 (14%)
 Frame = +1

Query: 337  DTDKPKLSMMEDKIGIVCDSNAYGKETDRLSFSASDPLHANGHGGNSNSLVFGIQDEHDF 516
            DT + K +M  ++ GI+  SN Y KE D L F  +D  + N H    + L    +D + F
Sbjct: 25   DTAEEKPTMNGNQNGILGHSNGY-KEADALGFPVNDFGNTNVHDNREDPLACDRKDGNKF 83

Query: 517  W-------------NSAVFKSSLLDD-----STRSNDNEPGGSP----VDHLNGFEIDAE 630
            W             N  +  S++ D+     ST + DN  GG+P    +      EI A 
Sbjct: 84   WEVPELDDSIFFDNNDEIKASNVRDNHNVDLSTINGDNR-GGNPFACDIPSSETNEIVAA 142

Query: 631  S---------------------FLFDTRDKNGAQITEEAAHSVMDGQTANGIEEESKDSE 747
            S                     F  DT+D+N      E  +  +D     G E    DS 
Sbjct: 143  SVTDDQTGSLSNIIHTKRGGNPFECDTKDRNQPWNIPE--YESLDFLDDKGNETIDSDSP 200

Query: 748  TSTVPHTFESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEENKILIESS 927
             ++    FE++   ++DK V + EL +  VCY+E+N   VKDIC+DEGV   +K+L ES 
Sbjct: 201  FTSHSELFENNKHFYSDKGVTDHELSELTVCYRENNFNIVKDICMDEGVPAVDKVLTESW 260

Query: 928  KDEHAGSVVSQPSDEDRYGGTTNDPDIEFFVPDGFKVSSPEHNRHDTASEFGAEKIDKXX 1107
            KD+   + VS  +DE+    T    D+   +    + SS E  ++  A   GAE I+   
Sbjct: 261  KDDQLSTSVSVDADEEHQSNTKKSVDMGSSIATVSQDSSCEDAKN-IAVTHGAE-IEPTG 318

Query: 1108 XXXXXXXXXXXXXXA---CVSEELILQKALLECSKC------------------------ 1206
                          A      +  +    ++  SKC                        
Sbjct: 319  APIPNDFNPSLENKANKDADKDSYLEDLLMIFGSKCTTNGKTTNASEKPSSPNTVVRVEE 378

Query: 1207 ------DEDKVSPQPDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKVEGGTITFD 1368
                  D D+ + QPD+VP  +++    A++   E +   G       NSK   GT  FD
Sbjct: 379  SNIKTSDGDQSTLQPDQVPFDQTLKSQTAISAADESNNNKG-------NSKEGAGTNIFD 431

Query: 1369 FNSSKPNVSNSIDASAELTTG-----KAPETEDEKPSDRF-ASSPAQLVNNEGKIKENPS 1530
            FN +KP  + + +   E         KA        SD   ASS     N      +N  
Sbjct: 432  FNLTKPESTTTTEGGVENLPEDSHKPKAVSVHKNGNSDNISASSQVPFANTA----DNAH 487

Query: 1531 DRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTS-GPVSGLITYSGPIAYSGNVXX 1707
             + L  ++ +N +    G+F+        DGE+SFS + GP+SG ITYSGPI+YSG++  
Sbjct: 488  QQHLESQNMANGQ----GHFA--------DGEASFSAARGPISGSITYSGPISYSGSLSL 535

Query: 1708 XXXXXXXXXXXFAFPILQNEWNSSPVRMEKAR-----KHRGWRHGLL 1833
                       FAFP+LQNEWNSSPVRM KA      K +GW+ GLL
Sbjct: 536  RSESSTTSTRSFAFPVLQNEWNSSPVRMAKAERRRLSKQKGWKQGLL 582



 Score = 31.6 bits (70), Expect(2) = 8e-43
 Identities = 13/21 (61%), Positives = 17/21 (80%)
 Frame = +3

Query: 273 MFASQLLRILQTIPTDVDHET 335
           MFASQLLR+L+T+P D   E+
Sbjct: 1   MFASQLLRLLETLPADTSSES 21


>ref|XP_006348792.1| PREDICTED: dentin sialophosphoprotein-like [Solanum tuberosum]
          Length = 586

 Score =  175 bits (444), Expect = 6e-41
 Identities = 166/583 (28%), Positives = 251/583 (43%), Gaps = 84/583 (14%)
 Frame = +1

Query: 337  DTDKPKLSMMEDKIGIVCDSNAYGKETDRLSFSASDPLHANGHGGNSNSLVFGIQDEHDF 516
            DT + K +M  ++ GI+  SN Y KE D L    +D  + N H    + L    +D ++F
Sbjct: 31   DTAEEKPTMNGNQNGILSHSNGY-KEADSLGIPVNDFGNTNVHDNKEDPLACDRKDGNEF 89

Query: 517  W-------------NSAVFKSSLLDDS----TRSNDNEPGGSP----------------- 594
            W             N+ +  S++ DD     ++ N +  GG+P                 
Sbjct: 90   WEVPELDDSIFFDNNNEIKASNVRDDHNVDLSKINGDNRGGNPFACDIPSSETNEIVAAS 149

Query: 595  -VDHLNG-------FEIDAESFLFDTRDKNGA-QITEEAAHSVMDGQTANGIEEESKDSE 747
              D  NG        +     F  DT+D++    I E  +   +D +     E E+ DS+
Sbjct: 150  VTDDQNGGLSNIIHSKRGGNPFECDTKDRDQPWNIPEYESLGFLDDK-----ENETIDSD 204

Query: 748  TSTVPHT--FESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEENKILIE 921
            +    H+  F+S+   ++DK V + ELP+  VCY+E+N   VKDIC+DEGV   +K+LIE
Sbjct: 205  SPFTSHSELFDSNKHFYSDKGVTDHELPELTVCYRENNFNMVKDICMDEGVPAVDKVLIE 264

Query: 922  SSKDEHAGSVVSQPSDEDRYGGT---------------------------TNDPDIEFF- 1017
            S KD    + VS  +DE++   T                           T+D +IE   
Sbjct: 265  SWKDGQPSTSVSVDADEEQQSNTRKSVDMGSTIASVSQDSSFKDAKNIAVTHDTEIEATG 324

Query: 1018 --VPDGFKVSSPEHNRHDTASEFGAEKIDKXXXXXXXXXXXXXXXXACVSEELILQKALL 1191
              VP+GF  S   +   D   +   E  D                 + ++  + ++++ +
Sbjct: 325  APVPNGFNPSLENNANKDADKDSYLE--DLLMIFGSKCTTNASEKPSSLNTVVRVEESNI 382

Query: 1192 ECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKVEGGTITFDF 1371
            + S  D D+ + QPD+VP  E  L+S      S Q+   G       N K   GT  FD 
Sbjct: 383  KTS--DGDQSTLQPDQVPS-EQTLKSQTAVSASGQTNNKG-------NIKEGVGTSIFDV 432

Query: 1372 NSSKPNVSNSIDASAELTTGKAPETEDEKPSDRFASSPAQLVNNEGKIK---ENPSDRKL 1542
            N +KP  + + +       G  PE +   P            NN    +    N +D   
Sbjct: 433  NLTKPESTKTTEGG----VGNLPE-DSHMPKAVSVHKNGNSDNNSASSQVPFANTADNAH 487

Query: 1543 LRKDASNDEIGNSGNFSVSSYLERGDGESSFSTS-GPVSGLITYSGPIAYSGNVXXXXXX 1719
             +   S +      +F+        DGE+SFS + GP+SG ITYSGPI+YSG+V      
Sbjct: 488  QQHLESQNMANGQSHFA--------DGEASFSAARGPISGSITYSGPISYSGSVSLRSES 539

Query: 1720 XXXXXXXFAFPILQNEWNSSPVRMEKAR-----KHRGWRHGLL 1833
                   FAFP+LQNEWNSSPVRM KA      K +GW+ G+L
Sbjct: 540  STTSTRSFAFPVLQNEWNSSPVRMAKAERRRLSKQKGWKQGIL 582


>emb|CBI27399.3| unnamed protein product [Vitis vinifera]
          Length = 435

 Score =  173 bits (438), Expect = 3e-40
 Identities = 142/422 (33%), Positives = 197/422 (46%), Gaps = 12/422 (2%)
 Frame = +1

Query: 604  LNGFEIDAESFLFDTRDKNGAQITEEAAHSVMDGQTANGIEEESKDSETSTVP-----HT 768
            L G E DA+    + R  N    T E   S+     AN    E ++S  + V       +
Sbjct: 53   LKGHERDADPLDGEDRFWN----TSERDCSINVDDIANACGNEVRNSVATCVVSSEKLES 108

Query: 769  FESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEENKILIESSKDEHAGS 948
            FE D    TDK+V + ELP   VC +ES    VKDIC+DEG+    KIL+E+ K+EH G 
Sbjct: 109  FEKDGDMCTDKSVTKHELP---VCCEESTYHAVKDICIDEGMLSPEKILVENGKEEHEGF 165

Query: 949  VVSQPSDEDRYGGTTNDP-DIEFFVPDGFKVSSPEHNRHDTASEFGAEKID-KXXXXXXX 1122
                P D D+    T +  D E  +PDG K S+      D   E   E  D +       
Sbjct: 166  CPFLPPDTDKNVDPTKETADKELPLPDGQKASAENDCGKDLMQE--EENYDARDKIISDT 223

Query: 1123 XXXXXXXXXACVSEELILQKALLECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQSK 1302
                       +  EL    ++ E S+ +  ++  Q  + P  E+VLE+ A+   +E+S 
Sbjct: 224  SEEKIVPEDIFLIPELSKANSMPESSEFNGMEIEHQCIQNPNGEAVLENPALVSEAEESD 283

Query: 1303 TDGPVSNTCYNSKVEGGTITFDFNSSKPNVSNSIDASAELTTGKAPETEDEKPSDRFASS 1482
             +   +   YNSK+E GTITFDF SS    + S+D+  E++    P+ +  +P       
Sbjct: 284  KNSFPNELSYNSKLESGTITFDFGSS----TTSMDSGREVS----PQNDGCEP------- 328

Query: 1483 PAQLVNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSGL 1662
                          P + + L K     E     +   S  ++RG GESSFS +GP S L
Sbjct: 329  --------------PLESQNLSKLEDGSE-----SLPFSGQIQRGLGESSFSAAGPSSAL 369

Query: 1663 ITYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA-----RKHRGWRHG 1827
            I+YSG I +SGN+             FAFP+LQ EWNSSPVRM KA     RKHR WR G
Sbjct: 370  ISYSGQITHSGNISLRSDSSTTSTRSFAFPVLQTEWNSSPVRMAKAERRHLRKHRSWRRG 429

Query: 1828 LL 1833
            +L
Sbjct: 430  IL 431


>ref|XP_006363556.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Solanum
            tuberosum] gi|565395867|ref|XP_006363557.1| PREDICTED:
            dentin sialophosphoprotein-like isoform X2 [Solanum
            tuberosum] gi|565395869|ref|XP_006363558.1| PREDICTED:
            dentin sialophosphoprotein-like isoform X3 [Solanum
            tuberosum] gi|565395871|ref|XP_006363559.1| PREDICTED:
            dentin sialophosphoprotein-like isoform X4 [Solanum
            tuberosum]
          Length = 532

 Score =  172 bits (437), Expect = 4e-40
 Identities = 147/522 (28%), Positives = 228/522 (43%), Gaps = 47/522 (9%)
 Frame = +1

Query: 409  KETDRLSFSASDPLHANGHGGNSNSLVFGIQDEHDFWN-SAVFKSSLLDDSTRSNDNEPG 585
            K++  L     D L +NG  G  +SL    ++ ++FWN   +  S   +D +RSN +E  
Sbjct: 17   KDSKSLVLPTKDLLDSNGRDGTKDSLACE-KERNEFWNVQELDDSEFFEDISRSNKHEIR 75

Query: 586  GSPV--------DHLNGFEIDAESFLFDTRDKNGAQITEEAAHSVMDGQTANGIEEESKD 741
             SP+         +L   + +   F  DT D++       +     D    N  +++ K+
Sbjct: 76   ASPLKDDPIEALSNLTSCKRNGNPFACDTADRDHPW----SIPKFEDPMIVNFFDDKEKE 131

Query: 742  SETSTVPHTFESDLKS-----FTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEEN 906
            +  S+   T  S+L       +TDK V+E +LP+  +CY E+N   +KDIC+DEGV   +
Sbjct: 132  TVVSSAQFTSLSELFGTNTHLYTDKGVLEFKLPELTICYNENNYNIMKDICMDEGVPLMD 191

Query: 907  KILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFF---------VPDGFKVSSPEHNR 1059
            KI+ ES K     S +S   DE +   T    D E           V +  K+S   H  
Sbjct: 192  KIVTESRKYHQPDSSISLAVDEHQPRNTREGVDSELVSSGESKDSSVENAVKISVDHHTT 251

Query: 1060 ------------------HDTASEFGAEKIDKXXXXXXXXXXXXXXXXACVSE-ELILQK 1182
                               D  S++  +                      +SE E  +Q 
Sbjct: 252  KEDEDTKSLGPNGINPFLEDNMSKYADKDSSLDVMKIFGSKDTTTAKATNISENESDIQN 311

Query: 1183 ALLECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKVEGGTIT 1362
              L+ S  D ++ + Q +++P   +   S      ++ +  +GP SN   NSK E G IT
Sbjct: 312  --LKESNSDAEQSALQANQIPTFVAAFNSQNTVSAADGTNNNGPGSNFSNNSKSESGAIT 369

Query: 1363 FDFNSSKPNVSNSI---DASAELTTGKAPETEDEKPSDRFASSPAQLVNNEGKIKENPSD 1533
             DFN ++  +S+S+   D      + K      +K     + S A  V+    +    S 
Sbjct: 370  CDFNLTELALSSSVAKSDKHLPEQSHKLEAVSSQKDGSSDSFSAATQVHFANSVDSCNSS 429

Query: 1534 RKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSGLITYSGPIAYSGNVXXXX 1713
                  + +N E  NSG+  +  +    +GE+SF   GP SGLI+YSG I +SGN+    
Sbjct: 430  IHADPPNVANLEEKNSGSIPLGVHGHFANGEASF---GPASGLISYSGHITHSGNISLRS 486

Query: 1714 XXXXXXXXXFAFPILQNEWNSSPVRMEKA--RKHRGWRHGLL 1833
                     FAFP+LQ+EWNSSPVRM KA  R ++GWR  LL
Sbjct: 487  DSSTTSARSFAFPVLQSEWNSSPVRMAKAERRHYKGWRQSLL 528


>ref|XP_004237062.1| PREDICTED: uncharacterized protein LOC101254294 [Solanum
            lycopersicum]
          Length = 532

 Score =  171 bits (433), Expect = 1e-39
 Identities = 152/529 (28%), Positives = 235/529 (44%), Gaps = 54/529 (10%)
 Frame = +1

Query: 409  KETDRLSFSASDPLHANGHGGNSNSLVFGIQDEHDFWNSAVFKSSL-LDDSTRSNDNEPG 585
            K++  L     D L +NG     +SL    +++++FWN      S+ ++D +RSN  E  
Sbjct: 16   KDSKSLVLPTKDLLDSNGRDSTKDSLACE-KEKNEFWNVQELDDSVFIEDISRSNKLENR 74

Query: 586  GSPV--------DHLNGFEIDAESFLFDTRDKNGAQITEEAAHSVMDGQTANGIEEESKD 741
             SP+         HL   + +   F  DT D++      +    ++     N  +++ K+
Sbjct: 75   ASPLKDDPDEAPSHLTSCKRNGNPFACDTADRDHPWSIPKFEDPII----VNFFDDKEKE 130

Query: 742  SETSTVPHT-----FESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEEN 906
            +  S+   T     F +D   +TDK V+E ELP+  +CYKE++   +KDIC+DEGV   +
Sbjct: 131  TVVSSTQFTSLSELFGADTHLYTDKGVLEFELPESTICYKENDYNIMKDICMDEGVPLMD 190

Query: 907  KILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFFVPDGFKVSSPEH------NRHDT 1068
            KI+ ES K +   S +S  +DE +   T    D E       K SS E       + H T
Sbjct: 191  KIVTESRKYDQPDSSISLAADEHQPRITREGVDSELVSSGESKASSVESAVKISVDHHTT 250

Query: 1069 ASEFG--------------------AEKIDKXXXXXXXXXXXXXXXXACVSEELILQKAL 1188
              + G                    AEK                        E       
Sbjct: 251  KEDEGNKSLVPNGINPFLEDNMSKDAEKDPYLDVMKIFGSKDTTMAKPTNISEKESDSQN 310

Query: 1189 LECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKVEGGTITFD 1368
             + S  D D+ + Q +++P       S      ++ +   GP SN   NSK + G IT D
Sbjct: 311  FKESNSDADQSAQQANQMPTSVEAFNSQYTVSPADGTNNYGPGSNFSNNSKSKSGAITCD 370

Query: 1369 FNSSKPNVSNSIDASAELTTGKAPETE-----DEKPSDRF-ASSPAQLVN-----NEGKI 1515
            FN ++  +S+S+  S +    ++ + E      +  SD F A++     N     N   I
Sbjct: 371  FNLTELALSSSVTKSDKHLPEQSHKLEAVSGQKDGSSDSFSAATQVHFANSVDSSNSSTI 430

Query: 1516 KENPSD-RKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSGLITYSGPIAYS 1692
              +P +   L  K++S+  +G  G+F+        +GE+SF   GP SGLI+YSG IA+S
Sbjct: 431  HADPPNVANLEEKNSSSIPLGVHGHFA--------NGEASF---GPASGLISYSGHIAHS 479

Query: 1693 GNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA--RKHRGWRHGLL 1833
            GN+             FAFP+LQ+EWNSSPVRM KA  R ++GWR  LL
Sbjct: 480  GNISLRSDSSTTSARSFAFPVLQSEWNSSPVRMAKAERRHYKGWRQSLL 528


>ref|XP_004239019.1| PREDICTED: uncharacterized protein LOC101258367 isoform 2 [Solanum
            lycopersicum]
          Length = 554

 Score =  167 bits (422), Expect = 2e-38
 Identities = 166/573 (28%), Positives = 240/573 (41%), Gaps = 88/573 (15%)
 Frame = +1

Query: 379  GIVCDSNAYGKETDRLSFSASDPLHANGHGGNSNSLVFGIQDEHDFW------------- 519
            GI+  SN Y KE D L F  +D  + N H    + L    +D + FW             
Sbjct: 7    GILGHSNGY-KEADALGFPVNDFGNTNVHDNREDPLACDRKDGNKFWEVPELDDSIFFDN 65

Query: 520  NSAVFKSSLLDD-----STRSNDNEPGGSP----VDHLNGFEIDAES------------- 633
            N  +  S++ D+     ST + DN  GG+P    +      EI A S             
Sbjct: 66   NDEIKASNVRDNHNVDLSTINGDNR-GGNPFACDIPSSETNEIVAASVTDDQTGSLSNII 124

Query: 634  --------FLFDTRDKNGAQITEEAAHSVMDGQTANGIEEESKDSETSTVPHTFESDLKS 789
                    F  DT+D+N      E  +  +D     G E    DS  ++    FE++   
Sbjct: 125  HTKRGGNPFECDTKDRNQPWNIPE--YESLDFLDDKGNETIDSDSPFTSHSELFENNKHF 182

Query: 790  FTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEENKILIESSKDEHAGSVVSQPSD 969
            ++DK V + EL +  VCY+E+N   VKDIC+DEGV   +K+L ES KD+   + VS  +D
Sbjct: 183  YSDKGVTDHELSELTVCYRENNFNIVKDICMDEGVPAVDKVLTESWKDDQLSTSVSVDAD 242

Query: 970  EDRYGGTTNDPDIEFFVPDGFKVSSPEHNRHDTASEFGAEKIDKXXXXXXXXXXXXXXXX 1149
            E+    T    D+   +    + SS E  ++  A   GAE I+                 
Sbjct: 243  EEHQSNTKKSVDMGSSIATVSQDSSCEDAKN-IAVTHGAE-IEPTGAPIPNDFNPSLENK 300

Query: 1150 A---CVSEELILQKALLECSKC------------------------------DEDKVSPQ 1230
            A      +  +    ++  SKC                              D D+ + Q
Sbjct: 301  ANKDADKDSYLEDLLMIFGSKCTTNGKTTNASEKPSSPNTVVRVEESNIKTSDGDQSTLQ 360

Query: 1231 PDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKVEGGTITFDFNSSKPNVSNSIDA 1410
            PD+VP  +++    A++   E +   G       NSK   GT  FDFN +KP  + + + 
Sbjct: 361  PDQVPFDQTLKSQTAISAADESNNNKG-------NSKEGAGTNIFDFNLTKPESTTTTEG 413

Query: 1411 SAELTTG-----KAPETEDEKPSDRF-ASSPAQLVNNEGKIKENPSDRKLLRKDASNDEI 1572
              E         KA        SD   ASS     N      +N   + L  ++ +N + 
Sbjct: 414  GVENLPEDSHKPKAVSVHKNGNSDNISASSQVPFANTA----DNAHQQHLESQNMANGQ- 468

Query: 1573 GNSGNFSVSSYLERGDGESSFSTS-GPVSGLITYSGPIAYSGNVXXXXXXXXXXXXXFAF 1749
               G+F+        DGE+SFS + GP+SG ITYSGPI+YSG++             FAF
Sbjct: 469  ---GHFA--------DGEASFSAARGPISGSITYSGPISYSGSLSLRSESSTTSTRSFAF 517

Query: 1750 PILQNEWNSSPVRMEKAR-----KHRGWRHGLL 1833
            P+LQNEWNSSPVRM KA      K +GW+ GLL
Sbjct: 518  PVLQNEWNSSPVRMAKAERRRLSKQKGWKQGLL 550


>gb|EOY01582.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 2
            [Theobroma cacao] gi|508709686|gb|EOY01583.1| 18S
            pre-ribosomal assembly protein gar2-related, putative
            isoform 2 [Theobroma cacao] gi|508709687|gb|EOY01584.1|
            18S pre-ribosomal assembly protein gar2-related, putative
            isoform 2 [Theobroma cacao]
          Length = 470

 Score =  149 bits (377), Expect = 4e-33
 Identities = 134/438 (30%), Positives = 189/438 (43%), Gaps = 57/438 (13%)
 Frame = +1

Query: 691  SVMDGQTANGIEEESKDSETSTVPHTFESDLKS----FTDKNVVECELPDFVVCYKESNT 858
            S+     ANG E+E +D  TS  P     D       + DK+V+ECELP+ VVCYKES  
Sbjct: 32   SISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQNSVFYLDKSVMECELPELVVCYKESTY 91

Query: 859  PHVKDICVDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFFVPDGFKV 1038
              VKDIC+DEGV  ++K L E+  DE           E      T   + +  + D    
Sbjct: 92   HVVKDICIDEGVPTQDKFLFETGMDEKIDCNFLPSEKEQDSQLMTEKLETDMCMQDVSMS 151

Query: 1039 SSPEHNRHDTASEFGAEK-------IDKXXXXXXXXXXXXXXXXACVSEELILQKALL-E 1194
                 +  D  +E G+ K       +                   C S++L+L + +  +
Sbjct: 152  PGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKGIPNQCDSKDLMLTRVVKGD 211

Query: 1195 CSKCDEDKVSPQPDEVPCLESVLESLAV--AFTSEQSKTDGPVSNTCYNSKVEGGTITFD 1368
              K   D VS +   +  L S+ E   V     S   K+DG    +  +S  +   +   
Sbjct: 212  AMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSDGIEQQSFQSSSKKEVMVMPP 271

Query: 1369 FNSSKPNVSNSIDASAELTTG-----KAPETEDEKPSDRFASSPAQLVNNE-----GKIK 1518
              S+   V  S D++ E          A E  D    +    SPAQ+  +E       + 
Sbjct: 272  LVSA---VEESKDSNEEAIVSVPALVSATEELDSGKGEAILISPAQVSTSEESTSSSLVN 328

Query: 1519 ENPSDRKL---------------LRKD-------------ASNDEIGNSGNFSVSSYLER 1614
            E   D KL                 KD              S  ++  + + S+S+ L++
Sbjct: 329  EVSYDNKLETGSITFNLDSSAPTSSKDECHHNLDSEPLGTGSTPKLEVAADQSISNNLQQ 388

Query: 1615 GDGESSFSTSGPVSGLITYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRME 1794
            G GESSFS +G V+GLI+YSGP+AYSG++             FAFPILQ+EWN SPVRM 
Sbjct: 389  GIGESSFSAAGLVTGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPVRMA 448

Query: 1795 KA-----RKHRGWRHGLL 1833
            KA     RKH+GWRHGLL
Sbjct: 449  KADRRHYRKHKGWRHGLL 466


>gb|EOY01581.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 1
            [Theobroma cacao]
          Length = 527

 Score =  149 bits (377), Expect = 4e-33
 Identities = 134/438 (30%), Positives = 189/438 (43%), Gaps = 57/438 (13%)
 Frame = +1

Query: 691  SVMDGQTANGIEEESKDSETSTVPHTFESDLKS----FTDKNVVECELPDFVVCYKESNT 858
            S+     ANG E+E +D  TS  P     D       + DK+V+ECELP+ VVCYKES  
Sbjct: 89   SISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQNSVFYLDKSVMECELPELVVCYKESTY 148

Query: 859  PHVKDICVDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFFVPDGFKV 1038
              VKDIC+DEGV  ++K L E+  DE           E      T   + +  + D    
Sbjct: 149  HVVKDICIDEGVPTQDKFLFETGMDEKIDCNFLPSEKEQDSQLMTEKLETDMCMQDVSMS 208

Query: 1039 SSPEHNRHDTASEFGAEK-------IDKXXXXXXXXXXXXXXXXACVSEELILQKALL-E 1194
                 +  D  +E G+ K       +                   C S++L+L + +  +
Sbjct: 209  PGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKGIPNQCDSKDLMLTRVVKGD 268

Query: 1195 CSKCDEDKVSPQPDEVPCLESVLESLAV--AFTSEQSKTDGPVSNTCYNSKVEGGTITFD 1368
              K   D VS +   +  L S+ E   V     S   K+DG    +  +S  +   +   
Sbjct: 269  AMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSDGIEQQSFQSSSKKEVMVMPP 328

Query: 1369 FNSSKPNVSNSIDASAELTTG-----KAPETEDEKPSDRFASSPAQLVNNE-----GKIK 1518
              S+   V  S D++ E          A E  D    +    SPAQ+  +E       + 
Sbjct: 329  LVSA---VEESKDSNEEAIVSVPALVSATEELDSGKGEAILISPAQVSTSEESTSSSLVN 385

Query: 1519 ENPSDRKL---------------LRKD-------------ASNDEIGNSGNFSVSSYLER 1614
            E   D KL                 KD              S  ++  + + S+S+ L++
Sbjct: 386  EVSYDNKLETGSITFNLDSSAPTSSKDECHHNLDSEPLGTGSTPKLEVAADQSISNNLQQ 445

Query: 1615 GDGESSFSTSGPVSGLITYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRME 1794
            G GESSFS +G V+GLI+YSGP+AYSG++             FAFPILQ+EWN SPVRM 
Sbjct: 446  GIGESSFSAAGLVTGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPVRMA 505

Query: 1795 KA-----RKHRGWRHGLL 1833
            KA     RKH+GWRHGLL
Sbjct: 506  KADRRHYRKHKGWRHGLL 523


>ref|XP_002514588.1| conserved hypothetical protein [Ricinus communis]
            gi|223546192|gb|EEF47694.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 488

 Score =  142 bits (359), Expect = 4e-31
 Identities = 123/413 (29%), Positives = 178/413 (43%), Gaps = 32/413 (7%)
 Frame = +1

Query: 691  SVMDGQTANGIEEESKDSE----TSTVPHTFESDLKSFTDKNVVECELPDFVVCYKESNT 858
            S +D  T   +    K+ E    TS    +F+ D   + DKNV+E ELP+ V+CYKE+  
Sbjct: 76   SKLDSCTGVNVSIHDKEEEVRNFTSLKIESFDKDSVFYIDKNVMEPELPELVLCYKENTY 135

Query: 859  PHVKDICVDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFFVP----- 1023
              VKDICVDEGV  +   L ++S D+        P  + +        D++         
Sbjct: 136  HVVKDICVDEGVPSQENFLFDTSVDQEKLCPYLIPEKDIKSEIQKERVDLDMSTQYLSKN 195

Query: 1024 -DGFKVSSPEHNRHDTASEFGAEKIDKXXXXXXXXXXXXXXXXACVSEELILQKALLECS 1200
             + FK  S E        +   E+I                    V+E L   K+LL  +
Sbjct: 196  DNSFKCDSKESMAIAEIEDDAMEEIANYTSKETFSLGELLLMPEVVAE-LSHSKSLLNST 254

Query: 1201 KCDEDKVSPQPDEVPCLESVLESLAVAFTSEQ----SKTDGPVSNTCYNSKVEGGTITFD 1368
               E     +P E   L +        + +EQ    +    P+     + + + GT+T D
Sbjct: 255  DEAEQLSIQRPSENIVLATASACEESKYATEQFLLVTPAVDPLVEESGHEEAKLGTLTSD 314

Query: 1369 FNSSKPNVSNSIDASAELT-------------TGKAPETEDEKPSDRFASSPAQLVNNEG 1509
             +    +  +     A L                K+P    +  SD  +S+P      EG
Sbjct: 315  SSPKASDHGHDEVILASLAPSYATEEPENGAKAAKSPSHTLDSVSDLNSSAPTASGGEEG 374

Query: 1510 KIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSGLITYSGPIAY 1689
              +   S+    R  + +++   +  FS    L+   GESSFS +GP+SGLI+YSGPIAY
Sbjct: 375  S-QVGGSEHLESRNSSRHEDTSITEPFS--GQLQYSHGESSFSAAGPLSGLISYSGPIAY 431

Query: 1690 SGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA-----RKHRGWRHGLL 1833
            SG++             FAFPILQ+EWNSSPVRM KA     RKHR WR GLL
Sbjct: 432  SGSLSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRRHFRKHRSWRQGLL 484


>ref|XP_004144157.1| PREDICTED: uncharacterized protein LOC101217989 [Cucumis sativus]
            gi|449523672|ref|XP_004168847.1| PREDICTED:
            uncharacterized protein LOC101224727 [Cucumis sativus]
          Length = 431

 Score =  140 bits (352), Expect = 3e-30
 Identities = 121/418 (28%), Positives = 185/418 (44%), Gaps = 24/418 (5%)
 Frame = +1

Query: 652  DKNGAQITE--EAAHSVMDGQTANGIEEESKDSETSTVPHTFESDLKSFTDKNVVECELP 825
            D N   IT+   ++  V D   A GI   S    +S +  +F     S+ DK+V+EC++ 
Sbjct: 64   DGNSCMITKINRSSTDVFDDNNAEGI---SAFGASSNMKPSF-----SYVDKSVMECQMS 115

Query: 826  DFVVCYKESNTPHVKDICVDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTND-- 999
              +VC +E N   VKDIC+D+GV        +S+ ++    +   P +EDR  G+  +  
Sbjct: 116  KTIVCDQEVNVNDVKDICIDDGVASLENFFFKSTAEKSISKI--SPLEEDRNEGSIKEKE 173

Query: 1000 --PDIEFFVPDGFKVSSPEHNRHDTASEFGAEKIDKXXXXXXXXXXXXXXXXACVSEELI 1173
               ++  F+ D  KVS  +H   D  +   A+ + +                  +SE  +
Sbjct: 174  TSSEVSKFIADDRKVSLEDHFAMDWTTHNDAKDLTQIEEEKLN-----------LSEPEL 222

Query: 1174 LQKALLECSKCDE--DKVSPQ-----------PDEVPCLESVLESLAVAFTSEQSKTDGP 1314
            L + L++ S   E  DK+  Q                 ++S  ++ A+   +E  K + P
Sbjct: 223  LMQKLVKRSYSSESLDKIGLQISGEKTNLEDPSSASKSVDSCNDTPALDSAAEPPKDNIP 282

Query: 1315 VSNTCYNSKVEGGTITFDFNSSKPNVSNSIDASAELTTGKAPETEDEKPSDRFASSPAQL 1494
               + YN + E G+I   FNS  P           +  G     E    SD    +  Q+
Sbjct: 283  AHPSGYNDEFENGSIALTFNSISP-----------VANGGEERQECCGRSDSVIGT--QV 329

Query: 1495 VNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSGLITYS 1674
            + N   ++   SD +LL     +D                  GESSFS   P++ L+TYS
Sbjct: 330  LTN---LEYRTSDSRLLSSQNMHD-----------------IGESSFSAVDPLASLVTYS 369

Query: 1675 GPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA-----RKHRGWRHGLL 1833
            GP+AYSG++             FAFPILQ+EWNSSPV+M KA     RK+RGWR GLL
Sbjct: 370  GPVAYSGSISLRSESSTTSTRSFAFPILQSEWNSSPVKMVKAERRHYRKYRGWREGLL 427


>gb|EMJ23318.1| hypothetical protein PRUPE_ppa004630mg [Prunus persica]
          Length = 499

 Score =  134 bits (337), Expect = 2e-28
 Identities = 134/474 (28%), Positives = 187/474 (39%), Gaps = 104/474 (21%)
 Frame = +1

Query: 724  EEESKD-----SETSTVPHTFESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDICVDE 888
            E+E KD     + +S      E +   + DK+V+ECELP+ +VCYKES+   +KDIC+DE
Sbjct: 29   EDEVKDFVPPYTLSSEKLEALEKESDYYMDKSVMECELPELIVCYKESSCNTIKDICIDE 88

Query: 889  GVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFFVPDGFKVSSPEHNRHDT 1068
            GV  ++K   E+  DE        P ++          DI   +PDGFK S+     HD 
Sbjct: 89   GVPSQDKNRFETGVDEKECCTFLSPDEDQNKQLLEEQMDIVVTLPDGFKSSA-----HDD 143

Query: 1069 ASEFGAEKIDKXXXXXXXXXXXXXXXXA--CVSEELILQKALL----------------- 1191
              +      D                     VS+E+     +L                 
Sbjct: 144  LEKGFVIPCDSKGLTQIGDAIYYTQEKTEIEVSKEIFFPANVLPMQELGAGNAHSSKSSN 203

Query: 1192 -ECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQSKTDGP------------VSNTCY 1332
             E ++  +D V    ++V  +     +  V+ T E S ++              V     
Sbjct: 204  EESTEAVQDTVQSSGEKVSEIAQTGSTAVVSVTEESSHSEKKALVSAAEESNFHVDELSN 263

Query: 1333 NSKVEGGTITFDFNSSKPNVSNSIDA---------------------------------- 1410
            NSKVE G+ T   + +  +VS + DA                                  
Sbjct: 264  NSKVENGSTTSGLSDTSVHVSTTRDACPDNDVHKHFETQTMPAGDDGDDNDDNMPDAEIV 323

Query: 1411 -------SAELTTGK--APE--------------TEDEKPSDRFASSPAQLVNNEGKI-- 1515
                   SA + TG+   PE               +DE P     SS  Q  +    I  
Sbjct: 324  PSQVQPCSAPVVTGREECPENGVCQPLDTSSTSKVDDEIPHSVIVSSQVQHYSAPVTISR 383

Query: 1516 KENPSDRKLLRKDASND-EIG--NSGNFSVSSYLERGDGESSFSTSGPVSGLITYSGPIA 1686
            +E P +      + SN   +G  NS     S +++RG GESSFS +G  S L+  SGP  
Sbjct: 384  EERPENGVWQCPETSNAFMVGDVNSDTQYASFHVQRGFGESSFSAAGHFSSLMNTSGP-- 441

Query: 1687 YSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA-----RKHRGWRHGLL 1833
            YSGNV             FAFP+LQ+EWNSSPVRM KA     RKHRGW H LL
Sbjct: 442  YSGNVSLRSESSTTSTRSFAFPVLQSEWNSSPVRMAKADRRHLRKHRGWGHSLL 495


>ref|XP_006484258.1| PREDICTED: dentin sialophosphoprotein-like isoform X3 [Citrus
            sinensis]
          Length = 483

 Score =  127 bits (318), Expect = 3e-26
 Identities = 132/436 (30%), Positives = 170/436 (38%), Gaps = 88/436 (20%)
 Frame = +1

Query: 790  FTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVH-----------------------E 900
            + DK+V ECELP+ +VCYKE NT HVKDIC+DEGVH                       +
Sbjct: 85   YMDKSVTECELPELIVCYKE-NTYHVKDICIDEGVHSHDRILFESDVGKSVRSFLPPKED 143

Query: 901  ENKILIESSK------------------DEH----AGSVVSQPSDED----------RYG 984
             N  L+E SK                  DEH     GS     SDED          R  
Sbjct: 144  RNSELLEESKNSVIPIPDVLKSSAENYSDEHIVNRCGSSQESDSDEDIDDICDSKDLRPA 203

Query: 985  GTTNDPDIE----------FFVPDGFKV----SSPEHNRHDTASEFGAEKIDKXXXXXXX 1122
            G   D   E          F + D   +    +    ++    +E  AEK          
Sbjct: 204  GDVKDDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDAEKESFQGSSAKA 263

Query: 1123 XXXXXXXXXACVSEELILQKALLECSK-----CDEDKVSPQPDEVPCLESVLE-----SL 1272
                        +EE++     +  S+     C E  +S  P  V   E   +     SL
Sbjct: 264  ALANPEEANGGTAEEILTGADFVSASEESQNGCGEG-ISGNPTLVSASEKAHDKSEEASL 322

Query: 1273 A----VAFTSEQSKTDGPVSNTCYNSKVEGGTITFDFNSSKPNVSNSIDASAELTTGKAP 1440
            A    V+  SE +K       + YNS VE G+ITFDF++S P  S   +    L  G + 
Sbjct: 323  ASPDGVSALSESTKIS-TAEKSSYNSMVETGSITFDFDASAPGASGKEEP---LQIGDSQ 378

Query: 1441 ETEDEKPSDRFASSPAQLVNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGD 1620
              E    S R   +P Q                                 SVSS    G 
Sbjct: 379  RIETPGMS-RLEDAPRQ---------------------------------SVSSQFHSGL 404

Query: 1621 GESSFSTSGPVSGLITYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA 1800
            GESSFS +G +  LI+YSGP+AYSG++             FAFPILQ EW+ SPVRM KA
Sbjct: 405  GESSFSAAGSLPSLISYSGPVAYSGSISLRSDSSTTSTRSFAFPILQTEWDRSPVRMAKA 464

Query: 1801 -----RKHRGWRHGLL 1833
                 RKH+ W+ GLL
Sbjct: 465  DRRHYRKHK-WKQGLL 479


>ref|XP_006484256.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Citrus
            sinensis] gi|568861537|ref|XP_006484257.1| PREDICTED:
            dentin sialophosphoprotein-like isoform X2 [Citrus
            sinensis]
          Length = 496

 Score =  127 bits (318), Expect = 3e-26
 Identities = 132/436 (30%), Positives = 170/436 (38%), Gaps = 88/436 (20%)
 Frame = +1

Query: 790  FTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVH-----------------------E 900
            + DK+V ECELP+ +VCYKE NT HVKDIC+DEGVH                       +
Sbjct: 98   YMDKSVTECELPELIVCYKE-NTYHVKDICIDEGVHSHDRILFESDVGKSVRSFLPPKED 156

Query: 901  ENKILIESSK------------------DEH----AGSVVSQPSDED----------RYG 984
             N  L+E SK                  DEH     GS     SDED          R  
Sbjct: 157  RNSELLEESKNSVIPIPDVLKSSAENYSDEHIVNRCGSSQESDSDEDIDDICDSKDLRPA 216

Query: 985  GTTNDPDIE----------FFVPDGFKV----SSPEHNRHDTASEFGAEKIDKXXXXXXX 1122
            G   D   E          F + D   +    +    ++    +E  AEK          
Sbjct: 217  GDVKDDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDAEKESFQGSSAKA 276

Query: 1123 XXXXXXXXXACVSEELILQKALLECSK-----CDEDKVSPQPDEVPCLESVLE-----SL 1272
                        +EE++     +  S+     C E  +S  P  V   E   +     SL
Sbjct: 277  ALANPEEANGGTAEEILTGADFVSASEESQNGCGEG-ISGNPTLVSASEKAHDKSEEASL 335

Query: 1273 A----VAFTSEQSKTDGPVSNTCYNSKVEGGTITFDFNSSKPNVSNSIDASAELTTGKAP 1440
            A    V+  SE +K       + YNS VE G+ITFDF++S P  S   +    L  G + 
Sbjct: 336  ASPDGVSALSESTKIS-TAEKSSYNSMVETGSITFDFDASAPGASGKEEP---LQIGDSQ 391

Query: 1441 ETEDEKPSDRFASSPAQLVNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGD 1620
              E    S R   +P Q                                 SVSS    G 
Sbjct: 392  RIETPGMS-RLEDAPRQ---------------------------------SVSSQFHSGL 417

Query: 1621 GESSFSTSGPVSGLITYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA 1800
            GESSFS +G +  LI+YSGP+AYSG++             FAFPILQ EW+ SPVRM KA
Sbjct: 418  GESSFSAAGSLPSLISYSGPVAYSGSISLRSDSSTTSTRSFAFPILQTEWDRSPVRMAKA 477

Query: 1801 -----RKHRGWRHGLL 1833
                 RKH+ W+ GLL
Sbjct: 478  DRRHYRKHK-WKQGLL 492


>ref|XP_002875229.1| hypothetical protein ARALYDRAFT_484289 [Arabidopsis lyrata subsp.
            lyrata] gi|297321067|gb|EFH51488.1| hypothetical protein
            ARALYDRAFT_484289 [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  125 bits (314), Expect = 7e-26
 Identities = 114/393 (29%), Positives = 168/393 (42%), Gaps = 15/393 (3%)
 Frame = +1

Query: 700  DGQTANGIEEESKDSETSTVPHTFESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDIC 879
            D +    + + S D + + V    + D   + DKNV  C+LP+ VVCYKE+    VKDIC
Sbjct: 61   DNEAGKKVRDISHDCDAN-VDSPDKKDPVFYMDKNVTACDLPEIVVCYKENTYHVVKDIC 119

Query: 880  VDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFFVPDGFKVSSPEHNR 1059
            VDEGV  + K L    KD    SV S  +++      TN        P   K +   + +
Sbjct: 120  VDEGVPVQEKFLF-GEKD----SVKSSSTEDLTKADKTN------VNPSESKSAEDSNTK 168

Query: 1060 HDTASEFGAEKIDKXXXXXXXXXXXXXXXXACVSEELILQKALLECSKCDEDKVSPQ--- 1230
             D +      K D+                +  ++E ++          +E K SP    
Sbjct: 169  VDDSEFCNNCKTDRDVEESSREDFADAEGSSAYNQEHLIVT--------EEAKASPSHGL 220

Query: 1231 -PDEVPCLESVLESLAVAFT--SEQSKTDGPV-SNTCYNSKVEGGTITFDFNSSKPNVSN 1398
             P E+   E+  + +A++    S++S T G + S       +  G I+ D +  +     
Sbjct: 221  NPSEIEPDENSNDEVAISSETDSKESLTLGDILSREDEQKSLNHGNISSDSHEEQSPSQL 280

Query: 1399 SIDASAELTTG----KAPETEDEKP-SDRFASSPAQLVNNEGKIKENPSDRKLLRKDASN 1563
                   L T     +  +TE+ KP  ++  S+    +    K   +P   +       N
Sbjct: 281  QDKEKRSLETAAIETELEKTEEPKPVEEKLPSASTTTLQEPNKTCNDPEKPETENHHQQN 340

Query: 1564 DEIGNSGNFSVSSYLERGDGESSFSTSGPVSGLITYSGPIAYSGNVXXXXXXXXXXXXXF 1743
              + NS      S    G+   S + S  +SG ITYSGPIAYSG++             F
Sbjct: 341  SLVENSYEDDKLSSSRFGETSFSAAESVSISGHITYSGPIAYSGSLSVRSDASTTSGRSF 400

Query: 1744 AFPILQNEWNSSPVRMEKARKHR---GWRHGLL 1833
            AFPILQ+EWNSSPVRM KA K R   GWRH LL
Sbjct: 401  AFPILQSEWNSSPVRMAKADKRRQKGGWRHTLL 433


>ref|XP_002311412.1| 18S pre-ribosomal assembly protein gar2 [Populus trichocarpa]
            gi|222851232|gb|EEE88779.1| 18S pre-ribosomal assembly
            protein gar2 [Populus trichocarpa]
          Length = 486

 Score =  124 bits (312), Expect = 1e-25
 Identities = 122/421 (28%), Positives = 183/421 (43%), Gaps = 73/421 (17%)
 Frame = +1

Query: 790  FTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEENKILIES------------SKD 933
            + DK+V+  E+P+ +VCYKE NT HVKDICVDEGV  ++K L ++            S+ 
Sbjct: 74   YMDKSVMVREVPELIVCYKE-NTYHVKDICVDEGVPLQDKFLFDTDAHKKNMCEFLPSER 132

Query: 934  EHAGSVVSQPSDED-------RYGGTTNDPDIEFFVPDGFKVSSPEHNRHDTA------- 1071
            +    +V + SD D       +      + D+   VPD    S  + ++HD +       
Sbjct: 133  DMNNEMVKEKSDLDMLIPEMLKSSSEKQNVDLHLPVPDVLISSEEKGSKHDLSLDCDPKH 192

Query: 1072 -------SEFGAEKI----DKXXXXXXXXXXXXXXXXAC------------VSEELIL-- 1176
                    ++G +K+     K                 C            V ++ +L  
Sbjct: 193  LMPTEEVMDYGTKKVTDNASKEILSLRDLLSMSELGAKCTPANASYHNMDKVEQQSLLCP 252

Query: 1177 -QKALLECSKCDEDKVSPQPDEVPCLESVLESLAVAF-TSEQSKTDGPVSNTCYNSKVEG 1350
             + A+LE     E+  S    E    ++ LES  +A  T + +  +G   +T   + +  
Sbjct: 253  RENAILETDSASEE--SEHCGEETISDNGLESATLAIPTQDPAYQEGDHGHT--EAVLVS 308

Query: 1351 GTITFDFNSSKPN----VSNSIDASAELTTGKAPETEDEKPS-----------DRFASSP 1485
             T+T     S        S+++D+ +E   G     EDE P            D  +S+P
Sbjct: 309  PTLTSAAEESDSKETKLASHALDSFSE---GSTSRIEDELPYNSKTETRSISFDNDSSAP 365

Query: 1486 AQLVNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSGLI 1665
            A          +N   ++L  +  S  E  N+   S    L+  DGESSFS+SGP+ GL 
Sbjct: 366  AASARES---PQNGESQRLGTRIVSRFEDPNAERLS-GGQLQYADGESSFSSSGPLFGLT 421

Query: 1666 TYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA-----RKHRGWRHGL 1830
            ++SGPIAYSG+V             FAFPILQ+EWNSSP RM KA     +K R W  GL
Sbjct: 422  SHSGPIAYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPARMAKADRRHFQKPRKWMQGL 481

Query: 1831 L 1833
            L
Sbjct: 482  L 482


>ref|XP_006291111.1| hypothetical protein CARUB_v10017222mg [Capsella rubella]
            gi|482559818|gb|EOA24009.1| hypothetical protein
            CARUB_v10017222mg [Capsella rubella]
          Length = 455

 Score =  122 bits (306), Expect = 6e-25
 Identities = 118/419 (28%), Positives = 178/419 (42%), Gaps = 21/419 (5%)
 Frame = +1

Query: 640  FDTRDKNGAQITEEAAHSVMDGQTANGIEEESKDSETSTVPHTFESDLKS------FTDK 801
            +DTR  +G +  +E   ++++  +    +E  K +  ++     + D         + DK
Sbjct: 52   YDTR--SGDEWDKENDGNILEPHSCGDADEAGKKTRDTSHDFVAKGDSPEKVNPVFYMDK 109

Query: 802  NVVECELPDFVVCYKESNTPHVKDICVDEGVHEENKILI-------ESSKDEHAGSVVSQ 960
            NV  C+LP+ VVCYKE++   VKDICVDEGV  + K L         ++   H GSV   
Sbjct: 110  NVTACDLPEIVVCYKENSYHVVKDICVDEGVPVQEKFLFGEKDSVKSTTNSNHCGSVDLM 169

Query: 961  PSDEDRYGGTTNDPDIEFFVPDGFKVSSPEHNRHDTASEFGAEKIDKXXXXXXXXXXXXX 1140
              D+          D++   P   K     +++ D +SE   +K  +             
Sbjct: 170  KVDK---------TDVK---PSETKSLEDSNSKVDDSSEVCNDKTVQDVEESSREAFADA 217

Query: 1141 XXXACVSEELILQKALLECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQSKTDGPVS 1320
               +   +E ++  +     K  E  +  + +E+   E V+ S    F SE       +S
Sbjct: 218  EGSSNYDQEHLIVTSPTLALKPSEISLEVESEEISKDEVVISS--EDFLSESLTLGDILS 275

Query: 1321 NTCYNSKVEGGTITFDFNSSKPNVSNSIDASAELTTG---KAPETEDEKPSDRFASSPAQ 1491
                   ++          S P        S E TTG   K  + E+ K ++   SS + 
Sbjct: 276  REDKQKSLKNDNGNRPEELSPPQHQEKEKRSLE-TTGLDTKLEKVEEPKTAEENLSSAST 334

Query: 1492 LVNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFST--SGPVSGLI 1665
                E     N  ++        N  + +  +  +SS      GE+SFS   S  +SG I
Sbjct: 335  TTVQEPNKSCNDLEKPETENHQQNRLVNSYEDDKLSS---SRFGETSFSAAESVSISGHI 391

Query: 1666 TYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKARKHR---GWRHGLL 1833
            TYSGPIAYSG++             FAFPILQ+EWNSSPVRM KA K R   GWRH LL
Sbjct: 392  TYSGPIAYSGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKADKRRQKGGWRHTLL 450


>gb|ESW25465.1| hypothetical protein PHAVU_003G038300g [Phaseolus vulgaris]
          Length = 430

 Score =  119 bits (298), Expect = 5e-24
 Identities = 122/424 (28%), Positives = 182/424 (42%), Gaps = 19/424 (4%)
 Frame = +1

Query: 619  IDAESFLFDTRDKNGAQITEEAAHSVMDGQT----ANGIEEESKDSETSTVP------HT 768
            +D E+  ++T+ ++   + E  +HS  D ++     NGIE   K S TS +        +
Sbjct: 70   VDCETNEYETKVRD---LVEPLSHSSKDIESFMKFPNGIESV-KRSPTSPISSPREGVES 125

Query: 769  FESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEENKILIESSKDEHAGS 948
              + +  F  K V ECE P   VCY ESN   VKDIC+DEGV +++ I++ +  DE A  
Sbjct: 126  LRNSVDVFMVKTVTECE-PHPEVCYNESNYHVVKDICIDEGVLKKDNIMVVNPVDEKAHD 184

Query: 949  VVSQPSDE--DRYGGTTNDPDIEFFVPDGFKVSSPEHNRH-DTASEFGAEKIDKXXXXXX 1119
                 S E  ++    T+   +     +G       HN+H D      +  ++K      
Sbjct: 185  FFPFESYETKEKQKDNTSINVLSLTPTEGSDKVFANHNQHKDLMLTEVSGDVNKQTPSP- 243

Query: 1120 XXXXXXXXXXACVSEELILQKALLECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQS 1299
                          ++++LQ  L E S   +DK   Q    P L S+ E  ++A   ++S
Sbjct: 244  -------------GDKVLLQDLLTEDSASSDDK-GEQISIEPGLHSISEDPSMAAGEDES 289

Query: 1300 KTDGPVSNTCYNSKVEGGTITFDFNSSKPNVSNSIDASAELTTGKAPETEDEKPSDRFAS 1479
            K D                       SK   +  +D SA    GK    E+ + S    S
Sbjct: 290  KND-----------------------SKAPENAKVDPSAPADCGK----EECRQS---GS 319

Query: 1480 SPAQLVNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSG 1659
                 + +  +  E  SD +                 + +S++    GESSFS  GPVSG
Sbjct: 320  CKCDEIQHTSRPMEWKSDDQ-----------------AATSHIRHSLGESSFSAMGPVSG 362

Query: 1660 LITYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA--RKHRG----WR 1821
             I+YSGP+ +SG++             FAFPI+Q+EWNSSPVRM KA  R HR     WR
Sbjct: 363  RISYSGPVPFSGSISLRSDSSTTSTRSFAFPIIQSEWNSSPVRMAKADRRHHRKQRCCWR 422

Query: 1822 HGLL 1833
             G L
Sbjct: 423  GGFL 426


>gb|AAX55105.1| hypothetical protein At2g03810 [Arabidopsis thaliana]
          Length = 439

 Score =  119 bits (297), Expect = 7e-24
 Identities = 120/394 (30%), Positives = 170/394 (43%), Gaps = 16/394 (4%)
 Frame = +1

Query: 700  DGQTANGIEEESKDSETSTVPHTFESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDIC 879
            + +    + + S D + + V    + D   + DKNV  C+LP+ VVCYKE+    VKDIC
Sbjct: 61   ENEAGKKVRDTSHDCDAN-VDSPEKKDPVFYMDKNVTACDLPEIVVCYKENTYHIVKDIC 119

Query: 880  VDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTN-DPDIEFFVPDGF-KVSSPEH 1053
            VDEGV  + K L    KD    SV S  +++      TN +P       D   KV   E 
Sbjct: 120  VDEGVPVQEKFLF-GEKD----SVKSSSTEDLMKADKTNVNPSETKSAEDSISKVDDSEF 174

Query: 1054 -NRHDTASEFGAEKIDKXXXXXXXXXXXXXXXXACVSEELI------LQKALLECSKCDE 1212
             N H T  +   E   +                  V+EE+       L  + +E  +  +
Sbjct: 175  CNDHKTDRDV-EESSGEDFADAEGTSSNYNQEHLIVTEEVKASPTHGLSPSEIEPDENSK 233

Query: 1213 DKV--SPQPDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKVEGGTITFDFNSSKP 1386
            D+V  S   D   CL     +L    + E  +      N   +S  E           + 
Sbjct: 234  DEVAISQDNDSKECL-----TLGDILSREDEQKSLNQDNISSDSHEEQSPSQLQDKEKRS 288

Query: 1387 NVSNSIDASAELTTGKAPETEDEKPSDRFASSPAQLVNNEGKIKENPSDRKLLRKDASND 1566
              + +I+   E T  + P+  +EK S   +++ +Q  N      E P      +++   +
Sbjct: 289  LETTAIETELEKT--EEPKQGEEKLSS-VSTTTSQEPNKTCNEPEKPETENHHQQNCLVE 345

Query: 1567 EIGNSGNFSVSSYLERGDGESSFSTSGPVS--GLITYSGPIAYSGNVXXXXXXXXXXXXX 1740
                   FS S +     GE+SFS +  VS  G ITYSGPIAYSG++             
Sbjct: 346  NSYEDDKFSSSRF-----GETSFSAADSVSISGHITYSGPIAYSGSLSVRSDASTTSGRS 400

Query: 1741 FAFPILQNEWNSSPVRMEKARKHR---GWRHGLL 1833
            FAFPILQ+EWNSSPVRM KA K R   GWRH LL
Sbjct: 401  FAFPILQSEWNSSPVRMAKADKRRQKGGWRHTLL 434


>ref|NP_178475.2| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis
            thaliana] gi|42570677|ref|NP_973412.1| 18S pre-ribosomal
            assembly protein gar2-related protein [Arabidopsis
            thaliana] gi|79316683|ref|NP_001030966.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|186499149|ref|NP_001118260.1|
            18S pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250656|gb|AEC05750.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250657|gb|AEC05751.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250658|gb|AEC05752.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250659|gb|AEC05753.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana]
          Length = 439

 Score =  119 bits (297), Expect = 7e-24
 Identities = 120/394 (30%), Positives = 170/394 (43%), Gaps = 16/394 (4%)
 Frame = +1

Query: 700  DGQTANGIEEESKDSETSTVPHTFESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDIC 879
            + +    + + S D + + V    + D   + DKNV  C+LP+ VVCYKE+    VKDIC
Sbjct: 61   ENEAGKKVRDTSHDCDAN-VDSPEKKDPVFYMDKNVTACDLPEIVVCYKENTYHIVKDIC 119

Query: 880  VDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTN-DPDIEFFVPDGF-KVSSPEH 1053
            VDEGV  + K L    KD    SV S  +++      TN +P       D   KV   E 
Sbjct: 120  VDEGVPVQEKFLF-GEKD----SVKSSSTEDLMKADKTNVNPSETKSAEDSISKVDDSEF 174

Query: 1054 -NRHDTASEFGAEKIDKXXXXXXXXXXXXXXXXACVSEELI------LQKALLECSKCDE 1212
             N H T  +   E   +                  V+EE+       L  + +E  +  +
Sbjct: 175  CNDHKTDRDV-EESSGEDFADAEGTSSNYNQEHLIVTEEVKASPTHGLSPSEIEPDENSK 233

Query: 1213 DKV--SPQPDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKVEGGTITFDFNSSKP 1386
            D+V  S   D   CL     +L    + E  +      N   +S  E           + 
Sbjct: 234  DEVAISQDNDSKECL-----TLGDILSREDEQKSLNQDNISSDSHEEQSPSQLQDKEKRS 288

Query: 1387 NVSNSIDASAELTTGKAPETEDEKPSDRFASSPAQLVNNEGKIKENPSDRKLLRKDASND 1566
              + +I+   E T  + P+  +EK S   +++ +Q  N      E P      +++   +
Sbjct: 289  LETTAIETELEKT--EEPKQGEEKLSS-VSTTTSQEPNKTCNEPEKPETENHHQQNCLVE 345

Query: 1567 EIGNSGNFSVSSYLERGDGESSFSTSGPVS--GLITYSGPIAYSGNVXXXXXXXXXXXXX 1740
                   FS S +     GE+SFS +  VS  G ITYSGPIAYSG++             
Sbjct: 346  NSYEDDKFSSSRF-----GETSFSAADSVSISGHITYSGPIAYSGSLSVRSDASTTSGRS 400

Query: 1741 FAFPILQNEWNSSPVRMEKARKHR---GWRHGLL 1833
            FAFPILQ+EWNSSPVRM KA K R   GWRH LL
Sbjct: 401  FAFPILQSEWNSSPVRMAKADKRRQKGGWRHTLL 434


>ref|XP_002266889.2| PREDICTED: uncharacterized protein LOC100247891 [Vitis vinifera]
          Length = 229

 Score =  118 bits (295), Expect = 1e-23
 Identities = 82/228 (35%), Positives = 116/228 (50%), Gaps = 5/228 (2%)
 Frame = +1

Query: 1165 ELILQKALLECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKV 1344
            EL    ++ E S+ +  ++  Q  + P  E+VLE+ A+   +E+S  +   +   YNSK+
Sbjct: 32   ELSKANSMPESSEFNGMEIEHQCIQNPNGEAVLENPALVSEAEESDKNSFPNELSYNSKL 91

Query: 1345 EGGTITFDFNSSKPNVSNSIDASAELTTGKAPETEDEKPSDRFASSPAQLVNNEGKIKEN 1524
            E GTITFDF SS    + S+D+  E++    P+ +  +P                     
Sbjct: 92   ESGTITFDFGSS----TTSMDSGREVS----PQNDGCEP--------------------- 122

Query: 1525 PSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSGLITYSGPIAYSGNVX 1704
            P + + L K     E     +   S  ++RG GESSFS +GP S LI+YSG I +SGN+ 
Sbjct: 123  PLESQNLSKLEDGSE-----SLPFSGQIQRGLGESSFSAAGPSSALISYSGQITHSGNIS 177

Query: 1705 XXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA-----RKHRGWRHGLL 1833
                        FAFP+LQ EWNSSPVRM KA     RKHR WR G+L
Sbjct: 178  LRSDSSTTSTRSFAFPVLQTEWNSSPVRMAKAERRHLRKHRSWRRGIL 225


Top