BLASTX nr result

ID: Ephedra25_contig00007846 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra25_contig00007846
         (3032 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006850162.1| hypothetical protein AMTR_s00022p00240520 [A...   142   9e-31
ref|XP_006843937.1| hypothetical protein AMTR_s00006p00038830 [A...   139   7e-30
ref|XP_002971847.1| hypothetical protein SELMODRAFT_412517 [Sela...   127   4e-26
ref|XP_001755726.1| predicted protein [Physcomitrella patens] gi...   125   9e-26
emb|CBI24675.3| unnamed protein product [Vitis vinifera]              117   3e-23
gb|EMJ23791.1| hypothetical protein PRUPE_ppa004165mg [Prunus pe...   116   5e-23
ref|XP_002274006.2| PREDICTED: uncharacterized protein LOC100255...   112   8e-22
gb|EXB88331.1| hypothetical protein L484_020399 [Morus notabilis]     110   4e-21
gb|EMJ03152.1| hypothetical protein PRUPE_ppa004590m1g [Prunus p...   108   2e-20
gb|EXB37622.1| hypothetical protein L484_021828 [Morus notabilis]     107   2e-20
ref|XP_006380661.1| hypothetical protein POPTR_0007s10270g [Popu...   106   5e-20
ref|XP_002332233.1| predicted protein [Populus trichocarpa]           106   5e-20
ref|XP_006494052.1| PREDICTED: uncharacterized protein LOC102622...   106   7e-20
ref|XP_004299979.1| PREDICTED: uncharacterized protein LOC101298...   106   7e-20
ref|XP_006442780.1| hypothetical protein CICLE_v10019728mg [Citr...   104   3e-19
gb|EOY04666.1| Uncharacterized protein TCM_019866 [Theobroma cacao]   103   3e-19
gb|EOX91598.1| Uncharacterized protein isoform 2 [Theobroma cacao]    102   1e-18
gb|EOX91597.1| Uncharacterized protein isoform 1 [Theobroma cacao]    102   1e-18
ref|XP_002876518.1| hypothetical protein ARALYDRAFT_486432 [Arab...   101   2e-18
ref|XP_004306989.1| PREDICTED: uncharacterized protein LOC101291...   100   4e-18

>ref|XP_006850162.1| hypothetical protein AMTR_s00022p00240520 [Amborella trichopoda]
            gi|548853760|gb|ERN11743.1| hypothetical protein
            AMTR_s00022p00240520 [Amborella trichopoda]
          Length = 517

 Score =  142 bits (358), Expect = 9e-31
 Identities = 100/287 (34%), Positives = 147/287 (51%), Gaps = 12/287 (4%)
 Frame = +3

Query: 834  CAQNMVSVEVPEESKF--NDQIMNTSESNFGGYDNMVEIL-NSNEQENRVDSPEEASECS 1004
            C +N+   +   E+    ND       +   G D +++I  N +  + R    EE +ECS
Sbjct: 40   CEENLECTKQSRETNLSINDTSDQDEMNKDMGCDTIIDIEGNEDNGDVRWSLAEETTECS 99

Query: 1005 SSFGDT----DVDMDSFGSDLKVDAA-EAESGLWNEDIDSY--AAQKRKGALRPEWKDFR 1163
            SSFGDT    D D     SD +V++    E+G  N  ID +    + RK  L  +WK +R
Sbjct: 100  SSFGDTLSSLDDDCKRIASDQEVESRFHGENGFANV-IDEFNGGLRLRKKKLTADWKKYR 158

Query: 1164 RHIEWRCHWLQLRYSDLLSRASKYDRILAGIQSRKQQE-GQFDIDASSTRSSSIKKQNNS 1340
              + WRCHWL+LR  +L S+ASKYDR+L  I+ RK+   GQ   D+SSTR       N  
Sbjct: 159  NPVMWRCHWLELRIKELQSQASKYDRLLLEIKRRKRLTFGQVAQDSSSTRVLPFSAPNPR 218

Query: 1341 HPVLXXXXXXKLENTADLSSYLLRHPVYSLYGKQEKLDADGALHEDDIYSLR-MASDPSS 1517
             PV       K+E+T D+  Y+  HP++S Y K++K +AD    +DD  S+  +  +  +
Sbjct: 219  LPVKKRQRRKKVEDTIDMQYYMALHPLFSYYEKRKKPEADAVSVDDDCNSVPVLTEEQRN 278

Query: 1518 IRFDDDDSEDPQSAMETKEGASSLEYLFCQIEILQSRVQKMNAHLNK 1658
            IR D   +     A E     +S E L  ++E LQSR+ K+   LNK
Sbjct: 279  IRVDGFGASHVWVASEMGSMETSPEQLLWRLEGLQSRILKLKGQLNK 325


>ref|XP_006843937.1| hypothetical protein AMTR_s00006p00038830 [Amborella trichopoda]
            gi|548846336|gb|ERN05612.1| hypothetical protein
            AMTR_s00006p00038830 [Amborella trichopoda]
          Length = 515

 Score =  139 bits (350), Expect = 7e-30
 Identities = 101/284 (35%), Positives = 144/284 (50%), Gaps = 15/284 (5%)
 Frame = +3

Query: 852  SVEVPEESKFNDQIMNTSESNFG-----GYDNMVEIL-NSNEQENRVDSPEEASECSSSF 1013
            S+E   +S+  D  +N +          G D +++I  N +  + R     E +ECSSSF
Sbjct: 43   SLECTNQSRETDLTINDTSDQDKMNKDIGCDTIIDIEGNVDNGDVRWSHAVETTECSSSF 102

Query: 1014 GDT----DVDMDSFGSDLKVDAA-EAESGLWNEDIDSY--AAQKRKGALRPEWKDFRRHI 1172
            GDT    D D     SD +V++    E+G  N  ID +    + RK  L  EWK +R  I
Sbjct: 103  GDTLSSLDDDCKRIVSDQEVESQFHGENGFANV-IDDFNGGLRLRKKKLTAEWKKYRNPI 161

Query: 1173 EWRCHWLQLRYSDLLSRASKYDRILAGIQSRKQQE-GQFDIDASSTRSSSIKKQNNSHPV 1349
             WRCHWL+LR  +L S+ASKYDR+L  I+ RK    GQ   D+SS R       N   PV
Sbjct: 162  MWRCHWLELRIKELQSQASKYDRLLLEIKRRKPLTFGQLAQDSSSARFLPFSAPNPRLPV 221

Query: 1350 LXXXXXXKLENTADLSSYLLRHPVYSLYGKQEKLDADGALHEDDIYSLRMAS-DPSSIRF 1526
                    +E+T D+ SY+  HP++S Y K+ K +AD    +DD  S+ + + +  +IR 
Sbjct: 222  KKRKRRKNVEDTIDMQSYMALHPLFSYYEKRRKPEADAVSVDDDYNSVPVLNEEQKNIRV 281

Query: 1527 DDDDSEDPQSAMETKEGASSLEYLFCQIEILQSRVQKMNAHLNK 1658
            DD  +     A E     +S E L   +E LQSR+ K+   LNK
Sbjct: 282  DDFGASHAWVASEMGSMETSPEQLLWSLEGLQSRILKLKGQLNK 325


>ref|XP_002971847.1| hypothetical protein SELMODRAFT_412517 [Selaginella moellendorffii]
            gi|300160146|gb|EFJ26764.1| hypothetical protein
            SELMODRAFT_412517 [Selaginella moellendorffii]
          Length = 496

 Score =  127 bits (318), Expect = 4e-26
 Identities = 103/368 (27%), Positives = 176/368 (47%), Gaps = 23/368 (6%)
 Frame = +3

Query: 927  DNMVEILNSNEQEN-RVDSPEEASECSSSFGDTDVDMDSFGSDLKVD-AAEAESGLWNED 1100
            D  VEIL   E E  R D  E+ASECSSSFG +D    + G +   + A+E +S     D
Sbjct: 102  DVPVEILGLPEAEGVRTDRDEQASECSSSFGFSDA---TSGDEANTNNASEVDSAA--RD 156

Query: 1101 IDSYAAQKRKGALRPEWKDFRRHIEWRCHWLQLRYSDLLSRASKYDRILAGIQSRKQQEG 1280
             +      R+  L  +WK++RR IEWRC WL +R  +L  +A+KYD +L+GIQ  K  +G
Sbjct: 157  GNGALETDRRRPLNMDWKNYRRGIEWRCRWLDIRLMELQRQATKYDEVLSGIQKAKPWDG 216

Query: 1281 QFDIDASSTRSSSIKKQNNSHPVLXXXXXXKLENTADLSSYLLRHPVYSLYGKQEKLDAD 1460
            + + D ++ R++ ++++  + P+L      K+E T D++ Y+ +HP++  Y K +K + +
Sbjct: 217  RTEPDGAA-RAAPVREERPAQPLLHRQRRRKVEETVDVADYMSKHPLFQRYEK-KKREYE 274

Query: 1461 GALHEDDIYSLRMASDPSSIRFDDDDSEDPQSAMETKEGASSLEYLFCQIEILQSRVQKM 1640
            G  H++      +A++ + +R       DP  A E  E   S+E +  ++E LQ+RV ++
Sbjct: 275  GEQHKE---PNDLAAEGAMVRL------DPPPADEANE--DSVENIMLKVETLQARVVRL 323

Query: 1641 NAHLNKHFS---------------GIGRACESSKGILHRFNSNETGTSKVQDPTLEDSLK 1775
               L    S               G  RA  +    + R          V    ++    
Sbjct: 324  RNQLRGSMSHKLPGQVARAGGGGGGAARAPAAKAAAVSRLTKAAPSGHPVSPAAVKGGQA 383

Query: 1776 SQRVSEDPFPSHIQENITHCVLPSNEVPAYLEAVHH-SIENDL-----PLDQREREDSSD 1937
            ++R + D        +I + V+P +    Y++ V H +IE          +  E ++S D
Sbjct: 384  ARRRASD-------YDINNVVMPVSVGAKYVQHVRHANIETPQWRLAEKSEMCEEDNSDD 436

Query: 1938 EVTIAKPY 1961
            E T  + Y
Sbjct: 437  EDTDDETY 444


>ref|XP_001755726.1| predicted protein [Physcomitrella patens] gi|162693045|gb|EDQ79399.1|
            predicted protein [Physcomitrella patens]
          Length = 1134

 Score =  125 bits (315), Expect = 9e-26
 Identities = 160/655 (24%), Positives = 254/655 (38%), Gaps = 127/655 (19%)
 Frame = +3

Query: 360  LDENRIFSKTSAGSTSSDYSNHQRSKIFFSVMMEKAYSNEYRTWREF------------- 500
            LD    F     G ++S++S    S + F++M EK  +++Y TWR F             
Sbjct: 308  LDSKGDFMGPLDGYSASNHSKVIESPMCFAMMQEKIRTHKYSTWRMFVAIHLDMLEVLVK 367

Query: 501  -----------------KEDFEEICNSFIHNENRDSEIWNAAHKLLQQGREYLEGFADRS 629
                              EDFE IC + +    + S IW AA  LL++G++ LE +    
Sbjct: 368  TRNGSCERGLVWVYVPVNEDFERICCNALKCNQKRSIIWAAADDLLRRGKKRLEQYEGYG 427

Query: 630  QKL-----FDGAIKVK---DSDSIRDFDKNTNDSCSDPSNKKCIMSAQNYIIQSSNDRV- 782
            + L      D  I V     S++++        S +    +  I  A     +S +  + 
Sbjct: 428  ESLVAWSQIDSKIGVSVKAKSENVKAMVCQPEKSLTSKGVRMTISGAATRSAESKSPSMC 487

Query: 783  -------------FLPADESVQTICDSTFRCAQNMVSVEVPEESKFNDQIM--------- 896
                          L  ++ V    +ST       V     E S   D I          
Sbjct: 488  RNTPVGKGSCLIQHLKTEKRVDQTTESTEPFPGPAVGGVFEETSMQKDGIKADVSRVVRN 547

Query: 897  ------------NTSESNFGGYDNMVEILNSNE---QENRVDSPEEASECSSSFGDTDVD 1031
                            S  G  D  V++    +    E R DS  EA+E SSS+G +   
Sbjct: 548  MESSGRVSEGEPGNGSSAVGQTDQDVDVEGGRDVDSAERRSDSVGEATESSSSYGSSGSS 607

Query: 1032 MDSFGS-DLKVDAAEAESGLWN--------EDIDSYAAQKRKGALRPEWKDFRRHIEWRC 1184
            ++  GS D  +   EAES L +        ED      ++ K AL  EWK  RR IEWRC
Sbjct: 608  LEGEGSPDRALWIFEAESSLRDGNGAVGLMEDDGVITGERGKKALDAEWKQTRRGIEWRC 667

Query: 1185 HWLQLRYSDLLSRASKYDRILAGIQSRKQQEGQFDI-DASSTRSSSIKKQNNSHPVLXXX 1361
            HWL+L+   + ++ ++Y+ +L   QS K  +   ++ + S +R+  +K   N HP++   
Sbjct: 668  HWLELKTLAIQAKLAQYENVLKKAQSEKVWKWDGEVGEGSCSRTVPVKILRN-HPIVHRK 726

Query: 1362 XXXKLENTADLSSYLLRHPVYSLYGKQ--EKLDADGA----LHED-----DIYSLRMASD 1508
               + E   D+   L +HPV+S Y K+  ++   DG+    +H D     D       S+
Sbjct: 727  HRRRAEGGQDVD--LGKHPVFSRYEKRKTQRKSDDGSGQKMVHRDLAQQTDFEVCCTLSE 784

Query: 1509 PSSIRFDDDDSEDPQSAMETKEGASSLEYLFCQIEILQSRVQKMNAHLNKHFSGIGRACE 1688
            P S  ++ +D E   +A     G  S+E L  Q+E LQ RV+K    L K  +    +  
Sbjct: 785  PES-TYEREDVELQPAAENAFVGHDSMEQLLWQVEALQVRVKKQKHLLRKEVAARKNSAV 843

Query: 1689 SSKG-------------ILHRFNSN------ETGTSKVQDPTLEDSLKSQRVSEDPFPSH 1811
              KG                + NS+        G ++V  P+       +  S       
Sbjct: 844  DCKGGGASSLSLRVPPPATQKGNSSAQGLPGPRGVTRVGPPSSSSGSLGRSGSGAGLARR 903

Query: 1812 --IQENITHCVLPSNEVPAYLEAVHH---------SIENDLPLDQREREDSSDEV 1943
                 +I++ V+P N    + E V H          +E+  P  QR  + SSDEV
Sbjct: 904  KAADYDISNMVMPVNVGATFAEQVRHVDIEIPLWRVVEDANPAVQRLEDSSSDEV 958


>emb|CBI24675.3| unnamed protein product [Vitis vinifera]
          Length = 465

 Score =  117 bits (293), Expect = 3e-23
 Identities = 70/228 (30%), Positives = 121/228 (53%), Gaps = 5/228 (2%)
 Frame = +3

Query: 987  EASECSSSFGDTDVDMDSFG--SDLKVDAAEAESGLWNEDIDSYAA--QKRKGALRPEWK 1154
            +A+E SSSFGDT+   + F   S+ +V++   +    +   DS+    Q RK  L   WK
Sbjct: 40   DATEYSSSFGDTESGNEKFSGLSEAEVESEYRDHNSLDSPFDSFGLMFQTRKKKLTSHWK 99

Query: 1155 DFRRHIEWRCHWLQLRYSDLLSRASKYDRILAGIQSRKQQEG-QFDIDASSTRSSSIKKQ 1331
             +   + WRC W +L+  +  S+A+KY ++LA    RKQ E  QF  +   ++S     Q
Sbjct: 100  KYIHPLMWRCKWAELKIREFKSQAAKYSKLLAAYDQRKQLESDQFTSEGFDSKSLPFSNQ 159

Query: 1332 NNSHPVLXXXXXXKLENTADLSSYLLRHPVYSLYGKQEKLDADGALHEDDIYSLRMASDP 1511
            N+    +      ++E T D+ SY+L H ++  Y + ++ DAD +   DD +   + ++ 
Sbjct: 160  NHRMKAMKRRKRKRIEETTDVPSYMLNHNLFG-YFENKRSDADSSSMVDD-FGNPVVTEQ 217

Query: 1512 SSIRFDDDDSEDPQSAMETKEGASSLEYLFCQIEILQSRVQKMNAHLN 1655
            ++   D+    D  S ++ ++   SLE + C+IE+ QSRVQK+ A L+
Sbjct: 218  NANGDDNFGISDDSSLLKFRDDDDSLEQILCKIEMAQSRVQKLKAQLD 265


>gb|EMJ23791.1| hypothetical protein PRUPE_ppa004165mg [Prunus persica]
          Length = 525

 Score =  116 bits (291), Expect = 5e-23
 Identities = 114/455 (25%), Positives = 204/455 (44%), Gaps = 44/455 (9%)
 Frame = +3

Query: 930  NMVEILNSNEQENRVDSPE-EASECSSSFGDTDVDMDSFGSDLKVDAAE---AESGLWNE 1097
            +++E  N++    + ++ + +A+E SSSF DT  D ++F    + +      A++GL + 
Sbjct: 69   DIIECRNNHNSSLQAETEDPDATEYSSSFADTMSDTENFSGFSEGEVQSQFFADNGLAS- 127

Query: 1098 DIDSYAA--QKRKGALRPEWKDFRRHIEWRCHWLQLRYSDLLSRASKYDRILAGIQSRKQ 1271
            D D++++  Q RK  L   W+DF R I WRC W++LR  ++ S+A KY R LA    RK 
Sbjct: 128  DFDAFSSPFQMRKKKLTNHWRDFIRPIMWRCKWMELRIKEIESQALKYSRELAIADQRKH 187

Query: 1272 QE-GQFDIDASSTRSSSIKKQNNSHPVLXXXXXXKLENTADLSSYLLRHPVYSLYGKQEK 1448
                QF ++   ++S     Q      +      ++E T D++SY+  H V+S Y + ++
Sbjct: 188  SGFDQFTLEEFGSKSVPFSSQCRRKKAMKRRKRKRVEETTDIASYMSHHNVFS-YLENKR 246

Query: 1449 LDADGALHEDDIYSLRMASDPSSIRFDDDDSEDPQSAMETKEGASSLEYLFCQIEILQSR 1628
             D D     D+  +  + ++ S+   D   + D  S  E ++G  SLE++  +IE + SR
Sbjct: 247  SDPDSTSVADEFSNAVIITEQSADYNDKLSTGDDWSFFEFRDGDKSLEHVLWKIETVHSR 306

Query: 1629 VQKMNAHLNKHFSGIGRACE-SSKGILHRFNSNETGTSKVQDPTLE----DSLKSQRVSE 1793
            V KM   ++   S    AC  SS   L      +  TS    P       D++ +  +  
Sbjct: 307  VHKMKNQIDVVMS--KNACRFSSSENLSLLVPCDAPTSSAHSPAFSAGNGDTISAGAIYT 364

Query: 1794 DPFPSHIQE-NITHCVLPSNEVPAYLEA--VHHSIEN--------DLPLDQREREDSSDE 1940
                 HI E N+   V+P + V ++ EA  V   IE+        D+   Q +  DSS++
Sbjct: 365  S--TQHISEYNLGDMVMPESAVSSFGEAIVVPDIIESTVGLLSAIDVTFHQPQFGDSSED 422

Query: 1941 VTIAKPYMEPVFVNSKEQFNENQLFSVYGSE-TVNYKRDASTLESEENIPN--------- 2090
            +      ++ V + ++    E   F +   +    +++    ++ E   P          
Sbjct: 423  I------VDNVLIPNEAAEGEKHTFELISDQPKETHEQSDKGIQEEGPFPTPSSEPDPLV 476

Query: 2091 -----------NSYLPFNIHIPRNKRRKINARPSS 2162
                        S L  +++ PRNKR++   R  S
Sbjct: 477  DASVPQEQSTLESCLASDVNFPRNKRKRGERRAGS 511


>ref|XP_002274006.2| PREDICTED: uncharacterized protein LOC100255929 [Vitis vinifera]
          Length = 483

 Score =  112 bits (281), Expect = 8e-22
 Identities = 84/297 (28%), Positives = 143/297 (48%), Gaps = 8/297 (2%)
 Frame = +3

Query: 789  PADESVQTICDSTFRCAQNMVSVEVPEESKFNDQIMNTSESNFGGYDNMVEILN-SNEQE 965
            P  +S   +     +C  N  +     E+  + Q    S++  G  D  V++   +N  +
Sbjct: 12   PDSKSPTKVVPDKVKCTSNWEATISEMEAMLDGQ----SKAPGGTEDVEVDVTGCANIID 67

Query: 966  NRVDSPEE--ASECSSSFGDTDVDMDSFG--SDLKVDAAEAESGLWNEDIDSYAA--QKR 1127
             ++   E+  A+E SSSFGDT+   + F   S+ +V++   +    +   DS+    Q R
Sbjct: 68   TKLAEAEDPDATEYSSSFGDTESGNEKFSGLSEAEVESEYRDHNSLDSPFDSFGLMFQTR 127

Query: 1128 KGALRPEWKDFRRHIEWRCHWLQLRYSDLLSRASKYDRILAGIQSRKQQEG-QFDIDASS 1304
            K  L   WK +   + WRC W +L+  +  S+A+KY ++LA    RKQ E  QF  +   
Sbjct: 128  KKKLTSHWKKYIHPLMWRCKWAELKIREFKSQAAKYSKLLAAYDQRKQLESDQFTSEGFD 187

Query: 1305 TRSSSIKKQNNSHPVLXXXXXXKLENTADLSSYLLRHPVYSLYGKQEKLDADGALHEDDI 1484
            ++S     QN+    +      ++E T D+ SY+L H ++  +   E+ +A+G    DD 
Sbjct: 188  SKSLPFSNQNHRMKAMKRRKRKRIEETTDVPSYMLNHNLFGYFVVTEQ-NANG----DDN 242

Query: 1485 YSLRMASDPSSIRFDDDDSEDPQSAMETKEGASSLEYLFCQIEILQSRVQKMNAHLN 1655
            + +  + D S ++F DDD               SLE + C+IE+ QSRVQK+ A L+
Sbjct: 243  FGI--SDDSSLLKFRDDD--------------DSLEQILCKIEMAQSRVQKLKAQLD 283


>gb|EXB88331.1| hypothetical protein L484_020399 [Morus notabilis]
          Length = 525

 Score =  110 bits (275), Expect = 4e-21
 Identities = 126/499 (25%), Positives = 209/499 (41%), Gaps = 52/499 (10%)
 Frame = +3

Query: 801  SVQTICDSTFRCAQNMVSVEVPEESKFNDQ--IMNTSESNFGGYDNMVEIL-------NS 953
            SV  +      C+ N        E  F+D+  I    + +  G  N   I        N+
Sbjct: 24   SVANLAMKGLSCSNNCEDTTTGMEDWFDDKSKIPKDVDVDITGSRNPSNIALAETEDPNA 83

Query: 954  NEQENRVDSPEEASECSSSFGDTDVDMDSFGSDLKVDAAEAESGLWNEDIDSYAAQKRKG 1133
             E  +  D   + +E  S F + +V+   FG +    + +A  GL+         Q RK 
Sbjct: 84   TEYSSSFDGTADDNENCSGFSEGEVESQFFGDNGFGSSFDAFGGLF---------QIRKK 134

Query: 1134 ALRPEWKDFRRHIEWRCHWLQLRYSDLLSRASKYDRILAGIQSRKQQEGQFDIDASSTRS 1313
             L   W++F R + WRC W++LR  ++ S+A KY R +      KQ   QF  D   ++S
Sbjct: 135  KLTNHWRNFIRPVMWRCKWMELRIKEIDSQALKYSREMEAYDQAKQGFHQFTPDGFCSKS 194

Query: 1314 SSIKKQNNSHPVLXXXXXXKLENTADLSSYLLRHPVYSLYGKQEKLDADGALHEDDIYSL 1493
                 Q N    +      ++E+T DL+SY+ +H ++S Y + ++ D D  L  DD +  
Sbjct: 195  WPFLSQYNGKKAMKRRKRNRVEDTPDLTSYMSQHNLFS-YLENKRPDPDSTLLADD-FGN 252

Query: 1494 RMASDPSSIRFDDDDSEDPQSAMETKEGASSLEYLFCQIEILQSRVQKMNAHLNKHFSGI 1673
             +A++  +   +D    D  ++ E ++G SS+E++  +IE L SRV+K+   ++   S  
Sbjct: 253  AVATEKIA-DCNDKHGTDDCTSFEFRDGDSSMEHVLWKIETLHSRVKKLRGQIDMVISKN 311

Query: 1674 GRACESSKGILHRFNSNETGTSKVQDPTLEDSLKSQRVSEDPFPSHIQE----NITHCVL 1841
                 SS+  L      +  TS    P          +S     +  Q     NI   VL
Sbjct: 312  AAKFSSSEN-LSLLAPLDVQTSSAHSPAFSAG-NGDEISPGAMYATSQHLTDYNIGDLVL 369

Query: 1842 PSNEVPAYLEA--VHHSIEN--------DLPLDQREREDSS----DEVTIAKPYMEPVFV 1979
            P + V ++ EA  V   IE+        D+ L Q +  DSS    D+V I +   E V  
Sbjct: 370  PESVVSSFEEAFSVPDIIESTVGLLSATDVTLHQPQFGDSSEDIVDDVLIHEEAAEGVGH 429

Query: 1980 NSKEQFNENQLFS----VYGSETVNYKRDASTLESEENIPN------------------- 2090
            +  ++   +Q F     V G E +N     S++ S E+ P+                   
Sbjct: 430  SCLQRVVSDQPFEEPDIVCGQEGMN----PSSIPSSESQPDPATSAVVAATGGVVPQEQS 485

Query: 2091 --NSYLPFNIHIPRNKRRK 2141
               S L  ++HIP  KR++
Sbjct: 486  SLKSCLASDVHIPIIKRKR 504


>gb|EMJ03152.1| hypothetical protein PRUPE_ppa004590m1g [Prunus persica]
          Length = 501

 Score =  108 bits (269), Expect = 2e-20
 Identities = 129/488 (26%), Positives = 211/488 (43%), Gaps = 26/488 (5%)
 Frame = +3

Query: 861  VPEESKFNDQIMNTSESNFGGYDNMVEILNSNEQENRVD--SPEEAS----EC------S 1004
            +P  S + D I+   E+  G     V     N + NR    SP++      EC      S
Sbjct: 36   MPCVSNYKDNILEMEETLVG--QTTVSNKRENAELNRTGGTSPDDVQILEGECGDLTENS 93

Query: 1005 SSFGDTDVDMDSFGSDLKVDAAEAESGLWNEDIDSY-----AAQKRKGALRPEWKDFRRH 1169
            SSFGDT +     GS L  D A+++ G  NE    Y     A Q RK  L PEW++F R 
Sbjct: 94   SSFGDT-ISGTEDGSTLDGDEADSQLGE-NEHASVYDGYFGAFQTRKKKLTPEWRNFIRP 151

Query: 1170 IEWRCHWLQLRYSDLLSRASKYDRILAGIQSRKQQ--EG--QFDIDASSTRSSSIKKQNN 1337
              WR  WL+L+  +LLS+  KYD  LA     K    EG      DA  T     +K+  
Sbjct: 152  EMWRLKWLELQIKELLSQTQKYDSELAKYDKEKLSAFEGFTSEGFDAMPTPKLMKRKKRK 211

Query: 1338 SHPVLXXXXXXKLENTADLSSYLLRHPVYSLYGKQEKLDADGALHEDDIYSLRMASDPSS 1517
                       ++E+T D++SY+  H ++S   + +K  A+G   ++D   L      +S
Sbjct: 212  -----------RVEDTTDIASYMSHHNLFSYVPESKKTAANGVCMQEDWGDL---GGKTS 257

Query: 1518 IRFDDDDSEDPQSAMETKEGASSLEYLFCQIEILQSRVQKMNAHLNKHFSGIGRACESSK 1697
               ++ ++ D  S++E ++G SSLE +  +IE++ S+V ++   ++K          ++ 
Sbjct: 258  YGHNEFETNDIWSSLEFRDGNSSLEDILWKIEVVHSQVWQLKTRIDKVVQENPGNFSANH 317

Query: 1698 GILHRFNSNETGTSKVQDPTLEDSLKSQRVSEDPFPSHIQENITHCVLPSNEVPAY--LE 1871
             ++    SN +  +    P   ++L  + +S      H++ NI +  LP + V ++  L 
Sbjct: 318  FLVPCDTSNGSAQNPASPPENGNTLLVETLS--TASQHVKFNIGNLFLPQSAVSSHEELT 375

Query: 1872 AVHHSIEN-DLPLDQREREDSSDEVTIAKPYM--EPVFVNSKEQFNENQLFSVYGSETVN 2042
             +   I N D P      E+  D   I    +  EP     K+Q  +    S+   ET N
Sbjct: 376  PLPGMIGNTDQPWLGNVLENVEDGCLIPNAAVKDEPYNFEVKDQLIQKPHISLEEQET-N 434

Query: 2043 YKRDASTLESEENIPNNSYLPFNIHIPRNKRRKINARPSSSQFPELVPLIESSNVSLQPE 2222
            +    S  E          LP N+ +P ++     A P+SS  P        S+ + +  
Sbjct: 435  FPVPVSETE----------LPTNLPVPVSE----TALPTSSSVPNAT---HESDSTTRTN 477

Query: 2223 FTVNARNK 2246
            F  N RN+
Sbjct: 478  FRWNTRNR 485


>gb|EXB37622.1| hypothetical protein L484_021828 [Morus notabilis]
          Length = 589

 Score =  107 bits (268), Expect = 2e-20
 Identities = 88/328 (26%), Positives = 153/328 (46%), Gaps = 8/328 (2%)
 Frame = +3

Query: 711  SDPSNKKCIMSAQNYIIQSSNDRVFLPADESVQTICDSTFRCAQNMVSVEVPEESKFNDQ 890
            S+ S K  +      + Q   D  FLP    V    D TF    ++V  +        ++
Sbjct: 121  SEASMKASVQKENTSLHQDPRDE-FLPC---VSNYKDETFDVEASLVVKQTTPPDMTENE 176

Query: 891  IMNTSESNFGGYDNMVEILNSNEQENRVDSPEEASECSSSFGDTDVDMDSFGSDLKVDAA 1070
             +N +++      ++V+    +E EN           SSSFGDT   +     D  ++  
Sbjct: 177  ELNITDTTHSPNIDVVQTDYIDEIEN-----------SSSFGDT---VSGAEEDSVLNGD 222

Query: 1071 EAESGLWNE--DIDSYAA-----QKRKGALRPEWKDFRRHIEWRCHWLQLRYSDLLSRAS 1229
            EA+S L  +   + +Y A     + RK  L   W+ F R + WRC W++L+  +L S+A 
Sbjct: 223  EAQSRLHGDHASVSTYDAYFGEFRMRKKKLTAHWRKFVRPLMWRCKWMELQIKELQSQAM 282

Query: 1230 KYDRILAGIQSRKQQE-GQFDIDASSTRSSSIKKQNNSHPVLXXXXXXKLENTADLSSYL 1406
            KYD+ LA    RK+ E G+F ++    +S     Q+  + V+      ++E   D+ SY+
Sbjct: 283  KYDKELAEFSERKEFEFGRFALEGLDAKSLPFLSQSGRNKVMKRKKRKRVEEVVDVVSYM 342

Query: 1407 LRHPVYSLYGKQEKLDADGALHEDDIYSLRMASDPSSIRFDDDDSEDPQSAMETKEGASS 1586
             +H ++S Y   + +    ++H DD  +L   + P     +D+   +   + E ++G SS
Sbjct: 343  SQHNLFSYYENNKSVPDIASVH-DDFGNLGKTTYP-----NDELGTNDGFSFEFQDGDSS 396

Query: 1587 LEYLFCQIEILQSRVQKMNAHLNKHFSG 1670
             E LF +IE +QSRV+++ +   K  SG
Sbjct: 397  FEDLFQKIETIQSRVRELISRAEKVVSG 424


>ref|XP_006380661.1| hypothetical protein POPTR_0007s10270g [Populus trichocarpa]
            gi|550334551|gb|ERP58458.1| hypothetical protein
            POPTR_0007s10270g [Populus trichocarpa]
          Length = 475

 Score =  106 bits (265), Expect = 5e-20
 Identities = 95/367 (25%), Positives = 168/367 (45%), Gaps = 18/367 (4%)
 Frame = +3

Query: 840  QNMVSVEVPEESKFNDQIMNTSESNF---------GGYD---NMVEILNSNEQENRVDSP 983
            +N+   + PE++K      N  +  F         G  D   N+++  N+++ E  V   
Sbjct: 22   ENISVRQDPEDNKVLQCASNCQDKGFHDDQDKAAVGSADVEVNIIDCTNASDNEQIVARY 81

Query: 984  EEASECSSSFGDTDVDMDSFGSDLKVDAAEAESGLWNEDIDSY--AAQKRKGALRPEWKD 1157
            E+++E  SSFGD + +  S  SD +V++     G      D Y  A Q R+  L   W+ 
Sbjct: 82   EDSTESMSSFGDLESETKSVSSDTEVESQLFVGGGSISIFDGYGGAFQMRRKRLTDHWRR 141

Query: 1158 FRRHIEWRCHWLQLRYSDLLSRASKYDRILAGIQSRKQQEGQ-FDIDASSTRSSSIKKQN 1334
            F R + WRC W++L+  +  S+A KYDR +A  + RK  + + F  +    +S       
Sbjct: 142  FIRPLMWRCKWVELQIKEFQSQALKYDREIAEHERRKLFDHETFMEEGFPVKSLPFSTCM 201

Query: 1335 NSHPVLXXXXXXKLENTADLSSYLLRHPVYSLYGKQEKLDADGALHEDDIYSLRMASDPS 1514
                 +      + E TAD++SY+L+H ++S Y +  K   DGA  +D    L +  D S
Sbjct: 202  ERKKAMKRKKRKRFEETADVASYMLQHNLFSYY-ENRKSAIDGASIDDG--CLNLGGDFS 258

Query: 1515 SIRFDDDDSEDPQSAMETKEGASSL-EYLFCQIEILQSRVQKMNAHLNKHFSGIGRACES 1691
            +   + +D    Q  + + + + ++ E++  QIE+L+S+V K+ A ++K  +       S
Sbjct: 259  AKTINGNDEFGFQDGLASLQSSDNISEHILRQIEVLKSQVHKLKARVDK-VASENPVKFS 317

Query: 1692 SKGILHRFNSNETGTSKVQDPTLEDSLKSQRVSEDPFPSHIQENITHC--VLPSNEVPAY 1865
            S   L     ++  TS   +P       S     D  PS +   +++   V+P   V ++
Sbjct: 318  SVNALSLLAPSDALTSSDCNPA------SVAKRGDSTPSRLPHAVSNMGNVMPETAVSSH 371

Query: 1866 LEAVHHS 1886
             EA   S
Sbjct: 372  REATSRS 378


>ref|XP_002332233.1| predicted protein [Populus trichocarpa]
          Length = 475

 Score =  106 bits (265), Expect = 5e-20
 Identities = 95/367 (25%), Positives = 168/367 (45%), Gaps = 18/367 (4%)
 Frame = +3

Query: 840  QNMVSVEVPEESKFNDQIMNTSESNF---------GGYD---NMVEILNSNEQENRVDSP 983
            +N+   + PE++K      N  +  F         G  D   N+++  N+++ E  V   
Sbjct: 22   ENISVRQDPEDNKVLQCASNCQDKGFHDDQDKAAVGSADVEVNIIDCTNASDNEQIVARY 81

Query: 984  EEASECSSSFGDTDVDMDSFGSDLKVDAAEAESGLWNEDIDSY--AAQKRKGALRPEWKD 1157
            E+++E  SSFGD + +  S  SD +V++     G      D Y  A Q R+  L   W+ 
Sbjct: 82   EDSTESMSSFGDLESETKSVSSDTEVESQLFVGGGSISIFDGYGGAFQMRRKRLTDHWRR 141

Query: 1158 FRRHIEWRCHWLQLRYSDLLSRASKYDRILAGIQSRKQQEGQ-FDIDASSTRSSSIKKQN 1334
            F R + WRC W++L+  +  S+A KYDR +A  + RK  + + F  +    +S       
Sbjct: 142  FIRPLMWRCKWVELQIKEFQSQALKYDREIAEHERRKLFDHETFMEEGFPVKSLPFSTCM 201

Query: 1335 NSHPVLXXXXXXKLENTADLSSYLLRHPVYSLYGKQEKLDADGALHEDDIYSLRMASDPS 1514
                 +      + E TAD++SY+L+H ++S Y +  K   DGA  +D    L +  D S
Sbjct: 202  ERKKAMKRKKRKRFEETADVASYMLQHNLFSYY-ENRKSAIDGASIDDG--CLNLGGDFS 258

Query: 1515 SIRFDDDDSEDPQSAMETKEGASSL-EYLFCQIEILQSRVQKMNAHLNKHFSGIGRACES 1691
            +   + +D    Q  + + + + ++ E++  QIE+L+S+V K+ A ++K  +       S
Sbjct: 259  AKTINGNDEFGFQDGLASLQSSDNISEHILRQIEVLKSQVHKLKARVDK-VASENPVKFS 317

Query: 1692 SKGILHRFNSNETGTSKVQDPTLEDSLKSQRVSEDPFPSHIQENITHC--VLPSNEVPAY 1865
            S   L     ++  TS   +P       S     D  PS +   +++   V+P   V ++
Sbjct: 318  SVNALSLLAPSDALTSSDCNPA------SVAKRGDSTPSRLPHAVSNMGNVMPETAVSSH 371

Query: 1866 LEAVHHS 1886
             EA   S
Sbjct: 372  REATSRS 378


>ref|XP_006494052.1| PREDICTED: uncharacterized protein LOC102622342 isoform X1 [Citrus
            sinensis] gi|568882474|ref|XP_006494053.1| PREDICTED:
            uncharacterized protein LOC102622342 isoform X2 [Citrus
            sinensis]
          Length = 520

 Score =  106 bits (264), Expect = 7e-20
 Identities = 125/514 (24%), Positives = 225/514 (43%), Gaps = 54/514 (10%)
 Frame = +3

Query: 786  LPADESVQTIC---DSTFRCA---QNMVSVEVPEESKFNDQIMNTSE--SNFGGYDNMVE 941
            +P  E  +TI    D   +C        SV+   + K +  +++  +  S  G  D  V+
Sbjct: 4    IPGGEPRKTIKVEPDEEVKCVLAKSEETSVKCLNDHKESKDVLSNGQTMSPKGPEDVEVD 63

Query: 942  ILN-SNEQENRVDSPEE--ASECSSSFGDTDVDMDSFG--SDLKVDAAEAESGLWNEDID 1106
            I+  + + E R+   E+  A+E SSSFG+T+ D +     S+++V++   +      + D
Sbjct: 64   IVKFTTDGEIRLTEAEDPDATEYSSSFGNTEPDTERCSGLSEVEVESQYFDVNGLRTNCD 123

Query: 1107 SYAA--QKRKGALRPEWKDFRRHIEWRCHWLQLRYSDLLSRASKYDRILAGI-QSRKQQE 1277
            S+    Q RK  L   W+ F R + WRC W +LR +++ S+A KY R LA   Q++  + 
Sbjct: 124  SFGRLFQMRKKKLTNHWRSFIRPLMWRCKWAELRINEIQSQALKYARELAAYDQNKLSRV 183

Query: 1278 GQFDIDASSTRSSSIKKQNNSHPVLXXXXXXKLENTADLSSYLLRHPVYSLYGKQEKLDA 1457
             Q  ++   ++S +   Q      +      + E+T DL+SY+  H ++S Y + ++ + 
Sbjct: 184  NQSTVEEFGSKSLAFSSQWYRKKAMKRRKRKRAEDTTDLASYMSHHSLFS-YLENKRSNP 242

Query: 1458 DGALHEDDIYSLRMASDPSSIRFDDDDSEDPQSAMETKEGASSLEYLFCQIEILQSRVQK 1637
            DG    DD  +  +   P+    D   S +     E K+  SSLE +  +IE + SRV++
Sbjct: 243  DGNSTADDFGNTVIMDQPADCN-DKFGSNEDALFFELKDDDSSLEQVLLKIETVHSRVRQ 301

Query: 1638 MNAHLNKHFSGIGRACESSKGILHRFNSNETGTSKVQDPTLE-DSLKSQRVSEDPFPS-H 1811
            + + L+   +       SS+  L      +  TS    PT    +  +  +     P+ H
Sbjct: 302  LKSQLDIVMAKNASRFSSSEN-LSLLAPCDGQTSSAPSPTFSAGNADTTSIGAIYNPTQH 360

Query: 1812 IQE-NITHCVLPSNEVPAYLEAVH--HSIEN--------DLPLDQREREDSSDEVTIAKP 1958
            I E +I   VLP + + +Y E +H    IE+        D+   Q +  DS +++     
Sbjct: 361  ISEYDIGDLVLPESAISSYAETIHVPDIIESTVGLLSAADVTFHQPQIGDSCEDI----- 415

Query: 1959 YMEPVFVNSKEQFNENQLFSVYGSETVNYKRDASTLESEEN-----IPNN---------- 2093
             ++ + + +     E   F    ++++    D    E  E+     IP +          
Sbjct: 416  -LDNILIENDGAEGEQHTFLGTSNQSIEKHNDPEKGEEGESTNPSPIPTSEPDPVAKSEV 474

Query: 2094 --------SYLPFNIHIPRNKRRK--INARPSSS 2165
                    S L  +I+ PRNKR++    A P SS
Sbjct: 475  DQDQSTLKSCLASDINFPRNKRKRGERKAGPGSS 508


>ref|XP_004299979.1| PREDICTED: uncharacterized protein LOC101298606 [Fragaria vesca
            subsp. vesca]
          Length = 506

 Score =  106 bits (264), Expect = 7e-20
 Identities = 110/435 (25%), Positives = 187/435 (42%), Gaps = 31/435 (7%)
 Frame = +3

Query: 930  NMVEILNSNEQENRVDSPE-EASECSSSFGDTDVDMDSFGSDLKVDAAEAESGLWNED-- 1100
            ++VE  N++      ++ + +A+E SSSF DT+ D ++F        AE ES  + E+  
Sbjct: 73   DIVECRNNHNHSLLAETEDPDATEYSSSFADTESDTENFSG---FSEAEVESQFFPENGL 129

Query: 1101 -----IDSYAAQKRKGALRPEWKDFRRHIEWRCHWLQLRYSDLLSRASKYDRILAGIQSR 1265
                   S A Q RK  L   W+ F R + WRC W++LR  ++ S+   Y R LA    R
Sbjct: 130  TSAFNAFSNAFQMRKKKLTNHWRTFIRPVMWRCKWMELRIKEIESQELNYSRALAAYDQR 189

Query: 1266 KQQE-GQFDIDASSTRSSSIKKQNNSHPVLXXXXXXKLENTADLSSYLLRHPVYSLYGKQ 1442
            K     QF ++ S ++S     Q             ++E T D++SY+ RH  +S Y + 
Sbjct: 190  KHSAFDQFILEESGSKSLPFSSQCLQIKATKRRKRKRIEETTDIASYISRHNAFS-YFEN 248

Query: 1443 EKLDADGALHEDDIYSLRMASDPSSIRFDDDDSEDPQSAMETKEGASSLEYLFCQIEILQ 1622
            ++ D D     D+ +S  + ++ S+   D   + D  S +E ++G  SLE++  +IE L 
Sbjct: 249  KRSDPDSTSLADE-FSNAVITEHSADHNDKLSTADEWSLLEFRDGDKSLEHVLWKIETLH 307

Query: 1623 SRVQKMNAHLNKHFSGIGRACESSKGILHRFNSNETGTSKVQDPTLE----DSLKSQRVS 1790
            S V K+   ++           SS+  L      +   S    P       D + +    
Sbjct: 308  SWVHKLKNQIDVVMPKTAAIFSSSEN-LSVLVPCDGQASSAHSPAFSAGNGDGIYATTQQ 366

Query: 1791 EDPFPSHIQENITHCVLPSNEVPAYLEAVHHS--IENDLPLDQREREDSSDEVTIAKPYM 1964
               F      NI   V+P + V ++ EA+  S  IE+ + L       S+ +V +     
Sbjct: 367  TSEF------NIGDLVMPESVVSSFGEAMPASDIIESSVGL------LSATDVIL----H 410

Query: 1965 EPVFVNSKEQFNENQLF--SVYGSETVNYKRDASTLESEENIPN--------------NS 2096
            +P F  S E   +N L   +  G + +  +    T E+ +  P                S
Sbjct: 411  QPQFSESSEDIVDNVLIYNAAEGEKLMVRRMGDETKENHQQQPEEAKQGPCATPSSTVES 470

Query: 2097 YLPFNIHIPRNKRRK 2141
             L  +++ PRNKR++
Sbjct: 471  CLGLDVNFPRNKRKR 485


>ref|XP_006442780.1| hypothetical protein CICLE_v10019728mg [Citrus clementina]
            gi|557545042|gb|ESR56020.1| hypothetical protein
            CICLE_v10019728mg [Citrus clementina]
          Length = 520

 Score =  104 bits (259), Expect = 3e-19
 Identities = 125/514 (24%), Positives = 223/514 (43%), Gaps = 54/514 (10%)
 Frame = +3

Query: 786  LPADESVQTIC---DSTFRCA---QNMVSVEVPEESKFNDQIMNTSE--SNFGGYDNMVE 941
            +P  E  +TI    D   +C        SV+   + K +  + +  +  S  G  D  V+
Sbjct: 4    IPGGEPRKTIKVEPDEEVKCVLAKSEETSVKCLNDHKESKDVFSNGQTMSPKGPEDVEVD 63

Query: 942  ILN-SNEQENRVDSPEE--ASECSSSFGDTDVDMDSFG--SDLKVDAAEAESGLWNEDID 1106
            I+  + + E R+   E+  A+E SSSFG+T+ D +     S+++V++   +      + D
Sbjct: 64   IVKFTTDGEIRLTEAEDPDATEYSSSFGNTEPDAERCSGLSEVEVESQYFDVNGLRTNCD 123

Query: 1107 SYAA--QKRKGALRPEWKDFRRHIEWRCHWLQLRYSDLLSRASKYDRILAGI-QSRKQQE 1277
            S+    Q RK  L   W+ F R + WRC W +LR +++ S+A KY R LA   Q++  + 
Sbjct: 124  SFGRLFQMRKKKLTNHWRSFIRPLMWRCKWAELRINEIQSQALKYARELAAYDQNKLSRV 183

Query: 1278 GQFDIDASSTRSSSIKKQNNSHPVLXXXXXXKLENTADLSSYLLRHPVYSLYGKQEKLDA 1457
             Q  ++   ++S +   Q      +      + E+T DL+SY+  H ++S Y + ++ + 
Sbjct: 184  NQSTVEEFGSKSLAFSSQWYRKKAMKRRKRKRAEDTTDLASYMSHHSLFS-YLENKRSNP 242

Query: 1458 DGALHEDDIYSLRMASDPSSIRFDDDDSEDPQSAMETKEGASSLEYLFCQIEILQSRVQK 1637
            DG    DD  +  +   P+    D   S +     E K+  SSLE +  +IE + SRV +
Sbjct: 243  DGNSTADDFGNTVIMDQPADCN-DKFGSNEDALFFELKDDDSSLEQVLLKIETVHSRVHQ 301

Query: 1638 MNAHLNKHFSGIGRACESSKGILHRFNSNETGTSKVQDPTLE-DSLKSQRVSEDPFPS-H 1811
            + + L+   +       SS+  L      +  TS    PT    +  +  +     P+ H
Sbjct: 302  LKSQLDIVMAKNASRFSSSEN-LSLLAPCDGQTSSAPSPTFSAGNADTTSIGAIYNPTQH 360

Query: 1812 IQE-NITHCVLPSNEVPAYLEAVH--HSIEN--------DLPLDQREREDSSDEVTIAKP 1958
            I E +I   VLP + + +Y E +H    IE+        D+   Q +  DS +++     
Sbjct: 361  ISEYDIGDLVLPESAISSYAETIHVPDIIESTVGLLSAADVTFHQPQIGDSCEDI----- 415

Query: 1959 YMEPVFVNSKEQFNENQLFSVYGSETVNYKRDASTLESEEN-----IPNN---------- 2093
             ++ + + +     E   F    ++++    D    E  E+     IP +          
Sbjct: 416  -LDNILIQNDGAEGEQHTFLGTSNQSIEKHNDPEKGEEGESTNPSPIPASEPDPVAKSEV 474

Query: 2094 --------SYLPFNIHIPRNKRRK--INARPSSS 2165
                    S L  +I+ PRNKR++    A P SS
Sbjct: 475  AQDQSTLKSCLASDINFPRNKRKRGERKAGPGSS 508


>gb|EOY04666.1| Uncharacterized protein TCM_019866 [Theobroma cacao]
          Length = 529

 Score =  103 bits (258), Expect = 3e-19
 Identities = 109/450 (24%), Positives = 190/450 (42%), Gaps = 45/450 (10%)
 Frame = +3

Query: 927  DNMVEILN-SNEQENRVDSPEE--ASECSSSFGDTDVDMDSFG--SDLKVDAAEAESGLW 1091
            D  V+I+  +N+ + R    E+  A+ECSSSF DT  D +     +D +V++       +
Sbjct: 69   DVEVDIIGCTNDGDTRTVKTEDPDATECSSSFADTTSDTEKCSGLNDAEVESQFIGDAAF 128

Query: 1092 NEDIDSYAAQK--RKGALRPEWKDFRRHIEWRCHWLQLRYSDLLSRASKYDRILAGIQSR 1265
                D+Y +    RK  L   W+ F R + WRC W +LR  ++ S+A KY   LA    R
Sbjct: 129  ASTYDAYNSMFHIRKKRLTSHWRSFIRPLMWRCKWAELRIKEIESQALKYGSELAAYDER 188

Query: 1266 K-QQEGQFDIDASSTRSSSIKKQNNSHPVLXXXXXXKLENTADLSSYLLRHPVYSLYGKQ 1442
            K  +  Q  ++   ++S            +      ++E T D++SY+  H ++S Y + 
Sbjct: 189  KLSRIDQSTVEGFGSKSLPFSSPCYRKKAIKRRRRKRIEETTDITSYMSCHNLFS-YLEN 247

Query: 1443 EKLDADGALHEDDIYSLRMASDPSSIRFDDDDSEDPQSAMETKEGASSLEYLFCQIEILQ 1622
            +K   DG    DD+ +       +        + D Q  +E ++G +SLE +  +IEI+ 
Sbjct: 248  KKTIPDGTYIADDLANTANMDQQTDCSGKFGINND-QLLLEFRDGNNSLEQVLWKIEIVH 306

Query: 1623 SRVQKMNAHLNKHFSGIGRACESSKGILHRFNSNETGTSKVQDPTLEDSLKSQRVSEDPF 1802
            SRVQK+ + L+   S       SS+  L    + +  TS    PT         +S  P 
Sbjct: 307  SRVQKIRSQLDLVMSKNASKFSSSEN-LSLLAACDAQTSSAPSPTFSAG-NGDTISVGPA 364

Query: 1803 PSHIQE----NITHCVLPSNEVPAYLEAVH--HSIEN--------DLPLDQREREDSSDE 1940
             +  Q+    ++   V+P++ +  Y E  H    IE+        D+   Q +  DS ++
Sbjct: 365  YTTTQQISEYDVGDLVMPASSISTYGETFHVPDIIESTVGLLSSADVTCHQPQIGDSCED 424

Query: 1941 VTIAKPYMEPVFVNSKEQFNENQLFSVYGSETVNYKRDASTLESEENI---------PN- 2090
            +      +E V + ++    + Q+     S+ +        +E  E+          PN 
Sbjct: 425  I------VENVLIQNEGNAGDRQVLMRTNSQPIEQHHQPEKVEEGESTNPSPIPTSEPNR 478

Query: 2091 -------------NSYLPFNIHIPRNKRRK 2141
                          S L  +I  PRNKR++
Sbjct: 479  ATKSIVSQDQSTLRSCLASDICFPRNKRKR 508


>gb|EOX91598.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 487

 Score =  102 bits (254), Expect = 1e-18
 Identities = 92/377 (24%), Positives = 165/377 (43%), Gaps = 5/377 (1%)
 Frame = +3

Query: 756  IIQSSNDRVFLPADESVQTICDSTFRCAQNMVSVEVPEESKFNDQIMNTSESNFGGYDNM 935
            ++ ++N+   LP D       D    C  N       EE+ + +      E +     N+
Sbjct: 17   VVLNNNENGSLPHDSK-----DKLMHCVSNCEDHIFAEETLYGEGQAKIPEGDEYMEINI 71

Query: 936  VEILNSNEQENRVDSPEEASECSSSFGDT--DVDMDSFGSDLKVDAAEAESGLWNEDIDS 1109
             E  NS      V   ++ +E SSSFG T   V+ DS  SD +V++A   +       D 
Sbjct: 72   TECTNSGGDRLAVAECQDDTENSSSFGGTASGVENDSAISDAEVESALCGASPLGSVFDG 131

Query: 1110 -YAAQKRKGALRPEWKDFRRHIEWRCHWLQLRYSDLLSRASKYDRILAGIQSRKQQE-GQ 1283
             +  +KRK  L   W+ F R + WRC WL+L+  +  S+A  YDR LA    RK+ E  +
Sbjct: 132  LFPMRKRK--LTDHWRRFIRPLMWRCKWLELQLKEFKSQALTYDRELAEYDQRKKFEYEK 189

Query: 1284 FDIDASSTRSSSIKKQNNSHPVLXXXXXXKLENTADLSSYLLRHPVYSLYGKQEKLDADG 1463
            F  +    +S    +Q     V+      ++E TADL+SY+  H ++S Y  ++ + A  
Sbjct: 190  FTFEGLDVKSQPFPRQIQRKKVMKRRKRKRVEETADLASYMSFHNLFSYYESKKSVVATA 249

Query: 1464 ALHEDDIYSLRMASDPSSIRFDDDDSEDPQSAMETKEGASSLEYLFCQIEILQSRVQKMN 1643
             L  D+        + +     D    D  + +E ++G +    +  QI+++QS+V+++ 
Sbjct: 250  TLDNDNGNLENKTGNGNG----DVWLNDGLACLEFRDGDTWSGQILRQIDLVQSQVRRLK 305

Query: 1644 AHLNKHFSGIGRACESSKGILHRFNSNETGTSKVQDPTLEDSLKSQRVSEDPFPSHIQE- 1820
              ++K  +   R   S   +     S+   +S+ +    E   +    S+     H+ E 
Sbjct: 306  TRVDKVVNESPRKFSSINMLSSLVPSDALNSSRNRPSPRESGERIPHRSQYASSQHLSEC 365

Query: 1821 NITHCVLPSNEVPAYLE 1871
            N+    +P + V ++ E
Sbjct: 366  NMGDLFMPGSAVSSHGE 382


>gb|EOX91597.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 497

 Score =  102 bits (254), Expect = 1e-18
 Identities = 92/377 (24%), Positives = 165/377 (43%), Gaps = 5/377 (1%)
 Frame = +3

Query: 756  IIQSSNDRVFLPADESVQTICDSTFRCAQNMVSVEVPEESKFNDQIMNTSESNFGGYDNM 935
            ++ ++N+   LP D       D    C  N       EE+ + +      E +     N+
Sbjct: 17   VVLNNNENGSLPHDSK-----DKLMHCVSNCEDHIFAEETLYGEGQAKIPEGDEYMEINI 71

Query: 936  VEILNSNEQENRVDSPEEASECSSSFGDT--DVDMDSFGSDLKVDAAEAESGLWNEDIDS 1109
             E  NS      V   ++ +E SSSFG T   V+ DS  SD +V++A   +       D 
Sbjct: 72   TECTNSGGDRLAVAECQDDTENSSSFGGTASGVENDSAISDAEVESALCGASPLGSVFDG 131

Query: 1110 -YAAQKRKGALRPEWKDFRRHIEWRCHWLQLRYSDLLSRASKYDRILAGIQSRKQQE-GQ 1283
             +  +KRK  L   W+ F R + WRC WL+L+  +  S+A  YDR LA    RK+ E  +
Sbjct: 132  LFPMRKRK--LTDHWRRFIRPLMWRCKWLELQLKEFKSQALTYDRELAEYDQRKKFEYEK 189

Query: 1284 FDIDASSTRSSSIKKQNNSHPVLXXXXXXKLENTADLSSYLLRHPVYSLYGKQEKLDADG 1463
            F  +    +S    +Q     V+      ++E TADL+SY+  H ++S Y  ++ + A  
Sbjct: 190  FTFEGLDVKSQPFPRQIQRKKVMKRRKRKRVEETADLASYMSFHNLFSYYESKKSVVATA 249

Query: 1464 ALHEDDIYSLRMASDPSSIRFDDDDSEDPQSAMETKEGASSLEYLFCQIEILQSRVQKMN 1643
             L  D+        + +     D    D  + +E ++G +    +  QI+++QS+V+++ 
Sbjct: 250  TLDNDNGNLENKTGNGNG----DVWLNDGLACLEFRDGDTWSGQILRQIDLVQSQVRRLK 305

Query: 1644 AHLNKHFSGIGRACESSKGILHRFNSNETGTSKVQDPTLEDSLKSQRVSEDPFPSHIQE- 1820
              ++K  +   R   S   +     S+   +S+ +    E   +    S+     H+ E 
Sbjct: 306  TRVDKVVNESPRKFSSINMLSSLVPSDALNSSRNRPSPRESGERIPHRSQYASSQHLSEC 365

Query: 1821 NITHCVLPSNEVPAYLE 1871
            N+    +P + V ++ E
Sbjct: 366  NMGDLFMPGSAVSSHGE 382


>ref|XP_002876518.1| hypothetical protein ARALYDRAFT_486432 [Arabidopsis lyrata subsp.
            lyrata] gi|297322356|gb|EFH52777.1| hypothetical protein
            ARALYDRAFT_486432 [Arabidopsis lyrata subsp. lyrata]
          Length = 516

 Score =  101 bits (252), Expect = 2e-18
 Identities = 126/475 (26%), Positives = 203/475 (42%), Gaps = 52/475 (10%)
 Frame = +3

Query: 897  NTSESNF----GGYDNMVEILNSNEQE-NRVDSPEEASECSSSFGDTDVDMDSFGSDLKV 1061
            NTSE       GG +  V+I+ S+E   +  D    A+E SSSF DT  +      D   
Sbjct: 43   NTSEETVTSVSGGEELDVDIVESDENNASTTDEDPNATEYSSSFSDTASENADMLLDGLT 102

Query: 1062 DAAEAESGLWNED-----IDSYAA--QKRKGALRPEWKDFRRHIEWRCHWLQLRYSDLLS 1220
              AE ES  W+E       DS+++    RK  L   W+ F R + WR  W++LR  +L S
Sbjct: 103  GEAEVESHYWDETDLGPAYDSFSSIFHFRKKRLTNHWRRFIRPLMWRSKWVELRIRELES 162

Query: 1221 RASKYDRILAGIQSRKQQEGQFDIDASSTRS--SSIKKQNNSHPVLXXXXXX------KL 1376
            RA +Y + L   +S  Q++ + +ID S   S    IK    S+P              K+
Sbjct: 163  RALEYPKEL---ESYDQEKLEANIDPSVLESCGEGIKSLPFSNPCYKKRAAKRRRKRKKV 219

Query: 1377 ENTADLSSYLLRHPVYSLYGKQEKLDADGALHEDDIYSLRMASDPSSIRFDDDDSEDPQS 1556
            E+T D++SY+  H ++S Y + ++L +DG    DD      A DP S   +  D +D  S
Sbjct: 220  ESTDDITSYMACHNLFS-YIETKRLSSDGMGLADDFGD---AKDPQSDSKEPVDLDDDDS 275

Query: 1557 AMETKEGASSLEYLFCQIEILQSRVQKMNAHLNKHFSGIGRACESSKGILHRFNSNETGT 1736
                +EG + LE +  +IE++ S+V ++   ++   S       SS+      N +    
Sbjct: 276  LFHHREGDNVLEEVLWKIELVHSQVHRLKTQVDVVMSKNAARFSSSE------NLSLLAA 329

Query: 1737 SKVQDPTLEDSLKSQRVSEDPF---PSHIQENITHCVLPSNEV-PAYLEAVH--HSIEN- 1895
            S    PT+        +S         H+ + +   V  S  V  +Y +A H    IE+ 
Sbjct: 330  SSAPSPTVSAGGNGDVISIGAIYNASQHMADVLGDLVFSSQGVISSYGDAFHIPDIIEST 389

Query: 1896 -------DLPLDQREREDSSDEVT------------IAKPYMEPVFVNSKEQFNENQLFS 2018
                   D+ L+  +  DS +++             +    ME    +  E+  E +  S
Sbjct: 390  VGLFADADVTLNHPQIGDSCEDILDNILIRNGVAEEMNSDLMETSCHDEAEKAEEGEGTS 449

Query: 2019 V----YGSETVNYKRDASTL--ESEENIPNNSYLPFNIHIPRNKRRKINARPSSS 2165
            V       ET  Y ++  +L  +  E+    S L   + +PRNKR +   R +SS
Sbjct: 450  VPPLQQTEETEQYSQEEKSLVLQGREDSVLRSCLASEMLVPRNKRTRGGERKASS 504


>ref|XP_004306989.1| PREDICTED: uncharacterized protein LOC101291176 [Fragaria vesca
            subsp. vesca]
          Length = 476

 Score =  100 bits (249), Expect = 4e-18
 Identities = 116/432 (26%), Positives = 195/432 (45%), Gaps = 21/432 (4%)
 Frame = +3

Query: 930  NMVEILNSNEQENRVDSP-EEASECSSSFGDTDVDMDSFGSDLKVDAAEAES------GL 1088
            N+ E    NE    V++  ++ +E SSSFGDT V     GS L  D  E++        L
Sbjct: 68   NITECSGPNENVQIVENECQDLTESSSSFGDT-VSGTENGSMLDGDEVESQYCENPSVSL 126

Query: 1089 WNEDIDSYAAQKRKGALRPEWKDFRRHIEWRCHWLQLRYSDLLSRASKYDRILAGIQSRK 1268
            ++   D++  +K+K  L   W+ F R + WRC WL+L+  +L S+  KYD  LA     K
Sbjct: 127  YDGYSDAFHTRKKK--LTAHWRKFIRPLMWRCKWLELQIKELQSQTLKYDAELAEYDKLK 184

Query: 1269 QQE-GQFDIDASSTRSSSIKKQNNSHPVLXXXXXXKLENTADLSSYLLRHPVYSLYGKQE 1445
            Q E G +  +    +S           ++      ++E+T D+ S+   H ++S Y + +
Sbjct: 185  QYEFGGYTAEGFDGKSIPFSSHIQRSKLMKRKKRKRVEDTTDIVSFTSNHTLFSYY-ENK 243

Query: 1446 KLDADGALHEDDIYSLRMASDPSSIRFDDDDSEDPQSAMETKEGASSLEYLFCQIEILQS 1625
            K  A+G   E+D  +L   S      F+          +E ++G S+LE +  +IE++ S
Sbjct: 244  KSGANGGSMEEDCGNLVGRSTHGINEFE----------IEFRDGDSTLEDILRKIEVVHS 293

Query: 1626 RVQKMNAHLNKHFSGIGRACESSKGILHRFNSNETGTSKVQDPTLEDSLKSQRVSED-PF 1802
            +V ++   ++K         + +  +        +  S  Q P LE    ++ ++E  P+
Sbjct: 294  QVCRLKTRIDK-------VVKENPALF------GSALSPAQSPALESG--NELLAETLPY 338

Query: 1803 PS-HIQE-NITHCVLPSNEVPAY--LEAVHHSIEN-DLPLDQREREDSSDEVTI--AKPY 1961
             S HI E N    +LPS+ V +   L  +   + N D PL   E+E++ D   +  A   
Sbjct: 339  ASQHISECNTGPLLLPSSAVSSLEGLNLLPAKMGNTDQPLIGTEQENAVDRYLVPNASAK 398

Query: 1962 MEP-VFVNSKEQFNENQLFSVYGSETVNYKRDAS-TLESEENIPNNSYLP---FNIHIPR 2126
             EP  F N+ +Q  E  L S    +T N+   AS T+  + + PN +  P        P 
Sbjct: 399  KEPHDFENNNDQVIEKPLLSSAQQKT-NFSVPASKTVPKKSSKPNATEQPGSTTRTEFPW 457

Query: 2127 NKRRKINARPSS 2162
            N R +   +P S
Sbjct: 458  NTRNRGKRKPGS 469


Top