BLASTX nr result

ID: Rehmannia22_contig00024891 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00024891
         (2376 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006436667.1| hypothetical protein CICLE_v10030805mg [Citr...   333   2e-88
ref|XP_006436666.1| hypothetical protein CICLE_v10030805mg [Citr...   327   1e-86
ref|XP_002533661.1| hypothetical protein RCOM_0152200 [Ricinus c...   315   6e-83
ref|XP_006345859.1| PREDICTED: micronuclear linker histone polyp...   310   2e-81
ref|XP_004239716.1| PREDICTED: uncharacterized protein LOC101267...   305   6e-80
gb|EMJ20137.1| hypothetical protein PRUPE_ppa002306mg [Prunus pe...   305   8e-80
ref|XP_006345860.1| PREDICTED: micronuclear linker histone polyp...   303   2e-79
gb|EXC25400.1| hypothetical protein L484_016782 [Morus notabilis]     302   4e-79
ref|XP_004307047.1| PREDICTED: uncharacterized protein LOC101309...   302   4e-79
ref|XP_004140985.1| PREDICTED: uncharacterized protein LOC101207...   298   6e-78
gb|EOY19205.1| Uncharacterized protein isoform 4 [Theobroma cacao]    297   1e-77
gb|EOY19202.1| Uncharacterized protein isoform 1 [Theobroma cacao]    294   1e-76
gb|ESW15816.1| hypothetical protein PHAVU_007G104500g [Phaseolus...   290   2e-75
ref|XP_006606287.1| PREDICTED: micronuclear linker histone polyp...   287   1e-74
ref|XP_006606284.1| PREDICTED: micronuclear linker histone polyp...   283   3e-73
gb|EOY19203.1| Uncharacterized protein isoform 2 [Theobroma caca...   280   2e-72
ref|XP_002311037.1| hypothetical protein POPTR_0008s02540g [Popu...   278   6e-72
emb|CBI40233.3| unnamed protein product [Vitis vinifera]              278   8e-72
ref|XP_006360476.1| PREDICTED: flocculation protein FLO11-like i...   272   4e-70
ref|XP_004249997.1| PREDICTED: uncharacterized protein LOC101251...   266   3e-68

>ref|XP_006436667.1| hypothetical protein CICLE_v10030805mg [Citrus clementina]
            gi|568878417|ref|XP_006492190.1| PREDICTED:
            uncharacterized protein LOC102610545 [Citrus sinensis]
            gi|557538863|gb|ESR49907.1| hypothetical protein
            CICLE_v10030805mg [Citrus clementina]
          Length = 732

 Score =  333 bits (854), Expect = 2e-88
 Identities = 273/739 (36%), Positives = 362/739 (48%), Gaps = 108/739 (14%)
 Frame = -3

Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051
            E ++QR  + M++SN MTIEFLRARLLSERSVSK+ARQRADELA++V ELEEQLK VSLQ
Sbjct: 7    EMQDQRTNSGMEDSNTMTIEFLRARLLSERSVSKSARQRADELARRVVELEEQLKLVSLQ 66

Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1871
            RKKAEKATA+VLAILEN+GIS++S+ FDS SDQ E+P + +  N            K R+
Sbjct: 67   RKKAEKATADVLAILENNGISEISDSFDSGSDQ-ETPCESEVGNNFNKEEENSVDSKFRR 125

Query: 1870 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDS-VRRRASFGSNSLSA--RR 1700
            N                 R LSW   + ++ +LEK  Y DS +RRR+SF S   S+   R
Sbjct: 126  NASVEHSGSGNDFSPVPHRGLSWNGRRGTKQSLEK--YKDSYLRRRSSFASTGSSSPKNR 183

Query: 1699 VGKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALR-EYENGKNQLESSTL 1523
            VGKSCR+IR R+++S  +     TE          G     V  + E   G    E   L
Sbjct: 184  VGKSCRQIRRRESKSAVEELK--TEPVKVDSQENGGGTSLEVDRKPEVLRGSEAQEEQYL 241

Query: 1522 RSNS-----ETQKM---DGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFR 1367
               S     E +K+    G  F+    D DME AL+ Q QLIG+Y           E+FR
Sbjct: 242  GEGSDSGCFENEKLVTGGGIDFNGCGGDKDMEKALEDQAQLIGRYEEMEKAQREWEERFR 301

Query: 1366 ENNSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQE-----QVDACFSE 1202
            ENNS T DSCDPGN SDVTEER E K  ++ R AGT NS  QE K E     Q+    S 
Sbjct: 302  ENNSSTPDSCDPGNQSDVTEEREESK-VQVQRVAGTVNSQVQEAKTEVHLSNQLSNTKSN 360

Query: 1201 ---KPETSKRSLQNENIISCESSASEFSFPMSREKNNQEFLGIQHNASQYRS-QQFPPMV 1034
                P++  +   +    + E  A +F+F MS EK NQE LG  H    + S  +  P  
Sbjct: 361  GFLPPQSGDQKCSSTP--ASEPLAQDFAFTMSNEKQNQESLGNNHYVPSHSSHHRLHPHG 418

Query: 1033 QTTTQSSTKISPYEEKSTALSTPPKI--SLPLELAVVP---QDNLGSVLEALKRAKSSLN 869
                QSS  +S     +T  S+  ++  S   + A+VP         VLEALK+A+ SL 
Sbjct: 419  SPENQSSQTVS----SNTGSSSRREVSGSQSEQYALVPHQTSSGFNEVLEALKQARLSLR 474

Query: 868  QKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE---------N 716
            QK+++ P T  R+ G V +PS + +   D  +IPV   GLFR+PTDY  E         +
Sbjct: 475  QKMSSLPSTESRSVGKVIEPSLSASTVWDRVEIPVGCSGLFRVPTDYAVETSKANFLVSD 534

Query: 715  ARPGFANFPPENSLGRFLSEP------FDSRSAFSS-------DLFLTDP----YRPFTP 587
            +RP  AN+ P + +G    +        D+RS F++       DLFLT P       ++ 
Sbjct: 535  SRPSLANYNPTSGIGLVSDDQTVSNSLMDTRSTFAADNFRPTRDLFLTGPSTDTRSSYSA 594

Query: 586  ERPFSQPRLSEGPSSNRMNR--LDSYTNPVLPSVK-------DSYP-------------- 476
            E      + S+  S   M R   DS  +  LPS +        SYP              
Sbjct: 595  ENRLLTRQYSDTRSRVSMMRPSFDSNLDAGLPSFRQYMYPNFSSYPDQVPQVPRNERLST 654

Query: 475  FL---------------------------------PDVTLRVPLNEGGASRNFPSSERGL 395
            FL                                 PD+  ++P +E G S   PS   G+
Sbjct: 655  FLPGRSVEMSVEISPMLDAGLSSSSQSANPYFSSYPDLMPQIPAHE-GLSTLRPSRSAGM 713

Query: 394  PPVMRLSSYDEHVRPDMYR 338
            PP   L  +++H RP MYR
Sbjct: 714  PPANHLPFHNDHTRPYMYR 732


>ref|XP_006436666.1| hypothetical protein CICLE_v10030805mg [Citrus clementina]
            gi|557538862|gb|ESR49906.1| hypothetical protein
            CICLE_v10030805mg [Citrus clementina]
          Length = 716

 Score =  327 bits (839), Expect = 1e-86
 Identities = 270/729 (37%), Positives = 356/729 (48%), Gaps = 108/729 (14%)
 Frame = -3

Query: 2200 MQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQRKKAEKATAN 2021
            M++SN MTIEFLRARLLSERSVSK+ARQRADELA++V ELEEQLK VSLQRKKAEKATA+
Sbjct: 1    MEDSNTMTIEFLRARLLSERSVSKSARQRADELARRVVELEEQLKLVSLQRKKAEKATAD 60

Query: 2020 VLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRKNXXXXXXXXX 1841
            VLAILEN+GIS++S+ FDS SDQ E+P + +  N            K R+N         
Sbjct: 61   VLAILENNGISEISDSFDSGSDQ-ETPCESEVGNNFNKEEENSVDSKFRRNASVEHSGSG 119

Query: 1840 XXXXXSTGRSLSWRSSKDSQHALEKKNYMDS-VRRRASFGSNSLSA--RRVGKSCRRIRH 1670
                    R LSW   + ++ +LEK  Y DS +RRR+SF S   S+   RVGKSCR+IR 
Sbjct: 120  NDFSPVPHRGLSWNGRRGTKQSLEK--YKDSYLRRRSSFASTGSSSPKNRVGKSCRQIRR 177

Query: 1669 RDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALR-EYENGKNQLESSTLRSNS-----E 1508
            R+++S  +     TE          G     V  + E   G    E   L   S     E
Sbjct: 178  RESKSAVEELK--TEPVKVDSQENGGGTSLEVDRKPEVLRGSEAQEEQYLGEGSDSGCFE 235

Query: 1507 TQKM---DGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSGTQDSC 1337
             +K+    G  F+    D DME AL+ Q QLIG+Y           E+FRENNS T DSC
Sbjct: 236  NEKLVTGGGIDFNGCGGDKDMEKALEDQAQLIGRYEEMEKAQREWEERFRENNSSTPDSC 295

Query: 1336 DPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQE-----QVDACFSE---KPETSKR 1181
            DPGN SDVTEER E K  ++ R AGT NS  QE K E     Q+    S     P++  +
Sbjct: 296  DPGNQSDVTEEREESK-VQVQRVAGTVNSQVQEAKTEVHLSNQLSNTKSNGFLPPQSGDQ 354

Query: 1180 SLQNENIISCESSASEFSFPMSREKNNQEFLGIQHNASQYRS-QQFPPMVQTTTQSSTKI 1004
               +    + E  A +F+F MS EK NQE LG  H    + S  +  P      QSS  +
Sbjct: 355  KCSSTP--ASEPLAQDFAFTMSNEKQNQESLGNNHYVPSHSSHHRLHPHGSPENQSSQTV 412

Query: 1003 SPYEEKSTALSTPPKI--SLPLELAVVP---QDNLGSVLEALKRAKSSLNQKLNNSPPTA 839
            S     +T  S+  ++  S   + A+VP         VLEALK+A+ SL QK+++ P T 
Sbjct: 413  S----SNTGSSSRREVSGSQSEQYALVPHQTSSGFNEVLEALKQARLSLRQKMSSLPSTE 468

Query: 838  GRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE---------NARPGFANFPP 686
             R+ G V +PS + +   D  +IPV   GLFR+PTDY  E         ++RP  AN+ P
Sbjct: 469  SRSVGKVIEPSLSASTVWDRVEIPVGCSGLFRVPTDYAVETSKANFLVSDSRPSLANYNP 528

Query: 685  ENSLGRFLSEP------FDSRSAFSS-------DLFLTDP----YRPFTPERPFSQPRLS 557
             + +G    +        D+RS F++       DLFLT P       ++ E      + S
Sbjct: 529  TSGIGLVSDDQTVSNSLMDTRSTFAADNFRPTRDLFLTGPSTDTRSSYSAENRLLTRQYS 588

Query: 556  EGPSSNRMNR--LDSYTNPVLPSVK-------DSYP--------------FL-------- 470
            +  S   M R   DS  +  LPS +        SYP              FL        
Sbjct: 589  DTRSRVSMMRPSFDSNLDAGLPSFRQYMYPNFSSYPDQVPQVPRNERLSTFLPGRSVEMS 648

Query: 469  -------------------------PDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYD 365
                                     PD+  ++P +E G S   PS   G+PP   L  ++
Sbjct: 649  VEISPMLDAGLSSSSQSANPYFSSYPDLMPQIPAHE-GLSTLRPSRSAGMPPANHLPFHN 707

Query: 364  EHVRPDMYR 338
            +H RP MYR
Sbjct: 708  DHTRPYMYR 716


>ref|XP_002533661.1| hypothetical protein RCOM_0152200 [Ricinus communis]
            gi|223526443|gb|EEF28720.1| hypothetical protein
            RCOM_0152200 [Ricinus communis]
          Length = 665

 Score =  315 bits (807), Expect = 6e-83
 Identities = 250/649 (38%), Positives = 337/649 (51%), Gaps = 50/649 (7%)
 Frame = -3

Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051
            E ++QR  + M++S AMTIEFLRARLLSERSVS+TARQRADELA +VAELEEQL+ VSLQ
Sbjct: 7    EKQDQRTNSGMEDSTAMTIEFLRARLLSERSVSRTARQRADELATRVAELEEQLRIVSLQ 66

Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1871
            R KAEKATA++LAILE +GISD+SE FDSCSD+ ++P + K  N            K+R 
Sbjct: 67   RMKAEKATADILAILEGNGISDISETFDSCSDR-DTPCESKVGN-RSSKEENSINSKVRN 124

Query: 1870 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGS-NSLSARRVG 1694
            N                GRSLSW+  K+S  +LEK     S+RRR+SF S  S   +R G
Sbjct: 125  NDSEELSGSDFDFSSVPGRSLSWKGRKNSPRSLEKSK-DSSMRRRSSFSSVGSSPKQRPG 183

Query: 1693 KSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLE------S 1532
            KSCR+IR +++R           K  C  D    +     +  ++E  + +++       
Sbjct: 184  KSCRQIRRKESRF---EYKASPVKRDCPEDEVAATSANFPSCSDFEPKRGEVKPLLEDSH 240

Query: 1531 STLRSNSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSG 1352
            S    N      +G  ++V+  D DME AL+HQ QLIGQY           EKFRENNS 
Sbjct: 241  SDCLGNERNASDNGLDYNVYRGDRDMEKALEHQAQLIGQYEAMEKVQREWEEKFRENNSS 300

Query: 1351 TQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETSKRSLQ 1172
            T DSCD GN SD+TEERYE++ P  ++   T+N+   E     V+   + +P     S  
Sbjct: 301  TPDSCDHGNRSDITEERYEIREP--AKGPATTNAIQTEGLLSVVEGVSNTQPHGFLPSSH 358

Query: 1171 NENIISCESSAS-----EFS-----FPMSREKNNQEFLG--------IQHNASQYRSQQF 1046
             + +   E  +S     EFS     FPM++ K NQ+  G        I H+ S     Q+
Sbjct: 359  VDAVCLEERKSSIAPVPEFSTQDSAFPMAKAKQNQKNPGNNDHSPLLIAHHDSASFGSQY 418

Query: 1045 PPMVQTTTQ--SSTKISPYEEKSTALSTPPKISLPLELAVVPQDNLGSVLEALKRAKSSL 872
                Q+     S+T  S  + K+T+ S   + +L    A      LG VLEAL+ A+ SL
Sbjct: 419  SSGSQSVLSFPSNTGSSFNKGKATSGSENERCALVPHKA---SGGLGGVLEALEEARQSL 475

Query: 871  NQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-NARPGFAN 695
             Q++N  P  A     SV + S + T   D  QIPV   GLFRLPTD+  E N R    +
Sbjct: 476  QQRINRLPSVATTVRKSV-ESSVSTTISRDEVQIPVGCVGLFRLPTDFSVEGNTRANLLS 534

Query: 694  FPPENSLG--------------RFLSEPF-DSRSAFSS-DLFLTDPY-----RPFTPERP 578
               + SLG              +F++ P+   RS+ S+ D FL+  Y     R  TP +P
Sbjct: 535  SSAQLSLGNHYSDRGVPAAASNQFVASPYLQGRSSSSTEDQFLSSQYVGGGSRIPTP-KP 593

Query: 577  FSQPRLSEG-PSSNRMNRLDSYTNPVLPSVKDSYPFLPDVTLRVPLNEG 434
            +  P L  G PSS+R      YT P  P +  SY   PD+  R+P  EG
Sbjct: 594  YFDPYLDTGLPSSSR------YTYPNYP-INTSY---PDLMPRIPSREG 632


>ref|XP_006345859.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X1
            [Solanum tuberosum]
          Length = 643

 Score =  310 bits (793), Expect = 2e-81
 Identities = 254/678 (37%), Positives = 330/678 (48%), Gaps = 47/678 (6%)
 Frame = -3

Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051
            +D++QRK   M++S+ MTIEFLRARLL+ERSVS+TARQRADELA++V ELE+QLK VSLQ
Sbjct: 7    QDQDQRKIVGMEDSS-MTIEFLRARLLAERSVSQTARQRADELAERVLELEDQLKIVSLQ 65

Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXK--L 1877
            RKKAEKATA VL+ILEN GISD SEEFDS SDQE    + K  +                
Sbjct: 66   RKKAEKATAAVLSILENEGISDASEEFDSGSDQEAIFSNSKGADSTDNRNERKPNPSNVK 125

Query: 1876 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSV-RRRASFGSN-SLSAR 1703
             +               STGRSLSW+S K S  + E+  Y DS  RR  SF S  S S +
Sbjct: 126  ERENDADISSSEIISSPSTGRSLSWKSGKHSLPSFERNRYTDSAWRRSGSFASTGSSSPK 185

Query: 1702 RVGKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNG-SDGEHVALREYENGKNQLESST 1526
            R GKSCRRIR   T++  D          C  +     ++  H +L +   G N ++   
Sbjct: 186  RAGKSCRRIRRNTTKTATDE---------CPPEHLPSFANNGHQSLMD-SAGNNDVKD-- 233

Query: 1525 LRSNSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSGTQ 1346
             + +  T +M        E D+ ME ALQH+ QLIGQY           EK+RENN+  Q
Sbjct: 234  -QRHLPTSEMSENQRKSDESDEGMERALQHKAQLIGQYEAEEKAQREWEEKYRENNNYAQ 292

Query: 1345 DSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVD-----ACFSEKPE---- 1193
            DSCDPGN+SDVTEER +MK+ E   +A   N  N   K ++VD           P     
Sbjct: 293  DSCDPGNYSDVTEERDDMKAFEQPYSAEMINLHNHANKFQEVDIPSTNGVTDNVPSTPHI 352

Query: 1192 -TSKRSLQN-ENIISCESSASEFSFPMSREKNNQEFLGIQHNASQYRSQQFPPMVQTTTQ 1019
             TS R  QN   II+ ES ASEF+      K+N            Y   Q P        
Sbjct: 353  GTSCRKDQNCSRIINSESPASEFAL----SKSNGSCPENDGPTPAYSRHQLP-------- 400

Query: 1018 SSTKISPYEEKSTALSTPPKISLPLELAVVPQ---DNLGSVLEALKRAKSSLNQKLNNSP 848
             S   SP      ++S+    SL    A+V +   DN+GS+L AL++AK S++Q++N SP
Sbjct: 401  -SANGSPIHPLENSISSSGGSSLQAGQALVSRDASDNIGSILGALEQAKFSISQQINVSP 459

Query: 847  PTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-NARPGFANFPPENSLG 671
               G +S     P    T + D   I    PGLFRLPTD+Q E      +  FP   S  
Sbjct: 460  IAEGGSSIEHSIP----TARIDRLDILPGFPGLFRLPTDFQLEATTTASYQGFPSRFSSA 515

Query: 670  RFLSEPFDSRSAFSSDLFLTDPYRPFTPERPFSQPRLSEGPSSNRMNRLDSYTNPV-LPS 494
                EP         D F T PY   +P    +    + G   + +N    + +P    S
Sbjct: 516  NHFHEP-------GYDQFSTTPYME-SPSNAITGLPYTTG--FDYLNPPSGFGHPFSSKS 565

Query: 493  VKDSYPFLPDVTLRV--------PLNEGGAS------------------RNFPSSERGLP 392
               +YPF P+ T  V        PL E   +                  R+ P +E G P
Sbjct: 566  TYPTYPFRPNTTTTVSQSQASWSPLYESSLTTLSPVVVPNLSSGEEVFLRSLPRNETGKP 625

Query: 391  PVMRLSSYDEHVRPDMYR 338
            P   +S YD H+RP+MYR
Sbjct: 626  PSFPVSHYDAHLRPNMYR 643


>ref|XP_004239716.1| PREDICTED: uncharacterized protein LOC101267607 [Solanum
            lycopersicum]
          Length = 617

 Score =  305 bits (781), Expect = 6e-80
 Identities = 250/682 (36%), Positives = 328/682 (48%), Gaps = 51/682 (7%)
 Frame = -3

Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051
            +D++QRKT  M E+++MTIEFLRARLL+ERSVS+TARQRADELA++V ELE+QLK VSLQ
Sbjct: 7    KDQDQRKTVGM-ENSSMTIEFLRARLLAERSVSQTARQRADELAERVLELEDQLKIVSLQ 65

Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXK--L 1877
            RKKAEKATA VL+ILEN GI+D SEEFDS SDQE    + K  +                
Sbjct: 66   RKKAEKATAAVLSILENEGITDASEEFDSGSDQEAIFSNSKGADSTDNRNEYKPDPSNVK 125

Query: 1876 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSV-RRRASFGSNSLSA-R 1703
             +               STGRSLSW+S K S  + E+  Y DS  RR  SF S   S+ +
Sbjct: 126  ERENDADISSSEIISSPSTGRSLSWKSGKHSLPSFERNRYTDSAWRRSGSFASTGTSSPK 185

Query: 1702 RVGKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTL 1523
            R GKSCRRIR  +T +  +  ND                              QL   T 
Sbjct: 186  RAGKSCRRIRRSNTNAGNNDVND------------------------------QLHLPTS 215

Query: 1522 RSNSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSGTQD 1343
             ++   +K D       E D+ ME ALQH+  LIG+Y           EK+RENN   QD
Sbjct: 216  ETSENQRKAD-------ESDEGMERALQHKALLIGKYEAEEKAQREWEEKYRENNYA-QD 267

Query: 1342 SCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFS--------EKPETS 1187
            SCDPGN+SDVTEER +MK+ E   +A   N  N   K ++VD   +          P  S
Sbjct: 268  SCDPGNYSDVTEERDDMKAFEQPYSAEMINLQNHANKFQEVDIPSTNGVTDNVPSNPHIS 327

Query: 1186 KRSLQNEN---IISCESSASEFSFPMSREKNNQEFLGIQHNASQYRSQQFPPMVQTTTQS 1016
                +++N   II+ ES ASEF+ P    K+N            Y   Q P         
Sbjct: 328  TSCRKDQNCSRIINSESPASEFALP----KSNGSCPENDGPTPAYCHHQLP--------- 374

Query: 1015 STKISPYEEKSTALSTPPKISLPLELAVV---PQDNLGSVLEALKRAKSSLNQKLNNSPP 845
            S+  SP +    ++S+    SL    A+V     DN+GS+L AL++AK S++Q++N S P
Sbjct: 375  SSNGSPIQPLENSISSSGGSSLQAGQALVSGDASDNIGSILGALEQAKFSISQQINVS-P 433

Query: 844  TAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-NARPGFANFPPENSLGR 668
              GR+S    + S       D   IP   PGLFRLPTD+Q E      +  FP   S   
Sbjct: 434  VEGRSS---IEHSIPTAKIEDRLDIPPGFPGLFRLPTDFQLEATTTASYQGFPSRFSSAN 490

Query: 667  FLSEPFDSRSAFSSDLFLTDPYR-----PFTPERPFSQPRLSEG-PSSNRMNRLDSYTNP 506
               EP    + FS+  ++  P       P+T    +  P  S G P S++          
Sbjct: 491  HFHEP--GYNQFSATPYMESPSNAITGLPYTTGFDYLNPPSSFGHPFSSK---------- 538

Query: 505  VLPSVKDSYPFLPDVTLRV--------PLNEGGAS------------------RNFPSSE 404
               S   +YPF P+ T  V        PL E   +                  R+ P +E
Sbjct: 539  ---STYPTYPFRPNTTTTVSQSQASWSPLYESSLTKSSPVVVPNLSSGEDVFLRSLPRNE 595

Query: 403  RGLPPVMRLSSYDEHVRPDMYR 338
             G PP   +S YD H+RP+MYR
Sbjct: 596  TGKPPSFPVSHYDAHMRPNMYR 617


>gb|EMJ20137.1| hypothetical protein PRUPE_ppa002306mg [Prunus persica]
          Length = 690

 Score =  305 bits (780), Expect = 8e-80
 Identities = 259/700 (37%), Positives = 336/700 (48%), Gaps = 69/700 (9%)
 Frame = -3

Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051
            + ++QR    M++S AMTIEFLRARLL+ERSVS++ARQR DEL + V ELEEQLK VSLQ
Sbjct: 7    DTQDQRSNLGMEDSTAMTIEFLRARLLAERSVSRSARQRVDELERMVEELEEQLKIVSLQ 66

Query: 2050 RKKAEKATANVLAILENHGISDVS-EEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLR 1874
            RK AEKAT +VLAILE+ GISD+S EEFDS SDQ E+    K  N            K+R
Sbjct: 67   RKMAEKATEDVLAILESQGISDISEEEFDSSSDQ-ETHQGSKVGNSLANEEESFVISKVR 125

Query: 1873 KNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGSNSLSARR-- 1700
            +                 GRSLSW+   DS  + EK   + SVRRR+SF S   S+ R  
Sbjct: 126  RKEQEEHSGSDADSSLIPGRSLSWKGRIDSPRSREKCKDL-SVRRRSSFSSIGFSSPRHH 184

Query: 1699 VGKSCRRIRHRDTRSME-DSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQ--LESS 1529
            +GKSCR+I+H++TRS + DS  +G    A S    N S+G    LRE      +  L + 
Sbjct: 185  LGKSCRQIKHKETRSDKFDSHENGV--GASSEGLPNFSNGGPEKLREGSEFPEEKVLSND 242

Query: 1528 TLRSNSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSGT 1349
            +L    E Q+     F+ H RD DME AL+HQ +LI +            EKFRENN+ T
Sbjct: 243  SLSRTKENQRDSDLDFNGHGRDKDMEKALEHQAKLICENEEMEKAQREWEEKFRENNTST 302

Query: 1348 QDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDAC------------FS 1205
             DSCDPGNHSD+TEER E+K+ +   +AG   +  QETK E+ D C            F 
Sbjct: 303  PDSCDPGNHSDITEERDEIKA-QTPCSAGVVVAQAQETKSEEGDVCLPKETFKIQQNGFL 361

Query: 1204 EKPETSKRSLQNE--NIISCESSASEFSFPMSREKNNQEFLGIQHNASQYRSQQFPPMVQ 1031
                     LQ++        S   EF+FP    K N E L        + S   P +  
Sbjct: 362  PASHVDMGGLQDQLNKSTVAPSQVEEFAFPTENGKQNHESLENFARHPSHGSHPNPLVHG 421

Query: 1030 TTTQSSTKISPYEEKSTALSTPPKISLPLELAVVP---QDNLGSVLEALKRAKSSLNQKL 860
            +    S+  S     S         S     A+VP   QD LG VL+ALK+AK SL Q +
Sbjct: 422  SAHNRSSDASSSVAGSGFHKGNASGSRSDLYALVPHDSQDRLGGVLDALKQAKLSLQQNM 481

Query: 859  NNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENA----------- 713
               P   G +     +PS       D  +IPV   GLFRLPTD+  E A           
Sbjct: 482  TRLPLVDGTSVHKSIEPSIPVMKTGDRVEIPVGCAGLFRLPTDFAVEEAATQSSFLGSSW 541

Query: 712  -----------------RPGFANFPPENSLGRFLSEPF-DSRSAFS---SDLFLTDPY-- 602
                             RP F+     N+  R++  P+ ++R  FS   +D F+ + Y  
Sbjct: 542  SGRYCPETLVTSSFVETRPTFS----MNAADRYVPSPYIETRQTFSTNATDRFIPNAYVE 597

Query: 601  -RPFTP---ERPF----SQPRLSEGPSSNRM----NRLDSYTNPVLPSVKDSYPFLPDVT 458
             RP  P     PF    S    S  P+ NR          Y  P  P    +YP +PD T
Sbjct: 598  SRPNFPANAAEPFVTSPSVDTRSNFPADNRFLSGPYSESGYAQPPYP----NYPSVPDRT 653

Query: 457  LRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 338
              +  +E   +R  P    G  P  R S YD+  RP+MYR
Sbjct: 654  PWITSDE-ALTRALPRKPVG-APTDRFSFYDQ-FRPNMYR 690


>ref|XP_006345860.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X2
            [Solanum tuberosum]
          Length = 618

 Score =  303 bits (776), Expect = 2e-79
 Identities = 252/677 (37%), Positives = 322/677 (47%), Gaps = 46/677 (6%)
 Frame = -3

Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051
            +D++QRK   M++S+ MTIEFLRARLL+ERSVS+TARQRADELA++V ELE+QLK VSLQ
Sbjct: 7    QDQDQRKIVGMEDSS-MTIEFLRARLLAERSVSQTARQRADELAERVLELEDQLKIVSLQ 65

Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXK--L 1877
            RKKAEKATA VL+ILEN GISD SEEFDS SDQE    + K  +                
Sbjct: 66   RKKAEKATAAVLSILENEGISDASEEFDSGSDQEAIFSNSKGADSTDNRNERKPNPSNVK 125

Query: 1876 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSV-RRRASFGSN-SLSAR 1703
             +               STGRSLSW+S K S  + E+  Y DS  RR  SF S  S S +
Sbjct: 126  ERENDADISSSEIISSPSTGRSLSWKSGKHSLPSFERNRYTDSAWRRSGSFASTGSSSPK 185

Query: 1702 RVGKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTL 1523
            R GKSCRRIR   T +  +   D                            +  L +S +
Sbjct: 186  RAGKSCRRIRRNTTNAGNNDVKD----------------------------QRHLPTSEM 217

Query: 1522 RSNSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSGTQD 1343
              N   +K D       E D+ ME ALQH+ QLIGQY           EK+RENN+  QD
Sbjct: 218  SENQ--RKSD-------ESDEGMERALQHKAQLIGQYEAEEKAQREWEEKYRENNNYAQD 268

Query: 1342 SCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVD-----ACFSEKPE----- 1193
            SCDPGN+SDVTEER +MK+ E   +A   N  N   K ++VD           P      
Sbjct: 269  SCDPGNYSDVTEERDDMKAFEQPYSAEMINLHNHANKFQEVDIPSTNGVTDNVPSTPHIG 328

Query: 1192 TSKRSLQN-ENIISCESSASEFSFPMSREKNNQEFLGIQHNASQYRSQQFPPMVQTTTQS 1016
            TS R  QN   II+ ES ASEF+      K+N            Y   Q P         
Sbjct: 329  TSCRKDQNCSRIINSESPASEFAL----SKSNGSCPENDGPTPAYSRHQLP--------- 375

Query: 1015 STKISPYEEKSTALSTPPKISLPLELAVVPQ---DNLGSVLEALKRAKSSLNQKLNNSPP 845
            S   SP      ++S+    SL    A+V +   DN+GS+L AL++AK S++Q++N SP 
Sbjct: 376  SANGSPIHPLENSISSSGGSSLQAGQALVSRDASDNIGSILGALEQAKFSISQQINVSPI 435

Query: 844  TAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-NARPGFANFPPENSLGR 668
              G +S     P    T + D   I    PGLFRLPTD+Q E      +  FP   S   
Sbjct: 436  AEGGSSIEHSIP----TARIDRLDILPGFPGLFRLPTDFQLEATTTASYQGFPSRFSSAN 491

Query: 667  FLSEPFDSRSAFSSDLFLTDPYRPFTPERPFSQPRLSEGPSSNRMNRLDSYTNPV-LPSV 491
               EP         D F T PY   +P    +    + G   + +N    + +P    S 
Sbjct: 492  HFHEP-------GYDQFSTTPYME-SPSNAITGLPYTTG--FDYLNPPSGFGHPFSSKST 541

Query: 490  KDSYPFLPDVTLRV--------PLNEGGAS------------------RNFPSSERGLPP 389
              +YPF P+ T  V        PL E   +                  R+ P +E G PP
Sbjct: 542  YPTYPFRPNTTTTVSQSQASWSPLYESSLTTLSPVVVPNLSSGEEVFLRSLPRNETGKPP 601

Query: 388  VMRLSSYDEHVRPDMYR 338
               +S YD H+RP+MYR
Sbjct: 602  SFPVSHYDAHLRPNMYR 618


>gb|EXC25400.1| hypothetical protein L484_016782 [Morus notabilis]
          Length = 654

 Score =  302 bits (774), Expect = 4e-79
 Identities = 257/689 (37%), Positives = 343/689 (49%), Gaps = 58/689 (8%)
 Frame = -3

Query: 2230 EDREQRKTTSMQESN--AMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVS 2057
            E ++QR ++SM++S   AMTIEFLRARLLSERSVS++ARQRADEL K+V ELEEQL+ VS
Sbjct: 7    EKQDQRSSSSMEDSQSTAMTIEFLRARLLSERSVSRSARQRADELEKRVEELEEQLRIVS 66

Query: 2056 LQRKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKL 1877
            LQRK AEKAT +VL+ILENHGISD SE +DS SDQE         NG             
Sbjct: 67   LQRKMAEKATVDVLSILENHGISDASETYDSGSDQETHQVANNYANGEERSVVSK----- 121

Query: 1876 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRR-----ASFGSNSL 1712
            R++                GRSLSW+   DS  + EK  Y DS  RR     +SFGS+S 
Sbjct: 122  RRSVLEELSGSDLDSSPINGRSLSWKGRSDSSRSREK--YKDSSVRRQNALSSSFGSSS- 178

Query: 1711 SARRVGKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLES 1532
                VGKSCR+IR R+TR++ +                     E +     ENG      
Sbjct: 179  PKHYVGKSCRQIRCRETRTVVEDHKT-----------------EPLKFDSQENGAATPPE 221

Query: 1531 STLRSNSETQKMDGRYFDV--HERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENN 1358
             +++++         + DV  H ++ DM+ AL+H+ QLIGQY           EK+RENN
Sbjct: 222  GSVKNDRRIP----NHLDVNGHGQEKDMKKALEHRAQLIGQYEEMEKAQREWEEKYRENN 277

Query: 1357 SGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVD-ACFSEKPETS-- 1187
            + T DS DPGNHSDVTE+R E+K+  L    G   +   + K  +VD +  S KP+++  
Sbjct: 278  TSTPDSYDPGNHSDVTEDRDEVKAQTLYN-VGIDIAQAVDAKSNKVDLSKESSKPQSNGF 336

Query: 1186 --------------KRSLQNENIISCESSASEFSFPMSREKNNQEFLGIQHNASQYRSQQ 1049
                           ++  N + ++    A EF+FP ++EK  QE L        +R  +
Sbjct: 337  LHPTRTRAAMGDLKVQASSNIDPVASRFQAQEFAFPTAKEKEAQESL----ENRDFRPSE 392

Query: 1048 FPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLEL--------AVVPQDN---LGSVL 902
             P   Q   +S     P++    ALS     S   +         A+VP +    LG VL
Sbjct: 393  SPHHGQLLHRSLPN-QPFDR--GALSDAGSSSHKRDFSGSQNDLYALVPHNPPVVLGGVL 449

Query: 901  EALKRAKSSLNQKLNNSP----PTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPT 734
            +ALK+AK SL QK+N  P     T   A     +P+   T   D  +IPV   GLFRLPT
Sbjct: 450  DALKQAKLSLQQKINRLPLEGTTTQTVAVNRSIEPTQPGTRVGDRLEIPVGCTGLFRLPT 509

Query: 733  DYQPENARPGFANFPPENSLGRFLSEPF--DSRSAFSS-DLFLTDPY----RPFTPERPF 575
            D+    A    ANF    S  R   EP+  D++ A ++ D FLT PY      F P+  F
Sbjct: 510  DFATVEASTQ-ANFLSSGS--RLSLEPYYPDNKVALTAPDRFLTSPYIESRSEFPPDVRF 566

Query: 574  --SQPRLSEGPSSNRMNRLDSYTNPVLPSVK--------DSYPFLPDVTLRVPLNEGGAS 425
              S   +S   +S   +R DS+ +    SV          SYP  PD   R+P +E G  
Sbjct: 567  LTSSSVVSGSRASTLNSRFDSHFDTGPSSVNRYSNYPPHPSYPPFPDSMPRIPSDE-GLR 625

Query: 424  RNFPSSERGLPPVMRLSSYDEHVRPDMYR 338
            R F SS     P  R S YD+H RP+MYR
Sbjct: 626  RPFRSSRSFGLPEDRFSFYDDHGRPNMYR 654


>ref|XP_004307047.1| PREDICTED: uncharacterized protein LOC101309582 [Fragaria vesca
            subsp. vesca]
          Length = 807

 Score =  302 bits (774), Expect = 4e-79
 Identities = 220/577 (38%), Positives = 297/577 (51%), Gaps = 34/577 (5%)
 Frame = -3

Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051
            + ++ R  + M +S  +TIEFLRARLLSERSVS++ARQRADEL K V ELEEQLK VSLQ
Sbjct: 7    DTQDLRINSGMDDSPGITIEFLRARLLSERSVSRSARQRADELEKMVEELEEQLKIVSLQ 66

Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQE---ESPHDFKARNGXXXXXXXXXXXK 1880
            RK AEKATA+VLAILEN G SD+SEEFDS SD E   ES    K+R              
Sbjct: 67   RKMAEKATADVLAILENQGASDISEEFDSSSDHETFQESKMGNKSRKEEENFLISE---- 122

Query: 1879 LRKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGS--NSLSA 1706
             R+N                GR+LSW+   DS  + EK     S+RRR++F +  +S S 
Sbjct: 123  -RRNEHEEYSGSDLDSSSIPGRNLSWKGRIDSPRSREKYK-EPSIRRRSTFSAVGSSSSR 180

Query: 1705 RRVGKSCRRIRHRDTRSM-----------EDSQNDGTEKAACSGDAFNGSDGEHVALREY 1559
              +GKSCR+I+HR+TRS+           +DS+ +G   ++     F+  D E +     
Sbjct: 181  HNLGKSCRQIKHRETRSVVERSKDEPAKFDDSEENGVAASSEGLSNFSYCDPERLRDGPE 240

Query: 1558 ENGKNQLESSTLRSNSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXX 1379
               +  L    L  + E Q+     F+ H R+ DME AL+HQ QLIGQ            
Sbjct: 241  SQKEKFLSKDALTRSKEHQRNGDPNFNGHGRNKDMERALEHQAQLIGQNEEMEMAQREWE 300

Query: 1378 EKFRENNSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDAC-FSE 1202
            EKFRENN+ T DSCDPGNHSD+TEER EMK+P     A  + S+ QE K E  D+C F E
Sbjct: 301  EKFRENNTSTPDSCDPGNHSDITEERDEMKTP---FPAEINASEAQEAKSEARDSCLFEE 357

Query: 1201 KPET--------------SKRSLQNENIISCESSASEFSFPMSREKNNQEFLGIQHNASQ 1064
            K +T                +   N + ++  S   EF+FP + E+  QE L    +   
Sbjct: 358  KMKTQLNGYLPPSDVEMGGMQDQMNRSSVASASPIQEFAFPTAYERQTQESLENNAHQPS 417

Query: 1063 YRSQQFPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLELAVVP---QDNLGSVLEAL 893
              S   P +++++   S+ +S     S   ++  +  L    A+VP   Q+ LG VL+AL
Sbjct: 418  PGSHHDPLLLESSHNRSSVVSSDGGSSFHNASGSRNDL---YALVPHDSQERLGGVLDAL 474

Query: 892  KRAKSSLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENA 713
            K+AK SL QK+   P     +     +P        +   IPV   GLFRLPTD+  E A
Sbjct: 475  KQAKLSLQQKIIRLPLVDDTSVQESIEPPIPAVTTGNRLDIPVGCAGLFRLPTDFAVEEA 534

Query: 712  RPGFANFPPENSLGRFLSEPFDSRSAFSSDLFLTDPY 602
                +     +SL      P    +A S+D F+T  Y
Sbjct: 535  ATKHSYLGLGSSLPSARYCPDKGLAASSTDQFVTSTY 571


>ref|XP_004140985.1| PREDICTED: uncharacterized protein LOC101207733 [Cucumis sativus]
          Length = 671

 Score =  298 bits (764), Expect = 6e-78
 Identities = 248/697 (35%), Positives = 335/697 (48%), Gaps = 66/697 (9%)
 Frame = -3

Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051
            + ++ R    ++++ AMTIEFLRARLLSERSVSK+ARQRADELAK+VAELEEQLK VSLQ
Sbjct: 7    DQQDPRSVPGVEDTTAMTIEFLRARLLSERSVSKSARQRADELAKRVAELEEQLKIVSLQ 66

Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1871
            RK AEKATA+VLAILE++G SD+SE  DS SD E  P   K  +G           + R+
Sbjct: 67   RKMAEKATADVLAILEDNGASDISETLDSNSDHETEP---KVEDGLAREDVSSGTVR-RR 122

Query: 1870 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGS--NSLSARRV 1697
            N                G SLSW+   DS H  EK     S+R R+SF S  +S    ++
Sbjct: 123  NEHEEYSGSNIDTSPVLGGSLSWKGRNDSPHTREKYK-KHSIRSRSSFTSIGSSSPKHQL 181

Query: 1696 GKSCRRIRHRDTRSMEDSQN-------DGTEKAACSG--DAFNGSDGEHVALRE-YENGK 1547
            G+SCR+I+ RDTR ++  Q        D +E+   +   D+ N S   H  LR+ YE  +
Sbjct: 182  GRSCRQIKRRDTRPLDGEQELKSDALVDSSEEIPSTSLEDSQNYSVNGHSILRDGYEVRE 241

Query: 1546 NQLESSTLRSNSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFR 1367
                SS+   NS          D +E+ DDME AL+ Q QLI QY           EKFR
Sbjct: 242  KTRSSSSGVHNSVGNSDQDNDIDGYEKVDDMEKALKCQAQLIDQYEAMEKAQREWEEKFR 301

Query: 1366 ENNSGTQDSCDPGNHSDVTEERYEMK--SPELSRAAGTS-------NSDNQETKQEQVDA 1214
            ENN+ T DSCDPGNHSD+TEER EM+  +P LS             + D ++  Q Q + 
Sbjct: 302  ENNNSTPDSCDPGNHSDITEERDEMRAQAPNLSNNPANEAKPQVAFDCDTRDLSQAQTNG 361

Query: 1213 CFSEKPETSKRSL--QNENIISCESSASEFSFPMSREKNNQEFLGIQHNASQ---YRSQQ 1049
                        L  QN N IS   S  EF+FPM+  K  QE    Q N++Q     S  
Sbjct: 362  LGPSMCAVDVEDLQDQNTNSISTSKSLEEFTFPMANVKQCQE---SQENSAQEPSCTSHL 418

Query: 1048 FPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLELAVVPQDNLGSVLEALKRAKSSLN 869
               + +    S   I+ Y++++   +      +P E        L  VLEALK+AK SL 
Sbjct: 419  NHGLPERPLSSHGGINSYDQETPCSNNDLYALVPHE-----PPALDGVLEALKQAKLSLT 473

Query: 868  QKLNNSPPTAG------RASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-NAR 710
            +K+   P   G      ++ G +  P        D  +IPV   GLFRLPTD+  E +++
Sbjct: 474  KKIIKLPSVDGESESIDKSIGPLSIPKMG-----DRLEIPVGCAGLFRLPTDFAAEASSQ 528

Query: 709  PGF----------ANFPPENSL----------------------GRFLSEPFDSRSAFSS 626
              F           ++P E +                        R  S  + + S F+ 
Sbjct: 529  ANFLASSSQLRSPTHYPGEGAALSANHQIFPGHEMEDRSSFLRDSRLRSSGYRAGSGFTR 588

Query: 625  DLFLTDPYRPFTPERPFSQPRLSEGPSSNRMNRLDSYTNPVLP-SVKDSYPFLPDVTLRV 449
            D FLTD      PE  +  P         + +  D Y + V P S   +YP  P V+  +
Sbjct: 589  DGFLTD----HIPENRWKNP--------GQKHHFDQYFDAVQPSSYVHNYPPRP-VSSNI 635

Query: 448  PLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 338
              N+    R FP     +PP  + S YD+  RP+MYR
Sbjct: 636  HPND-TFLRTFPGRSTEMPPTNQYSFYDDQFRPNMYR 671


>gb|EOY19205.1| Uncharacterized protein isoform 4 [Theobroma cacao]
          Length = 709

 Score =  297 bits (761), Expect = 1e-77
 Identities = 238/717 (33%), Positives = 353/717 (49%), Gaps = 84/717 (11%)
 Frame = -3

Query: 2236 SGEDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVS 2057
            S + ++ ++TT   E + MTIEFLRARLLSERSVSK+ARQR DELAK+VAELE+QLKFVS
Sbjct: 4    SDQVKQDQRTTCNVEDSTMTIEFLRARLLSERSVSKSARQRVDELAKRVAELEKQLKFVS 63

Query: 2056 LQRKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKL 1877
            +QR++AEKATA+VLAILEN+G+SD+SEE DS SDQ ++P +    NG           K+
Sbjct: 64   VQRRRAEKATADVLAILENNGVSDISEELDSSSDQ-DAPFESNINNGSTKEEESSVTSKV 122

Query: 1876 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDS-VRRRASFGSNSLSAR- 1703
            R+               ++GRSLSW+  K + H+ E+  Y D  VR R SF S S S+R 
Sbjct: 123  RQKESEELSGSEFDCSSASGRSLSWKGRKSASHSPER--YKDKLVRSRNSFASISFSSRK 180

Query: 1702 -RVGKSCRRIRHRDTRS----------MEDSQNDGTEKAACSGDAFNGSDGEHVALREYE 1556
             R GKSCR+IR R++RS          M D Q  G E ++   +A + + G H+     E
Sbjct: 181  HRQGKSCRQIRRRESRSVAEELKSDNIMVDPQVKGLENSS-EVNANHSTGGPHILPMGSE 239

Query: 1555 NGKNQLESSTLRSNSETQKMDGRYFDV----HERDDDMESALQHQVQLIGQYXXXXXXXX 1388
              +N+     L S++   + +   FD+    +E + DME AL+HQ QLI  Y        
Sbjct: 240  IHENKSTVDNLHSDALKNERNVTGFDLDFHGYEGEKDMEKALEHQAQLIVHYEAMERAQR 299

Query: 1387 XXXEKFRENNSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACF 1208
               EKFRE NS + DSCDPGNHSDVTEER E+K+ +    +GT+ S  Q  ++E + +  
Sbjct: 300  EWEEKFREKNSSSPDSCDPGNHSDVTEERDEIKA-QAQYVSGTATSQVQGAEEEHI-SFS 357

Query: 1207 SEKPETS--------------------KRSLQNENIISCESSASEFSFPMSREKNNQEFL 1088
            +E P+                       RSL  E+ ++  S   + +F M++E ++Q   
Sbjct: 358  AELPKIHSNDLVPPSQADMDRLQDWRYSRSLSPES-LNPNSPGQKLTFLMAKENHHQSMQ 416

Query: 1087 GIQHNASQYRSQQFPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLELAVVPQDNLG- 911
               +N+    S  F     +    + +    +  S +    P+    L  A+VP +  G 
Sbjct: 417  --SNNSPSNSSHHFAHPHDSPGNQAVQHISSDLGSHSCRELPRNKNEL-YALVPHETSGR 473

Query: 910  --SVLEALKRAKSSLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLP 737
               VL++LK+A+ SL QK++      G + G   + S +     +  +IP+   GLFR+P
Sbjct: 474  FTGVLDSLKQARLSLQQKISTLSLVEGASVGKAIETSGSGRKVGERVEIPLGCSGLFRVP 533

Query: 736  TDYQPENARPGF---------ANFPPENSLGRFLSEPFDSRSAFSSDLFLTDPYRPFTPE 584
            TD   E  +  F         AN  P+  +    S    + S  ++    +  Y+P + +
Sbjct: 534  TDISVEAPKANFLGSSSQLSLANHYPDRGVAPTASNHLLTTSYMNTQSSSSSNYQPVSSD 593

Query: 583  R----PFSQPRLSEGP-----------------------SSNRMN----RLDSYTNPVLP 497
            R    P+  PR S  P                       + +R++      D    PVLP
Sbjct: 594  RFFSGPYMYPRTSSSPFPTAFASSGYIKDDQILTGQCEETGSRLSTPKPSFDPSLEPVLP 653

Query: 496  SVK----DSYPFLPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 338
            S       ++P  PD+  ++   EG  + +   S    P     S YD H RPD++R
Sbjct: 654  SSSLQNYPTFPSYPDLVPQIHAKEGFPAFHTTRSVGATPD--WFSFYDSHFRPDIHR 708


>gb|EOY19202.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 749

 Score =  294 bits (752), Expect = 1e-76
 Identities = 237/708 (33%), Positives = 347/708 (49%), Gaps = 84/708 (11%)
 Frame = -3

Query: 2209 TTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQRKKAEKA 2030
            TT   E + MTIEFLRARLLSERSVSK+ARQR DELAK+VAELE+QLKFVS+QR++AEKA
Sbjct: 53   TTCNVEDSTMTIEFLRARLLSERSVSKSARQRVDELAKRVAELEKQLKFVSVQRRRAEKA 112

Query: 2029 TANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRKNXXXXXX 1850
            TA+VLAILEN+G+SD+SEE DS SDQ ++P +    NG           K+R+       
Sbjct: 113  TADVLAILENNGVSDISEELDSSSDQ-DAPFESNINNGSTKEEESSVTSKVRQKESEELS 171

Query: 1849 XXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDS-VRRRASFGSNSLSAR--RVGKSCRR 1679
                    ++GRSLSW+  K + H+ E+  Y D  VR R SF S S S+R  R GKSCR+
Sbjct: 172  GSEFDCSSASGRSLSWKGRKSASHSPER--YKDKLVRSRNSFASISFSSRKHRQGKSCRQ 229

Query: 1678 IRHRDTRS----------MEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESS 1529
            IR R++RS          M D Q  G E ++   +A + + G H+     E  +N+    
Sbjct: 230  IRRRESRSVAEELKSDNIMVDPQVKGLENSS-EVNANHSTGGPHILPMGSEIHENKSTVD 288

Query: 1528 TLRSNSETQKMDGRYFDV----HERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFREN 1361
             L S++   + +   FD+    +E + DME AL+HQ QLI  Y           EKFRE 
Sbjct: 289  NLHSDALKNERNVTGFDLDFHGYEGEKDMEKALEHQAQLIVHYEAMERAQREWEEKFREK 348

Query: 1360 NSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETS-- 1187
            NS + DSCDPGNHSDVTEER E+K+ +    +GT+ S  Q  ++E + +  +E P+    
Sbjct: 349  NSSSPDSCDPGNHSDVTEERDEIKA-QAQYVSGTATSQVQGAEEEHI-SFSAELPKIHSN 406

Query: 1186 ------------------KRSLQNENIISCESSASEFSFPMSREKNNQEFLGIQHNASQY 1061
                               RSL  E+ ++  S   + +F M++E ++Q      +N+   
Sbjct: 407  DLVPPSQADMDRLQDWRYSRSLSPES-LNPNSPGQKLTFLMAKENHHQSMQ--SNNSPSN 463

Query: 1060 RSQQFPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLELAVVPQDNLG---SVLEALK 890
             S  F     +    + +    +  S +    P+    L  A+VP +  G    VL++LK
Sbjct: 464  SSHHFAHPHDSPGNQAVQHISSDLGSHSCRELPRNKNEL-YALVPHETSGRFTGVLDSLK 522

Query: 889  RAKSSLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENAR 710
            +A+ SL QK++      G + G   + S +     +  +IP+   GLFR+PTD   E  +
Sbjct: 523  QARLSLQQKISTLSLVEGASVGKAIETSGSGRKVGERVEIPLGCSGLFRVPTDISVEAPK 582

Query: 709  PGF---------ANFPPENSLGRFLSEPFDSRSAFSSDLFLTDPYRPFTPER----PFSQ 569
              F         AN  P+  +    S    + S  ++    +  Y+P + +R    P+  
Sbjct: 583  ANFLGSSSQLSLANHYPDRGVAPTASNHLLTTSYMNTQSSSSSNYQPVSSDRFFSGPYMY 642

Query: 568  PRLSEGP-----------------------SSNRMN----RLDSYTNPVLPSVK----DS 482
            PR S  P                       + +R++      D    PVLPS       +
Sbjct: 643  PRTSSSPFPTAFASSGYIKDDQILTGQCEETGSRLSTPKPSFDPSLEPVLPSSSLQNYPT 702

Query: 481  YPFLPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 338
            +P  PD+  ++   EG  + +   S    P     S YD H RPD++R
Sbjct: 703  FPSYPDLVPQIHAKEGFPAFHTTRSVGATPD--WFSFYDSHFRPDIHR 748


>gb|ESW15816.1| hypothetical protein PHAVU_007G104500g [Phaseolus vulgaris]
          Length = 652

 Score =  290 bits (743), Expect = 2e-75
 Identities = 238/666 (35%), Positives = 337/666 (50%), Gaps = 40/666 (6%)
 Frame = -3

Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051
            + ++QR  +S ++S AMTIEFLRARLLSERS+SK+ARQRADELA+KV ELEEQL+ V LQ
Sbjct: 7    DPQDQRIASSTEDSTAMTIEFLRARLLSERSISKSARQRADELAEKVMELEEQLRMVILQ 66

Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1871
            RK AEKATA+VLAILE+ GIS VS+EFDS SD  E+P D    N            K R+
Sbjct: 67   RKMAEKATADVLAILESQGISGVSDEFDSGSDL-ENPFDSSMSNECAKEDEGPMKSKGRQ 125

Query: 1870 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEK-KNYMDSVRRRASFGSNSLSAR-RV 1697
            +               + +SLSW+   D  H+LEK K    +VRR++SF S S S + R+
Sbjct: 126  HGSDEMSGSNEDSSLVSSKSLSWKGRHDLSHSLEKYKTKSTNVRRQSSFSSFSSSPKHRL 185

Query: 1696 GKSCRRIRHRDTRS-MEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTLR 1520
            GKSCR+IRHR  RS ME+S+           +  + S+G       + +G     S+ L+
Sbjct: 186  GKSCRKIRHRQPRSVMEESRGKFVHVNCQVNELVSSSEG----FPNFRDG----GSNILK 237

Query: 1519 SNSETQKMDG---------RYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFR 1367
              S+ Q+ DG          + D + R+++ME AL+HQ +LI QY           EKFR
Sbjct: 238  IESKIQEEDGSEANLLSKNHHIDGYGRENEMEKALEHQAELIDQYEAMEKAQREWEEKFR 297

Query: 1366 ENNSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETS 1187
            ENNS T DSCDPGNHSD+TE++ E K  ++  AA    S  +E+K E    C SE+    
Sbjct: 298  ENNSTTPDSCDPGNHSDMTEDKDEGK-VQIPYAAKVVTSKAEESKGEPGGVCLSEE---- 352

Query: 1186 KRSLQNENIISCESSASE-FSFPMSREKNNQEFLGIQHNASQYRSQQFPPMVQTTTQSST 1010
            K   +   I+  +   ++ +    S   +  +FLG +++ S  +  Q   +V   +QSS 
Sbjct: 353  KLKAEGREIMPKKHDDTDVYRNQKSTTFSTSDFLGQENSHSPLKGNQNEILVNGHSQSSD 412

Query: 1009 KISPYEEKSTALST----------PPKISLPLELAVVPQDN--LGSVLEALKRAKSSLNQ 866
                 + + ++  T            K    L   V  + +     VLE+LK+A+ SL Q
Sbjct: 413  MNHLDQGRHSSFPTDIHGVQHQHDASKNQKDLYALVTREQSHQFDGVLESLKQARISLQQ 472

Query: 865  KLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENARPGFANFPP 686
            +LN  P   G   G   +P  + +   D F+IP    GLFRLPTD+  E A P F    P
Sbjct: 473  ELNRLPVVEG---GYTAKPLPSVSKNEDRFEIPFGFSGLFRLPTDFSDE-ATPRFNVRDP 528

Query: 685  ENSLGRFLSEPFDSRSAFSSDLFLTDP-------YRPFTPERPFSQPRLSEGPS-SNRMN 530
                G        + S  S   F T+P         P   ++  +   L  G   S+  +
Sbjct: 529  TTGFGSNY-HLNGTMSRTSVGQFFTNPPHSGKMLMSPSANDQALATRYLENGSRFSSSQS 587

Query: 529  RLDSYTN-PVLPSVKDSYPFLP------DVTLRVPLNEGGASRNFPSSERGLPPVMRLSS 371
              D ++N   L S K SYP  P      + T ++P  +   SR + +S  G+P   R S 
Sbjct: 588  PFDPFSNGGPLSSSKYSYPTFPINPSYQNATPQMPFGD-EVSRPYSNSTVGVPLANRFSF 646

Query: 370  YDEHVR 353
             D+H+R
Sbjct: 647  NDDHLR 652


>ref|XP_006606287.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X4
            [Glycine max]
          Length = 641

 Score =  287 bits (735), Expect = 1e-74
 Identities = 232/662 (35%), Positives = 323/662 (48%), Gaps = 36/662 (5%)
 Frame = -3

Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051
            + ++QR T+ M++S AMTIEFLRARLLSERS+S++A+QRADELAKKV +LEEQLK V LQ
Sbjct: 7    DPQDQRVTSCMEDSTAMTIEFLRARLLSERSISRSAKQRADELAKKVMDLEEQLKTVILQ 66

Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1871
            RK AEKATA+VLAILE+ GISDVSEEFDS SD  E+P D    N            K R+
Sbjct: 67   RKMAEKATADVLAILESEGISDVSEEFDSGSDL-ENPCDSSVSNECAKEGEEPMSSKGRQ 125

Query: 1870 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGSNSLSAR-RVG 1694
            +               + +SLSW+   DS H+LEK     ++RR++SF S S S + R G
Sbjct: 126  HGSDKMPGSNVDSSPVSSKSLSWKGRHDSSHSLEKYK-TSNLRRQSSFSSISSSPKHRQG 184

Query: 1693 KSCRRIRHRDTR-SMEDSQN---DGTEKAACSGDAFNGSDGEHVALREYENGKNQLESST 1526
            KSCR+IRHR  R  +E+S+N   +  ++ A     F    G    + + E+   +   S 
Sbjct: 185  KSCRKIRHRQIRLVVEESRNKFANHEKELASLSKGFPNFSGGGSNIPKIESEIQEEGGSG 244

Query: 1525 LRSNSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSGTQ 1346
                ++   +DG     + R+ DME AL+HQ QLI QY           EKFRENNS T 
Sbjct: 245  ANPLNKNHHVDG-----YGREKDMEKALEHQAQLIDQYEAMEKVQREWEEKFRENNSTTP 299

Query: 1345 DSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETSKRSLQNE 1166
            DSCDPGN+SD+TE++ E K   +  AA    SD QE+K E    C SE+    K   +  
Sbjct: 300  DSCDPGNYSDMTEDKDESK-VHIPFAAKVVTSDAQESKGEPRGVCLSEE----KFKAEAR 354

Query: 1165 NII-SCESSASEFSFPMSREKNNQEFLGIQHNASQYRSQQFPPMVQTTTQSSTKISPYEE 989
            +I+         +S   +   +  + LG Q++    +  Q    V    Q S        
Sbjct: 355  DIMPKTHDDTGGYSDQKNTTFSTSDLLGQQNSCPPLKGNQNESSVNGHFQPSVMNHQDPG 414

Query: 988  KSTALSTPPKISLPLELAVVPQDN--------------------LGSVLEALKRAKSSLN 869
            +     + P  S P ++  V   N                       VLE+LK+A+ SL 
Sbjct: 415  RHGYHDSKPTYSFPTDIHGVQHQNDASRNKTDLFALVTHEQPHKFNGVLESLKQARISLQ 474

Query: 868  QKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-----NARPG 704
            Q+L   P      SG   +PS + +   D F++PV   GLFR+PTD+        N +  
Sbjct: 475  QELKRLPLV---ESGYTAKPSASFSKSEDRFEVPVGCSGLFRIPTDFSDGATARFNVKDP 531

Query: 703  FANFPPENSLGRFLSEPFDSRSAFSSDLFLTDPYRPFTPERPFSQPRLS-----EGPSSN 539
             A F     L R +S   D +       F + PY       P +   L+      GP+  
Sbjct: 532  TAGFGSNFHLNRAMSRTSDGQ------FFPSLPYPDTQLSLPANDQSLAIRYVENGPNGG 585

Query: 538  RMNRLDSYTNPVLPSVKDSYPFLPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEH 359
             ++    YT P  P +  SY    + T ++P      SR + SS  G+P   R S   +H
Sbjct: 586  SLSS-SKYTYPTFP-INPSY---QNATPQMPFG-NEVSRPYSSSTVGVPLANRFSFNSDH 639

Query: 358  VR 353
            +R
Sbjct: 640  LR 641


>ref|XP_006606284.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X1
            [Glycine max] gi|571568788|ref|XP_006606285.1| PREDICTED:
            micronuclear linker histone polyprotein-like isoform X2
            [Glycine max] gi|571568792|ref|XP_006606286.1| PREDICTED:
            micronuclear linker histone polyprotein-like isoform X3
            [Glycine max]
          Length = 664

 Score =  283 bits (723), Expect = 3e-73
 Identities = 230/655 (35%), Positives = 318/655 (48%), Gaps = 36/655 (5%)
 Frame = -3

Query: 2209 TTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQRKKAEKA 2030
            T+ M++S AMTIEFLRARLLSERS+S++A+QRADELAKKV +LEEQLK V LQRK AEKA
Sbjct: 37   TSCMEDSTAMTIEFLRARLLSERSISRSAKQRADELAKKVMDLEEQLKTVILQRKMAEKA 96

Query: 2029 TANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRKNXXXXXX 1850
            TA+VLAILE+ GISDVSEEFDS SD  E+P D    N            K R++      
Sbjct: 97   TADVLAILESEGISDVSEEFDSGSDL-ENPCDSSVSNECAKEGEEPMSSKGRQHGSDKMP 155

Query: 1849 XXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGSNSLSAR-RVGKSCRRIR 1673
                     + +SLSW+   DS H+LEK     ++RR++SF S S S + R GKSCR+IR
Sbjct: 156  GSNVDSSPVSSKSLSWKGRHDSSHSLEKYK-TSNLRRQSSFSSISSSPKHRQGKSCRKIR 214

Query: 1672 HRDTR-SMEDSQN---DGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTLRSNSET 1505
            HR  R  +E+S+N   +  ++ A     F    G    + + E+   +   S     ++ 
Sbjct: 215  HRQIRLVVEESRNKFANHEKELASLSKGFPNFSGGGSNIPKIESEIQEEGGSGANPLNKN 274

Query: 1504 QKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSGTQDSCDPGN 1325
              +DG     + R+ DME AL+HQ QLI QY           EKFRENNS T DSCDPGN
Sbjct: 275  HHVDG-----YGREKDMEKALEHQAQLIDQYEAMEKVQREWEEKFRENNSTTPDSCDPGN 329

Query: 1324 HSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETSKRSLQNENII-SCE 1148
            +SD+TE++ E K   +  AA    SD QE+K E    C SE+    K   +  +I+    
Sbjct: 330  YSDMTEDKDESK-VHIPFAAKVVTSDAQESKGEPRGVCLSEE----KFKAEARDIMPKTH 384

Query: 1147 SSASEFSFPMSREKNNQEFLGIQHNASQYRSQQFPPMVQTTTQSSTKISPYEEKSTALST 968
                 +S   +   +  + LG Q++    +  Q    V    Q S        +     +
Sbjct: 385  DDTGGYSDQKNTTFSTSDLLGQQNSCPPLKGNQNESSVNGHFQPSVMNHQDPGRHGYHDS 444

Query: 967  PPKISLPLELAVVPQDN--------------------LGSVLEALKRAKSSLNQKLNNSP 848
             P  S P ++  V   N                       VLE+LK+A+ SL Q+L   P
Sbjct: 445  KPTYSFPTDIHGVQHQNDASRNKTDLFALVTHEQPHKFNGVLESLKQARISLQQELKRLP 504

Query: 847  PTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-----NARPGFANFPPE 683
                  SG   +PS + +   D F++PV   GLFR+PTD+        N +   A F   
Sbjct: 505  LV---ESGYTAKPSASFSKSEDRFEVPVGCSGLFRIPTDFSDGATARFNVKDPTAGFGSN 561

Query: 682  NSLGRFLSEPFDSRSAFSSDLFLTDPYRPFTPERPFSQPRLS-----EGPSSNRMNRLDS 518
              L R +S   D +       F + PY       P +   L+      GP+   ++    
Sbjct: 562  FHLNRAMSRTSDGQ------FFPSLPYPDTQLSLPANDQSLAIRYVENGPNGGSLSS-SK 614

Query: 517  YTNPVLPSVKDSYPFLPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVR 353
            YT P  P +  SY    + T ++P      SR + SS  G+P   R S   +H+R
Sbjct: 615  YTYPTFP-INPSY---QNATPQMPFG-NEVSRPYSSSTVGVPLANRFSFNSDHLR 664


>gb|EOY19203.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508727307|gb|EOY19204.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 665

 Score =  280 bits (717), Expect = 2e-72
 Identities = 230/704 (32%), Positives = 338/704 (48%), Gaps = 71/704 (10%)
 Frame = -3

Query: 2236 SGEDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVS 2057
            S + ++ ++TT   E + MTIEFLRARLLSERSVSK+ARQR DELAK+VAELE+QLKFVS
Sbjct: 4    SDQVKQDQRTTCNVEDSTMTIEFLRARLLSERSVSKSARQRVDELAKRVAELEKQLKFVS 63

Query: 2056 LQRKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKL 1877
            +QR++AEKATA+VLAILEN+G+SD+SEE DS SDQ ++P +    NG           K+
Sbjct: 64   VQRRRAEKATADVLAILENNGVSDISEELDSSSDQ-DAPFESNINNGSTKEEESSVTSKV 122

Query: 1876 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDS-VRRRASFGSNSLSAR- 1703
            R+               ++GRSLSW+  K + H+ E+  Y D  VR R SF S S S+R 
Sbjct: 123  RQKESEELSGSEFDCSSASGRSLSWKGRKSASHSPER--YKDKLVRSRNSFASISFSSRK 180

Query: 1702 -RVGKSCRRIRHRDTRSM-EDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESS 1529
             R GKSCR+IR R++RS+ E+ ++D                     +      K    SS
Sbjct: 181  HRQGKSCRQIRRRESRSVAEELKSDN--------------------IMVDPQVKGLENSS 220

Query: 1528 TLRSNSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSGT 1349
             + +N  T             + DME AL+HQ QLI  Y           EKFRE NS +
Sbjct: 221  EVNANHST------------GEKDMEKALEHQAQLIVHYEAMERAQREWEEKFREKNSSS 268

Query: 1348 QDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETS------ 1187
             DSCDPGNHSDVTEER E+K+ +    +GT+ S  Q  ++E + +  +E P+        
Sbjct: 269  PDSCDPGNHSDVTEERDEIKA-QAQYVSGTATSQVQGAEEEHI-SFSAELPKIHSNDLVP 326

Query: 1186 --------------KRSLQNENIISCESSASEFSFPMSREKNNQEFLGIQHNASQYRSQQ 1049
                           RSL  E+ ++  S   + +F M++E ++Q      +N+    S  
Sbjct: 327  PSQADMDRLQDWRYSRSLSPES-LNPNSPGQKLTFLMAKENHHQSMQ--SNNSPSNSSHH 383

Query: 1048 FPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLELAVVPQDNLG---SVLEALKRAKS 878
            F     +    + +    +  S +    P+    L  A+VP +  G    VL++LK+A+ 
Sbjct: 384  FAHPHDSPGNQAVQHISSDLGSHSCRELPRNKNEL-YALVPHETSGRFTGVLDSLKQARL 442

Query: 877  SLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENARPGF- 701
            SL QK++      G + G   + S +     +  +IP+   GLFR+PTD   E  +  F 
Sbjct: 443  SLQQKISTLSLVEGASVGKAIETSGSGRKVGERVEIPLGCSGLFRVPTDISVEAPKANFL 502

Query: 700  --------ANFPPENSLGRFLSEPFDSRSAFSSDLFLTDPYRPFTPER----PFSQPRLS 557
                    AN  P+  +    S    + S  ++    +  Y+P + +R    P+  PR S
Sbjct: 503  GSSSQLSLANHYPDRGVAPTASNHLLTTSYMNTQSSSSSNYQPVSSDRFFSGPYMYPRTS 562

Query: 556  EGP-----------------------SSNRMN----RLDSYTNPVLPSVK----DSYPFL 470
              P                       + +R++      D    PVLPS       ++P  
Sbjct: 563  SSPFPTAFASSGYIKDDQILTGQCEETGSRLSTPKPSFDPSLEPVLPSSSLQNYPTFPSY 622

Query: 469  PDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 338
            PD+  ++   EG  + +   S    P     S YD H RPD++R
Sbjct: 623  PDLVPQIHAKEGFPAFHTTRSVGATPD--WFSFYDSHFRPDIHR 664


>ref|XP_002311037.1| hypothetical protein POPTR_0008s02540g [Populus trichocarpa]
            gi|222850857|gb|EEE88404.1| hypothetical protein
            POPTR_0008s02540g [Populus trichocarpa]
          Length = 684

 Score =  278 bits (712), Expect = 6e-72
 Identities = 239/691 (34%), Positives = 339/691 (49%), Gaps = 60/691 (8%)
 Frame = -3

Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051
            E ++QR  +SM++S A+TIEFLRARLL+ERSVS+TARQRADELA++VAELEEQL+ VSLQ
Sbjct: 7    EKQDQRTRSSMEDSTAITIEFLRARLLAERSVSRTARQRADELAERVAELEEQLRIVSLQ 66

Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1871
            R KAEKAT +VLAILE++GISD SE F S SDQ+ +P + K               K+ K
Sbjct: 67   RMKAEKATVDVLAILESNGISDDSEIFGSSSDQD-TPCESKVGK-KTKQEESSVISKVTK 124

Query: 1870 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGSNSLSARR-VG 1694
                           S GR+LSW+  K S  +LEK     S+RRR+SF S S S +   G
Sbjct: 125  YKLEEHSGSGHDFSSSQGRNLSWKGRKHSPRSLEKCKD-PSLRRRSSFASTSSSPKHHQG 183

Query: 1693 KSCRRIRHRDTR-------SMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLE 1535
            KSCR++R++++R       +  D  +      A + + F       V     ENG+ +  
Sbjct: 184  KSCRQVRNKESRLTIGAFRTNPDKVDSPENGVATTSEVFPNCSEPEVG--RIENGEEKTL 241

Query: 1534 SSTLRSNSETQKMDGRYFD--VHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFREN 1361
                      Q+ D    +  V+  D DME AL+HQ QLI +Y           EKFREN
Sbjct: 242  PPISVGLENGQRADSNELEDNVYGSDRDMEKALEHQAQLIDRYKAMEKVQREWEEKFREN 301

Query: 1360 NSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETSKR 1181
            N  T DS D GN SDVTEE YE+K+ ++ +  GT  + +   K E V+   + +P    R
Sbjct: 302  NGSTPDSYDAGNRSDVTEEGYEIKA-QVQQHTGTVAAQSNRAKSE-VEKASNIQPNGILR 359

Query: 1180 ----------SLQNENIISCESSASEFSFPMSREK--NNQEFLGIQHNASQYRSQQFPPM 1037
                        ++ +  + ES A +F+F   ++K   N+E LG  ++ S + S   P  
Sbjct: 360  PSHVNIGQLQEWKSSSAPTSESPAQDFAFRAEKQKQNENEESLGNNYHPSPHSSHDHP-- 417

Query: 1036 VQTTTQSSTKISPYEEKSTALSTPPKISLPL--------EL-AVVP---QDNLGSVLEAL 893
                   S+  SP  + +T+  +                EL A+VP    + LG VL+AL
Sbjct: 418  ----QSHSSHDSPGSQSATSFPSNTDSGFSKGQFSGRQNELYALVPHRASNELGGVLDAL 473

Query: 892  KRAKSSLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-- 719
            K A+ SL QK++  P   G +  +   PS       D   IP+ + GLFRLP D+  E  
Sbjct: 474  KLARQSLQQKISTLPLIEGGSIRNSVDPSLPPPIPGDKVDIPLGNAGLFRLPFDFLAEGS 533

Query: 718  --------NARPGFANFPPEN-----SLGRFLSE-PFDSRSAF-SSDLFLTDPYRPFTPE 584
                    NA     N+ P+      ++ RF+S  P  + S F ++D FL       T  
Sbjct: 534  TRKNLDSTNAGLSLRNYYPDTGVPAAAINRFVSRFPTATGSRFPTADQFLASQSYSATGS 593

Query: 583  RPFSQPRLSEGPSSNRMNRLDS---YTNPVL-----PSVKDSYPFLPDVTLRVPLNEGGA 428
            R  ++ +          +R+ S   +  P L     PS + SYP  P     +P      
Sbjct: 594  RFPTEDQFLASQDVEAGSRISSQRPFFYPYLDTVSPPSARYSYPTNPSYPGPMPQLPSRE 653

Query: 427  SRNF-PSSERGLPPVMRLSSYDEHVRPDMYR 338
              +F PS+  G+PP    S  D H+RP+MYR
Sbjct: 654  PPSFLPSTTAGVPPADHFSFPDYHIRPNMYR 684


>emb|CBI40233.3| unnamed protein product [Vitis vinifera]
          Length = 682

 Score =  278 bits (711), Expect = 8e-72
 Identities = 187/418 (44%), Positives = 237/418 (56%), Gaps = 32/418 (7%)
 Frame = -3

Query: 2200 MQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQRKKAEKATAN 2021
            M++S AMTIEFLRARLLSERSVS+TARQRADELA++V +LEEQLK VS+QR KAEKATA+
Sbjct: 1    MEDSTAMTIEFLRARLLSERSVSRTARQRADELAQRVWKLEEQLKIVSIQRNKAEKATAD 60

Query: 2020 VLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRKNXXXXXXXXX 1841
            VLAILENH ISDVS EFDS SDQE +  D     G                         
Sbjct: 61   VLAILENHAISDVSWEFDSSSDQEVALCDSHVGGG------------------------- 95

Query: 1840 XXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGSNSLSARR--VGKSCRRIRHR 1667
                    R LSW+SSKDS H++EK+    S+RRR SF S+  S+ +  +GKSCR+IR R
Sbjct: 96   --------RRLSWKSSKDSSHSIEKRYLDCSIRRRHSFASSGSSSPKHNLGKSCRQIRRR 147

Query: 1666 DTRS----------MEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQ--LESSTL 1523
            +TRS          M DSQN+G    + S    NG D     LRE    + +  L    +
Sbjct: 148  ETRSAVDELKVGRVMVDSQNNGI--ISSSEGLPNGFDSGQEILREGSENQEEEALMDGQV 205

Query: 1522 RSNSETQKM---DGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSG 1352
              + E+Q+       + + + RD DME AL+HQ QLIGQY           EKFRENNS 
Sbjct: 206  SDSLESQRDATGSNHHLNRNGRDRDMERALEHQAQLIGQYEAEEKAQREWEEKFRENNSS 265

Query: 1351 TQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETS----- 1187
            T DSC+PGNHSDVTEER E+K P+   AAG   S +Q TK +  D  F+E+   +     
Sbjct: 266  TPDSCEPGNHSDVTEERDEVK-PQAPSAAGILTSQDQGTKLDDEDVHFNEESSQTLPTIS 324

Query: 1186 -------KRSLQNEN---IISCESSASEFSFPMSREKNNQEFLGIQHNASQYRSQQFP 1043
                      LQ +N   +++ ES A +F FPM++E  +QEFL  Q     + S  +P
Sbjct: 325  TTHLHGDMECLQEQNRCSMLAYESLAPDFVFPMAKENLHQEFLENQSYPLSHSSHHYP 382



 Score =  103 bits (257), Expect = 3e-19
 Identities = 77/220 (35%), Positives = 109/220 (49%), Gaps = 20/220 (9%)
 Frame = -3

Query: 937  AVVPQDN---LGSVLEALKRAKSSLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIP 767
            A+VP++    LG VLEAL++A+ SL  KLN  P   G + G   +PS   T   +  +IP
Sbjct: 472  ALVPRETSNELGGVLEALQQARLSLQHKLNRLPLIEGGSIGRAIEPSFPSTRAWERVEIP 531

Query: 766  VISPGLFRLPTDYQ----------PENARPGFANFPPE-----NSLGRFLSEPF--DSRS 638
            V   GLFR+P DYQ            +++    N+ P+     N   RFL+ P+     S
Sbjct: 532  VGCAGLFRVPADYQLGTATEANFLGSDSQSSLKNYYPDTGFVANPGDRFLTSPYLKTGSS 591

Query: 637  AFSSDLFLTDPYRPFTPERPFSQPRLSEGPSSNRMNRLDSYTNPVLPSVKDSYPFLPDVT 458
              + D FLT PYR      P  +P   +  S   ++    YT+P       +Y   PD+ 
Sbjct: 592  VPTDDSFLTSPYRETGSRIPPLRPSF-DYYSDAGLSASTRYTHP-------TYSSHPDLL 643

Query: 457  LRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 338
             R+P NEG A R   +SE G+P     S YD+H+RP+MYR
Sbjct: 644  YRMPFNEGFA-RPPRNSEVGIPSTDHFSFYDDHIRPNMYR 682


>ref|XP_006360476.1| PREDICTED: flocculation protein FLO11-like isoform X1 [Solanum
            tuberosum] gi|565389467|ref|XP_006360477.1| PREDICTED:
            flocculation protein FLO11-like isoform X2 [Solanum
            tuberosum] gi|565389469|ref|XP_006360478.1| PREDICTED:
            flocculation protein FLO11-like isoform X3 [Solanum
            tuberosum] gi|565389471|ref|XP_006360479.1| PREDICTED:
            flocculation protein FLO11-like isoform X4 [Solanum
            tuberosum] gi|565389473|ref|XP_006360480.1| PREDICTED:
            flocculation protein FLO11-like isoform X5 [Solanum
            tuberosum]
          Length = 678

 Score =  272 bits (696), Expect = 4e-70
 Identities = 225/656 (34%), Positives = 320/656 (48%), Gaps = 29/656 (4%)
 Frame = -3

Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051
            ED++Q K   +++S   TIEFLR RLL+ERS S+TA+QRADELA++V+ELEEQLK VSLQ
Sbjct: 7    EDQDQSKIDGVEDSKT-TIEFLRGRLLAERSASRTAKQRADELAQRVSELEEQLKAVSLQ 65

Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1871
            RKKAE+ATA VL+ILENH I DVSEEF S SD+E    D K               + ++
Sbjct: 66   RKKAERATAAVLSILENHSIDDVSEEFSSGSDKEAILSDQKDAENKTGGDISSSVKE-KE 124

Query: 1870 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRR-ASFGSNSLSA-RRV 1697
            +              ST RSLSW+S K S H+L+++ Y DS RRR ++F S  +S+ +RV
Sbjct: 125  DDVDTLSSSGTVSSSSTARSLSWKSGK-SSHSLDRRKYTDSNRRRYSNFSSTDISSPKRV 183

Query: 1696 GKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTLRS 1517
            G SCRRIR RDTRS  D   + +  A C+ +    S            G N +      S
Sbjct: 184  GNSCRRIRRRDTRSASDKLQNSS--AECASEPLPSSANNEPHPLTAGAGINDVNDQVHVS 241

Query: 1516 NSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSGTQDSC 1337
              +   + G   +  + D+D + AL  Q QLIGQY           EK+RE+N  T DSC
Sbjct: 242  AID---VSGNGKEADKSDEDSQRALHQQAQLIGQYEAEEKAQREWEEKYRESNICTPDSC 298

Query: 1336 DPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEK----------PETS 1187
            D  N+SDVTEER ++K+ +    AG ++  N   +    D   +E+          P  +
Sbjct: 299  DRENYSDVTEERDDLKASQEPCLAGNTSMQNHANQSGAADVSRTEQNGNIDNSPSTPHVN 358

Query: 1186 KRSLQNE---NIISCESSASEFSFPMSREKNNQEFLGIQHNASQYRSQQFPPMVQTTTQS 1016
               L+++     +  +S ASE + PMS    N  +L      S Y  QQ  P+ +     
Sbjct: 359  MSCLEDKKGSRTVESDSPASELARPMS----NGNYLENHGQTSAYSHQQSLPVTR----- 409

Query: 1015 STKISPYEEKSTALSTPPKISLPLELAVV---PQDNLGSVLEALKRAKSSLNQKLNNSPP 845
                SP   +S++L          ELA+V     +++ SVL  L++AK SL +++N+S P
Sbjct: 410  ----SPMHPRSSSLQAGQAPQTGYELALVSHNTSNSVNSVLGELEQAKLSLTKQINSSLP 465

Query: 844  TAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENARPGFANFPPENSLGRF 665
            TA          S N++++                P+ Y+                    
Sbjct: 466  TASYPGMPSRFSSVNQSSE----------------PSTYETS------------------ 491

Query: 664  LSEPFDSRSAF-SSDLFLTDPYRPFTPE--------RPFSQPRLSEG-PSSNRMNRLDSY 515
            LS   +SRS + +    +T P++   PE        RP S+     G PSS R N   S 
Sbjct: 492  LSPYMESRSKYVTQGNRVTYPFQRAFPEVSSSAPSYRPISETNFDAGQPSSMRFNPNSSS 551

Query: 514  TNPVLPS-VKDSYPFLPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRP 350
              P+       SYP  PD+  ++P NE   SRN+P +E  LPP    S++   V P
Sbjct: 552  RLPLSSKFTYPSYPKFPDMVPKLPPNE-VFSRNYPRNETDLPPSFSFSTWSPEVVP 606


>ref|XP_004249997.1| PREDICTED: uncharacterized protein LOC101251943 [Solanum
            lycopersicum]
          Length = 729

 Score =  266 bits (680), Expect = 3e-68
 Identities = 231/673 (34%), Positives = 334/673 (49%), Gaps = 46/673 (6%)
 Frame = -3

Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051
            ED++Q K   +++S   TIEFLR RLL+ERS S+TA+QRADELA+ V+ELEEQLK VSLQ
Sbjct: 7    EDQDQSKIDGVEDSKT-TIEFLRGRLLAERSASRTAKQRADELAQMVSELEEQLKVVSLQ 65

Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1871
            RK+AEKATA VL+ILE+H I DVSEEF S SD+E    D K   G           K ++
Sbjct: 66   RKRAEKATAAVLSILEDHSIDDVSEEFSSGSDKETILSDQKDA-GNKTGGDISSSAKEKE 124

Query: 1870 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRR-ASFGSNSLSA-RRV 1697
            +              ST RSLSW+S K S H+L+++ Y DS RRR ++F    +S+ +RV
Sbjct: 125  DDVDILSSSGTVSSSSTARSLSWKSGK-SSHSLDRRKYTDSNRRRYSNFSYTDISSPKRV 183

Query: 1696 GKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTLRS 1517
            G SCR+IR RDTRS  D   + +  A C+ +  + S            G + +       
Sbjct: 184  GNSCRQIRRRDTRSASDKLRNSS--AECASEPLSSSANNEPHSLTAGAGISDVNDQV--- 238

Query: 1516 NSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSGTQDSC 1337
            +     + G   +  + D+D + AL  QVQ IGQY           EK+RE+NS T DSC
Sbjct: 239  HVPALDVPGNGREADKSDEDSQRALHQQVQPIGQYEAEEKAQREWEEKYRESNSCTPDSC 298

Query: 1336 DPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETSKRSLQNENI- 1160
            D  N+SDVTEER ++K+ +    AG ++  N   +    D   +++      S    N+ 
Sbjct: 299  DRENYSDVTEERDDLKASQEPCLAGRTSMQNHANQCGAADVSRTKQNGNIDNSPSTPNVN 358

Query: 1159 ISC------------ESSASEFSFPMSREKNNQEFLGIQHNASQYRSQQFPPMVQTTTQS 1016
            +SC            +SSASE + PMS       +L      S +  QQ  P+ +     
Sbjct: 359  MSCLEDKKGSRTVGSDSSASELARPMS----TGNYLENHGQTSAFSHQQSFPVTR----- 409

Query: 1015 STKISPYEEKSTALSTPPKISLPLELAVV---PQDNLGSVLEALKRAKSSLNQKLNNSPP 845
                S    +S++L     +    ELA+V     + + SVL  L++AK SL +++N+S P
Sbjct: 410  ----SSMHPRSSSLQAGQALQTGYELALVSHNTSNGVDSVLGKLEQAKLSLTKQINSSLP 465

Query: 844  TAGRASGSVFQPSNNETNKTDSFQIPVISPGL----------FRLPTDYQ---PE--NAR 710
            TA          S N + +  +++I +  P +           R+   +Q   PE  ++ 
Sbjct: 466  TASYPGTPSRFSSLNHSPELSTYEISLTPPYVESRSKYVTQSNRVTYPFQRAFPEVSSSA 525

Query: 709  PGFANFPPEN-SLGRFLSEPF-DSRSAF-SSDLFLTDPY-RPFT-------PERPFSQPR 563
            P +      N   G+  S P+ +SRS + +    +T P+ R FT         RP S+  
Sbjct: 526  PSYRPISETNFEAGQPSSTPYVESRSKYVTQSNRVTYPFQRAFTEVSSSAPSYRPISETN 585

Query: 562  LSEG-PSSNRMNRLDSYTNPVLPSVK-DSYPFLPDVTLRVPLNEGGASRNFPSSERGLPP 389
               G PSS R N   S   P    +   SYP  PD+  ++P NE   SRNFP++E  LPP
Sbjct: 586  FDAGQPSSVRFNPNSSSRLPFSSKLTYPSYPKFPDMVPKLPPNE-VFSRNFPTNETDLPP 644

Query: 388  VMRLSSYDEHVRP 350
                S+  + V P
Sbjct: 645  SFSFSTLSQEVVP 657


Top