BLASTX nr result

ID: Rehmannia24_contig00012103 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia24_contig00012103
         (2379 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006436667.1| hypothetical protein CICLE_v10030805mg [Citr...   334   1e-88
ref|XP_006436666.1| hypothetical protein CICLE_v10030805mg [Citr...   328   7e-87
ref|XP_002533661.1| hypothetical protein RCOM_0152200 [Ricinus c...   317   1e-83
ref|XP_006345859.1| PREDICTED: micronuclear linker histone polyp...   311   6e-82
ref|XP_004239716.1| PREDICTED: uncharacterized protein LOC101267...   308   9e-81
gb|EXC25400.1| hypothetical protein L484_016782 [Morus notabilis]     305   5e-80
ref|XP_006345860.1| PREDICTED: micronuclear linker histone polyp...   305   6e-80
gb|EMJ20137.1| hypothetical protein PRUPE_ppa002306mg [Prunus pe...   305   8e-80
ref|XP_004307047.1| PREDICTED: uncharacterized protein LOC101309...   303   2e-79
ref|XP_004140985.1| PREDICTED: uncharacterized protein LOC101207...   300   3e-78
gb|EOY19205.1| Uncharacterized protein isoform 4 [Theobroma cacao]    297   1e-77
gb|ESW15816.1| hypothetical protein PHAVU_007G104500g [Phaseolus...   295   5e-77
gb|EOY19202.1| Uncharacterized protein isoform 1 [Theobroma cacao]    294   1e-76
ref|XP_006606287.1| PREDICTED: micronuclear linker histone polyp...   288   6e-75
ref|XP_006606284.1| PREDICTED: micronuclear linker histone polyp...   284   1e-73
ref|XP_002311037.1| hypothetical protein POPTR_0008s02540g [Popu...   281   9e-73
gb|EOY19203.1| Uncharacterized protein isoform 2 [Theobroma caca...   280   2e-72
emb|CBI40233.3| unnamed protein product [Vitis vinifera]              279   3e-72
ref|XP_006360476.1| PREDICTED: flocculation protein FLO11-like i...   276   4e-71
ref|XP_004249997.1| PREDICTED: uncharacterized protein LOC101251...   266   4e-68

>ref|XP_006436667.1| hypothetical protein CICLE_v10030805mg [Citrus clementina]
            gi|568878417|ref|XP_006492190.1| PREDICTED:
            uncharacterized protein LOC102610545 [Citrus sinensis]
            gi|557538863|gb|ESR49907.1| hypothetical protein
            CICLE_v10030805mg [Citrus clementina]
          Length = 732

 Score =  334 bits (856), Expect = 1e-88
 Identities = 273/739 (36%), Positives = 363/739 (49%), Gaps = 107/739 (14%)
 Frame = -2

Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073
            E ++QR  + M++SN MTIEFLRARLLSERSVSK+ARQRADELA++V ELEEQLK VSLQ
Sbjct: 7    EMQDQRTNSGMEDSNTMTIEFLRARLLSERSVSKSARQRADELARRVVELEEQLKLVSLQ 66

Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1893
            RKKAEKATA+VLAILEN+GIS++S+ FDS SDQ E+P + +  N            K R+
Sbjct: 67   RKKAEKATADVLAILENNGISEISDSFDSGSDQ-ETPCESEVGNNFNKEEENSVDSKFRR 125

Query: 1892 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDS-VRRRASFGSNSLSA--RR 1722
            N                 R LSW   + ++ +LEK  Y DS +RRR+SF S   S+   R
Sbjct: 126  NASVEHSGSGNDFSPVPHRGLSWNGRRGTKQSLEK--YKDSYLRRRSSFASTGSSSPKNR 183

Query: 1721 VGKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALR-EYENGKNQLESSTL 1545
            VGKSCR+IR R+++S  +     TE          G     V  + E   G    E   L
Sbjct: 184  VGKSCRQIRRRESKSAVEELK--TEPVKVDSQENGGGTSLEVDRKPEVLRGSEAQEEQYL 241

Query: 1544 RSNS-----ETQKM---DGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFR 1389
               S     E +K+    G  F+    D DME AL+ QAQLIG+Y           E+FR
Sbjct: 242  GEGSDSGCFENEKLVTGGGIDFNGCGGDKDMEKALEDQAQLIGRYEEMEKAQREWEERFR 301

Query: 1388 ENNSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQE-----QVDACFSE 1224
            ENNS T DSCDPGN SDVTEER E K  ++ R AGT NS  QE K E     Q+    S 
Sbjct: 302  ENNSSTPDSCDPGNQSDVTEEREESK-VQVQRVAGTVNSQVQEAKTEVHLSNQLSNTKSN 360

Query: 1223 ---KPETSKRSLQNENIISCESSASEFSFPMSREKNNQEFLGIQHDASQYRS-QQFPPMV 1056
                P++  +   +    + E  A +F+F MS EK NQE LG  H    + S  +  P  
Sbjct: 361  GFLPPQSGDQKCSSTP--ASEPLAQDFAFTMSNEKQNQESLGNNHYVPSHSSHHRLHPHG 418

Query: 1055 QTTTQSSTKISPYEEKSTALSTPPKI--SLPLELAVVP---QDNLGSVLEALKRAKSSLN 891
                QSS  +S     +T  S+  ++  S   + A+VP         VLEALK+A+ SL 
Sbjct: 419  SPENQSSQTVS----SNTGSSSRREVSGSQSEQYALVPHQTSSGFNEVLEALKQARLSLR 474

Query: 890  QKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE---------N 738
            QK+++ P T  R+ G V +PS + +   D  +IPV   GLFR+PTDY  E         +
Sbjct: 475  QKMSSLPSTESRSVGKVIEPSLSASTVWDRVEIPVGCSGLFRVPTDYAVETSKANFLVSD 534

Query: 737  ARPGFANFPPENSLGRFLSEP------FDSRSAFSS-------DLFLTDP----YRPFTP 609
            +RP  AN+ P + +G    +        D+RS F++       DLFLT P       ++ 
Sbjct: 535  SRPSLANYNPTSGIGLVSDDQTVSNSLMDTRSTFAADNFRPTRDLFLTGPSTDTRSSYSA 594

Query: 608  ERPFSQPRLSEGPSSSNRMN-RLDSYTNPVLPSVK-------DSYP-------------- 495
            E      + S+  S  + M    DS  +  LPS +        SYP              
Sbjct: 595  ENRLLTRQYSDTRSRVSMMRPSFDSNLDAGLPSFRQYMYPNFSSYPDQVPQVPRNERLST 654

Query: 494  FL---------------------------------PDVTLRVPLNEGGASRNFPSSERGL 414
            FL                                 PD+  ++P +E G S   PS   G+
Sbjct: 655  FLPGRSVEMSVEISPMLDAGLSSSSQSANPYFSSYPDLMPQIPAHE-GLSTLRPSRSAGM 713

Query: 413  PPVMRLSSYDEHVRPDMYR 357
            PP   L  +++H RP MYR
Sbjct: 714  PPANHLPFHNDHTRPYMYR 732


>ref|XP_006436666.1| hypothetical protein CICLE_v10030805mg [Citrus clementina]
            gi|557538862|gb|ESR49906.1| hypothetical protein
            CICLE_v10030805mg [Citrus clementina]
          Length = 716

 Score =  328 bits (841), Expect = 7e-87
 Identities = 270/729 (37%), Positives = 357/729 (48%), Gaps = 107/729 (14%)
 Frame = -2

Query: 2222 MQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQRKKAEKATAN 2043
            M++SN MTIEFLRARLLSERSVSK+ARQRADELA++V ELEEQLK VSLQRKKAEKATA+
Sbjct: 1    MEDSNTMTIEFLRARLLSERSVSKSARQRADELARRVVELEEQLKLVSLQRKKAEKATAD 60

Query: 2042 VLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRKNXXXXXXXXX 1863
            VLAILEN+GIS++S+ FDS SDQ E+P + +  N            K R+N         
Sbjct: 61   VLAILENNGISEISDSFDSGSDQ-ETPCESEVGNNFNKEEENSVDSKFRRNASVEHSGSG 119

Query: 1862 XXXXXSTGRSLSWRSSKDSQHALEKKNYMDS-VRRRASFGSNSLSA--RRVGKSCRRIRH 1692
                    R LSW   + ++ +LEK  Y DS +RRR+SF S   S+   RVGKSCR+IR 
Sbjct: 120  NDFSPVPHRGLSWNGRRGTKQSLEK--YKDSYLRRRSSFASTGSSSPKNRVGKSCRQIRR 177

Query: 1691 RDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALR-EYENGKNQLESSTLRSNS-----E 1530
            R+++S  +     TE          G     V  + E   G    E   L   S     E
Sbjct: 178  RESKSAVEELK--TEPVKVDSQENGGGTSLEVDRKPEVLRGSEAQEEQYLGEGSDSGCFE 235

Query: 1529 TQKM---DGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSGTQDSC 1359
             +K+    G  F+    D DME AL+ QAQLIG+Y           E+FRENNS T DSC
Sbjct: 236  NEKLVTGGGIDFNGCGGDKDMEKALEDQAQLIGRYEEMEKAQREWEERFRENNSSTPDSC 295

Query: 1358 DPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQE-----QVDACFSE---KPETSKR 1203
            DPGN SDVTEER E K  ++ R AGT NS  QE K E     Q+    S     P++  +
Sbjct: 296  DPGNQSDVTEEREESK-VQVQRVAGTVNSQVQEAKTEVHLSNQLSNTKSNGFLPPQSGDQ 354

Query: 1202 SLQNENIISCESSASEFSFPMSREKNNQEFLGIQHDASQYRS-QQFPPMVQTTTQSSTKI 1026
               +    + E  A +F+F MS EK NQE LG  H    + S  +  P      QSS  +
Sbjct: 355  KCSSTP--ASEPLAQDFAFTMSNEKQNQESLGNNHYVPSHSSHHRLHPHGSPENQSSQTV 412

Query: 1025 SPYEEKSTALSTPPKI--SLPLELAVVP---QDNLGSVLEALKRAKSSLNQKLNNSPPTA 861
            S     +T  S+  ++  S   + A+VP         VLEALK+A+ SL QK+++ P T 
Sbjct: 413  S----SNTGSSSRREVSGSQSEQYALVPHQTSSGFNEVLEALKQARLSLRQKMSSLPSTE 468

Query: 860  GRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE---------NARPGFANFPP 708
             R+ G V +PS + +   D  +IPV   GLFR+PTDY  E         ++RP  AN+ P
Sbjct: 469  SRSVGKVIEPSLSASTVWDRVEIPVGCSGLFRVPTDYAVETSKANFLVSDSRPSLANYNP 528

Query: 707  ENSLGRFLSEP------FDSRSAFSS-------DLFLTDP----YRPFTPERPFSQPRLS 579
             + +G    +        D+RS F++       DLFLT P       ++ E      + S
Sbjct: 529  TSGIGLVSDDQTVSNSLMDTRSTFAADNFRPTRDLFLTGPSTDTRSSYSAENRLLTRQYS 588

Query: 578  EGPSSSNRMN-RLDSYTNPVLPSVK-------DSYP--------------FL-------- 489
            +  S  + M    DS  +  LPS +        SYP              FL        
Sbjct: 589  DTRSRVSMMRPSFDSNLDAGLPSFRQYMYPNFSSYPDQVPQVPRNERLSTFLPGRSVEMS 648

Query: 488  -------------------------PDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYD 384
                                     PD+  ++P +E G S   PS   G+PP   L  ++
Sbjct: 649  VEISPMLDAGLSSSSQSANPYFSSYPDLMPQIPAHE-GLSTLRPSRSAGMPPANHLPFHN 707

Query: 383  EHVRPDMYR 357
            +H RP MYR
Sbjct: 708  DHTRPYMYR 716


>ref|XP_002533661.1| hypothetical protein RCOM_0152200 [Ricinus communis]
            gi|223526443|gb|EEF28720.1| hypothetical protein
            RCOM_0152200 [Ricinus communis]
          Length = 665

 Score =  317 bits (813), Expect = 1e-83
 Identities = 249/649 (38%), Positives = 337/649 (51%), Gaps = 49/649 (7%)
 Frame = -2

Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073
            E ++QR  + M++S AMTIEFLRARLLSERSVS+TARQRADELA +VAELEEQL+ VSLQ
Sbjct: 7    EKQDQRTNSGMEDSTAMTIEFLRARLLSERSVSRTARQRADELATRVAELEEQLRIVSLQ 66

Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1893
            R KAEKATA++LAILE +GISD+SE FDSCSD+ ++P + K  N            K+R 
Sbjct: 67   RMKAEKATADILAILEGNGISDISETFDSCSDR-DTPCESKVGN-RSSKEENSINSKVRN 124

Query: 1892 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGS-NSLSARRVG 1716
            N                GRSLSW+  K+S  +LEK     S+RRR+SF S  S   +R G
Sbjct: 125  NDSEELSGSDFDFSSVPGRSLSWKGRKNSPRSLEKSK-DSSMRRRSSFSSVGSSPKQRPG 183

Query: 1715 KSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLE------S 1554
            KSCR+IR +++R           K  C  D    +     +  ++E  + +++       
Sbjct: 184  KSCRQIRRKESRF---EYKASPVKRDCPEDEVAATSANFPSCSDFEPKRGEVKPLLEDSH 240

Query: 1553 STLRSNSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSG 1374
            S    N      +G  ++V+  D DME AL+HQAQLIGQY           EKFRENNS 
Sbjct: 241  SDCLGNERNASDNGLDYNVYRGDRDMEKALEHQAQLIGQYEAMEKVQREWEEKFRENNSS 300

Query: 1373 TQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETSKRSLQ 1194
            T DSCD GN SD+TEERYE++ P  ++   T+N+   E     V+   + +P     S  
Sbjct: 301  TPDSCDHGNRSDITEERYEIREP--AKGPATTNAIQTEGLLSVVEGVSNTQPHGFLPSSH 358

Query: 1193 NENIISCESSAS-----EFS-----FPMSREKNNQE---------FLGIQHDASQYRSQQ 1071
             + +   E  +S     EFS     FPM++ K NQ+          L   HD++ + SQ 
Sbjct: 359  VDAVCLEERKSSIAPVPEFSTQDSAFPMAKAKQNQKNPGNNDHSPLLIAHHDSASFGSQY 418

Query: 1070 FPPMVQTTT-QSSTKISPYEEKSTALSTPPKISLPLELAVVPQDNLGSVLEALKRAKSSL 894
                    +  S+T  S  + K+T+ S   + +L    A      LG VLEAL+ A+ SL
Sbjct: 419  SSGSQSVLSFPSNTGSSFNKGKATSGSENERCALVPHKA---SGGLGGVLEALEEARQSL 475

Query: 893  NQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-NARPGFAN 717
             Q++N  P  A     SV + S + T   D  QIPV   GLFRLPTD+  E N R    +
Sbjct: 476  QQRINRLPSVATTVRKSV-ESSVSTTISRDEVQIPVGCVGLFRLPTDFSVEGNTRANLLS 534

Query: 716  FPPENSLG--------------RFLSEPF-DSRSAFSS-DLFLTDPY-----RPFTPERP 600
               + SLG              +F++ P+   RS+ S+ D FL+  Y     R  TP +P
Sbjct: 535  SSAQLSLGNHYSDRGVPAAASNQFVASPYLQGRSSSSTEDQFLSSQYVGGGSRIPTP-KP 593

Query: 599  FSQPRLSEGPSSSNRMNRLDSYTNPVLPSVKDSYPFLPDVTLRVPLNEG 453
            +  P L  G  SS+R      YT P  P +  SY   PD+  R+P  EG
Sbjct: 594  YFDPYLDTGLPSSSR------YTYPNYP-INTSY---PDLMPRIPSREG 632


>ref|XP_006345859.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X1
            [Solanum tuberosum]
          Length = 643

 Score =  311 bits (798), Expect = 6e-82
 Identities = 255/679 (37%), Positives = 330/679 (48%), Gaps = 47/679 (6%)
 Frame = -2

Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073
            +D++QRK   M++S+ MTIEFLRARLL+ERSVS+TARQRADELA++V ELE+QLK VSLQ
Sbjct: 7    QDQDQRKIVGMEDSS-MTIEFLRARLLAERSVSQTARQRADELAERVLELEDQLKIVSLQ 65

Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXK--L 1899
            RKKAEKATA VL+ILEN GISD SEEFDS SDQE    + K  +                
Sbjct: 66   RKKAEKATAAVLSILENEGISDASEEFDSGSDQEAIFSNSKGADSTDNRNERKPNPSNVK 125

Query: 1898 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSV-RRRASFGSN-SLSAR 1725
             +               STGRSLSW+S K S  + E+  Y DS  RR  SF S  S S +
Sbjct: 126  ERENDADISSSEIISSPSTGRSLSWKSGKHSLPSFERNRYTDSAWRRSGSFASTGSSSPK 185

Query: 1724 RVGKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNG-SDGEHVALREYENGKNQLESST 1548
            R GKSCRRIR   T++  D          C  +     ++  H +L +   G N ++   
Sbjct: 186  RAGKSCRRIRRNTTKTATDE---------CPPEHLPSFANNGHQSLMD-SAGNNDVKD-- 233

Query: 1547 LRSNSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSGTQ 1368
             + +  T +M        E D+ ME ALQH+AQLIGQY           EK+RENN+  Q
Sbjct: 234  -QRHLPTSEMSENQRKSDESDEGMERALQHKAQLIGQYEAEEKAQREWEEKYRENNNYAQ 292

Query: 1367 DSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVD-----ACFSEKPE---- 1215
            DSCDPGN+SDVTEER +MK+ E   +A   N  N   K ++VD           P     
Sbjct: 293  DSCDPGNYSDVTEERDDMKAFEQPYSAEMINLHNHANKFQEVDIPSTNGVTDNVPSTPHI 352

Query: 1214 -TSKRSLQN-ENIISCESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQFPPMVQTTTQ 1041
             TS R  QN   II+ ES ASEF+      K+N            Y   Q P        
Sbjct: 353  GTSCRKDQNCSRIINSESPASEFAL----SKSNGSCPENDGPTPAYSRHQLP-------- 400

Query: 1040 SSTKISPYEEKSTALSTPPKISLPLELAVVPQ---DNLGSVLEALKRAKSSLNQKLNNSP 870
             S   SP      ++S+    SL    A+V +   DN+GS+L AL++AK S++Q++N SP
Sbjct: 401  -SANGSPIHPLENSISSSGGSSLQAGQALVSRDASDNIGSILGALEQAKFSISQQINVSP 459

Query: 869  PTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-NARPGFANFPPENSLG 693
               G +S     P    T + D   I    PGLFRLPTD+Q E      +  FP   S  
Sbjct: 460  IAEGGSSIEHSIP----TARIDRLDILPGFPGLFRLPTDFQLEATTTASYQGFPSRFSSA 515

Query: 692  RFLSEPFDSRSAFSSDLFLTDPYRPFTPERPFSQPRLSEGPSSSNRMNRLDSYTNPV-LP 516
                EP         D F T PY     E P +        +  + +N    + +P    
Sbjct: 516  NHFHEP-------GYDQFSTTPYM----ESPSNAITGLPYTTGFDYLNPPSGFGHPFSSK 564

Query: 515  SVKDSYPFLPDVTLRV--------PLNEGGAS------------------RNFPSSERGL 414
            S   +YPF P+ T  V        PL E   +                  R+ P +E G 
Sbjct: 565  STYPTYPFRPNTTTTVSQSQASWSPLYESSLTTLSPVVVPNLSSGEEVFLRSLPRNETGK 624

Query: 413  PPVMRLSSYDEHVRPDMYR 357
            PP   +S YD H+RP+MYR
Sbjct: 625  PPSFPVSHYDAHLRPNMYR 643


>ref|XP_004239716.1| PREDICTED: uncharacterized protein LOC101267607 [Solanum
            lycopersicum]
          Length = 617

 Score =  308 bits (788), Expect = 9e-81
 Identities = 250/682 (36%), Positives = 328/682 (48%), Gaps = 50/682 (7%)
 Frame = -2

Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073
            +D++QRKT  M E+++MTIEFLRARLL+ERSVS+TARQRADELA++V ELE+QLK VSLQ
Sbjct: 7    KDQDQRKTVGM-ENSSMTIEFLRARLLAERSVSQTARQRADELAERVLELEDQLKIVSLQ 65

Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXK--L 1899
            RKKAEKATA VL+ILEN GI+D SEEFDS SDQE    + K  +                
Sbjct: 66   RKKAEKATAAVLSILENEGITDASEEFDSGSDQEAIFSNSKGADSTDNRNEYKPDPSNVK 125

Query: 1898 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSV-RRRASFGSNSLSA-R 1725
             +               STGRSLSW+S K S  + E+  Y DS  RR  SF S   S+ +
Sbjct: 126  ERENDADISSSEIISSPSTGRSLSWKSGKHSLPSFERNRYTDSAWRRSGSFASTGTSSPK 185

Query: 1724 RVGKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTL 1545
            R GKSCRRIR  +T +  +  ND                              QL   T 
Sbjct: 186  RAGKSCRRIRRSNTNAGNNDVND------------------------------QLHLPTS 215

Query: 1544 RSNSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSGTQD 1365
             ++   +K D       E D+ ME ALQH+A LIG+Y           EK+RENN   QD
Sbjct: 216  ETSENQRKAD-------ESDEGMERALQHKALLIGKYEAEEKAQREWEEKYRENNYA-QD 267

Query: 1364 SCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFS--------EKPETS 1209
            SCDPGN+SDVTEER +MK+ E   +A   N  N   K ++VD   +          P  S
Sbjct: 268  SCDPGNYSDVTEERDDMKAFEQPYSAEMINLQNHANKFQEVDIPSTNGVTDNVPSNPHIS 327

Query: 1208 KRSLQNEN---IISCESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQFPPMVQTTTQS 1038
                +++N   II+ ES ASEF+ P    K+N            Y   Q P         
Sbjct: 328  TSCRKDQNCSRIINSESPASEFALP----KSNGSCPENDGPTPAYCHHQLP--------- 374

Query: 1037 STKISPYEEKSTALSTPPKISLPLELAVV---PQDNLGSVLEALKRAKSSLNQKLNNSPP 867
            S+  SP +    ++S+    SL    A+V     DN+GS+L AL++AK S++Q++N S P
Sbjct: 375  SSNGSPIQPLENSISSSGGSSLQAGQALVSGDASDNIGSILGALEQAKFSISQQINVS-P 433

Query: 866  TAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-NARPGFANFPPENSLGR 690
              GR+S    + S       D   IP   PGLFRLPTD+Q E      +  FP   S   
Sbjct: 434  VEGRSS---IEHSIPTAKIEDRLDIPPGFPGLFRLPTDFQLEATTTASYQGFPSRFSSAN 490

Query: 689  FLSEPFDSRSAFSSDLFLTDPYR-----PFTPERPFSQPRLSEGPSSSNRMNRLDSYTNP 525
               EP    + FS+  ++  P       P+T    +  P  S G   S++          
Sbjct: 491  HFHEP--GYNQFSATPYMESPSNAITGLPYTTGFDYLNPPSSFGHPFSSK---------- 538

Query: 524  VLPSVKDSYPFLPDVTLRV--------PLNEGGAS------------------RNFPSSE 423
               S   +YPF P+ T  V        PL E   +                  R+ P +E
Sbjct: 539  ---STYPTYPFRPNTTTTVSQSQASWSPLYESSLTKSSPVVVPNLSSGEDVFLRSLPRNE 595

Query: 422  RGLPPVMRLSSYDEHVRPDMYR 357
             G PP   +S YD H+RP+MYR
Sbjct: 596  TGKPPSFPVSHYDAHMRPNMYR 617


>gb|EXC25400.1| hypothetical protein L484_016782 [Morus notabilis]
          Length = 654

 Score =  305 bits (782), Expect = 5e-80
 Identities = 257/689 (37%), Positives = 345/689 (50%), Gaps = 57/689 (8%)
 Frame = -2

Query: 2252 EDREQRKTTSMQESN--AMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVS 2079
            E ++QR ++SM++S   AMTIEFLRARLLSERSVS++ARQRADEL K+V ELEEQL+ VS
Sbjct: 7    EKQDQRSSSSMEDSQSTAMTIEFLRARLLSERSVSRSARQRADELEKRVEELEEQLRIVS 66

Query: 2078 LQRKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKL 1899
            LQRK AEKAT +VL+ILENHGISD SE +DS SDQE         NG             
Sbjct: 67   LQRKMAEKATVDVLSILENHGISDASETYDSGSDQETHQVANNYANGEERSVVSK----- 121

Query: 1898 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRR-----ASFGSNSL 1734
            R++                GRSLSW+   DS  + EK  Y DS  RR     +SFGS+S 
Sbjct: 122  RRSVLEELSGSDLDSSPINGRSLSWKGRSDSSRSREK--YKDSSVRRQNALSSSFGSSS- 178

Query: 1733 SARRVGKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLES 1554
                VGKSCR+IR R+TR++ +                     E +     ENG      
Sbjct: 179  PKHYVGKSCRQIRCRETRTVVEDHKT-----------------EPLKFDSQENGAATPPE 221

Query: 1553 STLRSNSETQKMDGRYFDV--HERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENN 1380
             +++++         + DV  H ++ DM+ AL+H+AQLIGQY           EK+RENN
Sbjct: 222  GSVKNDRRIP----NHLDVNGHGQEKDMKKALEHRAQLIGQYEEMEKAQREWEEKYRENN 277

Query: 1379 SGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVD-ACFSEKPETS-- 1209
            + T DS DPGNHSDVTE+R E+K+  L    G   +   + K  +VD +  S KP+++  
Sbjct: 278  TSTPDSYDPGNHSDVTEDRDEVKAQTLYN-VGIDIAQAVDAKSNKVDLSKESSKPQSNGF 336

Query: 1208 --------------KRSLQNENIISCESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQ 1071
                           ++  N + ++    A EF+FP ++EK  QE L    +   +R  +
Sbjct: 337  LHPTRTRAAMGDLKVQASSNIDPVASRFQAQEFAFPTAKEKEAQESL----ENRDFRPSE 392

Query: 1070 FPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLEL--------AVVPQDN---LGSVL 924
             P   Q   +S     P++    ALS     S   +         A+VP +    LG VL
Sbjct: 393  SPHHGQLLHRSLPN-QPFDR--GALSDAGSSSHKRDFSGSQNDLYALVPHNPPVVLGGVL 449

Query: 923  EALKRAKSSLNQKLNNSP----PTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPT 756
            +ALK+AK SL QK+N  P     T   A     +P+   T   D  +IPV   GLFRLPT
Sbjct: 450  DALKQAKLSLQQKINRLPLEGTTTQTVAVNRSIEPTQPGTRVGDRLEIPVGCTGLFRLPT 509

Query: 755  DYQPENARPGFANFPPENSLGRFLSEPF--DSRSAFSS-DLFLTDPY----RPFTPE-RP 600
            D+    A    ANF    S  R   EP+  D++ A ++ D FLT PY      F P+ R 
Sbjct: 510  DFATVEASTQ-ANFLSSGS--RLSLEPYYPDNKVALTAPDRFLTSPYIESRSEFPPDVRF 566

Query: 599  FSQPRLSEGPSSSNRMNRLDSYTNPVLPSVK--------DSYPFLPDVTLRVPLNEGGAS 444
             +   +  G  +S   +R DS+ +    SV          SYP  PD   R+P +E G  
Sbjct: 567  LTSSSVVSGSRASTLNSRFDSHFDTGPSSVNRYSNYPPHPSYPPFPDSMPRIPSDE-GLR 625

Query: 443  RNFPSSERGLPPVMRLSSYDEHVRPDMYR 357
            R F SS     P  R S YD+H RP+MYR
Sbjct: 626  RPFRSSRSFGLPEDRFSFYDDHGRPNMYR 654


>ref|XP_006345860.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X2
            [Solanum tuberosum]
          Length = 618

 Score =  305 bits (781), Expect = 6e-80
 Identities = 253/678 (37%), Positives = 322/678 (47%), Gaps = 46/678 (6%)
 Frame = -2

Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073
            +D++QRK   M++S+ MTIEFLRARLL+ERSVS+TARQRADELA++V ELE+QLK VSLQ
Sbjct: 7    QDQDQRKIVGMEDSS-MTIEFLRARLLAERSVSQTARQRADELAERVLELEDQLKIVSLQ 65

Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXK--L 1899
            RKKAEKATA VL+ILEN GISD SEEFDS SDQE    + K  +                
Sbjct: 66   RKKAEKATAAVLSILENEGISDASEEFDSGSDQEAIFSNSKGADSTDNRNERKPNPSNVK 125

Query: 1898 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSV-RRRASFGSN-SLSAR 1725
             +               STGRSLSW+S K S  + E+  Y DS  RR  SF S  S S +
Sbjct: 126  ERENDADISSSEIISSPSTGRSLSWKSGKHSLPSFERNRYTDSAWRRSGSFASTGSSSPK 185

Query: 1724 RVGKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTL 1545
            R GKSCRRIR   T +  +   D                            +  L +S +
Sbjct: 186  RAGKSCRRIRRNTTNAGNNDVKD----------------------------QRHLPTSEM 217

Query: 1544 RSNSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSGTQD 1365
              N   +K D       E D+ ME ALQH+AQLIGQY           EK+RENN+  QD
Sbjct: 218  SENQ--RKSD-------ESDEGMERALQHKAQLIGQYEAEEKAQREWEEKYRENNNYAQD 268

Query: 1364 SCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVD-----ACFSEKPE----- 1215
            SCDPGN+SDVTEER +MK+ E   +A   N  N   K ++VD           P      
Sbjct: 269  SCDPGNYSDVTEERDDMKAFEQPYSAEMINLHNHANKFQEVDIPSTNGVTDNVPSTPHIG 328

Query: 1214 TSKRSLQN-ENIISCESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQFPPMVQTTTQS 1038
            TS R  QN   II+ ES ASEF+      K+N            Y   Q P         
Sbjct: 329  TSCRKDQNCSRIINSESPASEFAL----SKSNGSCPENDGPTPAYSRHQLP--------- 375

Query: 1037 STKISPYEEKSTALSTPPKISLPLELAVVPQ---DNLGSVLEALKRAKSSLNQKLNNSPP 867
            S   SP      ++S+    SL    A+V +   DN+GS+L AL++AK S++Q++N SP 
Sbjct: 376  SANGSPIHPLENSISSSGGSSLQAGQALVSRDASDNIGSILGALEQAKFSISQQINVSPI 435

Query: 866  TAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-NARPGFANFPPENSLGR 690
              G +S     P    T + D   I    PGLFRLPTD+Q E      +  FP   S   
Sbjct: 436  AEGGSSIEHSIP----TARIDRLDILPGFPGLFRLPTDFQLEATTTASYQGFPSRFSSAN 491

Query: 689  FLSEPFDSRSAFSSDLFLTDPYRPFTPERPFSQPRLSEGPSSSNRMNRLDSYTNPV-LPS 513
               EP         D F T PY     E P +        +  + +N    + +P    S
Sbjct: 492  HFHEP-------GYDQFSTTPYM----ESPSNAITGLPYTTGFDYLNPPSGFGHPFSSKS 540

Query: 512  VKDSYPFLPDVTLRV--------PLNEGGAS------------------RNFPSSERGLP 411
               +YPF P+ T  V        PL E   +                  R+ P +E G P
Sbjct: 541  TYPTYPFRPNTTTTVSQSQASWSPLYESSLTTLSPVVVPNLSSGEEVFLRSLPRNETGKP 600

Query: 410  PVMRLSSYDEHVRPDMYR 357
            P   +S YD H+RP+MYR
Sbjct: 601  PSFPVSHYDAHLRPNMYR 618


>gb|EMJ20137.1| hypothetical protein PRUPE_ppa002306mg [Prunus persica]
          Length = 690

 Score =  305 bits (780), Expect = 8e-80
 Identities = 256/700 (36%), Positives = 333/700 (47%), Gaps = 68/700 (9%)
 Frame = -2

Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073
            + ++QR    M++S AMTIEFLRARLL+ERSVS++ARQR DEL + V ELEEQLK VSLQ
Sbjct: 7    DTQDQRSNLGMEDSTAMTIEFLRARLLAERSVSRSARQRVDELERMVEELEEQLKIVSLQ 66

Query: 2072 RKKAEKATANVLAILENHGISDVS-EEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLR 1896
            RK AEKAT +VLAILE+ GISD+S EEFDS SDQ E+    K  N            K+R
Sbjct: 67   RKMAEKATEDVLAILESQGISDISEEEFDSSSDQ-ETHQGSKVGNSLANEEESFVISKVR 125

Query: 1895 KNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGSNSLSARR-- 1722
            +                 GRSLSW+   DS  + EK   + SVRRR+SF S   S+ R  
Sbjct: 126  RKEQEEHSGSDADSSLIPGRSLSWKGRIDSPRSREKCKDL-SVRRRSSFSSIGFSSPRHH 184

Query: 1721 VGKSCRRIRHRDTRSME-DSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQ--LESS 1551
            +GKSCR+I+H++TRS + DS  +G    A S    N S+G    LRE      +  L + 
Sbjct: 185  LGKSCRQIKHKETRSDKFDSHENGV--GASSEGLPNFSNGGPEKLREGSEFPEEKVLSND 242

Query: 1550 TLRSNSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSGT 1371
            +L    E Q+     F+ H RD DME AL+HQA+LI +            EKFRENN+ T
Sbjct: 243  SLSRTKENQRDSDLDFNGHGRDKDMEKALEHQAKLICENEEMEKAQREWEEKFRENNTST 302

Query: 1370 QDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDAC------------FS 1227
             DSCDPGNHSD+TEER E+K+ +   +AG   +  QETK E+ D C            F 
Sbjct: 303  PDSCDPGNHSDITEERDEIKA-QTPCSAGVVVAQAQETKSEEGDVCLPKETFKIQQNGFL 361

Query: 1226 EKPETSKRSLQNE--NIISCESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQFPPMVQ 1053
                     LQ++        S   EF+FP    K N E L        + S   P +  
Sbjct: 362  PASHVDMGGLQDQLNKSTVAPSQVEEFAFPTENGKQNHESLENFARHPSHGSHPNPLVHG 421

Query: 1052 TTTQSSTKISPYEEKSTALSTPPKISLPLELAVVP---QDNLGSVLEALKRAKSSLNQKL 882
            +    S+  S     S         S     A+VP   QD LG VL+ALK+AK SL Q +
Sbjct: 422  SAHNRSSDASSSVAGSGFHKGNASGSRSDLYALVPHDSQDRLGGVLDALKQAKLSLQQNM 481

Query: 881  NNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENA----------- 735
               P   G +     +PS       D  +IPV   GLFRLPTD+  E A           
Sbjct: 482  TRLPLVDGTSVHKSIEPSIPVMKTGDRVEIPVGCAGLFRLPTDFAVEEAATQSSFLGSSW 541

Query: 734  -----------------RPGFANFPPENSLGRFLSEPF-DSRSAFS---SDLFLTDPYRP 618
                             RP F+     N+  R++  P+ ++R  FS   +D F+ + Y  
Sbjct: 542  SGRYCPETLVTSSFVETRPTFS----MNAADRYVPSPYIETRQTFSTNATDRFIPNAYVE 597

Query: 617  FTPERPFSQPR-LSEGPSSSNRMN------------RLDSYTNPVLPSVKDSYPFLPDVT 477
              P  P +        PS   R N                Y  P  P    +YP +PD T
Sbjct: 598  SRPNFPANAAEPFVTSPSVDTRSNFPADNRFLSGPYSESGYAQPPYP----NYPSVPDRT 653

Query: 476  LRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 357
              +  +E   +R  P    G  P  R S YD+  RP+MYR
Sbjct: 654  PWITSDE-ALTRALPRKPVG-APTDRFSFYDQ-FRPNMYR 690


>ref|XP_004307047.1| PREDICTED: uncharacterized protein LOC101309582 [Fragaria vesca
            subsp. vesca]
          Length = 807

 Score =  303 bits (776), Expect = 2e-79
 Identities = 221/577 (38%), Positives = 297/577 (51%), Gaps = 34/577 (5%)
 Frame = -2

Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073
            + ++ R  + M +S  +TIEFLRARLLSERSVS++ARQRADEL K V ELEEQLK VSLQ
Sbjct: 7    DTQDLRINSGMDDSPGITIEFLRARLLSERSVSRSARQRADELEKMVEELEEQLKIVSLQ 66

Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQE---ESPHDFKARNGXXXXXXXXXXXK 1902
            RK AEKATA+VLAILEN G SD+SEEFDS SD E   ES    K+R              
Sbjct: 67   RKMAEKATADVLAILENQGASDISEEFDSSSDHETFQESKMGNKSRKEEENFLISE---- 122

Query: 1901 LRKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGS--NSLSA 1728
             R+N                GR+LSW+   DS  + EK     S+RRR++F +  +S S 
Sbjct: 123  -RRNEHEEYSGSDLDSSSIPGRNLSWKGRIDSPRSREKYK-EPSIRRRSTFSAVGSSSSR 180

Query: 1727 RRVGKSCRRIRHRDTRSM-----------EDSQNDGTEKAACSGDAFNGSDGEHVALREY 1581
              +GKSCR+I+HR+TRS+           +DS+ +G   ++     F+  D E +     
Sbjct: 181  HNLGKSCRQIKHRETRSVVERSKDEPAKFDDSEENGVAASSEGLSNFSYCDPERLRDGPE 240

Query: 1580 ENGKNQLESSTLRSNSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXX 1401
               +  L    L  + E Q+     F+ H R+ DME AL+HQAQLIGQ            
Sbjct: 241  SQKEKFLSKDALTRSKEHQRNGDPNFNGHGRNKDMERALEHQAQLIGQNEEMEMAQREWE 300

Query: 1400 EKFRENNSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDAC-FSE 1224
            EKFRENN+ T DSCDPGNHSD+TEER EMK+P     A  + S+ QE K E  D+C F E
Sbjct: 301  EKFRENNTSTPDSCDPGNHSDITEERDEMKTP---FPAEINASEAQEAKSEARDSCLFEE 357

Query: 1223 KPET--------------SKRSLQNENIISCESSASEFSFPMSREKNNQEFLGIQHDASQ 1086
            K +T                +   N + ++  S   EF+FP + E+  QE L        
Sbjct: 358  KMKTQLNGYLPPSDVEMGGMQDQMNRSSVASASPIQEFAFPTAYERQTQESLENNAHQPS 417

Query: 1085 YRSQQFPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLELAVVP---QDNLGSVLEAL 915
              S   P +++++   S+ +S     S   ++  +  L    A+VP   Q+ LG VL+AL
Sbjct: 418  PGSHHDPLLLESSHNRSSVVSSDGGSSFHNASGSRNDL---YALVPHDSQERLGGVLDAL 474

Query: 914  KRAKSSLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENA 735
            K+AK SL QK+   P     +     +P        +   IPV   GLFRLPTD+  E A
Sbjct: 475  KQAKLSLQQKIIRLPLVDDTSVQESIEPPIPAVTTGNRLDIPVGCAGLFRLPTDFAVEEA 534

Query: 734  RPGFANFPPENSLGRFLSEPFDSRSAFSSDLFLTDPY 624
                +     +SL      P    +A S+D F+T  Y
Sbjct: 535  ATKHSYLGLGSSLPSARYCPDKGLAASSTDQFVTSTY 571


>ref|XP_004140985.1| PREDICTED: uncharacterized protein LOC101207733 [Cucumis sativus]
          Length = 671

 Score =  300 bits (767), Expect = 3e-78
 Identities = 250/703 (35%), Positives = 335/703 (47%), Gaps = 71/703 (10%)
 Frame = -2

Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073
            + ++ R    ++++ AMTIEFLRARLLSERSVSK+ARQRADELAK+VAELEEQLK VSLQ
Sbjct: 7    DQQDPRSVPGVEDTTAMTIEFLRARLLSERSVSKSARQRADELAKRVAELEEQLKIVSLQ 66

Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1893
            RK AEKATA+VLAILE++G SD+SE  DS SD E  P   K  +G           + R+
Sbjct: 67   RKMAEKATADVLAILEDNGASDISETLDSNSDHETEP---KVEDGLAREDVSSGTVR-RR 122

Query: 1892 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGS--NSLSARRV 1719
            N                G SLSW+   DS H  EK     S+R R+SF S  +S    ++
Sbjct: 123  NEHEEYSGSNIDTSPVLGGSLSWKGRNDSPHTREKYK-KHSIRSRSSFTSIGSSSPKHQL 181

Query: 1718 GKSCRRIRHRDTRSMEDSQN-------DGTEKAACSG--DAFNGSDGEHVALRE-YENGK 1569
            G+SCR+I+ RDTR ++  Q        D +E+   +   D+ N S   H  LR+ YE  +
Sbjct: 182  GRSCRQIKRRDTRPLDGEQELKSDALVDSSEEIPSTSLEDSQNYSVNGHSILRDGYEVRE 241

Query: 1568 NQLESSTLRSNSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFR 1389
                SS+   NS          D +E+ DDME AL+ QAQLI QY           EKFR
Sbjct: 242  KTRSSSSGVHNSVGNSDQDNDIDGYEKVDDMEKALKCQAQLIDQYEAMEKAQREWEEKFR 301

Query: 1388 ENNSGTQDSCDPGNHSDVTEERYEMK--SPELSRAAGTS-------NSDNQETKQEQVDA 1236
            ENN+ T DSCDPGNHSD+TEER EM+  +P LS             + D ++  Q Q + 
Sbjct: 302  ENNNSTPDSCDPGNHSDITEERDEMRAQAPNLSNNPANEAKPQVAFDCDTRDLSQAQTNG 361

Query: 1235 CFSEKPETSKRSL--QNENIISCESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQFPP 1062
                        L  QN N IS   S  EF+FPM+  K  QE        SQ  S Q P 
Sbjct: 362  LGPSMCAVDVEDLQDQNTNSISTSKSLEEFTFPMANVKQCQE--------SQENSAQEPS 413

Query: 1061 --------MVQTTTQSSTKISPYEEKSTALSTPPKISLPLELAVVPQDNLGSVLEALKRA 906
                    + +    S   I+ Y++++   +      +P E        L  VLEALK+A
Sbjct: 414  CTSHLNHGLPERPLSSHGGINSYDQETPCSNNDLYALVPHE-----PPALDGVLEALKQA 468

Query: 905  KSSLNQKLNNSPPTAG------RASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQP 744
            K SL +K+   P   G      ++ G +  P        D  +IPV   GLFRLPTD+  
Sbjct: 469  KLSLTKKIIKLPSVDGESESIDKSIGPLSIPKMG-----DRLEIPVGCAGLFRLPTDFAA 523

Query: 743  E-NARPGF----------ANFPPENSL----------------------GRFLSEPFDSR 663
            E +++  F           ++P E +                        R  S  + + 
Sbjct: 524  EASSQANFLASSSQLRSPTHYPGEGAALSANHQIFPGHEMEDRSSFLRDSRLRSSGYRAG 583

Query: 662  SAFSSDLFLTDPYRPFTPERPFSQPRLSEGPSSSNRMNRLDSYTNPVLP-SVKDSYPFLP 486
            S F+ D FLTD      PE  +  P          + +  D Y + V P S   +YP  P
Sbjct: 584  SGFTRDGFLTD----HIPENRWKNP---------GQKHHFDQYFDAVQPSSYVHNYPPRP 630

Query: 485  DVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 357
             V+  +  N+    R FP     +PP  + S YD+  RP+MYR
Sbjct: 631  -VSSNIHPND-TFLRTFPGRSTEMPPTNQYSFYDDQFRPNMYR 671


>gb|EOY19205.1| Uncharacterized protein isoform 4 [Theobroma cacao]
          Length = 709

 Score =  297 bits (761), Expect = 1e-77
 Identities = 238/717 (33%), Positives = 354/717 (49%), Gaps = 83/717 (11%)
 Frame = -2

Query: 2258 SGEDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVS 2079
            S + ++ ++TT   E + MTIEFLRARLLSERSVSK+ARQR DELAK+VAELE+QLKFVS
Sbjct: 4    SDQVKQDQRTTCNVEDSTMTIEFLRARLLSERSVSKSARQRVDELAKRVAELEKQLKFVS 63

Query: 2078 LQRKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKL 1899
            +QR++AEKATA+VLAILEN+G+SD+SEE DS SDQ ++P +    NG           K+
Sbjct: 64   VQRRRAEKATADVLAILENNGVSDISEELDSSSDQ-DAPFESNINNGSTKEEESSVTSKV 122

Query: 1898 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDS-VRRRASFGSNSLSAR- 1725
            R+               ++GRSLSW+  K + H+ E+  Y D  VR R SF S S S+R 
Sbjct: 123  RQKESEELSGSEFDCSSASGRSLSWKGRKSASHSPER--YKDKLVRSRNSFASISFSSRK 180

Query: 1724 -RVGKSCRRIRHRDTRS----------MEDSQNDGTEKAACSGDAFNGSDGEHVALREYE 1578
             R GKSCR+IR R++RS          M D Q  G E ++   +A + + G H+     E
Sbjct: 181  HRQGKSCRQIRRRESRSVAEELKSDNIMVDPQVKGLENSS-EVNANHSTGGPHILPMGSE 239

Query: 1577 NGKNQLESSTLRSNSETQKMDGRYFDV----HERDDDMESALQHQAQLIGQYXXXXXXXX 1410
              +N+     L S++   + +   FD+    +E + DME AL+HQAQLI  Y        
Sbjct: 240  IHENKSTVDNLHSDALKNERNVTGFDLDFHGYEGEKDMEKALEHQAQLIVHYEAMERAQR 299

Query: 1409 XXXEKFRENNSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACF 1230
               EKFRE NS + DSCDPGNHSDVTEER E+K+ +    +GT+ S  Q  ++E + +  
Sbjct: 300  EWEEKFREKNSSSPDSCDPGNHSDVTEERDEIKA-QAQYVSGTATSQVQGAEEEHI-SFS 357

Query: 1229 SEKPETS--------------------KRSLQNENIISCESSASEFSFPMSREKNNQEFL 1110
            +E P+                       RSL  E+ ++  S   + +F M++E ++Q   
Sbjct: 358  AELPKIHSNDLVPPSQADMDRLQDWRYSRSLSPES-LNPNSPGQKLTFLMAKENHHQSMQ 416

Query: 1109 GIQHDASQYRSQQFPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLELAVVPQDNLG- 933
               +++    S  F     +    + +    +  S +    P+    L  A+VP +  G 
Sbjct: 417  --SNNSPSNSSHHFAHPHDSPGNQAVQHISSDLGSHSCRELPRNKNEL-YALVPHETSGR 473

Query: 932  --SVLEALKRAKSSLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLP 759
               VL++LK+A+ SL QK++      G + G   + S +     +  +IP+   GLFR+P
Sbjct: 474  FTGVLDSLKQARLSLQQKISTLSLVEGASVGKAIETSGSGRKVGERVEIPLGCSGLFRVP 533

Query: 758  TDYQPENARPGF---------ANFPPENSLGRFLSEPFDSRSAFSSDLFLTDPYRPFTPE 606
            TD   E  +  F         AN  P+  +    S    + S  ++    +  Y+P + +
Sbjct: 534  TDISVEAPKANFLGSSSQLSLANHYPDRGVAPTASNHLLTTSYMNTQSSSSSNYQPVSSD 593

Query: 605  R----PFSQPRLSEGP----------------------SSSNRMN----RLDSYTNPVLP 516
            R    P+  PR S  P                       + +R++      D    PVLP
Sbjct: 594  RFFSGPYMYPRTSSSPFPTAFASSGYIKDDQILTGQCEETGSRLSTPKPSFDPSLEPVLP 653

Query: 515  SVK----DSYPFLPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 357
            S       ++P  PD+  ++   EG  + +   S    P     S YD H RPD++R
Sbjct: 654  SSSLQNYPTFPSYPDLVPQIHAKEGFPAFHTTRSVGATPD--WFSFYDSHFRPDIHR 708


>gb|ESW15816.1| hypothetical protein PHAVU_007G104500g [Phaseolus vulgaris]
          Length = 652

 Score =  295 bits (756), Expect = 5e-77
 Identities = 239/666 (35%), Positives = 337/666 (50%), Gaps = 39/666 (5%)
 Frame = -2

Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073
            + ++QR  +S ++S AMTIEFLRARLLSERS+SK+ARQRADELA+KV ELEEQL+ V LQ
Sbjct: 7    DPQDQRIASSTEDSTAMTIEFLRARLLSERSISKSARQRADELAEKVMELEEQLRMVILQ 66

Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1893
            RK AEKATA+VLAILE+ GIS VS+EFDS SD  E+P D    N            K R+
Sbjct: 67   RKMAEKATADVLAILESQGISGVSDEFDSGSDL-ENPFDSSMSNECAKEDEGPMKSKGRQ 125

Query: 1892 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEK-KNYMDSVRRRASFGSNSLSAR-RV 1719
            +               + +SLSW+   D  H+LEK K    +VRR++SF S S S + R+
Sbjct: 126  HGSDEMSGSNEDSSLVSSKSLSWKGRHDLSHSLEKYKTKSTNVRRQSSFSSFSSSPKHRL 185

Query: 1718 GKSCRRIRHRDTRS-MEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTLR 1542
            GKSCR+IRHR  RS ME+S+           +  + S+G       + +G     S+ L+
Sbjct: 186  GKSCRKIRHRQPRSVMEESRGKFVHVNCQVNELVSSSEG----FPNFRDG----GSNILK 237

Query: 1541 SNSETQKMDG---------RYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFR 1389
              S+ Q+ DG          + D + R+++ME AL+HQA+LI QY           EKFR
Sbjct: 238  IESKIQEEDGSEANLLSKNHHIDGYGRENEMEKALEHQAELIDQYEAMEKAQREWEEKFR 297

Query: 1388 ENNSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETS 1209
            ENNS T DSCDPGNHSD+TE++ E K  ++  AA    S  +E+K E    C SE+    
Sbjct: 298  ENNSTTPDSCDPGNHSDMTEDKDEGK-VQIPYAAKVVTSKAEESKGEPGGVCLSEE---- 352

Query: 1208 KRSLQNENIISCESSASE-FSFPMSREKNNQEFLGIQHDASQYRSQQFPPMVQTTTQSST 1032
            K   +   I+  +   ++ +    S   +  +FLG ++  S  +  Q   +V   +QSS 
Sbjct: 353  KLKAEGREIMPKKHDDTDVYRNQKSTTFSTSDFLGQENSHSPLKGNQNEILVNGHSQSSD 412

Query: 1031 KISPYEEKSTALST----------PPKISLPLELAVVPQDN--LGSVLEALKRAKSSLNQ 888
                 + + ++  T            K    L   V  + +     VLE+LK+A+ SL Q
Sbjct: 413  MNHLDQGRHSSFPTDIHGVQHQHDASKNQKDLYALVTREQSHQFDGVLESLKQARISLQQ 472

Query: 887  KLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENARPGFANFPP 708
            +LN  P   G   G   +P  + +   D F+IP    GLFRLPTD+  E A P F    P
Sbjct: 473  ELNRLPVVEG---GYTAKPLPSVSKNEDRFEIPFGFSGLFRLPTDFSDE-ATPRFNVRDP 528

Query: 707  ENSLGRFLSEPFDSRSAFSSDLFLTDP-------YRPFTPERPFSQPRLSEGPSSSNRMN 549
                G        + S  S   F T+P         P   ++  +   L  G   S+  +
Sbjct: 529  TTGFGSNY-HLNGTMSRTSVGQFFTNPPHSGKMLMSPSANDQALATRYLENGSRFSSSQS 587

Query: 548  RLDSYTN-PVLPSVKDSYPFLP------DVTLRVPLNEGGASRNFPSSERGLPPVMRLSS 390
              D ++N   L S K SYP  P      + T ++P  +   SR + +S  G+P   R S 
Sbjct: 588  PFDPFSNGGPLSSSKYSYPTFPINPSYQNATPQMPFGD-EVSRPYSNSTVGVPLANRFSF 646

Query: 389  YDEHVR 372
             D+H+R
Sbjct: 647  NDDHLR 652


>gb|EOY19202.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 749

 Score =  294 bits (752), Expect = 1e-76
 Identities = 237/708 (33%), Positives = 348/708 (49%), Gaps = 83/708 (11%)
 Frame = -2

Query: 2231 TTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQRKKAEKA 2052
            TT   E + MTIEFLRARLLSERSVSK+ARQR DELAK+VAELE+QLKFVS+QR++AEKA
Sbjct: 53   TTCNVEDSTMTIEFLRARLLSERSVSKSARQRVDELAKRVAELEKQLKFVSVQRRRAEKA 112

Query: 2051 TANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRKNXXXXXX 1872
            TA+VLAILEN+G+SD+SEE DS SDQ ++P +    NG           K+R+       
Sbjct: 113  TADVLAILENNGVSDISEELDSSSDQ-DAPFESNINNGSTKEEESSVTSKVRQKESEELS 171

Query: 1871 XXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDS-VRRRASFGSNSLSAR--RVGKSCRR 1701
                    ++GRSLSW+  K + H+ E+  Y D  VR R SF S S S+R  R GKSCR+
Sbjct: 172  GSEFDCSSASGRSLSWKGRKSASHSPER--YKDKLVRSRNSFASISFSSRKHRQGKSCRQ 229

Query: 1700 IRHRDTRS----------MEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESS 1551
            IR R++RS          M D Q  G E ++   +A + + G H+     E  +N+    
Sbjct: 230  IRRRESRSVAEELKSDNIMVDPQVKGLENSS-EVNANHSTGGPHILPMGSEIHENKSTVD 288

Query: 1550 TLRSNSETQKMDGRYFDV----HERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFREN 1383
             L S++   + +   FD+    +E + DME AL+HQAQLI  Y           EKFRE 
Sbjct: 289  NLHSDALKNERNVTGFDLDFHGYEGEKDMEKALEHQAQLIVHYEAMERAQREWEEKFREK 348

Query: 1382 NSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETS-- 1209
            NS + DSCDPGNHSDVTEER E+K+ +    +GT+ S  Q  ++E + +  +E P+    
Sbjct: 349  NSSSPDSCDPGNHSDVTEERDEIKA-QAQYVSGTATSQVQGAEEEHI-SFSAELPKIHSN 406

Query: 1208 ------------------KRSLQNENIISCESSASEFSFPMSREKNNQEFLGIQHDASQY 1083
                               RSL  E+ ++  S   + +F M++E ++Q      +++   
Sbjct: 407  DLVPPSQADMDRLQDWRYSRSLSPES-LNPNSPGQKLTFLMAKENHHQSMQ--SNNSPSN 463

Query: 1082 RSQQFPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLELAVVPQDNLG---SVLEALK 912
             S  F     +    + +    +  S +    P+    L  A+VP +  G    VL++LK
Sbjct: 464  SSHHFAHPHDSPGNQAVQHISSDLGSHSCRELPRNKNEL-YALVPHETSGRFTGVLDSLK 522

Query: 911  RAKSSLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENAR 732
            +A+ SL QK++      G + G   + S +     +  +IP+   GLFR+PTD   E  +
Sbjct: 523  QARLSLQQKISTLSLVEGASVGKAIETSGSGRKVGERVEIPLGCSGLFRVPTDISVEAPK 582

Query: 731  PGF---------ANFPPENSLGRFLSEPFDSRSAFSSDLFLTDPYRPFTPER----PFSQ 591
              F         AN  P+  +    S    + S  ++    +  Y+P + +R    P+  
Sbjct: 583  ANFLGSSSQLSLANHYPDRGVAPTASNHLLTTSYMNTQSSSSSNYQPVSSDRFFSGPYMY 642

Query: 590  PRLSEGP----------------------SSSNRMN----RLDSYTNPVLPSVK----DS 501
            PR S  P                       + +R++      D    PVLPS       +
Sbjct: 643  PRTSSSPFPTAFASSGYIKDDQILTGQCEETGSRLSTPKPSFDPSLEPVLPSSSLQNYPT 702

Query: 500  YPFLPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 357
            +P  PD+  ++   EG  + +   S    P     S YD H RPD++R
Sbjct: 703  FPSYPDLVPQIHAKEGFPAFHTTRSVGATPD--WFSFYDSHFRPDIHR 748


>ref|XP_006606287.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X4
            [Glycine max]
          Length = 641

 Score =  288 bits (738), Expect = 6e-75
 Identities = 234/664 (35%), Positives = 326/664 (49%), Gaps = 37/664 (5%)
 Frame = -2

Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073
            + ++QR T+ M++S AMTIEFLRARLLSERS+S++A+QRADELAKKV +LEEQLK V LQ
Sbjct: 7    DPQDQRVTSCMEDSTAMTIEFLRARLLSERSISRSAKQRADELAKKVMDLEEQLKTVILQ 66

Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1893
            RK AEKATA+VLAILE+ GISDVSEEFDS SD  E+P D    N            K R+
Sbjct: 67   RKMAEKATADVLAILESEGISDVSEEFDSGSDL-ENPCDSSVSNECAKEGEEPMSSKGRQ 125

Query: 1892 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGSNSLSAR-RVG 1716
            +               + +SLSW+   DS H+LEK     ++RR++SF S S S + R G
Sbjct: 126  HGSDKMPGSNVDSSPVSSKSLSWKGRHDSSHSLEKYK-TSNLRRQSSFSSISSSPKHRQG 184

Query: 1715 KSCRRIRHRDTR-SMEDSQN---DGTEKAACSGDAFNGSDGEHVALREYENGKNQLESST 1548
            KSCR+IRHR  R  +E+S+N   +  ++ A     F    G    + + E+   +   S 
Sbjct: 185  KSCRKIRHRQIRLVVEESRNKFANHEKELASLSKGFPNFSGGGSNIPKIESEIQEEGGSG 244

Query: 1547 LRSNSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSGTQ 1368
                ++   +DG     + R+ DME AL+HQAQLI QY           EKFRENNS T 
Sbjct: 245  ANPLNKNHHVDG-----YGREKDMEKALEHQAQLIDQYEAMEKVQREWEEKFRENNSTTP 299

Query: 1367 DSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETSKRSLQNE 1188
            DSCDPGN+SD+TE++ E K   +  AA    SD QE+K E    C SE+    K   +  
Sbjct: 300  DSCDPGNYSDMTEDKDESK-VHIPFAAKVVTSDAQESKGEPRGVCLSEE----KFKAEAR 354

Query: 1187 NII-SCESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQFPPMVQTTTQSSTKISPYEE 1011
            +I+         +S   +   +  + LG Q+     +  Q    V    Q S        
Sbjct: 355  DIMPKTHDDTGGYSDQKNTTFSTSDLLGQQNSCPPLKGNQNESSVNGHFQPSVMNHQDPG 414

Query: 1010 KSTALSTPPKISLPLELAVVPQDN--------------------LGSVLEALKRAKSSLN 891
            +     + P  S P ++  V   N                       VLE+LK+A+ SL 
Sbjct: 415  RHGYHDSKPTYSFPTDIHGVQHQNDASRNKTDLFALVTHEQPHKFNGVLESLKQARISLQ 474

Query: 890  QKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-----NARPG 726
            Q+L   P      SG   +PS + +   D F++PV   GLFR+PTD+        N +  
Sbjct: 475  QELKRLPLV---ESGYTAKPSASFSKSEDRFEVPVGCSGLFRIPTDFSDGATARFNVKDP 531

Query: 725  FANFPPENSLGRFLSEPFDSRSAFSSDLFLTDPYRPFTPERPFSQPRLS-----EGPSSS 561
             A F     L R +S   D +       F + PY       P +   L+      GP+  
Sbjct: 532  TAGFGSNFHLNRAMSRTSDGQ------FFPSLPYPDTQLSLPANDQSLAIRYVENGPNGG 585

Query: 560  NRMNRLDSY-TNPVLPSVKDSYPFLPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYD 384
            +  +   +Y T P+ PS +++ P +P        NE   SR + SS  G+P   R S   
Sbjct: 586  SLSSSKYTYPTFPINPSYQNATPQMPFG------NE--VSRPYSSSTVGVPLANRFSFNS 637

Query: 383  EHVR 372
            +H+R
Sbjct: 638  DHLR 641


>ref|XP_006606284.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X1
            [Glycine max] gi|571568788|ref|XP_006606285.1| PREDICTED:
            micronuclear linker histone polyprotein-like isoform X2
            [Glycine max] gi|571568792|ref|XP_006606286.1| PREDICTED:
            micronuclear linker histone polyprotein-like isoform X3
            [Glycine max]
          Length = 664

 Score =  284 bits (726), Expect = 1e-73
 Identities = 232/657 (35%), Positives = 321/657 (48%), Gaps = 37/657 (5%)
 Frame = -2

Query: 2231 TTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQRKKAEKA 2052
            T+ M++S AMTIEFLRARLLSERS+S++A+QRADELAKKV +LEEQLK V LQRK AEKA
Sbjct: 37   TSCMEDSTAMTIEFLRARLLSERSISRSAKQRADELAKKVMDLEEQLKTVILQRKMAEKA 96

Query: 2051 TANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRKNXXXXXX 1872
            TA+VLAILE+ GISDVSEEFDS SD  E+P D    N            K R++      
Sbjct: 97   TADVLAILESEGISDVSEEFDSGSDL-ENPCDSSVSNECAKEGEEPMSSKGRQHGSDKMP 155

Query: 1871 XXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGSNSLSAR-RVGKSCRRIR 1695
                     + +SLSW+   DS H+LEK     ++RR++SF S S S + R GKSCR+IR
Sbjct: 156  GSNVDSSPVSSKSLSWKGRHDSSHSLEKYK-TSNLRRQSSFSSISSSPKHRQGKSCRKIR 214

Query: 1694 HRDTR-SMEDSQN---DGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTLRSNSET 1527
            HR  R  +E+S+N   +  ++ A     F    G    + + E+   +   S     ++ 
Sbjct: 215  HRQIRLVVEESRNKFANHEKELASLSKGFPNFSGGGSNIPKIESEIQEEGGSGANPLNKN 274

Query: 1526 QKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSGTQDSCDPGN 1347
              +DG     + R+ DME AL+HQAQLI QY           EKFRENNS T DSCDPGN
Sbjct: 275  HHVDG-----YGREKDMEKALEHQAQLIDQYEAMEKVQREWEEKFRENNSTTPDSCDPGN 329

Query: 1346 HSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETSKRSLQNENII-SCE 1170
            +SD+TE++ E K   +  AA    SD QE+K E    C SE+    K   +  +I+    
Sbjct: 330  YSDMTEDKDESK-VHIPFAAKVVTSDAQESKGEPRGVCLSEE----KFKAEARDIMPKTH 384

Query: 1169 SSASEFSFPMSREKNNQEFLGIQHDASQYRSQQFPPMVQTTTQSSTKISPYEEKSTALST 990
                 +S   +   +  + LG Q+     +  Q    V    Q S        +     +
Sbjct: 385  DDTGGYSDQKNTTFSTSDLLGQQNSCPPLKGNQNESSVNGHFQPSVMNHQDPGRHGYHDS 444

Query: 989  PPKISLPLELAVVPQDN--------------------LGSVLEALKRAKSSLNQKLNNSP 870
             P  S P ++  V   N                       VLE+LK+A+ SL Q+L   P
Sbjct: 445  KPTYSFPTDIHGVQHQNDASRNKTDLFALVTHEQPHKFNGVLESLKQARISLQQELKRLP 504

Query: 869  PTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-----NARPGFANFPPE 705
                  SG   +PS + +   D F++PV   GLFR+PTD+        N +   A F   
Sbjct: 505  LV---ESGYTAKPSASFSKSEDRFEVPVGCSGLFRIPTDFSDGATARFNVKDPTAGFGSN 561

Query: 704  NSLGRFLSEPFDSRSAFSSDLFLTDPYRPFTPERPFSQPRLS-----EGPSSSNRMNRLD 540
              L R +S   D +       F + PY       P +   L+      GP+  +  +   
Sbjct: 562  FHLNRAMSRTSDGQ------FFPSLPYPDTQLSLPANDQSLAIRYVENGPNGGSLSSSKY 615

Query: 539  SY-TNPVLPSVKDSYPFLPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVR 372
            +Y T P+ PS +++ P +P        NE   SR + SS  G+P   R S   +H+R
Sbjct: 616  TYPTFPINPSYQNATPQMPFG------NE--VSRPYSSSTVGVPLANRFSFNSDHLR 664


>ref|XP_002311037.1| hypothetical protein POPTR_0008s02540g [Populus trichocarpa]
            gi|222850857|gb|EEE88404.1| hypothetical protein
            POPTR_0008s02540g [Populus trichocarpa]
          Length = 684

 Score =  281 bits (719), Expect = 9e-73
 Identities = 240/691 (34%), Positives = 342/691 (49%), Gaps = 59/691 (8%)
 Frame = -2

Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073
            E ++QR  +SM++S A+TIEFLRARLL+ERSVS+TARQRADELA++VAELEEQL+ VSLQ
Sbjct: 7    EKQDQRTRSSMEDSTAITIEFLRARLLAERSVSRTARQRADELAERVAELEEQLRIVSLQ 66

Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1893
            R KAEKAT +VLAILE++GISD SE F S SDQ+ +P + K               K+ K
Sbjct: 67   RMKAEKATVDVLAILESNGISDDSEIFGSSSDQD-TPCESKVGK-KTKQEESSVISKVTK 124

Query: 1892 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGSNSLSARR-VG 1716
                           S GR+LSW+  K S  +LEK     S+RRR+SF S S S +   G
Sbjct: 125  YKLEEHSGSGHDFSSSQGRNLSWKGRKHSPRSLEKCKD-PSLRRRSSFASTSSSPKHHQG 183

Query: 1715 KSCRRIRHRDTR-------SMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLE 1557
            KSCR++R++++R       +  D  +      A + + F       V     ENG+ +  
Sbjct: 184  KSCRQVRNKESRLTIGAFRTNPDKVDSPENGVATTSEVFPNCSEPEVG--RIENGEEKTL 241

Query: 1556 SSTLRSNSETQKMDGRYFD--VHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFREN 1383
                      Q+ D    +  V+  D DME AL+HQAQLI +Y           EKFREN
Sbjct: 242  PPISVGLENGQRADSNELEDNVYGSDRDMEKALEHQAQLIDRYKAMEKVQREWEEKFREN 301

Query: 1382 NSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETSKR 1203
            N  T DS D GN SDVTEE YE+K+ ++ +  GT  + +   K E V+   + +P    R
Sbjct: 302  NGSTPDSYDAGNRSDVTEEGYEIKA-QVQQHTGTVAAQSNRAKSE-VEKASNIQPNGILR 359

Query: 1202 ----------SLQNENIISCESSASEFSFPMSREK--NNQEFLGIQHDASQYRSQQFPPM 1059
                        ++ +  + ES A +F+F   ++K   N+E LG  +  S + S   P  
Sbjct: 360  PSHVNIGQLQEWKSSSAPTSESPAQDFAFRAEKQKQNENEESLGNNYHPSPHSSHDHP-- 417

Query: 1058 VQTTTQSSTKISPYEEKSTALSTPPKISLPL--------EL-AVVP---QDNLGSVLEAL 915
                   S+  SP  + +T+  +                EL A+VP    + LG VL+AL
Sbjct: 418  ----QSHSSHDSPGSQSATSFPSNTDSGFSKGQFSGRQNELYALVPHRASNELGGVLDAL 473

Query: 914  KRAKSSLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-- 741
            K A+ SL QK++  P   G +  +   PS       D   IP+ + GLFRLP D+  E  
Sbjct: 474  KLARQSLQQKISTLPLIEGGSIRNSVDPSLPPPIPGDKVDIPLGNAGLFRLPFDFLAEGS 533

Query: 740  --------NARPGFANFPPEN-----SLGRFLSE-PFDSRSAF-SSDLFLTDPYRPFTPE 606
                    NA     N+ P+      ++ RF+S  P  + S F ++D FL       T  
Sbjct: 534  TRKNLDSTNAGLSLRNYYPDTGVPAAAINRFVSRFPTATGSRFPTADQFLASQSYSATGS 593

Query: 605  RPFSQPRL--SEGPSSSNRMNRLDSYTNPVL-----PSVKDSYPFLPDVTLRVPLNEGGA 447
            R  ++ +   S+   + +R++    +  P L     PS + SYP  P     +P      
Sbjct: 594  RFPTEDQFLASQDVEAGSRISSQRPFFYPYLDTVSPPSARYSYPTNPSYPGPMPQLPSRE 653

Query: 446  SRNF-PSSERGLPPVMRLSSYDEHVRPDMYR 357
              +F PS+  G+PP    S  D H+RP+MYR
Sbjct: 654  PPSFLPSTTAGVPPADHFSFPDYHIRPNMYR 684


>gb|EOY19203.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508727307|gb|EOY19204.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 665

 Score =  280 bits (717), Expect = 2e-72
 Identities = 230/704 (32%), Positives = 339/704 (48%), Gaps = 70/704 (9%)
 Frame = -2

Query: 2258 SGEDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVS 2079
            S + ++ ++TT   E + MTIEFLRARLLSERSVSK+ARQR DELAK+VAELE+QLKFVS
Sbjct: 4    SDQVKQDQRTTCNVEDSTMTIEFLRARLLSERSVSKSARQRVDELAKRVAELEKQLKFVS 63

Query: 2078 LQRKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKL 1899
            +QR++AEKATA+VLAILEN+G+SD+SEE DS SDQ ++P +    NG           K+
Sbjct: 64   VQRRRAEKATADVLAILENNGVSDISEELDSSSDQ-DAPFESNINNGSTKEEESSVTSKV 122

Query: 1898 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDS-VRRRASFGSNSLSAR- 1725
            R+               ++GRSLSW+  K + H+ E+  Y D  VR R SF S S S+R 
Sbjct: 123  RQKESEELSGSEFDCSSASGRSLSWKGRKSASHSPER--YKDKLVRSRNSFASISFSSRK 180

Query: 1724 -RVGKSCRRIRHRDTRSM-EDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESS 1551
             R GKSCR+IR R++RS+ E+ ++D                     +      K    SS
Sbjct: 181  HRQGKSCRQIRRRESRSVAEELKSDN--------------------IMVDPQVKGLENSS 220

Query: 1550 TLRSNSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSGT 1371
             + +N  T             + DME AL+HQAQLI  Y           EKFRE NS +
Sbjct: 221  EVNANHST------------GEKDMEKALEHQAQLIVHYEAMERAQREWEEKFREKNSSS 268

Query: 1370 QDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETS------ 1209
             DSCDPGNHSDVTEER E+K+ +    +GT+ S  Q  ++E + +  +E P+        
Sbjct: 269  PDSCDPGNHSDVTEERDEIKA-QAQYVSGTATSQVQGAEEEHI-SFSAELPKIHSNDLVP 326

Query: 1208 --------------KRSLQNENIISCESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQ 1071
                           RSL  E+ ++  S   + +F M++E ++Q      +++    S  
Sbjct: 327  PSQADMDRLQDWRYSRSLSPES-LNPNSPGQKLTFLMAKENHHQSMQ--SNNSPSNSSHH 383

Query: 1070 FPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLELAVVPQDNLG---SVLEALKRAKS 900
            F     +    + +    +  S +    P+    L  A+VP +  G    VL++LK+A+ 
Sbjct: 384  FAHPHDSPGNQAVQHISSDLGSHSCRELPRNKNEL-YALVPHETSGRFTGVLDSLKQARL 442

Query: 899  SLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENARPGF- 723
            SL QK++      G + G   + S +     +  +IP+   GLFR+PTD   E  +  F 
Sbjct: 443  SLQQKISTLSLVEGASVGKAIETSGSGRKVGERVEIPLGCSGLFRVPTDISVEAPKANFL 502

Query: 722  --------ANFPPENSLGRFLSEPFDSRSAFSSDLFLTDPYRPFTPER----PFSQPRLS 579
                    AN  P+  +    S    + S  ++    +  Y+P + +R    P+  PR S
Sbjct: 503  GSSSQLSLANHYPDRGVAPTASNHLLTTSYMNTQSSSSSNYQPVSSDRFFSGPYMYPRTS 562

Query: 578  EGP----------------------SSSNRMN----RLDSYTNPVLPSVK----DSYPFL 489
              P                       + +R++      D    PVLPS       ++P  
Sbjct: 563  SSPFPTAFASSGYIKDDQILTGQCEETGSRLSTPKPSFDPSLEPVLPSSSLQNYPTFPSY 622

Query: 488  PDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 357
            PD+  ++   EG  + +   S    P     S YD H RPD++R
Sbjct: 623  PDLVPQIHAKEGFPAFHTTRSVGATPD--WFSFYDSHFRPDIHR 664


>emb|CBI40233.3| unnamed protein product [Vitis vinifera]
          Length = 682

 Score =  279 bits (714), Expect = 3e-72
 Identities = 188/418 (44%), Positives = 238/418 (56%), Gaps = 32/418 (7%)
 Frame = -2

Query: 2222 MQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQRKKAEKATAN 2043
            M++S AMTIEFLRARLLSERSVS+TARQRADELA++V +LEEQLK VS+QR KAEKATA+
Sbjct: 1    MEDSTAMTIEFLRARLLSERSVSRTARQRADELAQRVWKLEEQLKIVSIQRNKAEKATAD 60

Query: 2042 VLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRKNXXXXXXXXX 1863
            VLAILENH ISDVS EFDS SDQE +  D     G                         
Sbjct: 61   VLAILENHAISDVSWEFDSSSDQEVALCDSHVGGG------------------------- 95

Query: 1862 XXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGSNSLSARR--VGKSCRRIRHR 1689
                    R LSW+SSKDS H++EK+    S+RRR SF S+  S+ +  +GKSCR+IR R
Sbjct: 96   --------RRLSWKSSKDSSHSIEKRYLDCSIRRRHSFASSGSSSPKHNLGKSCRQIRRR 147

Query: 1688 DTRS----------MEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQ--LESSTL 1545
            +TRS          M DSQN+G    + S    NG D     LRE    + +  L    +
Sbjct: 148  ETRSAVDELKVGRVMVDSQNNGI--ISSSEGLPNGFDSGQEILREGSENQEEEALMDGQV 205

Query: 1544 RSNSETQKM---DGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSG 1374
              + E+Q+       + + + RD DME AL+HQAQLIGQY           EKFRENNS 
Sbjct: 206  SDSLESQRDATGSNHHLNRNGRDRDMERALEHQAQLIGQYEAEEKAQREWEEKFRENNSS 265

Query: 1373 TQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETS----- 1209
            T DSC+PGNHSDVTEER E+K P+   AAG   S +Q TK +  D  F+E+   +     
Sbjct: 266  TPDSCEPGNHSDVTEERDEVK-PQAPSAAGILTSQDQGTKLDDEDVHFNEESSQTLPTIS 324

Query: 1208 -------KRSLQNEN---IISCESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQFP 1065
                      LQ +N   +++ ES A +F FPM++E  +QEFL  Q     + S  +P
Sbjct: 325  TTHLHGDMECLQEQNRCSMLAYESLAPDFVFPMAKENLHQEFLENQSYPLSHSSHHYP 382



 Score =  105 bits (263), Expect = 7e-20
 Identities = 81/225 (36%), Positives = 110/225 (48%), Gaps = 24/225 (10%)
 Frame = -2

Query: 959  AVVPQDN---LGSVLEALKRAKSSLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIP 789
            A+VP++    LG VLEAL++A+ SL  KLN  P   G + G   +PS   T   +  +IP
Sbjct: 472  ALVPRETSNELGGVLEALQQARLSLQHKLNRLPLIEGGSIGRAIEPSFPSTRAWERVEIP 531

Query: 788  VISPGLFRLPTDYQ----------PENARPGFANFPPE-----NSLGRFLSEPF--DSRS 660
            V   GLFR+P DYQ            +++    N+ P+     N   RFL+ P+     S
Sbjct: 532  VGCAGLFRVPADYQLGTATEANFLGSDSQSSLKNYYPDTGFVANPGDRFLTSPYLKTGSS 591

Query: 659  AFSSDLFLTDPYRP----FTPERPFSQPRLSEGPSSSNRMNRLDSYTNPVLPSVKDSYPF 492
              + D FLT PYR       P RP        G S+S R      YT+P       +Y  
Sbjct: 592  VPTDDSFLTSPYRETGSRIPPLRPSFDYYSDAGLSASTR------YTHP-------TYSS 638

Query: 491  LPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 357
             PD+  R+P NEG A R   +SE G+P     S YD+H+RP+MYR
Sbjct: 639  HPDLLYRMPFNEGFA-RPPRNSEVGIPSTDHFSFYDDHIRPNMYR 682


>ref|XP_006360476.1| PREDICTED: flocculation protein FLO11-like isoform X1 [Solanum
            tuberosum] gi|565389467|ref|XP_006360477.1| PREDICTED:
            flocculation protein FLO11-like isoform X2 [Solanum
            tuberosum] gi|565389469|ref|XP_006360478.1| PREDICTED:
            flocculation protein FLO11-like isoform X3 [Solanum
            tuberosum] gi|565389471|ref|XP_006360479.1| PREDICTED:
            flocculation protein FLO11-like isoform X4 [Solanum
            tuberosum] gi|565389473|ref|XP_006360480.1| PREDICTED:
            flocculation protein FLO11-like isoform X5 [Solanum
            tuberosum]
          Length = 678

 Score =  276 bits (705), Expect = 4e-71
 Identities = 224/649 (34%), Positives = 326/649 (50%), Gaps = 21/649 (3%)
 Frame = -2

Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073
            ED++Q K   +++S   TIEFLR RLL+ERS S+TA+QRADELA++V+ELEEQLK VSLQ
Sbjct: 7    EDQDQSKIDGVEDSKT-TIEFLRGRLLAERSASRTAKQRADELAQRVSELEEQLKAVSLQ 65

Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1893
            RKKAE+ATA VL+ILENH I DVSEEF S SD+E    D K               + ++
Sbjct: 66   RKKAERATAAVLSILENHSIDDVSEEFSSGSDKEAILSDQKDAENKTGGDISSSVKE-KE 124

Query: 1892 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRR-ASFGSNSLSA-RRV 1719
            +              ST RSLSW+S K S H+L+++ Y DS RRR ++F S  +S+ +RV
Sbjct: 125  DDVDTLSSSGTVSSSSTARSLSWKSGK-SSHSLDRRKYTDSNRRRYSNFSSTDISSPKRV 183

Query: 1718 GKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTLRS 1539
            G SCRRIR RDTRS  D   + +  A C+ +    S            G N +      S
Sbjct: 184  GNSCRRIRRRDTRSASDKLQNSS--AECASEPLPSSANNEPHPLTAGAGINDVNDQVHVS 241

Query: 1538 NSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSGTQDSC 1359
              +   + G   +  + D+D + AL  QAQLIGQY           EK+RE+N  T DSC
Sbjct: 242  AID---VSGNGKEADKSDEDSQRALHQQAQLIGQYEAEEKAQREWEEKYRESNICTPDSC 298

Query: 1358 DPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEK----------PETS 1209
            D  N+SDVTEER ++K+ +    AG ++  N   +    D   +E+          P  +
Sbjct: 299  DRENYSDVTEERDDLKASQEPCLAGNTSMQNHANQSGAADVSRTEQNGNIDNSPSTPHVN 358

Query: 1208 KRSLQNE---NIISCESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQFPPMVQTTTQS 1038
               L+++     +  +S ASE + PMS    N  +L      S Y  QQ  P+ +     
Sbjct: 359  MSCLEDKKGSRTVESDSPASELARPMS----NGNYLENHGQTSAYSHQQSLPVTR----- 409

Query: 1037 STKISPYEEKSTALSTPPKISLPLELAVV---PQDNLGSVLEALKRAKSSLNQKLNNSPP 867
                SP   +S++L          ELA+V     +++ SVL  L++AK SL +++N+S P
Sbjct: 410  ----SPMHPRSSSLQAGQAPQTGYELALVSHNTSNSVNSVLGELEQAKLSLTKQINSSLP 465

Query: 866  TAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENARPGFANFPPENSLGRF 687
            TA          S N++++  +++   +SP +           +R  +       + G  
Sbjct: 466  TASYPGMPSRFSSVNQSSEPSTYETS-LSPYM----------ESRSKYV------TQGNR 508

Query: 686  LSEPFDSRSAFSSDLFLTDPYRPFTPERPFSQPRLSE---GPSSSNRMNRLDSYTNPVLP 516
            ++ PF  + AF         YRP + E  F   + S     P+SS+R+     +T P   
Sbjct: 509  VTYPF--QRAFPEVSSSAPSYRPIS-ETNFDAGQPSSMRFNPNSSSRLPLSSKFTYP--- 562

Query: 515  SVKDSYPFLPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRP 369
                SYP  PD+  ++P NE   SRN+P +E  LPP    S++   V P
Sbjct: 563  ----SYPKFPDMVPKLPPNE-VFSRNYPRNETDLPPSFSFSTWSPEVVP 606


>ref|XP_004249997.1| PREDICTED: uncharacterized protein LOC101251943 [Solanum
            lycopersicum]
          Length = 729

 Score =  266 bits (679), Expect = 4e-68
 Identities = 229/673 (34%), Positives = 332/673 (49%), Gaps = 45/673 (6%)
 Frame = -2

Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073
            ED++Q K   +++S   TIEFLR RLL+ERS S+TA+QRADELA+ V+ELEEQLK VSLQ
Sbjct: 7    EDQDQSKIDGVEDSKT-TIEFLRGRLLAERSASRTAKQRADELAQMVSELEEQLKVVSLQ 65

Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1893
            RK+AEKATA VL+ILE+H I DVSEEF S SD+E    D K   G           K ++
Sbjct: 66   RKRAEKATAAVLSILEDHSIDDVSEEFSSGSDKETILSDQKDA-GNKTGGDISSSAKEKE 124

Query: 1892 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRR-ASFGSNSLSA-RRV 1719
            +              ST RSLSW+S K S H+L+++ Y DS RRR ++F    +S+ +RV
Sbjct: 125  DDVDILSSSGTVSSSSTARSLSWKSGK-SSHSLDRRKYTDSNRRRYSNFSYTDISSPKRV 183

Query: 1718 GKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTLRS 1539
            G SCR+IR RDTRS  D   + +  A C+ +  + S            G + +       
Sbjct: 184  GNSCRQIRRRDTRSASDKLRNSS--AECASEPLSSSANNEPHSLTAGAGISDVNDQV--- 238

Query: 1538 NSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSGTQDSC 1359
            +     + G   +  + D+D + AL  Q Q IGQY           EK+RE+NS T DSC
Sbjct: 239  HVPALDVPGNGREADKSDEDSQRALHQQVQPIGQYEAEEKAQREWEEKYRESNSCTPDSC 298

Query: 1358 DPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETSKRSLQNENI- 1182
            D  N+SDVTEER ++K+ +    AG ++  N   +    D   +++      S    N+ 
Sbjct: 299  DRENYSDVTEERDDLKASQEPCLAGRTSMQNHANQCGAADVSRTKQNGNIDNSPSTPNVN 358

Query: 1181 ISC------------ESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQFPPMVQTTTQS 1038
            +SC            +SSASE + PMS       +L      S +  QQ  P+ +     
Sbjct: 359  MSCLEDKKGSRTVGSDSSASELARPMS----TGNYLENHGQTSAFSHQQSFPVTR----- 409

Query: 1037 STKISPYEEKSTALSTPPKISLPLELAVV---PQDNLGSVLEALKRAKSSLNQKLNNSPP 867
                S    +S++L     +    ELA+V     + + SVL  L++AK SL +++N+S P
Sbjct: 410  ----SSMHPRSSSLQAGQALQTGYELALVSHNTSNGVDSVLGKLEQAKLSLTKQINSSLP 465

Query: 866  TAGRASGSVFQPSNNETNKTDSFQIPVISPGL----------FRLPTDYQ---PE--NAR 732
            TA          S N + +  +++I +  P +           R+   +Q   PE  ++ 
Sbjct: 466  TASYPGTPSRFSSLNHSPELSTYEISLTPPYVESRSKYVTQSNRVTYPFQRAFPEVSSSA 525

Query: 731  PGFANFPPEN-SLGRFLSEPF-DSRSAF-SSDLFLTDPY-RPFT-------PERPFSQPR 585
            P +      N   G+  S P+ +SRS + +    +T P+ R FT         RP S+  
Sbjct: 526  PSYRPISETNFEAGQPSSTPYVESRSKYVTQSNRVTYPFQRAFTEVSSSAPSYRPISETN 585

Query: 584  LSEGPSSSNRMNRLDSYTNPVLPSVK-DSYPFLPDVTLRVPLNEGGASRNFPSSERGLPP 408
               G  SS R N   S   P    +   SYP  PD+  ++P NE   SRNFP++E  LPP
Sbjct: 586  FDAGQPSSVRFNPNSSSRLPFSSKLTYPSYPKFPDMVPKLPPNE-VFSRNFPTNETDLPP 644

Query: 407  VMRLSSYDEHVRP 369
                S+  + V P
Sbjct: 645  SFSFSTLSQEVVP 657


Top