BLASTX nr result

ID: Paeonia25_contig00013892 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia25_contig00013892
         (2001 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007218938.1| hypothetical protein PRUPE_ppa002306mg [Prun...   427   e-116
ref|XP_004307047.1| PREDICTED: uncharacterized protein LOC101309...   420   e-114
ref|XP_006436667.1| hypothetical protein CICLE_v10030805mg [Citr...   414   e-113
ref|XP_006436666.1| hypothetical protein CICLE_v10030805mg [Citr...   411   e-112
ref|XP_007010395.1| Uncharacterized protein isoform 4 [Theobroma...   388   e-105
ref|XP_007010392.1| Uncharacterized protein isoform 1 [Theobroma...   385   e-104
gb|EXC25400.1| hypothetical protein L484_016782 [Morus notabilis]     376   e-101
ref|XP_002533661.1| hypothetical protein RCOM_0152200 [Ricinus c...   374   e-100
ref|XP_007010393.1| Uncharacterized protein isoform 2 [Theobroma...   362   5e-97
ref|XP_002311037.1| hypothetical protein POPTR_0008s02540g [Popu...   361   6e-97
ref|XP_006606287.1| PREDICTED: micronuclear linker histone polyp...   348   7e-93
ref|XP_006606284.1| PREDICTED: micronuclear linker histone polyp...   347   2e-92
ref|XP_004140985.1| PREDICTED: uncharacterized protein LOC101207...   345   3e-92
emb|CBI40233.3| unnamed protein product [Vitis vinifera]              338   4e-90
ref|XP_004496183.1| PREDICTED: uncharacterized protein LOC101514...   334   1e-88
ref|XP_004496182.1| PREDICTED: uncharacterized protein LOC101514...   333   1e-88
ref|XP_004496186.1| PREDICTED: uncharacterized protein LOC101514...   330   1e-87
ref|XP_007143822.1| hypothetical protein PHAVU_007G104500g [Phas...   325   5e-86
gb|EYU19796.1| hypothetical protein MIMGU_mgv1a003492mg [Mimulus...   308   5e-81
ref|XP_006606288.1| PREDICTED: micronuclear linker histone polyp...   292   4e-76

>ref|XP_007218938.1| hypothetical protein PRUPE_ppa002306mg [Prunus persica]
            gi|462415400|gb|EMJ20137.1| hypothetical protein
            PRUPE_ppa002306mg [Prunus persica]
          Length = 690

 Score =  427 bits (1097), Expect = e-116
 Identities = 289/649 (44%), Positives = 372/649 (57%), Gaps = 67/649 (10%)
 Frame = +2

Query: 236  QARSEIRKTTLMEDSNAMTVEYLRARLLSERSVSRTARQRAEELAKRVAELEAQLKAVSL 415
            Q   + R    MEDS AMT+E+LRARLL+ERSVSR+ARQR +EL + V ELE QLK VSL
Sbjct: 6    QDTQDQRSNLGMEDSTAMTIEFLRARLLAERSVSRSARQRVDELERMVEELEEQLKIVSL 65

Query: 416  QRKKAEKATADVLAILENHGISDLS-EAFDSSSDQEVTPPGSRVSNSM-NEEETSVNSKV 589
            QRK AEKAT DVLAILE+ GISD+S E FDSSSDQE T  GS+V NS+ NEEE+ V SKV
Sbjct: 66   QRKMAEKATEDVLAILESQGISDISEEEFDSSSDQE-THQGSKVGNSLANEEESFVISKV 124

Query: 590  RRDDSEEFSGSELECSPFP-RSLSWKSSNDSPHSLEXXXXXXXXXXXXXXXXXXXXXAKQ 766
            RR + EE SGS+ + S  P RSLSWK   DSP S E                      + 
Sbjct: 125  RRKEQEEHSGSDADSSLIPGRSLSWKGRIDSPRSRE-KCKDLSVRRRSSFSSIGFSSPRH 183

Query: 767  RLGKSCRQIRRREARSTAEDSKNGSLTLDPSSNGVATCSDGLPNGSETGSEIRREDSEIG 946
             LGKSCRQI+ +E RS   DS           NGV   S+GLPN S  G E  RE SE  
Sbjct: 184  HLGKSCRQIKHKETRSDKFDS---------HENGVGASSEGLPNFSNGGPEKLREGSEFP 234

Query: 947  EEK-----AVLKGLENQRKVTNAVQYHNGHGRDKDMERALESQAQLIGQHXXXXXXXXXX 1111
            EEK     ++ +  ENQR   ++    NGHGRDKDME+ALE QA+LI ++          
Sbjct: 235  EEKVLSNDSLSRTKENQR---DSDLDFNGHGRDKDMEKALEHQAKLICENEEMEKAQREW 291

Query: 1112 XXXFRDNNNSTPDSCEPGNQSDVTEERDEIKPQAASLAVTITSQDQEPKSEAEDVGIRKE 1291
               FR+NN STPDSC+PGN SD+TEERDEIK Q    A  + +Q QE KSE  DV + KE
Sbjct: 292  EEKFRENNTSTPDSCDPGNHSDITEERDEIKAQTPCSAGVVVAQAQETKSEEGDVCLPKE 351

Query: 1292 SRNVYTSGF-------QSPMQDQKNKNTLAYELPATDFAFPSAKGTQN-EQLEKYSYHPP 1447
            +  +  +GF          +QDQ NK+T+A      +FAFP+  G QN E LE ++ HP 
Sbjct: 352  TFKIQQNGFLPASHVDMGGLQDQLNKSTVA-PSQVEEFAFPTENGKQNHESLENFARHPS 410

Query: 1448 --SH---------------------------GETSGSRNENQALILHEAASNGLGGVLEA 1540
              SH                           G  SGSR++  AL+ H+ + + LGGVL+A
Sbjct: 411  HGSHPNPLVHGSAHNRSSDASSSVAGSGFHKGNASGSRSDLYALVPHD-SQDRLGGVLDA 469

Query: 1541 LQQAKLSIQQKLNKLPLLESGSVGKPFEPSAAAIRAGDKLDIPVGCAGLFKVPTDFKF-E 1717
            L+QAKLS+QQ + +LPL++  SV K  EPS   ++ GD+++IPVGCAGLF++PTDF   E
Sbjct: 470  LKQAKLSLQQNMTRLPLVDGTSVHKSIEPSIPVMKTGDRVEIPVGCAGLFRLPTDFAVEE 529

Query: 1718 PTPRANFLGSGSQVHPDTRVAVTGSVLS-------SSSNQFVPNSYMDTRLNVS-NGDHR 1873
               +++FLGS           VT S +        ++++++VP+ Y++TR   S N   R
Sbjct: 530  AATQSSFLGSSWSGRYCPETLVTSSFVETRPTFSMNAADRYVPSPYIETRQTFSTNATDR 589

Query: 1874 FLTNPLVET-------------GSRVLDTRLNVSNGDHRFLTNPLVETG 1981
            F+ N  VE+              S  +DTR N    D+RFL+ P  E+G
Sbjct: 590  FIPNAYVESRPNFPANAAEPFVTSPSVDTRSNFP-ADNRFLSGPYSESG 637


>ref|XP_004307047.1| PREDICTED: uncharacterized protein LOC101309582 [Fragaria vesca
            subsp. vesca]
          Length = 807

 Score =  420 bits (1079), Expect = e-114
 Identities = 277/625 (44%), Positives = 361/625 (57%), Gaps = 44/625 (7%)
 Frame = +2

Query: 236  QARSEIRKTTLMEDSNAMTVEYLRARLLSERSVSRTARQRAEELAKRVAELEAQLKAVSL 415
            Q   ++R  + M+DS  +T+E+LRARLLSERSVSR+ARQRA+EL K V ELE QLK VSL
Sbjct: 6    QDTQDLRINSGMDDSPGITIEFLRARLLSERSVSRSARQRADELEKMVEELEEQLKIVSL 65

Query: 416  QRKKAEKATADVLAILENHGISDLSEAFDSSSDQEVTPPGSRVSNSMNEEETSVNSKVRR 595
            QRK AEKATADVLAILEN G SD+SE FDSSSD E T   S++ N   +EE +     RR
Sbjct: 66   QRKMAEKATADVLAILENQGASDISEEFDSSSDHE-TFQESKMGNKSRKEEENFLISERR 124

Query: 596  DDSEEFSGSELECSPFP-RSLSWKSSNDSPHSLEXXXXXXXXXXXXXXXXXXXXXAKQRL 772
            ++ EE+SGS+L+ S  P R+LSWK   DSP S E                     ++  L
Sbjct: 125  NEHEEYSGSDLDSSSIPGRNLSWKGRIDSPRSRE-KYKEPSIRRRSTFSAVGSSSSRHNL 183

Query: 773  GKSCRQIRRREARSTAEDSKNGSLTLDPS-SNGVATCSDGLPNGSETGSEIRREDSEIGE 949
            GKSCRQI+ RE RS  E SK+     D S  NGVA  S+GL N S    E  R+  E  +
Sbjct: 184  GKSCRQIKHRETRSVVERSKDEPAKFDDSEENGVAASSEGLSNFSYCDPERLRDGPESQK 243

Query: 950  EK-----AVLKGLENQRKVTNAVQYHNGHGRDKDMERALESQAQLIGQHXXXXXXXXXXX 1114
            EK     A+ +  E+QR   N     NGHGR+KDMERALE QAQLIGQ+           
Sbjct: 244  EKFLSKDALTRSKEHQR---NGDPNFNGHGRNKDMERALEHQAQLIGQNEEMEMAQREWE 300

Query: 1115 XXFRDNNNSTPDSCEPGNQSDVTEERDEIK-PQAASLAVTITSQDQEPKSEAEDVGIRKE 1291
              FR+NN STPDSC+PGN SD+TEERDE+K P  A +     S+ QE KSEA D  + +E
Sbjct: 301  EKFRENNTSTPDSCDPGNHSDITEERDEMKTPFPAEIN---ASEAQEAKSEARDSCLFEE 357

Query: 1292 SRNVYTSGFQSP-------MQDQKNKNTLAYELPATDFAFPSAKGTQNEQLEKYSYHPPS 1450
                  +G+  P       MQDQ N++++A   P  +FAFP+A   Q ++  + + H PS
Sbjct: 358  KMKTQLNGYLPPSDVEMGGMQDQMNRSSVASASPIQEFAFPTAYERQTQESLENNAHQPS 417

Query: 1451 HG---------------------------ETSGSRNENQALILHEAASNGLGGVLEALQQ 1549
             G                             SGSRN+  AL+ H++    LGGVL+AL+Q
Sbjct: 418  PGSHHDPLLLESSHNRSSVVSSDGGSSFHNASGSRNDLYALVPHDSQER-LGGVLDALKQ 476

Query: 1550 AKLSIQQKLNKLPLLESGSVGKPFEPSAAAIRAGDKLDIPVGCAGLFKVPTDFKF-EPTP 1726
            AKLS+QQK+ +LPL++  SV +  EP   A+  G++LDIPVGCAGLF++PTDF   E   
Sbjct: 477  AKLSLQQKIIRLPLVDDTSVQESIEPPIPAVTTGNRLDIPVGCAGLFRLPTDFAVEEAAT 536

Query: 1727 RANFLGSGSQVHPDTRVAVTGSVLSSSSNQFVPNSYMDTRLNVSNGDHRFLTNPLVETGS 1906
            + ++LG GS + P  R      + +SS++QFV ++Y++TR     GD RF+ +P VE   
Sbjct: 537  KHSYLGLGSSL-PSARYCPDKGLAASSTDQFVTSTYVETRPPYHVGD-RFVASPYVE--- 591

Query: 1907 RVLDTRLNVSNG-DHRFLTNPLVET 1978
                 R  VS G     + NP  ET
Sbjct: 592  ----NRRTVSTGAGDLVVANPYAET 612


>ref|XP_006436667.1| hypothetical protein CICLE_v10030805mg [Citrus clementina]
            gi|568878417|ref|XP_006492190.1| PREDICTED:
            uncharacterized protein LOC102610545 [Citrus sinensis]
            gi|557538863|gb|ESR49907.1| hypothetical protein
            CICLE_v10030805mg [Citrus clementina]
          Length = 732

 Score =  414 bits (1065), Expect = e-113
 Identities = 281/628 (44%), Positives = 363/628 (57%), Gaps = 42/628 (6%)
 Frame = +2

Query: 233  GQARSEIRKTTLMEDSNAMTVEYLRARLLSERSVSRTARQRAEELAKRVAELEAQLKAVS 412
            GQ   + R  + MEDSN MT+E+LRARLLSERSVS++ARQRA+ELA+RV ELE QLK VS
Sbjct: 5    GQEMQDQRTNSGMEDSNTMTIEFLRARLLSERSVSKSARQRADELARRVVELEEQLKLVS 64

Query: 413  LQRKKAEKATADVLAILENHGISDLSEAFDSSSDQEVTPPGSRVSNSMN-EEETSVNSKV 589
            LQRKKAEKATADVLAILEN+GIS++S++FDS SDQE TP  S V N+ N EEE SV+SK 
Sbjct: 65   LQRKKAEKATADVLAILENNGISEISDSFDSGSDQE-TPCESEVGNNFNKEEENSVDSKF 123

Query: 590  RRDDSEEFSGSELECSPFP-RSLSWKSSNDSPHSLEXXXXXXXXXXXXXXXXXXXXXAKQ 766
            RR+ S E SGS  + SP P R LSW     +  SLE                      K 
Sbjct: 124  RRNASVEHSGSGNDFSPVPHRGLSWNGRRGTKQSLE-KYKDSYLRRRSSFASTGSSSPKN 182

Query: 767  RLGKSCRQIRRREARSTAEDSKNGSLTLDPSSNGVATCSDGLPNGSETGSEIRREDSEIG 946
            R+GKSCRQIRRRE++S  E+ K   + +D   NG  T  +      +   E+ R  SE  
Sbjct: 183  RVGKSCRQIRRRESKSAVEELKTEPVKVDSQENGGGTSLE-----VDRKPEVLR-GSEAQ 236

Query: 947  EEKAVLKG-----LENQRKVTNAVQYHNGHGRDKDMERALESQAQLIGQHXXXXXXXXXX 1111
            EE+ + +G      EN++ VT      NG G DKDME+ALE QAQLIG++          
Sbjct: 237  EEQYLGEGSDSGCFENEKLVTGGGIDFNGCGGDKDMEKALEDQAQLIGRYEEMEKAQREW 296

Query: 1112 XXXFRDNNNSTPDSCEPGNQSDVTEERDEIKPQAASLAVTITSQDQEPKSEAEDVGIRKE 1291
               FR+NN+STPDSC+PGNQSDVTEER+E K Q   +A T+ SQ QE K+E   V +  +
Sbjct: 297  EERFRENNSSTPDSCDPGNQSDVTEEREESKVQVQRVAGTVNSQVQEAKTE---VHLSNQ 353

Query: 1292 SRNVYTSGFQSPMQ-DQKNKNTLAYELPATDFAFPSAKGTQNEQLEKYSYHPPSHG---- 1456
              N  ++GF  P   DQK  +T A E  A DFAF  +   QN++    +++ PSH     
Sbjct: 354  LSNTKSNGFLPPQSGDQKCSSTPASEPLAQDFAFTMSNEKQNQESLGNNHYVPSHSSHHR 413

Query: 1457 -------------------------ETSGSRNENQALILHEAASNGLGGVLEALQQAKLS 1561
                                     E SGS++E  AL+ H+  S+G   VLEAL+QA+LS
Sbjct: 414  LHPHGSPENQSSQTVSSNTGSSSRREVSGSQSEQYALVPHQ-TSSGFNEVLEALKQARLS 472

Query: 1562 IQQKLNKLPLLESGSVGKPFEPSAAAIRAGDKLDIPVGCAGLFKVPTDFKFEPTPRANFL 1741
            ++QK++ LP  ES SVGK  EPS +A    D+++IPVGC+GLF+VPTD+  E T +ANFL
Sbjct: 473  LRQKMSSLPSTESRSVGKVIEPSLSASTVWDRVEIPVGCSGLFRVPTDYAVE-TSKANFL 531

Query: 1742 GSGSQVHPDTRVAVTGSVLSSSSNQFVPNSYMDTRLNVSNGDHR-----FLTNPLVETGS 1906
             S S+         +G  L  S +Q V NS MDTR   +  + R     FLT P  +T S
Sbjct: 532  VSDSRPSLANYNPTSGIGL-VSDDQTVSNSLMDTRSTFAADNFRPTRDLFLTGPSTDTRS 590

Query: 1907 RVLDTRLNVSNGDHRFLTNPLVETGSRV 1990
                      + ++R LT    +T SRV
Sbjct: 591  SY--------SAENRLLTRQYSDTRSRV 610


>ref|XP_006436666.1| hypothetical protein CICLE_v10030805mg [Citrus clementina]
            gi|557538862|gb|ESR49906.1| hypothetical protein
            CICLE_v10030805mg [Citrus clementina]
          Length = 716

 Score =  411 bits (1056), Expect = e-112
 Identities = 278/616 (45%), Positives = 358/616 (58%), Gaps = 42/616 (6%)
 Frame = +2

Query: 269  MEDSNAMTVEYLRARLLSERSVSRTARQRAEELAKRVAELEAQLKAVSLQRKKAEKATAD 448
            MEDSN MT+E+LRARLLSERSVS++ARQRA+ELA+RV ELE QLK VSLQRKKAEKATAD
Sbjct: 1    MEDSNTMTIEFLRARLLSERSVSKSARQRADELARRVVELEEQLKLVSLQRKKAEKATAD 60

Query: 449  VLAILENHGISDLSEAFDSSSDQEVTPPGSRVSNSMN-EEETSVNSKVRRDDSEEFSGSE 625
            VLAILEN+GIS++S++FDS SDQE TP  S V N+ N EEE SV+SK RR+ S E SGS 
Sbjct: 61   VLAILENNGISEISDSFDSGSDQE-TPCESEVGNNFNKEEENSVDSKFRRNASVEHSGSG 119

Query: 626  LECSPFP-RSLSWKSSNDSPHSLEXXXXXXXXXXXXXXXXXXXXXAKQRLGKSCRQIRRR 802
             + SP P R LSW     +  SLE                      K R+GKSCRQIRRR
Sbjct: 120  NDFSPVPHRGLSWNGRRGTKQSLE-KYKDSYLRRRSSFASTGSSSPKNRVGKSCRQIRRR 178

Query: 803  EARSTAEDSKNGSLTLDPSSNGVATCSDGLPNGSETGSEIRREDSEIGEEKAVLKG---- 970
            E++S  E+ K   + +D   NG  T  +      +   E+ R  SE  EE+ + +G    
Sbjct: 179  ESKSAVEELKTEPVKVDSQENGGGTSLE-----VDRKPEVLR-GSEAQEEQYLGEGSDSG 232

Query: 971  -LENQRKVTNAVQYHNGHGRDKDMERALESQAQLIGQHXXXXXXXXXXXXXFRDNNNSTP 1147
              EN++ VT      NG G DKDME+ALE QAQLIG++             FR+NN+STP
Sbjct: 233  CFENEKLVTGGGIDFNGCGGDKDMEKALEDQAQLIGRYEEMEKAQREWEERFRENNSSTP 292

Query: 1148 DSCEPGNQSDVTEERDEIKPQAASLAVTITSQDQEPKSEAEDVGIRKESRNVYTSGFQSP 1327
            DSC+PGNQSDVTEER+E K Q   +A T+ SQ QE K+E   V +  +  N  ++GF  P
Sbjct: 293  DSCDPGNQSDVTEEREESKVQVQRVAGTVNSQVQEAKTE---VHLSNQLSNTKSNGFLPP 349

Query: 1328 MQ-DQKNKNTLAYELPATDFAFPSAKGTQNEQLEKYSYHPPSHG---------------- 1456
               DQK  +T A E  A DFAF  +   QN++    +++ PSH                 
Sbjct: 350  QSGDQKCSSTPASEPLAQDFAFTMSNEKQNQESLGNNHYVPSHSSHHRLHPHGSPENQSS 409

Query: 1457 -------------ETSGSRNENQALILHEAASNGLGGVLEALQQAKLSIQQKLNKLPLLE 1597
                         E SGS++E  AL+ H+  S+G   VLEAL+QA+LS++QK++ LP  E
Sbjct: 410  QTVSSNTGSSSRREVSGSQSEQYALVPHQ-TSSGFNEVLEALKQARLSLRQKMSSLPSTE 468

Query: 1598 SGSVGKPFEPSAAAIRAGDKLDIPVGCAGLFKVPTDFKFEPTPRANFLGSGSQVHPDTRV 1777
            S SVGK  EPS +A    D+++IPVGC+GLF+VPTD+  E T +ANFL S S+       
Sbjct: 469  SRSVGKVIEPSLSASTVWDRVEIPVGCSGLFRVPTDYAVE-TSKANFLVSDSRPSLANYN 527

Query: 1778 AVTGSVLSSSSNQFVPNSYMDTRLNVSNGDHR-----FLTNPLVETGSRVLDTRLNVSNG 1942
              +G  L  S +Q V NS MDTR   +  + R     FLT P  +T S          + 
Sbjct: 528  PTSGIGL-VSDDQTVSNSLMDTRSTFAADNFRPTRDLFLTGPSTDTRSSY--------SA 578

Query: 1943 DHRFLTNPLVETGSRV 1990
            ++R LT    +T SRV
Sbjct: 579  ENRLLTRQYSDTRSRV 594


>ref|XP_007010395.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508727308|gb|EOY19205.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 709

 Score =  388 bits (996), Expect = e-105
 Identities = 269/645 (41%), Positives = 362/645 (56%), Gaps = 60/645 (9%)
 Frame = +2

Query: 236  QARSEIRKTTLMEDSNAMTVEYLRARLLSERSVSRTARQRAEELAKRVAELEAQLKAVSL 415
            Q + + R T  +EDS  MT+E+LRARLLSERSVS++ARQR +ELAKRVAELE QLK VS+
Sbjct: 6    QVKQDQRTTCNVEDST-MTIEFLRARLLSERSVSKSARQRVDELAKRVAELEKQLKFVSV 64

Query: 416  QRKKAEKATADVLAILENHGISDLSEAFDSSSDQEVTPPGSRVSN-SMNEEETSVNSKVR 592
            QR++AEKATADVLAILEN+G+SD+SE  DSSSDQ+  P  S ++N S  EEE+SV SKVR
Sbjct: 65   QRRRAEKATADVLAILENNGVSDISEELDSSSDQD-APFESNINNGSTKEEESSVTSKVR 123

Query: 593  RDDSEEFSGSELECSPFP-RSLSWKSSNDSPHSLEXXXXXXXXXXXXXXXXXXXXXAKQR 769
            + +SEE SGSE +CS    RSLSWK    + HS E                      K R
Sbjct: 124  QKESEELSGSEFDCSSASGRSLSWKGRKSASHSPE-RYKDKLVRSRNSFASISFSSRKHR 182

Query: 770  LGKSCRQIRRREARSTAEDSKNGSLTLDPSSNGVATCSDGLPNGSETGSEIRREDSEIGE 949
             GKSCRQIRRRE+RS AE+ K+ ++ +DP   G+   S+   N S  G  I    SEI E
Sbjct: 183  QGKSCRQIRRRESRSVAEELKSDNIMVDPQVKGLENSSEVNANHSTGGPHILPMGSEIHE 242

Query: 950  EKAVL-----KGLENQRKVTNAVQYHNGHGRDKDMERALESQAQLIGQHXXXXXXXXXXX 1114
             K+ +       L+N+R VT      +G+  +KDME+ALE QAQLI  +           
Sbjct: 243  NKSTVDNLHSDALKNERNVTGFDLDFHGYEGEKDMEKALEHQAQLIVHYEAMERAQREWE 302

Query: 1115 XXFRDNNNSTPDSCEPGNQSDVTEERDEIKPQAASLAVTITSQDQEPKSEAEDVGIRKES 1294
              FR+ N+S+PDSC+PGN SDVTEERDEIK QA  ++ T TSQ Q   +E E +    E 
Sbjct: 303  EKFREKNSSSPDSCDPGNHSDVTEERDEIKAQAQYVSGTATSQVQ--GAEEEHISFSAEL 360

Query: 1295 RNVYTSGFQSP-------MQDQKNKNTLAYE-----LPATDFAFPSAKGTQNEQLEK--- 1429
              ++++    P       +QD +   +L+ E      P     F  AK   ++ ++    
Sbjct: 361  PKIHSNDLVPPSQADMDRLQDWRYSRSLSPESLNPNSPGQKLTFLMAKENHHQSMQSNNS 420

Query: 1430 ------YSYHP----------------PSHG--ETSGSRNENQALILHEAASNGLGGVLE 1537
                  +  HP                 SH   E   ++NE  AL+ HE  S    GVL+
Sbjct: 421  PSNSSHHFAHPHDSPGNQAVQHISSDLGSHSCRELPRNKNELYALVPHE-TSGRFTGVLD 479

Query: 1538 ALQQAKLSIQQKLNKLPLLESGSVGKPFEPSAAAIRAGDKLDIPVGCAGLFKVPTDFKFE 1717
            +L+QA+LS+QQK++ L L+E  SVGK  E S +  + G++++IP+GC+GLF+VPTD   E
Sbjct: 480  SLKQARLSLQQKISTLSLVEGASVGKAIETSGSGRKVGERVEIPLGCSGLFRVPTDISVE 539

Query: 1718 PTPRANFLGSGSQV-----HPDTRVAVTGSVLSSSSNQFVPNSYMDTRLNVSN-----GD 1867
              P+ANFLGS SQ+     +PD  VA T      +SN  +  SYM+T+ + S+       
Sbjct: 540  -APKANFLGSSSQLSLANHYPDRGVAPT------ASNHLLTTSYMNTQSSSSSNYQPVSS 592

Query: 1868 HRFLTNPLV--ETGSRVLDTRLNVSN--GDHRFLTNPLVETGSRV 1990
             RF + P +   T S    T    S    D + LT    ETGSR+
Sbjct: 593  DRFFSGPYMYPRTSSSPFPTAFASSGYIKDDQILTGQCEETGSRL 637


>ref|XP_007010392.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508727305|gb|EOY19202.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 749

 Score =  385 bits (989), Expect = e-104
 Identities = 265/637 (41%), Positives = 357/637 (56%), Gaps = 60/637 (9%)
 Frame = +2

Query: 260  TTLMEDSNAMTVEYLRARLLSERSVSRTARQRAEELAKRVAELEAQLKAVSLQRKKAEKA 439
            TT   + + MT+E+LRARLLSERSVS++ARQR +ELAKRVAELE QLK VS+QR++AEKA
Sbjct: 53   TTCNVEDSTMTIEFLRARLLSERSVSKSARQRVDELAKRVAELEKQLKFVSVQRRRAEKA 112

Query: 440  TADVLAILENHGISDLSEAFDSSSDQEVTPPGSRVSN-SMNEEETSVNSKVRRDDSEEFS 616
            TADVLAILEN+G+SD+SE  DSSSDQ+  P  S ++N S  EEE+SV SKVR+ +SEE S
Sbjct: 113  TADVLAILENNGVSDISEELDSSSDQD-APFESNINNGSTKEEESSVTSKVRQKESEELS 171

Query: 617  GSELECSPFP-RSLSWKSSNDSPHSLEXXXXXXXXXXXXXXXXXXXXXAKQRLGKSCRQI 793
            GSE +CS    RSLSWK    + HS E                      K R GKSCRQI
Sbjct: 172  GSEFDCSSASGRSLSWKGRKSASHSPE-RYKDKLVRSRNSFASISFSSRKHRQGKSCRQI 230

Query: 794  RRREARSTAEDSKNGSLTLDPSSNGVATCSDGLPNGSETGSEIRREDSEIGEEKAVL--- 964
            RRRE+RS AE+ K+ ++ +DP   G+   S+   N S  G  I    SEI E K+ +   
Sbjct: 231  RRRESRSVAEELKSDNIMVDPQVKGLENSSEVNANHSTGGPHILPMGSEIHENKSTVDNL 290

Query: 965  --KGLENQRKVTNAVQYHNGHGRDKDMERALESQAQLIGQHXXXXXXXXXXXXXFRDNNN 1138
                L+N+R VT      +G+  +KDME+ALE QAQLI  +             FR+ N+
Sbjct: 291  HSDALKNERNVTGFDLDFHGYEGEKDMEKALEHQAQLIVHYEAMERAQREWEEKFREKNS 350

Query: 1139 STPDSCEPGNQSDVTEERDEIKPQAASLAVTITSQDQEPKSEAEDVGIRKESRNVYTSGF 1318
            S+PDSC+PGN SDVTEERDEIK QA  ++ T TSQ Q   +E E +    E   ++++  
Sbjct: 351  SSPDSCDPGNHSDVTEERDEIKAQAQYVSGTATSQVQ--GAEEEHISFSAELPKIHSNDL 408

Query: 1319 QSP-------MQDQKNKNTLAYE-----LPATDFAFPSAKGTQNEQLEK---------YS 1435
              P       +QD +   +L+ E      P     F  AK   ++ ++          + 
Sbjct: 409  VPPSQADMDRLQDWRYSRSLSPESLNPNSPGQKLTFLMAKENHHQSMQSNNSPSNSSHHF 468

Query: 1436 YHP----------------PSHG--ETSGSRNENQALILHEAASNGLGGVLEALQQAKLS 1561
             HP                 SH   E   ++NE  AL+ HE  S    GVL++L+QA+LS
Sbjct: 469  AHPHDSPGNQAVQHISSDLGSHSCRELPRNKNELYALVPHE-TSGRFTGVLDSLKQARLS 527

Query: 1562 IQQKLNKLPLLESGSVGKPFEPSAAAIRAGDKLDIPVGCAGLFKVPTDFKFEPTPRANFL 1741
            +QQK++ L L+E  SVGK  E S +  + G++++IP+GC+GLF+VPTD   E  P+ANFL
Sbjct: 528  LQQKISTLSLVEGASVGKAIETSGSGRKVGERVEIPLGCSGLFRVPTDISVE-APKANFL 586

Query: 1742 GSGSQV-----HPDTRVAVTGSVLSSSSNQFVPNSYMDTRLNVSN-----GDHRFLTNPL 1891
            GS SQ+     +PD  VA T      +SN  +  SYM+T+ + S+        RF + P 
Sbjct: 587  GSSSQLSLANHYPDRGVAPT------ASNHLLTTSYMNTQSSSSSNYQPVSSDRFFSGPY 640

Query: 1892 V--ETGSRVLDTRLNVSN--GDHRFLTNPLVETGSRV 1990
            +   T S    T    S    D + LT    ETGSR+
Sbjct: 641  MYPRTSSSPFPTAFASSGYIKDDQILTGQCEETGSRL 677


>gb|EXC25400.1| hypothetical protein L484_016782 [Morus notabilis]
          Length = 654

 Score =  376 bits (965), Expect = e-101
 Identities = 263/632 (41%), Positives = 345/632 (54%), Gaps = 55/632 (8%)
 Frame = +2

Query: 236  QARSEIRKTTLMEDSN--AMTVEYLRARLLSERSVSRTARQRAEELAKRVAELEAQLKAV 409
            Q + + R ++ MEDS   AMT+E+LRARLLSERSVSR+ARQRA+EL KRV ELE QL+ V
Sbjct: 6    QEKQDQRSSSSMEDSQSTAMTIEFLRARLLSERSVSRSARQRADELEKRVEELEEQLRIV 65

Query: 410  SLQRKKAEKATADVLAILENHGISDLSEAFDSSSDQEVTPPGSRVSNSMNEEETSVNSKV 589
            SLQRK AEKAT DVL+ILENHGISD SE +DS SDQE        +N  N EE SV SK 
Sbjct: 66   SLQRKMAEKATVDVLSILENHGISDASETYDSGSDQET---HQVANNYANGEERSVVSK- 121

Query: 590  RRDDSEEFSGSELECSPF-PRSLSWKSSNDSPHSLEXXXXXXXXXXXXXXXXXXXXXAKQ 766
            RR   EE SGS+L+ SP   RSLSWK  +DS  S E                      K 
Sbjct: 122  RRSVLEELSGSDLDSSPINGRSLSWKGRSDSSRSREKYKDSSVRRQNALSSSFGSSSPKH 181

Query: 767  RLGKSCRQIRRREARSTAEDSKNGSLTLDPSSNGVATCSDGLPNGSETGSEIRREDSEIG 946
             +GKSCRQIR RE R+  ED K   L  D   NG AT  +G                   
Sbjct: 182  YVGKSCRQIRCRETRTVVEDHKTEPLKFDSQENGAATPPEG------------------- 222

Query: 947  EEKAVLKGLENQRKVTNAVQYHNGHGRDKDMERALESQAQLIGQHXXXXXXXXXXXXXFR 1126
                    ++N R++ N +   NGHG++KDM++ALE +AQLIGQ+             +R
Sbjct: 223  -------SVKNDRRIPNHLDV-NGHGQEKDMKKALEHRAQLIGQYEEMEKAQREWEEKYR 274

Query: 1127 DNNNSTPDSCEPGNQSDVTEERDEIKPQAASLAVTITSQDQEPKSEAEDVGIRKESRNVY 1306
            +NN STPDS +PGN SDVTE+RDE+K Q         +Q  + KS   D  + KES    
Sbjct: 275  ENNTSTPDSYDPGNHSDVTEDRDEVKAQTLYNVGIDIAQAVDAKSNKVD--LSKESSKPQ 332

Query: 1307 TSGFQSP-----------MQDQKNKNTLAYELPATDFAFPSAKGTQ-NEQLEKYSYHP-- 1444
            ++GF  P           +Q   N + +A    A +FAFP+AK  +  E LE   + P  
Sbjct: 333  SNGFLHPTRTRAAMGDLKVQASSNIDPVASRFQAQEFAFPTAKEKEAQESLENRDFRPSE 392

Query: 1445 -PSHGET---------------------------SGSRNENQALILHEAASNGLGGVLEA 1540
             P HG+                            SGS+N+  AL+ H      LGGVL+A
Sbjct: 393  SPHHGQLLHRSLPNQPFDRGALSDAGSSSHKRDFSGSQNDLYALVPHNPPV-VLGGVLDA 451

Query: 1541 LQQAKLSIQQKLNKLPL----LESGSVGKPFEPSAAAIRAGDKLDIPVGCAGLFKVPTDF 1708
            L+QAKLS+QQK+N+LPL     ++ +V +  EP+    R GD+L+IPVGC GLF++PTDF
Sbjct: 452  LKQAKLSLQQKINRLPLEGTTTQTVAVNRSIEPTQPGTRVGDRLEIPVGCTGLFRLPTDF 511

Query: 1709 -KFEPTPRANFLGSGSQV-----HPDTRVAVTGSVLSSSSNQFVPNSYMDTRLNVSNGDH 1870
               E + +ANFL SGS++     +PD +VA+T      + ++F+ + Y+++R      D 
Sbjct: 512  ATVEASTQANFLSSGSRLSLEPYYPDNKVALT------APDRFLTSPYIESRSEFP-PDV 564

Query: 1871 RFLTNPLVETGSRVLDTRLNVSNGDHRFLTNP 1966
            RFLT+  V +GSR   + LN S  D  F T P
Sbjct: 565  RFLTSSSVVSGSRA--STLN-SRFDSHFDTGP 593


>ref|XP_002533661.1| hypothetical protein RCOM_0152200 [Ricinus communis]
            gi|223526443|gb|EEF28720.1| hypothetical protein
            RCOM_0152200 [Ricinus communis]
          Length = 665

 Score =  374 bits (959), Expect = e-100
 Identities = 253/598 (42%), Positives = 344/598 (57%), Gaps = 41/598 (6%)
 Frame = +2

Query: 242  RSEIRKTTLMEDSNAMTVEYLRARLLSERSVSRTARQRAEELAKRVAELEAQLKAVSLQR 421
            + + R  + MEDS AMT+E+LRARLLSERSVSRTARQRA+ELA RVAELE QL+ VSLQR
Sbjct: 8    KQDQRTNSGMEDSTAMTIEFLRARLLSERSVSRTARQRADELATRVAELEEQLRIVSLQR 67

Query: 422  KKAEKATADVLAILENHGISDLSEAFDSSSDQEVTPPGSRVSNSMNEEETSVNSKVRRDD 601
             KAEKATAD+LAILE +GISD+SE FDS SD++ TP  S+V N  ++EE S+NSKVR +D
Sbjct: 68   MKAEKATADILAILEGNGISDISETFDSCSDRD-TPCESKVGNRSSKEENSINSKVRNND 126

Query: 602  SEEFSGSELECSPFP-RSLSWKSSNDSPHSLEXXXXXXXXXXXXXXXXXXXXXAKQRLGK 778
            SEE SGS+ + S  P RSLSWK   +SP SLE                      KQR GK
Sbjct: 127  SEELSGSDFDFSSVPGRSLSWKGRKNSPRSLE--KSKDSSMRRRSSFSSVGSSPKQRPGK 184

Query: 779  SCRQIRRREARSTAEDSKNGSLTLDPSSNGVATCSDGLPNGSETGSEIRREDSEIGEEKA 958
            SCRQIRR+E+R    + K   +  D   + VA  S   P+ S+        + + GE K 
Sbjct: 185  SCRQIRRKESRF---EYKASPVKRDCPEDEVAATSANFPSCSDF-------EPKRGEVKP 234

Query: 959  VLKG-----LENQRKVTNAVQYHNGHGRDKDMERALESQAQLIGQHXXXXXXXXXXXXXF 1123
            +L+      L N+R  ++    +N +  D+DME+ALE QAQLIGQ+             F
Sbjct: 235  LLEDSHSDCLGNERNASDNGLDYNVYRGDRDMEKALEHQAQLIGQYEAMEKVQREWEEKF 294

Query: 1124 RDNNNSTPDSCEPGNQSDVTEERDEIKPQAASLAVTITSQDQEPKSEAEDVGIRKESRNV 1303
            R+NN+STPDSC+ GN+SD+TEER EI+  A   A T   Q +   S  E V   +    +
Sbjct: 295  RENNSSTPDSCDHGNRSDITEERYEIREPAKGPATTNAIQTEGLLSVVEGVSNTQPHGFL 354

Query: 1304 YTSGFQSP-MQDQKNKNTLAYELPATDFAFPSAKGTQNEQLEKYSYHPP---SHGE---- 1459
             +S   +  ++++K+      E    D AFP AK  QN++    + H P   +H +    
Sbjct: 355  PSSHVDAVCLEERKSSIAPVPEFSTQDSAFPMAKAKQNQKNPGNNDHSPLLIAHHDSASF 414

Query: 1460 ---------------------------TSGSRNENQALILHEAASNGLGGVLEALQQAKL 1558
                                       TSGS NE  AL+ H+ AS GLGGVLEAL++A+ 
Sbjct: 415  GSQYSSGSQSVLSFPSNTGSSFNKGKATSGSENERCALVPHK-ASGGLGGVLEALEEARQ 473

Query: 1559 SIQQKLNKLPLLESGSVGKPFEPSAAAIRAGDKLDIPVGCAGLFKVPTDFKFEPTPRANF 1738
            S+QQ++N+LP + + +V K  E S +   + D++ IPVGC GLF++PTDF  E   RAN 
Sbjct: 474  SLQQRINRLPSVAT-TVRKSVESSVSTTISRDEVQIPVGCVGLFRLPTDFSVEGNTRANL 532

Query: 1739 LGSGSQVHPDTRVAVTGSVLSSSSNQFVPNSYMDTRLNVSNGDHRFLTNPLVETGSRV 1912
            L S +Q+      +  G V +++SNQFV + Y+  R + S  D +FL++  V  GSR+
Sbjct: 533  LSSSAQLSLGNHYSDRG-VPAAASNQFVASPYLQGRSSSSTED-QFLSSQYVGGGSRI 588


>ref|XP_007010393.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|590567007|ref|XP_007010394.1| Uncharacterized protein
            isoform 2 [Theobroma cacao] gi|508727306|gb|EOY19203.1|
            Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508727307|gb|EOY19204.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 665

 Score =  362 bits (928), Expect = 5e-97
 Identities = 261/640 (40%), Positives = 348/640 (54%), Gaps = 55/640 (8%)
 Frame = +2

Query: 236  QARSEIRKTTLMEDSNAMTVEYLRARLLSERSVSRTARQRAEELAKRVAELEAQLKAVSL 415
            Q + + R T  +EDS  MT+E+LRARLLSERSVS++ARQR +ELAKRVAELE QLK VS+
Sbjct: 6    QVKQDQRTTCNVEDST-MTIEFLRARLLSERSVSKSARQRVDELAKRVAELEKQLKFVSV 64

Query: 416  QRKKAEKATADVLAILENHGISDLSEAFDSSSDQEVTPPGSRVSN-SMNEEETSVNSKVR 592
            QR++AEKATADVLAILEN+G+SD+SE  DSSSDQ+  P  S ++N S  EEE+SV SKVR
Sbjct: 65   QRRRAEKATADVLAILENNGVSDISEELDSSSDQD-APFESNINNGSTKEEESSVTSKVR 123

Query: 593  RDDSEEFSGSELECSPFP-RSLSWKSSNDSPHSLEXXXXXXXXXXXXXXXXXXXXXAKQR 769
            + +SEE SGSE +CS    RSLSWK    + HS E                      K R
Sbjct: 124  QKESEELSGSEFDCSSASGRSLSWKGRKSASHSPE-RYKDKLVRSRNSFASISFSSRKHR 182

Query: 770  LGKSCRQIRRREARSTAEDSKNGSLTLDPSSNGVATCSDGLPNGSETGSEIRREDSEIGE 949
             GKSCRQIRRRE+RS AE+ K+ ++ +DP                               
Sbjct: 183  QGKSCRQIRRRESRSVAEELKSDNIMVDPQ------------------------------ 212

Query: 950  EKAVLKGLENQRKVTNAVQYHNGHGRDKDMERALESQAQLIGQHXXXXXXXXXXXXXFRD 1129
                +KGLEN  +V NA    N    +KDME+ALE QAQLI  +             FR+
Sbjct: 213  ----VKGLENSSEV-NA----NHSTGEKDMEKALEHQAQLIVHYEAMERAQREWEEKFRE 263

Query: 1130 NNNSTPDSCEPGNQSDVTEERDEIKPQAASLAVTITSQDQEPKSEAEDVGIRKESRNVYT 1309
             N+S+PDSC+PGN SDVTEERDEIK QA  ++ T TSQ Q   +E E +    E   +++
Sbjct: 264  KNSSSPDSCDPGNHSDVTEERDEIKAQAQYVSGTATSQVQ--GAEEEHISFSAELPKIHS 321

Query: 1310 SGFQSP-------MQDQKNKNTLAYE-----LPATDFAFPSAKGTQNEQLEK-------- 1429
            +    P       +QD +   +L+ E      P     F  AK   ++ ++         
Sbjct: 322  NDLVPPSQADMDRLQDWRYSRSLSPESLNPNSPGQKLTFLMAKENHHQSMQSNNSPSNSS 381

Query: 1430 -YSYHP----------------PSHG--ETSGSRNENQALILHEAASNGLGGVLEALQQA 1552
             +  HP                 SH   E   ++NE  AL+ HE  S    GVL++L+QA
Sbjct: 382  HHFAHPHDSPGNQAVQHISSDLGSHSCRELPRNKNELYALVPHE-TSGRFTGVLDSLKQA 440

Query: 1553 KLSIQQKLNKLPLLESGSVGKPFEPSAAAIRAGDKLDIPVGCAGLFKVPTDFKFEPTPRA 1732
            +LS+QQK++ L L+E  SVGK  E S +  + G++++IP+GC+GLF+VPTD   E  P+A
Sbjct: 441  RLSLQQKISTLSLVEGASVGKAIETSGSGRKVGERVEIPLGCSGLFRVPTDISVE-APKA 499

Query: 1733 NFLGSGSQV-----HPDTRVAVTGSVLSSSSNQFVPNSYMDTRLNVSN-----GDHRFLT 1882
            NFLGS SQ+     +PD  VA T      +SN  +  SYM+T+ + S+        RF +
Sbjct: 500  NFLGSSSQLSLANHYPDRGVAPT------ASNHLLTTSYMNTQSSSSSNYQPVSSDRFFS 553

Query: 1883 NPLV--ETGSRVLDTRLNVSN--GDHRFLTNPLVETGSRV 1990
             P +   T S    T    S    D + LT    ETGSR+
Sbjct: 554  GPYMYPRTSSSPFPTAFASSGYIKDDQILTGQCEETGSRL 593


>ref|XP_002311037.1| hypothetical protein POPTR_0008s02540g [Populus trichocarpa]
            gi|222850857|gb|EEE88404.1| hypothetical protein
            POPTR_0008s02540g [Populus trichocarpa]
          Length = 684

 Score =  361 bits (927), Expect = 6e-97
 Identities = 256/627 (40%), Positives = 333/627 (53%), Gaps = 64/627 (10%)
 Frame = +2

Query: 236  QARSEIRKTTLMEDSNAMTVEYLRARLLSERSVSRTARQRAEELAKRVAELEAQLKAVSL 415
            Q + + R  + MEDS A+T+E+LRARLL+ERSVSRTARQRA+ELA+RVAELE QL+ VSL
Sbjct: 6    QEKQDQRTRSSMEDSTAITIEFLRARLLAERSVSRTARQRADELAERVAELEEQLRIVSL 65

Query: 416  QRKKAEKATADVLAILENHGISDLSEAFDSSSDQEVTPPGSRVSNSMNEEETSVNSKVRR 595
            QR KAEKAT DVLAILE++GISD SE F SSSDQ+ TP  S+V     +EE+SV SKV +
Sbjct: 66   QRMKAEKATVDVLAILESNGISDDSEIFGSSSDQD-TPCESKVGKKTKQEESSVISKVTK 124

Query: 596  DDSEEFSGSELECSPFP-RSLSWKSSNDSPHSLEXXXXXXXXXXXXXXXXXXXXXAKQRL 772
               EE SGS  + S    R+LSWK    SP SLE                      K   
Sbjct: 125  YKLEEHSGSGHDFSSSQGRNLSWKGRKHSPRSLEKCKDPSLRRRSSFASTSSSP--KHHQ 182

Query: 773  GKSCRQIRRREARSTAEDSKNGSLTLDPSSNGVATCSDGLPNGSETGSEIRREDSEIGEE 952
            GKSCRQ+R +E+R T    +     +D   NGVAT S+  PN SE   E+ R ++  GEE
Sbjct: 183  GKSCRQVRNKESRLTIGAFRTNPDKVDSPENGVATTSEVFPNCSEP--EVGRIEN--GEE 238

Query: 953  KA---VLKGLENQRKVTNAVQYHNGHGRDKDMERALESQAQLIGQHXXXXXXXXXXXXXF 1123
            K    +  GLEN ++  +     N +G D+DME+ALE QAQLI ++             F
Sbjct: 239  KTLPPISVGLENGQRADSNELEDNVYGSDRDMEKALEHQAQLIDRYKAMEKVQREWEEKF 298

Query: 1124 RDNNNSTPDSCEPGNQSDVTEERDEIKPQAASLAVTITSQDQEPKSEAEDVGIRKESRNV 1303
            R+NN STPDS + GN+SDVTEE  EIK Q      T+ +Q    KSE E      ++ N+
Sbjct: 299  RENNGSTPDSYDAGNRSDVTEEGYEIKAQVQQHTGTVAAQSNRAKSEVE------KASNI 352

Query: 1304 YTSGFQSP-------MQDQKNKNTLAYELPATDFAFPSAKGTQNEQLEKY--SYHPPSH- 1453
              +G   P       +Q+ K+ +    E PA DFAF + K  QNE  E    +YHP  H 
Sbjct: 353  QPNGILRPSHVNIGQLQEWKSSSAPTSESPAQDFAFRAEKQKQNENEESLGNNYHPSPHS 412

Query: 1454 --------------------------------GETSGSRNENQALILHEAASNGLGGVLE 1537
                                            G+ SG +NE  AL+ H A SN LGGVL+
Sbjct: 413  SHDHPQSHSSHDSPGSQSATSFPSNTDSGFSKGQFSGRQNELYALVPHRA-SNELGGVLD 471

Query: 1538 ALQQAKLSIQQKLNKLPLLESGSVGKPFEPSAAAIRAGDKLDIPVGCAGLFKVPTDFKFE 1717
            AL+ A+ S+QQK++ LPL+E GS+    +PS      GDK+DIP+G AGLF++P DF  E
Sbjct: 472  ALKLARQSLQQKISTLPLIEGGSIRNSVDPSLPPPIPGDKVDIPLGNAGLFRLPFDFLAE 531

Query: 1718 PTPRANFLGSGS-----QVHPDTRV-------------AVTGSVLSSSSNQFVPNSYMDT 1843
             + R N   + +       +PDT V               TGS   ++       SY  T
Sbjct: 532  GSTRKNLDSTNAGLSLRNYYPDTGVPAAAINRFVSRFPTATGSRFPTADQFLASQSYSAT 591

Query: 1844 RLNVSNGDHRFLTNPLVETGSRVLDTR 1924
                   D +FL +  VE GSR+   R
Sbjct: 592  GSRFPTED-QFLASQDVEAGSRISSQR 617


>ref|XP_006606287.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X4
            [Glycine max]
          Length = 641

 Score =  348 bits (892), Expect = 7e-93
 Identities = 239/581 (41%), Positives = 320/581 (55%), Gaps = 43/581 (7%)
 Frame = +2

Query: 254  RKTTLMEDSNAMTVEYLRARLLSERSVSRTARQRAEELAKRVAELEAQLKAVSLQRKKAE 433
            R T+ MEDS AMT+E+LRARLLSERS+SR+A+QRA+ELAK+V +LE QLK V LQRK AE
Sbjct: 12   RVTSCMEDSTAMTIEFLRARLLSERSISRSAKQRADELAKKVMDLEEQLKTVILQRKMAE 71

Query: 434  KATADVLAILENHGISDLSEAFDSSSDQEVTPPGSRVSNSMNEE-ETSVNSKVRRDDSEE 610
            KATADVLAILE+ GISD+SE FDS SD E  P  S VSN   +E E  ++SK R+  S++
Sbjct: 72   KATADVLAILESEGISDVSEEFDSGSDLE-NPCDSSVSNECAKEGEEPMSSKGRQHGSDK 130

Query: 611  FSGSELECSPF-PRSLSWKSSNDSPHSLEXXXXXXXXXXXXXXXXXXXXXAKQRLGKSCR 787
              GS ++ SP   +SLSWK  +DS HSLE                      K R GKSCR
Sbjct: 131  MPGSNVDSSPVSSKSLSWKGRHDSSHSLE--KYKTSNLRRQSSFSSISSSPKHRQGKSCR 188

Query: 788  QIRRREARSTAEDSKNGSLTLDPSSNGVATCSDGLPNGSETGSEIRREDSEIGEEKAVLK 967
            +IR R+ R   E+S+N     +     +A+ S G PN S  GS I + +SEI EE     
Sbjct: 189  KIRHRQIRLVVEESRNKFANHEKE---LASLSKGFPNFSGGGSNIPKIESEIQEEGG--- 242

Query: 968  GLENQRKVTNAVQYHNGHGRDKDMERALESQAQLIGQHXXXXXXXXXXXXXFRDNNNSTP 1147
               +     N   + +G+GR+KDME+ALE QAQLI Q+             FR+NN++TP
Sbjct: 243  ---SGANPLNKNHHVDGYGREKDMEKALEHQAQLIDQYEAMEKVQREWEEKFRENNSTTP 299

Query: 1148 DSCEPGNQSDVTEERDEIKPQAASLAVTITSQDQEPKSEAEDVGI-----RKESRNVY-- 1306
            DSC+PGN SD+TE++DE K      A  +TS  QE K E   V +     + E+R++   
Sbjct: 300  DSCDPGNYSDMTEDKDESKVHIPFAAKVVTSDAQESKGEPRGVCLSEEKFKAEARDIMPK 359

Query: 1307 TSGFQSPMQDQKNKNTLAYELPATDFAFPSAKGTQNE----------------------- 1417
            T        DQKN      +L     + P  KG QNE                       
Sbjct: 360  THDDTGGYSDQKNTTFSTSDLLGQQNSCPPLKGNQNESSVNGHFQPSVMNHQDPGRHGYH 419

Query: 1418 -QLEKYSYHPPSHG---ETSGSRNENQ--ALILHEAASNGLGGVLEALQQAKLSIQQKLN 1579
                 YS+    HG   +   SRN+    AL+ HE   +   GVLE+L+QA++S+QQ+L 
Sbjct: 420  DSKPTYSFPTDIHGVQHQNDASRNKTDLFALVTHE-QPHKFNGVLESLKQARISLQQELK 478

Query: 1580 KLPLLESGSVGKPFEPSAAAIRAGDKLDIPVGCAGLFKVPTDFKFEPTPRANF----LGS 1747
            +LPL+ESG   K   PSA+  ++ D+ ++PVGC+GLF++PTDF    T R N      G 
Sbjct: 479  RLPLVESGYTAK---PSASFSKSEDRFEVPVGCSGLFRIPTDFSDGATARFNVKDPTAGF 535

Query: 1748 GSQVHPDTRVAVTGSVLSSSSNQFVPN-SYMDTRLNVSNGD 1867
            GS  H +  ++ T      S  QF P+  Y DT+L++   D
Sbjct: 536  GSNFHLNRAMSRT------SDGQFFPSLPYPDTQLSLPAND 570


>ref|XP_006606284.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X1
            [Glycine max] gi|571568788|ref|XP_006606285.1| PREDICTED:
            micronuclear linker histone polyprotein-like isoform X2
            [Glycine max] gi|571568792|ref|XP_006606286.1| PREDICTED:
            micronuclear linker histone polyprotein-like isoform X3
            [Glycine max]
          Length = 664

 Score =  347 bits (889), Expect = 2e-92
 Identities = 238/579 (41%), Positives = 319/579 (55%), Gaps = 43/579 (7%)
 Frame = +2

Query: 260  TTLMEDSNAMTVEYLRARLLSERSVSRTARQRAEELAKRVAELEAQLKAVSLQRKKAEKA 439
            T+ MEDS AMT+E+LRARLLSERS+SR+A+QRA+ELAK+V +LE QLK V LQRK AEKA
Sbjct: 37   TSCMEDSTAMTIEFLRARLLSERSISRSAKQRADELAKKVMDLEEQLKTVILQRKMAEKA 96

Query: 440  TADVLAILENHGISDLSEAFDSSSDQEVTPPGSRVSNSMNEE-ETSVNSKVRRDDSEEFS 616
            TADVLAILE+ GISD+SE FDS SD E  P  S VSN   +E E  ++SK R+  S++  
Sbjct: 97   TADVLAILESEGISDVSEEFDSGSDLE-NPCDSSVSNECAKEGEEPMSSKGRQHGSDKMP 155

Query: 617  GSELECSPF-PRSLSWKSSNDSPHSLEXXXXXXXXXXXXXXXXXXXXXAKQRLGKSCRQI 793
            GS ++ SP   +SLSWK  +DS HSLE                      K R GKSCR+I
Sbjct: 156  GSNVDSSPVSSKSLSWKGRHDSSHSLE--KYKTSNLRRQSSFSSISSSPKHRQGKSCRKI 213

Query: 794  RRREARSTAEDSKNGSLTLDPSSNGVATCSDGLPNGSETGSEIRREDSEIGEEKAVLKGL 973
            R R+ R   E+S+N     +     +A+ S G PN S  GS I + +SEI EE       
Sbjct: 214  RHRQIRLVVEESRNKFANHEKE---LASLSKGFPNFSGGGSNIPKIESEIQEEGG----- 265

Query: 974  ENQRKVTNAVQYHNGHGRDKDMERALESQAQLIGQHXXXXXXXXXXXXXFRDNNNSTPDS 1153
             +     N   + +G+GR+KDME+ALE QAQLI Q+             FR+NN++TPDS
Sbjct: 266  -SGANPLNKNHHVDGYGREKDMEKALEHQAQLIDQYEAMEKVQREWEEKFRENNSTTPDS 324

Query: 1154 CEPGNQSDVTEERDEIKPQAASLAVTITSQDQEPKSEAEDVGI-----RKESRNVY--TS 1312
            C+PGN SD+TE++DE K      A  +TS  QE K E   V +     + E+R++   T 
Sbjct: 325  CDPGNYSDMTEDKDESKVHIPFAAKVVTSDAQESKGEPRGVCLSEEKFKAEARDIMPKTH 384

Query: 1313 GFQSPMQDQKNKNTLAYELPATDFAFPSAKGTQNE------------------------Q 1420
                   DQKN      +L     + P  KG QNE                         
Sbjct: 385  DDTGGYSDQKNTTFSTSDLLGQQNSCPPLKGNQNESSVNGHFQPSVMNHQDPGRHGYHDS 444

Query: 1421 LEKYSYHPPSHG---ETSGSRNENQ--ALILHEAASNGLGGVLEALQQAKLSIQQKLNKL 1585
               YS+    HG   +   SRN+    AL+ HE   +   GVLE+L+QA++S+QQ+L +L
Sbjct: 445  KPTYSFPTDIHGVQHQNDASRNKTDLFALVTHE-QPHKFNGVLESLKQARISLQQELKRL 503

Query: 1586 PLLESGSVGKPFEPSAAAIRAGDKLDIPVGCAGLFKVPTDFKFEPTPRANF----LGSGS 1753
            PL+ESG   K   PSA+  ++ D+ ++PVGC+GLF++PTDF    T R N      G GS
Sbjct: 504  PLVESGYTAK---PSASFSKSEDRFEVPVGCSGLFRIPTDFSDGATARFNVKDPTAGFGS 560

Query: 1754 QVHPDTRVAVTGSVLSSSSNQFVPN-SYMDTRLNVSNGD 1867
              H +  ++ T      S  QF P+  Y DT+L++   D
Sbjct: 561  NFHLNRAMSRT------SDGQFFPSLPYPDTQLSLPAND 593


>ref|XP_004140985.1| PREDICTED: uncharacterized protein LOC101207733 [Cucumis sativus]
          Length = 671

 Score =  345 bits (886), Expect = 3e-92
 Identities = 240/579 (41%), Positives = 329/579 (56%), Gaps = 42/579 (7%)
 Frame = +2

Query: 236  QARSEIRKTTLMEDSNAMTVEYLRARLLSERSVSRTARQRAEELAKRVAELEAQLKAVSL 415
            Q + + R    +ED+ AMT+E+LRARLLSERSVS++ARQRA+ELAKRVAELE QLK VSL
Sbjct: 6    QDQQDPRSVPGVEDTTAMTIEFLRARLLSERSVSKSARQRADELAKRVAELEEQLKIVSL 65

Query: 416  QRKKAEKATADVLAILENHGISDLSEAFDSSSDQEVTPPGSRVSNSMNEEETSVNSKVRR 595
            QRK AEKATADVLAILE++G SD+SE  DS+SD E  P   +V + +  E+ S  +  RR
Sbjct: 66   QRKMAEKATADVLAILEDNGASDISETLDSNSDHETEP---KVEDGLAREDVSSGTVRRR 122

Query: 596  DDSEEFSGSELECSP-FPRSLSWKSSNDSPHSLEXXXXXXXXXXXXXXXXXXXXXAKQRL 772
            ++ EE+SGS ++ SP    SLSWK  NDSPH+ E                      K +L
Sbjct: 123  NEHEEYSGSNIDTSPVLGGSLSWKGRNDSPHTRE-KYKKHSIRSRSSFTSIGSSSPKHQL 181

Query: 773  GKSCRQIRRREARS-TAEDSKNGSLTLDPSSNGVATCSDGLPNGSETGSEIRREDSEIGE 949
            G+SCRQI+RR+ R    E        +D S    +T  +   N S  G  I R+  E+ E
Sbjct: 182  GRSCRQIKRRDTRPLDGEQELKSDALVDSSEEIPSTSLEDSQNYSVNGHSILRDGYEVRE 241

Query: 950  E-KAVLKGLENQRKVTNAVQYHNGHGRDK--DMERALESQAQLIGQHXXXXXXXXXXXXX 1120
            + ++   G+ N   V N+ Q ++  G +K  DME+AL+ QAQLI Q+             
Sbjct: 242  KTRSSSSGVHN--SVGNSDQDNDIDGYEKVDDMEKALKCQAQLIDQYEAMEKAQREWEEK 299

Query: 1121 FRDNNNSTPDSCEPGNQSDVTEERDEIKPQAASLAVTITSQDQEPKSEAE-DVGIRKESR 1297
            FR+NNNSTPDSC+PGN SD+TEERDE++ QA +L+         P +EA+  V    ++R
Sbjct: 300  FRENNNSTPDSCDPGNHSDITEERDEMRAQAPNLS-------NNPANEAKPQVAFDCDTR 352

Query: 1298 NV---YTSGFQSPM--------QDQKNKNTLAYELPATDFAFPSAKGTQNEQLEKYSYHP 1444
            ++    T+G    M        QDQ N N+++      +F FP A   Q ++ ++ S   
Sbjct: 353  DLSQAQTNGLGPSMCAVDVEDLQDQ-NTNSISTSKSLEEFTFPMANVKQCQESQENSAQE 411

Query: 1445 PS------HG-----------------ETSGSRNENQALILHEAASNGLGGVLEALQQAK 1555
            PS      HG                 ET  S N+  AL+ HE  +  L GVLEAL+QAK
Sbjct: 412  PSCTSHLNHGLPERPLSSHGGINSYDQETPCSNNDLYALVPHEPPA--LDGVLEALKQAK 469

Query: 1556 LSIQQKLNKLPLL--ESGSVGKPFEPSAAAIRAGDKLDIPVGCAGLFKVPTDFKFEPTPR 1729
            LS+ +K+ KLP +  ES S+ K   P +   + GD+L+IPVGCAGLF++PTDF  E + +
Sbjct: 470  LSLTKKIIKLPSVDGESESIDKSIGPLSIP-KMGDRLEIPVGCAGLFRLPTDFAAEASSQ 528

Query: 1730 ANFLGSGSQVHPDTRVAVTGSVLSSSSNQFVPNSYMDTR 1846
            ANFL S SQ+   T     G+ L S+++Q  P   M+ R
Sbjct: 529  ANFLASSSQLRSPTHYPGEGAAL-SANHQIFPGHEMEDR 566


>emb|CBI40233.3| unnamed protein product [Vitis vinifera]
          Length = 682

 Score =  338 bits (868), Expect = 4e-90
 Identities = 201/407 (49%), Positives = 243/407 (59%), Gaps = 12/407 (2%)
 Frame = +2

Query: 269  MEDSNAMTVEYLRARLLSERSVSRTARQRAEELAKRVAELEAQLKAVSLQRKKAEKATAD 448
            MEDS AMT+E+LRARLLSERSVSRTARQRA+ELA+RV +LE QLK VS+QR KAEKATAD
Sbjct: 1    MEDSTAMTIEFLRARLLSERSVSRTARQRADELAQRVWKLEEQLKIVSIQRNKAEKATAD 60

Query: 449  VLAILENHGISDLSEAFDSSSDQEVTPPGSRVSNSMNEEETSVNSKVRRDDSEEFSGSEL 628
            VLAILENH ISD+S  FDSSSDQEV    S V                            
Sbjct: 61   VLAILENHAISDVSWEFDSSSDQEVALCDSHVGGG------------------------- 95

Query: 629  ECSPFPRSLSWKSSNDSPHSLEXXXXXXXXXXXXXXXXXXXXXAKQRLGKSCRQIRRREA 808
                  R LSWKSS DS HS+E                      K  LGKSCRQIRRRE 
Sbjct: 96   ------RRLSWKSSKDSSHSIEKRYLDCSIRRRHSFASSGSSSPKHNLGKSCRQIRRRET 149

Query: 809  RSTAEDSKNGSLTLDPSSNGVATCSDGLPNGSETGSEIRREDSEIGEEKAVLKG-----L 973
            RS  ++ K G + +D  +NG+ + S+GLPNG ++G EI RE SE  EE+A++ G     L
Sbjct: 150  RSAVDELKVGRVMVDSQNNGIISSSEGLPNGFDSGQEILREGSENQEEEALMDGQVSDSL 209

Query: 974  ENQRKVTNAVQYHNGHGRDKDMERALESQAQLIGQHXXXXXXXXXXXXXFRDNNNSTPDS 1153
            E+QR  T +  + N +GRD+DMERALE QAQLIGQ+             FR+NN+STPDS
Sbjct: 210  ESQRDATGSNHHLNRNGRDRDMERALEHQAQLIGQYEAEEKAQREWEEKFRENNSSTPDS 269

Query: 1154 CEPGNQSDVTEERDEIKPQAASLAVTITSQDQEPKSEAEDVGIRKES-------RNVYTS 1312
            CEPGN SDVTEERDE+KPQA S A  +TSQDQ  K + EDV   +ES          +  
Sbjct: 270  CEPGNHSDVTEERDEVKPQAPSAAGILTSQDQGTKLDDEDVHFNEESSQTLPTISTTHLH 329

Query: 1313 GFQSPMQDQKNKNTLAYELPATDFAFPSAKGTQNEQLEKYSYHPPSH 1453
            G    +Q+Q   + LAYE  A DF FP AK   +++  +   +P SH
Sbjct: 330  GDMECLQEQNRCSMLAYESLAPDFVFPMAKENLHQEFLENQSYPLSH 376



 Score =  139 bits (351), Expect = 4e-30
 Identities = 84/181 (46%), Positives = 111/181 (61%)
 Frame = +2

Query: 1448 SHGETSGSRNENQALILHEAASNGLGGVLEALQQAKLSIQQKLNKLPLLESGSVGKPFEP 1627
            S GE+S S++++ AL+  E  SN LGGVLEALQQA+LS+Q KLN+LPL+E GS+G+  EP
Sbjct: 459  SKGESSRSQDKHYALVPRET-SNELGGVLEALQQARLSLQHKLNRLPLIEGGSIGRAIEP 517

Query: 1628 SAAAIRAGDKLDIPVGCAGLFKVPTDFKFEPTPRANFLGSGSQVHPDTRVAVTGSVLSSS 1807
            S  + RA ++++IPVGCAGLF+VP D++      ANFLGS SQ                 
Sbjct: 518  SFPSTRAWERVEIPVGCAGLFRVPADYQLGTATEANFLGSDSQ----------------- 560

Query: 1808 SNQFVPNSYMDTRLNVSNGDHRFLTNPLVETGSRVLDTRLNVSNGDHRFLTNPLVETGSR 1987
                + N Y DT    + GD RFLT+P ++TGS V          D  FLT+P  ETGSR
Sbjct: 561  --SSLKNYYPDTGFVANPGD-RFLTSPYLKTGSSVPT--------DDSFLTSPYRETGSR 609

Query: 1988 V 1990
            +
Sbjct: 610  I 610


>ref|XP_004496183.1| PREDICTED: uncharacterized protein LOC101514253 isoform X2 [Cicer
            arietinum] gi|502118270|ref|XP_004496184.1| PREDICTED:
            uncharacterized protein LOC101514253 isoform X3 [Cicer
            arietinum] gi|502118272|ref|XP_004496185.1| PREDICTED:
            uncharacterized protein LOC101514253 isoform X4 [Cicer
            arietinum]
          Length = 660

 Score =  334 bits (856), Expect = 1e-88
 Identities = 245/614 (39%), Positives = 335/614 (54%), Gaps = 52/614 (8%)
 Frame = +2

Query: 245  SEIRKTTLMEDSNAMTVEYLRARLLSERSVSRTARQRAEELAKRVAELEAQLKAVSLQRK 424
            S  R T+ MEDS +MT+E+LRARLL+ERS+SR+ARQR  EL K+VAELE QL+ V+LQRK
Sbjct: 8    SVTRVTSCMEDSTSMTIEFLRARLLAERSISRSARQRTAELEKKVAELEEQLRTVTLQRK 67

Query: 425  KAEKATADVLAILENHGISDLSEAFDSSSDQEVTPPGSRVSN-SMNEEETSVNSKVRRDD 601
             AEKATADVLAILE+ GISDLSE  DS SD ++ P  S VSN S  E E   +SK RR +
Sbjct: 68   MAEKATADVLAILEDQGISDLSEELDSGSDIDI-PYESGVSNESSKEGERYRSSKERRHE 126

Query: 602  SEEFSGSE-LECSPFP-RSLSWKSSNDSPHSLEXXXXXXXXXXXXXXXXXXXXXAKQRLG 775
            S+E   S  ++ SP   RSLSWK  +DSP SLE                      K   G
Sbjct: 127  SDELYDSHVVDSSPVSNRSLSWKGRHDSPRSLE--KYKTSNIRRRNSFSSVSSSPKHHQG 184

Query: 776  KSCRQIRRREARSTAEDSKNGSLTLDPSSNGVATCSDGLPNGSETGSEIRREDSEIGEEK 955
            KSCR+IR R+ RS  E+S++ S+  +   N   + S+G PN S  GS I R +S+I    
Sbjct: 185  KSCRKIRHRQNRSVVEESRDKSVKDNFQENDFVSSSEGYPNRSVDGSNILRIESKI---- 240

Query: 956  AVLKGLENQRKVTNAVQYHNGHGRDKDMERALESQAQLIGQHXXXXXXXXXXXXXFRDNN 1135
              L+G E++  + N   + +  GR +DME+ALE QAQLI +              FR+NN
Sbjct: 241  --LEGDESEVNLVNKNHHVDRCGRKEDMEKALEHQAQLIDRFGAMEKAQREWEEKFRENN 298

Query: 1136 NS-TPDSCEPGNQSDVTEERDEIKPQAASLAVTITSQDQEPKSE------AEDVGIRKES 1294
            NS TPDSC+PGN SD+TE+++E K Q    +  +TS  QE K+E      +E++  + E+
Sbjct: 299  NSTTPDSCDPGNHSDMTEDKEESKAQIPYSSKAVTSNAQEDKAEPGGVRSSEEI-FKSEA 357

Query: 1295 RNVYTSGFQSPMQDQKNKNTLAY---ELPATDFAFPSAKGTQNE-------QLEKYSYHP 1444
            R+V    +     D  N+N+  +    L   +       G Q E       Q  + +YH 
Sbjct: 358  RDVMPKSYDD-TSDYNNQNSPTFRTSNLLGQENLHSPLNGNQTESSVNSHPQSSEVNYHD 416

Query: 1445 P------------------SHG---ETSGSRNENQ--ALILHEAASNGLGGVLEALQQAK 1555
            P                   HG   +   SRN+N   AL+  E  S+   G+LE+L+QA+
Sbjct: 417  PHGRGYPDSKPTLSFPKYIQHGSLHQNDSSRNKNDLYALVFRE-QSHEFNGILESLKQAR 475

Query: 1556 LSIQQKLNKLPLLESGSVGKPFEPSAAAIRAGDKLDIPVGCAGLFKVPTDFKFEPTPRAN 1735
            LS+QQ+LN+LPL+ES   G   +PSA   ++  + DIPVG +GLF++PTDF  E T R  
Sbjct: 476  LSLQQELNRLPLVESSHKG--IKPSAFVGKSEGRFDIPVGFSGLFRLPTDFSDEATSRFG 533

Query: 1736 FL----GSGSQVHPDTRVAVTGSVLSSSSNQFVPNSYMDTRLNVSNGDH----RFLTN-P 1888
                  G GS  + + R         +S  QFV N Y  TR+++S  D     R+L N P
Sbjct: 534  VRDSAGGFGSNFYHNNR-----GTSRTSDVQFVANPYYGTRMSLSANDQAHTTRYLENGP 588

Query: 1889 LVETGSRVLDTRLN 1930
            + ++     D  LN
Sbjct: 589  ISDSKKTPFDPFLN 602


>ref|XP_004496182.1| PREDICTED: uncharacterized protein LOC101514253 isoform X1 [Cicer
            arietinum]
          Length = 663

 Score =  333 bits (855), Expect = 1e-88
 Identities = 244/611 (39%), Positives = 334/611 (54%), Gaps = 52/611 (8%)
 Frame = +2

Query: 254  RKTTLMEDSNAMTVEYLRARLLSERSVSRTARQRAEELAKRVAELEAQLKAVSLQRKKAE 433
            R T+ MEDS +MT+E+LRARLL+ERS+SR+ARQR  EL K+VAELE QL+ V+LQRK AE
Sbjct: 14   RVTSCMEDSTSMTIEFLRARLLAERSISRSARQRTAELEKKVAELEEQLRTVTLQRKMAE 73

Query: 434  KATADVLAILENHGISDLSEAFDSSSDQEVTPPGSRVSN-SMNEEETSVNSKVRRDDSEE 610
            KATADVLAILE+ GISDLSE  DS SD ++ P  S VSN S  E E   +SK RR +S+E
Sbjct: 74   KATADVLAILEDQGISDLSEELDSGSDIDI-PYESGVSNESSKEGERYRSSKERRHESDE 132

Query: 611  FSGSE-LECSPFP-RSLSWKSSNDSPHSLEXXXXXXXXXXXXXXXXXXXXXAKQRLGKSC 784
               S  ++ SP   RSLSWK  +DSP SLE                      K   GKSC
Sbjct: 133  LYDSHVVDSSPVSNRSLSWKGRHDSPRSLE--KYKTSNIRRRNSFSSVSSSPKHHQGKSC 190

Query: 785  RQIRRREARSTAEDSKNGSLTLDPSSNGVATCSDGLPNGSETGSEIRREDSEIGEEKAVL 964
            R+IR R+ RS  E+S++ S+  +   N   + S+G PN S  GS I R +S+I      L
Sbjct: 191  RKIRHRQNRSVVEESRDKSVKDNFQENDFVSSSEGYPNRSVDGSNILRIESKI------L 244

Query: 965  KGLENQRKVTNAVQYHNGHGRDKDMERALESQAQLIGQHXXXXXXXXXXXXXFRDNNNS- 1141
            +G E++  + N   + +  GR +DME+ALE QAQLI +              FR+NNNS 
Sbjct: 245  EGDESEVNLVNKNHHVDRCGRKEDMEKALEHQAQLIDRFGAMEKAQREWEEKFRENNNST 304

Query: 1142 TPDSCEPGNQSDVTEERDEIKPQAASLAVTITSQDQEPKSE------AEDVGIRKESRNV 1303
            TPDSC+PGN SD+TE+++E K Q    +  +TS  QE K+E      +E++  + E+R+V
Sbjct: 305  TPDSCDPGNHSDMTEDKEESKAQIPYSSKAVTSNAQEDKAEPGGVRSSEEI-FKSEARDV 363

Query: 1304 YTSGFQSPMQDQKNKNTLAY---ELPATDFAFPSAKGTQNE-------QLEKYSYHPP-- 1447
                +     D  N+N+  +    L   +       G Q E       Q  + +YH P  
Sbjct: 364  MPKSYDD-TSDYNNQNSPTFRTSNLLGQENLHSPLNGNQTESSVNSHPQSSEVNYHDPHG 422

Query: 1448 ----------------SHG---ETSGSRNENQ--ALILHEAASNGLGGVLEALQQAKLSI 1564
                             HG   +   SRN+N   AL+  E  S+   G+LE+L+QA+LS+
Sbjct: 423  RGYPDSKPTLSFPKYIQHGSLHQNDSSRNKNDLYALVFRE-QSHEFNGILESLKQARLSL 481

Query: 1565 QQKLNKLPLLESGSVGKPFEPSAAAIRAGDKLDIPVGCAGLFKVPTDFKFEPTPRANFL- 1741
            QQ+LN+LPL+ES   G   +PSA   ++  + DIPVG +GLF++PTDF  E T R     
Sbjct: 482  QQELNRLPLVESSHKG--IKPSAFVGKSEGRFDIPVGFSGLFRLPTDFSDEATSRFGVRD 539

Query: 1742 ---GSGSQVHPDTRVAVTGSVLSSSSNQFVPNSYMDTRLNVSNGDH----RFLTN-PLVE 1897
               G GS  + + R         +S  QFV N Y  TR+++S  D     R+L N P+ +
Sbjct: 540  SAGGFGSNFYHNNR-----GTSRTSDVQFVANPYYGTRMSLSANDQAHTTRYLENGPISD 594

Query: 1898 TGSRVLDTRLN 1930
            +     D  LN
Sbjct: 595  SKKTPFDPFLN 605


>ref|XP_004496186.1| PREDICTED: uncharacterized protein LOC101514253 isoform X5 [Cicer
            arietinum]
          Length = 645

 Score =  330 bits (847), Expect = 1e-87
 Identities = 242/606 (39%), Positives = 331/606 (54%), Gaps = 52/606 (8%)
 Frame = +2

Query: 269  MEDSNAMTVEYLRARLLSERSVSRTARQRAEELAKRVAELEAQLKAVSLQRKKAEKATAD 448
            MEDS +MT+E+LRARLL+ERS+SR+ARQR  EL K+VAELE QL+ V+LQRK AEKATAD
Sbjct: 1    MEDSTSMTIEFLRARLLAERSISRSARQRTAELEKKVAELEEQLRTVTLQRKMAEKATAD 60

Query: 449  VLAILENHGISDLSEAFDSSSDQEVTPPGSRVSN-SMNEEETSVNSKVRRDDSEEFSGSE 625
            VLAILE+ GISDLSE  DS SD ++ P  S VSN S  E E   +SK RR +S+E   S 
Sbjct: 61   VLAILEDQGISDLSEELDSGSDIDI-PYESGVSNESSKEGERYRSSKERRHESDELYDSH 119

Query: 626  -LECSPFP-RSLSWKSSNDSPHSLEXXXXXXXXXXXXXXXXXXXXXAKQRLGKSCRQIRR 799
             ++ SP   RSLSWK  +DSP SLE                      K   GKSCR+IR 
Sbjct: 120  VVDSSPVSNRSLSWKGRHDSPRSLE--KYKTSNIRRRNSFSSVSSSPKHHQGKSCRKIRH 177

Query: 800  REARSTAEDSKNGSLTLDPSSNGVATCSDGLPNGSETGSEIRREDSEIGEEKAVLKGLEN 979
            R+ RS  E+S++ S+  +   N   + S+G PN S  GS I R +S+I      L+G E+
Sbjct: 178  RQNRSVVEESRDKSVKDNFQENDFVSSSEGYPNRSVDGSNILRIESKI------LEGDES 231

Query: 980  QRKVTNAVQYHNGHGRDKDMERALESQAQLIGQHXXXXXXXXXXXXXFRDNNNS-TPDSC 1156
            +  + N   + +  GR +DME+ALE QAQLI +              FR+NNNS TPDSC
Sbjct: 232  EVNLVNKNHHVDRCGRKEDMEKALEHQAQLIDRFGAMEKAQREWEEKFRENNNSTTPDSC 291

Query: 1157 EPGNQSDVTEERDEIKPQAASLAVTITSQDQEPKSE------AEDVGIRKESRNVYTSGF 1318
            +PGN SD+TE+++E K Q    +  +TS  QE K+E      +E++  + E+R+V    +
Sbjct: 292  DPGNHSDMTEDKEESKAQIPYSSKAVTSNAQEDKAEPGGVRSSEEI-FKSEARDVMPKSY 350

Query: 1319 QSPMQDQKNKNTLAY---ELPATDFAFPSAKGTQNE-------QLEKYSYHPP------- 1447
                 D  N+N+  +    L   +       G Q E       Q  + +YH P       
Sbjct: 351  DD-TSDYNNQNSPTFRTSNLLGQENLHSPLNGNQTESSVNSHPQSSEVNYHDPHGRGYPD 409

Query: 1448 -----------SHG---ETSGSRNENQ--ALILHEAASNGLGGVLEALQQAKLSIQQKLN 1579
                        HG   +   SRN+N   AL+  E  S+   G+LE+L+QA+LS+QQ+LN
Sbjct: 410  SKPTLSFPKYIQHGSLHQNDSSRNKNDLYALVFRE-QSHEFNGILESLKQARLSLQQELN 468

Query: 1580 KLPLLESGSVGKPFEPSAAAIRAGDKLDIPVGCAGLFKVPTDFKFEPTPRANFL----GS 1747
            +LPL+ES   G   +PSA   ++  + DIPVG +GLF++PTDF  E T R        G 
Sbjct: 469  RLPLVESSHKG--IKPSAFVGKSEGRFDIPVGFSGLFRLPTDFSDEATSRFGVRDSAGGF 526

Query: 1748 GSQVHPDTRVAVTGSVLSSSSNQFVPNSYMDTRLNVSNGDH----RFLTN-PLVETGSRV 1912
            GS  + + R         +S  QFV N Y  TR+++S  D     R+L N P+ ++    
Sbjct: 527  GSNFYHNNR-----GTSRTSDVQFVANPYYGTRMSLSANDQAHTTRYLENGPISDSKKTP 581

Query: 1913 LDTRLN 1930
             D  LN
Sbjct: 582  FDPFLN 587


>ref|XP_007143822.1| hypothetical protein PHAVU_007G104500g [Phaseolus vulgaris]
            gi|561017012|gb|ESW15816.1| hypothetical protein
            PHAVU_007G104500g [Phaseolus vulgaris]
          Length = 652

 Score =  325 bits (833), Expect = 5e-86
 Identities = 219/580 (37%), Positives = 316/580 (54%), Gaps = 34/580 (5%)
 Frame = +2

Query: 272  EDSNAMTVEYLRARLLSERSVSRTARQRAEELAKRVAELEAQLKAVSLQRKKAEKATADV 451
            EDS AMT+E+LRARLLSERS+S++ARQRA+ELA++V ELE QL+ V LQRK AEKATADV
Sbjct: 18   EDSTAMTIEFLRARLLSERSISKSARQRADELAEKVMELEEQLRMVILQRKMAEKATADV 77

Query: 452  LAILENHGISDLSEAFDSSSDQEVTPPGSRVSNSMNEEETSVNSKVRRDDSEEFSGSELE 631
            LAILE+ GIS +S+ FDS SD E     S  +    E+E  + SK R+  S+E SGS  +
Sbjct: 78   LAILESQGISGVSDEFDSGSDLENPFDSSMSNECAKEDEGPMKSKGRQHGSDEMSGSNED 137

Query: 632  CS-PFPRSLSWKSSNDSPHSLEXXXXXXXXXXXXXXXXXXXXXAKQRLGKSCRQIRRREA 808
             S    +SLSWK  +D  HSLE                      K RLGKSCR+IR R+ 
Sbjct: 138  SSLVSSKSLSWKGRHDLSHSLEKYKTKSTNVRRQSSFSSFSSSPKHRLGKSCRKIRHRQP 197

Query: 809  RSTAEDSKNGSLTLDPSSNGVATCSDGLPNGSETGSEIRREDSEIGEEKAVLKGLENQRK 988
            RS  E+S+   + ++   N + + S+G PN  + GS I + +S+I EE        ++  
Sbjct: 198  RSVMEESRGKFVHVNCQVNELVSSSEGFPNFRDGGSNILKIESKIQEEDG------SEAN 251

Query: 989  VTNAVQYHNGHGRDKDMERALESQAQLIGQHXXXXXXXXXXXXXFRDNNNSTPDSCEPGN 1168
            + +   + +G+GR+ +ME+ALE QA+LI Q+             FR+NN++TPDSC+PGN
Sbjct: 252  LLSKNHHIDGYGRENEMEKALEHQAELIDQYEAMEKAQREWEEKFRENNSTTPDSCDPGN 311

Query: 1169 QSDVTEERDEIKPQAASLAVTITSQDQEPKSEAEDVGIRKESRNVYTSGFQSPMQDQKN- 1345
             SD+TE++DE K Q    A  +TS+ +E K E   V + +E              D  + 
Sbjct: 312  HSDMTEDKDEGKVQIPYAAKVVTSKAEESKGEPGGVCLSEEKLKAEGREIMPKKHDDTDV 371

Query: 1346 -KNTLAYELPATDF-----AFPSAKGTQNE----------------QLEKYSYHPPSHG- 1456
             +N  +     +DF     +    KG QNE                Q    S+    HG 
Sbjct: 372  YRNQKSTTFSTSDFLGQENSHSPLKGNQNEILVNGHSQSSDMNHLDQGRHSSFPTDIHGV 431

Query: 1457 ----ETSGSRNENQALILHEAASNGLGGVLEALQQAKLSIQQKLNKLPLLESGSVGKPFE 1624
                + S ++ +  AL+  E  S+   GVLE+L+QA++S+QQ+LN+LP++E G   KP  
Sbjct: 432  QHQHDASKNQKDLYALVTRE-QSHQFDGVLESLKQARISLQQELNRLPVVEGGYTAKPL- 489

Query: 1625 PSAAAIRAGDKLDIPVGCAGLFKVPTDFKFEPTPRANF----LGSGSQVHPDTRVAVTGS 1792
            PS +  +  D+ +IP G +GLF++PTDF  E TPR N      G GS  H      + G+
Sbjct: 490  PSVS--KNEDRFEIPFGFSGLFRLPTDFSDEATPRFNVRDPTTGFGSNYH------LNGT 541

Query: 1793 VLSSSSNQFVPNSYMDTRLNVS-NGDHRFLTNPLVETGSR 1909
            +  +S  QF  N     ++ +S + + + L    +E GSR
Sbjct: 542  MSRTSVGQFFTNPPHSGKMLMSPSANDQALATRYLENGSR 581


>gb|EYU19796.1| hypothetical protein MIMGU_mgv1a003492mg [Mimulus guttatus]
          Length = 581

 Score =  308 bits (790), Expect = 5e-81
 Identities = 220/526 (41%), Positives = 284/526 (53%), Gaps = 14/526 (2%)
 Frame = +2

Query: 269  MEDSNAMTVEYLRARLLSERSVSRTARQRAEELAKRVAELEAQLKAVSLQRKKAEKATAD 448
            ME+SNAMT+E+LRARLLSERSVS++ARQRA+EL+KRVAEL  QL  VSLQRKKAEKATAD
Sbjct: 1    MEESNAMTIEFLRARLLSERSVSKSARQRADELSKRVAELTEQLNFVSLQRKKAEKATAD 60

Query: 449  VLAILENHGISDLSEAFDSSSDQEVTPPGSRVSN-SMNEEETSVNSKVRRDDSEEFSGSE 625
            VLA+LENHGISD+SE FDS S+Q+ +P   +  N S+  +ETS N K R++++E +S SE
Sbjct: 61   VLAMLENHGISDVSEEFDSCSEQDESPHELKARNSSLVIQETSTNHKPRKNETEAYSSSE 120

Query: 626  LECSPF--PRSLSWKSSNDSPHSLEXXXXXXXXXXXXXXXXXXXXXAKQRLGKSCRQIRR 799
            +E  P    RSLSWKS+ D P                         + +R GKSCR+IR 
Sbjct: 121  IESCPSIGSRSLSWKSTKD-PQRHSPEKKKYIDSVRRRTSFSSNGSSAKRAGKSCRRIRH 179

Query: 800  REARSTAEDSKNGSLTLDPSSNGVATCSDGLPNGSETGSEIRREDSEIGEEKAVLKGLEN 979
            RE RS  E+ +N       +S  V  CS      + T S + R ++E  E          
Sbjct: 180  RETRS-IEELQNVDTEKAVNSRDVCNCSSNGEPVALTESPVLRSNNEAQE---------- 228

Query: 980  QRKVTNAVQYHNGHGRDKDMERALESQAQLIGQHXXXXXXXXXXXXXFRDNNNS--TPDS 1153
                +N   Y NG     DME AL+ QAQLIGQ+             FR+NNNS  T DS
Sbjct: 229  ----SNIGHYFNG-----DMESALQHQAQLIGQYEEEEKAQREWEDKFRENNNSGGTQDS 279

Query: 1154 CEPGNQSDVTEERDEIKPQAASLA-VTITSQDQEPKSEAEDVGIRKESRNVYTSGFQSPM 1330
            C+PGN SDVTEE  E+KP   S A  T+ + +QE K E +   I K    V     +   
Sbjct: 280  CDPGNHSDVTEELYEMKPPKQSFASETVCTDNQETKQEPQ---ISKSLPPVTYDNHKVNS 336

Query: 1331 QDQKNKNTLAYELPATDFAFPSAK-GTQNEQLEK------YSYHPPSHGETSGSRNENQA 1489
            Q+QK    L  E  AT+F+FP++K  + N+  EK         HP     +S SR   + 
Sbjct: 337  QEQK----LVGESSATEFSFPTSKEKSDNDSSEKQHEASALRTHPSLQLSSSSSR---EL 389

Query: 1490 LILHEAASNGLGGVLEALQQAKLSIQQKLNKLPLLESGSV-GKPFEPSAAAIRAGDKLDI 1666
             I+    SN LG VLEALQ+AKLS+ QKLN LP    G+      +PS       D   I
Sbjct: 390  SIMPRETSNNLGSVLEALQRAKLSLNQKLNNLPPSAGGATSSSAVKPSNLETDKVDSWRI 449

Query: 1667 PVGCAGLFKVPTDFKFEPTPRANFLGSGSQVHPDTRVAVTGSVLSS 1804
            P+   GLF++P D++FE        G     H   R  +T  +  S
Sbjct: 450  PICSPGLFRLPIDYQFEANNPRALSGDSFLTHVTNRPFITPEIQRS 495


>ref|XP_006606288.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X5
            [Glycine max]
          Length = 595

 Score =  292 bits (747), Expect = 4e-76
 Identities = 210/542 (38%), Positives = 284/542 (52%), Gaps = 43/542 (7%)
 Frame = +2

Query: 371  KRVAELEAQLKAVSLQRKKAEKATADVLAILENHGISDLSEAFDSSSDQEVTPPGSRVSN 550
            K+V +LE QLK V LQRK AEKATADVLAILE+ GISD+SE FDS SD E  P  S VSN
Sbjct: 5    KKVMDLEEQLKTVILQRKMAEKATADVLAILESEGISDVSEEFDSGSDLE-NPCDSSVSN 63

Query: 551  SMNEE-ETSVNSKVRRDDSEEFSGSELECSPFP-RSLSWKSSNDSPHSLEXXXXXXXXXX 724
               +E E  ++SK R+  S++  GS ++ SP   +SLSWK  +DS HSLE          
Sbjct: 64   ECAKEGEEPMSSKGRQHGSDKMPGSNVDSSPVSSKSLSWKGRHDSSHSLEKYKTSNLRRQ 123

Query: 725  XXXXXXXXXXXAKQRLGKSCRQIRRREARSTAEDSKNGSLTLDPSSNGVATCSDGLPNGS 904
                        K R GKSCR+IR R+ R   E+S+N     +     +A+ S G PN S
Sbjct: 124  SSFSSISSSP--KHRQGKSCRKIRHRQIRLVVEESRNKFANHEKE---LASLSKGFPNFS 178

Query: 905  ETGSEIRREDSEIGEEKAVLKGLENQRKVTNAVQYHNGHGRDKDMERALESQAQLIGQHX 1084
              GS I + +SEI EE        +     N   + +G+GR+KDME+ALE QAQLI Q+ 
Sbjct: 179  GGGSNIPKIESEIQEEGG------SGANPLNKNHHVDGYGREKDMEKALEHQAQLIDQYE 232

Query: 1085 XXXXXXXXXXXXFRDNNNSTPDSCEPGNQSDVTEERDEIKPQAASLAVTITSQDQEPKSE 1264
                        FR+NN++TPDSC+PGN SD+TE++DE K      A  +TS  QE K E
Sbjct: 233  AMEKVQREWEEKFRENNSTTPDSCDPGNYSDMTEDKDESKVHIPFAAKVVTSDAQESKGE 292

Query: 1265 AEDVGI-----RKESRNVY--TSGFQSPMQDQKNKNTLAYELPATDFAFPSAKGTQNE-- 1417
               V +     + E+R++   T        DQKN      +L     + P  KG QNE  
Sbjct: 293  PRGVCLSEEKFKAEARDIMPKTHDDTGGYSDQKNTTFSTSDLLGQQNSCPPLKGNQNESS 352

Query: 1418 ----------------------QLEKYSYHPPSHG---ETSGSRNENQ--ALILHEAASN 1516
                                      YS+    HG   +   SRN+    AL+ HE   +
Sbjct: 353  VNGHFQPSVMNHQDPGRHGYHDSKPTYSFPTDIHGVQHQNDASRNKTDLFALVTHE-QPH 411

Query: 1517 GLGGVLEALQQAKLSIQQKLNKLPLLESGSVGKPFEPSAAAIRAGDKLDIPVGCAGLFKV 1696
               GVLE+L+QA++S+QQ+L +LPL+ESG   K   PSA+  ++ D+ ++PVGC+GLF++
Sbjct: 412  KFNGVLESLKQARISLQQELKRLPLVESGYTAK---PSASFSKSEDRFEVPVGCSGLFRI 468

Query: 1697 PTDFKFEPTPRANF----LGSGSQVHPDTRVAVTGSVLSSSSNQFVPN-SYMDTRLNVSN 1861
            PTDF    T R N      G GS  H +  ++ T      S  QF P+  Y DT+L++  
Sbjct: 469  PTDFSDGATARFNVKDPTAGFGSNFHLNRAMSRT------SDGQFFPSLPYPDTQLSLPA 522

Query: 1862 GD 1867
             D
Sbjct: 523  ND 524


Top