BLASTX nr result

ID: Ephedra28_contig00006881 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra28_contig00006881
         (1789 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EMJ16104.1| hypothetical protein PRUPE_ppa000399mg [Prunus pe...   196   3e-47
gb|EXB72261.1| hypothetical protein L484_009144 [Morus notabilis]     177   2e-41
ref|XP_006482303.1| PREDICTED: putative nuclear matrix constitue...   169   3e-39
ref|XP_006430826.1| hypothetical protein CICLE_v10013467mg [Citr...   168   8e-39
emb|CAN74990.1| hypothetical protein VITISV_008657 [Vitis vinifera]   167   1e-38
ref|XP_002278531.2| PREDICTED: putative nuclear matrix constitue...   165   7e-38
emb|CAN74873.1| hypothetical protein VITISV_038920 [Vitis vinifera]   165   7e-38
ref|XP_006849769.1| hypothetical protein AMTR_s00024p00252300 [A...   164   2e-37
ref|XP_006373467.1| hypothetical protein POPTR_0017s14050g [Popu...   158   8e-36
ref|XP_002329317.1| predicted protein [Populus trichocarpa] gi|5...   158   8e-36
gb|EOY04287.1| Nuclear matrix constituent protein 1-like protein...   156   3e-35
gb|EOY04286.1| Nuclear matrix constituent protein 1-like protein...   156   3e-35
ref|XP_004169820.1| PREDICTED: LOW QUALITY PROTEIN: putative nuc...   155   4e-35
ref|XP_002525969.1| DNA double-strand break repair rad50 ATPase,...   155   7e-35
ref|XP_004141494.1| PREDICTED: putative nuclear matrix constitue...   154   1e-34
ref|XP_003520054.1| PREDICTED: putative nuclear matrix constitue...   153   3e-34
ref|XP_006574886.1| PREDICTED: putative nuclear matrix constitue...   152   3e-34
gb|EOY02173.1| Nuclear matrix constituent protein-related, putat...   152   6e-34
ref|XP_006300299.1| hypothetical protein CARUB_v10019693mg [Caps...   151   1e-33
gb|EOY02176.1| Nuclear matrix constituent protein-related, putat...   150   1e-33

>gb|EMJ16104.1| hypothetical protein PRUPE_ppa000399mg [Prunus persica]
          Length = 1208

 Score =  196 bits (498), Expect = 3e-47
 Identities = 153/547 (27%), Positives = 269/547 (49%), Gaps = 47/547 (8%)
 Frame = +1

Query: 4    EERKKIEKWHTDEEKRLKEERIYHREQVEKDLETLRLEREAFERHVESDRAQLSESLRKE 183
            E+++++EKW   EE+RLK E++  ++ ++++ + L+L +E+FE H+E +++ L E  + E
Sbjct: 578  EQKEEVEKWKHVEEERLKSEKVMAQDHIQREQDDLKLAKESFEAHMEHEKSVLDEKAQSE 637

Query: 184  EADLLQKIEREGNEWKTDMELRAEEMQKQLHEKEIEFQKEKEREMQKIHEEKEIVQRDIE 363
             + +L ++E    E + DM+ R EEM+K L E+E  F +E+ERE+  ++  +E+ +R++E
Sbjct: 638  RSQMLHELETRKRELEIDMQNRLEEMEKPLREREKSFAEERERELDNVNYLREVARREME 697

Query: 364  QMKLDXXXXXXXXXXXXXXXXQAEKEWAEIKNDIVELQNQREKLQEQRKSLLKEKEGIIS 543
            ++K++                  E++  EI+ DI EL +  +KL++QR+  +KE+E  IS
Sbjct: 698  EIKVERLKIEKEREEADANKEHLERQHIEIRKDIDELLDLSQKLRDQREQFIKERESFIS 757

Query: 544  QCDQLKRLEN--EL--NIVDCDLKQFNEAHSNTQITPFDKAGPS---------------- 663
              ++ K   N  E+    V  +L+   E   N ++ P  + G                  
Sbjct: 758  FIEKFKSCTNCGEMISEFVLSNLRPLAEI-ENAEVIPPPRLGDDYLKGGFNENLAQRQNN 816

Query: 664  ------DSK---ASGRLSWIQRCASKLFNQSPSPGKVSENNGEKDGNEG-----QNLISA 801
                  DS+   + G +SW+++C SK+FN SP   K+   + +   NE      QN+ ++
Sbjct: 817  EISLGIDSRSPVSGGTISWLRKCTSKIFNLSPGK-KIEFGSPQNLANEAPFSGEQNVEAS 875

Query: 802  EVVSGLEVEKEHTAALPENQIHGNDDEAEVVDNPSVHVKEAVVERQNLRHTRKSRPSVNF 981
            +   G+E E E +  +  +      D   V  +  +   EAV       H+  +  + + 
Sbjct: 876  KRGCGIENEAELSFGVASDSF----DVQRVQSDNRIREVEAVQYPSPDEHSNMNSEAPDL 931

Query: 982  DQTNLASSDASGAMSKDKSKG-----KVFKRTRSIKAVVEDAKAILESSV---DKEMSDG 1137
             + +   SD  G   K   +G        KRTRS+KAVV+DAKAIL  +    D E ++G
Sbjct: 932  PEDS-QPSDLKGGCQKPSRRGGRRGRPAVKRTRSVKAVVKDAKAILGEAFETNDSEYANG 990

Query: 1138 DQLKDQIDAVVE--GGXXXXXXXXXXXXXXENRREGSDKLSSQVGRKRGR--KSRVSVEP 1305
               +D +D   E  GG                    +DK S++ GRKRGR   S+++V  
Sbjct: 991  -TAEDSVDMHTESHGGSSL-----------------ADKRSARNGRKRGRAQTSQIAVS- 1031

Query: 1306 DPEDAETQSELSIGGRKRQRRKSAVDGQNTGSGTPGSRRYNLRR-HTATSSMAPQAVSTK 1482
              +D+E +S+  +G ++++RR+  +  +      PG  RYNLRR  T  +  A  A    
Sbjct: 1032 GGDDSEGRSDSVMGAQRKKRREKVIPAEQ----APGESRYNLRRPKTGVTVAAASASRDL 1087

Query: 1483 EDDNAAE 1503
              DN  E
Sbjct: 1088 VKDNEEE 1094


>gb|EXB72261.1| hypothetical protein L484_009144 [Morus notabilis]
          Length = 1203

 Score =  177 bits (448), Expect = 2e-41
 Identities = 155/558 (27%), Positives = 247/558 (44%), Gaps = 39/558 (6%)
 Frame = +1

Query: 4    EERKKIEKWHTDEEKRLKEERIYHREQVEKDLETLRLEREAFERHVESDRAQLSESLRKE 183
            E++++ EK    EE+RLK E+   ++ + ++ E L L RE+F  + E ++  L+E  + E
Sbjct: 586  EQKEEFEKLKEIEEERLKNEKAAAQDHIRREQEELNLARESFSAYTEHEKTLLAEKEKSE 645

Query: 184  EADLLQKIEREGNEWKTDMELRAEEMQKQLHEKEIEFQKEKEREMQKIHEEKEIVQRDIE 363
             + ++   E    E +TDM+ R EE++K L EKE  F++E++RE+  I+  +++ +RD+E
Sbjct: 646  RSQMIHDYEVRKRELETDMQNRLEEIEKPLREKEKSFEEERKRELDNINYLRDVARRDME 705

Query: 364  QMKLDXXXXXXXXXXXXXXXXQAEKEWAEIKNDIVELQNQREKLQEQRKSLLKEKEGIIS 543
            ++K +                  E+   EI+ DI EL +   KL++QR+  +KE+E  IS
Sbjct: 706  ELKFERLKIEKERHEADTNKEHLERHRVEIRKDIEELFDLSNKLKDQREQFIKERERFIS 765

Query: 544  QCDQLKRLENELNIVD----CDLKQFNEAHSNTQITPFDKAG------------------ 657
              D+LK   N   IV      DL+   E   N ++ P  K                    
Sbjct: 766  FVDELKGCNNCSEIVSEFVLSDLRSLVEI-ENVEVLPMPKLADYAKGGVIGDLAASKKPS 824

Query: 658  -----PSDSKASGRLSWIQRCASKLFNQSPSPGKVSENNGEKDGNEGQNLISAEVVSGLE 822
                 P    + G +SW+++C +K+F    SPGK SE+   +      NL   E   G  
Sbjct: 825  SDTFDPKSPVSGGTMSWLRKCTTKIFKL--SPGKKSESTSVR------NLAEEEPFLG-- 874

Query: 823  VEKEHTAALPENQIHGNDDEAEVVDNPSVHVKEAVVERQNLRHTRKSRPSVNFDQTNLAS 1002
               EH    P  ++  ++ EAE+         ++   + ++R T   +     D +N+ S
Sbjct: 875  ---EHNLEEPPKKVLSSEIEAEL---SFAAASDSFDVQASIRETEAGQDPSADDVSNINS 928

Query: 1003 -----------SDASGAMSKD-KSKGKVFKRTRSIKAVVEDAKAILESSVDKEMSDGDQL 1146
                       SD  G   +  + KGKV  RT S++AVVEDAKA+L          G+ L
Sbjct: 929  QGPEAPEDSQPSDLKGEKKRPRRGKGKV-SRTLSVEAVVEDAKALL----------GEDL 977

Query: 1147 KDQIDAVVEGGXXXXXXXXXXXXXXENRREGSDKLSSQVGRKRGRKSRVSVEPDPEDAET 1326
            K        G                   E     + + GR R  ++ VS E D  D+E 
Sbjct: 978  KLNDGGYQNGNAEDSANTNAGSQGGSIIAEKKPFYARKRGRPRTSQATVS-EHDGYDSEE 1036

Query: 1327 QSELSIGGRKRQRRKSAVDGQNTGSGTPGSRRYNLRRHTATSSMAPQAVSTKEDDNAAES 1506
            +SE   G RKR R     D   T    P  RRYNLRR  +  + AP           A  
Sbjct: 1037 RSE--AGRRKRMR-----DKVPTVEQAPAERRYNLRRPKSQDAAAPV---------KASR 1080

Query: 1507 SGKDESQKMEEGSLNRVA 1560
            S +++ Q  +E  L+ +A
Sbjct: 1081 SKENQQQVTDEAGLSSIA 1098


>ref|XP_006482303.1| PREDICTED: putative nuclear matrix constituent protein 1-like
            protein-like [Citrus sinensis]
          Length = 1175

 Score =  169 bits (429), Expect = 3e-39
 Identities = 156/585 (26%), Positives = 261/585 (44%), Gaps = 36/585 (6%)
 Frame = +1

Query: 4    EERKKIEKWHTDEEKRLKEERIYHREQVEKDLETLRLEREAFERHVESDRAQLSESLRKE 183
            E+ +K+EK    EE+R+K ++    + ++++ E L + +E+F+  ++ +++ ++E    E
Sbjct: 556  EQTEKLEKEKLSEEERIKRDKQLAEDHIKREWEALEVAKESFKATMDHEQSMITEKAESE 615

Query: 184  EADLLQKIEREGNEWKTDMELRAEEMQKQLHEKEIEFQKEKEREMQKIHEEKEIVQRDIE 363
               LL   E +  + ++DM  R EE++K L EKE  F++EKERE+  I+  ++I ++++E
Sbjct: 616  RRQLLHDFELQKRKLESDMLNRQEELEKDLKEKERLFEEEKERELSNINYLRDIARKEME 675

Query: 364  QMKLDXXXXXXXXXXXXXXXXQAEKEWAEIKNDIVELQNQREKLQEQRKSLLKEKEGIIS 543
            +MKL+                  E E   I+ DI  L    + L+EQR+ ++KE++  ++
Sbjct: 676  EMKLERLKLEKEKQEVDSHRKHLEGEQVGIRKDIDMLVGLTKMLKEQREQIVKERDRFLN 735

Query: 544  QCDQLKRLENELNI----VDCDLKQ----------------FNEAHSNTQITPFDKAGPS 663
              ++ K+ E+   I    V  DL Q                +     N++I+P D     
Sbjct: 736  FVEKQKKCEHCAEITSEFVLSDLVQEIVKSEVPPLPRVANDYVNEKKNSEISP-DVLASG 794

Query: 664  DSKASGRLSWIQRCASKLFNQSPSPGK----VSENNGEKDGNEGQ-NLISAEVVSGLEVE 828
               ++G +SW+++C SK+F  SPS       V E   E   + GQ  L  +    G   E
Sbjct: 795  SPASAGTISWLRKCTSKIFKLSPSKKDENTVVRELTEETPSSGGQTKLQESSRRLGQTNE 854

Query: 829  KEHTAALPENQIHGNDDEAEVVDNPSVHVKEAVVERQNLRHTRKSRPSVNFDQTNLASSD 1008
             + + A+  +        +E         +   V+ QN  +     P V   Q N   SD
Sbjct: 855  PDLSFAIVNDSFDAQRFHSETSTREVEADQHKQVDGQN--NLNGKAPEV---QENSQPSD 909

Query: 1009 ASGAMSKDKSKGKVFKRTRSIKAVVEDAKAILESSVDKEMSDGDQLKDQIDAVVEGGXXX 1188
             +      K       RTRS+KAVV+DAKAIL      E+++ + L    D  V+     
Sbjct: 910  LNHGRQPRKRGRPRVSRTRSVKAVVQDAKAILGEGF--ELTESENLNGNADDSVQ----- 962

Query: 1189 XXXXXXXXXXXENRREGS--DKLSSQVGRKRGRKSRVSV---EPDPEDAETQSELSIGGR 1353
                       E+R E S  DK +S+  RKR R     +   E D +D+E QS   + G+
Sbjct: 963  --------EAAESRGEPSLDDKGTSRNARKRNRAQSSQITTSEHDVDDSEAQSGSVVVGQ 1014

Query: 1354 KRQRRKSAVDGQNTGSGTPGSRRYNLRRHTATSSMAPQAVSTKEDDNAAESSGKDESQKM 1533
             R+RR+      +    TP   RYNLRR    +  A  +   KE +  +E         +
Sbjct: 1015 PRKRRQKV----DPAEQTPVPTRYNLRRPKTGAPAAAVSEPNKEKEEVSEG----VRGAL 1066

Query: 1534 EEGSLNRVADEP------QDNMSGEQPVREDGFENDERSQDIQEN 1650
            E+  +N  A  P       DN    Q VR    +N + S+   EN
Sbjct: 1067 EDEIVNSKAAPPNSVGVFSDNGRSSQLVRCGAVDNKDASKQFVEN 1111


>ref|XP_006430826.1| hypothetical protein CICLE_v10013467mg [Citrus clementina]
            gi|557532883|gb|ESR44066.1| hypothetical protein
            CICLE_v10013467mg [Citrus clementina]
          Length = 1166

 Score =  168 bits (425), Expect = 8e-39
 Identities = 154/584 (26%), Positives = 261/584 (44%), Gaps = 36/584 (6%)
 Frame = +1

Query: 7    ERKKIEKWHTDEEKRLKEERIYHREQVEKDLETLRLEREAFERHVESDRAQLSESLRKEE 186
            E +K+EK    EE+R+K ++    + ++++ E L + +E+F+  ++ +++ ++E    E 
Sbjct: 548  ETEKLEKEKLSEEERIKRDKQLAEDHIKREWEALEVAKESFKATMDHEQSMITEKAESER 607

Query: 187  ADLLQKIEREGNEWKTDMELRAEEMQKQLHEKEIEFQKEKEREMQKIHEEKEIVQRDIEQ 366
              LL   E +  + ++DM+ R EE++K L EKE  F++EKERE+  I+  ++I ++++E+
Sbjct: 608  RQLLHDFELQKRKLESDMQNRQEELEKDLKEKERLFEEEKERELSNINYLRDIARKEMEE 667

Query: 367  MKLDXXXXXXXXXXXXXXXXQAEKEWAEIKNDIVELQNQREKLQEQRKSLLKEKEGIISQ 546
            MKL+                  E E   I+ DI  L    + L+EQR+ ++KE++  ++ 
Sbjct: 668  MKLERLKLEKEKQEVDSHRKHLEGEQVGIRKDIDMLVGLTKMLKEQREQIVKERDRFLNF 727

Query: 547  CDQLKRLENELNI----VDCDLKQ----------------FNEAHSNTQITPFDKAGPSD 666
             ++ K+ E+   I    V  DL Q                +     N++++P D      
Sbjct: 728  VEKQKKCEHCAEITSEFVLSDLVQEIVKSEVPPLPRVANDYVNEKKNSEMSP-DVLASGS 786

Query: 667  SKASGRLSWIQRCASKLFNQSPSP----GKVSENNGEKDGNEGQ-NLISAEVVSGLEVEK 831
              ++G +SW+++C SK+F  SPS       V E   E   + GQ  L  +    G   E 
Sbjct: 787  PASAGTISWLRKCTSKIFKLSPSKKGENTVVRELTEETPSSGGQTKLQESSRRLGQTNEP 846

Query: 832  EHTAALPENQIHGNDDEAEVVDNPSVHVKEAVVERQNLRHTRKSRPSVNFDQTNLASSDA 1011
            + + A+  +        +E         +   V+ QN  +     P V   Q N   SD 
Sbjct: 847  DLSFAIVNDSFDAQRYHSETSTREVEADQHKQVDGQN--NLNGKAPEV---QENSQPSDL 901

Query: 1012 SGAMSKDKSKGKVFKRTRSIKAVVEDAKAILESSVDKEMSDGDQLKDQIDAVVEGGXXXX 1191
            +      K       RTRS+KAVV+DAKAIL      E+++ + L    D  V+      
Sbjct: 902  NHGRQPRKRGRPRVSRTRSVKAVVQDAKAILGEGF--ELTESENLNGNADDSVQ------ 953

Query: 1192 XXXXXXXXXXENRREGS--DKLSSQVGRKRGRKSRVSV---EPDPEDAETQSELSIGGRK 1356
                      E+R E S  DK +S+  RKR       +   E D +D+E QS   + G+ 
Sbjct: 954  -------EAAESRGEPSLDDKGTSRNARKRNHAQSSQITTSEHDVDDSEAQSGSVVVGQP 1006

Query: 1357 RQRRKSAVDGQNTGSGTPGSRRYNLRRHTATSSMAPQAVSTKEDDNAAESSGKDESQKME 1536
            R+RR+      +    TP   RYNLRR    +  A  +   KE +  +E         +E
Sbjct: 1007 RKRRQKV----DPAEQTPVPTRYNLRRPKTGAPAAAVSEPNKEKEEVSEG----VRGALE 1058

Query: 1537 EGSLNRVADEP------QDNMSGEQPVREDGFENDERSQDIQEN 1650
            +  +N  A  P       DN    Q VR    +N++ S+   EN
Sbjct: 1059 DEIVNSKAAPPNSVGVFSDNGRSSQLVRCGAVDNNDASKQFVEN 1102


>emb|CAN74990.1| hypothetical protein VITISV_008657 [Vitis vinifera]
          Length = 1140

 Score =  167 bits (424), Expect = 1e-38
 Identities = 149/558 (26%), Positives = 258/558 (46%), Gaps = 55/558 (9%)
 Frame = +1

Query: 4    EERKKIEKWHTDEEKRLKEERIYHREQVEKDLETLRLEREAFERHVESDRAQLSESLRKE 183
            E+R+K+EK    EE+RLK E++  ++ ++++ E+L+L +E+F   +E +++ LSE  + E
Sbjct: 542  EQREKLEKLKHSEEERLKTEKLATQDYIQREFESLKLAKESFAASMEHEQSVLSEKAQSE 601

Query: 184  EADLLQKIEREGNEWKTDMELRAEEMQKQLHEKEIEFQKEKEREMQKIHEEKEIVQRDIE 363
            ++ ++   E    E +TD++ R EE++KQL E+E  F++E+ERE+  ++  +E+ ++++E
Sbjct: 602  KSQMIHDFELLKRELETDIQNRQEELEKQLQEREKVFEEERERELNNVNYLREVARQEME 661

Query: 364  QMKLDXXXXXXXXXXXXXXXXQAEKEWAEIKNDIVELQNQREKLQEQRKSLLKEKEGIIS 543
            ++KL+                  ++   E++ DI EL +   KL++QR+   KE+E  I+
Sbjct: 662  EVKLERLRIEKEKQEVAANKKHLDEHQFEMRKDIDELVSLSRKLKDQRELFSKERERFIA 721

Query: 544  QCDQLKRLEN----ELNIVDCDLKQFNEAHSNTQITPFDK-------------------- 651
              +Q K  +N        V  DL+   E   N ++ P  +                    
Sbjct: 722  FVEQQKSCKNCGEITCEFVLSDLQPLPEI-ENVEVPPLPRLADRYFKGSVQGNMAASERQ 780

Query: 652  --------AGPSDSKASGRLSWIQRCASKLFNQSPSPGKVSENNGEKDGNEGQNLISAEV 807
                     G     + G +S++++C SK+FN   SPGK  E          QNL  A  
Sbjct: 781  NIEMTPGIVGSGSPTSGGTISFLRKCTSKIFNL--SPGKKIEVAAI------QNLTEAPE 832

Query: 808  VSGLEVEKEHTAALPENQIHGNDDEAEV---VDNPSVHVKEAVVERQNLRHTRKSRPSVN 978
             S   + +      P  ++   +DE E    + N S  V+   ++  N     ++   ++
Sbjct: 833  PSRQAIVE------PSKRLGSTEDEPEPSFRIANDSFDVQR--IQSDNSIKEVEAGQDLS 884

Query: 979  FDQTNLAS-----------SDASGAMSKDKSKGKV-FKRTRSIKAVVEDAKAILESSVDK 1122
             D++N+ S           SD  GA  K   + K    RTRS+KAVV DAKAIL  S++ 
Sbjct: 885  IDESNIDSKALELQQHSQHSDLKGARRKPGKRSKQRIHRTRSVKAVVRDAKAILGESLEL 944

Query: 1123 EMSDGDQLKDQIDAVVEGGXXXXXXXXXXXXXXENRREGS--DKLSSQVGRKRGR---KS 1287
              ++      +  A +                 E+R E S  DK + + GRKR R     
Sbjct: 945  SENEHPNGNPEDSAHMN---------------DESRGESSFADKGTPRNGRKRQRAYTSQ 989

Query: 1288 RVSVEPDPEDAETQSELSIGGRKRQRRKSAVDGQNTGSGTPGSRRYNLRRHTATSSMAPQ 1467
             +  E D +D+E +S+  +  R+ +RR+           T G  RYNLRR   T ++A  
Sbjct: 990  TMVSEQDGDDSEGRSDSVMARRQGKRRQKVPPAVQ----TLGQERYNLRRPKNTVTVAAA 1045

Query: 1468 AVST---KEDDNAAESSG 1512
              ST   K  +   + SG
Sbjct: 1046 KSSTNLHKRKETETDGSG 1063


>ref|XP_002278531.2| PREDICTED: putative nuclear matrix constituent protein 1-like
            protein-like [Vitis vinifera]
          Length = 1213

 Score =  165 bits (417), Expect = 7e-38
 Identities = 153/562 (27%), Positives = 262/562 (46%), Gaps = 54/562 (9%)
 Frame = +1

Query: 4    EERKKIEKWHTDEEKRLKEERIYHREQVEKDLETLRLEREAFERHVESDRAQLSESLRKE 183
            +E++K+EK H  EE+RLK+E++   E ++++LE +R+E+E+F   ++       E LRK 
Sbjct: 584  DEKEKLEKLHLSEEERLKKEKLAMEEHIQRELEAVRIEKESFAAIMKH------EQLRKR 637

Query: 184  EADLLQKIEREGNEWKTDMELRAEEMQKQLHEKEIEFQKEKEREMQKIHEEKEIVQRDIE 363
            + ++             +M+ R +E+QK+L E+E  F++E+ERE+  I+  KE+ +R+IE
Sbjct: 638  DLEI-------------EMQNRQDEIQKRLQERERAFEEERERELNNINHLKEVARREIE 684

Query: 364  QMKLDXXXXXXXXXXXXXXXXQAEKEWAEIKNDIVELQNQREKLQEQRKSLLKEKEGIIS 543
            +MK +                Q E    E++ DI EL     KL++QR+  +KE++  ++
Sbjct: 685  EMKTERRRIEKEKQEVLLNKRQLEGHQLEMRKDIDELGILSRKLKDQREQFIKERDRFLT 744

Query: 544  QCDQLKRLE-----------NELNIVDCDLKQFNEAHSNTQI--TPFDKAGPSD------ 666
              D+ K  +           N+L + + +++ F   +   +   +P      SD      
Sbjct: 745  FVDKHKTCKNCGEITREFVLNDLQLPEMEVEAFPLPNLADEFLNSPQGNMAASDGTNVKI 804

Query: 667  ---------SKASGRLSWIQRCASKLFNQSPSPGKVSENNGEKDGNEGQNLISAEVVSGL 819
                     S + GR+S++++CA+K+FN SPS  K SE+ G +   E   L+  +V    
Sbjct: 805  STGEIDLVSSGSGGRMSFLRKCATKIFNLSPS--KKSEHVGVQVLREESPLLDLQV---- 858

Query: 820  EVEKEHTAALPENQIHGNDDEAEVVDNPSVHVKEAVVERQNLRHTRKSR-------PSVN 978
             +EK    ++    I   +DE E    PS  +     + Q L      R        SV+
Sbjct: 859  NLEKAEGPSIVGQSI--AEDELE----PSFGIANDSFDIQQLHSDSVMREVDGGHAQSVD 912

Query: 979  FDQTNLASSDASGAMSKDKSKGKVFK------------RTRSIKAVVEDAKAILESSVDK 1122
               +N+ S +  G     +S+ K  +            RTRS+K VVEDAKA L  + + 
Sbjct: 913  -GVSNMGSKEQEGPEDSQQSELKSGRRKPGRKRRTGVHRTRSVKNVVEDAKAFLGETPEI 971

Query: 1123 EMSDGDQLKDQIDAVVEGGXXXXXXXXXXXXXXENRREGSDKLSSQVGRKRGR--KSRVS 1296
               +GD+  +      E G              E     ++K +S + RKR R   SR++
Sbjct: 972  PELNGDERPNDSTYTNEEG--------------ERETSHAEKAASTITRKRQRAPSSRIT 1017

Query: 1297 -VEPDPEDAETQSE-LSIGGRKRQRRKSAVDGQNTGSGTPGSRRYNLRRHTATSSMAPQA 1470
              E D  D+E +S+ ++ GGR ++R+  A   Q     TPG +RYNLRRH    ++A   
Sbjct: 1018 ESEQDAADSEGRSDSVTAGGRGKRRQTVAPVVQ-----TPGEKRYNLRRHKTAGTVATAQ 1072

Query: 1471 VST---KEDDNAAESSGKDESQ 1527
             S    K D+   +    +  Q
Sbjct: 1073 ASANLPKRDEKGGDGGDDNTLQ 1094


>emb|CAN74873.1| hypothetical protein VITISV_038920 [Vitis vinifera]
          Length = 1234

 Score =  165 bits (417), Expect = 7e-38
 Identities = 145/552 (26%), Positives = 262/552 (47%), Gaps = 44/552 (7%)
 Frame = +1

Query: 4    EERKKIEKWHTDEEKRLKEERIYHREQVEKDLETLRLEREAFERHVESDRAQLSESLRKE 183
            +E++K+EK H  EE+RLK+E++   E ++++LE +R+E+E+F   ++ ++  LSE  + +
Sbjct: 602  DEKEKLEKLHLSEEERLKKEKLAMEEHIQRELEAVRIEKESFAAIMKHEQVTLSEKAQND 661

Query: 184  EADLLQKIEREGNEWKTDMELRAEEMQKQLHEKEIEFQKEKEREMQKIHEEKEIVQRDIE 363
             + +L+  E    + + +M+ R +E+QK+L E+E  F++E+ERE+  I+  KE+ +R+IE
Sbjct: 662  HSQMLRDFELRKRDLEIEMQNRQDEIQKRLQERERAFEEERERELNNINHLKEVARREIE 721

Query: 364  QMKLDXXXXXXXXXXXXXXXXQAEKEWAEIKNDIVELQNQREKLQEQRKSLLKEKEGIIS 543
            +MK +                Q E    E++ DI EL     KL++QR+  +KE++  ++
Sbjct: 722  EMKTERRRIEKEKQEVLLNKRQLEGHQLEMRKDIDELGILSRKLKDQREQFIKERDRFLT 781

Query: 544  QCDQLKRLE-----------NELNIVDCDLKQFNEAHSNTQI--TPFDKAGPSD------ 666
              D+ K  +           N+L + + +++ F   +   +   +P      SD      
Sbjct: 782  FVDKHKTCKNCGEITREFVLNDLQLPEMEVEAFPLPNLADEFLNSPQGNMAASDGTNVKI 841

Query: 667  ---------SKASGRLSWIQRCASKLFNQSPSPGKVSENNGEKDGNEGQNLISAEVVSGL 819
                     S + GR+S++++CA+K+FN SPS  K SE+ G +   E   L+  +V    
Sbjct: 842  XTGEIDLVSSGSGGRMSFLRKCATKIFNLSPS--KKSEHVGVQVLREESPLLDLQV---- 895

Query: 820  EVEKEHTAALPENQIHGNDDEAEVVDNPSVHVKEAVVERQNLRHTRKSR-------PSVN 978
             +EK    ++    I   +DE E    PS  +     + Q L      R        SV+
Sbjct: 896  NLEKAEGPSIVGQSI--AEDELE----PSFGIANDSFDIQQLHSDSVMREVDGGHAQSVD 949

Query: 979  FDQTNLASSDASGAMSKDKSKGKVFKRT--RSIKAVVEDAKAILESSVDKEMSDGDQLKD 1152
               +N+ S +  G     +S+ K  +R   R  +  V   +++      K + +GD+  +
Sbjct: 950  -GVSNMGSKEQEGPEDSQQSELKSGRRKPGRKRRTGVHRTRSV------KNVLNGDERPN 1002

Query: 1153 QIDAVVEGGXXXXXXXXXXXXXXENRREGSDKLSSQVGRKRGR--KSRVS-VEPDPEDAE 1323
                  E G              E     ++K +S + RKR R   SR++  E D  D+E
Sbjct: 1003 DSTYTNEEG--------------ERETSHAEKAASTITRKRQRAPSSRITESEQDAADSE 1048

Query: 1324 TQSE-LSIGGRKRQRRKSAVDGQNTGSGTPGSRRYNLRRHTATSSMAPQAVST---KEDD 1491
             +S+ ++ GGR ++R+  A   Q     TPG +RYNLRRH    ++A    S    K D+
Sbjct: 1049 GRSDSVTAGGRGKRRQTVAPVVQ-----TPGEKRYNLRRHKTAGTVATAQASANLPKRDE 1103

Query: 1492 NAAESSGKDESQ 1527
               +    +  Q
Sbjct: 1104 KGGDGGDDNTLQ 1115


>ref|XP_006849769.1| hypothetical protein AMTR_s00024p00252300 [Amborella trichopoda]
            gi|548853344|gb|ERN11350.1| hypothetical protein
            AMTR_s00024p00252300 [Amborella trichopoda]
          Length = 1290

 Score =  164 bits (414), Expect = 2e-37
 Identities = 156/653 (23%), Positives = 290/653 (44%), Gaps = 75/653 (11%)
 Frame = +1

Query: 7    ERKKIEKWHTDEEKRLKEERIYHREQVEKDLETLRLEREAFERHVESDRAQLSESLRKEE 186
            E+ +  K   +EE +LK E     E+ +++ E L L++ +F  ++  +R+ + ++ R+E 
Sbjct: 614  EKDEFLKRKCEEELKLKREEQKTSEKFQREYEALELQKNSFTENMNHERSVILQNARRER 673

Query: 187  ADLLQKIEREGNEWKTDMELRAEEMQKQLHEKEIEFQKEKEREMQKIHEEKEIVQRDIEQ 366
             D++++ E + N  ++ ++ R E+M+KQ  EKE +FQ+ +ER  ++I  ++E+ Q+++E+
Sbjct: 674  DDMIREFELQKNALESSIQNRREDMEKQFLEKERDFQEVRERMWKEIEAQRELAQKEMEE 733

Query: 367  MKLDXXXXXXXXXXXXXXXXQAEKEWAEIKNDIVELQNQREKLQEQRKSLLKEKEGIISQ 546
            MKL+                  E E  EI+ D+ +L     KL+EQR+ L +E++ I+S+
Sbjct: 734  MKLERTKLGRERQEVALSKKHVEGERLEIQKDVEQLHILTTKLKEQREELRRERDRILSR 793

Query: 547  CDQLKRLENE-LNIVD----CDLKQFNEAHSN--------------TQITPFDKAGPS-- 663
             + LKR + + +++ D     +L+ F E  +N                +      GPS  
Sbjct: 794  IEHLKRGQGDSIDVTDGLALSELQSFKEFENNGGNLLPRLLDGYMKESMQGRSNVGPSNL 853

Query: 664  -----------DSKASGRLSWIQRCAS------------KLFNQSPSPGKVSENNGEKDG 774
                       +S +  R SW+Q+C S            ++ NQ  SP  V  +  +   
Sbjct: 854  MEETPPLGAVLNSTSPARFSWLQKCKSIFKLSPGKRLDEQVTNQEKSPSDVEADADQILE 913

Query: 775  NEGQNLISA-------EVVSGLEVEK----EHTAALPENQIHGNDDEAEVVDNPSVHVKE 921
            N+   L+S        E+  G+++ +       AA PE+   G+++E  V  + +   + 
Sbjct: 914  NDSGGLVSGGANYDEPEISVGIQISQAVDFHRRAASPESIGRGDEEETVVTPSAADGTQS 973

Query: 922  AVVERQNLRHTRKSRPSVNFDQTNLASSDASGAMSKDKSKG--KVFKRTRSIKAVVEDAK 1095
             ++E Q         PS + + ++  S+ A G   K   +G  K+ +RTRS+K VV+++K
Sbjct: 974  DMLEMQ-------EGPSASAEISH-PSAAAGGRARKKPRRGAPKLTRRTRSVKDVVKESK 1025

Query: 1096 AILESSVDKEMSDGDQLKDQIDAVVEGGXXXXXXXXXXXXXXENRREGSDKLSSQVGRKR 1275
            AIL  S ++  ++ ++                          E+ +   D     + +K 
Sbjct: 1026 AILGESSEELKTEEEE--------------------------ESAQANVDSKGQPIVKKG 1059

Query: 1276 GRK-----SRVSVEPDPEDAETQSELSIGGRKRQRRKSAVDGQNTGSGTPGSRRYNLRRH 1440
            GRK     +  ++    +DA++QSE    GR ++R+      Q      PG RRYNLR  
Sbjct: 1060 GRKRQHPTTSRTMSEQQQDADSQSESVTRGRSKRRQIEPSHIQ-----PPGGRRYNLRHS 1114

Query: 1441 T--------ATSSMAPQAVSTKEDDNAAESSGKDESQKMEEGSLNRV-ADEPQ----DNM 1581
            T          S      V+T  D+N ++   K   + +E  + N +  DEP     +N 
Sbjct: 1115 TLEKHVENPVGSQALASKVTTDADENHSQHVTKSPGEVVEGQTSNHIHPDEPSIESLENA 1174

Query: 1582 SGEQPVREDGFENDERSQDIQENGGESSFNCFLEVSSHGILKSETYTVSQEDE 1740
             G       G E     + +Q    ES      E S+  ++  ET    +E +
Sbjct: 1175 HG-------GGEAKTDVRMLQHTKFESIVEIHREFSTQKVIMIETGGALEETD 1220


>ref|XP_006373467.1| hypothetical protein POPTR_0017s14050g [Populus trichocarpa]
            gi|550320289|gb|ERP51264.1| hypothetical protein
            POPTR_0017s14050g [Populus trichocarpa]
          Length = 1150

 Score =  158 bits (399), Expect = 8e-36
 Identities = 146/555 (26%), Positives = 255/555 (45%), Gaps = 36/555 (6%)
 Frame = +1

Query: 4    EERKKIEKWHTDEEKRLKEERIYHREQVEKDLETLRLEREAFERHVESDRAQLSESLRKE 183
            E+++K EK+   EE+R++ ER      ++++LE L++ +E+FE ++E +R+ ++E  + E
Sbjct: 549  EQKEKFEKYRLSEEERIRNERKETENYIKRELEALQVAKESFEANMEHERSVMAEKAQNE 608

Query: 184  EADLLQKIEREGNEWKTDMELRAEEMQKQLHEKEIEFQKEKEREMQKIHEEKEIVQRDIE 363
               +L  IE +  E + +++ R EEM + L EKE  F++E+ERE + I+  +++ +R++E
Sbjct: 609  RNQMLHSIEMQKTELENELQKRQEEMDRLLQEKEKLFEEEREREFKNINFLRDVARREME 668

Query: 364  QMKLDXXXXXXXXXXXXXXXXQAEKEWAEIKNDIVELQNQREKLQEQRKSLLKEKEGIIS 543
             MKL+                  +++  E++ DI +L N   KL++ R+  +KEKE  I 
Sbjct: 669  DMKLERLRIEKEKQEVDEKKRHLQEQQIEMREDIDKLGNLSRKLKDHREQFIKEKERFIV 728

Query: 544  QCDQLKRLEN--EL--NIVDCDLKQFNEAHS----------NTQITPFD----------- 648
              +Q K  +N  EL    V  DL    E             N  +T  D           
Sbjct: 729  FVEQNKGCKNCGELTSEFVLSDLISSQEIEKADALPTSKLVNNHVTTDDGNPAASEKHDS 788

Query: 649  KAGPSDSKASGRLSWIQRCASKLFNQSPSPGKVSENNGEKDGNEGQNLISAEVVSGLEVE 828
            +  P+ + +   +SW+++C SK+     S GK  E    ++  +G  L S E V+  E+ 
Sbjct: 789  EMSPTLAHSVSPVSWLRKCTSKILKF--SAGKRIEPAALQNLTDGTPL-SGEQVNAEEMS 845

Query: 829  K-----EHTAALPENQIHGNDDEAEVVDNPSVHVKEAVVERQNLRHTRKSRPSVNFDQTN 993
            K     E+   L    ++ + D   V+ + S+   EA  +      +  +  +    + +
Sbjct: 846  KRLDFTENEPELSFAIVNDSLDAQRVLSDTSIREVEAGHDLSINDQSNNNGTAPEIQEDS 905

Query: 994  LASSDASGAMSKDKSKGKVFKRTRSIKAVVEDAKAILESSVD-KEMSDGDQLKDQIDAVV 1170
              S        + + + +V  RTRS+K VV+DAKA+L  +++  E  D   LK       
Sbjct: 906  QPSGLKHDPQPRKRGRPRV-SRTRSVKEVVQDAKALLGGALELNEAEDSGHLKS------ 958

Query: 1171 EGGXXXXXXXXXXXXXXENRREGS--DKLSSQVGRKRGR--KSRVSV-EPDPEDAETQSE 1335
                             E+R E S  DK   +  RKR R   S++SV +   +D+E  S+
Sbjct: 959  -----------------ESRDESSLADKGGPRNARKRNRTQTSQISVSDRYGDDSEGHSD 1001

Query: 1336 LSIGGRKRQRRKSAVDGQNTGSGTPGSRRYNLRRHTATSSMAPQAVSTKEDDNAAESSGK 1515
                G +R+RR+  V  Q     T G  +YNLRR      +A   V    + N  +    
Sbjct: 1002 SVTAGDRRKRRQKVVPNQ-----TQGQTQYNLRRREL--GVAVVTVKASSNLNNEKEKED 1054

Query: 1516 DESQKMEEGSLNRVA 1560
            D     ++G+L R A
Sbjct: 1055 DGVSSPQDGNLLRSA 1069


>ref|XP_002329317.1| predicted protein [Populus trichocarpa]
            gi|566213280|ref|XP_006373468.1| nuclear matrix
            constituent protein 1 [Populus trichocarpa]
            gi|550320290|gb|ERP51265.1| nuclear matrix constituent
            protein 1 [Populus trichocarpa]
          Length = 1156

 Score =  158 bits (399), Expect = 8e-36
 Identities = 146/555 (26%), Positives = 255/555 (45%), Gaps = 36/555 (6%)
 Frame = +1

Query: 4    EERKKIEKWHTDEEKRLKEERIYHREQVEKDLETLRLEREAFERHVESDRAQLSESLRKE 183
            E+++K EK+   EE+R++ ER      ++++LE L++ +E+FE ++E +R+ ++E  + E
Sbjct: 555  EQKEKFEKYRLSEEERIRNERKETENYIKRELEALQVAKESFEANMEHERSVMAEKAQNE 614

Query: 184  EADLLQKIEREGNEWKTDMELRAEEMQKQLHEKEIEFQKEKEREMQKIHEEKEIVQRDIE 363
               +L  IE +  E + +++ R EEM + L EKE  F++E+ERE + I+  +++ +R++E
Sbjct: 615  RNQMLHSIEMQKTELENELQKRQEEMDRLLQEKEKLFEEEREREFKNINFLRDVARREME 674

Query: 364  QMKLDXXXXXXXXXXXXXXXXQAEKEWAEIKNDIVELQNQREKLQEQRKSLLKEKEGIIS 543
             MKL+                  +++  E++ DI +L N   KL++ R+  +KEKE  I 
Sbjct: 675  DMKLERLRIEKEKQEVDEKKRHLQEQQIEMREDIDKLGNLSRKLKDHREQFIKEKERFIV 734

Query: 544  QCDQLKRLEN--EL--NIVDCDLKQFNEAHS----------NTQITPFD----------- 648
              +Q K  +N  EL    V  DL    E             N  +T  D           
Sbjct: 735  FVEQNKGCKNCGELTSEFVLSDLISSQEIEKADALPTSKLVNNHVTTDDGNPAASEKHDS 794

Query: 649  KAGPSDSKASGRLSWIQRCASKLFNQSPSPGKVSENNGEKDGNEGQNLISAEVVSGLEVE 828
            +  P+ + +   +SW+++C SK+     S GK  E    ++  +G  L S E V+  E+ 
Sbjct: 795  EMSPTLAHSVSPVSWLRKCTSKILKF--SAGKRIEPAALQNLTDGTPL-SGEQVNAEEMS 851

Query: 829  K-----EHTAALPENQIHGNDDEAEVVDNPSVHVKEAVVERQNLRHTRKSRPSVNFDQTN 993
            K     E+   L    ++ + D   V+ + S+   EA  +      +  +  +    + +
Sbjct: 852  KRLDFTENEPELSFAIVNDSLDAQRVLSDTSIREVEAGHDLSINDQSNNNGTAPEIQEDS 911

Query: 994  LASSDASGAMSKDKSKGKVFKRTRSIKAVVEDAKAILESSVD-KEMSDGDQLKDQIDAVV 1170
              S        + + + +V  RTRS+K VV+DAKA+L  +++  E  D   LK       
Sbjct: 912  QPSGLKHDPQPRKRGRPRV-SRTRSVKEVVQDAKALLGGALELNEAEDSGHLKS------ 964

Query: 1171 EGGXXXXXXXXXXXXXXENRREGS--DKLSSQVGRKRGR--KSRVSV-EPDPEDAETQSE 1335
                             E+R E S  DK   +  RKR R   S++SV +   +D+E  S+
Sbjct: 965  -----------------ESRDESSLADKGGPRNARKRNRTQTSQISVSDRYGDDSEGHSD 1007

Query: 1336 LSIGGRKRQRRKSAVDGQNTGSGTPGSRRYNLRRHTATSSMAPQAVSTKEDDNAAESSGK 1515
                G +R+RR+  V  Q     T G  +YNLRR      +A   V    + N  +    
Sbjct: 1008 SVTAGDRRKRRQKVVPNQ-----TQGQTQYNLRRREL--GVAVVTVKASSNLNNEKEKED 1060

Query: 1516 DESQKMEEGSLNRVA 1560
            D     ++G+L R A
Sbjct: 1061 DGVSSPQDGNLLRSA 1075


>gb|EOY04287.1| Nuclear matrix constituent protein 1-like protein, putative isoform 2
            [Theobroma cacao]
          Length = 1102

 Score =  156 bits (394), Expect = 3e-35
 Identities = 149/594 (25%), Positives = 264/594 (44%), Gaps = 39/594 (6%)
 Frame = +1

Query: 4    EERKKIEKWHTDEEKRLKEERIYHREQVEKDLETLRLEREAFERHVESDRAQLSESLRKE 183
            ++ +K EK    EE+RLK E+    + ++++L+ L + +E F   +E +++ ++E    E
Sbjct: 553  QQTEKFEKQKLAEEERLKNEKQVAEDYIKRELDALEVAKETFAATMEHEQSVIAEKAESE 612

Query: 184  EADLLQKIEREGNEWKTDMELRAEEMQKQLHEKEIEFQKEKEREMQKIHEEKEIVQRDIE 363
             +  L  +E +  + ++DM+ R EEM+K+L E +  F++EKERE+ KI+  +E+ +R++E
Sbjct: 613  RSQRLHDLELQKRKLESDMQNRFEEMEKELGESKKSFEEEKERELDKINHLREVARRELE 672

Query: 364  QMKLDXXXXXXXXXXXXXXXXQAEKEWAEIKNDIVELQNQREKLQEQRKSLLKEKEGIIS 543
            ++K +                  E +  EI+ DI +L +  +KL++QR+  +KE+   IS
Sbjct: 673  ELKQERLKIEKEEQEVNASKMHLEGQQIEIRKDIDDLVDISKKLKDQREHFIKERNRFIS 732

Query: 544  ------QCDQLKRLENELNIVDCDLKQFNE------------------AHSNTQITPFDK 651
                   C     + +E  + D    Q  E                  A  N  ++   K
Sbjct: 733  FVEKHKSCKNCGEMTSEFMLSDLQSLQKIEDEEVLPLPSLADDYISGNAFRNLAVSKRQK 792

Query: 652  ------AGPSDSKASGRLSWIQRCASKLFNQSPS----PGKVSENNGEKDGNEGQNLISA 801
                   G     + G +SW+++C SK+F  SP     P  V++ N E   + GQ  ++ 
Sbjct: 793  DEISPPVGSGSPVSGGTMSWLRKCTSKIFKLSPGKNIEPHAVTKLNVEAPLSGGQ--VNM 850

Query: 802  EVVSGLEVEKEHTAALPENQIHGNDDEAEVVDNPSVHVKEAVVERQNLRHTRKSRPSVNF 981
            E +S +E E E + A     +  +  +++         ++  ++ Q+   +++    V  
Sbjct: 851  EGMSNVEHEPELSIAAATESLDVHRVQSDTSTRDVDAGQDLSIDNQSNIDSKELE--VLG 908

Query: 982  DQTNLASSDASGAMSKDKSKGKVFKRTRSIKAVVEDAKAILESSVDKEMSDGDQLKDQID 1161
            D  N  S    G   + + + +V KRTRS+KAVV+DA+AI+  ++  E ++ +     +D
Sbjct: 909  DSQN--SDFNRGNQLRKRGRPRV-KRTRSVKAVVKDAEAIIGKAL--ESNELEHPNGNLD 963

Query: 1162 AVVEGGXXXXXXXXXXXXXXENRREGS--DKLSSQVGRKRGR---KSRVSVEPDPEDAET 1326
            +                   E+R E    D  +S+  RKR R     +   E D  D+  
Sbjct: 964  S--------------GHANAESRDESGLFDGGTSRNARKRNRAQTSQKTESEQDGVDS-G 1008

Query: 1327 QSELSIGGRKRQRRKSAVDGQNTGSGTPGSRRYNLRRHTATSSMAPQAVSTKEDDNAAES 1506
             S+  + G++R+RR+  V        TPG  RYNLRR     ++A     T  D N    
Sbjct: 1009 HSDSIVAGQQRKRRQKVV----LAMPTPGEARYNLRRPKTGVTVA----KTTSDVNRENE 1060

Query: 1507 SGKDESQKMEEGSLNRVADEPQDNMSGEQPVREDGFENDERSQDIQENGGESSF 1668
              KD             A +  +      PV E+G        D  ENGG + F
Sbjct: 1061 GAKD-------------AGDQVNYSKAPMPVSENG--------DASENGGSAHF 1093


>gb|EOY04286.1| Nuclear matrix constituent protein 1-like protein, putative isoform 1
            [Theobroma cacao]
          Length = 1177

 Score =  156 bits (394), Expect = 3e-35
 Identities = 149/594 (25%), Positives = 264/594 (44%), Gaps = 39/594 (6%)
 Frame = +1

Query: 4    EERKKIEKWHTDEEKRLKEERIYHREQVEKDLETLRLEREAFERHVESDRAQLSESLRKE 183
            ++ +K EK    EE+RLK E+    + ++++L+ L + +E F   +E +++ ++E    E
Sbjct: 553  QQTEKFEKQKLAEEERLKNEKQVAEDYIKRELDALEVAKETFAATMEHEQSVIAEKAESE 612

Query: 184  EADLLQKIEREGNEWKTDMELRAEEMQKQLHEKEIEFQKEKEREMQKIHEEKEIVQRDIE 363
             +  L  +E +  + ++DM+ R EEM+K+L E +  F++EKERE+ KI+  +E+ +R++E
Sbjct: 613  RSQRLHDLELQKRKLESDMQNRFEEMEKELGESKKSFEEEKERELDKINHLREVARRELE 672

Query: 364  QMKLDXXXXXXXXXXXXXXXXQAEKEWAEIKNDIVELQNQREKLQEQRKSLLKEKEGIIS 543
            ++K +                  E +  EI+ DI +L +  +KL++QR+  +KE+   IS
Sbjct: 673  ELKQERLKIEKEEQEVNASKMHLEGQQIEIRKDIDDLVDISKKLKDQREHFIKERNRFIS 732

Query: 544  ------QCDQLKRLENELNIVDCDLKQFNE------------------AHSNTQITPFDK 651
                   C     + +E  + D    Q  E                  A  N  ++   K
Sbjct: 733  FVEKHKSCKNCGEMTSEFMLSDLQSLQKIEDEEVLPLPSLADDYISGNAFRNLAVSKRQK 792

Query: 652  ------AGPSDSKASGRLSWIQRCASKLFNQSPS----PGKVSENNGEKDGNEGQNLISA 801
                   G     + G +SW+++C SK+F  SP     P  V++ N E   + GQ  ++ 
Sbjct: 793  DEISPPVGSGSPVSGGTMSWLRKCTSKIFKLSPGKNIEPHAVTKLNVEAPLSGGQ--VNM 850

Query: 802  EVVSGLEVEKEHTAALPENQIHGNDDEAEVVDNPSVHVKEAVVERQNLRHTRKSRPSVNF 981
            E +S +E E E + A     +  +  +++         ++  ++ Q+   +++    V  
Sbjct: 851  EGMSNVEHEPELSIAAATESLDVHRVQSDTSTRDVDAGQDLSIDNQSNIDSKELE--VLG 908

Query: 982  DQTNLASSDASGAMSKDKSKGKVFKRTRSIKAVVEDAKAILESSVDKEMSDGDQLKDQID 1161
            D  N  S    G   + + + +V KRTRS+KAVV+DA+AI+  ++  E ++ +     +D
Sbjct: 909  DSQN--SDFNRGNQLRKRGRPRV-KRTRSVKAVVKDAEAIIGKAL--ESNELEHPNGNLD 963

Query: 1162 AVVEGGXXXXXXXXXXXXXXENRREGS--DKLSSQVGRKRGR---KSRVSVEPDPEDAET 1326
            +                   E+R E    D  +S+  RKR R     +   E D  D+  
Sbjct: 964  S--------------GHANAESRDESGLFDGGTSRNARKRNRAQTSQKTESEQDGVDS-G 1008

Query: 1327 QSELSIGGRKRQRRKSAVDGQNTGSGTPGSRRYNLRRHTATSSMAPQAVSTKEDDNAAES 1506
             S+  + G++R+RR+  V        TPG  RYNLRR     ++A     T  D N    
Sbjct: 1009 HSDSIVAGQQRKRRQKVV----LAMPTPGEARYNLRRPKTGVTVA----KTTSDVNRENE 1060

Query: 1507 SGKDESQKMEEGSLNRVADEPQDNMSGEQPVREDGFENDERSQDIQENGGESSF 1668
              KD             A +  +      PV E+G        D  ENGG + F
Sbjct: 1061 GAKD-------------AGDQVNYSKAPMPVSENG--------DASENGGSAHF 1093


>ref|XP_004169820.1| PREDICTED: LOW QUALITY PROTEIN: putative nuclear matrix constituent
            protein 1-like protein-like [Cucumis sativus]
          Length = 1204

 Score =  155 bits (393), Expect = 4e-35
 Identities = 157/656 (23%), Positives = 271/656 (41%), Gaps = 74/656 (11%)
 Frame = +1

Query: 7    ERKKIEKWHTDEEKRLKEERIYHREQVEKDLETLRLEREAFERHVESDRAQLSESLRKEE 186
            ++++ EK    EE+RLK ER+     + ++ E L+L +E+F   +E +++ ++E  + + 
Sbjct: 569  QKEEFEKRIFSEEERLKSERLETEAYIHREQENLKLAQESFAASMEHEKSAIAEKAQSDR 628

Query: 187  ADLLQKIEREGNEWKTDMELRAEEMQKQLHEKEIEFQKEKEREMQKIHEEKEIVQRDIEQ 366
            + ++   + +  E ++ M+ R EEM++   EK+  F++EKERE++ I   +++ +R++++
Sbjct: 629  SQMMHDFDLQKRELESAMQNRVEEMERGFREKDKLFKEEKERELENIKFLRDVARREMDE 688

Query: 367  MKLDXXXXXXXXXXXXXXXXQAEKEWAEIKNDIVELQNQREKLQEQRKSLLKEKEGIIS- 543
            +KL+                  E++  EI+ DI EL     KL++QR+ L+ E++  IS 
Sbjct: 689  LKLERLKTEKERQEAEANKEHLERQRIEIRKDIEELLELSNKLKDQRERLVAERDRFISY 748

Query: 544  -----QCDQLKRLENELNIVDCD-LKQFNEAH--------------SNTQITPFDKAGPS 663
                  C     + +E  + D   L  F  A                  Q++P    G S
Sbjct: 749  VDKHVTCKNCGEIASEFVLSDLQYLDGFENADVLNLPGLPDKYMEIQGLQVSPGGNLGIS 808

Query: 664  DSK---------------ASGRLSWIQRCASKLFNQSPSPGKVSENNGEKDGNEGQNLIS 798
            D K               ++G +SW+++C SK+F  SP   K+     EK  +E      
Sbjct: 809  DVKNGELTPGGAGQKSPISAGTISWLRKCTSKIFKFSPGK-KIVSPAFEKQDDEA----- 862

Query: 799  AEVVSGLEVEKEH-TAALPENQIHGNDDEAEVVDNPSVHVKEAVVERQNLRHT---RKSR 966
                    V  EH   A P  ++   +DE E+    S+ +    ++ + ++     R   
Sbjct: 863  -------PVSDEHDDLAEPSKRMSVGEDEVEL----SLAIASDSLDDRRIQSDVSGRDVE 911

Query: 967  PSVNF---DQTNLASSDASGAMSKDKSKGKVFK-----------RTRSIKAVVEDAKAIL 1104
            PS N    +Q+N+ S     A+    S  +  K           RTRS+KAVVEDAKAI+
Sbjct: 912  PSQNLSIDNQSNIVSKAPEVAVDSQPSDVREIKXRPKRGKPKINRTRSVKAVVEDAKAII 971

Query: 1105 ESSVDKEMSDGDQLKDQIDAVVEGGXXXXXXXXXXXXXXENRREGSDKLSSQVGRKRGRK 1284
                        +L+    A    G              E+   G     +   R R   
Sbjct: 972  -----------GELQPTQQAEYPNGNAEDSSQLNNESRDESSLAGKGTQRNLRKRTRANS 1020

Query: 1285 SRVSVEPDPEDAETQSELSIGGRKRQRRKSAVDGQNTGSGTPGSRRYNLRRHTATSSMAP 1464
            S++  E D +D+E +S   + G+ R+RR+ A             +RYNLRR    +S  P
Sbjct: 1021 SQIMGENDHDDSEVRSGSVVEGQPRKRRQRAAPAVRA-----PEKRYNLRRKVVGASKEP 1075

Query: 1465 QAVSTKEDDNAAESSGKDE---------SQKMEEGSLN----------RVADEPQDNMSG 1587
              +S KE +     + ++E         +  M   S N           V D   D ++G
Sbjct: 1076 SNIS-KEHEEVGTVNRREEDVHYSKVRPTPSMGVASDNAGSAHLVRCGTVQDNQDDGVAG 1134

Query: 1588 EQPVREDGFENDERSQDIQENGGESSFNCFLEVSSHGILKSET-YTVSQEDEARSE 1752
               +  D     E      EN G        +   HG  +SE+   V  ED+   E
Sbjct: 1135 TSKISIDMVSQSEEVNGSPENAG--------KYEDHGEYRSESCEEVGNEDDDDDE 1182


>ref|XP_002525969.1| DNA double-strand break repair rad50 ATPase, putative [Ricinus
            communis] gi|223534701|gb|EEF36393.1| DNA double-strand
            break repair rad50 ATPase, putative [Ricinus communis]
          Length = 1163

 Score =  155 bits (391), Expect = 7e-35
 Identities = 136/520 (26%), Positives = 242/520 (46%), Gaps = 42/520 (8%)
 Frame = +1

Query: 4    EERKKIEKWHTDEEKRLKEERIYHREQVEKDLETLRLEREAFERHVESDRAQLSESLRKE 183
            E+R+K EK    EE+R+K E+    + V ++ E L + +E+FE ++E +R+ L+E    E
Sbjct: 562  EQREKFEKQKASEEERIKHEKQNVEDYVIREREALEIAKESFEANMEHERSALAEKALSE 621

Query: 184  EADLLQKIEREGNEWKTDMELRAEEMQKQLHEKEIEFQKEKEREMQKIHEEKEIVQRDIE 363
               +L + E + +E   D++++ E M+K L EKE  F++EKERE++ I+  +++ +R++E
Sbjct: 622  RQQMLHEFELQKSELGNDLQIKQEGMEKVLQEKEKLFEEEKERELKNINFLRDLARREME 681

Query: 364  QMKLDXXXXXXXXXXXXXXXXQAEKEWAEIKNDIVELQNQREKLQEQRKSLLKEKEGIIS 543
            +MK +                  +++  E+++DI +L +  +KL++ R+  +KEKE  I 
Sbjct: 682  EMKFERLRIEKERQEIEENKKHLQEQQLEMRDDIDKLGDLSKKLKDHREQFVKEKERFIL 741

Query: 544  QCDQLKRLEN--------------------------ELNIVDCDLKQFNEAHSNTQITPF 645
              +Q K  +N                             ++       N+  + T +   
Sbjct: 742  FVEQHKSCKNCGEITSEFVLSDLISSQEIEKAVLLPNQGLIQSATGNCNQNLAATAVQDN 801

Query: 646  DKAGPSDSKASGRLSWIQRCASKLFNQSPSPGKVSENNGEKDGNEGQNLISAEVVSGLEV 825
            D   PS  +++  +SW+++C SK+F+ SP          + +    QNL +  +    E 
Sbjct: 802  D-ISPSAGRSASPVSWLRKCTSKIFSFSPG--------NKMEPAAVQNLTAPLLAEDREE 852

Query: 826  EKEH---TAALPENQIH-GND--DEAEVVDNPSVHVKEAVVERQNLRHTRKSRPSVNFDQ 987
              +    TA  PE     GND  D   +  + S+   EAV +      +  +  ++   +
Sbjct: 853  PSKRLDFTAHEPELSFTIGNDSLDVQRIQSDSSIREAEAVQDFSIDDKSNINNEAIQVPE 912

Query: 988  TNLASSDASGAMSKDKSKGKVFKRTRSIKAVVEDAKAILESSVD--KEMSDGDQLKDQID 1161
                S+   G     + + +V  RTRS+KAVV+DAKAIL  S++   E  D   LK    
Sbjct: 913  GTQPSNVKLGRQIHKRGRPRV-SRTRSMKAVVQDAKAILGESLELNTETEDSSHLK---- 967

Query: 1162 AVVEGGXXXXXXXXXXXXXXENRREG--SDKLSSQVGRKR--GRKSRVSV----EPDPED 1317
                                E+R E   +D+  S+  RKR   R S+ +V    + D ++
Sbjct: 968  -------------------AESRGESNLADEKISRNARKRKSTRASQNTVSEHGDGDGDE 1008

Query: 1318 AETQSELSIGGRKRQRRKSAVDGQNTGSGTPGSRRYNLRR 1437
            +E  S+    G++R+R++     Q     TPG +RYNLRR
Sbjct: 1009 SEGHSDSITAGKRRKRQQKVAIVQ-----TPGEKRYNLRR 1043


>ref|XP_004141494.1| PREDICTED: putative nuclear matrix constituent protein 1-like
            protein-like [Cucumis sativus]
          Length = 1205

 Score =  154 bits (389), Expect = 1e-34
 Identities = 157/658 (23%), Positives = 269/658 (40%), Gaps = 76/658 (11%)
 Frame = +1

Query: 7    ERKKIEKWHTDEEKRLKEERIYHREQVEKDLETLRLEREAFERHVESDRAQLSESLRKEE 186
            ++++ EK    EE+RLK ER+     + ++ E L+L +E+F   +E +++ ++E  + + 
Sbjct: 569  QKEEFEKRIFSEEERLKSERLETEAYIHREQENLKLAQESFAASMEHEKSAIAEKAQSDR 628

Query: 187  ADLLQKIEREGNEWKTDMELRAEEMQKQLHEKEIEFQKEKEREMQKIHEEKEIVQRDIEQ 366
            + ++   + +  E ++ M+ R EEM++   EK+  F++EKERE++ I   +++ +R++++
Sbjct: 629  SQMMHDFDLQKRELESAMQNRVEEMERGFREKDKLFKEEKERELENIKFLRDVARREMDE 688

Query: 367  MKLDXXXXXXXXXXXXXXXXQAEKEWAEIKNDIVELQNQREKLQEQRKSLLKEKEGIISQ 546
            +KL+                  E++  EI+ DI EL     KL++QR+ L+ E++  IS 
Sbjct: 689  LKLERLKTEKERQEAEANKEHLERQRIEIRKDIEELLELSNKLKDQRERLVAERDRFISY 748

Query: 547  CDQ---------------------LKRLENE--LNIVDCDLKQFNEAHSNTQITPFDKAG 657
             D+                     L   EN   LN+     K          ++P    G
Sbjct: 749  VDKHVTCKNCGEIASEFVLSDLQYLDGFENADVLNLPGLPDKYMEIQGLQVSVSPGGNLG 808

Query: 658  PSDSK---------------ASGRLSWIQRCASKLFNQSPSPGKVSENNGEKDGNEGQNL 792
             SD K               ++G +SW+++C SK+F  SP   K+     EK  +E    
Sbjct: 809  ISDVKNGELTPGGAGQKSPISAGTISWLRKCTSKIFKFSPGK-KIVSPAFEKQDDEAP-- 865

Query: 793  ISAEVVSGLEVEKEHT-AALPENQIHGNDDEAEVVDNPSVHVKEAVVERQNLRHTRKSR- 966
                      V  EH   A P  ++   +DE E+    S+ +    ++ + ++     R 
Sbjct: 866  ----------VSDEHDDLAEPSKRMSVGEDEVEL----SLAIASDSLDDRRIQSDVSGRD 911

Query: 967  --PSVNF---DQTNLAS-----------SDASGAMSKDKSKGKVFKRTRSIKAVVEDAKA 1098
              PS N    +Q+N+ S           SD      + K       RTRS+KAVVEDAKA
Sbjct: 912  VEPSQNLSIDNQSNIVSKVPEVAVDSQPSDVRENKKRPKRGKPKINRTRSVKAVVEDAKA 971

Query: 1099 ILESSVDKEMSDGDQLKDQIDAVVEGGXXXXXXXXXXXXXXENRREGSDKLSSQVGRKRG 1278
            I+            +L+    A    G              E+   G     +   R R 
Sbjct: 972  II-----------GELQPTQQAEYPNGNAEDSSQLNNESRDESSLAGKGTQRNLRKRTRA 1020

Query: 1279 RKSRVSVEPDPEDAETQSELSIGGRKRQRRKSAVDGQNTGSGTPGSRRYNLRRHTATSSM 1458
              S++  E D +D+E +S   + G+ R+RR+ A             +RYNLRR    +S 
Sbjct: 1021 NSSQIMGENDHDDSEVRSGSVVEGQPRKRRQRAAPAVRA-----PEKRYNLRRKVVGASK 1075

Query: 1459 APQAVSTKEDDNAAESSGKDE---------SQKMEEGSLN----------RVADEPQDNM 1581
             P  +S KE +     + ++E         +  M   S N           V D   D +
Sbjct: 1076 EPSNIS-KEHEEVGTVNRREEDVHYSRVRPTPSMGVASDNAGSAHLVRCGTVQDNQDDGV 1134

Query: 1582 SGEQPVREDGFENDERSQDIQENGGESSFNCFLEVSSHGILKSET-YTVSQEDEARSE 1752
            +G   +  D     E      EN G        +   HG  +SE+   V  ED+   E
Sbjct: 1135 AGTSKISIDMVSQSEEVNGSPENAG--------KYEDHGEYRSESCEEVGNEDDDDDE 1184


>ref|XP_003520054.1| PREDICTED: putative nuclear matrix constituent protein 1-like
            protein-like isoform X1 [Glycine max]
          Length = 1210

 Score =  153 bits (386), Expect = 3e-34
 Identities = 145/556 (26%), Positives = 255/556 (45%), Gaps = 41/556 (7%)
 Frame = +1

Query: 7    ERKKIEKWHTDEEKRLKEERIYHREQVEKDLETLRLEREAFERHVESDRAQLSESLRKEE 186
            E++ + K+   EE+RLK E+ + ++ ++K+LE L  E+E+F   ++ ++  LSE ++ E+
Sbjct: 575  EKESLRKFQNSEEERLKSEKQHMQDHIKKELEMLESEKESFRDSMKQEKHLLSEKVKNEK 634

Query: 187  ADLLQKIEREGNEWKTDMELRAEEMQKQLHEKEIEFQKEKEREMQKIHEEKEIVQRDIEQ 366
            A +LQ  E +    + +++ R EEM+K L E+E  FQ+E +RE+  I+  K++ +++ E+
Sbjct: 635  AQMLQDFELKMRNLENEIQKRQEEMEKDLQERERNFQEEMQRELDNINNLKDVTEKEWEE 694

Query: 367  MKLDXXXXXXXXXXXXXXXXQAEKEWAEIKNDIVELQNQREKLQEQRKSLLKEKEGIISQ 546
            +K +                Q +    E+  D   L N   K++++R+ L+ E++  +  
Sbjct: 695  VKAEGIRLENERKVLESNKQQLKSGQHEMHEDSEMLMNLSRKVKKERERLVAERKHFLEL 754

Query: 547  CDQLK------RLENELNIVDCDLKQFNE-----------AHSNTQITPFDKAGPSDSKA 675
             ++L+       +  +  + D  L  F E            + N      D    S+   
Sbjct: 755  VEKLRSCKGCGEVVRDFVVSDIQLPDFKERVAIPSPISPVLNDNPPKNSQDNIAASEFNI 814

Query: 676  SGR---LSWIQRCASKLFNQSPSPGKVSENNGEKDGNEGQNLISAEVVSGLEVEKEHTAA 846
            SG    +SW+++C +K+FN SPS  K ++  G  D   G + +S    S   +++E   +
Sbjct: 815  SGSVKPVSWLRKCTTKIFNLSPS--KRADAVGALD-MPGTSPLSDVNFSVENIDEELPTS 871

Query: 847  LPENQIHGNDDEAEVV--------DNP---SVHVKEAVVERQNLRHTRKSRPS--VNFDQ 987
            LP        DE +          D P   S ++ + V +  +L     SR    V+ D 
Sbjct: 872  LPNIGARVIFDERQPAGGMAHHSSDTPHLQSDNIGKEVGDEYSLSVGDHSRVDSFVDGDP 931

Query: 988  TNLASSDASGAMSKDKSKGKV-FKRTRSIKAVVEDAKAILESSVDKEMSDGDQLKDQIDA 1164
             +   S       K   K K    RTRS+KAVVE+AK  L     K++ +        D 
Sbjct: 932  GDSQQSVPKLGRRKPGRKSKSGIARTRSVKAVVEEAKEFL-GKAPKKIENASLQSLNTDH 990

Query: 1165 VVEGGXXXXXXXXXXXXXXENRREGSDKLSSQVG-----RKRGRKSRVS-VEPDPEDAET 1326
            +                  E+ RE S      +G     R+R + SR++  E +  D+E 
Sbjct: 991  I-----------------REDSREDSSHTEKAIGNTRRKRQRAQTSRITESEQNAGDSEG 1033

Query: 1327 QSE-LSIGGRKRQRRKSAVDGQNTGSGTPGSRRYNLRRHTATSSMAPQAVSTKEDDNAAE 1503
            QS+ ++ GGR+++R+  A   Q T     G +RYNLRRH     +A +  ST+   NA +
Sbjct: 1034 QSDSITAGGRRKKRQTVAPLTQVT-----GEKRYNLRRH----KIAGKDSSTQNISNATK 1084

Query: 1504 SSGKDESQKMEEGSLN 1551
            S  K+ +    EG  N
Sbjct: 1085 SVEKEAAAGKLEGDKN 1100


>ref|XP_006574886.1| PREDICTED: putative nuclear matrix constituent protein 1-like
            protein-like isoform X2 [Glycine max]
          Length = 1211

 Score =  152 bits (385), Expect = 3e-34
 Identities = 145/556 (26%), Positives = 255/556 (45%), Gaps = 41/556 (7%)
 Frame = +1

Query: 7    ERKKIEKWHTDEEKRLKEERIYHREQVEKDLETLRLEREAFERHVESDRAQLSESLRKEE 186
            E++ + K+   EE+RLK E+ + ++ ++K+LE L  E+E+F   ++ ++  LSE ++ E+
Sbjct: 575  EKESLRKFQNSEEERLKSEKQHMQDHIKKELEMLESEKESFRDSMKQEKHLLSEKVKNEK 634

Query: 187  ADLLQKIEREGNEWKTDMELRAEEMQKQLHEKEIEFQKEKEREMQKIHEEKEIVQRDIEQ 366
            A +LQ  E +    + +++ R EEM+K L E+E  FQ+E +RE+  I+  K++ +++ E+
Sbjct: 635  AQMLQDFELKMRNLENEIQKRQEEMEKDLQERERNFQEEMQRELDNINNLKDVTEKEWEE 694

Query: 367  MKLDXXXXXXXXXXXXXXXXQAEKEWAEIKNDIVELQNQREKLQEQRKSLLKEKEGIISQ 546
            +K +                Q +    E+  D   L N   K++++R+ L+ E++  +  
Sbjct: 695  VKAEGIRLENERKVLESNKQQLKSGQHEMHEDSEMLMNLSRKVKKERERLVAERKHFLEL 754

Query: 547  CDQLK------RLENELNIVDCDLKQFNE-----------AHSNTQITPFDKAGPSDSKA 675
             ++L+       +  +  + D  L  F E            + N      D    S+   
Sbjct: 755  VEKLRSCKGCGEVVRDFVVSDIQLPDFKERVAIPSPISPVLNDNPPKNSQDNIAASEFNI 814

Query: 676  SGR---LSWIQRCASKLFNQSPSPGKVSENNGEKDGNEGQNLISAEVVSGLEVEKEHTAA 846
            SG    +SW+++C +K+FN SPS  K ++  G  D   G + +S    S   +++E   +
Sbjct: 815  SGSVKPVSWLRKCTTKIFNLSPS--KRADAVGALD-MPGTSPLSDVNFSVENIDEELPTS 871

Query: 847  LPENQIHGNDDEAEVV--------DNP---SVHVKEAVVERQNLRHTRKSRPS--VNFDQ 987
            LP        DE +          D P   S ++ + V +  +L     SR    V+ D 
Sbjct: 872  LPNIGARVIFDERQPAGGMAHHSSDTPHLQSDNIGKEVGDEYSLSVGDHSRVDSFVDGDP 931

Query: 988  TNLASSDASGAMSKDKSKGKV-FKRTRSIKAVVEDAKAILESSVDKEMSDGDQLKDQIDA 1164
             +   S       K   K K    RTRS+KAVVE+AK  L     K++ +        D 
Sbjct: 932  GDSQQSVPKLGRRKPGRKSKSGIARTRSVKAVVEEAKEFL-GKAPKKIENASLQSLNTDH 990

Query: 1165 VVEGGXXXXXXXXXXXXXXENRREGSDKLSSQVG-----RKRGRKSRVS-VEPDPEDAET 1326
            +                  E+ RE S      +G     R+R + SR++  E +  D+E 
Sbjct: 991  I-----------------REDSREDSSHTEKAIGNTRRKRQRAQTSRITESEQNAGDSEG 1033

Query: 1327 QSE-LSIGGRKRQRRKSAVDGQNTGSGTPGSRRYNLRRHTATSSMAPQAVSTKEDDNAAE 1503
            QS+ ++ GGR+++R+  A   Q T     G +RYNLRRH  +   A +  ST+   NA +
Sbjct: 1034 QSDSITAGGRRKKRQTVAPLTQVT-----GEKRYNLRRHKIS---AGKDSSTQNISNATK 1085

Query: 1504 SSGKDESQKMEEGSLN 1551
            S  K+ +    EG  N
Sbjct: 1086 SVEKEAAAGKLEGDKN 1101


>gb|EOY02173.1| Nuclear matrix constituent protein-related, putative isoform 3
            [Theobroma cacao]
          Length = 1080

 Score =  152 bits (383), Expect = 6e-34
 Identities = 140/525 (26%), Positives = 241/525 (45%), Gaps = 35/525 (6%)
 Frame = +1

Query: 4    EERKKIEKWHTDEEKRLKEERIYHREQVEKDLETLRLEREAFERHVESDRAQLSESLRKE 183
            EE+ K EK+   EE+RLK+E    R+ V +++E++RL++E+FE  ++ +++ L E  + E
Sbjct: 589  EEKDKFEKFRHSEEERLKKEESAMRDYVCREMESIRLQKESFEASMKHEKSVLLEEAQNE 648

Query: 184  EADLLQKIEREGNEWKTDMELRAEEMQKQLHEKEIEFQKEKEREMQKIHEEKEIVQRDIE 363
               +LQ  E +    +TD++ R ++ QK L E+ + F++ KERE+  +   KE V+R++E
Sbjct: 649  HIKMLQDFELQKMNLETDLQNRFDQKQKDLQERIVAFEEVKERELANMRCSKEDVEREME 708

Query: 364  QMKLDXXXXXXXXXXXXXXXXQAEKEWAEIKNDIVELQNQREKLQEQRKSLLKEKEGIIS 543
            +++                  +  ++  E++ DI EL     +L++QR+  ++E+   + 
Sbjct: 709  EIRSARLAVEREKQEVAINRDKLNEQQQEMRKDIDELGILSSRLKDQREHFIRERHSFLE 768

Query: 544  QCDQLKRLENELNIV-DCDLKQFNEAH-SNTQITPFDK------------AGPSDSK--- 672
              ++LK  +    I  D  L  F      + +I P  +             G S  K   
Sbjct: 769  FVEKLKSCKTCGEITRDFVLSNFQLPDVEDREIVPLPRLADELIRNHQGYLGASGVKNIK 828

Query: 673  -----------ASGRLSWIQRCASKLFNQSPSPGKVSENNGEKDGNEGQNLISAEVVSGL 819
                       ++GR+SW+++C +K+F  S SP K +E+  E  G     L + E    +
Sbjct: 829  RSPEAYSQYPESAGRMSWLRKCTTKIF--SISPTKRNESKAEGPG----ELTNKEAGGNI 882

Query: 820  -EVEKEHTAALPENQIHG---NDDEAEVVDNPSVHVKEAVVERQNLRHTRKSRPSVNFDQ 987
             E   E +  +P + I+      D+   VD+ S           +L H+          +
Sbjct: 883  HEKAGEPSLRIPGDSINNQLLQSDKIGKVDDRS---------GPSLDHSYTDSKVQEVPE 933

Query: 988  TNLASSDASGAMSKDKSKGKVFKRTRSIKAVVEDAKAIL-ESSVDKEMSDGDQLKDQIDA 1164
             +  S   SG     +       RTRS+KAVVEDAK  L ES  + E S+  Q  D   A
Sbjct: 934  DSQQSERKSGRRKPGRKPKSGLNRTRSVKAVVEDAKLFLGESPEEPEPSESVQPDDISHA 993

Query: 1165 -VVEGGXXXXXXXXXXXXXXENRREGSDKLSSQVGRKRGRKSRVS-VEPDPEDAETQSEL 1338
              V  G              ENR   + +      R+R + S+++  E D  D+E +S+ 
Sbjct: 994  NEVSAG---------VSTHSENRARNNAR-----KRRRPQDSKITDTELDAADSEGRSDS 1039

Query: 1339 SIGGRKRQRRKSAVDGQNTGSGTPGSRRYNLRRHTATSSMAPQAV 1473
               G +R+R+++A  G      TPG +RYNLRR    S  +P  +
Sbjct: 1040 VTTGGQRKRQQTAAQGLQ----TPGEKRYNLRRPKLHSQGSPSLI 1080


>ref|XP_006300299.1| hypothetical protein CARUB_v10019693mg [Capsella rubella]
            gi|482569009|gb|EOA33197.1| hypothetical protein
            CARUB_v10019693mg [Capsella rubella]
          Length = 1130

 Score =  151 bits (381), Expect = 1e-33
 Identities = 146/620 (23%), Positives = 277/620 (44%), Gaps = 36/620 (5%)
 Frame = +1

Query: 4    EERKKIEKWHTDEEKRLKEERIYHREQVEKDLETLRLEREAFERHVESDRAQLSESLRKE 183
            ++++K+E+ +  EE+RLK+E+    E ++++LE L + + +F   +E +R+ LS+    E
Sbjct: 544  DQKEKLERQNHLEEERLKKEKQAANENMQRELEALEVAKASFAETMEHERSMLSKKAESE 603

Query: 184  EADLLQKIEREGNEWKTDMELRAEEMQKQLHEKEIEFQKEKEREMQKIHEEKEIVQRDIE 363
             + LL +IE    + ++DM+ + EE +++L  KE  F++E+E+++  I+  ++I  +++ 
Sbjct: 604  RSQLLHEIEMRNGKLESDMQAKLEERERELQAKEKLFEEEREKDLSNINYLRDIASKEMA 663

Query: 364  QMKLDXXXXXXXXXXXXXXXXQAEKEWAEIKNDIVELQNQREKLQEQRKSLLKEKEGIIS 543
             MK +                  E++  EI+ D+ +L    +KL+EQR+  + E+   +S
Sbjct: 664  DMKNERHRIVKEKLEVDASKNHLEEQQTEIRKDVEDLVALTKKLKEQREQFISERSRFLS 723

Query: 544  ------QCDQLKRLENELNIVDCDLKQF-----------NEA--HSNTQITPFDKAGPSD 666
                   C+    L +EL + + D  +            NE        I+P   AG   
Sbjct: 724  SMESNRNCNPCGELLHELVLPEIDNVEMPNMSKLANILDNEVPRQEIRDISP-TAAGLGL 782

Query: 667  SKASGRLSWIQRCASKLFNQSP---SPGKVSENNGEKDG-------NEGQNLISAEVVSG 816
              A G +SW+++C SK+   SP   +   V+ N  +++        N G +     V + 
Sbjct: 783  PVAGGTVSWLRKCTSKILKLSPIKMAEPSVTWNLADQEQPADQANVNSGPSSTPQAVTNS 842

Query: 817  LEVEK-EHTAALPENQIHGNDDEAEVVDNPSVHVKEAVVERQNLRHTRKSRPSVNFDQTN 993
             +V+K E      E ++   + +    D  +++ K   V   +L                
Sbjct: 843  FDVQKAESETGTKEVEVTNVNSDG---DQSNINSKAQEVASDSL---------------- 883

Query: 994  LASSDASGAMSKDKSKGKV-FKRTRSIKAVVEDAKAILESSVDKEMSDGDQLKDQIDAVV 1170
              S+  +   S+ + K K   +RTRS+K VVEDAKAI   S+D  + + +   + I+A  
Sbjct: 884  --SNQNADGQSRMRGKAKARTRRTRSVKDVVEDAKAIYGESID--LCEPNDSTENIEA-- 937

Query: 1171 EGGXXXXXXXXXXXXXXENRREGSDKLSSQVGRKRGRKSRV---SVEPDPEDAETQSELS 1341
                             E  R  SD+ +S+ GRKRGR   +   + E D  +++ +S+  
Sbjct: 938  -----------NDGSMGEPGR--SDRATSKNGRKRGRVGSLRTCTTEQDGNESDGKSDSV 984

Query: 1342 IGGRKRQRRKSAVDGQNTGSGTPGSRRYNLRRHTATSSMAPQAVSTKEDDNAAESSGKDE 1521
             GG ++++R+  V  +  G      +RYNLRR    +     +    E   A +  G   
Sbjct: 985  TGGAQQRKRRQKVASEQQGEVV--GQRYNLRRPRRVTGETTLSKKHNETSGAQQDEGVYC 1042

Query: 1522 SQKMEEGSLNRVADEPQDNMSGEQPVREDGFENDERSQDIQENGGESSFNCFLEVSSHGI 1701
            +Q   E S+     +  + +S      ED  ++ +      +  GES      E  S  +
Sbjct: 1043 AQTTVEASVGVAVSD--NGVSTNVVQHEDTADSQDTDAGSPKRTGES------EAMSEDV 1094

Query: 1702 LKSETYTVS--QEDEARSEN 1755
             K+     S  ++DE+ +E+
Sbjct: 1095 HKTPQRADSDGEDDESDAEH 1114


>gb|EOY02176.1| Nuclear matrix constituent protein-related, putative isoform 6
            [Theobroma cacao]
          Length = 1179

 Score =  150 bits (380), Expect = 1e-33
 Identities = 144/549 (26%), Positives = 252/549 (45%), Gaps = 41/549 (7%)
 Frame = +1

Query: 4    EERKKIEKWHTDEEKRLKEERIYHREQVEKDLETLRLEREAFERHVESDRAQLSESLRKE 183
            EE+ K EK+   EE+RLK+E    R+ V +++E++RL++E+FE  ++ +++ L E  + E
Sbjct: 570  EEKDKFEKFRHSEEERLKKEESAMRDYVCREMESIRLQKESFEASMKHEKSVLLEEAQNE 629

Query: 184  EADLLQKIEREGNEWKTDMELRAEEMQKQLHEKEIEFQKEKEREMQKIHEEKEIVQRDIE 363
               +LQ  E +    +TD++ R ++ QK L E+ + F++ KERE+  +   KE V+R++E
Sbjct: 630  HIKMLQDFELQKMNLETDLQNRFDQKQKDLQERIVAFEEVKERELANMRCSKEDVEREME 689

Query: 364  QMKLDXXXXXXXXXXXXXXXXQAEKEWAEIKNDIVELQNQREKLQEQRKSLLKEKEGIIS 543
            +++                  +  ++  E++ DI EL     +L++QR+  ++E+   + 
Sbjct: 690  EIRSARLAVEREKQEVAINRDKLNEQQQEMRKDIDELGILSSRLKDQREHFIRERHSFLE 749

Query: 544  QCDQLKRLENELNIV-DCDLKQFNEAH-SNTQITPFDK------------AGPSDSK--- 672
              ++LK  +    I  D  L  F      + +I P  +             G S  K   
Sbjct: 750  FVEKLKSCKTCGEITRDFVLSNFQLPDVEDREIVPLPRLADELIRNHQGYLGASGVKNIK 809

Query: 673  -----------ASGRLSWIQRCASKLFNQSPSPGKVSENNGEKDGNEGQNLISAEVVSGL 819
                       ++GR+SW+++C +K+F  S SP K +E+  E  G     L + E    +
Sbjct: 810  RSPEAYSQYPESAGRMSWLRKCTTKIF--SISPTKRNESKAEGPG----ELTNKEAGGNI 863

Query: 820  -EVEKEHTAALPENQIHG---NDDEAEVVDNPSVHVKEAVVERQNLRHTRKSRPSVNFDQ 987
             E   E +  +P + I+      D+   VD+ S           +L H+          +
Sbjct: 864  HEKAGEPSLRIPGDSINNQLLQSDKIGKVDDRS---------GPSLDHSYTDSKVQEVPE 914

Query: 988  TNLASSDASGAMSKDKSKGKVFKRTRSIKAVVEDAKAIL-ESSVDKEMSDGDQLKDQIDA 1164
             +  S   SG     +       RTRS+KAVVEDAK  L ES  + E S+  Q  D   A
Sbjct: 915  DSQQSERKSGRRKPGRKPKSGLNRTRSVKAVVEDAKLFLGESPEEPEPSESVQPDDISHA 974

Query: 1165 -VVEGGXXXXXXXXXXXXXXENRREGSDKLSSQVGRKRGRKSRVS-VEPDPEDAETQSEL 1338
              V  G              ENR   + +      R+R + S+++  E D  D+E +S+ 
Sbjct: 975  NEVSAG---------VSTHSENRARNNAR-----KRRRPQDSKITDTELDAADSEGRSDS 1020

Query: 1339 SIGGRKRQRRKSAVDGQNTGSGTPGSRRYNLRRH----TATSSMAPQAV--STKEDDNAA 1500
               G +R+R+++A  G      TPG +RYNLRR     TA +++A   +  + +E D   
Sbjct: 1021 VTTGGQRKRQQTAAQGLQ----TPGEKRYNLRRPKLTVTAKAALASSDLLKTRQEPDGGV 1076

Query: 1501 ESSGKDESQ 1527
               G  +++
Sbjct: 1077 VEGGVSDTE 1085


Top