BLASTX nr result

ID: Rauwolfia21_contig00005873 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00005873
         (2345 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN74873.1| hypothetical protein VITISV_038920 [Vitis vinifera]   500   e-139
ref|XP_002278531.2| PREDICTED: putative nuclear matrix constitue...   492   e-136
gb|EOY02174.1| Nuclear matrix constituent protein-related, putat...   452   e-124
gb|EOY02176.1| Nuclear matrix constituent protein-related, putat...   448   e-123
gb|EOY02175.1| Nuclear matrix constituent protein-related, putat...   448   e-123
gb|EOY02171.1| Nuclear matrix constituent protein-related, putat...   448   e-123
ref|XP_006437755.1| hypothetical protein CICLE_v10030538mg [Citr...   441   e-121
gb|EMJ28278.1| hypothetical protein PRUPE_ppa000415mg [Prunus pe...   441   e-121
ref|XP_006484395.1| PREDICTED: putative nuclear matrix constitue...   440   e-120
emb|CAN74990.1| hypothetical protein VITISV_008657 [Vitis vinifera]   420   e-114
gb|EOY04286.1| Nuclear matrix constituent protein 1-like protein...   414   e-113
gb|EOY02173.1| Nuclear matrix constituent protein-related, putat...   411   e-112
gb|EOY02172.1| Nuclear matrix constituent protein-related, putat...   411   e-112
ref|XP_002524388.1| ATP binding protein, putative [Ricinus commu...   404   e-109
dbj|BAA20407.1| nuclear matrix constituent protein 1 [Daucus car...   402   e-109
dbj|BAF64424.1| nuclear matrix constituent protein 1-like [Petro...   402   e-109
dbj|BAF64421.1| nuclear matrix constituent protein 1-like [Apium...   401   e-109
ref|XP_004297151.1| PREDICTED: putative nuclear matrix constitue...   400   e-108
dbj|BAF64423.1| nuclear matrix constituent protein 1-like [Foeni...   398   e-108
gb|EXB53970.1| hypothetical protein L484_022938 [Morus notabilis]     397   e-108

>emb|CAN74873.1| hypothetical protein VITISV_038920 [Vitis vinifera]
          Length = 1234

 Score =  500 bits (1288), Expect = e-139
 Identities = 319/749 (42%), Positives = 439/749 (58%), Gaps = 48/749 (6%)
 Frame = -3

Query: 2343 LKGELEKMKLDVSQKEMLLREGNEKLILNEDERTEHTRLQKELREEIEKSRAQXXXXXXX 2164
            LK ELEK++ D++++E+ + E  E+L + E+ER+EH RLQ EL++EI+K R Q       
Sbjct: 509  LKDELEKIRADITEQELQIHEETERLKVTEEERSEHHRLQLELKQEIDKCRHQEEMLQKE 568

Query: 2163 XXXXKQDRMKFEEEWETLDEKRAAFARELQQIEEEKQMFEKLRRSEDDKLKKEKLATEEY 1984
                KQ+R+ FE++WE LDEKRA   +E+++I +EK+  EKL  SE+++LKKEKLA EE+
Sbjct: 569  REDLKQERIMFEKDWEALDEKRAVITKEMREIGDEKEKLEKLHLSEEERLKKEKLAMEEH 628

Query: 1983 IQRELEAIRLEKESFAVTMKYEKSALSEKAQGEHDQLLQEFEARRRDLETEMQKKQEDME 1804
            IQRELEA+R+EKESFA  MK+E+  LSEKAQ +H Q+L++FE R+RDLE EMQ +Q++++
Sbjct: 629  IQRELEAVRIEKESFAAIMKHEQVTLSEKAQNDHSQMLRDFELRKRDLEIEMQNRQDEIQ 688

Query: 1803 KDAQEKERAFQEERDKADDNISRLKEEVQKEMEIVSAERSRLQKEKQDITSNRKHLEEYQ 1624
            K  QE+ERAF+EER++  +NI+ LKE  ++E+E +  ER R++KEKQ++  N++ LE +Q
Sbjct: 689  KRLQERERAFEEERERELNNINHLKEVARREIEEMKTERRRIEKEKQEVLLNKRQLEGHQ 748

Query: 1623 LEMQNDIHELGILSKKLKDQREHFLHQRGRFLSFVERLKNCKNCGEIARSYVLSDLQLAE 1444
            LEM+ DI ELGILS+KLKDQRE F+ +R RFL+FV++ K CKNCGEI R +VL+DLQL E
Sbjct: 749  LEMRKDIDELGILSRKLKDQREQFIKERDRFLTFVDKHKTCKNCGEITREFVLNDLQLPE 808

Query: 1443 MEHKETSPFPTLGDELLEK-----ASSYGTDMNRSPAE--KKSSDSGGRFSWLQKCTSKF 1285
            ME  E  P P L DE L       A+S GT++     E    SS SGGR S+L+KC +K 
Sbjct: 809  ME-VEAFPLPNLADEFLNSPQGNMAASDGTNVKIXTGEIDLVSSGSGGRMSFLRKCATKI 867

Query: 1284 RKFSPSAK--HFVPENL--EPALSDRLVVAANTEGPSTAATA---DKGKANILGAEPSCE 1126
               SPS K  H   + L  E  L D  V     EGPS    +   D+ + +   A  S +
Sbjct: 868  FNLSPSKKSEHVGVQVLREESPLLDLQVNLEKAEGPSIVGQSIAEDELEPSFGIANDSFD 927

Query: 1125 ICILG----------------DNQSNGNDRIQQVVEDXXXXXXXXXXXXXRKNTQGGIRR 994
            I  L                 D  SN   + Q+  ED              +  + G+ R
Sbjct: 928  IQQLHSDSVMREVDGGHAQSVDGVSNMGSKEQEGPEDSQQSELKSGRRKPGRKRRTGVHR 987

Query: 993  TRSVKAVVEDAAVILGKTPGEPILNQQRQKDA--SDDEGSRGESSLLDK----IPRKRTW 832
            TRSVK V+                  +R  D+  +++EG R E+S  +K    I RKR  
Sbjct: 988  TRSVKNVLN---------------GDERPNDSTYTNEEGER-ETSHAEKAASTITRKRQR 1031

Query: 831  AQTSRKVGSEQDGHDSEGRSESVTAGGHRKRRQTVAPAPQNPVEKRYNLRHHKNGGTAVA 652
            A +SR   SEQD  DSEGRS+SVTAGG  KRRQTVAP  Q P EKRYNLR HK  GT   
Sbjct: 1032 APSSRITESEQDAADSEGRSDSVTAGGRGKRRQTVAPVVQTPGEKRYNLRRHKTAGTVAT 1091

Query: 651  AGTSVD--KNGKETTEGGNSGPAGAPQNIEVTSAPVLETASENRTQTPVDQNTSYKTVES 478
            A  S +  K  ++  +GG+        N +  S+P L   S+N   TP+   T+ K+VE 
Sbjct: 1092 AQASANLPKRDEKGGDGGDDNTLQTKANPKAASSPSL-ADSDNPKTTPLVHVTTLKSVEI 1150

Query: 477  REFD-----KFRTQQ--NGAEDNAIAAENQTMDFSEEANGTPDFN---EEEHGSTLNSXX 328
            RE+      +F+T     G  D+A  AEN  M+  +E  G P      E+E+GS  +   
Sbjct: 1151 REYSPDRVVRFKTVDIVGGNNDSARLAEN--MELRQEIPGNPGDTPGYEDENGSMSHEED 1208

Query: 327  XXXXXXXXXXDVHPGEVSVSRKLWNFFTS 241
                        HPG+ S+ +KLWNFFT+
Sbjct: 1209 DNSDEDESE---HPGDASIGKKLWNFFTT 1234


>ref|XP_002278531.2| PREDICTED: putative nuclear matrix constituent protein 1-like
            protein-like [Vitis vinifera]
          Length = 1213

 Score =  492 bits (1267), Expect = e-136
 Identities = 320/750 (42%), Positives = 436/750 (58%), Gaps = 49/750 (6%)
 Frame = -3

Query: 2343 LKGELEKMKLDVSQKEMLLREGNEKLILNEDERTEHTRLQKELREEIEKSRAQXXXXXXX 2164
            LK ELEK++ D++++E+ + E  E+L + E+ER+EH RLQ EL++EI+K R Q       
Sbjct: 491  LKDELEKIRADITEQELQIHEETERLKVTEEERSEHHRLQLELKQEIDKCRHQEEMLQKE 550

Query: 2163 XXXXKQDRMKFEEEWETLDEKRAAFARELQQIEEEKQMFEKLRRSEDDKLKKEKLATEEY 1984
                KQ+R+ FE++WE LDEKRA   +E+++I +EK+  EKL  SE+++LKKEKLA EE+
Sbjct: 551  REDLKQERIMFEKDWEALDEKRAVITKEMREIGDEKEKLEKLHLSEEERLKKEKLAMEEH 610

Query: 1983 IQRELEAIRLEKESFAVTMKYEKSALSEKAQGEHDQLLQEFEARRRDLETEMQKKQEDME 1804
            IQRELEA+R+EKESFA  MK+E                   + R+RDLE EMQ +Q++++
Sbjct: 611  IQRELEAVRIEKESFAAIMKHE-------------------QLRKRDLEIEMQNRQDEIQ 651

Query: 1803 KDAQEKERAFQEERDKADDNISRLKEEVQKEMEIVSAERSRLQKEKQDITSNRKHLEEYQ 1624
            K  QE+ERAF+EER++  +NI+ LKE  ++E+E +  ER R++KEKQ++  N++ LE +Q
Sbjct: 652  KRLQERERAFEEERERELNNINHLKEVARREIEEMKTERRRIEKEKQEVLLNKRQLEGHQ 711

Query: 1623 LEMQNDIHELGILSKKLKDQREHFLHQRGRFLSFVERLKNCKNCGEIARSYVLSDLQLAE 1444
            LEM+ DI ELGILS+KLKDQRE F+ +R RFL+FV++ K CKNCGEI R +VL+DLQL E
Sbjct: 712  LEMRKDIDELGILSRKLKDQREQFIKERDRFLTFVDKHKTCKNCGEITREFVLNDLQLPE 771

Query: 1443 MEHKETSPFPTLGDELLEK-----ASSYGTDMNRSPAE--KKSSDSGGRFSWLQKCTSKF 1285
            ME  E  P P L DE L       A+S GT++  S  E    SS SGGR S+L+KC +K 
Sbjct: 772  ME-VEAFPLPNLADEFLNSPQGNMAASDGTNVKISTGEIDLVSSGSGGRMSFLRKCATKI 830

Query: 1284 RKFSPSAK--HFVPENL--EPALSDRLVVAANTEGPSTAATA---DKGKANILGAEPSCE 1126
               SPS K  H   + L  E  L D  V     EGPS    +   D+ + +   A  S +
Sbjct: 831  FNLSPSKKSEHVGVQVLREESPLLDLQVNLEKAEGPSIVGQSIAEDELEPSFGIANDSFD 890

Query: 1125 ICILG----------------DNQSNGNDRIQQVVEDXXXXXXXXXXXXXRKNTQGGIRR 994
            I  L                 D  SN   + Q+  ED              +  + G+ R
Sbjct: 891  IQQLHSDSVMREVDGGHAQSVDGVSNMGSKEQEGPEDSQQSELKSGRRKPGRKRRTGVHR 950

Query: 993  TRSVKAVVEDAAVILGKTPGEPILN-QQRQKDA--SDDEGSRGESSLLDK----IPRKRT 835
            TRSVK VVEDA   LG+TP  P LN  +R  D+  +++EG R E+S  +K    I RKR 
Sbjct: 951  TRSVKNVVEDAKAFLGETPEIPELNGDERPNDSTYTNEEGER-ETSHAEKAASTITRKRQ 1009

Query: 834  WAQTSRKVGSEQDGHDSEGRSESVTAGGHRKRRQTVAPAPQNPVEKRYNLRHHKNGGTAV 655
             A +SR   SEQD  DSEGRS+SVTAGG  KRRQTVAP  Q P EKRYNLR HK  GT  
Sbjct: 1010 RAPSSRITESEQDAADSEGRSDSVTAGGRGKRRQTVAPVVQTPGEKRYNLRRHKTAGTVA 1069

Query: 654  AAGTSVD--KNGKETTEGGNSGPAGAPQNIEVTSAPVLETASENRTQTPVDQNTSYKTVE 481
             A  S +  K  ++  +GG+        N +  S+P L   S+N   TP+   T+ K+VE
Sbjct: 1070 TAQASANLPKRDEKGGDGGDDNTLQTKANPKAASSPSL-ADSDNPKTTPLVHVTTLKSVE 1128

Query: 480  SREFD-----KFRTQQ--NGAEDNAIAAENQTMDFSEEANGTPDFN---EEEHGSTLNSX 331
             RE+      +F+T     G  D+A  AEN  M+  +E  G P      E+E+GS  +  
Sbjct: 1129 IREYSPDRVVRFKTVDIVGGNNDSARLAEN--MELRQEIPGNPGDTPGYEDENGSMSHEE 1186

Query: 330  XXXXXXXXXXXDVHPGEVSVSRKLWNFFTS 241
                         HPG+ S+ +KLWNFFT+
Sbjct: 1187 DDNSDEDESE---HPGDASIGKKLWNFFTT 1213


>gb|EOY02174.1| Nuclear matrix constituent protein-related, putative isoform 4
            [Theobroma cacao]
          Length = 1195

 Score =  452 bits (1162), Expect = e-124
 Identities = 285/721 (39%), Positives = 417/721 (57%), Gaps = 20/721 (2%)
 Frame = -3

Query: 2343 LKGELEKMKLDVSQKEMLLREGNEKLILNEDERTEHTRLQKELREEIEKSRAQXXXXXXX 2164
            LK E++K+  + SQ+E+ +RE ++KL + E+ER+EH RLQ EL+++I+  R Q       
Sbjct: 496  LKDEIDKIGAETSQQELRIREESQKLKITEEERSEHIRLQSELKQQIDSCRHQEELLLKE 555

Query: 2163 XXXXKQDRMKFEEEWETLDEKRAAFARELQQIEEEKQMFEKLRRSEDDKLKKEKLATEEY 1984
                KQ R  FE+EWE LDEKRA    + ++I EEK  FEK R SE+++LKKE+ A  +Y
Sbjct: 556  HEDLKQQRENFEKEWEVLDEKRAEITMQRKEIVEEKDKFEKFRHSEEERLKKEESAMRDY 615

Query: 1983 IQRELEAIRLEKESFAVTMKYEKSALSEKAQGEHDQLLQEFEARRRDLETEMQKKQEDME 1804
            + RE+E+IRL+KESF  +MK+EKS L E+AQ EH ++LQ+FE ++ +LET++Q + +  +
Sbjct: 616  VCREMESIRLQKESFEASMKHEKSVLLEEAQNEHIKMLQDFELQKMNLETDLQNRFDQKQ 675

Query: 1803 KDAQEKERAFQEERDKADDNISRLKEEVQKEMEIVSAERSRLQKEKQDITSNRKHLEEYQ 1624
            KD QE+  AF+E +++   N+   KE+V++EME + + R  +++EKQ++  NR  L E Q
Sbjct: 676  KDLQERIVAFEEVKERELANMRCSKEDVEREMEEIRSARLAVEREKQEVAINRDKLNEQQ 735

Query: 1623 LEMQNDIHELGILSKKLKDQREHFLHQRGRFLSFVERLKNCKNCGEIARSYVLSDLQLAE 1444
             EM+ DI ELGILS +LKDQREHF+ +R  FL FVE+LK+CK CGEI R +VLS+ QL +
Sbjct: 736  QEMRKDIDELGILSSRLKDQREHFIRERHSFLEFVEKLKSCKTCGEITRDFVLSNFQLPD 795

Query: 1443 MEHKETSPFPTLGDELLEKASSY-----GTDMNRSP-AEKKSSDSGGRFSWLQKCTSKFR 1282
            +E +E  P P L DEL+     Y       ++ RSP A  +  +S GR SWL+KCT+K  
Sbjct: 796  VEDREIVPLPRLADELIRNHQGYLGASGVKNIKRSPEAYSQYPESAGRMSWLRKCTTKIF 855

Query: 1281 KFSPSAKHFVPENLEPALSDRLV---VAANTEGPSTAATADKGKANILGAEPSCEICILG 1111
              SP+ ++         L+++     +      PS     D     +L ++   ++    
Sbjct: 856  SISPTKRNESKAEGPGELTNKEAGGNIHEKAGEPSLRIPGDSINNQLLQSDKIGKVDDRS 915

Query: 1110 D---NQSNGNDRIQQVVEDXXXXXXXXXXXXXRKNTQGGIRRTRSVKAVVEDAAVILGKT 940
                + S  + ++Q+V ED              +  + G+ RTRSVKAVVEDA + LG++
Sbjct: 916  GPSLDHSYTDSKVQEVPEDSQQSERKSGRRKPGRKPKSGLNRTRSVKAVVEDAKLFLGES 975

Query: 939  PGEPILNQQRQKD--ASDDEGSRGESSLLD----KIPRKRTWAQTSRKVGSEQDGHDSEG 778
            P EP  ++  Q D  +  +E S G S+  +       RKR   Q S+   +E D  DSEG
Sbjct: 976  PEEPEPSESVQPDDISHANEVSAGVSTHSENRARNNARKRRRPQDSKITDTELDAADSEG 1035

Query: 777  RSESVTAGGHRKRRQTVAPAPQNPVEKRYNLRHHKNGGTAVAAGTSVD-KNGKETTEGGN 601
            RS+SVT GG RKR+QT A   Q P EKRYNLR  K   TA AA  S D    ++  +GG 
Sbjct: 1036 RSDSVTTGGQRKRQQTAAQGLQTPGEKRYNLRRPKLTVTAKAALASSDLLKTRQEPDGG- 1094

Query: 600  SGPAGAPQNIEVTSAPVLETASENRTQTPVDQNTSYKTVESREFDKFRTQQNGAEDNAIA 421
                       V    V +T  ENR+   V Q T+ K VE  E +KF+T  +  +DNA A
Sbjct: 1095 -----------VVEGGVSDT--ENRSSNLV-QVTTLKNVEIVE-EKFKTSVD-VDDNANA 1138

Query: 420  AEN-QTMDFSEEANGTPDFNEEEHGSTLNSXXXXXXXXXXXXDVHPGEVSVSRKLWNFFT 244
            A+   ++D SEE     + NE++  S+++               HPGEVS+ +K+W FFT
Sbjct: 1139 AKPVGSVDLSEEVGTAENGNEDQSVSSIDEDEDDSDDEIE----HPGEVSIGKKIWTFFT 1194

Query: 243  S 241
            S
Sbjct: 1195 S 1195


>gb|EOY02176.1| Nuclear matrix constituent protein-related, putative isoform 6
            [Theobroma cacao]
          Length = 1179

 Score =  448 bits (1153), Expect = e-123
 Identities = 283/723 (39%), Positives = 416/723 (57%), Gaps = 22/723 (3%)
 Frame = -3

Query: 2343 LKGELEKMKLDVSQKEMLLREGNEKLILNEDERTEHTRLQKELREEIEKSRAQXXXXXXX 2164
            LK E++K+  + SQ+E+ +RE ++KL + E+ER+EH RLQ EL+++I+  R Q       
Sbjct: 477  LKDEIDKIGAETSQQELRIREESQKLKITEEERSEHIRLQSELKQQIDSCRHQEELLLKE 536

Query: 2163 XXXXKQDRMKFEEEWETLDEKRAAFARELQQIEEEKQMFEKLRRSEDDKLKKEKLATEEY 1984
                KQ R  FE+EWE LDEKRA    + ++I EEK  FEK R SE+++LKKE+ A  +Y
Sbjct: 537  HEDLKQQRENFEKEWEVLDEKRAEITMQRKEIVEEKDKFEKFRHSEEERLKKEESAMRDY 596

Query: 1983 IQRELEAIRLEKESFAVTMKYEKSALSEKAQGEHDQLLQEFEARRRDLETEMQKKQEDME 1804
            + RE+E+IRL+KESF  +MK+EKS L E+AQ EH ++LQ+FE ++ +LET++Q + +  +
Sbjct: 597  VCREMESIRLQKESFEASMKHEKSVLLEEAQNEHIKMLQDFELQKMNLETDLQNRFDQKQ 656

Query: 1803 KDAQEKERAFQEERDKADDNISRLKEEVQKEMEIVSAERSRLQKEKQDITSNRKHLEEYQ 1624
            KD QE+  AF+E +++   N+   KE+V++EME + + R  +++EKQ++  NR  L E Q
Sbjct: 657  KDLQERIVAFEEVKERELANMRCSKEDVEREMEEIRSARLAVEREKQEVAINRDKLNEQQ 716

Query: 1623 LEMQNDIHELGILSKKLKDQREHFLHQRGRFLSFVERLKNCKNCGEIARSYVLSDLQLAE 1444
             EM+ DI ELGILS +LKDQREHF+ +R  FL FVE+LK+CK CGEI R +VLS+ QL +
Sbjct: 717  QEMRKDIDELGILSSRLKDQREHFIRERHSFLEFVEKLKSCKTCGEITRDFVLSNFQLPD 776

Query: 1443 MEHKETSPFPTLGDELLEKASSY-----GTDMNRSP-AEKKSSDSGGRFSWLQKCTSKFR 1282
            +E +E  P P L DEL+     Y       ++ RSP A  +  +S GR SWL+KCT+K  
Sbjct: 777  VEDREIVPLPRLADELIRNHQGYLGASGVKNIKRSPEAYSQYPESAGRMSWLRKCTTKIF 836

Query: 1281 KFSPSAKHFVPENLEPALSDRLV---VAANTEGPSTAATADKGKANILGAEPSCEICILG 1111
              SP+ ++         L+++     +      PS     D     +L ++   ++    
Sbjct: 837  SISPTKRNESKAEGPGELTNKEAGGNIHEKAGEPSLRIPGDSINNQLLQSDKIGKVDDRS 896

Query: 1110 D---NQSNGNDRIQQVVEDXXXXXXXXXXXXXRKNTQGGIRRTRSVKAVVEDAAVILGKT 940
                + S  + ++Q+V ED              +  + G+ RTRSVKAVVEDA + LG++
Sbjct: 897  GPSLDHSYTDSKVQEVPEDSQQSERKSGRRKPGRKPKSGLNRTRSVKAVVEDAKLFLGES 956

Query: 939  PGEPILNQQRQKD--ASDDEGSRGESSLLD----KIPRKRTWAQTSRKVGSEQDGHDSEG 778
            P EP  ++  Q D  +  +E S G S+  +       RKR   Q S+   +E D  DSEG
Sbjct: 957  PEEPEPSESVQPDDISHANEVSAGVSTHSENRARNNARKRRRPQDSKITDTELDAADSEG 1016

Query: 777  RSESVTAGGHRKRRQTVAPAPQNPVEKRYNLRHHKNGGTAVAAGTSVD-KNGKETTEGGN 601
            RS+SVT GG RKR+QT A   Q P EKRYNLR  K   TA AA  S D    ++  +GG 
Sbjct: 1017 RSDSVTTGGQRKRQQTAAQGLQTPGEKRYNLRRPKLTVTAKAALASSDLLKTRQEPDGG- 1075

Query: 600  SGPAGAPQNIEVTSAPVLETASENRTQTPVDQNTSYKTVE--SREFDKFRTQQNGAEDNA 427
                       V    V +T  ENR+   V Q T+ K VE    +  +F+T  +  +DNA
Sbjct: 1076 -----------VVEGGVSDT--ENRSSNLV-QVTTLKNVEIVEEKVVRFKTSVD-VDDNA 1120

Query: 426  IAAEN-QTMDFSEEANGTPDFNEEEHGSTLNSXXXXXXXXXXXXDVHPGEVSVSRKLWNF 250
             AA+   ++D SEE     + NE++  S+++               HPGEVS+ +K+W F
Sbjct: 1121 NAAKPVGSVDLSEEVGTAENGNEDQSVSSIDEDEDDSDDEIE----HPGEVSIGKKIWTF 1176

Query: 249  FTS 241
            FTS
Sbjct: 1177 FTS 1179


>gb|EOY02175.1| Nuclear matrix constituent protein-related, putative isoform 5
            [Theobroma cacao]
          Length = 1188

 Score =  448 bits (1153), Expect = e-123
 Identities = 283/723 (39%), Positives = 416/723 (57%), Gaps = 22/723 (3%)
 Frame = -3

Query: 2343 LKGELEKMKLDVSQKEMLLREGNEKLILNEDERTEHTRLQKELREEIEKSRAQXXXXXXX 2164
            LK E++K+  + SQ+E+ +RE ++KL + E+ER+EH RLQ EL+++I+  R Q       
Sbjct: 486  LKDEIDKIGAETSQQELRIREESQKLKITEEERSEHIRLQSELKQQIDSCRHQEELLLKE 545

Query: 2163 XXXXKQDRMKFEEEWETLDEKRAAFARELQQIEEEKQMFEKLRRSEDDKLKKEKLATEEY 1984
                KQ R  FE+EWE LDEKRA    + ++I EEK  FEK R SE+++LKKE+ A  +Y
Sbjct: 546  HEDLKQQRENFEKEWEVLDEKRAEITMQRKEIVEEKDKFEKFRHSEEERLKKEESAMRDY 605

Query: 1983 IQRELEAIRLEKESFAVTMKYEKSALSEKAQGEHDQLLQEFEARRRDLETEMQKKQEDME 1804
            + RE+E+IRL+KESF  +MK+EKS L E+AQ EH ++LQ+FE ++ +LET++Q + +  +
Sbjct: 606  VCREMESIRLQKESFEASMKHEKSVLLEEAQNEHIKMLQDFELQKMNLETDLQNRFDQKQ 665

Query: 1803 KDAQEKERAFQEERDKADDNISRLKEEVQKEMEIVSAERSRLQKEKQDITSNRKHLEEYQ 1624
            KD QE+  AF+E +++   N+   KE+V++EME + + R  +++EKQ++  NR  L E Q
Sbjct: 666  KDLQERIVAFEEVKERELANMRCSKEDVEREMEEIRSARLAVEREKQEVAINRDKLNEQQ 725

Query: 1623 LEMQNDIHELGILSKKLKDQREHFLHQRGRFLSFVERLKNCKNCGEIARSYVLSDLQLAE 1444
             EM+ DI ELGILS +LKDQREHF+ +R  FL FVE+LK+CK CGEI R +VLS+ QL +
Sbjct: 726  QEMRKDIDELGILSSRLKDQREHFIRERHSFLEFVEKLKSCKTCGEITRDFVLSNFQLPD 785

Query: 1443 MEHKETSPFPTLGDELLEKASSY-----GTDMNRSP-AEKKSSDSGGRFSWLQKCTSKFR 1282
            +E +E  P P L DEL+     Y       ++ RSP A  +  +S GR SWL+KCT+K  
Sbjct: 786  VEDREIVPLPRLADELIRNHQGYLGASGVKNIKRSPEAYSQYPESAGRMSWLRKCTTKIF 845

Query: 1281 KFSPSAKHFVPENLEPALSDRLV---VAANTEGPSTAATADKGKANILGAEPSCEICILG 1111
              SP+ ++         L+++     +      PS     D     +L ++   ++    
Sbjct: 846  SISPTKRNESKAEGPGELTNKEAGGNIHEKAGEPSLRIPGDSINNQLLQSDKIGKVDDRS 905

Query: 1110 D---NQSNGNDRIQQVVEDXXXXXXXXXXXXXRKNTQGGIRRTRSVKAVVEDAAVILGKT 940
                + S  + ++Q+V ED              +  + G+ RTRSVKAVVEDA + LG++
Sbjct: 906  GPSLDHSYTDSKVQEVPEDSQQSERKSGRRKPGRKPKSGLNRTRSVKAVVEDAKLFLGES 965

Query: 939  PGEPILNQQRQKD--ASDDEGSRGESSLLD----KIPRKRTWAQTSRKVGSEQDGHDSEG 778
            P EP  ++  Q D  +  +E S G S+  +       RKR   Q S+   +E D  DSEG
Sbjct: 966  PEEPEPSESVQPDDISHANEVSAGVSTHSENRARNNARKRRRPQDSKITDTELDAADSEG 1025

Query: 777  RSESVTAGGHRKRRQTVAPAPQNPVEKRYNLRHHKNGGTAVAAGTSVD-KNGKETTEGGN 601
            RS+SVT GG RKR+QT A   Q P EKRYNLR  K   TA AA  S D    ++  +GG 
Sbjct: 1026 RSDSVTTGGQRKRQQTAAQGLQTPGEKRYNLRRPKLTVTAKAALASSDLLKTRQEPDGG- 1084

Query: 600  SGPAGAPQNIEVTSAPVLETASENRTQTPVDQNTSYKTVE--SREFDKFRTQQNGAEDNA 427
                       V    V +T  ENR+   V Q T+ K VE    +  +F+T  +  +DNA
Sbjct: 1085 -----------VVEGGVSDT--ENRSSNLV-QVTTLKNVEIVEEKVVRFKTSVD-VDDNA 1129

Query: 426  IAAEN-QTMDFSEEANGTPDFNEEEHGSTLNSXXXXXXXXXXXXDVHPGEVSVSRKLWNF 250
             AA+   ++D SEE     + NE++  S+++               HPGEVS+ +K+W F
Sbjct: 1130 NAAKPVGSVDLSEEVGTAENGNEDQSVSSIDEDEDDSDDEIE----HPGEVSIGKKIWTF 1185

Query: 249  FTS 241
            FTS
Sbjct: 1186 FTS 1188


>gb|EOY02171.1| Nuclear matrix constituent protein-related, putative isoform 1
            [Theobroma cacao]
          Length = 1198

 Score =  448 bits (1153), Expect = e-123
 Identities = 283/723 (39%), Positives = 416/723 (57%), Gaps = 22/723 (3%)
 Frame = -3

Query: 2343 LKGELEKMKLDVSQKEMLLREGNEKLILNEDERTEHTRLQKELREEIEKSRAQXXXXXXX 2164
            LK E++K+  + SQ+E+ +RE ++KL + E+ER+EH RLQ EL+++I+  R Q       
Sbjct: 496  LKDEIDKIGAETSQQELRIREESQKLKITEEERSEHIRLQSELKQQIDSCRHQEELLLKE 555

Query: 2163 XXXXKQDRMKFEEEWETLDEKRAAFARELQQIEEEKQMFEKLRRSEDDKLKKEKLATEEY 1984
                KQ R  FE+EWE LDEKRA    + ++I EEK  FEK R SE+++LKKE+ A  +Y
Sbjct: 556  HEDLKQQRENFEKEWEVLDEKRAEITMQRKEIVEEKDKFEKFRHSEEERLKKEESAMRDY 615

Query: 1983 IQRELEAIRLEKESFAVTMKYEKSALSEKAQGEHDQLLQEFEARRRDLETEMQKKQEDME 1804
            + RE+E+IRL+KESF  +MK+EKS L E+AQ EH ++LQ+FE ++ +LET++Q + +  +
Sbjct: 616  VCREMESIRLQKESFEASMKHEKSVLLEEAQNEHIKMLQDFELQKMNLETDLQNRFDQKQ 675

Query: 1803 KDAQEKERAFQEERDKADDNISRLKEEVQKEMEIVSAERSRLQKEKQDITSNRKHLEEYQ 1624
            KD QE+  AF+E +++   N+   KE+V++EME + + R  +++EKQ++  NR  L E Q
Sbjct: 676  KDLQERIVAFEEVKERELANMRCSKEDVEREMEEIRSARLAVEREKQEVAINRDKLNEQQ 735

Query: 1623 LEMQNDIHELGILSKKLKDQREHFLHQRGRFLSFVERLKNCKNCGEIARSYVLSDLQLAE 1444
             EM+ DI ELGILS +LKDQREHF+ +R  FL FVE+LK+CK CGEI R +VLS+ QL +
Sbjct: 736  QEMRKDIDELGILSSRLKDQREHFIRERHSFLEFVEKLKSCKTCGEITRDFVLSNFQLPD 795

Query: 1443 MEHKETSPFPTLGDELLEKASSY-----GTDMNRSP-AEKKSSDSGGRFSWLQKCTSKFR 1282
            +E +E  P P L DEL+     Y       ++ RSP A  +  +S GR SWL+KCT+K  
Sbjct: 796  VEDREIVPLPRLADELIRNHQGYLGASGVKNIKRSPEAYSQYPESAGRMSWLRKCTTKIF 855

Query: 1281 KFSPSAKHFVPENLEPALSDRLV---VAANTEGPSTAATADKGKANILGAEPSCEICILG 1111
              SP+ ++         L+++     +      PS     D     +L ++   ++    
Sbjct: 856  SISPTKRNESKAEGPGELTNKEAGGNIHEKAGEPSLRIPGDSINNQLLQSDKIGKVDDRS 915

Query: 1110 D---NQSNGNDRIQQVVEDXXXXXXXXXXXXXRKNTQGGIRRTRSVKAVVEDAAVILGKT 940
                + S  + ++Q+V ED              +  + G+ RTRSVKAVVEDA + LG++
Sbjct: 916  GPSLDHSYTDSKVQEVPEDSQQSERKSGRRKPGRKPKSGLNRTRSVKAVVEDAKLFLGES 975

Query: 939  PGEPILNQQRQKD--ASDDEGSRGESSLLD----KIPRKRTWAQTSRKVGSEQDGHDSEG 778
            P EP  ++  Q D  +  +E S G S+  +       RKR   Q S+   +E D  DSEG
Sbjct: 976  PEEPEPSESVQPDDISHANEVSAGVSTHSENRARNNARKRRRPQDSKITDTELDAADSEG 1035

Query: 777  RSESVTAGGHRKRRQTVAPAPQNPVEKRYNLRHHKNGGTAVAAGTSVD-KNGKETTEGGN 601
            RS+SVT GG RKR+QT A   Q P EKRYNLR  K   TA AA  S D    ++  +GG 
Sbjct: 1036 RSDSVTTGGQRKRQQTAAQGLQTPGEKRYNLRRPKLTVTAKAALASSDLLKTRQEPDGG- 1094

Query: 600  SGPAGAPQNIEVTSAPVLETASENRTQTPVDQNTSYKTVE--SREFDKFRTQQNGAEDNA 427
                       V    V +T  ENR+   V Q T+ K VE    +  +F+T  +  +DNA
Sbjct: 1095 -----------VVEGGVSDT--ENRSSNLV-QVTTLKNVEIVEEKVVRFKTSVD-VDDNA 1139

Query: 426  IAAEN-QTMDFSEEANGTPDFNEEEHGSTLNSXXXXXXXXXXXXDVHPGEVSVSRKLWNF 250
             AA+   ++D SEE     + NE++  S+++               HPGEVS+ +K+W F
Sbjct: 1140 NAAKPVGSVDLSEEVGTAENGNEDQSVSSIDEDEDDSDDEIE----HPGEVSIGKKIWTF 1195

Query: 249  FTS 241
            FTS
Sbjct: 1196 FTS 1198


>ref|XP_006437755.1| hypothetical protein CICLE_v10030538mg [Citrus clementina]
            gi|557539951|gb|ESR50995.1| hypothetical protein
            CICLE_v10030538mg [Citrus clementina]
          Length = 1222

 Score =  441 bits (1135), Expect = e-121
 Identities = 265/733 (36%), Positives = 413/733 (56%), Gaps = 32/733 (4%)
 Frame = -3

Query: 2343 LKGELEKMKLDVSQKEMLLREGNEKLILNEDERTEHTRLQKELREEIEKSRAQXXXXXXX 2164
            LK E++K++ + +Q+E+ ++E  +KL +NE+E++E  RLQ +L+++IE  R Q       
Sbjct: 495  LKVEIDKIESENAQQELQIQEECQKLKINEEEKSELLRLQSQLKQQIETYRHQQELLLKE 554

Query: 2163 XXXXKQDRMKFEEEWETLDEKRAAFARELQQIEEEKQMFEKLRRSEDDKLKKEKLATEEY 1984
                +QDR KFE+EWE LDEKR    +E ++I +EK+  EKL+ S +++LKKE+ A  +Y
Sbjct: 555  HEDLQQDREKFEKEWEVLDEKRDEINKEQEKIADEKKKLEKLQHSAEERLKKEECAMRDY 614

Query: 1983 IQRELEAIRLEKESFAVTMKYEKSALSEKAQGEHDQLLQEFEARRRDLETEMQKKQEDME 1804
            +QRE+EAIRL+KE+F  TM++E+  LSEKA+ +  ++L+EFE +R + E E+  +++ ME
Sbjct: 615  VQREIEAIRLDKEAFEATMRHEQLVLSEKAKNDRRKMLEEFEMQRMNQEAELLNRRDKME 674

Query: 1803 KDAQEKERAFQEERDKADDNISRLKEEVQKEMEIVSAERSRLQKEKQDITSNRKHLEEYQ 1624
            K+ QE+ R F+E+R++  ++I+ LKE  + E++ + +ER +L+KEK ++  NR+ L+E Q
Sbjct: 675  KELQERTRTFEEKRERVLNDIAHLKEVAEGEIQEIKSERDQLEKEKHEVKVNREKLQEQQ 734

Query: 1623 LEMQNDIHELGILSKKLKDQREHFLHQRGRFLSFVERLKNCKNCGEIARSYVLSDLQLAE 1444
            L M+ DI EL IL ++L   RE F  ++ RFL FVE+  +CKNCGE+ R++V+S+LQL +
Sbjct: 735  LGMRKDIDELDILCRRLYGDREQFKREKERFLEFVEKHTSCKNCGEMMRAFVISNLQLPD 794

Query: 1443 MEHKETSPFPTLGDELL-----EKASSYGTDMNRSPAEKK--SSDSGGRFSWLQKCTSKF 1285
             E +   P P + +  L     + A+ Y ++++ S        +DSGGR SWL+KCTSK 
Sbjct: 795  DEARNDIPLPQVAERCLGNLQGDVAAPYDSNISNSHGGMNLGRADSGGRMSWLRKCTSKI 854

Query: 1284 RKFSPSAKH-----FVPENLEPALSDRLVVAANTEGPSTAATADKGKANILGAEPSCEIC 1120
               SP  K       + E  EP  +   ++    EGP    + +    +    EP     
Sbjct: 855  FSISPIKKSEHISTSMLEEEEPQSAVPTIMQEKAEGPGVLVSKEAIGYSSPEDEPQSSFR 914

Query: 1119 ILGDNQSNGND---------------RIQQVVEDXXXXXXXXXXXXXRKNTQGGIRRTRS 985
            ++ D+ +   D               +++ V ED              +  + G+ RTRS
Sbjct: 915  LVNDSTNREVDDEYAPSVDGHSYMDSKVEDVAEDSQQSELRSGKRRPGRKRKSGVNRTRS 974

Query: 984  VKAVVEDAAVILGKTPGEPILNQQRQKDASDDEGSRGESSLLDKIPRKRTWAQTSRKVGS 805
            +KA VEDA + LG++P    LN   Q    D +G    +     + +KR   QTS+   S
Sbjct: 975  LKAAVEDAKLFLGESPEGAGLNASFQAH-EDSQGISSHTQEASNMAKKRRRPQTSKTTQS 1033

Query: 804  EQDGHDSEGRSESVTA-GGHRKRRQTVAPAPQNPVEKRYNLRHHKNGGTAVAAGTSVDKN 628
            E+DG  SEG S+SVTA GG RKRRQTVA   Q P E+RYNLR HK     +A   S D +
Sbjct: 1034 EKDGAGSEGYSDSVTAGGGRRKRRQTVATVSQTPGERRYNLRRHKTSSAVLALEASADLS 1093

Query: 627  GKETTEGGNSGPAGAPQNIEVTSAPVLETASENRTQTPVDQNTSYKTVESRE--FDKFRT 454
                T    + P     N +  S       +ENR  T + Q TS K++E  +    +F++
Sbjct: 1094 KANKTVAEVTNPVEVVSNPKSASTFPPAVLNENRKSTHLAQVTSVKSMELSQDRAVRFKS 1153

Query: 453  QQNGAEDNAIAAEN-QTMDFSEEANGTPDF-NEEEHGSTLNSXXXXXXXXXXXXDVHPGE 280
              N  ++NA A ++ +    SEE NGT ++ +E+E+G  +                HPGE
Sbjct: 1154 TTNIVDENADAPKSIENTVLSEEVNGTSEYVDEDENGGRVLEDEEDDDDDSD----HPGE 1209

Query: 279  VSVSRKLWNFFTS 241
             S+ +KLWNFFTS
Sbjct: 1210 ASIGKKLWNFFTS 1222


>gb|EMJ28278.1| hypothetical protein PRUPE_ppa000415mg [Prunus persica]
          Length = 1198

 Score =  441 bits (1135), Expect = e-121
 Identities = 282/738 (38%), Positives = 417/738 (56%), Gaps = 37/738 (5%)
 Frame = -3

Query: 2343 LKGELEKMKLDVSQKEMLLREGNEKLILNEDERTEHTRLQKELREEIEKSRAQXXXXXXX 2164
            LK E++K+K +  Q E+ +RE  EKL++ ++ER+EH RLQ EL++EI+  R Q       
Sbjct: 488  LKEEIQKIKDENVQLELQIREEREKLVITQEERSEHLRLQSELQQEIKTYRLQNELLSKE 547

Query: 2163 XXXXKQDRMKFEEEWETLDEKRAAFARELQQIEEEKQMFEKLRRSEDDKLKKEKLATEEY 1984
                KQ R KFEEEWE LDE++A  +R L++I EEK+  EKL+ +E+++LK+EK A ++Y
Sbjct: 548  AEDLKQQREKFEEEWENLDERKAEISRGLEKIVEEKEKLEKLQGTEEERLKEEKHAMQDY 607

Query: 1983 IQRELEAIRLEKESFAVTMKYEKSALSEKAQGEHDQLLQEFEARRRDLETEMQKKQEDME 1804
            I+REL+ + LEKESFA  M+ E+ A++EKAQ +H Q++Q+FE+++R+LE +MQ +Q++ME
Sbjct: 608  IKRELDNLNLEKESFAAKMRNEQFAIAEKAQFQHSQMVQDFESQKRELEVDMQNRQQEME 667

Query: 1803 KDAQEKERAFQEERDKADDNISRLKEEVQKEMEIVSAERSRLQKEKQDITSNRKHLEEYQ 1624
            K  QE ERAF+EE+D+   NI+ LKE  +K+ E + +E+ R++KE++++  N+K +E  Q
Sbjct: 668  KHLQEMERAFEEEKDREYTNINFLKEVAEKKSEELRSEKYRMEKEREELALNKKQVEVNQ 727

Query: 1623 LEMQNDIHELGILSKKLKDQREHFLHQRGRFLSFVERLKNCKNCGEIARSYVLSDLQLAE 1444
            LEM+ DI +L +LSKK+K QRE  + +RGRFL+FVE++K+CK+CGE+ R +VLSDLQ+  
Sbjct: 728  LEMRKDIDQLAMLSKKIKHQREQLIEERGRFLAFVEKIKSCKDCGEMTREFVLSDLQVPG 787

Query: 1443 MEHK-ETSPFPTLGDELLEKASSYGTDMNRSPAEKKSSDSGGRFSWLQKCTSKFRKFSPS 1267
            M H  E    P L DE L+ + +     + S  + +  +SG   S L+KC S   K SP 
Sbjct: 788  MYHHIEAVSLPRLSDEFLKNSQA-----DLSAPDLEYPESGWGTSLLRKCKSMVSKVSPI 842

Query: 1266 AKHFVPENLEPALSDRLVVAANTEGPSTAATADKGKANILGAEPSCEIC----------- 1120
             K    E++  A+S  L        P +    ++G    +G E   E             
Sbjct: 843  KKM---EHITDAVSTELP-------PLSTMKVNEGARGHIGHEDEPEPSFRMPNDAISQP 892

Query: 1119 ILGDNQSNGND---------------RIQQVVEDXXXXXXXXXXXXXRKNTQGGIRRTRS 985
            +  DN +   D               +++ V +D              +  +  + RTR+
Sbjct: 893  LPSDNTTKEVDDGYAPSIDDHSFIDSKVKDVPDDSEQSELKSYQCKPGRGRKSRLSRTRT 952

Query: 984  VKAVVEDAAVILGKTPGEPILNQQRQKDASD-DEGSRGESSLLDK----IPRKRTWAQTS 820
            VKA VE+A + L  T  EP        D+S+  E SRG+SS ++K    I RKR  AQ+S
Sbjct: 953  VKATVEEAKIFLRDTLEEPSNASMLPNDSSNIHEESRGDSSFVEKANTSIGRKRRRAQSS 1012

Query: 819  RKVGSEQDGHDSEGRSESVT-AGGHRKRRQTVAPAPQNPVEKRYNLRHHKNGGTAVAAGT 643
            R   SEQD  DSEGRS SVT AGG RKRRQ++A + Q P E+RYNLRH K  G+  AA  
Sbjct: 1013 RITESEQDDCDSEGRSGSVTTAGGRRKRRQSIASSVQAPGEQRYNLRHRKTAGSVTAAPA 1072

Query: 642  SVDKNGKETTEGGNSGPAGAPQNIEVTSAPVLETASENRTQTPVDQNTSYKTVE--SREF 469
            + D   +   E G  G    P+++       L  A E      + Q T+ K+VE      
Sbjct: 1073 AADLKKRRKEEAGGGGAEPNPESVS-----SLGMAGETGQTAQLMQVTTSKSVEFSQERV 1127

Query: 468  DKFRTQQNGAEDNAIAAEN--QTMDFSEEANGTPDFNEEEHGSTLNSXXXXXXXXXXXXD 295
             +F T ++  + NA  A    +  + S E NGTP     E GS  N+             
Sbjct: 1128 VRFSTPEDIVDGNAADAAKTVENTELSGEDNGTP-----ESGSGNNTVGESDDDYDDEE- 1181

Query: 294  VHPGEVSVSRKLWNFFTS 241
              PGE S+ +K+WNF T+
Sbjct: 1182 -RPGEASIRKKIWNFLTT 1198


>ref|XP_006484395.1| PREDICTED: putative nuclear matrix constituent protein 1-like
            protein-like [Citrus sinensis]
          Length = 1222

 Score =  440 bits (1132), Expect = e-120
 Identities = 267/733 (36%), Positives = 413/733 (56%), Gaps = 32/733 (4%)
 Frame = -3

Query: 2343 LKGELEKMKLDVSQKEMLLREGNEKLILNEDERTEHTRLQKELREEIEKSRAQXXXXXXX 2164
            LK E++K++ +  Q+E+ ++E  +KL +NE+E++E  RLQ +L+++IE  R Q       
Sbjct: 495  LKVEIDKIESENVQQELQIQEECQKLKINEEEKSELLRLQSQLKQQIETYRHQQELLLKE 554

Query: 2163 XXXXKQDRMKFEEEWETLDEKRAAFARELQQIEEEKQMFEKLRRSEDDKLKKEKLATEEY 1984
                +QDR KFE+EWE LDEKR    +E ++I +EK+  EKL+ S +++LKKE+ A  +Y
Sbjct: 555  HEDLQQDREKFEKEWEVLDEKRDEINKEQEKIADEKKKLEKLQHSAEERLKKEECAMRDY 614

Query: 1983 IQRELEAIRLEKESFAVTMKYEKSALSEKAQGEHDQLLQEFEARRRDLETEMQKKQEDME 1804
            +QRE+EAIRL+KE+F  TM++E+  LSEKA+ +  ++L+EFE +R + E E+  +++ ME
Sbjct: 615  VQREIEAIRLDKEAFEATMRHEQLVLSEKAKNDRRKMLEEFEMQRMNQEAELLNRRDKME 674

Query: 1803 KDAQEKERAFQEERDKADDNISRLKEEVQKEMEIVSAERSRLQKEKQDITSNRKHLEEYQ 1624
            K+ QE+ R F+E+R++  ++I+ LKE  + E++ + +ER +L+KEK ++  NR+ L+E Q
Sbjct: 675  KELQERTRTFEEKRERVLNDIAHLKEVAEGEIQEIKSERDQLEKEKHEVKVNREKLQEQQ 734

Query: 1623 LEMQNDIHELGILSKKLKDQREHFLHQRGRFLSFVERLKNCKNCGEIARSYVLSDLQLAE 1444
            L M+ DI EL IL ++L   RE F  ++ RFL FVE+  +CKNCGE+ R++V+S+LQL +
Sbjct: 735  LGMRKDIDELDILCRRLYGDREQFKREKERFLEFVEKHTSCKNCGEMMRAFVISNLQLPD 794

Query: 1443 MEHKETSPFPTLGDELL-----EKASSYGTDMNRSPAEKK--SSDSGGRFSWLQKCTSKF 1285
             E +   P P + +  L     + A+ Y ++++ S        +DSGG  SWL+KCTSK 
Sbjct: 795  DEARNDIPLPQVAERCLGNRQGDVAAPYDSNISNSHGGMNLGRADSGGHMSWLRKCTSKI 854

Query: 1284 RKFSPSAKH-----FVPENLEPALSDRLVVAANTEGPSTAATADKGKANILGAEPSCEIC 1120
               SP  K       + E  EP  +   ++    EGP    + +    +    EP     
Sbjct: 855  FSISPIKKSEHISTSMLEEEEPQSAVPTIMQEKAEGPGVLVSKEAIGYSSPEDEPQSSFR 914

Query: 1119 ILGDNQSNGND---------------RIQQVVEDXXXXXXXXXXXXXRKNTQGGIRRTRS 985
            ++ D+ +   D               +++ V ED              +  + G+ RTRS
Sbjct: 915  LVNDSTNREMDDEYAPSVDGHSYMDSKVEDVAEDSQQSELRSGKRRPGRKRKSGVNRTRS 974

Query: 984  VKAVVEDAAVILGKTPGEPILNQQRQKDASDDEGSRGESSLLDKIPRKRTWAQTSRKVGS 805
            VKA VEDA + LG++P    LN   Q    D +G    +     + +KR   QTS+   S
Sbjct: 975  VKAAVEDAKLFLGESPEGAGLNASFQAH-EDSQGISSHTQEASNMAKKRRRPQTSKTTQS 1033

Query: 804  EQDGHDSEGRSESVTA-GGHRKRRQTVAPAPQNPVEKRYNLRHHKNGGTAVAAGTSVDKN 628
            E+DG DSEG S+SVTA GG RKRRQTVA   Q P E+RYNLR HK     +A   S D +
Sbjct: 1034 EKDGADSEGYSDSVTAGGGRRKRRQTVATVSQTPGERRYNLRRHKTSSAVLALEASADLS 1093

Query: 627  GKETTEGGNSGPAGAPQNIEVTSAPVLETASENRTQTPVDQNTSYKTVE-SRE-FDKFRT 454
                T    + P     N +  S       +EN   T + Q TS K++E SR+   +F++
Sbjct: 1094 KANKTVAEVTNPVEVVSNPKSASTFPPAVLNENGKSTHLAQVTSVKSMELSRDRAVRFKS 1153

Query: 453  QQNGAEDNAIAAEN-QTMDFSEEANGTPDF-NEEEHGSTLNSXXXXXXXXXXXXDVHPGE 280
              N  ++NA A ++ +    SEE NGT ++ +E+E+G  +                HPGE
Sbjct: 1154 TTNIVDENADAPKSIENTVLSEEVNGTSEYVDEDENGGRVLEDEEDDDDDSD----HPGE 1209

Query: 279  VSVSRKLWNFFTS 241
             S+ +KLWNFFTS
Sbjct: 1210 ASIGKKLWNFFTS 1222


>emb|CAN74990.1| hypothetical protein VITISV_008657 [Vitis vinifera]
          Length = 1140

 Score =  420 bits (1080), Expect = e-114
 Identities = 264/660 (40%), Positives = 383/660 (58%), Gaps = 35/660 (5%)
 Frame = -3

Query: 2343 LKGELEKMKLDVSQKEMLLREGNEKLILNEDERTEHTRLQKELREEIEKSRAQXXXXXXX 2164
            LK   EK+++++ ++++ + E  E+L + E+ER+E  RLQ EL++EIEK R +       
Sbjct: 449  LKAVAEKIRVEIEEQKLKVHEEREQLEITEEERSEFLRLQSELKQEIEKYRLEKEVLLKE 508

Query: 2163 XXXXKQDRMKFEEEWETLDEKRAAFARELQQIEEEKQMFEKLRRSEDDKLKKEKLATEEY 1984
                K  R  FE EWE LDEK A   ++L  + E+++  EKL+ SE+++LK EKLAT++Y
Sbjct: 509  VEDLKLQRETFEREWEVLDEKXAEIEKDLIDVSEQREKLEKLKHSEEERLKTEKLATQDY 568

Query: 1983 IQRELEAIRLEKESFAVTMKYEKSALSEKAQGEHDQLLQEFEARRRDLETEMQKKQEDME 1804
            IQRE E+++L KESFA +M++E+S LSEKAQ E  Q++ +FE  +R+LET++Q +QE++E
Sbjct: 569  IQREFESLKLAKESFAASMEHEQSVLSEKAQSEKSQMIHDFELLKRELETDIQNRQEELE 628

Query: 1803 KDAQEKERAFQEERDKADDNISRLKEEVQKEMEIVSAERSRLQKEKQDITSNRKHLEEYQ 1624
            K  QE+E+ F+EER++  +N++ L+E  ++EME V  ER R++KEKQ++ +N+KHL+E+Q
Sbjct: 629  KQLQEREKVFEEERERELNNVNYLREVARQEMEEVKLERLRIEKEKQEVAANKKHLDEHQ 688

Query: 1623 LEMQNDIHELGILSKKLKDQREHFLHQRGRFLSFVERLKNCKNCGEIARSYVLSDLQ-LA 1447
             EM+ DI EL  LS+KLKDQRE F  +R RF++FVE+ K+CKNCGEI   +VLSDLQ L 
Sbjct: 689  FEMRKDIDELVSLSRKLKDQRELFSKERERFIAFVEQQKSCKNCGEITCEFVLSDLQPLP 748

Query: 1446 EMEHKETSPFPTLGDELLE------KASSYGTDMNRSP--AEKKSSDSGGRFSWLQKCTS 1291
            E+E+ E  P P L D   +       A+S   ++  +P      S  SGG  S+L+KCTS
Sbjct: 749  EIENVEVPPLPRLADRYFKGSVQGNMAASERQNIEMTPGIVGSGSPTSGGTISFLRKCTS 808

Query: 1290 KFRKFSPSAKHFV---------PENLEPAL---SDRLVVAANTEGPSTAATADKGKANIL 1147
            K    SP  K  V         PE    A+   S RL    +   PS     D      +
Sbjct: 809  KIFNLSPGKKIEVAAIQNLTEAPEPSRQAIVEPSKRLGSTEDEPEPSFRIANDSFDVQRI 868

Query: 1146 GAEPSCEICILGD----NQSNGNDRIQQVVEDXXXXXXXXXXXXXRKNTQGGIRRTRSVK 979
             ++ S +    G     ++SN + +  ++ +               K ++  I RTRSVK
Sbjct: 869  QSDNSIKEVEAGQDLSIDESNIDSKALELQQHSQHSDLKGARRKPGKRSKQRIHRTRSVK 928

Query: 978  AVVEDAAVILGKT---PGEPILNQQRQKDASDDEGSRGESSLLDK----IPRKRTWAQTS 820
            AVV DA  ILG++         N   +  A  ++ SRGESS  DK      RKR  A TS
Sbjct: 929  AVVRDAKAILGESLELSENEHPNGNPEDSAHMNDESRGESSFADKGTPRNGRKRQRAYTS 988

Query: 819  RKVGSEQDGHDSEGRSESVTAGGHRKRRQTVAPAPQNPVEKRYNLRHHKNGGTAVAAGTS 640
            + + SEQDG DSEGRS+SV A    KRRQ V PA Q   ++RYNLR  KN  T  AA +S
Sbjct: 989  QTMVSEQDGDDSEGRSDSVMARRQGKRRQKVPPAVQTLGQERYNLRRPKNTVTVAAAKSS 1048

Query: 639  VDKNGKETTEGGNSGPAGAPQNI-EVTSAPVLETA--SENRTQTPVDQNTSYKTVESREF 469
             + + ++ TE   SG  G  + I +  +AP       SEN   T V Q  +++T+    F
Sbjct: 1049 TNLHKRKETETDGSGAGGTGEEIPDCNAAPATSVGLISENGGSTHVLQVETFETIVDVHF 1108


>gb|EOY04286.1| Nuclear matrix constituent protein 1-like protein, putative isoform 1
            [Theobroma cacao]
          Length = 1177

 Score =  414 bits (1064), Expect = e-113
 Identities = 273/748 (36%), Positives = 407/748 (54%), Gaps = 47/748 (6%)
 Frame = -3

Query: 2343 LKGELEKMKLDVSQKEMLLREGNEKLILNEDERTEHTRLQKELREEIEKSRAQXXXXXXX 2164
            LK E+EK++++  +K + + E N++L + E+ER+E+ RLQ EL+EEIEK R         
Sbjct: 460  LKAEVEKIRVENEEKLLKMHEENDRLRVTEEERSEYLRLQLELKEEIEKCRLSEELLLKE 519

Query: 2163 XXXXKQDRMKFEEEWETLDEKRAAFARELQQIEEEKQMFEKLRRSEDDKLKKEKLATEEY 1984
                K+ +  FE EWE LDEKR    +EL+ I ++ + FEK + +E+++LK EK   E+Y
Sbjct: 520  VEDLKRQKENFEREWEELDEKRLEIEKELKNISQQTEKFEKQKLAEEERLKNEKQVAEDY 579

Query: 1983 IQRELEAIRLEKESFAVTMKYEKSALSEKAQGEHDQLLQEFEARRRDLETEMQKKQEDME 1804
            I+REL+A+ + KE+FA TM++E+S ++EKA+ E  Q L + E ++R LE++MQ + E+ME
Sbjct: 580  IKRELDALEVAKETFAATMEHEQSVIAEKAESERSQRLHDLELQKRKLESDMQNRFEEME 639

Query: 1803 KDAQEKERAFQEERDKADDNISRLKEEVQKEMEIVSAERSRLQKEKQDITSNRKHLEEYQ 1624
            K+  E +++F+EE+++  D I+ L+E  ++E+E +  ER +++KE+Q++ +++ HLE  Q
Sbjct: 640  KELGESKKSFEEEKERELDKINHLREVARRELEELKQERLKIEKEEQEVNASKMHLEGQQ 699

Query: 1623 LEMQNDIHELGILSKKLKDQREHFLHQRGRFLSFVERLKNCKNCGEIARSYVLSDLQ-LA 1447
            +E++ DI +L  +SKKLKDQREHF+ +R RF+SFVE+ K+CKNCGE+   ++LSDLQ L 
Sbjct: 700  IEIRKDIDDLVDISKKLKDQREHFIKERNRFISFVEKHKSCKNCGEMTSEFMLSDLQSLQ 759

Query: 1446 EMEHKETSPFPTLGDELLE-------KASSYGTDMNRSPAEKKSSDSGGRFSWLQKCTSK 1288
            ++E +E  P P+L D+ +          S    D    P    S  SGG  SWL+KCTSK
Sbjct: 760  KIEDEEVLPLPSLADDYISGNAFRNLAVSKRQKDEISPPVGSGSPVSGGTMSWLRKCTSK 819

Query: 1287 FRKFSPSAKHFVPENLEPALSDRLVVAA-------NTEGPST---------AATADKGKA 1156
              K SP       +N+EP    +L V A       N EG S          AA  +    
Sbjct: 820  IFKLSPG------KNIEPHAVTKLNVEAPLSGGQVNMEGMSNVEHEPELSIAAATESLDV 873

Query: 1155 NILGAEPSCEICILG-----DNQSNGNDRIQQVVEDXXXXXXXXXXXXXRKNTQGGIRRT 991
            + + ++ S      G     DNQSN + +  +V+ D             RK  +  ++RT
Sbjct: 874  HRVQSDTSTRDVDAGQDLSIDNQSNIDSKELEVLGD-SQNSDFNRGNQLRKRGRPRVKRT 932

Query: 990  RSVKAVVEDAAVILGKTPGEPILNQQRQKDASDDEG-----SRGESSLLD----KIPRKR 838
            RSVKAVV+DA  I+GK       N+    + + D G     SR ES L D    +  RKR
Sbjct: 933  RSVKAVVKDAEAIIGKALES---NELEHPNGNLDSGHANAESRDESGLFDGGTSRNARKR 989

Query: 837  TWAQTSRKVGSEQDGHDSEGRSESVTAGGHRKRRQTVAPAPQNPVEKRYNLRHHKNGGTA 658
              AQTS+K  SEQDG DS G S+S+ AG  RKRRQ V  A   P E RYNLR  K G T 
Sbjct: 990  NRAQTSQKTESEQDGVDS-GHSDSIVAGQQRKRRQKVVLAMPTPGEARYNLRRPKTGVTV 1048

Query: 657  VAAGTSVDKNGKETTEGGNSGPAGAPQNIEVTSAPVLETASENRTQTPVDQN--TSYKTV 484
                + V++         N G   A   +  + AP+           PV +N   S    
Sbjct: 1049 AKTTSDVNRE--------NEGAKDAGDQVNYSKAPM-----------PVSENGDASENGG 1089

Query: 483  ESREFDKFRTQQNGAEDNAIAAENQTMD--FSEEANGTPDFNEE-----EHGSTLNSXXX 325
             +    +  T ++  + +A A +    D   SEE N  P+   E     ++ S   S   
Sbjct: 1090 SAHFLQQCETARDTNDGDADATKKLAADAALSEEVNTAPEGVGEYGDGNDYRSDSRSEGL 1149

Query: 324  XXXXXXXXXDVHPGEVSVSRKLWNFFTS 241
                     + HPGEVS+ +KLWNFFT+
Sbjct: 1150 KDEDEDEDDEEHPGEVSMGKKLWNFFTT 1177


>gb|EOY02173.1| Nuclear matrix constituent protein-related, putative isoform 3
            [Theobroma cacao]
          Length = 1080

 Score =  411 bits (1057), Expect = e-112
 Identities = 237/575 (41%), Positives = 347/575 (60%), Gaps = 18/575 (3%)
 Frame = -3

Query: 2343 LKGELEKMKLDVSQKEMLLREGNEKLILNEDERTEHTRLQKELREEIEKSRAQXXXXXXX 2164
            LK E++K+  + SQ+E+ +RE ++KL + E+ER+EH RLQ EL+++I+  R Q       
Sbjct: 496  LKDEIDKIGAETSQQELRIREESQKLKITEEERSEHIRLQSELKQQIDSCRHQEELLLKE 555

Query: 2163 XXXXKQDRMKFEEEWETLDEKRAAFARELQQIEEEKQMFEKLRRSEDDKLKKEKLATEEY 1984
                KQ R  FE+EWE LDEKRA    + ++I EEK  FEK R SE+++LKKE+ A  +Y
Sbjct: 556  HEDLKQQRENFEKEWEVLDEKRAEITMQRKEIVEEKDKFEKFRHSEEERLKKEESAMRDY 615

Query: 1983 IQRELEAIRLEKESFAVTMKYEKSALSEKAQGEHDQLLQEFEARRRDLETEMQKKQEDME 1804
            + RE+E+IRL+KESF  +MK+EKS L E+AQ EH ++LQ+FE ++ +LET++Q + +  +
Sbjct: 616  VCREMESIRLQKESFEASMKHEKSVLLEEAQNEHIKMLQDFELQKMNLETDLQNRFDQKQ 675

Query: 1803 KDAQEKERAFQEERDKADDNISRLKEEVQKEMEIVSAERSRLQKEKQDITSNRKHLEEYQ 1624
            KD QE+  AF+E +++   N+   KE+V++EME + + R  +++EKQ++  NR  L E Q
Sbjct: 676  KDLQERIVAFEEVKERELANMRCSKEDVEREMEEIRSARLAVEREKQEVAINRDKLNEQQ 735

Query: 1623 LEMQNDIHELGILSKKLKDQREHFLHQRGRFLSFVERLKNCKNCGEIARSYVLSDLQLAE 1444
             EM+ DI ELGILS +LKDQREHF+ +R  FL FVE+LK+CK CGEI R +VLS+ QL +
Sbjct: 736  QEMRKDIDELGILSSRLKDQREHFIRERHSFLEFVEKLKSCKTCGEITRDFVLSNFQLPD 795

Query: 1443 MEHKETSPFPTLGDELLEKASSY-----GTDMNRSP-AEKKSSDSGGRFSWLQKCTSKFR 1282
            +E +E  P P L DEL+     Y       ++ RSP A  +  +S GR SWL+KCT+K  
Sbjct: 796  VEDREIVPLPRLADELIRNHQGYLGASGVKNIKRSPEAYSQYPESAGRMSWLRKCTTKIF 855

Query: 1281 KFSPSAKHFVPENLEPALSDRLV---VAANTEGPSTAATADKGKANILGAEPSCEICILG 1111
              SP+ ++         L+++     +      PS     D     +L ++   ++    
Sbjct: 856  SISPTKRNESKAEGPGELTNKEAGGNIHEKAGEPSLRIPGDSINNQLLQSDKIGKVDDRS 915

Query: 1110 D---NQSNGNDRIQQVVEDXXXXXXXXXXXXXRKNTQGGIRRTRSVKAVVEDAAVILGKT 940
                + S  + ++Q+V ED              +  + G+ RTRSVKAVVEDA + LG++
Sbjct: 916  GPSLDHSYTDSKVQEVPEDSQQSERKSGRRKPGRKPKSGLNRTRSVKAVVEDAKLFLGES 975

Query: 939  PGEPILNQQRQKD--ASDDEGSRGESSLLD----KIPRKRTWAQTSRKVGSEQDGHDSEG 778
            P EP  ++  Q D  +  +E S G S+  +       RKR   Q S+   +E D  DSEG
Sbjct: 976  PEEPEPSESVQPDDISHANEVSAGVSTHSENRARNNARKRRRPQDSKITDTELDAADSEG 1035

Query: 777  RSESVTAGGHRKRRQTVAPAPQNPVEKRYNLRHHK 673
            RS+SVT GG RKR+QT A   Q P EKRYNLR  K
Sbjct: 1036 RSDSVTTGGQRKRQQTAAQGLQTPGEKRYNLRRPK 1070


>gb|EOY02172.1| Nuclear matrix constituent protein-related, putative isoform 2
            [Theobroma cacao]
          Length = 1079

 Score =  411 bits (1057), Expect = e-112
 Identities = 237/575 (41%), Positives = 347/575 (60%), Gaps = 18/575 (3%)
 Frame = -3

Query: 2343 LKGELEKMKLDVSQKEMLLREGNEKLILNEDERTEHTRLQKELREEIEKSRAQXXXXXXX 2164
            LK E++K+  + SQ+E+ +RE ++KL + E+ER+EH RLQ EL+++I+  R Q       
Sbjct: 496  LKDEIDKIGAETSQQELRIREESQKLKITEEERSEHIRLQSELKQQIDSCRHQEELLLKE 555

Query: 2163 XXXXKQDRMKFEEEWETLDEKRAAFARELQQIEEEKQMFEKLRRSEDDKLKKEKLATEEY 1984
                KQ R  FE+EWE LDEKRA    + ++I EEK  FEK R SE+++LKKE+ A  +Y
Sbjct: 556  HEDLKQQRENFEKEWEVLDEKRAEITMQRKEIVEEKDKFEKFRHSEEERLKKEESAMRDY 615

Query: 1983 IQRELEAIRLEKESFAVTMKYEKSALSEKAQGEHDQLLQEFEARRRDLETEMQKKQEDME 1804
            + RE+E+IRL+KESF  +MK+EKS L E+AQ EH ++LQ+FE ++ +LET++Q + +  +
Sbjct: 616  VCREMESIRLQKESFEASMKHEKSVLLEEAQNEHIKMLQDFELQKMNLETDLQNRFDQKQ 675

Query: 1803 KDAQEKERAFQEERDKADDNISRLKEEVQKEMEIVSAERSRLQKEKQDITSNRKHLEEYQ 1624
            KD QE+  AF+E +++   N+   KE+V++EME + + R  +++EKQ++  NR  L E Q
Sbjct: 676  KDLQERIVAFEEVKERELANMRCSKEDVEREMEEIRSARLAVEREKQEVAINRDKLNEQQ 735

Query: 1623 LEMQNDIHELGILSKKLKDQREHFLHQRGRFLSFVERLKNCKNCGEIARSYVLSDLQLAE 1444
             EM+ DI ELGILS +LKDQREHF+ +R  FL FVE+LK+CK CGEI R +VLS+ QL +
Sbjct: 736  QEMRKDIDELGILSSRLKDQREHFIRERHSFLEFVEKLKSCKTCGEITRDFVLSNFQLPD 795

Query: 1443 MEHKETSPFPTLGDELLEKASSY-----GTDMNRSP-AEKKSSDSGGRFSWLQKCTSKFR 1282
            +E +E  P P L DEL+     Y       ++ RSP A  +  +S GR SWL+KCT+K  
Sbjct: 796  VEDREIVPLPRLADELIRNHQGYLGASGVKNIKRSPEAYSQYPESAGRMSWLRKCTTKIF 855

Query: 1281 KFSPSAKHFVPENLEPALSDRLV---VAANTEGPSTAATADKGKANILGAEPSCEICILG 1111
              SP+ ++         L+++     +      PS     D     +L ++   ++    
Sbjct: 856  SISPTKRNESKAEGPGELTNKEAGGNIHEKAGEPSLRIPGDSINNQLLQSDKIGKVDDRS 915

Query: 1110 D---NQSNGNDRIQQVVEDXXXXXXXXXXXXXRKNTQGGIRRTRSVKAVVEDAAVILGKT 940
                + S  + ++Q+V ED              +  + G+ RTRSVKAVVEDA + LG++
Sbjct: 916  GPSLDHSYTDSKVQEVPEDSQQSERKSGRRKPGRKPKSGLNRTRSVKAVVEDAKLFLGES 975

Query: 939  PGEPILNQQRQKD--ASDDEGSRGESSLLD----KIPRKRTWAQTSRKVGSEQDGHDSEG 778
            P EP  ++  Q D  +  +E S G S+  +       RKR   Q S+   +E D  DSEG
Sbjct: 976  PEEPEPSESVQPDDISHANEVSAGVSTHSENRARNNARKRRRPQDSKITDTELDAADSEG 1035

Query: 777  RSESVTAGGHRKRRQTVAPAPQNPVEKRYNLRHHK 673
            RS+SVT GG RKR+QT A   Q P EKRYNLR  K
Sbjct: 1036 RSDSVTTGGQRKRQQTAAQGLQTPGEKRYNLRRPK 1070


>ref|XP_002524388.1| ATP binding protein, putative [Ricinus communis]
            gi|223536349|gb|EEF37999.1| ATP binding protein, putative
            [Ricinus communis]
          Length = 1172

 Score =  404 bits (1037), Expect = e-109
 Identities = 268/722 (37%), Positives = 390/722 (54%), Gaps = 21/722 (2%)
 Frame = -3

Query: 2343 LKGELEKMKLDVSQKEMLLREGNEKLILNEDERTEHTRLQKELREEIEKSRAQXXXXXXX 2164
            LK + EK++ ++S +E  + E +E L L  DER EH RLQ EL++E+EK R Q       
Sbjct: 491  LKDDCEKIRSEISNQEQQIGEKSENLKLTNDERLEHLRLQAELKQELEKCRHQEEYILKE 550

Query: 2163 XXXXKQDRMKFEEEWETLDEKRAAFARELQQIEEEKQMFEKLRRSEDDKLKKEKLATEEY 1984
                K++R  FE+E E L+EKRA  ++EL +I EE++ F++L+ + +++LKKE+ A +EY
Sbjct: 551  AEELKEERKNFEKELEVLEEKRAQLSKELNEITEEREKFKQLQYTMEERLKKEENAMKEY 610

Query: 1983 IQRELEAIRLEKESFAVTMKYEKSALSEKAQGEHDQLLQEFEARRRDLETEMQKKQEDME 1804
             Q+ELE +R+EKE F +  + E+  +S++A+ EHDQ++Q+FE++R   E ++  ++E+ME
Sbjct: 611  TQKELETVRVEKEYFEMRKRNEQQVISKQAKTEHDQMVQDFESQRSTFEADLVSRREEME 670

Query: 1803 KDAQEKERAFQEERDKADDNISRLKEEVQKEMEIVSAERSRLQKEKQDITSNRKHLEEYQ 1624
            K  +E+ERAFQ +RD+    I+  KE  QKE+E +  ER  ++KEKQ++  N++ L+  Q
Sbjct: 671  KGLRERERAFQLQRDRELKEINYSKEAAQKELEEIRIERHVIEKEKQEVAKNKEELDGQQ 730

Query: 1623 LEMQNDIHELGILSKKLKDQREHFLHQRGRFLSFVERLKNCKNCGEIARSYVLSDLQLAE 1444
              M+ DI EL +LS KL+DQRE  + +R  FL+FVE+ K+CKNCG++   ++LSDL   +
Sbjct: 731  FGMRKDIDELVMLSNKLRDQREQVIRERNHFLAFVEKHKSCKNCGDVTAEFILSDLLPPD 790

Query: 1443 MEHKETSPFPTLGDELLEKASSYGTDMNRSPAEKKSSDSGGRFSWLQKCTSKFRKFSPSA 1264
            ME ++        DEL +   S G    +    +   +S    SW +KCTSK    SP  
Sbjct: 791  MEDRKILLLQERADELRDVQDSPGALNVKKSQGELDLNSQECVSWFRKCTSKIFSISPKK 850

Query: 1263 KHFVPENLEPAL----SDRLVVAANTEGPSTAATADKGKANILGAEPSCEICIL------ 1114
               + + L P L    +D L   A  E        D+ + +      S EI  L      
Sbjct: 851  ---IEQVLAPVLAEEKTDALGTLARKEASRNGVPGDESRPSFGTTHDSVEIQQLQFDSIK 907

Query: 1113 --GDNQS---NGNDRIQQVVEDXXXXXXXXXXXXXRKNTQGGIRRTRSVKAVVEDAAVIL 949
              GD  S   + +  +   VED              K  +GG+ RTRSVKAVVEDA + L
Sbjct: 908  VEGDGNSISFDDHSNVDSKVEDSGPSKLKSSQRKPGKRRKGGLNRTRSVKAVVEDAKLFL 967

Query: 948  GKTPGEPILNQQRQKDASDDEGSRGESS----LLDKIPRKRTWAQTSRKVGSEQDGHDSE 781
            GK+  EP       +  SD+  SRG S+    L   IPRKR          SEQ+  DSE
Sbjct: 968  GKSAEEP-------EYISDE--SRGISTHTEKLASNIPRKRERTPAE----SEQNAGDSE 1014

Query: 780  GRSESVTAGGHRKRRQTVAPAPQNPVEKRYNLRHHKNGGTAVAAGTSVDKNGKETTEGGN 601
            G S+SVT GG RKRRQ V P    P +KRYNLR HK      A   SV    KE+  G  
Sbjct: 1015 GFSDSVTTGGRRKRRQMVVPT-ITPGQKRYNLRRHK---VDQALSGSVKTGEKESDGGDA 1070

Query: 600  SGPAGAPQNIEVTSAPVLETASENRTQTPVDQNTSYKTVESREFDKFRTQQNGAEDNAIA 421
            + P   P   E  SA  L  ASE              T +S +  KF T+     D A A
Sbjct: 1071 AEPIPKP---ETVSALSLGVASE--------------TEKSTDLVKFSTE--NVNDQADA 1111

Query: 420  AEN-QTMDFSEEANGTPDFN-EEEHGSTLNSXXXXXXXXXXXXDVHPGEVSVSRKLWNFF 247
             ++ +  + SEE N T ++  E+E+GST++             + HPGEVS+ +K+W FF
Sbjct: 1112 TKSVEITELSEEVNDTSEYGVEDENGSTIHEDTQEDCDDDDESE-HPGEVSIGKKIWTFF 1170

Query: 246  TS 241
            T+
Sbjct: 1171 TT 1172


>dbj|BAA20407.1| nuclear matrix constituent protein 1 [Daucus carota]
          Length = 1119

 Score =  402 bits (1034), Expect = e-109
 Identities = 274/731 (37%), Positives = 393/731 (53%), Gaps = 30/731 (4%)
 Frame = -3

Query: 2343 LKGELEKMKLDVSQKEMLLREGNEKLILNEDERTEHTRLQKELREEIEKSRAQXXXXXXX 2164
            LK E+EK +    ++ + L E  E+L + E+ER E  RLQ EL++EIE  R Q       
Sbjct: 408  LKAEIEKDRASTEEQRLKLSEEIERLKITEEERLELARLQSELKQEIENCRHQRELLLKE 467

Query: 2163 XXXXKQDRMKFEEEWETLDEKRAAFARELQQIEEEKQMFEKLRRSEDDKLKKEKLATEEY 1984
                KQ++M+FE+EWE LDE+R A  ++L+ I  +K+ FEKL+ SE+D+L  +KL TE Y
Sbjct: 468  EDELKQEKMRFEKEWEDLDERRTALMKDLKDITVQKENFEKLKHSEEDRLNNKKLDTESY 527

Query: 1983 IQRELEAIRLEKESFAVTMKYEKSALSEKAQGEHDQLLQEFEARRRDLETEMQKKQEDME 1804
            +Q+EL+A+RL K+SFA TM++EK+ L+E+   E  Q+L +FE  +R+LET++  ++EDME
Sbjct: 528  VQKELDALRLTKDSFAATMEHEKAVLAERTSSEKKQMLNDFELWKRELETKLFNEREDME 587

Query: 1803 KDAQEKERAFQEERDKADDNISRLKEEVQKEMEIVSAERSRLQKEKQDITSNRKHLEEYQ 1624
               + +E+ F EER+K  +NI+ +KE + KE E +  ERSR+ KEKQ+I  ++KHL+E  
Sbjct: 588  NALRLREKQFDEEREKELNNINYIKEVISKEREDIKLERSRIAKEKQEILMHQKHLDEQH 647

Query: 1623 LEMQNDIHELGILSKKLKDQREHFLHQRGRFLSFVERLKNCKNCGEIARSYVLSDLQ-LA 1447
            + MQ DI +L  LS+KLKDQRE F  +R  F+ FVE  K+CKNCGE+   +V+SDLQ LA
Sbjct: 648  VVMQKDIGQLVSLSEKLKDQREQFFKERECFIRFVESQKSCKNCGEMTSEFVVSDLQSLA 707

Query: 1446 EMEHKETSPFPTLGDELLEKASSYGTDMNRSPAEK-----KSSDSGGRFSWLQKCTSKFR 1282
            E+E+ +    P L +  L +      D N S          S  SGG  SWLQKCTSK  
Sbjct: 708  ELENLKALSVPQLAENYLRQDLQGTPDKNLSTVTPGAVGLGSPASGGTKSWLQKCTSKIF 767

Query: 1281 KFSPSAKHFVPE-------NLEPALSDRLVVAANTEGPSTAATADKGKANI----LGAEP 1135
             FS S K+  P+       ++E + +  L      E PS  A       N+       E 
Sbjct: 768  IFSASKKNNSPDQNTSRRLHVEASPNKLLNTEVIPELPSGVAGETLEMQNMQVSNSNREM 827

Query: 1134 SCEICILGDNQSNGNDRIQQVVEDXXXXXXXXXXXXXRKNTQGGIRRTRSVKAVVEDAAV 955
               + + G  QSN + +    VED              K  +G +RR RS K V E+A  
Sbjct: 828  ESNLNLSGTEQSNIDSKALD-VEDSQQSDVRAGNRKPGKRAKGRVRRKRSAKEVAEEAKT 886

Query: 954  ILGKTPGEPI-LNQQRQKD---ASDDEGSRGESSLLDK---IPRKRTWAQTSRKVGSEQD 796
            +L     +PI LN+    +   ++    SRG+SSL+ K     RKR  +Q S+    +  
Sbjct: 887  VL----ADPIELNENEHSNGLASAYTNESRGDSSLVGKRTRNSRKRNPSQPSQSAAGDV- 941

Query: 795  GHDSEGRSESVTAGGHRKRRQTVAPAPQNPVEKRYNLRHHKNGGTAVAAGTSVDKN---G 625
            G DSEG S+SVTAGG +KRR+ V PA Q P   RYNLR HK     VA G   D N    
Sbjct: 942  GADSEGHSDSVTAGGRQKRRRKVVPAVQAPT-GRYNLRRHKTAAPLVANGALSDPNKGKE 1000

Query: 624  KETTEGGNSGPAGAPQNIEVTSAPVLETASENRTQTPVD-QNTSYKTVESREFDKFRTQQ 448
            KE  +GG  G    P  ++  +  V  T  + R     +  +  +  + +       T +
Sbjct: 1001 KEIDDGGGIGEE-IPDEVDGNTHLVQVTTLKKRINVVNEFSSAGFHGINA-------TSE 1052

Query: 447  NGAEDNAIAAENQTMDFSEEANGTPDFNE--EEHGSTLNSXXXXXXXXXXXXDVHPGEVS 274
            +   D A    + TM  SEE NGTP+ +   +  G T  +              HPGEVS
Sbjct: 1053 SQDRDAANQLVSDTM-LSEEVNGTPEQSRGYQNQGDTSGAEGEDEDGDEVE---HPGEVS 1108

Query: 273  VSRKLWNFFTS 241
            + +K+W F T+
Sbjct: 1109 MRKKVWKFLTT 1119


>dbj|BAF64424.1| nuclear matrix constituent protein 1-like [Petroselinum crispum]
          Length = 1119

 Score =  402 bits (1033), Expect = e-109
 Identities = 275/731 (37%), Positives = 392/731 (53%), Gaps = 30/731 (4%)
 Frame = -3

Query: 2343 LKGELEKMKLDVSQKEMLLREGNEKLILNEDERTEHTRLQKELREEIEKSRAQXXXXXXX 2164
            LK E+EK +    ++ + L E  E+L + E+ER E  RLQ EL++EIE  R Q       
Sbjct: 408  LKAEIEKGRASTEEQRLKLSEEIERLKITEEERLELARLQSELKQEIENCRHQRELLLKE 467

Query: 2163 XXXXKQDRMKFEEEWETLDEKRAAFARELQQIEEEKQMFEKLRRSEDDKLKKEKLATEEY 1984
                KQ++M+FE+EWE LDE+R A  ++L+ I  +K+ FEKL+ SE+D+L  +KL TE Y
Sbjct: 468  EDELKQEKMRFEKEWEDLDERRTALMKDLKDITVQKENFEKLKHSEEDRLNNKKLDTESY 527

Query: 1983 IQRELEAIRLEKESFAVTMKYEKSALSEKAQGEHDQLLQEFEARRRDLETEMQKKQEDME 1804
            +Q+EL+A+RL K+SFA TM++EK+ L+E+   E  Q+L +FE  +R+LET++  ++EDME
Sbjct: 528  VQKELDALRLTKDSFAATMEHEKAVLAERTSSEKKQMLNDFELWKRELETKLFNEREDME 587

Query: 1803 KDAQEKERAFQEERDKADDNISRLKEEVQKEMEIVSAERSRLQKEKQDITSNRKHLEEYQ 1624
               + +E+ F EER+K  +NI+ +KE   KE E +  ERSR+ KEKQ+I  ++KHL+E  
Sbjct: 588  NALRLREKQFDEEREKELNNINYIKEVFSKEREDIKLERSRIAKEKQEILMHQKHLDEQH 647

Query: 1623 LEMQNDIHELGILSKKLKDQREHFLHQRGRFLSFVERLKNCKNCGEIARSYVLSDLQ-LA 1447
            + MQ DI +L  LS+KLKDQRE F  +R  F+ FVE  K+CKNCGE+   +V+SDLQ LA
Sbjct: 648  VVMQKDIGQLVSLSEKLKDQREQFFKERECFIRFVESQKSCKNCGEMTSEFVVSDLQSLA 707

Query: 1446 EMEHKETSPFPTLGDELLEKASSYGTDMNRSPAEK-----KSSDSGGRFSWLQKCTSKFR 1282
            E+E+ +    P L +  L +      D N S          S  SGG  SWLQKCTSK  
Sbjct: 708  ELENLKALSVPQLAENYLRQDLQGTPDKNLSTVTPGAVGLGSPASGGTKSWLQKCTSKIF 767

Query: 1281 KFSPSAKHFVPE-------NLEPALSDRLVVAANTEGPSTAATADKGKANI----LGAEP 1135
             FS S K+  P+       ++E + +  L      E PS  A       N+       E 
Sbjct: 768  IFSASKKNNSPDQNTSRRLHVEASPNKLLNTEVIPELPSGVAGETLEMQNMQVSNSNREM 827

Query: 1134 SCEICILGDNQSNGNDRIQQVVEDXXXXXXXXXXXXXRKNTQGGIRRTRSVKAVVEDAAV 955
               + + G  QSN + +    VED              K  +G +RR RS K V E+A  
Sbjct: 828  ESNLNLSGTEQSNIDSKALD-VEDSQQSDVRAGNRKPGKRAKGRVRRKRSAKEVAEEAKT 886

Query: 954  ILGKTPGEPI-LNQQRQKD---ASDDEGSRGESSLLDK---IPRKRTWAQTSRKVGSEQD 796
            +L     +PI LN+    +   ++    SRG+SSL+ K     RKR  +Q S+    E  
Sbjct: 887  VL----ADPIELNENEHSNGLASAYTNESRGDSSLVGKRTRNSRKRNPSQPSQSAAGEV- 941

Query: 795  GHDSEGRSESVTAGGHRKRRQTVAPAPQNPVEKRYNLRHHKNGGTAVAAGTSVDKN---G 625
            G DSEG S+SVTAGG +KRR+ V PA Q P   RYNLR HK     VA G   D N    
Sbjct: 942  GADSEGHSDSVTAGGRQKRRRKVVPAVQAPT-GRYNLRRHKTAAPLVANGALSDPNKGKE 1000

Query: 624  KETTEGGNSGPAGAPQNIEVTSAPVLETASENRTQTPVD-QNTSYKTVESREFDKFRTQQ 448
            KE  +GG  G    P  ++  +  V  T  + R     +  +  +  + +       T +
Sbjct: 1001 KEIDDGGGIGEE-IPDEVDGNTHLVQVTTLKKRINVVNEFSSAGFHGINA-------TSE 1052

Query: 447  NGAEDNAIAAENQTMDFSEEANGTPDFNE--EEHGSTLNSXXXXXXXXXXXXDVHPGEVS 274
            +   D A    + TM  SEE NGTP+ +   +  G T  +              HPGEVS
Sbjct: 1053 SQDRDAANQLVSDTM-LSEEVNGTPEQSRGYQNQGDTSGAEGEDEDGDEVE---HPGEVS 1108

Query: 273  VSRKLWNFFTS 241
            + +K+W F T+
Sbjct: 1109 MRKKVWKFLTT 1119


>dbj|BAF64421.1| nuclear matrix constituent protein 1-like [Apium graveolens]
          Length = 1119

 Score =  401 bits (1030), Expect = e-109
 Identities = 274/731 (37%), Positives = 392/731 (53%), Gaps = 30/731 (4%)
 Frame = -3

Query: 2343 LKGELEKMKLDVSQKEMLLREGNEKLILNEDERTEHTRLQKELREEIEKSRAQXXXXXXX 2164
            LK E+EK +    ++ + L E  E+L + E+ER E  RLQ EL++EIE  R Q       
Sbjct: 408  LKAEIEKARASTEEQRLKLSEEIERLKITEEERLELARLQSELKQEIENCRHQRELLLKE 467

Query: 2163 XXXXKQDRMKFEEEWETLDEKRAAFARELQQIEEEKQMFEKLRRSEDDKLKKEKLATEEY 1984
                KQ++M+FE+EWE LDE+R A  ++L+ I  +K+ FEKL+ SE+D+L  +KL TE Y
Sbjct: 468  EDELKQEKMRFEKEWEDLDERRTALMKDLKDITVQKENFEKLKHSEEDRLNNKKLDTESY 527

Query: 1983 IQRELEAIRLEKESFAVTMKYEKSALSEKAQGEHDQLLQEFEARRRDLETEMQKKQEDME 1804
            +Q+EL+A+RL K+SFA TM++EK+ L+E+   E  Q+L +FE  +R+LET++  ++EDME
Sbjct: 528  VQKELDALRLTKDSFAATMEHEKAVLAERTSSEKKQMLNDFELWKRELETKLFNEREDME 587

Query: 1803 KDAQEKERAFQEERDKADDNISRLKEEVQKEMEIVSAERSRLQKEKQDITSNRKHLEEYQ 1624
               + +E+ F EER+K  +NI+ LKE + KE E +  ERSR+ KEKQ+I  ++KHL+E  
Sbjct: 588  NALRLREKQFDEEREKELNNINYLKEVISKEREDIKLERSRIAKEKQEILMHQKHLDEQH 647

Query: 1623 LEMQNDIHELGILSKKLKDQREHFLHQRGRFLSFVERLKNCKNCGEIARSYVLSDLQ-LA 1447
            + MQ DI +L  LS+KLKDQRE F  +R  F+ FVE  K+CKNCGE+   +V+SDLQ LA
Sbjct: 648  VVMQKDIGQLVSLSEKLKDQREQFFKERECFIRFVESQKSCKNCGEMTSEFVVSDLQSLA 707

Query: 1446 EMEHKETSPFPTLGDELLEKASSYGTDMNRSPAEK-----KSSDSGGRFSWLQKCTSKFR 1282
            E+E+ +    P L +  L +      D N S          S  SGG  SWLQKCTSK  
Sbjct: 708  ELENLKALSVPQLAENYLRQDLQGTPDKNLSTVTPGAVGLGSPASGGTKSWLQKCTSKIF 767

Query: 1281 KFSPSAKHFVPE-------NLEPALSDRLVVAANTEGPSTAATADKGKANI----LGAEP 1135
             FS S K+  P+       ++E + +  L      E PS  A       N+       E 
Sbjct: 768  IFSASKKNNSPDQNTSRRLHVEASPNKLLNTEVIPELPSGVAGETLEMQNMQVSNSNREM 827

Query: 1134 SCEICILGDNQSNGNDRIQQVVEDXXXXXXXXXXXXXRKNTQGGIRRTRSVKAVVEDAAV 955
               + + G  QSN + +    VED              K  +G +RR RS K V E+A  
Sbjct: 828  ESNLNLSGTEQSNIDSKALD-VEDSQQSDVRAGNRKPGKRAKGRVRRKRSAKEVAEEAKT 886

Query: 954  ILGKTPGEPI-LNQQRQKD---ASDDEGSRGESSLLDK---IPRKRTWAQTSRKVGSEQD 796
            +L     +PI LN+    +   ++    SRG+SSL+ K     RKR  +Q  +    +  
Sbjct: 887  VL----ADPIELNENEHSNGLASAYTNESRGDSSLVGKRTRNSRKRNPSQPFQSAAGDV- 941

Query: 795  GHDSEGRSESVTAGGHRKRRQTVAPAPQNPVEKRYNLRHHKNGGTAVAAGTSVDKN---G 625
            G DSEG S+SVTAGG +KRR+ V PA Q P   RYNLR HK     VA G   D N    
Sbjct: 942  GADSEGHSDSVTAGGPQKRRRKVVPAVQAPT-GRYNLRRHKTAAPLVANGALSDPNKGKE 1000

Query: 624  KETTEGGNSGPAGAPQNIEVTSAPVLETASENRTQTPVD-QNTSYKTVESREFDKFRTQQ 448
            KE  +GG  G    P  ++  +  V  T  + R     +  +  +  + +       T +
Sbjct: 1001 KEIDDGGGIGEE-IPDEVDGNTHLVQVTTLKKRINVVNEFSSAGFHGINA-------TSE 1052

Query: 447  NGAEDNAIAAENQTMDFSEEANGTPDFNE--EEHGSTLNSXXXXXXXXXXXXDVHPGEVS 274
            +   D A    + TM  SEE NGTP+ +   +  G T  +              HPGEVS
Sbjct: 1053 SQDRDAANQLVSDTM-LSEEVNGTPEQSRGYQNQGDTSGAEGEDEDGDEVE---HPGEVS 1108

Query: 273  VSRKLWNFFTS 241
            + +K+W F T+
Sbjct: 1109 MRKKVWKFLTT 1119


>ref|XP_004297151.1| PREDICTED: putative nuclear matrix constituent protein 1-like
            protein-like [Fragaria vesca subsp. vesca]
          Length = 1148

 Score =  400 bits (1027), Expect = e-108
 Identities = 275/730 (37%), Positives = 394/730 (53%), Gaps = 29/730 (3%)
 Frame = -3

Query: 2343 LKGELEKMKLDVSQKEMLLREGNEKLILNEDERTEHTRLQKELREEIEKSRAQXXXXXXX 2164
            LK E++K+K +  Q E  +REG EK  + E E+++H RLQ EL++EI   R Q       
Sbjct: 479  LKDEIQKIKDENVQLEQQIREGREKHAITEKEKSDHLRLQSELQQEINNYRLQNELLLKE 538

Query: 2163 XXXXKQDRMKFEEEWETLDEKRAAFARELQQIEEEKQMFEKLRRSEDDKLKKEKLATEEY 1984
                KQ+R KFE+EWE LDE+RA    EL+++ EEK+  E+L+  E ++LK+E+ A E+Y
Sbjct: 539  AEDLKQEREKFEKEWEDLDERRAKVDGELRKVVEEKEQLERLQCIEAERLKEERKAVEDY 598

Query: 1983 IQRELEAIRLEKESFAVTMKYEKSALSEKAQGEHDQLLQEFEARRRDLETEMQKKQEDME 1804
             QRE+E ++ E+ESF   M   + ALSEKAQ EH Q++Q+FE+RRRDLET+MQK+Q+ M 
Sbjct: 599  RQREIENLKQERESFTAKMTNGQIALSEKAQSEHAQMVQDFESRRRDLETDMQKRQDKMV 658

Query: 1803 KDAQEKERAFQEERDKADDNISRLKEEVQKEMEIVSAERSRLQKEKQDITSNRKHLEEYQ 1624
            K  QE+E AF+EE+D+   NI+ LK    K+ E + +ER+  +KE++ +   +K LE  Q
Sbjct: 659  KQLQERETAFEEEKDREYTNINFLKGVADKQREELLSERNTNEKEREALALQKKELEANQ 718

Query: 1623 LEMQNDIHELGILSKKLKDQREHFLHQRGRFLSFVERLKNCKNCGEIARSYVLSDLQLAE 1444
            LEM+ DI +L  LSKK+K QRE  + +RGRFL+FVER+K+CK+CGEI R +VLSDLQ+  
Sbjct: 719  LEMREDIDQLDKLSKKIKCQREQLIEERGRFLAFVERVKSCKDCGEITREFVLSDLQVPG 778

Query: 1443 M---------EHKETSPFPTLGDELLEKAS------------SYGTDMNRSPAEKKSSDS 1327
            M         EHKE+      G++L +K                 T++ R PA +K  + 
Sbjct: 779  MYNVEAVPNSEHKESG----WGEKLQQKCKLVVSKVTSNKKLDVSTELPRPPAMQKGKE- 833

Query: 1326 GGRFSWLQKCTSKFRKFSPSAKHFVPENLEPALSDRLVVAANTEGPSTAATADKGKANIL 1147
                        K      +  H   EN EP  S R     N    + AA AD     + 
Sbjct: 834  -----------PKLLASEEARGHSSHEN-EPQPSLR---RCNDSANAEAAVADNNCKAVD 878

Query: 1146 GAEPSCEICILGDNQSNGNDRIQQVVEDXXXXXXXXXXXXXRKNTQGGIRRTRSVKAVVE 967
            G  PS       D+ S  + + Q + ED              +  +  + RT SVKAVVE
Sbjct: 879  GYAPSI------DDYSFISSQEQDIPEDSEQSELKSGRRKPARGRKSRLSRTHSVKAVVE 932

Query: 966  DAAVILGKTPGEPILNQQRQKDASDDEGSRGESSLLDKIPRKRTWAQTSRKVGSEQDGHD 787
            DA   LG+TP EP       + +  +EG    +S+  K PR R     S +V SEQD  D
Sbjct: 933  DAKKFLGETP-EPSNASLLNESSYINEGDSSFTSIGRKRPRPR-----SSRVESEQDDCD 986

Query: 786  SEGRSESVTAGGHRKRRQTVAPAPQNPVEKRYNLRHHKNGGTAVAAGTSVDKNGKETTEG 607
            SEGRS SVTAGGHRKRRQ VA A Q P  +RYNLR+ K  GT  AA  +     +   E 
Sbjct: 987  SEGRSGSVTAGGHRKRRQPVASAVQTPGGQRYNLRNRKTAGTLAAASAAPHLKSRRKEE- 1045

Query: 606  GNSGPAGAPQNIEVTSAPVLETASEN--RTQTPVDQNT------SYKTVESREFDKFRTQ 451
             +   +   + I+VT+   +E+  E   R  TP  ++T      + K VE  E     T+
Sbjct: 1046 -SKPESVGAELIQVTTLKPVESTEERVVRFATPEPRDTVNGKADATKLVEEAELS---TE 1101

Query: 450  QNGAEDNAIAAENQTMDFSEEANGTPDFNEEEHGSTLNSXXXXXXXXXXXXDVHPGEVSV 271
             NG E ++ +   ++ D S + +G  D+++E+                     HPG+VS+
Sbjct: 1102 LNGTE-SSHSTGGESGDSSGDESG-DDYDDED---------------------HPGQVSI 1138

Query: 270  SRKLWNFFTS 241
             +K+W FF++
Sbjct: 1139 GKKIWTFFST 1148


>dbj|BAF64423.1| nuclear matrix constituent protein 1-like [Foeniculum vulgare]
          Length = 1119

 Score =  398 bits (1023), Expect = e-108
 Identities = 272/731 (37%), Positives = 393/731 (53%), Gaps = 30/731 (4%)
 Frame = -3

Query: 2343 LKGELEKMKLDVSQKEMLLREGNEKLILNEDERTEHTRLQKELREEIEKSRAQXXXXXXX 2164
            LK E+EK +    ++ + L E  E+L + E+ER E  RLQ EL++EIE  R Q       
Sbjct: 408  LKAEIEKDRASTEEQRLKLSEEIERLKITEEERLELARLQSELKQEIENCRHQRELLLKE 467

Query: 2163 XXXXKQDRMKFEEEWETLDEKRAAFARELQQIEEEKQMFEKLRRSEDDKLKKEKLATEEY 1984
                KQ++M+FE+EWE LDE+R A  ++L+ I  +K+ FEKL+ SE+D+L  +KL TE Y
Sbjct: 468  EDELKQEKMRFEKEWEDLDERRTALMKDLKDITVQKENFEKLKHSEEDRLNNKKLDTESY 527

Query: 1983 IQRELEAIRLEKESFAVTMKYEKSALSEKAQGEHDQLLQEFEARRRDLETEMQKKQEDME 1804
            +Q+EL+A+RL K+SFA TM++EK+ L+E+   E  Q+L +FE  +R+LET++  ++EDME
Sbjct: 528  VQKELDALRLTKDSFAATMEHEKAVLAERTSSEKKQMLNDFELWKRELETKLFNEREDME 587

Query: 1803 KDAQEKERAFQEERDKADDNISRLKEEVQKEMEIVSAERSRLQKEKQDITSNRKHLEEYQ 1624
               + +E+ F EER+K  + I+ +KE + KE E +  ERSR+ KEKQ+I  ++KHL+E  
Sbjct: 588  NALRLREKQFDEEREKELNTINYIKEVISKEREDIKLERSRIAKEKQEILMHQKHLDEQH 647

Query: 1623 LEMQNDIHELGILSKKLKDQREHFLHQRGRFLSFVERLKNCKNCGEIARSYVLSDLQ-LA 1447
            + MQ DI +L  LS+KLKDQRE F  +R  F+ FVE  K+CKNCGE+   +V+SDLQ LA
Sbjct: 648  VVMQKDIGQLVSLSEKLKDQREQFFKERECFIRFVESQKSCKNCGEMTSEFVVSDLQSLA 707

Query: 1446 EMEHKETSPFPTLGDELLEKASSYGTDMNRSPAEK-----KSSDSGGRFSWLQKCTSKFR 1282
            E+E+ +    P L +  L +      D N S          S  SGG  SWLQKCTSK  
Sbjct: 708  ELENLKALSVPQLAENYLRQDLQGTPDKNLSTVTPGAVGLGSPASGGTKSWLQKCTSKIF 767

Query: 1281 KFSPSAKHFVPE-------NLEPALSDRLVVAANTEGPSTAATADKGKANI----LGAEP 1135
             FS S K+  P+       ++E + +  L      E PS  A  +    N+       E 
Sbjct: 768  IFSASKKNNSPDQNTSRRLHVEASPNKLLNTEVIPELPSGVAGENLEMQNMQVSNSNREM 827

Query: 1134 SCEICILGDNQSNGNDRIQQVVEDXXXXXXXXXXXXXRKNTQGGIRRTRSVKAVVEDAAV 955
               + + G  QSN + +    VED              K  +G +RR RS K V E+A  
Sbjct: 828  ESNLNLSGTEQSNIDSKALD-VEDSQQSDVRAGNRKPGKRAKGRVRRKRSAKEVAEEAKT 886

Query: 954  ILGKTPGEPI-LNQQRQKD---ASDDEGSRGESSLLDK---IPRKRTWAQTSRKVGSEQD 796
            +L     +PI LN+    +   ++    SRG+SSL+ K     RKR  +Q S+    +  
Sbjct: 887  VL----ADPIELNENEHSNGLASAYTNESRGDSSLVGKRTRNSRKRNPSQPSQSAAGDV- 941

Query: 795  GHDSEGRSESVTAGGHRKRRQTVAPAPQNPVEKRYNLRHHKNGGTAVAAGTSVDKN---G 625
            G +SEG S+SVTAGG +KRR+ V PA Q P   RYNLR HK     VA G   D N    
Sbjct: 942  GANSEGHSDSVTAGGPQKRRRKVVPAVQAPT-GRYNLRRHKTAAPLVANGALSDPNKGKE 1000

Query: 624  KETTEGGNSGPAGAPQNIEVTSAPVLETASENRTQTPVD-QNTSYKTVESREFDKFRTQQ 448
            KE  +GG  G    P  ++  +  V  T  + R     +  +  +  + +       T +
Sbjct: 1001 KEIDDGGGIGEE-IPDEVDGNTHLVQVTTLKKRINVVNEFSSAGFHGINA-------TSE 1052

Query: 447  NGAEDNAIAAENQTMDFSEEANGTPDFNE--EEHGSTLNSXXXXXXXXXXXXDVHPGEVS 274
            +   D A    + TM  SEE NGTP+ +   +  G T  +              HPGEVS
Sbjct: 1053 SQDRDAANQLVSDTM-LSEEVNGTPEQSRGYQNQGDTSGAEGEDEDGDEVE---HPGEVS 1108

Query: 273  VSRKLWNFFTS 241
            + +K+W F T+
Sbjct: 1109 MRKKVWKFLTT 1119


>gb|EXB53970.1| hypothetical protein L484_022938 [Morus notabilis]
          Length = 1663

 Score =  397 bits (1021), Expect = e-108
 Identities = 259/705 (36%), Positives = 387/705 (54%), Gaps = 38/705 (5%)
 Frame = -3

Query: 2343 LKGELEKMKLDVSQKEMLLREGNEKLILNEDERTEHTRLQKELREEIEKSRAQXXXXXXX 2164
            L  E+EK+K +  Q E+ +RE +E   +   ER+EH RLQ EL++EIEK R Q       
Sbjct: 503  LLAEVEKIKAENIQLELQIREESESKRITNKERSEHVRLQLELKQEIEKYRGQSELLSIE 562

Query: 2163 XXXXKQDRMKFEEEWETLDEKRAAFARELQQIEEEKQMFEKLRRSEDDKLKKEKLATEEY 1984
                K+++  FE+EWE LD+KR+  ++EL+++ EEK+  EKLR  E+ +LK+EK A  E+
Sbjct: 563  AKELKEEKENFEQEWEDLDKKRSVISKELRELAEEKEKLEKLRHLEEHRLKEEKHAVHEF 622

Query: 1983 IQRELEAIRLEKESFAVTMKYEKSALSEKAQGEHDQLLQEFEARRRDLETEMQKKQEDME 1804
             QRELE ++ EK+S A  M+ E+  LSEKAQ EH Q++Q+FE RRR+LE+E+Q ++E+ME
Sbjct: 623  RQRELEDLKREKDSLAAKMEMEQLTLSEKAQLEHSQMIQDFELRRRNLESEIQNQREEME 682

Query: 1803 KDAQEKERAFQEERDKADDNISRLKEEVQKEMEIVSAERSRLQKEKQDITSNRKHLEEYQ 1624
            K   E+ERAF++ER++  +NI  LK    KE E +  ER R++K+++ +T N++  ++ +
Sbjct: 683  KLLYERERAFEDERERELNNIKYLKGVAHKEREELKLERHRIEKQREQLTLNKEQFKQNE 742

Query: 1623 LEMQNDIHELGILSKKLKDQREHFLHQRGRFLSFVERLKNCKNCGEIARSYVLSDLQLAE 1444
            LEMQNDI +L  LSKK+KDQRE  L  R +FL+FVE++K C++ GE+ R   +S+  + E
Sbjct: 743  LEMQNDIDQLATLSKKVKDQREELLKDRAQFLAFVEKVKTCRDGGEVERELSVSNFHVPE 802

Query: 1443 MEHKETSPFPTLGDELLEKASSYGTDMNRSPAEKKSSDSGGRFSWLQKCTSKFRKFSPS- 1267
            + H   +P PTL +E LE +       + + +   SS SGGR SWLQKCTS F K SP+ 
Sbjct: 803  VSHGNAAPLPTLHEEHLENSPD-----DLAVSNLGSSKSGGRMSWLQKCTSVF-KLSPNK 856

Query: 1266 -AKHF---VPENLEPALSDRLVVAANTEGPSTAATADKG--------------KANILGA 1141
             ++H    +P  L P+ + ++      + P+  +   +G                +++  
Sbjct: 857  ISEHVLAPIPIELPPSSAAQVKTDEKAKEPALGSDGVRGPDISEDRPPAPLRISNDVVNV 916

Query: 1140 EPSCEICILG----------DNQSNGNDRIQQVVEDXXXXXXXXXXXXXRKNTQGGIRRT 991
            +      I+G          D+ SN + +++   ED              +  + G+ RT
Sbjct: 917  QRVQVTNIVGEIHDGYAPSVDDHSNLDSKVEAAPEDSLQSESKSALRKPSRRHKSGLHRT 976

Query: 990  RSVKAVVEDAAVILGKTPGEPILNQQRQKDASD----DEGSRGESSLLDK--IPRKRTWA 829
             SV+A VEDA   LGKT  EP          SD    +E SR +S  ++K    RKR  +
Sbjct: 977  HSVQAAVEDAKAFLGKTLEEP--GSSATIPPSDSYNINEESRDDSVHIEKGNTARKRQRS 1034

Query: 828  QTSRKVGSEQDGHDSEGRSESVTAGGHRKRRQTVAPAPQNPVEKRYNLRHHKNGGTAVAA 649
            QTS    SEQD  DSE  S SVTAG  RKR+QTVA   Q P E+RYN R  K     + +
Sbjct: 1035 QTSHISESEQDVGDSEACSGSVTAGRRRKRQQTVASGLQTPGEERYNFRPRKKLCPNMIS 1094

Query: 648  GTSVD-KNGKETTEGGNSGPAGAPQNIEVTSAPVLETASENRTQTPVDQNTSYKTVESRE 472
            G   D K  +E   GG+  P  A  N E  S  + E A ++          + KTVE  E
Sbjct: 1095 GMVKDLKKTREKEAGGSRTPCVA-ANPEAVSVSLTEVAQKSPETKQTVHVITTKTVEFSE 1153

Query: 471  --FDKFRTQQNGAEDNAIAAENQTMDFSEEANGTPDFNEEEHGST 343
                +F T ++  +    A   +    S E NGT +  +E+  ++
Sbjct: 1154 NKIVRFITSEDIGDSTDAAESVENTKLSMEINGTSECGDEDENNS 1198


Top