BLASTX nr result

ID: Sinomenium22_contig00005639 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00005639
         (2222 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI21104.3| unnamed protein product [Vitis vinifera]              462   e-127
ref|XP_006483425.1| PREDICTED: uncharacterized protein LOC102613...   375   e-101
ref|XP_006483424.1| PREDICTED: uncharacterized protein LOC102613...   375   e-101
ref|XP_007011789.1| Uncharacterized protein isoform 9 [Theobroma...   353   2e-94
ref|XP_007011788.1| Uncharacterized protein isoform 8, partial [...   353   2e-94
ref|XP_007011783.1| Uncharacterized protein isoform 3 [Theobroma...   353   2e-94
ref|XP_007011781.1| Uncharacterized protein isoform 1 [Theobroma...   353   2e-94
ref|XP_002519907.1| mixed-lineage leukemia protein, mll, putativ...   340   2e-90
ref|XP_004292737.1| PREDICTED: uncharacterized protein LOC101313...   326   3e-86
gb|EXB80746.1| Histone-lysine N-methyltransferase ATX1 [Morus no...   306   3e-80
emb|CAN76638.1| hypothetical protein VITISV_027480 [Vitis vinifera]   261   7e-67
ref|XP_006596088.1| PREDICTED: uncharacterized protein LOC100812...   258   6e-66
ref|XP_006596087.1| PREDICTED: uncharacterized protein LOC100812...   258   6e-66
ref|XP_006596086.1| PREDICTED: uncharacterized protein LOC100812...   258   6e-66
ref|XP_006596085.1| PREDICTED: uncharacterized protein LOC100812...   258   6e-66
ref|XP_006596084.1| PREDICTED: uncharacterized protein LOC100812...   258   6e-66
ref|XP_006596083.1| PREDICTED: uncharacterized protein LOC100812...   258   6e-66
ref|XP_003549306.2| PREDICTED: uncharacterized protein LOC100816...   251   1e-63
ref|XP_006601170.1| PREDICTED: uncharacterized protein LOC100816...   247   1e-62
ref|XP_006601169.1| PREDICTED: uncharacterized protein LOC100816...   247   1e-62

>emb|CBI21104.3| unnamed protein product [Vitis vinifera]
          Length = 1111

 Score =  462 bits (1190), Expect = e-127
 Identities = 303/780 (38%), Positives = 410/780 (52%), Gaps = 43/780 (5%)
 Frame = -2

Query: 2212 HMNLKDKANAIGRYDNLKRQSSTQNDCQTSQWRDVPSKQKGCCNAPCIERPAEVFNGRRN 2033
            H+  K+K  +  + +  K Q+  + DC  SQW+DVPSK    C+  C+    +   GR+N
Sbjct: 20   HIVQKEKNISFHQNEKSKGQNHKKIDCHASQWKDVPSKVIVSCDMKCVRPSVDGLGGRKN 79

Query: 2032 VKGQPI------------ETAAVVIN-EIQEAESLNEQQMSNGFSGCSAPAITEVTGEVN 1892
             + QP             +TAA   N  +QE   L EQ+MSN  SGCSAPA+T+ + EVN
Sbjct: 80   DEDQPAMYGRKNDEDQLADTAAKRFNGNLQEINCLKEQEMSNISSGCSAPAVTQASIEVN 139

Query: 1891 NVGSCTMDAR---YVNDHVFDEGSGIAKCWSSDETFDNERSTETVNGSGKLDMAKRGFSY 1721
            N+ SCT+DA      ND V DE SGI KCWSSD+  D+ERS E +  + K    K G S 
Sbjct: 140  NMDSCTVDAGDTGCANDLVVDEASGIEKCWSSDDALDSERSAEFLGFTCKTSFIKEGSSK 199

Query: 1720 GLPALSSEGPIDDLR-SGNIRLKKLPNRSPDACTVYKSFNCKQKLESDFKVGKQKKPTKW 1544
             L   SS   ID+L+   + R K++ N S     +++  +   K+E   K  K+KK  K 
Sbjct: 200  ALANQSSRSLIDELKFRDSFRWKRVRNESHTGLAIHEKNSHSPKIERGLKTRKRKKTMKM 259

Query: 1543 KRLDATFPATGLSSVHYDSPNSQGQDEIHLXXXXXXXXXXXSNHGLQIMCGISSNGHSNF 1364
            K L+A+FPA+G SS HY+     G  E                 G    CG  + G S F
Sbjct: 260  KMLNASFPASGFSSGHYEHTECAGSAEWRSFSYKDVDTLLQCELGTSHTCGACTIGPS-F 318

Query: 1363 KRXXXXXXXXXXXXXXRDRHGIHEHHGDWEDY-SRKKLKDEMLCSRN---PKLLGENRFK 1196
            KR              RD   I+      + Y ++ K K E L        K +G +R  
Sbjct: 319  KRRRSTLSSAKNFSRKRDVDKIYADREGEDGYQAQSKGKTEFLSIHEVSGAKRIGPDR-- 376

Query: 1195 WCCNTDANRHFPRRETNQVSARKAVKDDSAGSVQHSNFFINHIGMCDRQPRPVVCGNSGI 1016
                 +A R F  +E +     KAVK +S G V+ S+     + + +R+ +PVVCG  G+
Sbjct: 377  ---TAEAFRQFCMQEPSHT---KAVKYNSVGCVKESSCL--KLDVSNRREKPVVCGKYGV 428

Query: 1015 ISNGKLTSGQTKPTKIVPLSLILKKARRCVVSENEEPIPVSVSETKKSCFR--------- 863
            ISNGKL     KP KI  LS +LK ARRC +S N+EP   S+ + KK+  R         
Sbjct: 429  ISNGKLAIDVPKPAKIFSLSRVLKTARRCTLSANDEPRLTSMRQLKKARLRGSNGCVNEI 488

Query: 862  --------SEYNNTVFKRDVDTETFLFEKKKEGFSRNEVCLAELSMPQKDRDSGCPRSLN 707
                    +E  N     + + +  + E +K   S +  C  EL M ++++  G  +   
Sbjct: 489  SNLMKEKENEIQNATRCDERNPDNSMEEAEKAVISGDTRCADELLMSKQEKAYGSKKD-- 546

Query: 706  QDILKRFSSAPLKSKFKEARKRSLDELSGKGNKLSPAKNFLRKSLRCSFRTKHRFHEHSC 527
                  + S  LK K+KE RKRSL EL+GKG   S    F++       +        S 
Sbjct: 547  ----DSYHSTRLKRKYKEIRKRSLYELTGKGKSPSSGNAFVKIPKHAPQKKSG-----SV 597

Query: 526  LLKGTDDNQCHAEELCEGISRKSIKDRKCQAFIVDSDAFCCVCGSSNEDEINCLLECSVC 347
             L+  +D++    E  +  S+KSIK+ + ++FI D+DAFCCVCGSSN+DEINCLLECS C
Sbjct: 598  GLENAEDSKHSMSESYKVNSKKSIKEHRFESFISDTDAFCCVCGSSNKDEINCLLECSRC 657

Query: 346  SIRVHQACYGVSKAPKGRWYCRPCSSNSKNIVCVLCGYGGGAMTRALKSLNIVKSFLKAW 167
             IRVHQACYGVS+ PKGRWYCRPC ++SKNIVCVLCGYGGGAMTRAL++ NIVKS LK W
Sbjct: 658  LIRVHQACYGVSRVPKGRWYCRPCRTSSKNIVCVLCGYGGGAMTRALRTRNIVKSLLKVW 717

Query: 166  DITEHS---SSAVLKKNWHKQTNVMKNSSGLARSE--VHNSIIAGALDPIIKQWVHMVCG 2
            +I   S   SS   +    K   +  + SGL      +HN+I AG LD  +KQWVHMVCG
Sbjct: 718  NIETESWPKSSVPPEALQDKLGTLDSSRSGLENESFPIHNTITAGILDSTVKQWVHMVCG 777


>ref|XP_006483425.1| PREDICTED: uncharacterized protein LOC102613578 isoform X2 [Citrus
            sinensis]
          Length = 2119

 Score =  375 bits (962), Expect = e-101
 Identities = 275/804 (34%), Positives = 390/804 (48%), Gaps = 69/804 (8%)
 Frame = -2

Query: 2206 NLKDKANAIGRYDNLKRQSSTQNDCQTSQWRDVPSKQKGCCNAPCIERPAE-VFNGRRNV 2030
            +L++K  +  +   +  Q    N C  SQW+DVPSK KG     C++  AE + +GR N+
Sbjct: 1013 DLREKIISSDQRAKVTGQVRKSNVCHASQWKDVPSKYKGVSTVACLDLSAEDLLDGRGNI 1072

Query: 2029 KGQPIE-TAAVVINEIQEAESLNEQQMSNGFSGCSAPAITEVTGEVNNVGSCTMD---AR 1862
             GQ  + T+      ++  +SL EQ+MSN  SGCSA A+T  + + NN+ S T D   AR
Sbjct: 1073 DGQLGDATSKCSYGTMKIRDSLKEQEMSNISSGCSAAAVTHTSVQGNNLDSTTPDVGNAR 1132

Query: 1861 YVNDHVFDEGSGIAKCWSSDETFDNERSTETVNGSGKLDMAKRGFSYGLPALSSEGPIDD 1682
            Y+N H+ DEGSGI KCWSSD+  ++ERS E +  + K +++K G S  +  LSS   +D+
Sbjct: 1133 YINKHIVDEGSGIDKCWSSDDALESERSAEFLGSNCKTNLSKEGSSKNINNLSSRSLLDE 1192

Query: 1681 LRSGN-IRLKKLPNRSPDACTVYKSFNCKQKLESDFKVGKQKKPTKWKRLDATFPATGLS 1505
            L+  N +  KK   ++     V+   N K K+E   K GK+K+  K K L    P  G S
Sbjct: 1193 LKLLNSLTWKKNRKQTHTRLAVHGKINFK-KIERGVKTGKKKRARKIKMLVPQCPTGGPS 1251

Query: 1504 SVHYDSPNSQGQDEIHLXXXXXXXXXXXSNHGLQIMCGISSNGHSNFKRXXXXXXXXXXX 1325
            +V Y  P  +G D +              N   Q  C   +       +           
Sbjct: 1252 TVPYKYP--KGTDSLPFSSEDVEMH----NPSFQETCISGACSPQPISKCGRSLSSSKEL 1305

Query: 1324 XXXRDRHGIHEHHGDWEDYSRKKLKDEMLCSRNP----KLLGENRFKWCCNTDANRHFPR 1157
               RD H I++   D  DY             NP    +  G   F     +D  R    
Sbjct: 1306 FRKRDLHMIYDDR-DGNDYQ---------IEANPCKIHEFSGIKEFGRAWTSDCTRKSQM 1355

Query: 1156 RETNQVSARKAVKDDSAGSVQHSNFFINHIGMCDRQPRPVVCGNSGIISNGKLTSGQTKP 977
             E   V  +  V+  S G ++  +     + +C R+ RPVVCG  G I N +L    ++P
Sbjct: 1356 AEPTHVHTKDGVRCRSFGCMKALSS--GEVNICSRKVRPVVCGKYGEICN-ELIGDVSRP 1412

Query: 976  TKIVPLSLILKKARRCVVSENEEPIPVSVSETKKSCF----------------RSEYNNT 845
             KIVPLS ILK +RR  +    +       E KK+ F                +S  +++
Sbjct: 1413 AKIVPLSRILKTSRRDTLPNTCDSKQTFPDELKKAIFCGSDAGYNGFSNLKEEKSAIHHS 1472

Query: 844  VFKRDVDTETFLFEKKKEGFSRNEVCLAELSMPQKDRDSGCPRSLNQDILKRFSSAPLKS 665
                +++ +  L E +K   +  +    E SM +K  D    ++ ++   K F+ +  K 
Sbjct: 1473 SICNEMNVDLSLEEDEKMFTNGVD---EENSMLEKKLDHKSKKNCSKLNRKVFTKS--KP 1527

Query: 664  KFKEARKRSLDELSGKGNKLSPAKNFLRKSLRCSFRTKHRFHEHSCLLKGTDDNQCHAEE 485
            K KE RKRSL EL+  G K +     L K  +C    K    + S    G+  N   + E
Sbjct: 1528 KSKEIRKRSLCELTDNGKKSTSESFSLVKISKCM--PKMEAGKVSKNAVGSKQNIRASSE 1585

Query: 484  LCEGISRKSIKDRKCQAFIVDSDAFCCVCGSSNEDEINCLLECSVCSIRVHQACYGVSKA 305
                ++ + +       +++DSDAFCCVCG SN+DEINCL+ECS C I+VHQACYGVSK 
Sbjct: 1586 ----VNSEKLNPEHRSLYVMDSDAFCCVCGGSNKDEINCLIECSRCFIKVHQACYGVSKV 1641

Query: 304  PKGRWYCRPCSSNSKNIVCVLCGYGGGAMTRALKSLNIVKSFLKAWDI------------ 161
            PKG WYCRPC +NS++IVCVLCGYGGGAMT AL+S  IVK  LKAW+I            
Sbjct: 1642 PKGHWYCRPCRTNSRDIVCVLCGYGGGAMTCALRSRTIVKGLLKAWNIETDSRHKNAVSS 1701

Query: 160  ------------------------------TEHSSSAVLKKNWHKQTNVMKNSSGLARS- 74
                                          TE  S+A  K ++  Q +V++ SSG A + 
Sbjct: 1702 AQIMEDDLNMLHSSGPMLESSMLPVSRPVNTEPLSTAAWKMDFPNQLDVLQKSSGNANNV 1761

Query: 73   EVHNSIIAGALDPIIKQWVHMVCG 2
            +VHNSI AGA D  +KQWVHMVCG
Sbjct: 1762 KVHNSITAGAFDSTVKQWVHMVCG 1785


>ref|XP_006483424.1| PREDICTED: uncharacterized protein LOC102613578 isoform X1 [Citrus
            sinensis]
          Length = 2120

 Score =  375 bits (962), Expect = e-101
 Identities = 275/804 (34%), Positives = 390/804 (48%), Gaps = 69/804 (8%)
 Frame = -2

Query: 2206 NLKDKANAIGRYDNLKRQSSTQNDCQTSQWRDVPSKQKGCCNAPCIERPAE-VFNGRRNV 2030
            +L++K  +  +   +  Q    N C  SQW+DVPSK KG     C++  AE + +GR N+
Sbjct: 1014 DLREKIISSDQRAKVTGQVRKSNVCHASQWKDVPSKYKGVSTVACLDLSAEDLLDGRGNI 1073

Query: 2029 KGQPIE-TAAVVINEIQEAESLNEQQMSNGFSGCSAPAITEVTGEVNNVGSCTMD---AR 1862
             GQ  + T+      ++  +SL EQ+MSN  SGCSA A+T  + + NN+ S T D   AR
Sbjct: 1074 DGQLGDATSKCSYGTMKIRDSLKEQEMSNISSGCSAAAVTHTSVQGNNLDSTTPDVGNAR 1133

Query: 1861 YVNDHVFDEGSGIAKCWSSDETFDNERSTETVNGSGKLDMAKRGFSYGLPALSSEGPIDD 1682
            Y+N H+ DEGSGI KCWSSD+  ++ERS E +  + K +++K G S  +  LSS   +D+
Sbjct: 1134 YINKHIVDEGSGIDKCWSSDDALESERSAEFLGSNCKTNLSKEGSSKNINNLSSRSLLDE 1193

Query: 1681 LRSGN-IRLKKLPNRSPDACTVYKSFNCKQKLESDFKVGKQKKPTKWKRLDATFPATGLS 1505
            L+  N +  KK   ++     V+   N K K+E   K GK+K+  K K L    P  G S
Sbjct: 1194 LKLLNSLTWKKNRKQTHTRLAVHGKINFK-KIERGVKTGKKKRARKIKMLVPQCPTGGPS 1252

Query: 1504 SVHYDSPNSQGQDEIHLXXXXXXXXXXXSNHGLQIMCGISSNGHSNFKRXXXXXXXXXXX 1325
            +V Y  P  +G D +              N   Q  C   +       +           
Sbjct: 1253 TVPYKYP--KGTDSLPFSSEDVEMH----NPSFQETCISGACSPQPISKCGRSLSSSKEL 1306

Query: 1324 XXXRDRHGIHEHHGDWEDYSRKKLKDEMLCSRNP----KLLGENRFKWCCNTDANRHFPR 1157
               RD H I++   D  DY             NP    +  G   F     +D  R    
Sbjct: 1307 FRKRDLHMIYDDR-DGNDYQ---------IEANPCKIHEFSGIKEFGRAWTSDCTRKSQM 1356

Query: 1156 RETNQVSARKAVKDDSAGSVQHSNFFINHIGMCDRQPRPVVCGNSGIISNGKLTSGQTKP 977
             E   V  +  V+  S G ++  +     + +C R+ RPVVCG  G I N +L    ++P
Sbjct: 1357 AEPTHVHTKDGVRCRSFGCMKALSS--GEVNICSRKVRPVVCGKYGEICN-ELIGDVSRP 1413

Query: 976  TKIVPLSLILKKARRCVVSENEEPIPVSVSETKKSCF----------------RSEYNNT 845
             KIVPLS ILK +RR  +    +       E KK+ F                +S  +++
Sbjct: 1414 AKIVPLSRILKTSRRDTLPNTCDSKQTFPDELKKAIFCGSDAGYNGFSNLKEEKSAIHHS 1473

Query: 844  VFKRDVDTETFLFEKKKEGFSRNEVCLAELSMPQKDRDSGCPRSLNQDILKRFSSAPLKS 665
                +++ +  L E +K   +  +    E SM +K  D    ++ ++   K F+ +  K 
Sbjct: 1474 SICNEMNVDLSLEEDEKMFTNGVD---EENSMLEKKLDHKSKKNCSKLNRKVFTKS--KP 1528

Query: 664  KFKEARKRSLDELSGKGNKLSPAKNFLRKSLRCSFRTKHRFHEHSCLLKGTDDNQCHAEE 485
            K KE RKRSL EL+  G K +     L K  +C    K    + S    G+  N   + E
Sbjct: 1529 KSKEIRKRSLCELTDNGKKSTSESFSLVKISKCM--PKMEAGKVSKNAVGSKQNIRASSE 1586

Query: 484  LCEGISRKSIKDRKCQAFIVDSDAFCCVCGSSNEDEINCLLECSVCSIRVHQACYGVSKA 305
                ++ + +       +++DSDAFCCVCG SN+DEINCL+ECS C I+VHQACYGVSK 
Sbjct: 1587 ----VNSEKLNPEHRSLYVMDSDAFCCVCGGSNKDEINCLIECSRCFIKVHQACYGVSKV 1642

Query: 304  PKGRWYCRPCSSNSKNIVCVLCGYGGGAMTRALKSLNIVKSFLKAWDI------------ 161
            PKG WYCRPC +NS++IVCVLCGYGGGAMT AL+S  IVK  LKAW+I            
Sbjct: 1643 PKGHWYCRPCRTNSRDIVCVLCGYGGGAMTCALRSRTIVKGLLKAWNIETDSRHKNAVSS 1702

Query: 160  ------------------------------TEHSSSAVLKKNWHKQTNVMKNSSGLARS- 74
                                          TE  S+A  K ++  Q +V++ SSG A + 
Sbjct: 1703 AQIMEDDLNMLHSSGPMLESSMLPVSRPVNTEPLSTAAWKMDFPNQLDVLQKSSGNANNV 1762

Query: 73   EVHNSIIAGALDPIIKQWVHMVCG 2
            +VHNSI AGA D  +KQWVHMVCG
Sbjct: 1763 KVHNSITAGAFDSTVKQWVHMVCG 1786


>ref|XP_007011789.1| Uncharacterized protein isoform 9 [Theobroma cacao]
            gi|508782152|gb|EOY29408.1| Uncharacterized protein
            isoform 9 [Theobroma cacao]
          Length = 1619

 Score =  353 bits (905), Expect = 2e-94
 Identities = 264/803 (32%), Positives = 381/803 (47%), Gaps = 66/803 (8%)
 Frame = -2

Query: 2212 HMNLKDKANAIGRYDNLKRQSSTQNDCQTSQWRDVPSKQKGCCNAPCIERPAEVFNGRRN 2033
            H+  K++ + + +   +K Q   +  C  SQWRDVPSKQK  C    I   AEV +    
Sbjct: 639  HVIPKERTSLLYQGGKVKGQLPVRIACHASQWRDVPSKQKEACKMTRINPSAEVLDASGC 698

Query: 2032 VKGQPIETAAVVINE-IQEAESLNEQQMSNGFSGCSAPAITEVTGEVNNVGSCTMDAR-- 1862
             + Q  +     I   +  A S   Q MSN  SGCSAP +T+ + EVNN+ S T+DA   
Sbjct: 699  AEDQHGDAGMRCIGSAVNRAASFKGQDMSNISSGCSAPDVTQASIEVNNMDSSTIDAEDN 758

Query: 1861 -YVNDHVFDEGSGIAKCWSSDETFDNERSTETVNGSGKLDMAKRG---FSYGLPALSSEG 1694
             Y+ND V DEGSGI KC SS++  ++ERS   +  S +  +  +G      G P+ S   
Sbjct: 759  GYMNDLVVDEGSGIDKCCSSNDAHESERSAAFIGVSCRSKIRTKGSPRIPNGQPSFSL-- 816

Query: 1693 PIDDLRS-GNIRLKKLPNRSPDACTVYKSFNCKQKLESDFKVGKQKKPTKWKRLDATFPA 1517
             +D+L+   ++  KK  N+   + T     N  +K+    K GK+K+  K++ LDA FP 
Sbjct: 817  -LDELKLIDSLTWKKGKNQIYTSITGSGRTNHLKKIRRGSKAGKRKRTVKFRTLDAAFPP 875

Query: 1516 TGLSSVHYDSPNSQGQDEIHLXXXXXXXXXXXSNHGLQIMCGISSNGHSNFKRXXXXXXX 1337
              +S  H  S N   Q                 +    I  G+  +G ++  +       
Sbjct: 876  K-VSFRHCSSNNGSPQ----------LPSRSSKDWQTLIPSGLEPHGDTDLIQPGELFSA 924

Query: 1336 XXXXXXXRDRHGIHEHHGDWEDYSRKKLKDEMLCSRNPKLLGENRFKWCCNTDANRHFPR 1157
                    D HG++      EDY + +LK +    + P++ G  + K     D+   F  
Sbjct: 925  KIVSQKR-DLHGVYNDQDGEEDY-QPELKCDARFGKIPEVSGRKKLKRAGAFDS---FES 979

Query: 1156 RETNQVSARKAVKDDSAGSVQHSNFFIN-HIGMCDRQPRPVVCGNSGIISNGKLTSGQTK 980
              T++   R   K  ++ +V     F +  +  CD++ RP+VCG  G I + K  + + +
Sbjct: 980  LGTSKSILRTVEKSYNSNAVHCIKAFSSLEVTFCDKKDRPIVCGEYGEICSRKFATDELR 1039

Query: 979  PTKIVPLSLILKKARRCVVSENEEPIPVSVSETKKSCFRSEYNNTVFKRDVDTETFLFEK 800
            P KIVPLS +LK   +C + ++ +P        KK            +R   T  F  +K
Sbjct: 1040 PAKIVPLSRVLKNTEQCTLQKSCKPKSTLRKSKKK------------RRPKSTVYFDLKK 1087

Query: 799  KKEG----FS-RNEVCLAELSMPQKDRDSGCPRSLNQDILKRFSSAPLKSKF-------- 659
             +E     FS  +EV    +   +K   SG  +  N   L          K+        
Sbjct: 1088 AEENGGNQFSVSHEVSGCHVEEGKKTCVSGIKQFDNNSFLLEKGKDDRSEKYCCIPDGIA 1147

Query: 658  --------KEARKRSLDELSGKGNKLSPAKNFLRKSLRCSFRTKHRFHEHSCLLKGTDDN 503
                    KE RKRSL EL+GKG +     + L +  +C  + K R       LK T D 
Sbjct: 1148 YNRSNIRCKEIRKRSLYELTGKGKESGSDSHPLMEISKCMPKMKVRKS-----LKETGDV 1202

Query: 502  QCHAEELCEGISRKSIKDRKCQAFIVDSDAFCCVCGSSNEDEINCLLECSVCSIRVHQAC 323
            + H        + KSI   +C + IVDSD FCCVCGSSN+DE NCLLECS CSIRVHQAC
Sbjct: 1203 ESHGHRSSNMNAEKSIMQTRCSS-IVDSDVFCCVCGSSNKDEFNCLLECSRCSIRVHQAC 1261

Query: 322  YGVSKAPKGRWYCRPCSSNSKNIVCVLCGYGGGAMTRALKSLNIVKSFLKAWDI------ 161
            YG+ K P+G WYCRPC ++SK+ VCVLCGYGGGAMT+AL+S   VK  LKAW+I      
Sbjct: 1262 YGILKVPRGHWYCRPCRTSSKDTVCVLCGYGGGAMTQALRSRAFVKGLLKAWNIEAECGP 1321

Query: 160  --TEHSSSAVL---------------------------KKNWHKQTNVMKNS-SGLARSE 71
              T +S+  VL                           K +   Q ++++NS    ++  
Sbjct: 1322 KSTNYSAETVLDDQSLVVSNSFCNLQFKDLELSRTASWKLDVQNQLDIIRNSPCPDSKLN 1381

Query: 70   VHNSIIAGALDPIIKQWVHMVCG 2
            ++NS+ AG LD  +KQWVHMVCG
Sbjct: 1382 LYNSVTAGVLDSTVKQWVHMVCG 1404


>ref|XP_007011788.1| Uncharacterized protein isoform 8, partial [Theobroma cacao]
            gi|508782151|gb|EOY29407.1| Uncharacterized protein
            isoform 8, partial [Theobroma cacao]
          Length = 2068

 Score =  353 bits (905), Expect = 2e-94
 Identities = 264/803 (32%), Positives = 381/803 (47%), Gaps = 66/803 (8%)
 Frame = -2

Query: 2212 HMNLKDKANAIGRYDNLKRQSSTQNDCQTSQWRDVPSKQKGCCNAPCIERPAEVFNGRRN 2033
            H+  K++ + + +   +K Q   +  C  SQWRDVPSKQK  C    I   AEV +    
Sbjct: 1005 HVIPKERTSLLYQGGKVKGQLPVRIACHASQWRDVPSKQKEACKMTRINPSAEVLDASGC 1064

Query: 2032 VKGQPIETAAVVINE-IQEAESLNEQQMSNGFSGCSAPAITEVTGEVNNVGSCTMDAR-- 1862
             + Q  +     I   +  A S   Q MSN  SGCSAP +T+ + EVNN+ S T+DA   
Sbjct: 1065 AEDQHGDAGMRCIGSAVNRAASFKGQDMSNISSGCSAPDVTQASIEVNNMDSSTIDAEDN 1124

Query: 1861 -YVNDHVFDEGSGIAKCWSSDETFDNERSTETVNGSGKLDMAKRG---FSYGLPALSSEG 1694
             Y+ND V DEGSGI KC SS++  ++ERS   +  S +  +  +G      G P+ S   
Sbjct: 1125 GYMNDLVVDEGSGIDKCCSSNDAHESERSAAFIGVSCRSKIRTKGSPRIPNGQPSFSL-- 1182

Query: 1693 PIDDLRS-GNIRLKKLPNRSPDACTVYKSFNCKQKLESDFKVGKQKKPTKWKRLDATFPA 1517
             +D+L+   ++  KK  N+   + T     N  +K+    K GK+K+  K++ LDA FP 
Sbjct: 1183 -LDELKLIDSLTWKKGKNQIYTSITGSGRTNHLKKIRRGSKAGKRKRTVKFRTLDAAFPP 1241

Query: 1516 TGLSSVHYDSPNSQGQDEIHLXXXXXXXXXXXSNHGLQIMCGISSNGHSNFKRXXXXXXX 1337
              +S  H  S N   Q                 +    I  G+  +G ++  +       
Sbjct: 1242 K-VSFRHCSSNNGSPQ----------LPSRSSKDWQTLIPSGLEPHGDTDLIQPGELFSA 1290

Query: 1336 XXXXXXXRDRHGIHEHHGDWEDYSRKKLKDEMLCSRNPKLLGENRFKWCCNTDANRHFPR 1157
                    D HG++      EDY + +LK +    + P++ G  + K     D+   F  
Sbjct: 1291 KIVSQKR-DLHGVYNDQDGEEDY-QPELKCDARFGKIPEVSGRKKLKRAGAFDS---FES 1345

Query: 1156 RETNQVSARKAVKDDSAGSVQHSNFFIN-HIGMCDRQPRPVVCGNSGIISNGKLTSGQTK 980
              T++   R   K  ++ +V     F +  +  CD++ RP+VCG  G I + K  + + +
Sbjct: 1346 LGTSKSILRTVEKSYNSNAVHCIKAFSSLEVTFCDKKDRPIVCGEYGEICSRKFATDELR 1405

Query: 979  PTKIVPLSLILKKARRCVVSENEEPIPVSVSETKKSCFRSEYNNTVFKRDVDTETFLFEK 800
            P KIVPLS +LK   +C + ++ +P        KK            +R   T  F  +K
Sbjct: 1406 PAKIVPLSRVLKNTEQCTLQKSCKPKSTLRKSKKK------------RRPKSTVYFDLKK 1453

Query: 799  KKEG----FS-RNEVCLAELSMPQKDRDSGCPRSLNQDILKRFSSAPLKSKF-------- 659
             +E     FS  +EV    +   +K   SG  +  N   L          K+        
Sbjct: 1454 AEENGGNQFSVSHEVSGCHVEEGKKTCVSGIKQFDNNSFLLEKGKDDRSEKYCCIPDGIA 1513

Query: 658  --------KEARKRSLDELSGKGNKLSPAKNFLRKSLRCSFRTKHRFHEHSCLLKGTDDN 503
                    KE RKRSL EL+GKG +     + L +  +C  + K R       LK T D 
Sbjct: 1514 YNRSNIRCKEIRKRSLYELTGKGKESGSDSHPLMEISKCMPKMKVRKS-----LKETGDV 1568

Query: 502  QCHAEELCEGISRKSIKDRKCQAFIVDSDAFCCVCGSSNEDEINCLLECSVCSIRVHQAC 323
            + H        + KSI   +C + IVDSD FCCVCGSSN+DE NCLLECS CSIRVHQAC
Sbjct: 1569 ESHGHRSSNMNAEKSIMQTRCSS-IVDSDVFCCVCGSSNKDEFNCLLECSRCSIRVHQAC 1627

Query: 322  YGVSKAPKGRWYCRPCSSNSKNIVCVLCGYGGGAMTRALKSLNIVKSFLKAWDI------ 161
            YG+ K P+G WYCRPC ++SK+ VCVLCGYGGGAMT+AL+S   VK  LKAW+I      
Sbjct: 1628 YGILKVPRGHWYCRPCRTSSKDTVCVLCGYGGGAMTQALRSRAFVKGLLKAWNIEAECGP 1687

Query: 160  --TEHSSSAVL---------------------------KKNWHKQTNVMKNS-SGLARSE 71
              T +S+  VL                           K +   Q ++++NS    ++  
Sbjct: 1688 KSTNYSAETVLDDQSLVVSNSFCNLQFKDLELSRTASWKLDVQNQLDIIRNSPCPDSKLN 1747

Query: 70   VHNSIIAGALDPIIKQWVHMVCG 2
            ++NS+ AG LD  +KQWVHMVCG
Sbjct: 1748 LYNSVTAGVLDSTVKQWVHMVCG 1770


>ref|XP_007011783.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508782146|gb|EOY29402.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 2104

 Score =  353 bits (905), Expect = 2e-94
 Identities = 264/803 (32%), Positives = 381/803 (47%), Gaps = 66/803 (8%)
 Frame = -2

Query: 2212 HMNLKDKANAIGRYDNLKRQSSTQNDCQTSQWRDVPSKQKGCCNAPCIERPAEVFNGRRN 2033
            H+  K++ + + +   +K Q   +  C  SQWRDVPSKQK  C    I   AEV +    
Sbjct: 1005 HVIPKERTSLLYQGGKVKGQLPVRIACHASQWRDVPSKQKEACKMTRINPSAEVLDASGC 1064

Query: 2032 VKGQPIETAAVVINE-IQEAESLNEQQMSNGFSGCSAPAITEVTGEVNNVGSCTMDAR-- 1862
             + Q  +     I   +  A S   Q MSN  SGCSAP +T+ + EVNN+ S T+DA   
Sbjct: 1065 AEDQHGDAGMRCIGSAVNRAASFKGQDMSNISSGCSAPDVTQASIEVNNMDSSTIDAEDN 1124

Query: 1861 -YVNDHVFDEGSGIAKCWSSDETFDNERSTETVNGSGKLDMAKRG---FSYGLPALSSEG 1694
             Y+ND V DEGSGI KC SS++  ++ERS   +  S +  +  +G      G P+ S   
Sbjct: 1125 GYMNDLVVDEGSGIDKCCSSNDAHESERSAAFIGVSCRSKIRTKGSPRIPNGQPSFSL-- 1182

Query: 1693 PIDDLRS-GNIRLKKLPNRSPDACTVYKSFNCKQKLESDFKVGKQKKPTKWKRLDATFPA 1517
             +D+L+   ++  KK  N+   + T     N  +K+    K GK+K+  K++ LDA FP 
Sbjct: 1183 -LDELKLIDSLTWKKGKNQIYTSITGSGRTNHLKKIRRGSKAGKRKRTVKFRTLDAAFPP 1241

Query: 1516 TGLSSVHYDSPNSQGQDEIHLXXXXXXXXXXXSNHGLQIMCGISSNGHSNFKRXXXXXXX 1337
              +S  H  S N   Q                 +    I  G+  +G ++  +       
Sbjct: 1242 K-VSFRHCSSNNGSPQ----------LPSRSSKDWQTLIPSGLEPHGDTDLIQPGELFSA 1290

Query: 1336 XXXXXXXRDRHGIHEHHGDWEDYSRKKLKDEMLCSRNPKLLGENRFKWCCNTDANRHFPR 1157
                    D HG++      EDY + +LK +    + P++ G  + K     D+   F  
Sbjct: 1291 KIVSQKR-DLHGVYNDQDGEEDY-QPELKCDARFGKIPEVSGRKKLKRAGAFDS---FES 1345

Query: 1156 RETNQVSARKAVKDDSAGSVQHSNFFIN-HIGMCDRQPRPVVCGNSGIISNGKLTSGQTK 980
              T++   R   K  ++ +V     F +  +  CD++ RP+VCG  G I + K  + + +
Sbjct: 1346 LGTSKSILRTVEKSYNSNAVHCIKAFSSLEVTFCDKKDRPIVCGEYGEICSRKFATDELR 1405

Query: 979  PTKIVPLSLILKKARRCVVSENEEPIPVSVSETKKSCFRSEYNNTVFKRDVDTETFLFEK 800
            P KIVPLS +LK   +C + ++ +P        KK            +R   T  F  +K
Sbjct: 1406 PAKIVPLSRVLKNTEQCTLQKSCKPKSTLRKSKKK------------RRPKSTVYFDLKK 1453

Query: 799  KKEG----FS-RNEVCLAELSMPQKDRDSGCPRSLNQDILKRFSSAPLKSKF-------- 659
             +E     FS  +EV    +   +K   SG  +  N   L          K+        
Sbjct: 1454 AEENGGNQFSVSHEVSGCHVEEGKKTCVSGIKQFDNNSFLLEKGKDDRSEKYCCIPDGIA 1513

Query: 658  --------KEARKRSLDELSGKGNKLSPAKNFLRKSLRCSFRTKHRFHEHSCLLKGTDDN 503
                    KE RKRSL EL+GKG +     + L +  +C  + K R       LK T D 
Sbjct: 1514 YNRSNIRCKEIRKRSLYELTGKGKESGSDSHPLMEISKCMPKMKVRKS-----LKETGDV 1568

Query: 502  QCHAEELCEGISRKSIKDRKCQAFIVDSDAFCCVCGSSNEDEINCLLECSVCSIRVHQAC 323
            + H        + KSI   +C + IVDSD FCCVCGSSN+DE NCLLECS CSIRVHQAC
Sbjct: 1569 ESHGHRSSNMNAEKSIMQTRCSS-IVDSDVFCCVCGSSNKDEFNCLLECSRCSIRVHQAC 1627

Query: 322  YGVSKAPKGRWYCRPCSSNSKNIVCVLCGYGGGAMTRALKSLNIVKSFLKAWDI------ 161
            YG+ K P+G WYCRPC ++SK+ VCVLCGYGGGAMT+AL+S   VK  LKAW+I      
Sbjct: 1628 YGILKVPRGHWYCRPCRTSSKDTVCVLCGYGGGAMTQALRSRAFVKGLLKAWNIEAECGP 1687

Query: 160  --TEHSSSAVL---------------------------KKNWHKQTNVMKNS-SGLARSE 71
              T +S+  VL                           K +   Q ++++NS    ++  
Sbjct: 1688 KSTNYSAETVLDDQSLVVSNSFCNLQFKDLELSRTASWKLDVQNQLDIIRNSPCPDSKLN 1747

Query: 70   VHNSIIAGALDPIIKQWVHMVCG 2
            ++NS+ AG LD  +KQWVHMVCG
Sbjct: 1748 LYNSVTAGVLDSTVKQWVHMVCG 1770


>ref|XP_007011781.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590572148|ref|XP_007011782.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590572172|ref|XP_007011784.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590572176|ref|XP_007011785.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590572180|ref|XP_007011786.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590572184|ref|XP_007011787.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508782144|gb|EOY29400.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508782145|gb|EOY29401.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508782147|gb|EOY29403.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508782148|gb|EOY29404.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508782149|gb|EOY29405.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508782150|gb|EOY29406.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1738

 Score =  353 bits (905), Expect = 2e-94
 Identities = 264/803 (32%), Positives = 381/803 (47%), Gaps = 66/803 (8%)
 Frame = -2

Query: 2212 HMNLKDKANAIGRYDNLKRQSSTQNDCQTSQWRDVPSKQKGCCNAPCIERPAEVFNGRRN 2033
            H+  K++ + + +   +K Q   +  C  SQWRDVPSKQK  C    I   AEV +    
Sbjct: 639  HVIPKERTSLLYQGGKVKGQLPVRIACHASQWRDVPSKQKEACKMTRINPSAEVLDASGC 698

Query: 2032 VKGQPIETAAVVINE-IQEAESLNEQQMSNGFSGCSAPAITEVTGEVNNVGSCTMDAR-- 1862
             + Q  +     I   +  A S   Q MSN  SGCSAP +T+ + EVNN+ S T+DA   
Sbjct: 699  AEDQHGDAGMRCIGSAVNRAASFKGQDMSNISSGCSAPDVTQASIEVNNMDSSTIDAEDN 758

Query: 1861 -YVNDHVFDEGSGIAKCWSSDETFDNERSTETVNGSGKLDMAKRG---FSYGLPALSSEG 1694
             Y+ND V DEGSGI KC SS++  ++ERS   +  S +  +  +G      G P+ S   
Sbjct: 759  GYMNDLVVDEGSGIDKCCSSNDAHESERSAAFIGVSCRSKIRTKGSPRIPNGQPSFSL-- 816

Query: 1693 PIDDLRS-GNIRLKKLPNRSPDACTVYKSFNCKQKLESDFKVGKQKKPTKWKRLDATFPA 1517
             +D+L+   ++  KK  N+   + T     N  +K+    K GK+K+  K++ LDA FP 
Sbjct: 817  -LDELKLIDSLTWKKGKNQIYTSITGSGRTNHLKKIRRGSKAGKRKRTVKFRTLDAAFPP 875

Query: 1516 TGLSSVHYDSPNSQGQDEIHLXXXXXXXXXXXSNHGLQIMCGISSNGHSNFKRXXXXXXX 1337
              +S  H  S N   Q                 +    I  G+  +G ++  +       
Sbjct: 876  K-VSFRHCSSNNGSPQ----------LPSRSSKDWQTLIPSGLEPHGDTDLIQPGELFSA 924

Query: 1336 XXXXXXXRDRHGIHEHHGDWEDYSRKKLKDEMLCSRNPKLLGENRFKWCCNTDANRHFPR 1157
                    D HG++      EDY + +LK +    + P++ G  + K     D+   F  
Sbjct: 925  KIVSQKR-DLHGVYNDQDGEEDY-QPELKCDARFGKIPEVSGRKKLKRAGAFDS---FES 979

Query: 1156 RETNQVSARKAVKDDSAGSVQHSNFFIN-HIGMCDRQPRPVVCGNSGIISNGKLTSGQTK 980
              T++   R   K  ++ +V     F +  +  CD++ RP+VCG  G I + K  + + +
Sbjct: 980  LGTSKSILRTVEKSYNSNAVHCIKAFSSLEVTFCDKKDRPIVCGEYGEICSRKFATDELR 1039

Query: 979  PTKIVPLSLILKKARRCVVSENEEPIPVSVSETKKSCFRSEYNNTVFKRDVDTETFLFEK 800
            P KIVPLS +LK   +C + ++ +P        KK            +R   T  F  +K
Sbjct: 1040 PAKIVPLSRVLKNTEQCTLQKSCKPKSTLRKSKKK------------RRPKSTVYFDLKK 1087

Query: 799  KKEG----FS-RNEVCLAELSMPQKDRDSGCPRSLNQDILKRFSSAPLKSKF-------- 659
             +E     FS  +EV    +   +K   SG  +  N   L          K+        
Sbjct: 1088 AEENGGNQFSVSHEVSGCHVEEGKKTCVSGIKQFDNNSFLLEKGKDDRSEKYCCIPDGIA 1147

Query: 658  --------KEARKRSLDELSGKGNKLSPAKNFLRKSLRCSFRTKHRFHEHSCLLKGTDDN 503
                    KE RKRSL EL+GKG +     + L +  +C  + K R       LK T D 
Sbjct: 1148 YNRSNIRCKEIRKRSLYELTGKGKESGSDSHPLMEISKCMPKMKVRKS-----LKETGDV 1202

Query: 502  QCHAEELCEGISRKSIKDRKCQAFIVDSDAFCCVCGSSNEDEINCLLECSVCSIRVHQAC 323
            + H        + KSI   +C + IVDSD FCCVCGSSN+DE NCLLECS CSIRVHQAC
Sbjct: 1203 ESHGHRSSNMNAEKSIMQTRCSS-IVDSDVFCCVCGSSNKDEFNCLLECSRCSIRVHQAC 1261

Query: 322  YGVSKAPKGRWYCRPCSSNSKNIVCVLCGYGGGAMTRALKSLNIVKSFLKAWDI------ 161
            YG+ K P+G WYCRPC ++SK+ VCVLCGYGGGAMT+AL+S   VK  LKAW+I      
Sbjct: 1262 YGILKVPRGHWYCRPCRTSSKDTVCVLCGYGGGAMTQALRSRAFVKGLLKAWNIEAECGP 1321

Query: 160  --TEHSSSAVL---------------------------KKNWHKQTNVMKNS-SGLARSE 71
              T +S+  VL                           K +   Q ++++NS    ++  
Sbjct: 1322 KSTNYSAETVLDDQSLVVSNSFCNLQFKDLELSRTASWKLDVQNQLDIIRNSPCPDSKLN 1381

Query: 70   VHNSIIAGALDPIIKQWVHMVCG 2
            ++NS+ AG LD  +KQWVHMVCG
Sbjct: 1382 LYNSVTAGVLDSTVKQWVHMVCG 1404


>ref|XP_002519907.1| mixed-lineage leukemia protein, mll, putative [Ricinus communis]
            gi|223540953|gb|EEF42511.1| mixed-lineage leukemia
            protein, mll, putative [Ricinus communis]
          Length = 1125

 Score =  340 bits (872), Expect = 2e-90
 Identities = 265/820 (32%), Positives = 379/820 (46%), Gaps = 83/820 (10%)
 Frame = -2

Query: 2212 HMNLKDKANAIGRYDNLKRQSSTQNDCQTSQWRDVPSKQKGCCNAPCIERPAEVFNGRRN 2033
            HM  K  A +  +   LK +        TSQW+DVP K K  C   C ++ A+    R  
Sbjct: 17   HMASKVNAISFDQCGMLKGELPKNATFHTSQWKDVPRKLKRVCEVACAKQSADTSLKREY 76

Query: 2032 VKGQPIETAAVVIN-EIQEAESLNEQQMSNGFSGCSAPAITEVTGEVNNVGSCTM--DAR 1862
              GQ  + AA   +  +  A S  EQ MSN  SGCS PA+T+ + E  NV S T+  ++ 
Sbjct: 77   KLGQLGDNAANCFDGAVAAAASFKEQDMSNISSGCSTPAVTQASTEFTNVESSTVVGNSG 136

Query: 1861 YVNDHVFDEGSGIAKCWSSDETFDNERSTETVNGSGKLDMAKRGFSYGLPALSSEGPIDD 1682
             +N+ V DEGSGI KCWSSD+ F+++RS +    + K ++   G        SS   +D+
Sbjct: 137  CINNLVVDEGSGIDKCWSSDDAFESDRSADFHGSTCKKNLVYMGSHNTAVNKSSRSLLDE 196

Query: 1681 LR-SGNIRLKKLPNRSPDACTVYKSFNCKQKLESDFKVGKQKKPTKWKRLDATFPATGLS 1505
            ++   ++  KK  N+  +  TV+   N  Q+ +   K GK+K+    K  DA    T   
Sbjct: 197  VKLMDSLTWKKGQNQKHNGITVHGKNNHSQEFDRGLKTGKRKREIIPKVSDAPL-GTAAP 255

Query: 1504 SVHYDSPNSQGQDEIHLXXXXXXXXXXXSNHGLQIMCG-----ISSNGH---SNFKRXXX 1349
             +H   P   G  +               +  +Q++        +S  H   +N K    
Sbjct: 256  MLHGKYPEYGGTAD-----------WPCLSENVQMVSAGQESSQTSGAHCVKANPKDGNC 304

Query: 1348 XXXXXXXXXXXRDRHGIHEHHGDWEDYSRKKLKDEMLCSRNPKLLGENRFKWCCNTDANR 1169
                       RD H ++ + GD E      +  +       ++LG  +F+     D + 
Sbjct: 305  MQSVSKSLSRNRDLHRLY-NAGDGEANPHNDINHDDNSCEVLEILGRKKFRSIHAADLSI 363

Query: 1168 HFPRRETNQVSARKAVKDDSAGSVQHSNFFINHIGMCDRQPRPVVCGNSGIISNGKLTSG 989
             F R++  Q    KA K DS   ++ S+    H+  C  + +PV CG  G I NG L   
Sbjct: 364  QFQRQDCTQAVGEKAGKYDSLDRIKASS--AQHL--CHGKAKPVACGKYGEIVNGNLNGD 419

Query: 988  QTKPTKIVPLSLILKKARRCVVSENEEPIPVSVSET------KKSCFRSEYNNTVFK--- 836
             +KP KIV L  +LK A++C + +  +P   S  E         +CF    N T  K   
Sbjct: 420  VSKPAKIVSLDKVLKTAQKCSLPKICKPGLTSSKEIGTNFSWSNACFGKFSNLTKEKEHG 479

Query: 835  -------RDVDTETFLFEKKKEGFSR-NEVCLAELSMPQKDRDS---GCPRSLNQDILKR 689
                   +D++  T L EK+   F+  +E    E+SM +K       GC       IL  
Sbjct: 480  RNVALLCKDMNVRTSL-EKRSNSFANYDEQSADEVSMLEKSEGKNGRGCV------ILDT 532

Query: 688  FSSAPLKSKFKEARKRSLDELSGKGNKLSPA-----KNFL---RKSLRCSFRTKHRFHEH 533
             + A  +SK++E RKRSL EL+ KG   SP      KNF    +  L  + R   + H++
Sbjct: 533  IAHAQSRSKYRETRKRSLYELTLKGKSSSPKMVSRKKNFKYVPKMKLGKTLRNSEKSHDN 592

Query: 532  SCLLKGTDDNQCHAEELCEGISRKSIKDRKCQAFIVDSDAFCCVCGSSNEDEINCLLECS 353
                +  D  +C  E+                  I D D+FC VC SSN+DE+NCLLEC 
Sbjct: 593  GS--QKVDPKRCAREQK--------------HLSITDMDSFCSVCRSSNKDEVNCLLECR 636

Query: 352  VCSIRVHQACYGVSKAPKGRWYCRPCSSNSKNIVCVLCGYGGGAMTRALKSLNIVKSFLK 173
             CSIRVHQACYGVS+ PKG WYCRPC +++K+IVCVLCGYGGGAMT AL+S  IVK  LK
Sbjct: 637  RCSIRVHQACYGVSRVPKGHWYCRPCRTSAKDIVCVLCGYGGGAMTLALRSRTIVKGLLK 696

Query: 172  AWDI------------------------------------------TEHSSSAVLKKNWH 119
            AW++                                           E S+S V  K+  
Sbjct: 697  AWNLEIESVAKNAISSPEILHHEMSMLHSSGPGPENRSYPVLRPVNIEPSTSTVCNKDVQ 756

Query: 118  KQTNVMKNSSG-LARSEVHNSIIAGALDPIIKQWVHMVCG 2
               +++ NS G L+  +V+NSI AG LD  +KQWVHMVCG
Sbjct: 757  NHLDILPNSLGHLSNLKVNNSITAGVLDSTVKQWVHMVCG 796


>ref|XP_004292737.1| PREDICTED: uncharacterized protein LOC101313577 [Fragaria vesca
            subsp. vesca]
          Length = 2169

 Score =  326 bits (835), Expect = 3e-86
 Identities = 259/809 (32%), Positives = 380/809 (46%), Gaps = 70/809 (8%)
 Frame = -2

Query: 2218 NVHMNLKDKANAIGRYDNLKRQSSTQNDCQTSQWRDVPSKQKGCCNAPCIERPAEVFNGR 2039
            N H+  KDK  ++     L  + +  N   TSQWRDVPSK KG  +   ++R A +F+  
Sbjct: 1062 NSHLIPKDKTVSLDHKRKLSGEVTKNNAYHTSQWRDVPSKVKGVSDVTRVDRLANLFDAT 1121

Query: 2038 RNVKGQPIETAAVVIN-EIQEAESLNEQQMSNGFSGCSAPAITEVTGEVNNVGSCTMDAR 1862
            R  + +  +T     N  +Q A+S+ E ++SN  SGCSAP +++ + E NN+ S T D  
Sbjct: 1122 REDREKLGDTCVKCFNGTVQIADSMKEHEVSNISSGCSAPVVSQPSIEFNNMESSTNDP- 1180

Query: 1861 YVNDH------VFDEGSGIAKCWSSDETFDNERSTETVNGSGKLDMAKRGFSYGLPALSS 1700
               DH      V DEGSGI K WSSD+  ++ERS + +  +G   + K G    L   SS
Sbjct: 1181 --GDHGCGSNFVVDEGSGIDKAWSSDDALESERSAKFLASTGS-SLKKVGAPKNLNHESS 1237

Query: 1699 EGPIDDLRSGN-IRLKKLPNRSPDACTVYKSFNCKQKLESDFKVGKQKKPTKWKRLDATF 1523
               +DDL+  N +  +K  ++ P    +       Q LE   K+GK+K+    + L+A+ 
Sbjct: 1238 SCLLDDLKLLNSLTWQKGRDQIPAGLALRDKDKHLQNLEQGLKIGKRKRELALE-LNASC 1296

Query: 1522 PATGLSSVHYDSPNSQGQDEIHLXXXXXXXXXXXSNHGLQIMCGISSNGHSNFKRXXXXX 1343
              +  S V  ++ NS G  +              S      + G      S+  R     
Sbjct: 1297 SNSDSSRVRQENHNSNGTSQFTSQPSKSLMMLSTSRKSGTHVTGNCITQSSSKPRLHISS 1356

Query: 1342 XXXXXXXXXRDRHGIHEHHGDWEDYSRKKLKDEMLCSRN----PKLLGENRFKWCCNTDA 1175
                       R  +H+ H D E       + E+    N    P++ G    K  C+++A
Sbjct: 1357 SAKKLLL----RSDLHKLHDDKESEVNNVFQTELNGGANNHELPEVSGGKTCKRDCSSNA 1412

Query: 1174 NRHFPRRETNQVSARKAVKDDSAGSVQ-HSNFFINHIGMCDRQPRPVVCGNSGIISNGKL 998
             R F  +E    S+RK  K     SV    +     + +  R+ RP+VCG  G +++G  
Sbjct: 1413 FRQFQIQE----SSRKDTKRTKYNSVDGFKSTCSQQVKIGHRKARPIVCGIYGELTDGSS 1468

Query: 997  TSGQTKPTKIVPLSLILKKARRCVVSE--NEEPIPVSVSETKKSCFRSEYNNTVFKRDVD 824
            T   +KP K+VPLS +L  +R+C++ +  N +    S S  KK    +   NT    D+ 
Sbjct: 1469 TGRMSKPAKLVPLSRVLNSSRKCILPKLCNSK----SSSMRKKKLGGAAICNTY---DLK 1521

Query: 823  TE-------------TFLFEKKKEGFSRNEVCLAELSMPQKDRDSGCPRSLNQDILKRFS 683
            TE             T + +KKKE          EL   +K  D    +   +  L   +
Sbjct: 1522 TEKYKCHDAMVKVNDTSMRKKKKECSPGEREIHKELFSMEKQGDVQSEKDHQK--LDSIT 1579

Query: 682  SAPLKSKFKEARKRSLDELSGKGNKLSPAKNFLRKSLRCSFRTKHRFHEHSCLLKGTDDN 503
               L+ K KE RKRS+ E + KG+      + + K    +FR  +   +   +  G D  
Sbjct: 1580 HTQLQMKPKEIRKRSIYEFTEKGDDTGFKSSSVSKI--SNFRPAN---DGKLVNTGEDSG 1634

Query: 502  QCHAEELCEGISRKSIKDRKCQAFIVDSDAFCCVCGSSNEDEINCLLECSVCSIRVHQAC 323
                  LC+  ++ S ++ +C     DSD  CCVCGSSN+DEIN LLECS CS+RVHQAC
Sbjct: 1635 ------LCQHSAKNSTQEHRCHCNC-DSDPICCVCGSSNQDEINILLECSQCSVRVHQAC 1687

Query: 322  YGVSKAPKGRWYCRPCSSNSKNIVCVLCGYGGGAMTRALKSLNIVKSFLKAWDIT----- 158
            YGVSK PKG W CRPC  +SK+IVCVLCGYGGGAMT+AL+S  I  S L+AW+I      
Sbjct: 1688 YGVSKVPKGCWSCRPCRMSSKDIVCVLCGYGGGAMTQALRSQTIAVSILRAWNIETECGP 1747

Query: 157  ----------------------EHSSSAVL---------------KKNWHKQTNVMKNSS 89
                                   HS S+ L               K+    + + ++NS 
Sbjct: 1748 KNELCSIKTLQKDSTGLHCSGYRHSESSSLFVSQQSGQPLAAAHCKRGMSYRVDGVENSP 1807

Query: 88   GLARSEVHNSIIAGALDPIIKQWVHMVCG 2
             +++++VHNSI  G +D   KQWVHMVCG
Sbjct: 1808 SVSKTKVHNSITMGLVDSATKQWVHMVCG 1836


>gb|EXB80746.1| Histone-lysine N-methyltransferase ATX1 [Morus notabilis]
          Length = 2073

 Score =  306 bits (783), Expect = 3e-80
 Identities = 245/762 (32%), Positives = 351/762 (46%), Gaps = 54/762 (7%)
 Frame = -2

Query: 2125 SQWRDVPSKQKGCCNAPCIERPAEVFNGRRNVKGQPIETAAVVINEIQEAESLNEQQMSN 1946
            SQWRDVPSK K      C +  AE  N     K                  S  E + SN
Sbjct: 1021 SQWRDVPSKVKRVSTTMCRDSSAECINVTMQTKN-----------------SSKENETSN 1063

Query: 1945 GFSGCSAPAITEVTGEVNNVGSCTMDARY---VNDHVFDEGSGIAKCWSSDETFDNERST 1775
              SG SAPA+T+++ EVN       DA     V++ V DEGSGI KCWSSD+   +ERS 
Sbjct: 1064 ISSGSSAPAVTQLSVEVNKTDYSCADAGNTGCVSNLVVDEGSGIDKCWSSDDARGSERSE 1123

Query: 1774 ETVNGSGKLDMAKRGFSYGLPALSSEGPIDDLRSGN-IRLKKLPNRSPDACTVYKSFNCK 1598
            +    + K    + G S      SS   +D+L+  N +  KK P +      + +  +  
Sbjct: 1124 DFHGDNCKTSFTESGSSKNANCKSSRSLLDELKLINSLTWKKGPKQIQTGTFLNEEDHLS 1183

Query: 1597 QKLESDFKVGKQKKPTKWKRLDATFPATGLSSVHYDSPNSQGQDEIHLXXXXXXXXXXXS 1418
             KL    K GK+ +       D +    G +S  + S  SQ   +IH             
Sbjct: 1184 IKLNRCLKKGKKNRDCSSLVHDES--NEGTNSAEFPSSASQ---QIH------SLSSHRK 1232

Query: 1417 NHGLQIMCGISSNGHSNFKRXXXXXXXXXXXXXXRDRHGIHEHHGDWEDYSRKKLKDEML 1238
            N G       S +   N +               RD + I         Y+ K+ KD   
Sbjct: 1233 NFG-------SCSNQQNSEHRLTTFSTMKKPSRKRDIYKI---------YNDKEEKDVSS 1276

Query: 1237 CSRNPKLLGENRFKWCCNTDANRHFPRRETNQVSARKAVKDDSAGSVQHSNFFINHIGMC 1058
            C   P++    R+K  C + +N      E     +R   K +S G ++ S     +   C
Sbjct: 1277 CE-TPEISAAKRYKKDCTSTSNGRSLIEEQTHGGSRTKNKYNSIGCMRSSLNCQANTRHC 1335

Query: 1057 DRQPRPVVCGNSGIISNGKLTSGQTKPTKIVPLSLILKKARRCVVSENEEPIPVSVSETK 878
              + +P+VCG  G +S+G+L    +KP KIVPLS +L  ARRC + +NE+    S+   K
Sbjct: 1336 --KSKPIVCGKYGELSDGELVGNMSKPAKIVPLSRVLMLARRCTLPKNEKRTFTSIRGMK 1393

Query: 877  ------------KSCFRSEYNNTVFKRDVDTETFLFEKKKEGFSRNEVCLAELSMPQKDR 734
                        ++   S  ++      ++ ETFL   K     R++    +LSM + +R
Sbjct: 1394 THSDGADGFHRLRTEKESRSHDAAVSGKLNNETFLEIMKNRCSGRDDKFAEDLSMLEIER 1453

Query: 733  DSGCPRSLNQDILKRFSSAPLKSKFKEARKRSLDELSGKGNKLSPAKNFLRKSLRCSFRT 554
                     +D +   + A LKS+ KE RKRS+ EL+  G         L K+ +CS   
Sbjct: 1454 HENEKACGKEDSI---AHARLKSRSKEIRKRSIYELAVDGEAPHNKTLSLSKASKCSPEV 1510

Query: 553  KHRFHEHSCLLKGTDDNQCHAEELCEGISRKSIKDRKCQAFIVDSDAFCCVCGSSNEDEI 374
                      + G  ++  H   LCE +++KS  D+   +  V S++FCCVCGSS++D+ 
Sbjct: 1511 SKG------TILGNGEDGTHG--LCE-VAQKS-PDQIWSSLPV-SESFCCVCGSSDKDDT 1559

Query: 373  NCLLECSVCSIRVHQACYGVSKAPKGRWYCRPCSSNSKNIVCVLCGYGGGAMTRALKSLN 194
            N LLEC++C I+VHQACYGVS+APKG WYCRPC ++S+NIVCVLCGYGGGAMTRAL+S  
Sbjct: 1560 NNLLECNICLIKVHQACYGVSRAPKGHWYCRPCRTSSRNIVCVLCGYGGGAMTRALRSRT 1619

Query: 193  IVKSFLKAWDI--------------------------------------TEHSSSAVLKK 128
            IVKS L+ W++                                      T+  +S V K 
Sbjct: 1620 IVKSLLRVWNVETEWKALSVKDLETLTRLNSSGPEREEGTSFPMCQPENTKPLASVVCKM 1679

Query: 127  NWHKQTNVMKNSSGLARSEVHNSIIAGALDPIIKQWVHMVCG 2
            +     +V++NS  + + +V NSI AG LD   KQWVHMVCG
Sbjct: 1680 DMPYNVDVLRNSLCVKKLKVDNSITAGFLDSTTKQWVHMVCG 1721


>emb|CAN76638.1| hypothetical protein VITISV_027480 [Vitis vinifera]
          Length = 578

 Score =  261 bits (668), Expect = 7e-67
 Identities = 196/572 (34%), Positives = 274/572 (47%), Gaps = 38/572 (6%)
 Frame = -2

Query: 2212 HMNLKDKANAIGRYDNLKRQSSTQNDCQTSQWRDVPSKQKGCCNAPCIERPAEVFNGRRN 2033
            H+  K+K  +  + +  K Q+  + DC  SQW+DVPSK    C+  C+    +   GR+N
Sbjct: 20   HIVQKEKNISFHQNEKSKGQNHKKIDCHASQWKDVPSKVIVSCDMKCVRPSVDGLGGRKN 79

Query: 2032 VKGQPI------------ETAAVVIN-EIQEAESLNEQQMSNGFSGCSAPAITEVTGEVN 1892
             + QP             +TAA   N  +QE   L EQ+MSN  SGCSAPA+T+ + EVN
Sbjct: 80   DEDQPAMYGRKNDEDQLADTAAKRFNGNLQEINCLKEQEMSNISSGCSAPAVTQASIEVN 139

Query: 1891 NVGSCTMDAR---YVNDHVFDEGSGIAKCWSSDETFDNERSTETVNGSGKLDMAKRGFSY 1721
            N+ SCT+DA      ND V DE SGI KCWSSD+  D+ERS E +  + K    K G S 
Sbjct: 140  NMDSCTVDAGDTGCANDLVVDEASGIEKCWSSDDALDSERSAEFLGFTCKTSFIKEGSSK 199

Query: 1720 GLPALSSEGPIDDLR-SGNIRLKKLPNRSPDACTVYKSFNCKQKLESDFKVGKQKKPTKW 1544
             L   SS   ID+L+   + R K++ N S     +++  +   K+E   K  K+KK  K 
Sbjct: 200  ALANQSSRSLIDELKFRDSFRWKRVRNESHTGLAIHEKNSHSPKIERGLKTRKRKKTMKM 259

Query: 1543 KRLDATFPATGLSSVHYDSPNSQGQDEIHLXXXXXXXXXXXSNHGLQIMCGISSNGHSNF 1364
            K L+A+FPA+G SS HY+     G  E                 G    CG  + G S F
Sbjct: 260  KMLNASFPASGFSSGHYEHTKCAGSAEWRSFSYKDVDTLLQCELGTSHTCGACTIGPS-F 318

Query: 1363 KRXXXXXXXXXXXXXXRDRHGIHEHHGDWEDY-SRKKLKDEMLCSRN---PKLLGENRFK 1196
            KR              RD   I+      + Y ++ K K E L        K +G +R  
Sbjct: 319  KRRRSTLSSAKNFSRKRDVDKIYADREGEDGYQAQSKGKTEFLSIHEVSGAKRIGPDR-- 376

Query: 1195 WCCNTDANRHFPRRETNQVSARKAVKDDSAGSVQHSNFFINHIGMCDRQPRPVVCGNSGI 1016
                 +A R F  +E +     KAVK +S G V+ S+     + + +R+ +PVVCG  G+
Sbjct: 377  ---TAEAFRQFCMQEPSHT---KAVKYNSVGCVKESSCL--KLDVSNRREKPVVCGKYGV 428

Query: 1015 ISNGKLTSGQTKPTKIVPLSLILKKARRCVVSENEEPIPVSVSETKKSCFR--------- 863
            ISNGKL     KP KI  LS +LK ARRC +S N+EP   S+ + KK+  R         
Sbjct: 429  ISNGKLAIDVPKPAKIFSLSRVLKTARRCTLSANDEPRLTSMRQLKKARLRGSNGCVNEI 488

Query: 862  --------SEYNNTVFKRDVDTETFLFEKKKEGFSRNEVCLAELSMPQKDRDSGCPRSLN 707
                    +E  N     + + +  + E +K   S +  C  EL M  ++R  G  +   
Sbjct: 489  SNLMKEKENEIQNATRCDERNPDNSMEEAEKAVISGDTRCADELLMSXQERAYGSKKD-- 546

Query: 706  QDILKRFSSAPLKSKFKEARKRSLDELSGKGN 611
                  + S  LK K+KE RKRSL EL+GKGN
Sbjct: 547  ----DSYXSTRLKRKYKEIRKRSLYELTGKGN 574


>ref|XP_006596088.1| PREDICTED: uncharacterized protein LOC100812602 isoform X6 [Glycine
            max]
          Length = 1870

 Score =  258 bits (660), Expect = 6e-66
 Identities = 249/793 (31%), Positives = 343/793 (43%), Gaps = 62/793 (7%)
 Frame = -2

Query: 2194 KANAIGRYDNLKRQSSTQNDCQTSQWRDVPSK-QKGCCNAPCIERPAEVFN--GRRNVKG 2024
            K   I +   L  Q S +   +T QWRDVPSK +K  C+A  + + A   +  G+ +V+ 
Sbjct: 813  KGENIEQGGKLDGQDSIKIGFRTPQWRDVPSKVRKAVCDATSLGQTATGMDWEGQDSVQL 872

Query: 2023 QPIETAAVVINEIQEAESLNEQQMSNGFSGCSAPAITEVTGEVNNVGSC---TMDARYVN 1853
              I         I   +   EQ+ SN  SGCSAP +T+ + EVN +  C    +D  +VN
Sbjct: 873  GNISMKRFK-RTIDMGDMSKEQENSNVSSGCSAPVVTQASLEVNKIEPCMGDAVDTGFVN 931

Query: 1852 DHVFDEGSGIAKCWSSDETFDNERSTETVNGSGKLDMAKRGFSYGLPALSSEGPIDDLRS 1673
            + V DEGSGI K WSSD     E+S E + GS      K  +   L        +DDL+ 
Sbjct: 932  NLVVDEGSGIDKGWSSDLV---EKSDEFL-GSSSGSCLKNDYLRVLNDQPCCNLLDDLKL 987

Query: 1672 -GNIRLKKLPNRSPDACTVYKSFNCKQKLESDFKVGKQKKPTKWKRLDATFPATGLSSVH 1496
              ++  KK  N++    +     N  QK++   K GK++K    + LDA+  +   S +H
Sbjct: 988  LDSLIWKKGWNQNNFVLSSNCKSNQSQKVKKGLK-GKKRKRNLVRILDASLSSEFPSLLH 1046

Query: 1495 YDSPNSQGQDEIHLXXXXXXXXXXXSNHGLQIMCGISSNGHSNFKRXXXXXXXXXXXXXX 1316
                N +     +                LQ     SSN  S  +               
Sbjct: 1047 --KKNEEVTGICNSSSSCSKEMQMRPLSSLQK----SSNKSSFVQPSNKQKHTAFSSKFL 1100

Query: 1315 RDRHGIHEHHGDWEDYSRKKLKDEMLCSRNPKLLGENRFKWCCNTDANRHFPRRETNQVS 1136
              ++ +++H      Y  +   D    +  P + G  + K    +D    F  +E     
Sbjct: 1101 SCKNHLNKHQSYKVGYESESSSDAEFRTL-PGVSGSKKLKKDLTSDCFEQFQMQEP---- 1155

Query: 1135 ARKAVKDDSAG--SVQHSNFFINHIGMCDRQPRPVVCGNSGIISNGKLTSGQTKPTKIVP 962
            A +  ++D     S +  N          R  RPVVCG  G IS+G L     KP KIV 
Sbjct: 1156 AYEEPENDKLRPFSCRKEN--------AHRITRPVVCGKYGEISSGHLAREVQKPVKIVS 1207

Query: 961  LSLILKKARRCVVSENEEPIPVSVSETKKSCFRSEYNNTV------FKRDVDTETFLFEK 800
            L  +LK ++RC    N +PIP S  + K+    +   +         K   +T+  +F  
Sbjct: 1208 LRKVLKSSKRCTGHTNGKPIPTSKKKWKRLSIGTSSGHCCGNPGLKIKEHNETQNAIF-- 1265

Query: 799  KKEGFSRNEVCLAELSMPQKDRDSGCP---------RSLNQDILKRFSSAPLKSKFKEAR 647
                F++  V   +LSM   DR    P         ++   + +   +   LK K KE R
Sbjct: 1266 ----FNKTNV---DLSMEDLDRGGKPPVVYKGKRDAKAKQGNSVGNRAYVSLKVKNKEIR 1318

Query: 646  K-RSLDELSGKGNKLSPAKNFLRKSLRCSFRTKHRFHEHSCLLKGTDDNQCHAEELCEGI 470
            K RS+ EL+ K  K+    N                             Q     LC   
Sbjct: 1319 KQRSITELTAKETKVMDMMN---------------------------SAQDQEPGLCSTA 1351

Query: 469  SRKSIKDRKCQAFIVDSDAFCCVCGSSNEDEINCLLECSVCSIRVHQACYGVSKAPK-GR 293
            SR SI+     A I +SDAFCCVC SS+ D+IN LLECS C IRVHQACYGVS  PK   
Sbjct: 1352 SRNSIQGHMNIATI-NSDAFCCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSS 1410

Query: 292  WYCRPCSSNSKNIVCVLCGYGGGAMTRALKSLNIVKSFLKAW------------------ 167
            W CRPC +NSKNIVCVLCGYGGGAMTRA+ S  IVKS LK W                  
Sbjct: 1411 WCCRPCRTNSKNIVCVLCGYGGGAMTRAIMSHTIVKSLLKVWNGEKDGMPKNTTSHEVFE 1470

Query: 166  ----------DITEHSSSAVLK-KNWHKQTNVMK-------NSSGLARSEVHNSIIAGAL 41
                      D  E    +VLK K     T++MK         + ++  +VHNSI    L
Sbjct: 1471 KEIDAFLSSKDGQEVDQESVLKPKIVDTSTDLMKVTNHIQHTPTSVSNFKVHNSITEAVL 1530

Query: 40   DPIIKQWVHMVCG 2
            DP +KQW+HMVCG
Sbjct: 1531 DPTVKQWIHMVCG 1543


>ref|XP_006596087.1| PREDICTED: uncharacterized protein LOC100812602 isoform X5 [Glycine
            max]
          Length = 1872

 Score =  258 bits (660), Expect = 6e-66
 Identities = 249/793 (31%), Positives = 343/793 (43%), Gaps = 62/793 (7%)
 Frame = -2

Query: 2194 KANAIGRYDNLKRQSSTQNDCQTSQWRDVPSK-QKGCCNAPCIERPAEVFN--GRRNVKG 2024
            K   I +   L  Q S +   +T QWRDVPSK +K  C+A  + + A   +  G+ +V+ 
Sbjct: 815  KGENIEQGGKLDGQDSIKIGFRTPQWRDVPSKVRKAVCDATSLGQTATGMDWEGQDSVQL 874

Query: 2023 QPIETAAVVINEIQEAESLNEQQMSNGFSGCSAPAITEVTGEVNNVGSC---TMDARYVN 1853
              I         I   +   EQ+ SN  SGCSAP +T+ + EVN +  C    +D  +VN
Sbjct: 875  GNISMKRFK-RTIDMGDMSKEQENSNVSSGCSAPVVTQASLEVNKIEPCMGDAVDTGFVN 933

Query: 1852 DHVFDEGSGIAKCWSSDETFDNERSTETVNGSGKLDMAKRGFSYGLPALSSEGPIDDLRS 1673
            + V DEGSGI K WSSD     E+S E + GS      K  +   L        +DDL+ 
Sbjct: 934  NLVVDEGSGIDKGWSSDLV---EKSDEFL-GSSSGSCLKNDYLRVLNDQPCCNLLDDLKL 989

Query: 1672 -GNIRLKKLPNRSPDACTVYKSFNCKQKLESDFKVGKQKKPTKWKRLDATFPATGLSSVH 1496
              ++  KK  N++    +     N  QK++   K GK++K    + LDA+  +   S +H
Sbjct: 990  LDSLIWKKGWNQNNFVLSSNCKSNQSQKVKKGLK-GKKRKRNLVRILDASLSSEFPSLLH 1048

Query: 1495 YDSPNSQGQDEIHLXXXXXXXXXXXSNHGLQIMCGISSNGHSNFKRXXXXXXXXXXXXXX 1316
                N +     +                LQ     SSN  S  +               
Sbjct: 1049 --KKNEEVTGICNSSSSCSKEMQMRPLSSLQK----SSNKSSFVQPSNKQKHTAFSSKFL 1102

Query: 1315 RDRHGIHEHHGDWEDYSRKKLKDEMLCSRNPKLLGENRFKWCCNTDANRHFPRRETNQVS 1136
              ++ +++H      Y  +   D    +  P + G  + K    +D    F  +E     
Sbjct: 1103 SCKNHLNKHQSYKVGYESESSSDAEFRTL-PGVSGSKKLKKDLTSDCFEQFQMQEP---- 1157

Query: 1135 ARKAVKDDSAG--SVQHSNFFINHIGMCDRQPRPVVCGNSGIISNGKLTSGQTKPTKIVP 962
            A +  ++D     S +  N          R  RPVVCG  G IS+G L     KP KIV 
Sbjct: 1158 AYEEPENDKLRPFSCRKEN--------AHRITRPVVCGKYGEISSGHLAREVQKPVKIVS 1209

Query: 961  LSLILKKARRCVVSENEEPIPVSVSETKKSCFRSEYNNTV------FKRDVDTETFLFEK 800
            L  +LK ++RC    N +PIP S  + K+    +   +         K   +T+  +F  
Sbjct: 1210 LRKVLKSSKRCTGHTNGKPIPTSKKKWKRLSIGTSSGHCCGNPGLKIKEHNETQNAIF-- 1267

Query: 799  KKEGFSRNEVCLAELSMPQKDRDSGCP---------RSLNQDILKRFSSAPLKSKFKEAR 647
                F++  V   +LSM   DR    P         ++   + +   +   LK K KE R
Sbjct: 1268 ----FNKTNV---DLSMEDLDRGGKPPVVYKGKRDAKAKQGNSVGNRAYVSLKVKNKEIR 1320

Query: 646  K-RSLDELSGKGNKLSPAKNFLRKSLRCSFRTKHRFHEHSCLLKGTDDNQCHAEELCEGI 470
            K RS+ EL+ K  K+    N                             Q     LC   
Sbjct: 1321 KQRSITELTAKETKVMDMMN---------------------------SAQDQEPGLCSTA 1353

Query: 469  SRKSIKDRKCQAFIVDSDAFCCVCGSSNEDEINCLLECSVCSIRVHQACYGVSKAPK-GR 293
            SR SI+     A I +SDAFCCVC SS+ D+IN LLECS C IRVHQACYGVS  PK   
Sbjct: 1354 SRNSIQGHMNIATI-NSDAFCCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSS 1412

Query: 292  WYCRPCSSNSKNIVCVLCGYGGGAMTRALKSLNIVKSFLKAW------------------ 167
            W CRPC +NSKNIVCVLCGYGGGAMTRA+ S  IVKS LK W                  
Sbjct: 1413 WCCRPCRTNSKNIVCVLCGYGGGAMTRAIMSHTIVKSLLKVWNGEKDGMPKNTTSHEVFE 1472

Query: 166  ----------DITEHSSSAVLK-KNWHKQTNVMK-------NSSGLARSEVHNSIIAGAL 41
                      D  E    +VLK K     T++MK         + ++  +VHNSI    L
Sbjct: 1473 KEIDAFLSSKDGQEVDQESVLKPKIVDTSTDLMKVTNHIQHTPTSVSNFKVHNSITEAVL 1532

Query: 40   DPIIKQWVHMVCG 2
            DP +KQW+HMVCG
Sbjct: 1533 DPTVKQWIHMVCG 1545


>ref|XP_006596086.1| PREDICTED: uncharacterized protein LOC100812602 isoform X4 [Glycine
            max]
          Length = 1976

 Score =  258 bits (660), Expect = 6e-66
 Identities = 249/793 (31%), Positives = 343/793 (43%), Gaps = 62/793 (7%)
 Frame = -2

Query: 2194 KANAIGRYDNLKRQSSTQNDCQTSQWRDVPSK-QKGCCNAPCIERPAEVFN--GRRNVKG 2024
            K   I +   L  Q S +   +T QWRDVPSK +K  C+A  + + A   +  G+ +V+ 
Sbjct: 951  KGENIEQGGKLDGQDSIKIGFRTPQWRDVPSKVRKAVCDATSLGQTATGMDWEGQDSVQL 1010

Query: 2023 QPIETAAVVINEIQEAESLNEQQMSNGFSGCSAPAITEVTGEVNNVGSC---TMDARYVN 1853
              I         I   +   EQ+ SN  SGCSAP +T+ + EVN +  C    +D  +VN
Sbjct: 1011 GNISMKRFK-RTIDMGDMSKEQENSNVSSGCSAPVVTQASLEVNKIEPCMGDAVDTGFVN 1069

Query: 1852 DHVFDEGSGIAKCWSSDETFDNERSTETVNGSGKLDMAKRGFSYGLPALSSEGPIDDLRS 1673
            + V DEGSGI K WSSD     E+S E + GS      K  +   L        +DDL+ 
Sbjct: 1070 NLVVDEGSGIDKGWSSDLV---EKSDEFL-GSSSGSCLKNDYLRVLNDQPCCNLLDDLKL 1125

Query: 1672 -GNIRLKKLPNRSPDACTVYKSFNCKQKLESDFKVGKQKKPTKWKRLDATFPATGLSSVH 1496
              ++  KK  N++    +     N  QK++   K GK++K    + LDA+  +   S +H
Sbjct: 1126 LDSLIWKKGWNQNNFVLSSNCKSNQSQKVKKGLK-GKKRKRNLVRILDASLSSEFPSLLH 1184

Query: 1495 YDSPNSQGQDEIHLXXXXXXXXXXXSNHGLQIMCGISSNGHSNFKRXXXXXXXXXXXXXX 1316
                N +     +                LQ     SSN  S  +               
Sbjct: 1185 --KKNEEVTGICNSSSSCSKEMQMRPLSSLQK----SSNKSSFVQPSNKQKHTAFSSKFL 1238

Query: 1315 RDRHGIHEHHGDWEDYSRKKLKDEMLCSRNPKLLGENRFKWCCNTDANRHFPRRETNQVS 1136
              ++ +++H      Y  +   D    +  P + G  + K    +D    F  +E     
Sbjct: 1239 SCKNHLNKHQSYKVGYESESSSDAEFRTL-PGVSGSKKLKKDLTSDCFEQFQMQEP---- 1293

Query: 1135 ARKAVKDDSAG--SVQHSNFFINHIGMCDRQPRPVVCGNSGIISNGKLTSGQTKPTKIVP 962
            A +  ++D     S +  N          R  RPVVCG  G IS+G L     KP KIV 
Sbjct: 1294 AYEEPENDKLRPFSCRKEN--------AHRITRPVVCGKYGEISSGHLAREVQKPVKIVS 1345

Query: 961  LSLILKKARRCVVSENEEPIPVSVSETKKSCFRSEYNNTV------FKRDVDTETFLFEK 800
            L  +LK ++RC    N +PIP S  + K+    +   +         K   +T+  +F  
Sbjct: 1346 LRKVLKSSKRCTGHTNGKPIPTSKKKWKRLSIGTSSGHCCGNPGLKIKEHNETQNAIF-- 1403

Query: 799  KKEGFSRNEVCLAELSMPQKDRDSGCP---------RSLNQDILKRFSSAPLKSKFKEAR 647
                F++  V   +LSM   DR    P         ++   + +   +   LK K KE R
Sbjct: 1404 ----FNKTNV---DLSMEDLDRGGKPPVVYKGKRDAKAKQGNSVGNRAYVSLKVKNKEIR 1456

Query: 646  K-RSLDELSGKGNKLSPAKNFLRKSLRCSFRTKHRFHEHSCLLKGTDDNQCHAEELCEGI 470
            K RS+ EL+ K  K+    N                             Q     LC   
Sbjct: 1457 KQRSITELTAKETKVMDMMN---------------------------SAQDQEPGLCSTA 1489

Query: 469  SRKSIKDRKCQAFIVDSDAFCCVCGSSNEDEINCLLECSVCSIRVHQACYGVSKAPK-GR 293
            SR SI+     A I +SDAFCCVC SS+ D+IN LLECS C IRVHQACYGVS  PK   
Sbjct: 1490 SRNSIQGHMNIATI-NSDAFCCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSS 1548

Query: 292  WYCRPCSSNSKNIVCVLCGYGGGAMTRALKSLNIVKSFLKAW------------------ 167
            W CRPC +NSKNIVCVLCGYGGGAMTRA+ S  IVKS LK W                  
Sbjct: 1549 WCCRPCRTNSKNIVCVLCGYGGGAMTRAIMSHTIVKSLLKVWNGEKDGMPKNTTSHEVFE 1608

Query: 166  ----------DITEHSSSAVLK-KNWHKQTNVMK-------NSSGLARSEVHNSIIAGAL 41
                      D  E    +VLK K     T++MK         + ++  +VHNSI    L
Sbjct: 1609 KEIDAFLSSKDGQEVDQESVLKPKIVDTSTDLMKVTNHIQHTPTSVSNFKVHNSITEAVL 1668

Query: 40   DPIIKQWVHMVCG 2
            DP +KQW+HMVCG
Sbjct: 1669 DPTVKQWIHMVCG 1681


>ref|XP_006596085.1| PREDICTED: uncharacterized protein LOC100812602 isoform X3 [Glycine
            max]
          Length = 2006

 Score =  258 bits (660), Expect = 6e-66
 Identities = 249/793 (31%), Positives = 343/793 (43%), Gaps = 62/793 (7%)
 Frame = -2

Query: 2194 KANAIGRYDNLKRQSSTQNDCQTSQWRDVPSK-QKGCCNAPCIERPAEVFN--GRRNVKG 2024
            K   I +   L  Q S +   +T QWRDVPSK +K  C+A  + + A   +  G+ +V+ 
Sbjct: 949  KGENIEQGGKLDGQDSIKIGFRTPQWRDVPSKVRKAVCDATSLGQTATGMDWEGQDSVQL 1008

Query: 2023 QPIETAAVVINEIQEAESLNEQQMSNGFSGCSAPAITEVTGEVNNVGSC---TMDARYVN 1853
              I         I   +   EQ+ SN  SGCSAP +T+ + EVN +  C    +D  +VN
Sbjct: 1009 GNISMKRFK-RTIDMGDMSKEQENSNVSSGCSAPVVTQASLEVNKIEPCMGDAVDTGFVN 1067

Query: 1852 DHVFDEGSGIAKCWSSDETFDNERSTETVNGSGKLDMAKRGFSYGLPALSSEGPIDDLRS 1673
            + V DEGSGI K WSSD     E+S E + GS      K  +   L        +DDL+ 
Sbjct: 1068 NLVVDEGSGIDKGWSSDLV---EKSDEFL-GSSSGSCLKNDYLRVLNDQPCCNLLDDLKL 1123

Query: 1672 -GNIRLKKLPNRSPDACTVYKSFNCKQKLESDFKVGKQKKPTKWKRLDATFPATGLSSVH 1496
              ++  KK  N++    +     N  QK++   K GK++K    + LDA+  +   S +H
Sbjct: 1124 LDSLIWKKGWNQNNFVLSSNCKSNQSQKVKKGLK-GKKRKRNLVRILDASLSSEFPSLLH 1182

Query: 1495 YDSPNSQGQDEIHLXXXXXXXXXXXSNHGLQIMCGISSNGHSNFKRXXXXXXXXXXXXXX 1316
                N +     +                LQ     SSN  S  +               
Sbjct: 1183 --KKNEEVTGICNSSSSCSKEMQMRPLSSLQK----SSNKSSFVQPSNKQKHTAFSSKFL 1236

Query: 1315 RDRHGIHEHHGDWEDYSRKKLKDEMLCSRNPKLLGENRFKWCCNTDANRHFPRRETNQVS 1136
              ++ +++H      Y  +   D    +  P + G  + K    +D    F  +E     
Sbjct: 1237 SCKNHLNKHQSYKVGYESESSSDAEFRTL-PGVSGSKKLKKDLTSDCFEQFQMQEP---- 1291

Query: 1135 ARKAVKDDSAG--SVQHSNFFINHIGMCDRQPRPVVCGNSGIISNGKLTSGQTKPTKIVP 962
            A +  ++D     S +  N          R  RPVVCG  G IS+G L     KP KIV 
Sbjct: 1292 AYEEPENDKLRPFSCRKEN--------AHRITRPVVCGKYGEISSGHLAREVQKPVKIVS 1343

Query: 961  LSLILKKARRCVVSENEEPIPVSVSETKKSCFRSEYNNTV------FKRDVDTETFLFEK 800
            L  +LK ++RC    N +PIP S  + K+    +   +         K   +T+  +F  
Sbjct: 1344 LRKVLKSSKRCTGHTNGKPIPTSKKKWKRLSIGTSSGHCCGNPGLKIKEHNETQNAIF-- 1401

Query: 799  KKEGFSRNEVCLAELSMPQKDRDSGCP---------RSLNQDILKRFSSAPLKSKFKEAR 647
                F++  V   +LSM   DR    P         ++   + +   +   LK K KE R
Sbjct: 1402 ----FNKTNV---DLSMEDLDRGGKPPVVYKGKRDAKAKQGNSVGNRAYVSLKVKNKEIR 1454

Query: 646  K-RSLDELSGKGNKLSPAKNFLRKSLRCSFRTKHRFHEHSCLLKGTDDNQCHAEELCEGI 470
            K RS+ EL+ K  K+    N                             Q     LC   
Sbjct: 1455 KQRSITELTAKETKVMDMMN---------------------------SAQDQEPGLCSTA 1487

Query: 469  SRKSIKDRKCQAFIVDSDAFCCVCGSSNEDEINCLLECSVCSIRVHQACYGVSKAPK-GR 293
            SR SI+     A I +SDAFCCVC SS+ D+IN LLECS C IRVHQACYGVS  PK   
Sbjct: 1488 SRNSIQGHMNIATI-NSDAFCCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSS 1546

Query: 292  WYCRPCSSNSKNIVCVLCGYGGGAMTRALKSLNIVKSFLKAW------------------ 167
            W CRPC +NSKNIVCVLCGYGGGAMTRA+ S  IVKS LK W                  
Sbjct: 1547 WCCRPCRTNSKNIVCVLCGYGGGAMTRAIMSHTIVKSLLKVWNGEKDGMPKNTTSHEVFE 1606

Query: 166  ----------DITEHSSSAVLK-KNWHKQTNVMK-------NSSGLARSEVHNSIIAGAL 41
                      D  E    +VLK K     T++MK         + ++  +VHNSI    L
Sbjct: 1607 KEIDAFLSSKDGQEVDQESVLKPKIVDTSTDLMKVTNHIQHTPTSVSNFKVHNSITEAVL 1666

Query: 40   DPIIKQWVHMVCG 2
            DP +KQW+HMVCG
Sbjct: 1667 DPTVKQWIHMVCG 1679


>ref|XP_006596084.1| PREDICTED: uncharacterized protein LOC100812602 isoform X2 [Glycine
            max]
          Length = 2007

 Score =  258 bits (660), Expect = 6e-66
 Identities = 249/793 (31%), Positives = 343/793 (43%), Gaps = 62/793 (7%)
 Frame = -2

Query: 2194 KANAIGRYDNLKRQSSTQNDCQTSQWRDVPSK-QKGCCNAPCIERPAEVFN--GRRNVKG 2024
            K   I +   L  Q S +   +T QWRDVPSK +K  C+A  + + A   +  G+ +V+ 
Sbjct: 950  KGENIEQGGKLDGQDSIKIGFRTPQWRDVPSKVRKAVCDATSLGQTATGMDWEGQDSVQL 1009

Query: 2023 QPIETAAVVINEIQEAESLNEQQMSNGFSGCSAPAITEVTGEVNNVGSC---TMDARYVN 1853
              I         I   +   EQ+ SN  SGCSAP +T+ + EVN +  C    +D  +VN
Sbjct: 1010 GNISMKRFK-RTIDMGDMSKEQENSNVSSGCSAPVVTQASLEVNKIEPCMGDAVDTGFVN 1068

Query: 1852 DHVFDEGSGIAKCWSSDETFDNERSTETVNGSGKLDMAKRGFSYGLPALSSEGPIDDLRS 1673
            + V DEGSGI K WSSD     E+S E + GS      K  +   L        +DDL+ 
Sbjct: 1069 NLVVDEGSGIDKGWSSDLV---EKSDEFL-GSSSGSCLKNDYLRVLNDQPCCNLLDDLKL 1124

Query: 1672 -GNIRLKKLPNRSPDACTVYKSFNCKQKLESDFKVGKQKKPTKWKRLDATFPATGLSSVH 1496
              ++  KK  N++    +     N  QK++   K GK++K    + LDA+  +   S +H
Sbjct: 1125 LDSLIWKKGWNQNNFVLSSNCKSNQSQKVKKGLK-GKKRKRNLVRILDASLSSEFPSLLH 1183

Query: 1495 YDSPNSQGQDEIHLXXXXXXXXXXXSNHGLQIMCGISSNGHSNFKRXXXXXXXXXXXXXX 1316
                N +     +                LQ     SSN  S  +               
Sbjct: 1184 --KKNEEVTGICNSSSSCSKEMQMRPLSSLQK----SSNKSSFVQPSNKQKHTAFSSKFL 1237

Query: 1315 RDRHGIHEHHGDWEDYSRKKLKDEMLCSRNPKLLGENRFKWCCNTDANRHFPRRETNQVS 1136
              ++ +++H      Y  +   D    +  P + G  + K    +D    F  +E     
Sbjct: 1238 SCKNHLNKHQSYKVGYESESSSDAEFRTL-PGVSGSKKLKKDLTSDCFEQFQMQEP---- 1292

Query: 1135 ARKAVKDDSAG--SVQHSNFFINHIGMCDRQPRPVVCGNSGIISNGKLTSGQTKPTKIVP 962
            A +  ++D     S +  N          R  RPVVCG  G IS+G L     KP KIV 
Sbjct: 1293 AYEEPENDKLRPFSCRKEN--------AHRITRPVVCGKYGEISSGHLAREVQKPVKIVS 1344

Query: 961  LSLILKKARRCVVSENEEPIPVSVSETKKSCFRSEYNNTV------FKRDVDTETFLFEK 800
            L  +LK ++RC    N +PIP S  + K+    +   +         K   +T+  +F  
Sbjct: 1345 LRKVLKSSKRCTGHTNGKPIPTSKKKWKRLSIGTSSGHCCGNPGLKIKEHNETQNAIF-- 1402

Query: 799  KKEGFSRNEVCLAELSMPQKDRDSGCP---------RSLNQDILKRFSSAPLKSKFKEAR 647
                F++  V   +LSM   DR    P         ++   + +   +   LK K KE R
Sbjct: 1403 ----FNKTNV---DLSMEDLDRGGKPPVVYKGKRDAKAKQGNSVGNRAYVSLKVKNKEIR 1455

Query: 646  K-RSLDELSGKGNKLSPAKNFLRKSLRCSFRTKHRFHEHSCLLKGTDDNQCHAEELCEGI 470
            K RS+ EL+ K  K+    N                             Q     LC   
Sbjct: 1456 KQRSITELTAKETKVMDMMN---------------------------SAQDQEPGLCSTA 1488

Query: 469  SRKSIKDRKCQAFIVDSDAFCCVCGSSNEDEINCLLECSVCSIRVHQACYGVSKAPK-GR 293
            SR SI+     A I +SDAFCCVC SS+ D+IN LLECS C IRVHQACYGVS  PK   
Sbjct: 1489 SRNSIQGHMNIATI-NSDAFCCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSS 1547

Query: 292  WYCRPCSSNSKNIVCVLCGYGGGAMTRALKSLNIVKSFLKAW------------------ 167
            W CRPC +NSKNIVCVLCGYGGGAMTRA+ S  IVKS LK W                  
Sbjct: 1548 WCCRPCRTNSKNIVCVLCGYGGGAMTRAIMSHTIVKSLLKVWNGEKDGMPKNTTSHEVFE 1607

Query: 166  ----------DITEHSSSAVLK-KNWHKQTNVMK-------NSSGLARSEVHNSIIAGAL 41
                      D  E    +VLK K     T++MK         + ++  +VHNSI    L
Sbjct: 1608 KEIDAFLSSKDGQEVDQESVLKPKIVDTSTDLMKVTNHIQHTPTSVSNFKVHNSITEAVL 1667

Query: 40   DPIIKQWVHMVCG 2
            DP +KQW+HMVCG
Sbjct: 1668 DPTVKQWIHMVCG 1680


>ref|XP_006596083.1| PREDICTED: uncharacterized protein LOC100812602 isoform X1 [Glycine
            max]
          Length = 2008

 Score =  258 bits (660), Expect = 6e-66
 Identities = 249/793 (31%), Positives = 343/793 (43%), Gaps = 62/793 (7%)
 Frame = -2

Query: 2194 KANAIGRYDNLKRQSSTQNDCQTSQWRDVPSK-QKGCCNAPCIERPAEVFN--GRRNVKG 2024
            K   I +   L  Q S +   +T QWRDVPSK +K  C+A  + + A   +  G+ +V+ 
Sbjct: 951  KGENIEQGGKLDGQDSIKIGFRTPQWRDVPSKVRKAVCDATSLGQTATGMDWEGQDSVQL 1010

Query: 2023 QPIETAAVVINEIQEAESLNEQQMSNGFSGCSAPAITEVTGEVNNVGSC---TMDARYVN 1853
              I         I   +   EQ+ SN  SGCSAP +T+ + EVN +  C    +D  +VN
Sbjct: 1011 GNISMKRFK-RTIDMGDMSKEQENSNVSSGCSAPVVTQASLEVNKIEPCMGDAVDTGFVN 1069

Query: 1852 DHVFDEGSGIAKCWSSDETFDNERSTETVNGSGKLDMAKRGFSYGLPALSSEGPIDDLRS 1673
            + V DEGSGI K WSSD     E+S E + GS      K  +   L        +DDL+ 
Sbjct: 1070 NLVVDEGSGIDKGWSSDLV---EKSDEFL-GSSSGSCLKNDYLRVLNDQPCCNLLDDLKL 1125

Query: 1672 -GNIRLKKLPNRSPDACTVYKSFNCKQKLESDFKVGKQKKPTKWKRLDATFPATGLSSVH 1496
              ++  KK  N++    +     N  QK++   K GK++K    + LDA+  +   S +H
Sbjct: 1126 LDSLIWKKGWNQNNFVLSSNCKSNQSQKVKKGLK-GKKRKRNLVRILDASLSSEFPSLLH 1184

Query: 1495 YDSPNSQGQDEIHLXXXXXXXXXXXSNHGLQIMCGISSNGHSNFKRXXXXXXXXXXXXXX 1316
                N +     +                LQ     SSN  S  +               
Sbjct: 1185 --KKNEEVTGICNSSSSCSKEMQMRPLSSLQK----SSNKSSFVQPSNKQKHTAFSSKFL 1238

Query: 1315 RDRHGIHEHHGDWEDYSRKKLKDEMLCSRNPKLLGENRFKWCCNTDANRHFPRRETNQVS 1136
              ++ +++H      Y  +   D    +  P + G  + K    +D    F  +E     
Sbjct: 1239 SCKNHLNKHQSYKVGYESESSSDAEFRTL-PGVSGSKKLKKDLTSDCFEQFQMQEP---- 1293

Query: 1135 ARKAVKDDSAG--SVQHSNFFINHIGMCDRQPRPVVCGNSGIISNGKLTSGQTKPTKIVP 962
            A +  ++D     S +  N          R  RPVVCG  G IS+G L     KP KIV 
Sbjct: 1294 AYEEPENDKLRPFSCRKEN--------AHRITRPVVCGKYGEISSGHLAREVQKPVKIVS 1345

Query: 961  LSLILKKARRCVVSENEEPIPVSVSETKKSCFRSEYNNTV------FKRDVDTETFLFEK 800
            L  +LK ++RC    N +PIP S  + K+    +   +         K   +T+  +F  
Sbjct: 1346 LRKVLKSSKRCTGHTNGKPIPTSKKKWKRLSIGTSSGHCCGNPGLKIKEHNETQNAIF-- 1403

Query: 799  KKEGFSRNEVCLAELSMPQKDRDSGCP---------RSLNQDILKRFSSAPLKSKFKEAR 647
                F++  V   +LSM   DR    P         ++   + +   +   LK K KE R
Sbjct: 1404 ----FNKTNV---DLSMEDLDRGGKPPVVYKGKRDAKAKQGNSVGNRAYVSLKVKNKEIR 1456

Query: 646  K-RSLDELSGKGNKLSPAKNFLRKSLRCSFRTKHRFHEHSCLLKGTDDNQCHAEELCEGI 470
            K RS+ EL+ K  K+    N                             Q     LC   
Sbjct: 1457 KQRSITELTAKETKVMDMMN---------------------------SAQDQEPGLCSTA 1489

Query: 469  SRKSIKDRKCQAFIVDSDAFCCVCGSSNEDEINCLLECSVCSIRVHQACYGVSKAPK-GR 293
            SR SI+     A I +SDAFCCVC SS+ D+IN LLECS C IRVHQACYGVS  PK   
Sbjct: 1490 SRNSIQGHMNIATI-NSDAFCCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSS 1548

Query: 292  WYCRPCSSNSKNIVCVLCGYGGGAMTRALKSLNIVKSFLKAW------------------ 167
            W CRPC +NSKNIVCVLCGYGGGAMTRA+ S  IVKS LK W                  
Sbjct: 1549 WCCRPCRTNSKNIVCVLCGYGGGAMTRAIMSHTIVKSLLKVWNGEKDGMPKNTTSHEVFE 1608

Query: 166  ----------DITEHSSSAVLK-KNWHKQTNVMK-------NSSGLARSEVHNSIIAGAL 41
                      D  E    +VLK K     T++MK         + ++  +VHNSI    L
Sbjct: 1609 KEIDAFLSSKDGQEVDQESVLKPKIVDTSTDLMKVTNHIQHTPTSVSNFKVHNSITEAVL 1668

Query: 40   DPIIKQWVHMVCG 2
            DP +KQW+HMVCG
Sbjct: 1669 DPTVKQWIHMVCG 1681


>ref|XP_003549306.2| PREDICTED: uncharacterized protein LOC100816713 isoform X1 [Glycine
            max]
          Length = 2032

 Score =  251 bits (641), Expect = 1e-63
 Identities = 246/803 (30%), Positives = 344/803 (42%), Gaps = 70/803 (8%)
 Frame = -2

Query: 2200 KDKANAIGRYDNLKRQSSTQNDCQTSQWRDVPSK-QKGCCNAPCIERPAEVFN--GRRNV 2030
            K K   I +   L  Q S +    T QWRDVPSK +K  C+A  +++ A   +  G+  V
Sbjct: 975  KGKNILIEQGGKLDGQDSIKIGFHTPQWRDVPSKVRKAVCDATSLDQTATGLDWEGQDGV 1034

Query: 2029 KGQPIETAAVVINEIQEAESLNEQQMSNGFSGCSAPAITEVTGEVNNVGSCTMDA---RY 1859
            +   I         I   +   EQ+ SN  SGCSAP +T+ + EVN + SCT DA    +
Sbjct: 1035 QLGNISMKRFK-RTIDMGDISKEQKSSNVSSGCSAPVVTQASVEVNKIDSCTDDAVDTGF 1093

Query: 1858 VNDHVFDEGSGIAKCWSSDETFDNERSTETVNGSGKLDMAKRGFSYGLPALSSEGPIDDL 1679
            VN+ V DEGSGI + WSSD     ERS E + GS      K  +   L        +DDL
Sbjct: 1094 VNNLVVDEGSGIDQGWSSDLV---ERSDEFL-GSTTGSCLKNDYLRVLYDQPCCNLLDDL 1149

Query: 1678 RS-GNIRLKKLPNRSPDACTVYKSFNCKQKLESDFKVGKQKKPTKWKRLDATFPATGLSS 1502
            +   ++  KK  N++    +     N  QK++   K GK++K    + +DA       SS
Sbjct: 1150 KLLDSLIWKKGRNQNHFVLSSNCKTNQSQKVKKVLK-GKKRKRNVVRIVDA-------SS 1201

Query: 1501 VHYDSPNSQGQDEIHLXXXXXXXXXXXSNHGLQIMCGISSNGHSNFKRXXXXXXXXXXXX 1322
                  N +G     +             H L  +   SSN  S  +             
Sbjct: 1202 SLLHKKNEEGAG---ICNSSSSLSREMQMHSLSSLKK-SSNKSSFVQPSNKQKHTAYSSK 1257

Query: 1321 XXRDRHGIHEHHGDWEDYSRKKLKDEMLCSRNPKLLGENRFKWCCNTDANRHFPRRETNQ 1142
                ++ +++H      Y  +   D    +  P + G  + +   ++D    F  +E   
Sbjct: 1258 FLSCKNRLNKHQSFKVGYESESSSDAEFHTL-PGVSGTKKLEKDLSSDCFEQFQMQEL-- 1314

Query: 1141 VSARKAVKDDSAG--SVQHSNFFINHIGMCDRQPRPVV-CGNSGIISNGKLTSGQTKPTK 971
              A +  ++D     S +  N          R  RPVV CG  G ISNG L     KP K
Sbjct: 1315 --AYEEPENDKLRPFSCRKEN--------AHRITRPVVVCGKYGEISNGHLAREVQKPAK 1364

Query: 970  IVPLSLILKKARRCVVSENEEPIPVSVSETKK--------SCFRS---------EYNNTV 842
            IV LS +LK ++RC+   N +P   S  + K+         C R+         E  NT+
Sbjct: 1365 IVSLSKVLKSSKRCMGHTNGKPRLTSKKKWKRLSIETSSGHCCRNPGLKIKEHNETENTI 1424

Query: 841  FKRDVDTETFLFEKKKEGFSRNEVCLAELSMPQKDRDSGCPRSLNQDILKRFSSAPLKSK 662
            F  + + +  + + ++ G             P   +     ++   D +   ++  LK K
Sbjct: 1425 FLNETNVDVSMEDLERGG-----------KPPAVYKGKRDAKAKQGDSVGNRANISLKVK 1473

Query: 661  FKEARK-RSLDELSGKGNKLSPAKNFLRKSLRCSFRTKHRFHEHSCLLKGTDDNQCHAEE 485
             KE RK RS++EL+ K  K+                               D  +C  ++
Sbjct: 1474 NKEIRKQRSINELTAKETKVM------------------------------DMTKCAQDQ 1503

Query: 484  ---LCEGISRKSIKDRKCQAFIVDSDAFCCVCGSSNEDEINCLLECSVCSIRVHQACYGV 314
               LC   SR SI+     + I +SDAFCCVC  S  D+INCLLECS C IRVHQACYGV
Sbjct: 1504 EPGLCGTKSRNSIQGHTSISTI-NSDAFCCVCRRSTNDKINCLLECSRCLIRVHQACYGV 1562

Query: 313  SKAPK-GRWYCRPCSSNSKNIVCVLCGYGGGAMTRALKSLNIVKSFLKAW---------- 167
            S  PK   W CRPC +NSKNI CVLCGYGGGAMTRA+ S  IVKS LK W          
Sbjct: 1563 STLPKKSSWCCRPCRTNSKNIACVLCGYGGGAMTRAIMSHTIVKSLLKVWNCEKDGMPRD 1622

Query: 166  ------------------DITEHSSSAVLK----------KNWHKQTNVMKNSSGLARSE 71
                              D  E    +VLK           N     ++    +  +  +
Sbjct: 1623 TTSCEVLEKEIDAFPSSKDGLEVDQESVLKPKIVDTSTDLMNQISTNHIPHTPTSFSNFK 1682

Query: 70   VHNSIIAGALDPIIKQWVHMVCG 2
            VHNSI  G LDP +KQW+HMVCG
Sbjct: 1683 VHNSITEGVLDPTVKQWIHMVCG 1705


>ref|XP_006601170.1| PREDICTED: uncharacterized protein LOC100816713 isoform X3 [Glycine
            max]
          Length = 2033

 Score =  247 bits (631), Expect = 1e-62
 Identities = 247/806 (30%), Positives = 345/806 (42%), Gaps = 73/806 (9%)
 Frame = -2

Query: 2200 KDKANAIGRYDNLKRQSSTQNDCQTSQWRDVPSK-QKGCCNAPCIERPAEVFN--GRRNV 2030
            K K   I +   L  Q S +    T QWRDVPSK +K  C+A  +++ A   +  G+  V
Sbjct: 973  KGKNILIEQGGKLDGQDSIKIGFHTPQWRDVPSKVRKAVCDATSLDQTATGLDWEGQDGV 1032

Query: 2029 KGQPIETAAVVINEIQEAESLNEQQMSNGFSGCSAPAITEVTGEVNNVGSCTMDA---RY 1859
            +   I         I   +   EQ+ SN  SGCSAP +T+ + EVN + SCT DA    +
Sbjct: 1033 QLGNISMKRFK-RTIDMGDISKEQKSSNVSSGCSAPVVTQASVEVNKIDSCTDDAVDTGF 1091

Query: 1858 VNDHVFDEGSGIAKCWSSDETFDNERSTETVNGSGKLDMAKRGFSYGLPALSSEGPIDDL 1679
            VN+ V DEGSGI + WSSD     ERS E + GS      K  +   L        +DDL
Sbjct: 1092 VNNLVVDEGSGIDQGWSSDLV---ERSDEFL-GSTTGSCLKNDYLRVLYDQPCCNLLDDL 1147

Query: 1678 RS-GNIRLKKLPNRSPDACTVYKSFNCKQKLESDFKVGKQKKPTKWKRLDATFPATGLSS 1502
            +   ++  KK  N++    +     N  QK++   K GK++K    + +DA       SS
Sbjct: 1148 KLLDSLIWKKGRNQNHFVLSSNCKTNQSQKVKKVLK-GKKRKRNVVRIVDA-------SS 1199

Query: 1501 VHYDSPNSQGQDEIHLXXXXXXXXXXXSNHGLQIMCGISSNGHSNFKRXXXXXXXXXXXX 1322
                  N +G     +             H L  +   SSN  S  +             
Sbjct: 1200 SLLHKKNEEGAG---ICNSSSSLSREMQMHSLSSLKK-SSNKSSFVQPSNKQKHTAYSSK 1255

Query: 1321 XXRDRHGIHEHHGDWEDYSRKKLKDEMLCSRNPKLLGENRFKWCCNTDANRHFPRRETNQ 1142
                ++ +++H      Y  +   D    +  P + G  + +   ++D    F  +E   
Sbjct: 1256 FLSCKNRLNKHQSFKVGYESESSSDAEFHTL-PGVSGTKKLEKDLSSDCFEQFQMQEL-- 1312

Query: 1141 VSARKAVKDDSAG--SVQHSNFFINHIGMCDRQPRPVV-CGNSGIISNGKLTSGQTKPTK 971
              A +  ++D     S +  N          R  RPVV CG  G ISNG L     KP K
Sbjct: 1313 --AYEEPENDKLRPFSCRKEN--------AHRITRPVVVCGKYGEISNGHLAREVQKPAK 1362

Query: 970  IVPLSLILKKARRCVVSENEEPIPVSVSETKK--------SCFRS---------EYNNTV 842
            IV LS +LK ++RC+   N +P   S  + K+         C R+         E  NT+
Sbjct: 1363 IVSLSKVLKSSKRCMGHTNGKPRLTSKKKWKRLSIETSSGHCCRNPGLKIKEHNETENTI 1422

Query: 841  FKRDVDTETFLFEKKKEGFSRNEVCLAELSMPQKDRDSGCPRSLNQDILKRFSSAPLKSK 662
            F  + + +  + + ++ G             P   +     ++   D +   ++  LK K
Sbjct: 1423 FLNETNVDVSMEDLERGG-----------KPPAVYKGKRDAKAKQGDSVGNRANISLKVK 1471

Query: 661  FKEARK-RSLDELSGKGNKLSPAKNFLRKSLRCSFRTKHRFHEHSCLLKGTDDNQCHAEE 485
             KE RK RS++EL+ K  K+                               D  +C  ++
Sbjct: 1472 NKEIRKQRSINELTAKETKVM------------------------------DMTKCAQDQ 1501

Query: 484  ---LCEGISRKSIKDRKCQAFIVDSDAFCCVCGSSNEDEINCLLECSVCSIRVHQACYGV 314
               LC   SR SI+     + I +SDAFCCVC  S  D+INCLLECS C IRVHQACYGV
Sbjct: 1502 EPGLCGTKSRNSIQGHTSISTI-NSDAFCCVCRRSTNDKINCLLECSRCLIRVHQACYGV 1560

Query: 313  SKAP-KGRWYCRPCSSNSKNIV---CVLCGYGGGAMTRALKSLNIVKSFLKAW------- 167
            S  P K  W CRPC +NSKNIV   CVLCGYGGGAMTRA+ S  IVKS LK W       
Sbjct: 1561 STLPKKSSWCCRPCRTNSKNIVYPACVLCGYGGGAMTRAIMSHTIVKSLLKVWNCEKDGM 1620

Query: 166  ---------------------DITEHSSSAVLK----------KNWHKQTNVMKNSSGLA 80
                                 D  E    +VLK           N     ++    +  +
Sbjct: 1621 PRDTTSCEVLEKEIDAFPSSKDGLEVDQESVLKPKIVDTSTDLMNQISTNHIPHTPTSFS 1680

Query: 79   RSEVHNSIIAGALDPIIKQWVHMVCG 2
              +VHNSI  G LDP +KQW+HMVCG
Sbjct: 1681 NFKVHNSITEGVLDPTVKQWIHMVCG 1706


>ref|XP_006601169.1| PREDICTED: uncharacterized protein LOC100816713 isoform X2 [Glycine
            max]
          Length = 2035

 Score =  247 bits (631), Expect = 1e-62
 Identities = 247/806 (30%), Positives = 345/806 (42%), Gaps = 73/806 (9%)
 Frame = -2

Query: 2200 KDKANAIGRYDNLKRQSSTQNDCQTSQWRDVPSK-QKGCCNAPCIERPAEVFN--GRRNV 2030
            K K   I +   L  Q S +    T QWRDVPSK +K  C+A  +++ A   +  G+  V
Sbjct: 975  KGKNILIEQGGKLDGQDSIKIGFHTPQWRDVPSKVRKAVCDATSLDQTATGLDWEGQDGV 1034

Query: 2029 KGQPIETAAVVINEIQEAESLNEQQMSNGFSGCSAPAITEVTGEVNNVGSCTMDA---RY 1859
            +   I         I   +   EQ+ SN  SGCSAP +T+ + EVN + SCT DA    +
Sbjct: 1035 QLGNISMKRFK-RTIDMGDISKEQKSSNVSSGCSAPVVTQASVEVNKIDSCTDDAVDTGF 1093

Query: 1858 VNDHVFDEGSGIAKCWSSDETFDNERSTETVNGSGKLDMAKRGFSYGLPALSSEGPIDDL 1679
            VN+ V DEGSGI + WSSD     ERS E + GS      K  +   L        +DDL
Sbjct: 1094 VNNLVVDEGSGIDQGWSSDLV---ERSDEFL-GSTTGSCLKNDYLRVLYDQPCCNLLDDL 1149

Query: 1678 RS-GNIRLKKLPNRSPDACTVYKSFNCKQKLESDFKVGKQKKPTKWKRLDATFPATGLSS 1502
            +   ++  KK  N++    +     N  QK++   K GK++K    + +DA       SS
Sbjct: 1150 KLLDSLIWKKGRNQNHFVLSSNCKTNQSQKVKKVLK-GKKRKRNVVRIVDA-------SS 1201

Query: 1501 VHYDSPNSQGQDEIHLXXXXXXXXXXXSNHGLQIMCGISSNGHSNFKRXXXXXXXXXXXX 1322
                  N +G     +             H L  +   SSN  S  +             
Sbjct: 1202 SLLHKKNEEGAG---ICNSSSSLSREMQMHSLSSLKK-SSNKSSFVQPSNKQKHTAYSSK 1257

Query: 1321 XXRDRHGIHEHHGDWEDYSRKKLKDEMLCSRNPKLLGENRFKWCCNTDANRHFPRRETNQ 1142
                ++ +++H      Y  +   D    +  P + G  + +   ++D    F  +E   
Sbjct: 1258 FLSCKNRLNKHQSFKVGYESESSSDAEFHTL-PGVSGTKKLEKDLSSDCFEQFQMQEL-- 1314

Query: 1141 VSARKAVKDDSAG--SVQHSNFFINHIGMCDRQPRPVV-CGNSGIISNGKLTSGQTKPTK 971
              A +  ++D     S +  N          R  RPVV CG  G ISNG L     KP K
Sbjct: 1315 --AYEEPENDKLRPFSCRKEN--------AHRITRPVVVCGKYGEISNGHLAREVQKPAK 1364

Query: 970  IVPLSLILKKARRCVVSENEEPIPVSVSETKK--------SCFRS---------EYNNTV 842
            IV LS +LK ++RC+   N +P   S  + K+         C R+         E  NT+
Sbjct: 1365 IVSLSKVLKSSKRCMGHTNGKPRLTSKKKWKRLSIETSSGHCCRNPGLKIKEHNETENTI 1424

Query: 841  FKRDVDTETFLFEKKKEGFSRNEVCLAELSMPQKDRDSGCPRSLNQDILKRFSSAPLKSK 662
            F  + + +  + + ++ G             P   +     ++   D +   ++  LK K
Sbjct: 1425 FLNETNVDVSMEDLERGG-----------KPPAVYKGKRDAKAKQGDSVGNRANISLKVK 1473

Query: 661  FKEARK-RSLDELSGKGNKLSPAKNFLRKSLRCSFRTKHRFHEHSCLLKGTDDNQCHAEE 485
             KE RK RS++EL+ K  K+                               D  +C  ++
Sbjct: 1474 NKEIRKQRSINELTAKETKVM------------------------------DMTKCAQDQ 1503

Query: 484  ---LCEGISRKSIKDRKCQAFIVDSDAFCCVCGSSNEDEINCLLECSVCSIRVHQACYGV 314
               LC   SR SI+     + I +SDAFCCVC  S  D+INCLLECS C IRVHQACYGV
Sbjct: 1504 EPGLCGTKSRNSIQGHTSISTI-NSDAFCCVCRRSTNDKINCLLECSRCLIRVHQACYGV 1562

Query: 313  SKAP-KGRWYCRPCSSNSKNIV---CVLCGYGGGAMTRALKSLNIVKSFLKAW------- 167
            S  P K  W CRPC +NSKNIV   CVLCGYGGGAMTRA+ S  IVKS LK W       
Sbjct: 1563 STLPKKSSWCCRPCRTNSKNIVYPACVLCGYGGGAMTRAIMSHTIVKSLLKVWNCEKDGM 1622

Query: 166  ---------------------DITEHSSSAVLK----------KNWHKQTNVMKNSSGLA 80
                                 D  E    +VLK           N     ++    +  +
Sbjct: 1623 PRDTTSCEVLEKEIDAFPSSKDGLEVDQESVLKPKIVDTSTDLMNQISTNHIPHTPTSFS 1682

Query: 79   RSEVHNSIIAGALDPIIKQWVHMVCG 2
              +VHNSI  G LDP +KQW+HMVCG
Sbjct: 1683 NFKVHNSITEGVLDPTVKQWIHMVCG 1708


Top