BLASTX nr result

ID: Akebia27_contig00023759 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00023759
         (1431 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI21105.3| unnamed protein product [Vitis vinifera]              346   2e-92
ref|XP_006483425.1| PREDICTED: uncharacterized protein LOC102613...   275   4e-71
ref|XP_006483424.1| PREDICTED: uncharacterized protein LOC102613...   275   4e-71
ref|XP_006450350.1| hypothetical protein CICLE_v10010345mg, part...   275   4e-71
ref|XP_007011788.1| Uncharacterized protein isoform 8, partial [...   272   3e-70
ref|XP_007011783.1| Uncharacterized protein isoform 3 [Theobroma...   272   3e-70
ref|XP_002519906.1| conserved hypothetical protein [Ricinus comm...   250   1e-63
ref|XP_007226535.1| hypothetical protein PRUPE_ppa025154mg [Prun...   240   1e-60
ref|XP_006371759.1| hypothetical protein POPTR_0018s02180g [Popu...   234   9e-59
gb|EXB80746.1| Histone-lysine N-methyltransferase ATX1 [Morus no...   225   3e-56
ref|XP_006852791.1| hypothetical protein AMTR_s00033p00150780 [A...   202   4e-49
ref|XP_004292737.1| PREDICTED: uncharacterized protein LOC101313...   191   5e-46
ref|XP_003549306.2| PREDICTED: uncharacterized protein LOC100816...   178   5e-42
ref|XP_006601170.1| PREDICTED: uncharacterized protein LOC100816...   178   5e-42
ref|XP_006601169.1| PREDICTED: uncharacterized protein LOC100816...   178   5e-42
ref|XP_006596084.1| PREDICTED: uncharacterized protein LOC100812...   174   7e-41
ref|XP_006596088.1| PREDICTED: uncharacterized protein LOC100812...   174   9e-41
ref|XP_006596087.1| PREDICTED: uncharacterized protein LOC100812...   174   9e-41
ref|XP_006596086.1| PREDICTED: uncharacterized protein LOC100812...   174   9e-41
ref|XP_006596085.1| PREDICTED: uncharacterized protein LOC100812...   174   9e-41

>emb|CBI21105.3| unnamed protein product [Vitis vinifera]
          Length = 1012

 Score =  346 bits (887), Expect = 2e-92
 Identities = 188/401 (46%), Positives = 243/401 (60%), Gaps = 8/401 (1%)
 Frame = +3

Query: 198  MDNSWQSKCXXXXXXXXXXXXXXXXXX-----RNQMAMNAGQNFHPYVAQELRSMNSGVI 362
            MDN+WQ KC                       RNQM +NAG+ F P +A E RS   G+I
Sbjct: 1    MDNAWQVKCSSSWQSATPPSMPSSSQHPPQESRNQMEINAGRYF-PTIAHEQRSAALGMI 59

Query: 363  QDSMVFNTLDIGSCRPEMPELGNSFLALLSGPSPQIQCEVQHLSSSKPSMDSTKLPMPDN 542
            Q+ +  NTL++GS R    ELGNSFLALLSGP   +QC++Q L + KP   S KLP+  +
Sbjct: 60   QEPLFSNTLNLGSYRSGHAELGNSFLALLSGPPSLLQCDLQQLLNPKPICTSNKLPVYSS 119

Query: 543  GLMAGGDGCGVQLSPVGLLSQRQGYENMGSGVEHVPLISSRTASTSACSFVSDLNAKLQM 722
             +     G GV  +P G LS+  GY+   SG++  P++SS TA ++ CS  S L+  LQ 
Sbjct: 120  SVTVSTAGSGVPHAPTGSLSENLGYQKPRSGMDFCPIVSSTTAVSTNCSSTSVLHDALQA 179

Query: 723  GNLNRYNSEITKQDIHHRVQGNQTGVDLSSPQWGWLTRTNFPSVGQHNLTNVQASMKVPS 902
             NLN  +S++ K  IHH V  N+   + SS + GW   T   + G+ + TN+ AS K PS
Sbjct: 180  ANLNLQSSDLAKATIHHMVPRNEKVREFSSLKGGWPVNTGSANFGKLHGTNIHASQKRPS 239

Query: 903  NRKSFVSDHSSAFASGRPRVFCLATRGDLLISNTFLLGVVCSCHGSHMSITTFCEHSGLC 1082
               S + DH + F SG PRVFC  T GDLL+SNT LLGVVC CH  HMS++ FCEHS L 
Sbjct: 240  EASSSLCDHQATFTSGCPRVFCFGTSGDLLLSNTGLLGVVCLCHCWHMSVSKFCEHSELR 299

Query: 1083 AVNPGDAVRLESGETVAQWRRLYFLKFGIRVPDDNSGWDWPDGISATGGLVKCKASVPNT 1262
             VNPGDAVR++SGET+AQWR+ YF KFGIRVP+D SGWDWP+GISAT G +K   +VP+ 
Sbjct: 300  DVNPGDAVRMDSGETIAQWRKQYFQKFGIRVPEDQSGWDWPEGISATAGFLKSSVTVPSL 359

Query: 1263 SKNSEMLRRIDPFVGSSA---RSGQPWNSFVSPNNSHAEQS 1376
             K S+    +   VGSS    R  QPW++ V P N    Q+
Sbjct: 360  YKKSD----LSHLVGSSGDLLRFEQPWDNVVFPKNPRTGQN 396


>ref|XP_006483425.1| PREDICTED: uncharacterized protein LOC102613578 isoform X2 [Citrus
            sinensis]
          Length = 2119

 Score =  275 bits (703), Expect = 4e-71
 Identities = 161/394 (40%), Positives = 217/394 (55%), Gaps = 1/394 (0%)
 Frame = +3

Query: 198  MDNSWQSKCXXXXXXXXXXXXXXXXXXRNQMAMNAGQNFHPYVAQELRSMNSGVIQDSMV 377
            M+NSWQ KC                  RNQ  M++G   +P+    LRS   G +QDS V
Sbjct: 1    MENSWQIKCGSSTQPMASSTSLET---RNQREMDSGYCSYPHGTHGLRSSGRGKVQDSSV 57

Query: 378  FNTLDIGSCRPEMPELGNSFLALLSGPSPQIQCEVQHLSSSKP-SMDSTKLPMPDNGLMA 554
             N     SCR    ELGNSFLALLS P   +QC+ +  S+ K  +  S+KLP     +++
Sbjct: 58   PNIRIGSSCRQGNAELGNSFLALLSAPPSLLQCDFKEQSNLKSFNASSSKLPFDGGVVIS 117

Query: 555  GGDGCGVQLSPVGLLSQRQGYENMGSGVEHVPLISSRTASTSACSFVSDLNAKLQMGNLN 734
               G GV     GLLS+ Q  +N+ +G    P+ SSR  + S CS    L+  L+  N++
Sbjct: 118  TSVGSGVPPIANGLLSECQSNQNVQNGAS--PIFSSRVVANSNCSTKYGLHDGLETVNVS 175

Query: 735  RYNSEITKQDIHHRVQGNQTGVDLSSPQWGWLTRTNFPSVGQHNLTNVQASMKVPSNRKS 914
              +S++ K  IH  V  N+   D SS +  W   T+     +   + +  S K P    S
Sbjct: 176  LQSSDLAKAIIHQLVSSNERAKDFSSIKGKW-HNTSLGHAAKIPSSCIPISHKEPLQSNS 234

Query: 915  FVSDHSSAFASGRPRVFCLATRGDLLISNTFLLGVVCSCHGSHMSITTFCEHSGLCAVNP 1094
             +    SA  S  PRV CL   G+LL+SNT LLG+VCSCH  H S+  FCEH GL  VNP
Sbjct: 235  SLPCLPSACTSECPRVICLGASGNLLLSNTGLLGIVCSCHHFHTSVAKFCEHLGLYDVNP 294

Query: 1095 GDAVRLESGETVAQWRRLYFLKFGIRVPDDNSGWDWPDGISATGGLVKCKASVPNTSKNS 1274
            GDAVR+ESGET+AQWR+LYF KFGIRVPDD +GWDWP+ +SA  GLVK   +  N    S
Sbjct: 295  GDAVRMESGETIAQWRKLYFRKFGIRVPDDQTGWDWPEALSAPAGLVKSSMAASNMPNYS 354

Query: 1275 EMLRRIDPFVGSSARSGQPWNSFVSPNNSHAEQS 1376
            ++ + +    G   + GQPW+S V P N + +++
Sbjct: 355  DLAKLVSS-SGGLIKRGQPWDSIVYPKNPYTDKN 387


>ref|XP_006483424.1| PREDICTED: uncharacterized protein LOC102613578 isoform X1 [Citrus
            sinensis]
          Length = 2120

 Score =  275 bits (703), Expect = 4e-71
 Identities = 161/394 (40%), Positives = 217/394 (55%), Gaps = 1/394 (0%)
 Frame = +3

Query: 198  MDNSWQSKCXXXXXXXXXXXXXXXXXXRNQMAMNAGQNFHPYVAQELRSMNSGVIQDSMV 377
            M+NSWQ KC                  RNQ  M++G   +P+    LRS   G +QDS V
Sbjct: 1    MENSWQIKCGSSTQPMASSTSLET---RNQREMDSGYCSYPHGTHGLRSSGRGKVQDSSV 57

Query: 378  FNTLDIGSCRPEMPELGNSFLALLSGPSPQIQCEVQHLSSSKP-SMDSTKLPMPDNGLMA 554
             N     SCR    ELGNSFLALLS P   +QC+ +  S+ K  +  S+KLP     +++
Sbjct: 58   PNIRIGSSCRQGNAELGNSFLALLSAPPSLLQCDFKEQSNLKSFNASSSKLPFDGGVVIS 117

Query: 555  GGDGCGVQLSPVGLLSQRQGYENMGSGVEHVPLISSRTASTSACSFVSDLNAKLQMGNLN 734
               G GV     GLLS+ Q  +N+ +G    P+ SSR  + S CS    L+  L+  N++
Sbjct: 118  TSVGSGVPPIANGLLSECQSNQNVQNGAS--PIFSSRVVANSNCSTKYGLHDGLETVNVS 175

Query: 735  RYNSEITKQDIHHRVQGNQTGVDLSSPQWGWLTRTNFPSVGQHNLTNVQASMKVPSNRKS 914
              +S++ K  IH  V  N+   D SS +  W   T+     +   + +  S K P    S
Sbjct: 176  LQSSDLAKAIIHQLVSSNERAKDFSSIKGKW-HNTSLGHAAKIPSSCIPISHKEPLQSNS 234

Query: 915  FVSDHSSAFASGRPRVFCLATRGDLLISNTFLLGVVCSCHGSHMSITTFCEHSGLCAVNP 1094
             +    SA  S  PRV CL   G+LL+SNT LLG+VCSCH  H S+  FCEH GL  VNP
Sbjct: 235  SLPCLPSACTSECPRVICLGASGNLLLSNTGLLGIVCSCHHFHTSVAKFCEHLGLYDVNP 294

Query: 1095 GDAVRLESGETVAQWRRLYFLKFGIRVPDDNSGWDWPDGISATGGLVKCKASVPNTSKNS 1274
            GDAVR+ESGET+AQWR+LYF KFGIRVPDD +GWDWP+ +SA  GLVK   +  N    S
Sbjct: 295  GDAVRMESGETIAQWRKLYFRKFGIRVPDDQTGWDWPEALSAPAGLVKSSMAASNMPNYS 354

Query: 1275 EMLRRIDPFVGSSARSGQPWNSFVSPNNSHAEQS 1376
            ++ + +    G   + GQPW+S V P N + +++
Sbjct: 355  DLAKLVSS-SGGLIKRGQPWDSIVYPKNPYTDKN 387


>ref|XP_006450350.1| hypothetical protein CICLE_v10010345mg, partial [Citrus clementina]
            gi|557553576|gb|ESR63590.1| hypothetical protein
            CICLE_v10010345mg, partial [Citrus clementina]
          Length = 938

 Score =  275 bits (703), Expect = 4e-71
 Identities = 161/394 (40%), Positives = 217/394 (55%), Gaps = 1/394 (0%)
 Frame = +3

Query: 198  MDNSWQSKCXXXXXXXXXXXXXXXXXXRNQMAMNAGQNFHPYVAQELRSMNSGVIQDSMV 377
            M+NSWQ KC                  RNQ  M++G   +P+    LRS   G +QDS V
Sbjct: 1    MENSWQIKCGSSTQPMASSTSLET---RNQREMDSGYCSYPHGTHGLRSSGRGKVQDSSV 57

Query: 378  FNTLDIGSCRPEMPELGNSFLALLSGPSPQIQCEVQHLSSSKP-SMDSTKLPMPDNGLMA 554
             N     SCR    ELGNSFLALLS P   +QC+ +  S+ K  +  S+KLP     +++
Sbjct: 58   PNIRIGSSCRQGNAELGNSFLALLSAPPSLLQCDFKEQSNLKSFNASSSKLPFDGGVVIS 117

Query: 555  GGDGCGVQLSPVGLLSQRQGYENMGSGVEHVPLISSRTASTSACSFVSDLNAKLQMGNLN 734
               G GV     GLLS+ Q  +N+ +G    P+ SSR  + S CS    L+  L+  N++
Sbjct: 118  TSVGSGVPPIANGLLSECQSNQNVQNGAS--PIFSSRVVANSNCSTKYGLHDGLETVNVS 175

Query: 735  RYNSEITKQDIHHRVQGNQTGVDLSSPQWGWLTRTNFPSVGQHNLTNVQASMKVPSNRKS 914
              +S++ K  IH  V  N+   D SS +  W   T+     +   + +  S K P    S
Sbjct: 176  LQSSDLAKAIIHQLVSSNERAKDFSSIKGKW-HNTSLGHAAKIPSSCIPISHKEPLQSNS 234

Query: 915  FVSDHSSAFASGRPRVFCLATRGDLLISNTFLLGVVCSCHGSHMSITTFCEHSGLCAVNP 1094
             +    SA  S  PRV CL   G+LL+SNT LLG+VCSCH  H S+  FCEH GL  VNP
Sbjct: 235  SLPCLPSACTSECPRVICLGASGNLLLSNTGLLGIVCSCHHFHTSVAKFCEHLGLYDVNP 294

Query: 1095 GDAVRLESGETVAQWRRLYFLKFGIRVPDDNSGWDWPDGISATGGLVKCKASVPNTSKNS 1274
            GDAVR+ESGET+AQWR+LYF KFGIRVPDD +GWDWP+ +SA  GLVK   +  N    S
Sbjct: 295  GDAVRMESGETIAQWRKLYFRKFGIRVPDDQTGWDWPEALSAPAGLVKSSMAASNMPNYS 354

Query: 1275 EMLRRIDPFVGSSARSGQPWNSFVSPNNSHAEQS 1376
            ++ + +    G   + GQPW+S V P N + +++
Sbjct: 355  DLAKLVSS-SGGLIKRGQPWDSIVYPKNPYTDKN 387


>ref|XP_007011788.1| Uncharacterized protein isoform 8, partial [Theobroma cacao]
            gi|508782151|gb|EOY29407.1| Uncharacterized protein
            isoform 8, partial [Theobroma cacao]
          Length = 2068

 Score =  272 bits (695), Expect = 3e-70
 Identities = 164/396 (41%), Positives = 219/396 (55%), Gaps = 3/396 (0%)
 Frame = +3

Query: 198  MDNSWQSKCXXXXXXXXXXXXXXXXXX-RNQMAMNAGQNFHPYVAQELRSMNSGVIQDSM 374
            MDNSW+ K                    +NQM +N+GQ FH +VAQ+L S   G ++D M
Sbjct: 1    MDNSWRIKFDSTLQSSMPSMASSASQEPQNQMVINSGQYFHQHVAQDLSSTLHGRMRDPM 60

Query: 375  VFNTLDIGSCRPEMPELGNSFLALLSGPSPQIQCEVQHLSSSKPSMDSTKLPMPDNGLMA 554
              N+ ++ S +    E  NSFLALLSG    +QC+ Q LSS K    S  + + D     
Sbjct: 61   PPNSSNLCSIKSNHSEQANSFLALLSGSPSLLQCDFQELSSRKVFNASRSVNIND----- 115

Query: 555  GGDGCGVQLSPVG--LLSQRQGYENMGSGVEHVPLISSRTASTSACSFVSDLNAKLQMGN 728
                 G ++ P+   LLS+    +N  +G   V  + SR   +S  S VS L+  L   N
Sbjct: 116  ----FGSEIPPIAGALLSETLSNQNTQNGANSV--VPSRLVLSSTGSGVSFLHGSLHASN 169

Query: 729  LNRYNSEITKQDIHHRVQGNQTGVDLSSPQWGWLTRTNFPSVGQHNLTNVQASMKVPSNR 908
             N   S++ K   H R+ G +   D+ +    W   ++    G     N+Q S K     
Sbjct: 170  SNLQTSDLAKVVNHLRLPGTEKVKDVPTLNGDWYGTSSTTKAGNLYSKNIQMSTKRAEEL 229

Query: 909  KSFVSDHSSAFASGRPRVFCLATRGDLLISNTFLLGVVCSCHGSHMSITTFCEHSGLCAV 1088
             S  SD SS   SG PRVFCL T G LL+SNT LLG+VCSCH  H S++ FCEHSGLC V
Sbjct: 230  NSSTSDQSSTNLSGCPRVFCLGTGGYLLLSNTGLLGIVCSCHFFHTSVSKFCEHSGLCDV 289

Query: 1089 NPGDAVRLESGETVAQWRRLYFLKFGIRVPDDNSGWDWPDGISATGGLVKCKASVPNTSK 1268
            NPGDAVR+ESGET+AQWR+LYF KFGIRVP+D+SGWDWP+G+  T GLVK  A+ P  SK
Sbjct: 290  NPGDAVRMESGETIAQWRKLYFEKFGIRVPEDHSGWDWPEGLLPTAGLVKSSATEPKISK 349

Query: 1269 NSEMLRRIDPFVGSSARSGQPWNSFVSPNNSHAEQS 1376
             S ++ +    VGSS    +  ++ +SP+N    Q+
Sbjct: 350  TSHLVNQ----VGSSQGLSRCMDNTMSPSNPQTGQN 381


>ref|XP_007011783.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508782146|gb|EOY29402.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 2104

 Score =  272 bits (695), Expect = 3e-70
 Identities = 164/396 (41%), Positives = 219/396 (55%), Gaps = 3/396 (0%)
 Frame = +3

Query: 198  MDNSWQSKCXXXXXXXXXXXXXXXXXX-RNQMAMNAGQNFHPYVAQELRSMNSGVIQDSM 374
            MDNSW+ K                    +NQM +N+GQ FH +VAQ+L S   G ++D M
Sbjct: 1    MDNSWRIKFDSTLQSSMPSMASSASQEPQNQMVINSGQYFHQHVAQDLSSTLHGRMRDPM 60

Query: 375  VFNTLDIGSCRPEMPELGNSFLALLSGPSPQIQCEVQHLSSSKPSMDSTKLPMPDNGLMA 554
              N+ ++ S +    E  NSFLALLSG    +QC+ Q LSS K    S  + + D     
Sbjct: 61   PPNSSNLCSIKSNHSEQANSFLALLSGSPSLLQCDFQELSSRKVFNASRSVNIND----- 115

Query: 555  GGDGCGVQLSPVG--LLSQRQGYENMGSGVEHVPLISSRTASTSACSFVSDLNAKLQMGN 728
                 G ++ P+   LLS+    +N  +G   V  + SR   +S  S VS L+  L   N
Sbjct: 116  ----FGSEIPPIAGALLSETLSNQNTQNGANSV--VPSRLVLSSTGSGVSFLHGSLHASN 169

Query: 729  LNRYNSEITKQDIHHRVQGNQTGVDLSSPQWGWLTRTNFPSVGQHNLTNVQASMKVPSNR 908
             N   S++ K   H R+ G +   D+ +    W   ++    G     N+Q S K     
Sbjct: 170  SNLQTSDLAKVVNHLRLPGTEKVKDVPTLNGDWYGTSSTTKAGNLYSKNIQMSTKRAEEL 229

Query: 909  KSFVSDHSSAFASGRPRVFCLATRGDLLISNTFLLGVVCSCHGSHMSITTFCEHSGLCAV 1088
             S  SD SS   SG PRVFCL T G LL+SNT LLG+VCSCH  H S++ FCEHSGLC V
Sbjct: 230  NSSTSDQSSTNLSGCPRVFCLGTGGYLLLSNTGLLGIVCSCHFFHTSVSKFCEHSGLCDV 289

Query: 1089 NPGDAVRLESGETVAQWRRLYFLKFGIRVPDDNSGWDWPDGISATGGLVKCKASVPNTSK 1268
            NPGDAVR+ESGET+AQWR+LYF KFGIRVP+D+SGWDWP+G+  T GLVK  A+ P  SK
Sbjct: 290  NPGDAVRMESGETIAQWRKLYFEKFGIRVPEDHSGWDWPEGLLPTAGLVKSSATEPKISK 349

Query: 1269 NSEMLRRIDPFVGSSARSGQPWNSFVSPNNSHAEQS 1376
             S ++ +    VGSS    +  ++ +SP+N    Q+
Sbjct: 350  TSHLVNQ----VGSSQGLSRCMDNTMSPSNPQTGQN 381


>ref|XP_002519906.1| conserved hypothetical protein [Ricinus communis]
            gi|223540952|gb|EEF42510.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 903

 Score =  250 bits (639), Expect = 1e-63
 Identities = 149/385 (38%), Positives = 214/385 (55%), Gaps = 2/385 (0%)
 Frame = +3

Query: 279  RNQMAMNAGQNFHPYVAQELRSMNSGVIQDSMVFNTLDIGSCRPEMPELGNSFLALLSGP 458
            R+Q   N GQ F  +  Q+LR+   G + D     T  +  C     +LGNSFLALLSGP
Sbjct: 20   RDQTGRNPGQYFISHAGQDLRTQVHGRMLDP----TFPLSPCSSSHADLGNSFLALLSGP 75

Query: 459  SPQIQCEVQHLSSSKPSMDSTKLPMPDNGLMAGGDGCGVQLSPVGLLSQRQGYENMGSGV 638
            +  +Q + Q  S+SKP   S KLP+ ++ +     G  +  +     S+   Y+NM SG 
Sbjct: 76   ASLLQFDFQEFSNSKPLNTSIKLPI-ESSIAVSPTGSQIPPTSSWKPSENGSYQNMQSGA 134

Query: 639  EHVPLISSRTASTSACSFVSDLNAKLQMGNLNRYNSEITKQDIHHRVQGNQTGVDLSSPQ 818
            +  PLISSR  +TS     S     L   +++   S++ K  +H  V GN+   D +  +
Sbjct: 135  DLCPLISSRATTTSNFGSNSVFPNGLPAASISLQGSDLAKTVLHDAVLGNEKLKDFTYLR 194

Query: 819  WGWLTRTNFPSVGQHNLTNVQASMKVPSNRKSFVSDHSSAFASGRPRVFCLATRGDLLIS 998
                  ++  ++   N+ N Q   K+P   +S  S +SS F SG PRVFC+   GDLL+S
Sbjct: 195  GELHNISDANAIKLQNVNN-QMPQKLPLAAESSASINSSRFPSGCPRVFCMDRSGDLLLS 253

Query: 999  NTFLLGVVCSCHGSHMSITTFCEHSGLCAVNPGDAVRLESGETVAQWRRLYFLKFGIRVP 1178
            NT LLG++CSCH  HMS++ FCEHSGL  +NPGDA+ ++SGET+AQWR+LYF KFGIRVP
Sbjct: 254  NTGLLGILCSCHCFHMSVSKFCEHSGLWNINPGDAIHMDSGETIAQWRKLYFQKFGIRVP 313

Query: 1179 DDNSGWDWPDGISATGGLVKCKASVPNTSKNSEMLRRIDPFVGSSARSGQPWNSFVSPNN 1358
            +D SGWDWP+G+     L++   S+ +  K +  +  + P   + ARSG+P +  V  N 
Sbjct: 314  EDQSGWDWPEGLPLAASLMRSGVSMSSMPKKTACINLVAP-SEALARSGRPLSDAVVKN- 371

Query: 1359 SHAEQSFL--EKPSKKVMQTPQQRN 1427
                  FL  + P    +   QQRN
Sbjct: 372  ------FLADQNPVIDALHDEQQRN 390


>ref|XP_007226535.1| hypothetical protein PRUPE_ppa025154mg [Prunus persica]
            gi|462423471|gb|EMJ27734.1| hypothetical protein
            PRUPE_ppa025154mg [Prunus persica]
          Length = 893

 Score =  240 bits (613), Expect = 1e-60
 Identities = 143/360 (39%), Positives = 203/360 (56%), Gaps = 2/360 (0%)
 Frame = +3

Query: 354  GVIQDSMVFNTLDIGSCRPEMPELGNSFLALLSGPSPQIQCEVQHLSSSKPSMDSTKLPM 533
            G +QD ++ + L  GS R E   LGNSFLALLSG S   QC+ Q LS+ KP   S K+  
Sbjct: 3    GRLQDPLLASKLYSGSHRSEHANLGNSFLALLSGSSSVFQCDFQELSNPKPISTSCKILP 62

Query: 534  PDNGLMAGGDGCGVQLSPVGLLSQRQGYENMGSGVEHVPLISSRTASTSACSFVSDLNAK 713
              N  +  G G  + ++  G+LS+    +N+ SG +    +SSR+  +S+C+  S L+  
Sbjct: 63   DSNNFIVNGIGSAIPVTSSGVLSENLNGQNLQSGADSCTKVSSRSVPSSSCASNSVLH-- 120

Query: 714  LQMGNLNRYNSEITKQDIHHRVQGNQT--GVDLSSPQWGWLTRTNFPSVGQHNLTNVQAS 887
                  +  +S++ K    + V G++   G    S +W  ++  +    G+    N+Q S
Sbjct: 121  ------DLQSSDLAKVVTRNMVLGSEKVKGSFSLSGEWHGVSPAD---TGKACGANIQTS 171

Query: 888  MKVPSNRKSFVSDHSSAFASGRPRVFCLATRGDLLISNTFLLGVVCSCHGSHMSITTFCE 1067
             K+P      +S+ +S+F +G PRVFC  T G LL+SNT L+G+VCSCH  HMS+  FCE
Sbjct: 172  KKLPVEGNFVISNQASSFMNGCPRVFCSTTSGYLLLSNTGLVGIVCSCHCLHMSVLKFCE 231

Query: 1068 HSGLCAVNPGDAVRLESGETVAQWRRLYFLKFGIRVPDDNSGWDWPDGISATGGLVKCKA 1247
            HSGL  VNPG AVR+++GET+AQW +LYFL  GIRVP D S WDWP+G+SAT GLVK   
Sbjct: 232  HSGLYGVNPGHAVRMDNGETIAQWCKLYFLNSGIRVPGDRSEWDWPEGLSATAGLVKSSL 291

Query: 1248 SVPNTSKNSEMLRRIDPFVGSSARSGQPWNSFVSPNNSHAEQSFLEKPSKKVMQTPQQRN 1427
            S+PN S +   L  +    G SA S Q  +      N    Q+ +       ++  QQRN
Sbjct: 292  SMPNMSND---LSHMVCSSGGSASSQQSLDGVALSKNLFTNQNLV----VGAVENKQQRN 344


>ref|XP_006371759.1| hypothetical protein POPTR_0018s02180g [Populus trichocarpa]
            gi|550317856|gb|ERP49556.1| hypothetical protein
            POPTR_0018s02180g [Populus trichocarpa]
          Length = 868

 Score =  234 bits (596), Expect = 9e-59
 Identities = 148/357 (41%), Positives = 204/357 (57%), Gaps = 3/357 (0%)
 Frame = +3

Query: 366  DSMVFNTLDIGSCRPEMPELGNSFLALLSGPSPQIQCEVQHLSSSKPSMDSTKLPMPDNG 545
            DSMV N  ++ S      +LGNSFLALLSGP+    C+   L + K    S+++P  D G
Sbjct: 3    DSMVSNIPNLSSYSGNC-DLGNSFLALLSGPASFSPCDFHELPNPKQFSASSRVPSEDTG 61

Query: 546  LMAGGDGCGVQLSPVGLLSQRQGYENMGSGVEHVPLISSRTASTSACSFVSDLNAKLQMG 725
             +    G    L    + S     +N  +G    P++SS+ ASTS     S L   LQ  
Sbjct: 62   SLFNASGSRAPLMSSRIPSGNLSNQNQRNGAN--PVVSSKCASTSN----SVLQHCLQGA 115

Query: 726  NLNRYNSEITKQDIHHRVQGNQTGVDLSSPQWGWLTRTNFPSVGQHNLTNVQASMKVPSN 905
            N   ++S++ K  IH++V  N+   D SS +  W + TN  +  +   TN Q   K+   
Sbjct: 116  NFAMHSSDLAKAVIHYKVSDNEKVKDSSSLRGEWRS-TNPANAVKLPDTNCQMPGKLALE 174

Query: 906  RKSFVSDHSSAFASGRPRVFCLATRGDLLISNTFLLGVVCSCHGSHMSITTFCEHSGLCA 1085
             +  VS +SSA ++  PRVFCL   G+LL+S+T LLG++CSCH  HMS++ FCEHSGL  
Sbjct: 175  PELSVSKNSSALSNQYPRVFCLGKSGELLLSSTGLLGILCSCHCFHMSVSKFCEHSGLWN 234

Query: 1086 VNPGDAVRLESGETVAQWRRLYFLKFGIRVPDDNSGWDWPDGISATGGLVKCKASVPNTS 1265
            VNPG AV +E+GET+AQWR+LYF KFGIRVP+D SGWDWP+G+  T  LV     +P  S
Sbjct: 235  VNPGVAVHMENGETIAQWRKLYFQKFGIRVPEDQSGWDWPEGLPLTASLVHSSVPLP-LS 293

Query: 1266 KNSEMLRRIDPFVGSS---ARSGQPWNSFVSPNNSHAEQSFLEKPSKKVMQTPQQRN 1427
            K+S+     +  VGSS    RSGQP +S V P N   + +  + P   V+   Q+RN
Sbjct: 294  KHSD----CNHLVGSSEGLVRSGQPIDSVVFPKNPLTDYNLNQNPVFDVLD-KQKRN 345


>gb|EXB80746.1| Histone-lysine N-methyltransferase ATX1 [Morus notabilis]
          Length = 2073

 Score =  225 bits (574), Expect = 3e-56
 Identities = 150/411 (36%), Positives = 208/411 (50%), Gaps = 3/411 (0%)
 Frame = +3

Query: 198  MDNSWQSKCXXXXXXXXXXXXXXXXXX-RNQMAMNAGQNFHPYVAQELRSMNSGVIQDSM 374
            M+NSWQ KC                   RNQ   NAG   + +  ++L     G +QD +
Sbjct: 1    MENSWQVKCGSTLQSSAPSLASSSSQELRNQTERNAGYYSYSHDPRDLSLKVLGTVQDPL 60

Query: 375  VFNTLDIGSCRPEMPELGNSFLALLSGPSPQIQCEVQHLSSSKPSMDSTKLPMPDNGLMA 554
            + N  D+   +     LGNSFLALLSGP   +QC+ + LS+SK   D + +       + 
Sbjct: 61   LPNYPDLSFQKSGHVNLGNSFLALLSGPPSLLQCDFKELSNSKLMSDGSSV-------IV 113

Query: 555  GGDGCGVQLSPVGLLSQRQGYENMGSGVEHVPLISSRTASTSACSFVSDLNAKLQMGNLN 734
               G G+ L   G   +    +N+  GV   P  S+R    S C+  S L   LQ     
Sbjct: 114  NAIGNGIPLRFSGSPLEYMSEQNLQPGVAFSPSNSTRGVEASNCNTNSVLPG-LQ----- 167

Query: 735  RYNSEITKQDIHHRVQGNQTGVDLSSPQWGWLTRTNFPSVGQHNLTNVQASMKVPSNRKS 914
              + ++    +H  V  ++      S    W      P+ G+ + T VQ S        S
Sbjct: 168  --SPDVETTTVHCMVPSSEKAKGSLSINGEWHGAVA-PNTGKLSSTKVQTSQMKSLEENS 224

Query: 915  FVSDH--SSAFASGRPRVFCLATRGDLLISNTFLLGVVCSCHGSHMSITTFCEHSGLCAV 1088
             +S+   SS   S  PRVFCL T G LLISNT LLG+VCSCH  HMS+  FCEHSGLC V
Sbjct: 225  SISNQYQSSKVLSECPRVFCLGTGGYLLISNTGLLGIVCSCHSLHMSVLKFCEHSGLCGV 284

Query: 1089 NPGDAVRLESGETVAQWRRLYFLKFGIRVPDDNSGWDWPDGISATGGLVKCKASVPNTSK 1268
            NPGDAV +++G+T+AQWR+LYF KFGIRV ++   WDWP+G+SAT GLVK + ++PN S 
Sbjct: 285  NPGDAVCMDNGQTIAQWRKLYFQKFGIRVSEEQIDWDWPEGLSATSGLVKSRTTLPNIS- 343

Query: 1269 NSEMLRRIDPFVGSSARSGQPWNSFVSPNNSHAEQSFLEKPSKKVMQTPQQ 1421
                   +    G  +RSGQ  ++ +  +N H  QS +   S+   +   Q
Sbjct: 344  ------HLAHSSGGLSRSGQLSDNAML-SNLHTNQSMVIDASQNKQKRDAQ 387


>ref|XP_006852791.1| hypothetical protein AMTR_s00033p00150780 [Amborella trichopoda]
            gi|548856405|gb|ERN14258.1| hypothetical protein
            AMTR_s00033p00150780 [Amborella trichopoda]
          Length = 2123

 Score =  202 bits (513), Expect = 4e-49
 Identities = 146/416 (35%), Positives = 196/416 (47%), Gaps = 12/416 (2%)
 Frame = +3

Query: 198  MDNSWQSKCXXXXXXXXXXXXXXXXXXRNQMAMNAGQNF-HPYVAQELRSMNSGVIQDSM 374
            MDNSW  K                   RNQ  MNA Q F H Y             Q++ 
Sbjct: 1    MDNSWPGKVGPSWPGPPSSVP------RNQFEMNADQYFLHTYA------------QEAN 42

Query: 375  VFNTLDIGS----CRPEMPELGNSFLALLSG-PSPQIQCEVQHLSSSKPSMDSTKLPMPD 539
            V NT++ GS    C+   PE  NSF++LL+G PS QI  E Q L+SS+  M +T  P+  
Sbjct: 43   VSNTMNFGSTPYNCKMANPEFANSFISLLAGGPSQQICGEFQQLTSSRSGMATTSPPIN- 101

Query: 540  NGLMAGGDGCGVQLSPVGLLSQRQGYENMGSGVEHVPLISSRT--ASTSACSFVSDLNAK 713
                                      EN+ +G E   +I SR   A  S    V + +  
Sbjct: 102  --------------------------ENIVNGPELYQVIGSRNPLAFNSGRGLVFN-DGN 134

Query: 714  LQMGNLNRYNSEITKQDIHHRVQGNQTGVDLSSPQWGWLTRTNFPSVGQHNLTNVQASMK 893
            LQ  + + + S   KQ        +   V   SP    +  TN       ++++     K
Sbjct: 135  LQPKSSHLHGSNAAKQVFSDHTPRDNEIVSQRSPIQWLIGTTNTKQQNNAHISSY-TRFK 193

Query: 894  VPSNRKSFVSDHSSAFASGRPRVFCLATRGDLLISNTFLLGVVCSCHGSHMSITTFCEHS 1073
            +PS+ K  V D +S+   G  R +CL   GDLL+     LG+VCSCHG HMS+  FCEHS
Sbjct: 194  LPSDSKCDVIDQASSIVKGLTRAYCLGKSGDLLLIEGGHLGIVCSCHGLHMSVAKFCEHS 253

Query: 1074 GLCAVNPGDAVRLESGETVAQWRRLYFLKFGIRVPDDNSGWDWPDGISATGGLVKCKASV 1253
            G   +NPG+AVR  SGETVAQWRR  ++K GI++PDD +GWDWPDG +A  G  K K++ 
Sbjct: 254  GSSVINPGEAVRTGSGETVAQWRRENYIKLGIKLPDDTAGWDWPDGSTANAGKPKYKSAC 313

Query: 1254 ----PNTSKNSEMLRRIDPFVGSSARSGQPWNSFVSPNNSHAEQSFLEKPSKKVMQ 1409
                 N  KNS + R   PF G   RS QPWN+  S N      + LE  + +  +
Sbjct: 314  IQKNQNIEKNSGVSRHGYPFDG-QPRSEQPWNNANSFNYPRGGLAILESSASRTTE 368


>ref|XP_004292737.1| PREDICTED: uncharacterized protein LOC101313577 [Fragaria vesca
            subsp. vesca]
          Length = 2169

 Score =  191 bits (486), Expect = 5e-46
 Identities = 141/425 (33%), Positives = 188/425 (44%), Gaps = 63/425 (14%)
 Frame = +3

Query: 198  MDNSWQSKCXXXXXXXXXXXXXXXXXXRNQMAMNAGQNFHPYVAQELRSMNSGVIQDSMV 377
            M+N W +KC                  RNQM  NAG       +Q+LR    G +QD ++
Sbjct: 1    MENRWMNKCNSTLPPAASSSSSQEQ--RNQME-NAGYLSCQRGSQDLRPAMLGWLQDPLL 57

Query: 378  FNTLDIGSCRPEMPELGNSFLALLSGPSPQIQCEVQHLSSSKPSMDSTKLPMPDNGLMAG 557
             +    GS R E   LGNSFL+LLSG    +Q   Q  S+S+P   S K+    N  +  
Sbjct: 58   SSIQSSGSHRSEHVNLGNSFLSLLSGSPSLLQRGFQDFSNSQPICTSGKILPVGNNSILN 117

Query: 558  GDGCGVQLSPVGLLSQRQGYENMGSGVEHVPLISSRTASTSAC---SFVSDL-------- 704
                 + LS  GL S++  + N+ SG +     SS+   +S C   S + DL        
Sbjct: 118  STQSTIPLSSTGLPSEKLSWTNLQSGTDFCHNGSSKVVPSSICASNSVLHDLQSSDLAKV 177

Query: 705  --------NAKLQMG--------------NLNRYNSEITKQDIHHRVQGNQTGVDLS-SP 815
                    N KL+                  N  NSE    + +  +      V  S SP
Sbjct: 178  VICHTGPVNEKLESSYALSREWHCAGPASRANIQNSERMPLEANSFISNQAYRVCHSASP 237

Query: 816  QWGWLTRTNFPSVGQHNLTNVQASMKVPSNRKSFVSDHSSA------------------- 938
               W  R            ++  S  +P    SF+S H+S+                   
Sbjct: 238  ADSWKARRE----------SIHTSQTIPLEANSFISYHASSLWHGTNPADNGKACRANIE 287

Query: 939  ----------FASGRPRVFCLATRGDLLISNTFLLGVVCSCHGSHMSITTFCEHSGLCAV 1088
                      F +G PRVFC  T G LL SNT  LG+VCSCH   MS   FCEHSGL  V
Sbjct: 288  TSPKMPQASSFMNGCPRVFCSTTSGYLLFSNTGFLGIVCSCHSFRMSAFKFCEHSGLYGV 347

Query: 1089 NPGDAVRLESGETVAQWRRLYFLKFGIRVPDDNSGWDWPDGISATGGLVKCKASVPNTSK 1268
            NPGDA+R++SGET++QW +LY  KFGIR+P D S WDWP+ +SAT  L+K    +P  S 
Sbjct: 348  NPGDAIRMDSGETISQWCKLYLPKFGIRIPGDKSEWDWPEELSATASLMKRSVPMPKISN 407

Query: 1269 NSEML 1283
            +S  L
Sbjct: 408  SSSDL 412


>ref|XP_003549306.2| PREDICTED: uncharacterized protein LOC100816713 isoform X1 [Glycine
            max]
          Length = 2032

 Score =  178 bits (452), Expect = 5e-42
 Identities = 136/423 (32%), Positives = 201/423 (47%), Gaps = 16/423 (3%)
 Frame = +3

Query: 198  MDNSWQSKCXXXXXXXXXXXXXXXXXXRNQMAMNAGQNFHPYVAQELRSMNSGVIQDSMV 377
            M+++W+ KC                    +  +N     +P     LRS   G  Q  + 
Sbjct: 1    MESAWKRKCDSPFQPSTSAAVPSAPVPEPEPEINTSHCLYPQFPHGLRSKFVGGKQCPVY 60

Query: 378  FNTLDIGSCRPEMPELGNSFLALLSGPSPQIQCEVQHLSSSKPSMDSTKLPMPDNGLMAG 557
             +     +      + G+SFL+LL  P   +Q E   LS+ K  + S          + G
Sbjct: 61   QSFPHSITHGSGQADTGSSFLSLLYAPPSLLQHESWDLSNRKLCISSCDCTAAIGNSVVG 120

Query: 558  GDGCGV-QLSPVGLLSQRQGYENMGSGVEHVPLISSRTASTSACSFVSDLNAKLQMGNLN 734
                G  + S VGL+++     N+ S V   P ISSR              A + + N +
Sbjct: 121  SIESGTFRTSGVGLMTENLINRNLQSWVTTFPEISSR--------------AMVGLKNSS 166

Query: 735  RYNSEITKQDIHHRVQGNQTGVDLSSPQWGWLTRTNFPSVGQHNLTN-----------VQ 881
             +        + H +Q + T    + P  G   R +F S GQ   T+           VQ
Sbjct: 167  SF--------VFHDIQSSNTATQPTIPG-GEKARESFSSSGQCQGTSPACSLNVCWSDVQ 217

Query: 882  ASMKVPSNRKSFVSDHSSAFASGRPRVFCLATRGDLLISNTFLLGVVCSCHGSHMSITTF 1061
             +  V   + S  S +++ F SG PRVFC+   G LL+SNT LLG+VCSCH  HMS+  F
Sbjct: 218  TTPTVALEQSS--SKYATPFMSGCPRVFCMGKSGHLLLSNTGLLGIVCSCHCCHMSVAKF 275

Query: 1062 CEHSGLCAVNPGDAVRLESGETVAQWRRLYFLKFGIRVPDDNSGWDWPDGISATGGLVKC 1241
            CEHSGL  V+PG+AVR+ESGET++QW++ YFLKFGIR   + + WDWP+ +S TG L++ 
Sbjct: 276  CEHSGLYGVDPGEAVRMESGETISQWQKQYFLKFGIRSLGNENEWDWPEVLSTTGSLMRS 335

Query: 1242 KASVPNTSKNSEMLRRIDPFVGSSA---RSGQPWNSFVSPNNSHAEQS-FLEKPSKKVMQ 1409
             AS  + SK +     +   + SSA   RS +  +  V P N+HA+ + F++  S K   
Sbjct: 336  NASAFDMSKTN-----LSHMLSSSAVMSRSAKSSDYAVFPKNAHADNNLFIDALSGKQAT 390

Query: 1410 TPQ 1418
            T Q
Sbjct: 391  TIQ 393


>ref|XP_006601170.1| PREDICTED: uncharacterized protein LOC100816713 isoform X3 [Glycine
            max]
          Length = 2033

 Score =  178 bits (452), Expect = 5e-42
 Identities = 136/423 (32%), Positives = 201/423 (47%), Gaps = 16/423 (3%)
 Frame = +3

Query: 198  MDNSWQSKCXXXXXXXXXXXXXXXXXXRNQMAMNAGQNFHPYVAQELRSMNSGVIQDSMV 377
            M+++W+ KC                    +  +N     +P     LRS   G  Q  + 
Sbjct: 1    MESAWKRKCDSPFQPSTSAAVPSAPVPEPEPEINTSHCLYPQFPHGLRSKFVGGKQCPVY 60

Query: 378  FNTLDIGSCRPEMPELGNSFLALLSGPSPQIQCEVQHLSSSKPSMDSTKLPMPDNGLMAG 557
             +     +      + G+SFL+LL  P   +Q E   LS+ K  + S          + G
Sbjct: 61   QSFPHSITHGSGQADTGSSFLSLLYAPPSLLQHESWDLSNRKLCISSCDCTAAIGNSVVG 120

Query: 558  GDGCGV-QLSPVGLLSQRQGYENMGSGVEHVPLISSRTASTSACSFVSDLNAKLQMGNLN 734
                G  + S VGL+++     N+ S V   P ISSR              A + + N +
Sbjct: 121  SIESGTFRTSGVGLMTENLINRNLQSWVTTFPEISSR--------------AMVGLKNSS 166

Query: 735  RYNSEITKQDIHHRVQGNQTGVDLSSPQWGWLTRTNFPSVGQHNLTN-----------VQ 881
             +        + H +Q + T    + P  G   R +F S GQ   T+           VQ
Sbjct: 167  SF--------VFHDIQSSNTATQPTIPG-GEKARESFSSSGQCQGTSPACSLNVCWSDVQ 217

Query: 882  ASMKVPSNRKSFVSDHSSAFASGRPRVFCLATRGDLLISNTFLLGVVCSCHGSHMSITTF 1061
             +  V   + S  S +++ F SG PRVFC+   G LL+SNT LLG+VCSCH  HMS+  F
Sbjct: 218  TTPTVALEQSS--SKYATPFMSGCPRVFCMGKSGHLLLSNTGLLGIVCSCHCCHMSVAKF 275

Query: 1062 CEHSGLCAVNPGDAVRLESGETVAQWRRLYFLKFGIRVPDDNSGWDWPDGISATGGLVKC 1241
            CEHSGL  V+PG+AVR+ESGET++QW++ YFLKFGIR   + + WDWP+ +S TG L++ 
Sbjct: 276  CEHSGLYGVDPGEAVRMESGETISQWQKQYFLKFGIRSLGNENEWDWPEVLSTTGSLMRS 335

Query: 1242 KASVPNTSKNSEMLRRIDPFVGSSA---RSGQPWNSFVSPNNSHAEQS-FLEKPSKKVMQ 1409
             AS  + SK +     +   + SSA   RS +  +  V P N+HA+ + F++  S K   
Sbjct: 336  NASAFDMSKTN-----LSHMLSSSAVMSRSAKSSDYAVFPKNAHADNNLFIDALSGKQAT 390

Query: 1410 TPQ 1418
            T Q
Sbjct: 391  TIQ 393


>ref|XP_006601169.1| PREDICTED: uncharacterized protein LOC100816713 isoform X2 [Glycine
            max]
          Length = 2035

 Score =  178 bits (452), Expect = 5e-42
 Identities = 136/423 (32%), Positives = 201/423 (47%), Gaps = 16/423 (3%)
 Frame = +3

Query: 198  MDNSWQSKCXXXXXXXXXXXXXXXXXXRNQMAMNAGQNFHPYVAQELRSMNSGVIQDSMV 377
            M+++W+ KC                    +  +N     +P     LRS   G  Q  + 
Sbjct: 1    MESAWKRKCDSPFQPSTSAAVPSAPVPEPEPEINTSHCLYPQFPHGLRSKFVGGKQCPVY 60

Query: 378  FNTLDIGSCRPEMPELGNSFLALLSGPSPQIQCEVQHLSSSKPSMDSTKLPMPDNGLMAG 557
             +     +      + G+SFL+LL  P   +Q E   LS+ K  + S          + G
Sbjct: 61   QSFPHSITHGSGQADTGSSFLSLLYAPPSLLQHESWDLSNRKLCISSCDCTAAIGNSVVG 120

Query: 558  GDGCGV-QLSPVGLLSQRQGYENMGSGVEHVPLISSRTASTSACSFVSDLNAKLQMGNLN 734
                G  + S VGL+++     N+ S V   P ISSR              A + + N +
Sbjct: 121  SIESGTFRTSGVGLMTENLINRNLQSWVTTFPEISSR--------------AMVGLKNSS 166

Query: 735  RYNSEITKQDIHHRVQGNQTGVDLSSPQWGWLTRTNFPSVGQHNLTN-----------VQ 881
             +        + H +Q + T    + P  G   R +F S GQ   T+           VQ
Sbjct: 167  SF--------VFHDIQSSNTATQPTIPG-GEKARESFSSSGQCQGTSPACSLNVCWSDVQ 217

Query: 882  ASMKVPSNRKSFVSDHSSAFASGRPRVFCLATRGDLLISNTFLLGVVCSCHGSHMSITTF 1061
             +  V   + S  S +++ F SG PRVFC+   G LL+SNT LLG+VCSCH  HMS+  F
Sbjct: 218  TTPTVALEQSS--SKYATPFMSGCPRVFCMGKSGHLLLSNTGLLGIVCSCHCCHMSVAKF 275

Query: 1062 CEHSGLCAVNPGDAVRLESGETVAQWRRLYFLKFGIRVPDDNSGWDWPDGISATGGLVKC 1241
            CEHSGL  V+PG+AVR+ESGET++QW++ YFLKFGIR   + + WDWP+ +S TG L++ 
Sbjct: 276  CEHSGLYGVDPGEAVRMESGETISQWQKQYFLKFGIRSLGNENEWDWPEVLSTTGSLMRS 335

Query: 1242 KASVPNTSKNSEMLRRIDPFVGSSA---RSGQPWNSFVSPNNSHAEQS-FLEKPSKKVMQ 1409
             AS  + SK +     +   + SSA   RS +  +  V P N+HA+ + F++  S K   
Sbjct: 336  NASAFDMSKTN-----LSHMLSSSAVMSRSAKSSDYAVFPKNAHADNNLFIDALSGKQAT 390

Query: 1410 TPQ 1418
            T Q
Sbjct: 391  TIQ 393


>ref|XP_006596084.1| PREDICTED: uncharacterized protein LOC100812602 isoform X2 [Glycine
            max]
          Length = 2007

 Score =  174 bits (442), Expect = 7e-41
 Identities = 124/374 (33%), Positives = 176/374 (47%), Gaps = 15/374 (4%)
 Frame = +3

Query: 198  MDNSWQSKCXXXXXXXXXXXXXXXXXXRNQMAMNAGQNFHPYVAQELRSMNSGVIQDSMV 377
            M++ W+ KC                    ++  N     +P VA  LRS   G  Q  + 
Sbjct: 1    MESPWERKCDSPLQPSTSATVLSAPPPETEI--NTSYCLYPQVAHGLRSKFVGGKQGHVY 58

Query: 378  FNTLDIGSCRPEMPELGNSFLALLSGPSPQIQCEVQHLSSSKPSMDSTKLPMPDNGLMAG 557
             +     +      +  NSFL+LL GP   +Q E + LS  K    S          + G
Sbjct: 59   QSFPHSTAHGSGQADTRNSFLSLLYGPPSLLQHEFRDLSDRKLCFSSGDCTAAIGNSVVG 118

Query: 558  GDGCGV-QLSPVGLLSQRQGYENMGSGVEHVPLISSRTASTSACSFVSDLNAKLQMGNLN 734
                G  Q S VGL+++     N+ S V   P ISSR              A + + N N
Sbjct: 119  SIESGTFQTSGVGLMTENLINHNLQSRVTTFPEISSR--------------AMVGLNNSN 164

Query: 735  RYNSEITKQDIHHRVQGNQTGVDLSSPQWGWLTRTNFPSVGQHNLTNVQASMKV------ 896
             +        + H +Q + T +    P      R +F S GQ   T   +S+ V      
Sbjct: 165  NF--------VFHDIQSSNTAIQPPIPG-SEKARESFSSPGQCQGTIPASSLNVCCSDIQ 215

Query: 897  --------PSNRKSFVSDHSSAFASGRPRVFCLATRGDLLISNTFLLGVVCSCHGSHMSI 1052
                    PS+ K     +++ F SG PRVFC+   G LL+SNT LLG+VCSCH  HMS+
Sbjct: 216  TTQTIALEPSSSK-----YATPFMSGCPRVFCMGKSGHLLLSNTGLLGIVCSCHCCHMSV 270

Query: 1053 TTFCEHSGLCAVNPGDAVRLESGETVAQWRRLYFLKFGIRVPDDNSGWDWPDGISATGGL 1232
              FCEHSGL  ++PG+AVR+ESGET++QW++LYFLKFGIR   + + WDWPD +S  G L
Sbjct: 271  LKFCEHSGLHGIDPGEAVRMESGETISQWQKLYFLKFGIRSLGNENEWDWPDVLSTRGSL 330

Query: 1233 VKCKASVPNTSKNS 1274
            ++  +S  + SK +
Sbjct: 331  MRSNSSAFDMSKTN 344


>ref|XP_006596088.1| PREDICTED: uncharacterized protein LOC100812602 isoform X6 [Glycine
            max]
          Length = 1870

 Score =  174 bits (441), Expect = 9e-41
 Identities = 124/374 (33%), Positives = 176/374 (47%), Gaps = 15/374 (4%)
 Frame = +3

Query: 198  MDNSWQSKCXXXXXXXXXXXXXXXXXXRNQMAMNAGQNFHPYVAQELRSMNSGVIQDSMV 377
            M++ W+ KC                    +  +N     +P VA  LRS   G  Q  + 
Sbjct: 1    MESPWERKCDSPLQPSTSATVLSAPPPETK-EINTSYCLYPQVAHGLRSKFVGGKQGHVY 59

Query: 378  FNTLDIGSCRPEMPELGNSFLALLSGPSPQIQCEVQHLSSSKPSMDSTKLPMPDNGLMAG 557
             +     +      +  NSFL+LL GP   +Q E + LS  K    S          + G
Sbjct: 60   QSFPHSTAHGSGQADTRNSFLSLLYGPPSLLQHEFRDLSDRKLCFSSGDCTAAIGNSVVG 119

Query: 558  GDGCGV-QLSPVGLLSQRQGYENMGSGVEHVPLISSRTASTSACSFVSDLNAKLQMGNLN 734
                G  Q S VGL+++     N+ S V   P ISSR              A + + N N
Sbjct: 120  SIESGTFQTSGVGLMTENLINHNLQSRVTTFPEISSR--------------AMVGLNNSN 165

Query: 735  RYNSEITKQDIHHRVQGNQTGVDLSSPQWGWLTRTNFPSVGQHNLTNVQASMKV------ 896
             +        + H +Q + T +    P      R +F S GQ   T   +S+ V      
Sbjct: 166  NF--------VFHDIQSSNTAIQPPIPG-SEKARESFSSPGQCQGTIPASSLNVCCSDIQ 216

Query: 897  --------PSNRKSFVSDHSSAFASGRPRVFCLATRGDLLISNTFLLGVVCSCHGSHMSI 1052
                    PS+ K     +++ F SG PRVFC+   G LL+SNT LLG+VCSCH  HMS+
Sbjct: 217  TTQTIALEPSSSK-----YATPFMSGCPRVFCMGKSGHLLLSNTGLLGIVCSCHCCHMSV 271

Query: 1053 TTFCEHSGLCAVNPGDAVRLESGETVAQWRRLYFLKFGIRVPDDNSGWDWPDGISATGGL 1232
              FCEHSGL  ++PG+AVR+ESGET++QW++LYFLKFGIR   + + WDWPD +S  G L
Sbjct: 272  LKFCEHSGLHGIDPGEAVRMESGETISQWQKLYFLKFGIRSLGNENEWDWPDVLSTRGSL 331

Query: 1233 VKCKASVPNTSKNS 1274
            ++  +S  + SK +
Sbjct: 332  MRSNSSAFDMSKTN 345


>ref|XP_006596087.1| PREDICTED: uncharacterized protein LOC100812602 isoform X5 [Glycine
            max]
          Length = 1872

 Score =  174 bits (441), Expect = 9e-41
 Identities = 124/374 (33%), Positives = 176/374 (47%), Gaps = 15/374 (4%)
 Frame = +3

Query: 198  MDNSWQSKCXXXXXXXXXXXXXXXXXXRNQMAMNAGQNFHPYVAQELRSMNSGVIQDSMV 377
            M++ W+ KC                    +  +N     +P VA  LRS   G  Q  + 
Sbjct: 1    MESPWERKCDSPLQPSTSATVLSAPPPETK-EINTSYCLYPQVAHGLRSKFVGGKQGHVY 59

Query: 378  FNTLDIGSCRPEMPELGNSFLALLSGPSPQIQCEVQHLSSSKPSMDSTKLPMPDNGLMAG 557
             +     +      +  NSFL+LL GP   +Q E + LS  K    S          + G
Sbjct: 60   QSFPHSTAHGSGQADTRNSFLSLLYGPPSLLQHEFRDLSDRKLCFSSGDCTAAIGNSVVG 119

Query: 558  GDGCGV-QLSPVGLLSQRQGYENMGSGVEHVPLISSRTASTSACSFVSDLNAKLQMGNLN 734
                G  Q S VGL+++     N+ S V   P ISSR              A + + N N
Sbjct: 120  SIESGTFQTSGVGLMTENLINHNLQSRVTTFPEISSR--------------AMVGLNNSN 165

Query: 735  RYNSEITKQDIHHRVQGNQTGVDLSSPQWGWLTRTNFPSVGQHNLTNVQASMKV------ 896
             +        + H +Q + T +    P      R +F S GQ   T   +S+ V      
Sbjct: 166  NF--------VFHDIQSSNTAIQPPIPG-SEKARESFSSPGQCQGTIPASSLNVCCSDIQ 216

Query: 897  --------PSNRKSFVSDHSSAFASGRPRVFCLATRGDLLISNTFLLGVVCSCHGSHMSI 1052
                    PS+ K     +++ F SG PRVFC+   G LL+SNT LLG+VCSCH  HMS+
Sbjct: 217  TTQTIALEPSSSK-----YATPFMSGCPRVFCMGKSGHLLLSNTGLLGIVCSCHCCHMSV 271

Query: 1053 TTFCEHSGLCAVNPGDAVRLESGETVAQWRRLYFLKFGIRVPDDNSGWDWPDGISATGGL 1232
              FCEHSGL  ++PG+AVR+ESGET++QW++LYFLKFGIR   + + WDWPD +S  G L
Sbjct: 272  LKFCEHSGLHGIDPGEAVRMESGETISQWQKLYFLKFGIRSLGNENEWDWPDVLSTRGSL 331

Query: 1233 VKCKASVPNTSKNS 1274
            ++  +S  + SK +
Sbjct: 332  MRSNSSAFDMSKTN 345


>ref|XP_006596086.1| PREDICTED: uncharacterized protein LOC100812602 isoform X4 [Glycine
            max]
          Length = 1976

 Score =  174 bits (441), Expect = 9e-41
 Identities = 124/374 (33%), Positives = 176/374 (47%), Gaps = 15/374 (4%)
 Frame = +3

Query: 198  MDNSWQSKCXXXXXXXXXXXXXXXXXXRNQMAMNAGQNFHPYVAQELRSMNSGVIQDSMV 377
            M++ W+ KC                    +  +N     +P VA  LRS   G  Q  + 
Sbjct: 1    MESPWERKCDSPLQPSTSATVLSAPPPETK-EINTSYCLYPQVAHGLRSKFVGGKQGHVY 59

Query: 378  FNTLDIGSCRPEMPELGNSFLALLSGPSPQIQCEVQHLSSSKPSMDSTKLPMPDNGLMAG 557
             +     +      +  NSFL+LL GP   +Q E + LS  K    S          + G
Sbjct: 60   QSFPHSTAHGSGQADTRNSFLSLLYGPPSLLQHEFRDLSDRKLCFSSGDCTAAIGNSVVG 119

Query: 558  GDGCGV-QLSPVGLLSQRQGYENMGSGVEHVPLISSRTASTSACSFVSDLNAKLQMGNLN 734
                G  Q S VGL+++     N+ S V   P ISSR              A + + N N
Sbjct: 120  SIESGTFQTSGVGLMTENLINHNLQSRVTTFPEISSR--------------AMVGLNNSN 165

Query: 735  RYNSEITKQDIHHRVQGNQTGVDLSSPQWGWLTRTNFPSVGQHNLTNVQASMKV------ 896
             +        + H +Q + T +    P      R +F S GQ   T   +S+ V      
Sbjct: 166  NF--------VFHDIQSSNTAIQPPIPG-SEKARESFSSPGQCQGTIPASSLNVCCSDIQ 216

Query: 897  --------PSNRKSFVSDHSSAFASGRPRVFCLATRGDLLISNTFLLGVVCSCHGSHMSI 1052
                    PS+ K     +++ F SG PRVFC+   G LL+SNT LLG+VCSCH  HMS+
Sbjct: 217  TTQTIALEPSSSK-----YATPFMSGCPRVFCMGKSGHLLLSNTGLLGIVCSCHCCHMSV 271

Query: 1053 TTFCEHSGLCAVNPGDAVRLESGETVAQWRRLYFLKFGIRVPDDNSGWDWPDGISATGGL 1232
              FCEHSGL  ++PG+AVR+ESGET++QW++LYFLKFGIR   + + WDWPD +S  G L
Sbjct: 272  LKFCEHSGLHGIDPGEAVRMESGETISQWQKLYFLKFGIRSLGNENEWDWPDVLSTRGSL 331

Query: 1233 VKCKASVPNTSKNS 1274
            ++  +S  + SK +
Sbjct: 332  MRSNSSAFDMSKTN 345


>ref|XP_006596085.1| PREDICTED: uncharacterized protein LOC100812602 isoform X3 [Glycine
            max]
          Length = 2006

 Score =  174 bits (441), Expect = 9e-41
 Identities = 124/374 (33%), Positives = 176/374 (47%), Gaps = 15/374 (4%)
 Frame = +3

Query: 198  MDNSWQSKCXXXXXXXXXXXXXXXXXXRNQMAMNAGQNFHPYVAQELRSMNSGVIQDSMV 377
            M++ W+ KC                    +  +N     +P VA  LRS   G  Q  + 
Sbjct: 1    MESPWERKCDSPLQPSTSATVLSAPPPETK-EINTSYCLYPQVAHGLRSKFVGGKQGHVY 59

Query: 378  FNTLDIGSCRPEMPELGNSFLALLSGPSPQIQCEVQHLSSSKPSMDSTKLPMPDNGLMAG 557
             +     +      +  NSFL+LL GP   +Q E + LS  K    S          + G
Sbjct: 60   QSFPHSTAHGSGQADTRNSFLSLLYGPPSLLQHEFRDLSDRKLCFSSGDCTAAIGNSVVG 119

Query: 558  GDGCGV-QLSPVGLLSQRQGYENMGSGVEHVPLISSRTASTSACSFVSDLNAKLQMGNLN 734
                G  Q S VGL+++     N+ S V   P ISSR              A + + N N
Sbjct: 120  SIESGTFQTSGVGLMTENLINHNLQSRVTTFPEISSR--------------AMVGLNNSN 165

Query: 735  RYNSEITKQDIHHRVQGNQTGVDLSSPQWGWLTRTNFPSVGQHNLTNVQASMKV------ 896
             +        + H +Q + T +    P      R +F S GQ   T   +S+ V      
Sbjct: 166  NF--------VFHDIQSSNTAIQPPIPG-SEKARESFSSPGQCQGTIPASSLNVCCSDIQ 216

Query: 897  --------PSNRKSFVSDHSSAFASGRPRVFCLATRGDLLISNTFLLGVVCSCHGSHMSI 1052
                    PS+ K     +++ F SG PRVFC+   G LL+SNT LLG+VCSCH  HMS+
Sbjct: 217  TTQTIALEPSSSK-----YATPFMSGCPRVFCMGKSGHLLLSNTGLLGIVCSCHCCHMSV 271

Query: 1053 TTFCEHSGLCAVNPGDAVRLESGETVAQWRRLYFLKFGIRVPDDNSGWDWPDGISATGGL 1232
              FCEHSGL  ++PG+AVR+ESGET++QW++LYFLKFGIR   + + WDWPD +S  G L
Sbjct: 272  LKFCEHSGLHGIDPGEAVRMESGETISQWQKLYFLKFGIRSLGNENEWDWPDVLSTRGSL 331

Query: 1233 VKCKASVPNTSKNS 1274
            ++  +S  + SK +
Sbjct: 332  MRSNSSAFDMSKTN 345


Top