BLASTX nr result

ID: Papaver31_contig00004923 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver31_contig00004923
         (1574 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010257928.1| PREDICTED: uncharacterized protein LOC104597...   220   3e-54
ref|XP_010257925.1| PREDICTED: uncharacterized protein LOC104597...   220   3e-54
ref|XP_010261085.1| PREDICTED: uncharacterized protein LOC104599...   189   5e-45
ref|XP_010261008.1| PREDICTED: uncharacterized protein LOC104599...   189   5e-45
ref|XP_012464097.1| PREDICTED: uncharacterized protein LOC105783...   147   3e-32
gb|KHG21027.1| Formate--tetrahydrofolate ligase [Gossypium arbor...   145   7e-32
ref|XP_007045750.1| 18S pre-ribosomal assembly protein gar2-rela...   143   4e-31
ref|XP_007045749.1| 18S pre-ribosomal assembly protein gar2-rela...   143   4e-31
ref|XP_006573172.1| PREDICTED: uncharacterized protein MAL13P1.3...   135   8e-29
ref|XP_008446474.1| PREDICTED: dentin sialophosphoprotein isofor...   135   1e-28
ref|XP_006573175.1| PREDICTED: uncharacterized protein MAL13P1.3...   128   1e-26
ref|XP_003520134.1| PREDICTED: uncharacterized protein LOC100778...   128   2e-26
gb|KRH70993.1| hypothetical protein GLYMA_02G122700 [Glycine max...   127   2e-26
gb|KRH70991.1| hypothetical protein GLYMA_02G122700 [Glycine max...   127   2e-26
ref|XP_012478806.1| PREDICTED: uncharacterized protein LOC105794...   127   2e-26
ref|XP_012478809.1| PREDICTED: uncharacterized protein LOC105794...   127   2e-26
ref|XP_006348792.1| PREDICTED: dentin sialophosphoprotein-like [...   119   1e-23
ref|XP_010554264.1| PREDICTED: uncharacterized protein LOC104824...   118   2e-23
gb|KHG16888.1| Polyribonucleotide nucleotidyltransferase [Gossyp...   117   2e-23
gb|AAX55105.1| hypothetical protein At2g03810 [Arabidopsis thali...   117   4e-23

>ref|XP_010257928.1| PREDICTED: uncharacterized protein LOC104597869 isoform X2 [Nelumbo
            nucifera]
          Length = 415

 Score =  220 bits (560), Expect = 3e-54
 Identities = 149/428 (34%), Positives = 209/428 (48%), Gaps = 7/428 (1%)
 Frame = -2

Query: 1534 EEHGNSFRKVPGLDDLSDSEDSVNVAENKSANSINPFLDPSCDDDLSQKEMELYTDKTVT 1355
            ++ G + R V GL D    +D +N  E K  + I  ++ PS +  LS+K  + YTDK+V 
Sbjct: 27   KQTGENVRNVKGLHDFVSMDDLINGREGKIGDHIPTYVLPSGEIKLSEKVTKFYTDKSVM 86

Query: 1354 ECELPELIVCFKEGSYSIIKDICIDEGLPSVDKTFRENGVVPDKELFTYLESSNELVKDI 1175
            ECE+PELIVCFKEG Y ++KDIC+DEG+PS DK   ENG V  K    + +         
Sbjct: 87   ECEVPELIVCFKEGPYHVVKDICVDEGVPSQDKILTENGQVDCKPCSMHSD--------- 137

Query: 1174 GHTTEPNIDCQFEMDGANQCVSELDEGNVKARDENMVSDDLKVQTRFLVEGCNIDSPYES 995
                + N D   +M G+    S++ +  V++  E         +  F  +  N D     
Sbjct: 138  ---LDVNSDLTKQMVGSVTLDSDVMKSLVQSDCEKNTDSQCNSKDLFQKDEKNADVE--- 191

Query: 994  CNIDGDDVQHNKDLPFVEREESLTTSSKEKDRVESEFPIQECDNNNSSRVSCD--TDGSA 821
                 D++ H   L      E++ +  K K                 +  SC   T+  +
Sbjct: 192  -----DEIAHAHILDKKVMSENMLSVGKLK-----------------TEKSCPELTNFDS 229

Query: 820  SRRQSNESQDMSEDGESANSLRPSTAVQEDESTNSNQVGEGETAGTVTLNSDSSPPPT-T 644
            +  Q   +QDMS +G  ANS  PS A + D S   N+V         ++  DS+P  + T
Sbjct: 230  NGEQQAHNQDMSREGTLANSAVPSPAAESDSSNPDNKVPLNSKVENRSITFDSNPSTSAT 289

Query: 643  SGREEDPNTQKSEFQRAIHTV-NILGLEE---DSQTASSRSFFIQHGHGEXXXXXXXXXX 476
            SGR E  + QK++  + +HT+ N   LE+   +S TASSRSFFIQHGHGE          
Sbjct: 290  SGRVE--SKQKADSPQPLHTLLNTSRLEDGPVESLTASSRSFFIQHGHGESSFSAVGPMS 347

Query: 475  XPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKMVKPDRRHLRKHRGWK 296
              + Y+ P                   SFAFP+LHSEWNSSPVKM K D+RH RKHR WK
Sbjct: 348  GSITYSGPIPYSGSISLRSDSSTTSNRSFAFPILHSEWNSSPVKMAKADQRHFRKHRRWK 407

Query: 295  LCFPCCRY 272
            + F CC +
Sbjct: 408  MNFLCCSF 415


>ref|XP_010257925.1| PREDICTED: uncharacterized protein LOC104597869 isoform X1 [Nelumbo
            nucifera] gi|720006276|ref|XP_010257926.1| PREDICTED:
            uncharacterized protein LOC104597869 isoform X1 [Nelumbo
            nucifera] gi|720006279|ref|XP_010257927.1| PREDICTED:
            uncharacterized protein LOC104597869 isoform X1 [Nelumbo
            nucifera]
          Length = 453

 Score =  220 bits (560), Expect = 3e-54
 Identities = 149/428 (34%), Positives = 209/428 (48%), Gaps = 7/428 (1%)
 Frame = -2

Query: 1534 EEHGNSFRKVPGLDDLSDSEDSVNVAENKSANSINPFLDPSCDDDLSQKEMELYTDKTVT 1355
            ++ G + R V GL D    +D +N  E K  + I  ++ PS +  LS+K  + YTDK+V 
Sbjct: 65   KQTGENVRNVKGLHDFVSMDDLINGREGKIGDHIPTYVLPSGEIKLSEKVTKFYTDKSVM 124

Query: 1354 ECELPELIVCFKEGSYSIIKDICIDEGLPSVDKTFRENGVVPDKELFTYLESSNELVKDI 1175
            ECE+PELIVCFKEG Y ++KDIC+DEG+PS DK   ENG V  K    + +         
Sbjct: 125  ECEVPELIVCFKEGPYHVVKDICVDEGVPSQDKILTENGQVDCKPCSMHSD--------- 175

Query: 1174 GHTTEPNIDCQFEMDGANQCVSELDEGNVKARDENMVSDDLKVQTRFLVEGCNIDSPYES 995
                + N D   +M G+    S++ +  V++  E         +  F  +  N D     
Sbjct: 176  ---LDVNSDLTKQMVGSVTLDSDVMKSLVQSDCEKNTDSQCNSKDLFQKDEKNADVE--- 229

Query: 994  CNIDGDDVQHNKDLPFVEREESLTTSSKEKDRVESEFPIQECDNNNSSRVSCD--TDGSA 821
                 D++ H   L      E++ +  K K                 +  SC   T+  +
Sbjct: 230  -----DEIAHAHILDKKVMSENMLSVGKLK-----------------TEKSCPELTNFDS 267

Query: 820  SRRQSNESQDMSEDGESANSLRPSTAVQEDESTNSNQVGEGETAGTVTLNSDSSPPPT-T 644
            +  Q   +QDMS +G  ANS  PS A + D S   N+V         ++  DS+P  + T
Sbjct: 268  NGEQQAHNQDMSREGTLANSAVPSPAAESDSSNPDNKVPLNSKVENRSITFDSNPSTSAT 327

Query: 643  SGREEDPNTQKSEFQRAIHTV-NILGLEE---DSQTASSRSFFIQHGHGEXXXXXXXXXX 476
            SGR E  + QK++  + +HT+ N   LE+   +S TASSRSFFIQHGHGE          
Sbjct: 328  SGRVE--SKQKADSPQPLHTLLNTSRLEDGPVESLTASSRSFFIQHGHGESSFSAVGPMS 385

Query: 475  XPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKMVKPDRRHLRKHRGWK 296
              + Y+ P                   SFAFP+LHSEWNSSPVKM K D+RH RKHR WK
Sbjct: 386  GSITYSGPIPYSGSISLRSDSSTTSNRSFAFPILHSEWNSSPVKMAKADQRHFRKHRRWK 445

Query: 295  LCFPCCRY 272
            + F CC +
Sbjct: 446  MNFLCCSF 453


>ref|XP_010261085.1| PREDICTED: uncharacterized protein LOC104599949 isoform X2 [Nelumbo
            nucifera]
          Length = 447

 Score =  189 bits (481), Expect = 5e-45
 Identities = 144/430 (33%), Positives = 208/430 (48%), Gaps = 9/430 (2%)
 Frame = -2

Query: 1534 EEH-GNSFRKVPGLDDLSDSEDSVNVAENKSANSINPFLDPSCDDDLSQKEMELYTDKTV 1358
            E+H G S R V GL D   +++ +N  EN++ +S   ++ PS +  LS+K    YTDK V
Sbjct: 65   EKHTGESLRNVKGLHDFVRTDNLINGKENETGDSAPMYVLPSGETKLSEKVTGFYTDKVV 124

Query: 1357 TECELPELIVCFKEGSYSIIKDICIDEGLPSVDKTFRENGVVPDKELF--TYLESSNELV 1184
             ECELP+L V FKE  Y ++KDICIDEG+PS+DK   EN  V  K  F  T L+ +++L 
Sbjct: 125  MECELPDLTVGFKEDPYRVVKDICIDEGVPSLDKILTENDEVDYKSCFPHTGLDVNSDLT 184

Query: 1183 KDIGHTTEPNIDCQFEMDGANQCVSELDEGNVKARDENMVSDDLKVQTRFLVEG-CNIDS 1007
            K              E D     ++E+                     + LVE  CN D 
Sbjct: 185  K--------------EKDSVLPSLNEM---------------------KSLVESYCNKDI 209

Query: 1006 PYESCNIDGDDVQHNKDLPFVEREESLTTSSKEKDRVESEFPIQECDNNNSSRVSCDTDG 827
                CN    +V H KD  +V+ EE  T  +   + +    P+ + D  +S     +   
Sbjct: 210  -LNQCN---SEVLHQKD-EYVD-EEDKTAHNSTDEVIPGSVPLGKLDTEDSYIKPSNFGS 263

Query: 826  SASRRQSNESQDMSEDGESANSLRPSTAVQEDESTNSNQVG-EGETAGTVTLNSDSSPPP 650
            +  ++QSN  QD S++  +      S   + D+S  +N+V    +     T+ S     P
Sbjct: 264  NKDQQQSN--QDSSKEAPAEKYGISSPTEESDDSNPANKVPFNNKVENGSTIMSFHPSKP 321

Query: 649  TTSGREEDPNTQKSEFQRAIHTV-NILGLEE---DSQTASSRSFFIQHGHGEXXXXXXXX 482
            TT  REE   + K++  + +H + ++  LE+   DS T SSRS  IQHGHGE        
Sbjct: 322  TT--REE--TSTKADSPQPLHILLSMSRLEDGTVDSLTGSSRSLCIQHGHGESSFSAAGP 377

Query: 481  XXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKMVKPDRRHLRKHRG 302
                + Y+ P                   SFAFP+LHSEWNSSPVKM K +RR+  KHRG
Sbjct: 378  MSGSITYSGPVPYSGSISLRSDSSTTSTRSFAFPILHSEWNSSPVKMAKANRRNFHKHRG 437

Query: 301  WKLCFPCCRY 272
            W++   CCR+
Sbjct: 438  WRMNLLCCRF 447


>ref|XP_010261008.1| PREDICTED: uncharacterized protein LOC104599949 isoform X1 [Nelumbo
            nucifera] gi|719967160|ref|XP_010261016.1| PREDICTED:
            uncharacterized protein LOC104599949 isoform X1 [Nelumbo
            nucifera] gi|719967163|ref|XP_010261023.1| PREDICTED:
            uncharacterized protein LOC104599949 isoform X1 [Nelumbo
            nucifera] gi|719967167|ref|XP_010261033.1| PREDICTED:
            uncharacterized protein LOC104599949 isoform X1 [Nelumbo
            nucifera] gi|719967171|ref|XP_010261043.1| PREDICTED:
            uncharacterized protein LOC104599949 isoform X1 [Nelumbo
            nucifera] gi|719967174|ref|XP_010261051.1| PREDICTED:
            uncharacterized protein LOC104599949 isoform X1 [Nelumbo
            nucifera] gi|719967177|ref|XP_010261058.1| PREDICTED:
            uncharacterized protein LOC104599949 isoform X1 [Nelumbo
            nucifera] gi|719967180|ref|XP_010261067.1| PREDICTED:
            uncharacterized protein LOC104599949 isoform X1 [Nelumbo
            nucifera] gi|719967184|ref|XP_010261077.1| PREDICTED:
            uncharacterized protein LOC104599949 isoform X1 [Nelumbo
            nucifera]
          Length = 465

 Score =  189 bits (481), Expect = 5e-45
 Identities = 144/430 (33%), Positives = 208/430 (48%), Gaps = 9/430 (2%)
 Frame = -2

Query: 1534 EEH-GNSFRKVPGLDDLSDSEDSVNVAENKSANSINPFLDPSCDDDLSQKEMELYTDKTV 1358
            E+H G S R V GL D   +++ +N  EN++ +S   ++ PS +  LS+K    YTDK V
Sbjct: 83   EKHTGESLRNVKGLHDFVRTDNLINGKENETGDSAPMYVLPSGETKLSEKVTGFYTDKVV 142

Query: 1357 TECELPELIVCFKEGSYSIIKDICIDEGLPSVDKTFRENGVVPDKELF--TYLESSNELV 1184
             ECELP+L V FKE  Y ++KDICIDEG+PS+DK   EN  V  K  F  T L+ +++L 
Sbjct: 143  MECELPDLTVGFKEDPYRVVKDICIDEGVPSLDKILTENDEVDYKSCFPHTGLDVNSDLT 202

Query: 1183 KDIGHTTEPNIDCQFEMDGANQCVSELDEGNVKARDENMVSDDLKVQTRFLVEG-CNIDS 1007
            K              E D     ++E+                     + LVE  CN D 
Sbjct: 203  K--------------EKDSVLPSLNEM---------------------KSLVESYCNKDI 227

Query: 1006 PYESCNIDGDDVQHNKDLPFVEREESLTTSSKEKDRVESEFPIQECDNNNSSRVSCDTDG 827
                CN    +V H KD  +V+ EE  T  +   + +    P+ + D  +S     +   
Sbjct: 228  -LNQCN---SEVLHQKD-EYVD-EEDKTAHNSTDEVIPGSVPLGKLDTEDSYIKPSNFGS 281

Query: 826  SASRRQSNESQDMSEDGESANSLRPSTAVQEDESTNSNQVG-EGETAGTVTLNSDSSPPP 650
            +  ++QSN  QD S++  +      S   + D+S  +N+V    +     T+ S     P
Sbjct: 282  NKDQQQSN--QDSSKEAPAEKYGISSPTEESDDSNPANKVPFNNKVENGSTIMSFHPSKP 339

Query: 649  TTSGREEDPNTQKSEFQRAIHTV-NILGLEE---DSQTASSRSFFIQHGHGEXXXXXXXX 482
            TT  REE   + K++  + +H + ++  LE+   DS T SSRS  IQHGHGE        
Sbjct: 340  TT--REE--TSTKADSPQPLHILLSMSRLEDGTVDSLTGSSRSLCIQHGHGESSFSAAGP 395

Query: 481  XXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKMVKPDRRHLRKHRG 302
                + Y+ P                   SFAFP+LHSEWNSSPVKM K +RR+  KHRG
Sbjct: 396  MSGSITYSGPVPYSGSISLRSDSSTTSTRSFAFPILHSEWNSSPVKMAKANRRNFHKHRG 455

Query: 301  WKLCFPCCRY 272
            W++   CCR+
Sbjct: 456  WRMNLLCCRF 465


>ref|XP_012464097.1| PREDICTED: uncharacterized protein LOC105783281 [Gossypium raimondii]
            gi|823262692|ref|XP_012464099.1| PREDICTED:
            uncharacterized protein LOC105783281 [Gossypium
            raimondii] gi|763813583|gb|KJB80435.1| hypothetical
            protein B456_013G097400 [Gossypium raimondii]
            gi|763813584|gb|KJB80436.1| hypothetical protein
            B456_013G097400 [Gossypium raimondii]
          Length = 505

 Score =  147 bits (370), Expect = 3e-32
 Identities = 124/441 (28%), Positives = 185/441 (41%), Gaps = 34/441 (7%)
 Frame = -2

Query: 1492 DLSDSEDSVNVAENKSANSINPFLDPSCDDDLSQKEMELYTDKTVTECELPELIVCFKEG 1313
            D S+S    +    K       F   S  +  S ++   Y DK+V +CELPEL+VC+KE 
Sbjct: 87   DCSNSVHDFSNGNEKEVRDFVTFNSHSSKNMDSFQDSVFYLDKSVMDCELPELVVCYKES 146

Query: 1312 SYSIIKDICIDEGLPSVDKTFRENGVVPDKEL-FTY--LESSNELVKDIGHTTEPNIDCQ 1142
            +Y ++KDICIDEG+P+ D    E+ V    E  F+Y   +  NEL+K++  T  P  D  
Sbjct: 147  TYHVVKDICIDEGVPTQDMFLFESSVDEKSECNFSYPKKDQDNELMKEMSETDMPMQDIS 206

Query: 1141 FEMDGANQCVSELDE--GNVKARDENMVSDDLKVQTRFLVEGCNIDSPYESCNIDGDDVQ 968
            F  +  NQ   ++D   G+ K  D +    D+ +          I + +     D  D+ 
Sbjct: 207  FSPE-ENQSGKDIDNECGSNKKLDADTYMQDIALSLEENKSNKGIPNEW-----DPRDLL 260

Query: 967  HNKDLPFVEREESLTTSSKEKDRVESEFPIQECDNNNSSRVSCDTDGSASRRQSNESQDM 788
              +D+     E      SKE   +     + E     S  +S D       +QS E+   
Sbjct: 261  VTRDMKDDAMEMMSNDGSKELFTLGDILSLPELATLKSEAMSPDCKSDRIEQQSFENSSK 320

Query: 787  SE------------------------DGE-----SANSLRPSTAVQEDESTNSNQVGEGE 695
             E                        +G       A  + P+ A    E+T+S  V E  
Sbjct: 321  KEVIVASAVEESNNLILSAPALVSTAEGSDIGKGEATPISPAPASASLEATSSGLVNE-- 378

Query: 694  TAGTVTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEEDSQTASSRSFFIQHG 515
              G++T +S SS P  TSG+  +   +     +         LEE +    S +  +Q G
Sbjct: 379  -TGSITFDSRSSAP--TSGKGSNKPLEAGRTSK---------LEETADQPFSSN--LQSG 424

Query: 514  HGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKMVK 335
            +GE            ++Y+ P                   SFAFP+L SEWNSSPV+M K
Sbjct: 425  NGESSFSAAGPLTGLISYSGPIAYSGNLSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAK 484

Query: 334  PDRRHLRKHRGWKLCFPCCRY 272
             DRR  R+HRGW+  F CCR+
Sbjct: 485  ADRRQYRRHRGWRQGFLCCRF 505


>gb|KHG21027.1| Formate--tetrahydrofolate ligase [Gossypium arboreum]
          Length = 505

 Score =  145 bits (367), Expect = 7e-32
 Identities = 123/441 (27%), Positives = 187/441 (42%), Gaps = 34/441 (7%)
 Frame = -2

Query: 1492 DLSDSEDSVNVAENKSANSINPFLDPSCDDDLSQKEMELYTDKTVTECELPELIVCFKEG 1313
            D S+S    +    K    I  F   S  +  S ++   Y DK+V +CELPEL+VC+KE 
Sbjct: 87   DCSNSVHDFSNGNEKEVRDIVTFNSHSSKNMDSFQDSVFYLDKSVMDCELPELVVCYKES 146

Query: 1312 SYSIIKDICIDEGLPSVDKTFRENGVVPDKEL-FTY--LESSNELVKDIGHTTEPNIDCQ 1142
            +Y ++KDICIDEG+P+ D    E+ V    E  F+Y   +  NEL+K++  T  P  +  
Sbjct: 147  TYHVVKDICIDEGVPTQDMFLFESSVDEKSECNFSYPKKDQDNELMKEMSETDIPMQNIS 206

Query: 1141 FEMDGANQCVSELDE--GNVKARDENMVSDDLKVQTRFLVEGCNIDSPYESCNIDGDDVQ 968
            F  +  NQ   ++D   G+ K  + +    D+ +          I + +     D  D+ 
Sbjct: 207  FSPE-ENQSGKDIDNDCGSNKKLNADTYMQDIALSLEENKSNKGIPNEW-----DPRDLL 260

Query: 967  HNKDLPFVEREESLTTSSKEKDRVESEFPIQECDNNNSSRVSCDTDGSASRRQSNESQDM 788
              +D+     E      SKE   +       E     S  +S D     + +QS E+   
Sbjct: 261  VTRDMKDDATEMMSNEGSKELFILGDILSFPELTTLKSEAMSPDFKSDRNEQQSFENSSK 320

Query: 787  SE-----DGESANSL------------------------RPSTAVQEDESTNSNQVGEGE 695
             E     + E +N+L                         P+ A    E+T+S  V E  
Sbjct: 321  KEVIVASEVEDSNNLILSAPALASTAEGSDSGKGEATPISPAPASASLEATSSGLVNE-- 378

Query: 694  TAGTVTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEEDSQTASSRSFFIQHG 515
              G++T +S SS P +  G  E   T ++             LEE +    S +  +Q G
Sbjct: 379  -TGSITFDSRSSAPTSGKGSSEPLETGRTS-----------KLEETADQPFSSN--LQSG 424

Query: 514  HGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKMVK 335
            +GE            ++Y+ P                   SFAFP+L SEWNSSPV+M K
Sbjct: 425  NGESSFSAAGPLTGLISYSGPITYSGNLSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAK 484

Query: 334  PDRRHLRKHRGWKLCFPCCRY 272
             D+R  R+HRGW+  F CCR+
Sbjct: 485  ADQRQYRRHRGWRQGFLCCRF 505


>ref|XP_007045750.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 2
            [Theobroma cacao] gi|590698568|ref|XP_007045751.1| 18S
            pre-ribosomal assembly protein gar2-related, putative
            isoform 2 [Theobroma cacao]
            gi|590698571|ref|XP_007045752.1| 18S pre-ribosomal
            assembly protein gar2-related, putative isoform 2
            [Theobroma cacao] gi|508709685|gb|EOY01582.1| 18S
            pre-ribosomal assembly protein gar2-related, putative
            isoform 2 [Theobroma cacao] gi|508709686|gb|EOY01583.1|
            18S pre-ribosomal assembly protein gar2-related, putative
            isoform 2 [Theobroma cacao] gi|508709687|gb|EOY01584.1|
            18S pre-ribosomal assembly protein gar2-related, putative
            isoform 2 [Theobroma cacao]
          Length = 470

 Score =  143 bits (361), Expect = 4e-31
 Identities = 135/456 (29%), Positives = 201/456 (44%), Gaps = 52/456 (11%)
 Frame = -2

Query: 1483 DSEDSVNVAENKSANSINPFL---DPSCDDDLSQKEMELYTDKTVTECELPELIVCFKEG 1313
            D   SVN   N +   +  F+    PS  +  S +    Y DK+V ECELPEL+VC+KE 
Sbjct: 30   DCSISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQNSVFYLDKSVMECELPELVVCYKES 89

Query: 1312 SYSIIKDICIDEGLPSVDKTFRENGVVPDKELFTYLESSNELVKDIGHTTEPNIDCQFEM 1133
            +Y ++KDICIDEG+P+ DK   E G+  +K    +L S  E  +D    TE     + E 
Sbjct: 90   TYHVVKDICIDEGVPTQDKFLFETGM-DEKIDCNFLPSEKE--QDSQLMTE-----KLET 141

Query: 1132 DGANQCVSELDEGNVKARD-ENMVSDDLKVQTRFLVEGCNIDSPYESCNIDGDDVQHNKD 956
            D   Q VS     N   +D +N    + KV T   ++  ++       N    +   +KD
Sbjct: 142  DMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKGIPNQCDSKD 201

Query: 955  LPFV-----EREESLTTS-SKEKDRVESEFPIQECDNNNSSRVS--CDTDG-------SA 821
            L        +  + +T   SKE   +     + E    NS  +S  C +DG       S+
Sbjct: 202  LMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSDGIEQQSFQSS 261

Query: 820  SRRQS----------NESQDMSEDG-----------ESANS-------LRPSTAVQEDES 725
            S+++            ES+D +E+            E  +S       + P+     +ES
Sbjct: 262  SKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSGKGEAILISPAQVSTSEES 321

Query: 724  TNSNQVGEGE-----TAGTVTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEE 560
            T+S+ V E         G++T N DSS P  TS ++E  +   SE    + T +   LE 
Sbjct: 322  TSSSLVNEVSYDNKLETGSITFNLDSSAP--TSSKDECHHNLDSE---PLGTGSTPKLEV 376

Query: 559  DSQTASSRSFFIQHGHGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFP 380
             +  + S +  +Q G GE            ++Y+ P                   SFAFP
Sbjct: 377  AADQSISNN--LQQGIGESSFSAAGLVTGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFP 434

Query: 379  VLHSEWNSSPVKMVKPDRRHLRKHRGWKLCFPCCRY 272
            +L SEWN SPV+M K DRRH RKH+GW+    CCR+
Sbjct: 435  ILQSEWNCSPVRMAKADRRHYRKHKGWRHGLLCCRF 470


>ref|XP_007045749.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 1
            [Theobroma cacao] gi|508709684|gb|EOY01581.1| 18S
            pre-ribosomal assembly protein gar2-related, putative
            isoform 1 [Theobroma cacao]
          Length = 527

 Score =  143 bits (361), Expect = 4e-31
 Identities = 135/456 (29%), Positives = 201/456 (44%), Gaps = 52/456 (11%)
 Frame = -2

Query: 1483 DSEDSVNVAENKSANSINPFL---DPSCDDDLSQKEMELYTDKTVTECELPELIVCFKEG 1313
            D   SVN   N +   +  F+    PS  +  S +    Y DK+V ECELPEL+VC+KE 
Sbjct: 87   DCSISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQNSVFYLDKSVMECELPELVVCYKES 146

Query: 1312 SYSIIKDICIDEGLPSVDKTFRENGVVPDKELFTYLESSNELVKDIGHTTEPNIDCQFEM 1133
            +Y ++KDICIDEG+P+ DK   E G+  +K    +L S  E  +D    TE     + E 
Sbjct: 147  TYHVVKDICIDEGVPTQDKFLFETGM-DEKIDCNFLPSEKE--QDSQLMTE-----KLET 198

Query: 1132 DGANQCVSELDEGNVKARD-ENMVSDDLKVQTRFLVEGCNIDSPYESCNIDGDDVQHNKD 956
            D   Q VS     N   +D +N    + KV T   ++  ++       N    +   +KD
Sbjct: 199  DMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKGIPNQCDSKD 258

Query: 955  LPFV-----EREESLTTS-SKEKDRVESEFPIQECDNNNSSRVS--CDTDG-------SA 821
            L        +  + +T   SKE   +     + E    NS  +S  C +DG       S+
Sbjct: 259  LMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSDGIEQQSFQSS 318

Query: 820  SRRQS----------NESQDMSEDG-----------ESANS-------LRPSTAVQEDES 725
            S+++            ES+D +E+            E  +S       + P+     +ES
Sbjct: 319  SKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSGKGEAILISPAQVSTSEES 378

Query: 724  TNSNQVGEGE-----TAGTVTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEE 560
            T+S+ V E         G++T N DSS P  TS ++E  +   SE    + T +   LE 
Sbjct: 379  TSSSLVNEVSYDNKLETGSITFNLDSSAP--TSSKDECHHNLDSE---PLGTGSTPKLEV 433

Query: 559  DSQTASSRSFFIQHGHGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFP 380
             +  + S +  +Q G GE            ++Y+ P                   SFAFP
Sbjct: 434  AADQSISNN--LQQGIGESSFSAAGLVTGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFP 491

Query: 379  VLHSEWNSSPVKMVKPDRRHLRKHRGWKLCFPCCRY 272
            +L SEWN SPV+M K DRRH RKH+GW+    CCR+
Sbjct: 492  ILQSEWNCSPVRMAKADRRHYRKHKGWRHGLLCCRF 527


>ref|XP_006573172.1| PREDICTED: uncharacterized protein MAL13P1.304-like isoform X2
            [Glycine max] gi|571434350|ref|XP_006573173.1| PREDICTED:
            uncharacterized protein MAL13P1.304-like isoform X3
            [Glycine max] gi|571434352|ref|XP_006573174.1| PREDICTED:
            uncharacterized protein MAL13P1.304-like isoform X4
            [Glycine max] gi|571434354|ref|XP_003516473.2| PREDICTED:
            uncharacterized protein MAL13P1.304-like isoform X1
            [Glycine max] gi|734428580|gb|KHN44807.1| hypothetical
            protein glysoja_038982 [Glycine soja]
            gi|947127284|gb|KRH75138.1| hypothetical protein
            GLYMA_01G064900 [Glycine max] gi|947127285|gb|KRH75139.1|
            hypothetical protein GLYMA_01G064900 [Glycine max]
            gi|947127286|gb|KRH75140.1| hypothetical protein
            GLYMA_01G064900 [Glycine max] gi|947127287|gb|KRH75141.1|
            hypothetical protein GLYMA_01G064900 [Glycine max]
            gi|947127288|gb|KRH75142.1| hypothetical protein
            GLYMA_01G064900 [Glycine max] gi|947127289|gb|KRH75143.1|
            hypothetical protein GLYMA_01G064900 [Glycine max]
            gi|947127290|gb|KRH75144.1| hypothetical protein
            GLYMA_01G064900 [Glycine max]
          Length = 517

 Score =  135 bits (341), Expect = 8e-29
 Identities = 122/425 (28%), Positives = 179/425 (42%), Gaps = 27/425 (6%)
 Frame = -2

Query: 1465 NVAENKSANSINPFLDPSCDDDLSQKEMELYTDKTVTECELPELIVCFKEGSYSIIKDIC 1286
            N  E+   +  +P  +P+   DL +  ++ Y DKTVTECE P L VC+KE +Y ++KDIC
Sbjct: 104  NDVESVKRSLTSPISNPAEGRDLPRNSVDGYMDKTVTECE-PHLEVCYKESNYHVVKDIC 162

Query: 1285 IDEGLPSVDKTFRENGVVPDKELFTYLESSNELVKDIGHTTEPNIDCQ-FEMDGANQCVS 1109
            +DEG+ + DK    N V      F + ES     K   +T+   +     E    N  +S
Sbjct: 163  VDEGVLNKDKVMFVNTVDEKAHNFFHSESYENKEKQKDNTSIKALSLTPTEEKAHNFFLS 222

Query: 1108 ELDEGNVKARDE------NMVSDDLKVQTRFLVEGCNIDSPYESCNI-------DGDDVQ 968
            E  E   K +D       ++   + K    F  E         S N+       + D+V 
Sbjct: 223  ESYENKEKQKDNISINVLSLTPTEEKAHNFFPSESKEKQKDNTSINVLSLTPTEESDEVH 282

Query: 967  HNKDLPFVEREESLTTSSKEKDRVESEF-PIQEC-----DNNNSSRVSCDTDGSASRRQS 806
             N D P     +    + K    V  E  P+ E      D      VS D  G    + S
Sbjct: 283  ANHDQPKGLMHKDGDATEKISGNVNKEMKPLPEDKVLLQDLLTEDSVSSDDKGE---QIS 339

Query: 805  NESQDMSEDGESANSLR------PSTAVQEDESTNSNQVGEGETAGTVTLNSDSSPPPTT 644
            NE +  S+   S N++       PS A+ +DES N N + E E++   T   D S P + 
Sbjct: 340  NEPELHSQSEGSKNTVEEAILESPSLALADDESNNDNMLSEKESS---THQLDPSRP-SD 395

Query: 643  SGREEDPNTQKSEFQRAIHTVNILGLEEDSQTASSRSFFIQHGHGEXXXXXXXXXXXPLA 464
             G+EE       +      T+  +  + D Q  +     I H  GE            ++
Sbjct: 396  CGKEECHQAGVCKCDEIQQTMKPVEGKSDDQAVTGH---IHHSLGEASFSSIGPMSGRIS 452

Query: 463  YTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKMVKPDRRHLRKHRG-WKLCF 287
            Y+ P                   SFAFP++ SEWNSSPV+M K DR+H RK R  W+  F
Sbjct: 453  YSGPVPYSGSISLRSDSSTTSTRSFAFPIIQSEWNSSPVRMAKADRKHFRKQRWCWRDGF 512

Query: 286  PCCRY 272
             CC++
Sbjct: 513  LCCKF 517


>ref|XP_008446474.1| PREDICTED: dentin sialophosphoprotein isoform X3 [Cucumis melo]
          Length = 445

 Score =  135 bits (340), Expect = 1e-28
 Identities = 131/439 (29%), Positives = 198/439 (45%), Gaps = 25/439 (5%)
 Frame = -2

Query: 1513 RKVPGLDDLSDSEDSVNVAENKSANSINPFLDP---SCDDDLSQKEMELYTDKTVTECEL 1343
            R+   LDD +D +D            +  F+ P   SC  DLS+++ ELY +K++ EC+L
Sbjct: 68   RECLDLDDFNDYDD------------VKAFVSPLNNSCKVDLSEEDSELYMEKSIVECQL 115

Query: 1342 PELIVCFKEGSYSIIKDICIDEGLPSVDKTFRENGVVPDKELFTYLESSNELVKDIGHTT 1163
            PELIVC+KE   +I+KDICID+G P  DK F  + +  D+E             D+    
Sbjct: 116  PELIVCYKENICNIVKDICIDDGTPR-DKLFCGSSL--DEE-------------DVCSIN 159

Query: 1162 EPNIDCQFEMDGANQCVSELDEGNVKARDENMVSDDLKVQTRFLVEGCNIDSPYESCNID 983
             P  D +      ++ V EL + ++ A D++  S+    +          DSP +  + D
Sbjct: 160  PPTKDWK------DESVGELKQRDMFASDDSEHSESFGSK----------DSPNQCDSKD 203

Query: 982  -------GDDVQH--NKDLPFVER-EESLT--TSSKEKDRVESEFPIQECDNNNSSRVS- 842
                     DV +  + D+P  +   ESL   T +K K   +SE   Q C     S V  
Sbjct: 204  LASTPEAEYDVAYFTDNDMPMTDLVTESLKPLTDNKIKPHPQSE---QVCIETTCSEVPV 260

Query: 841  ----CDTDGSASRRQSNESQDMSED---GESANSLRPSTAVQEDESTNSNQVGEGETAGT 683
                 D     +R  ++ES   +ED    +SAN+   S +V   E+T+SN +   + +  
Sbjct: 261  LAHVADESFGNTRETTSESITSAEDPKNSDSANAPSTSASVGCKETTSSNPLASADKSEP 320

Query: 682  VTLNSDSSPPPTTSGREEDPNTQKSEFQ--RAIHTVNILGLEEDSQTASSRSFFIQHGHG 509
               N+ S+P      R E  +  + E++  R     N      DS T SS    +Q G G
Sbjct: 321  QCHNTSSNPK-----RVEYEDLPRVEYEDIRKTEVGNF-----DSHTVSSE---VQQGVG 367

Query: 508  EXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKMVKPD 329
            E            ++ +                     SFAFP+L +EWNSSPV+M KPD
Sbjct: 368  E-TSFSVAPLGSLMSNSGRIGYSGSISHRSDSSTTSTRSFAFPILQTEWNSSPVRMAKPD 426

Query: 328  RRHLRKHRGWKLCFPCCRY 272
            R+HL+KHRGW+    CCR+
Sbjct: 427  RKHLQKHRGWRHGILCCRF 445


>ref|XP_006573175.1| PREDICTED: uncharacterized protein MAL13P1.304-like isoform X5
            [Glycine max]
          Length = 484

 Score =  128 bits (322), Expect = 1e-26
 Identities = 116/413 (28%), Positives = 176/413 (42%), Gaps = 15/413 (3%)
 Frame = -2

Query: 1465 NVAENKSANSINPFLDPSCDDDLSQKEMELYTDKTVTECELPELIVCFKEGSYSIIKDIC 1286
            N  E+   +  +P  +P+   DL +  ++ Y DKTVTECE P L VC+KE +Y ++KDIC
Sbjct: 104  NDVESVKRSLTSPISNPAEGRDLPRNSVDGYMDKTVTECE-PHLEVCYKESNYHVVKDIC 162

Query: 1285 IDEGLPSVDKTFRENGVVPDKELFTYLES--SNELVKDIGHTTEPNIDCQFEMDGANQCV 1112
            +DEG+ + DK    N V      F + ES  + E  KD                  N  +
Sbjct: 163  VDEGVLNKDKVMFVNTVDEKAHNFFHSESYENKEKQKD------------------NISI 204

Query: 1111 SELDEGNVKARDENMVSDDLKVQTRFLVEGCNIDSPYESCNIDGDDVQHNKDLPFVEREE 932
            + L     + +  N    + K + +   +  +I+    +   + D+V  N D P     +
Sbjct: 205  NVLSLTPTEEKAHNFFPSESKEKQK---DNTSINVLSLTPTEESDEVHANHDQPKGLMHK 261

Query: 931  SLTTSSKEKDRVESEF-PIQEC-----DNNNSSRVSCDTDGSASRRQSNESQDMSEDGES 770
                + K    V  E  P+ E      D      VS D  G    + SNE +  S+   S
Sbjct: 262  DGDATEKISGNVNKEMKPLPEDKVLLQDLLTEDSVSSDDKGE---QISNEPELHSQSEGS 318

Query: 769  ANSLR------PSTAVQEDESTNSNQVGEGETAGTVTLNSDSSPPPTTSGREEDPNTQKS 608
             N++       PS A+ +DES N N + E E++   T   D S P +  G+EE       
Sbjct: 319  KNTVEEAILESPSLALADDESNNDNMLSEKESS---THQLDPSRP-SDCGKEECHQAGVC 374

Query: 607  EFQRAIHTVNILGLEEDSQTASSRSFFIQHGHGEXXXXXXXXXXXPLAYTDPAXXXXXXX 428
            +      T+  +  + D Q  +     I H  GE            ++Y+ P        
Sbjct: 375  KCDEIQQTMKPVEGKSDDQAVTGH---IHHSLGEASFSSIGPMSGRISYSGPVPYSGSIS 431

Query: 427  XXXXXXXXXXXSFAFPVLHSEWNSSPVKMVKPDRRHLRKHRG-WKLCFPCCRY 272
                       SFAFP++ SEWNSSPV+M K DR+H RK R  W+  F CC++
Sbjct: 432  LRSDSSTTSTRSFAFPIIQSEWNSSPVRMAKADRKHFRKQRWCWRDGFLCCKF 484


>ref|XP_003520134.1| PREDICTED: uncharacterized protein LOC100778990 isoform X1 [Glycine
            max] gi|571439806|ref|XP_006574962.1| PREDICTED:
            uncharacterized protein LOC100778990 isoform X2 [Glycine
            max] gi|571439809|ref|XP_006574963.1| PREDICTED:
            uncharacterized protein LOC100778990 isoform X3 [Glycine
            max] gi|571439811|ref|XP_006574964.1| PREDICTED:
            uncharacterized protein LOC100778990 isoform X4 [Glycine
            max] gi|571439813|ref|XP_006574965.1| PREDICTED:
            uncharacterized protein LOC100778990 isoform X5 [Glycine
            max] gi|734397083|gb|KHN29948.1| hypothetical protein
            glysoja_014766 [Glycine soja] gi|947122782|gb|KRH70988.1|
            hypothetical protein GLYMA_02G122700 [Glycine max]
            gi|947122783|gb|KRH70989.1| hypothetical protein
            GLYMA_02G122700 [Glycine max] gi|947122784|gb|KRH70990.1|
            hypothetical protein GLYMA_02G122700 [Glycine max]
          Length = 485

 Score =  128 bits (321), Expect = 2e-26
 Identities = 118/413 (28%), Positives = 173/413 (41%), Gaps = 15/413 (3%)
 Frame = -2

Query: 1465 NVAENKSANSINPFLDPSCDDDLSQKEMELYTDKTVTECELPELIVCFKEGSYSIIKDIC 1286
            N  E+   +  +P  +P    +      + Y DKTVT+CE P L VC+KE +Y ++KDIC
Sbjct: 104  NDIESVKRSPTSPISNPVKGRNSPWNPEDGYMDKTVTQCE-PHLEVCYKESNYHVVKDIC 162

Query: 1285 IDEGLPSVDKTFRENGVVPDKEL---FTYLESSNELVKDIGHTTEPNIDCQFEMDGA-NQ 1118
            IDEG+   DK    N   PD E    F   +S     K   +T+   +      + A N 
Sbjct: 163  IDEGVLKKDKVMFLN---PDDEKAHNFFPSDSYENKEKQKDNTSIGVLSLIPTGEKAHNF 219

Query: 1117 CVSELDEGNVKARDENMVSDDLKVQTRFLVEGCNIDSPYESCNIDGDDVQHNKDLPFVER 938
              SE  E   K +D   +               N+ S   +   D D   H++    + +
Sbjct: 220  FPSESYENKEKQKDNTSI---------------NVLSLTPTKESDKDPANHDQPKDLMHK 264

Query: 937  EESLTTSSKEKDRVESEFPIQE-----CDNNNSSRVSCDTDGSASRRQSNESQDMSEDGE 773
            +E  T   K    V  E P+ E      D      VS D  G    + SNE +  S+  E
Sbjct: 265  DEDAT--EKVSSNVNKETPLPEDKVLLQDLLAQDSVSSDDKG---EQISNEPELHSQPEE 319

Query: 772  SANSLR------PSTAVQEDESTNSNQVGEGETAGTVTLNSDSSPPPTTSGREEDPNTQK 611
            S N++       PS A+++DES N N + E    G+ T   D S  P+  G+E+      
Sbjct: 320  SKNTVEEAILETPSLALEDDESNNDNVLSE---KGSFTHQLDPS-VPSDCGKEDCHQAGV 375

Query: 610  SEFQRAIHTVNILGLEEDSQTASSRSFFIQHGHGEXXXXXXXXXXXPLAYTDPAXXXXXX 431
             +      T+  +  + D Q  +     ++H  GE            ++Y+ P       
Sbjct: 376  CKCDEIQQTMKPVEGKSDDQAVTGT---VRHSLGEASFSAIGPMSGRISYSGPVPYSGSI 432

Query: 430  XXXXXXXXXXXXSFAFPVLHSEWNSSPVKMVKPDRRHLRKHRGWKLCFPCCRY 272
                        SFAFP++ SEWNSSPV+M K DR H RK R WK  F CC++
Sbjct: 433  SLRSDSSTTSTRSFAFPIIQSEWNSSPVRMAKADRSHYRKQRWWKDSFLCCKF 485


>gb|KRH70993.1| hypothetical protein GLYMA_02G122700 [Glycine max]
            gi|947122788|gb|KRH70994.1| hypothetical protein
            GLYMA_02G122700 [Glycine max] gi|947122789|gb|KRH70995.1|
            hypothetical protein GLYMA_02G122700 [Glycine max]
            gi|947122790|gb|KRH70996.1| hypothetical protein
            GLYMA_02G122700 [Glycine max]
          Length = 367

 Score =  127 bits (320), Expect = 2e-26
 Identities = 114/383 (29%), Positives = 163/383 (42%), Gaps = 15/383 (3%)
 Frame = -2

Query: 1375 YTDKTVTECELPELIVCFKEGSYSIIKDICIDEGLPSVDKTFRENGVVPDKEL---FTYL 1205
            Y DKTVT+CE P L VC+KE +Y ++KDICIDEG+   DK    N   PD E    F   
Sbjct: 16   YMDKTVTQCE-PHLEVCYKESNYHVVKDICIDEGVLKKDKVMFLN---PDDEKAHNFFPS 71

Query: 1204 ESSNELVKDIGHTTEPNIDCQFEMDGA-NQCVSELDEGNVKARDENMVSDDLKVQTRFLV 1028
            +S     K   +T+   +      + A N   SE  E   K +D   +            
Sbjct: 72   DSYENKEKQKDNTSIGVLSLIPTGEKAHNFFPSESYENKEKQKDNTSI------------ 119

Query: 1027 EGCNIDSPYESCNIDGDDVQHNKDLPFVEREESLTTSSKEKDRVESEFPIQE-----CDN 863
               N+ S   +   D D   H++    + ++E  T   K    V  E P+ E      D 
Sbjct: 120  ---NVLSLTPTKESDKDPANHDQPKDLMHKDEDAT--EKVSSNVNKETPLPEDKVLLQDL 174

Query: 862  NNSSRVSCDTDGSASRRQSNESQDMSEDGESANSLR------PSTAVQEDESTNSNQVGE 701
                 VS D  G    + SNE +  S+  ES N++       PS A+++DES N N + E
Sbjct: 175  LAQDSVSSDDKG---EQISNEPELHSQPEESKNTVEEAILETPSLALEDDESNNDNVLSE 231

Query: 700  GETAGTVTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEEDSQTASSRSFFIQ 521
                G+ T   D S  P+  G+E+       +      T+  +  + D Q  +     ++
Sbjct: 232  ---KGSFTHQLDPS-VPSDCGKEDCHQAGVCKCDEIQQTMKPVEGKSDDQAVTGT---VR 284

Query: 520  HGHGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKM 341
            H  GE            ++Y+ P                   SFAFP++ SEWNSSPV+M
Sbjct: 285  HSLGEASFSAIGPMSGRISYSGPVPYSGSISLRSDSSTTSTRSFAFPIIQSEWNSSPVRM 344

Query: 340  VKPDRRHLRKHRGWKLCFPCCRY 272
             K DR H RK R WK  F CC++
Sbjct: 345  AKADRSHYRKQRWWKDSFLCCKF 367


>gb|KRH70991.1| hypothetical protein GLYMA_02G122700 [Glycine max]
            gi|947122786|gb|KRH70992.1| hypothetical protein
            GLYMA_02G122700 [Glycine max]
          Length = 444

 Score =  127 bits (320), Expect = 2e-26
 Identities = 114/383 (29%), Positives = 163/383 (42%), Gaps = 15/383 (3%)
 Frame = -2

Query: 1375 YTDKTVTECELPELIVCFKEGSYSIIKDICIDEGLPSVDKTFRENGVVPDKEL---FTYL 1205
            Y DKTVT+CE P L VC+KE +Y ++KDICIDEG+   DK    N   PD E    F   
Sbjct: 93   YMDKTVTQCE-PHLEVCYKESNYHVVKDICIDEGVLKKDKVMFLN---PDDEKAHNFFPS 148

Query: 1204 ESSNELVKDIGHTTEPNIDCQFEMDGA-NQCVSELDEGNVKARDENMVSDDLKVQTRFLV 1028
            +S     K   +T+   +      + A N   SE  E   K +D   +            
Sbjct: 149  DSYENKEKQKDNTSIGVLSLIPTGEKAHNFFPSESYENKEKQKDNTSI------------ 196

Query: 1027 EGCNIDSPYESCNIDGDDVQHNKDLPFVEREESLTTSSKEKDRVESEFPIQE-----CDN 863
               N+ S   +   D D   H++    + ++E  T   K    V  E P+ E      D 
Sbjct: 197  ---NVLSLTPTKESDKDPANHDQPKDLMHKDEDAT--EKVSSNVNKETPLPEDKVLLQDL 251

Query: 862  NNSSRVSCDTDGSASRRQSNESQDMSEDGESANSLR------PSTAVQEDESTNSNQVGE 701
                 VS D  G    + SNE +  S+  ES N++       PS A+++DES N N + E
Sbjct: 252  LAQDSVSSDDKG---EQISNEPELHSQPEESKNTVEEAILETPSLALEDDESNNDNVLSE 308

Query: 700  GETAGTVTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEEDSQTASSRSFFIQ 521
                G+ T   D S  P+  G+E+       +      T+  +  + D Q  +     ++
Sbjct: 309  ---KGSFTHQLDPS-VPSDCGKEDCHQAGVCKCDEIQQTMKPVEGKSDDQAVTGT---VR 361

Query: 520  HGHGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKM 341
            H  GE            ++Y+ P                   SFAFP++ SEWNSSPV+M
Sbjct: 362  HSLGEASFSAIGPMSGRISYSGPVPYSGSISLRSDSSTTSTRSFAFPIIQSEWNSSPVRM 421

Query: 340  VKPDRRHLRKHRGWKLCFPCCRY 272
             K DR H RK R WK  F CC++
Sbjct: 422  AKADRSHYRKQRWWKDSFLCCKF 444


>ref|XP_012478806.1| PREDICTED: uncharacterized protein LOC105794265 isoform X1 [Gossypium
            raimondii] gi|823157856|ref|XP_012478807.1| PREDICTED:
            uncharacterized protein LOC105794265 isoform X1
            [Gossypium raimondii] gi|823157858|ref|XP_012478808.1|
            PREDICTED: uncharacterized protein LOC105794265 isoform
            X1 [Gossypium raimondii] gi|763763266|gb|KJB30520.1|
            hypothetical protein B456_005G147700 [Gossypium
            raimondii] gi|763763269|gb|KJB30523.1| hypothetical
            protein B456_005G147700 [Gossypium raimondii]
          Length = 518

 Score =  127 bits (320), Expect = 2e-26
 Identities = 121/446 (27%), Positives = 188/446 (42%), Gaps = 42/446 (9%)
 Frame = -2

Query: 1483 DSEDSVNVAENKSANSINPFLDP---SCDDDLSQKEMELYTDKTVTECELPELIVCFKEG 1313
            D   SVN   N +      F+ P   S  +  S ++   Y DK+V E  LPEL+VC+KE 
Sbjct: 86   DCSMSVNDFSNGNEKEARDFVPPNSHSLKNMGSFQDSVFYLDKSVMEYALPELVVCYKES 145

Query: 1312 SYSIIKDICIDEGLPSVDKTFRENGVVPDKELFTYLESSNEL-VKDIGHTTEPNIDCQFE 1136
            +Y ++KDICIDEG+P+ DK F  + VV  K    +L S  +   K +   +E +I  Q  
Sbjct: 146  AYHVVKDICIDEGVPTQDK-FLFDSVVDKKSDCNFLPSEEDQDSKLLKEKSESDISMQAG 204

Query: 1135 MDGANQCVSELDEGNVKARDENMVSDDLKVQTRFLVE----------GCNIDSPYESCNI 986
                 +   + D  N +  ++  +SD         +E           C+ +    S  +
Sbjct: 205  SMYPEENQMDKDIDNERDSNKKTISDKCTQDISLSLEENEPKNRIPSQCDTEDLILSRKM 264

Query: 985  DGDDVQHNKDLPFVE----------REESLTTSSKEKDRVESEFPIQECDNNNSSR---- 848
              D ++  +D    E           E S           +S+   Q+C  N+  +    
Sbjct: 265  TDDTMKMARDDVSKELFTLGELLSMPELSTVKPKAMSSNCKSDGIKQQCFQNSKEKEVMV 324

Query: 847  ----VSCDTDGSASRRQS--------NESQDMSEDGESANSLRPSTAVQEDESTNSNQVG 704
                VS D +   S +++        + +++M    E A    P T+     S+  N+V 
Sbjct: 325  MPPLVSADKESDNSSKETILSASAPVSVAEEMDSRKEEATMFSPVTS-----SSLVNEVS 379

Query: 703  EGE--TAGTVTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEEDSQTASSRSF 530
            +     A ++    DSS    TS + E  +    E   A+ T +   LE+ +   SS + 
Sbjct: 380  DDSKLAARSIAFGFDSS--ALTSSKNEGCHNLDRE---ALETGHTPKLEDIADQPSSNN- 433

Query: 529  FIQHGHGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSP 350
             +Q G+GE            ++Y+ P                   SFAFP+L SEWNSSP
Sbjct: 434  -LQCGNGESSFSAAGLVTGLISYSGPIAYSGSLSHRSDSSTTSTRSFAFPILQSEWNSSP 492

Query: 349  VKMVKPDRRHLRKHRGWKLCFPCCRY 272
            V+M K DRRH RKHRGW+    CCR+
Sbjct: 493  VRMAKADRRHYRKHRGWRQGLLCCRF 518


>ref|XP_012478809.1| PREDICTED: uncharacterized protein LOC105794265 isoform X2 [Gossypium
            raimondii] gi|823157862|ref|XP_012478810.1| PREDICTED:
            uncharacterized protein LOC105794265 isoform X2
            [Gossypium raimondii] gi|763763265|gb|KJB30519.1|
            hypothetical protein B456_005G147700 [Gossypium
            raimondii] gi|763763268|gb|KJB30522.1| hypothetical
            protein B456_005G147700 [Gossypium raimondii]
          Length = 466

 Score =  127 bits (320), Expect = 2e-26
 Identities = 121/446 (27%), Positives = 188/446 (42%), Gaps = 42/446 (9%)
 Frame = -2

Query: 1483 DSEDSVNVAENKSANSINPFLDP---SCDDDLSQKEMELYTDKTVTECELPELIVCFKEG 1313
            D   SVN   N +      F+ P   S  +  S ++   Y DK+V E  LPEL+VC+KE 
Sbjct: 34   DCSMSVNDFSNGNEKEARDFVPPNSHSLKNMGSFQDSVFYLDKSVMEYALPELVVCYKES 93

Query: 1312 SYSIIKDICIDEGLPSVDKTFRENGVVPDKELFTYLESSNEL-VKDIGHTTEPNIDCQFE 1136
            +Y ++KDICIDEG+P+ DK F  + VV  K    +L S  +   K +   +E +I  Q  
Sbjct: 94   AYHVVKDICIDEGVPTQDK-FLFDSVVDKKSDCNFLPSEEDQDSKLLKEKSESDISMQAG 152

Query: 1135 MDGANQCVSELDEGNVKARDENMVSDDLKVQTRFLVE----------GCNIDSPYESCNI 986
                 +   + D  N +  ++  +SD         +E           C+ +    S  +
Sbjct: 153  SMYPEENQMDKDIDNERDSNKKTISDKCTQDISLSLEENEPKNRIPSQCDTEDLILSRKM 212

Query: 985  DGDDVQHNKDLPFVE----------REESLTTSSKEKDRVESEFPIQECDNNNSSR---- 848
              D ++  +D    E           E S           +S+   Q+C  N+  +    
Sbjct: 213  TDDTMKMARDDVSKELFTLGELLSMPELSTVKPKAMSSNCKSDGIKQQCFQNSKEKEVMV 272

Query: 847  ----VSCDTDGSASRRQS--------NESQDMSEDGESANSLRPSTAVQEDESTNSNQVG 704
                VS D +   S +++        + +++M    E A    P T+     S+  N+V 
Sbjct: 273  MPPLVSADKESDNSSKETILSASAPVSVAEEMDSRKEEATMFSPVTS-----SSLVNEVS 327

Query: 703  EGE--TAGTVTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEEDSQTASSRSF 530
            +     A ++    DSS    TS + E  +    E   A+ T +   LE+ +   SS + 
Sbjct: 328  DDSKLAARSIAFGFDSS--ALTSSKNEGCHNLDRE---ALETGHTPKLEDIADQPSSNN- 381

Query: 529  FIQHGHGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSP 350
             +Q G+GE            ++Y+ P                   SFAFP+L SEWNSSP
Sbjct: 382  -LQCGNGESSFSAAGLVTGLISYSGPIAYSGSLSHRSDSSTTSTRSFAFPILQSEWNSSP 440

Query: 349  VKMVKPDRRHLRKHRGWKLCFPCCRY 272
            V+M K DRRH RKHRGW+    CCR+
Sbjct: 441  VRMAKADRRHYRKHRGWRQGLLCCRF 466


>ref|XP_006348792.1| PREDICTED: dentin sialophosphoprotein-like [Solanum tuberosum]
          Length = 586

 Score =  119 bits (297), Expect = 1e-23
 Identities = 110/458 (24%), Positives = 187/458 (40%), Gaps = 24/458 (5%)
 Frame = -2

Query: 1573 DEILSGKETELYTEEHGNSFR----------KVPGLDDLSDSEDSVNVAENKSANSINPF 1424
            D+   G    ++++  GN F            +P  + L   +D     EN++ +S +PF
Sbjct: 152  DDQNGGLSNIIHSKRGGNPFECDTKDRDQPWNIPEYESLGFLDDK----ENETIDSDSPF 207

Query: 1423 LDPSCDDDLSQKEMELYTDKTVTECELPELIVCFKEGSYSIIKDICIDEGLPSVDKTFRE 1244
               S   +L       Y+DK VT+ ELPEL VC++E +++++KDIC+DEG+P+VDK   E
Sbjct: 208  TSHS---ELFDSNKHFYSDKGVTDHELPELTVCYRENNFNMVKDICMDEGVPAVDKVLIE 264

Query: 1243 NGVVPDKELFTYLESSNELVKDIGHTTEPNIDCQFEMDGANQCVSELDEGNVKARDENMV 1064
            +           +++  E   +    T  ++D    +   +Q  S  D  N+    +   
Sbjct: 265  SWKDGQPSTSVSVDADEEQQSN----TRKSVDMGSTIASVSQDSSFKDAKNIAVTHDT-- 318

Query: 1063 SDDLKVQTRFLVEGCNIDSPYESCNIDGDDVQHNKDLPFVEREESLTTSSKEKDRVESEF 884
              +++     +  G N  S   + N D D   + +DL  +   +  T +S++   + +  
Sbjct: 319  --EIEATGAPVPNGFN-PSLENNANKDADKDSYLEDLLMIFGSKCTTNASEKPSSLNTVV 375

Query: 883  PIQECD--NNNSSRVSCDTDGSASRRQSNESQDMSEDGESANSLRPSTAV---------Q 737
             ++E +   ++  + +   D   S +       +S  G++ N       V          
Sbjct: 376  RVEESNIKTSDGDQSTLQPDQVPSEQTLKSQTAVSASGQTNNKGNIKEGVGTSIFDVNLT 435

Query: 736  EDESTNSNQVGEGETAGTVTLNSDSSPPPTTSGREE---DPNTQKSEFQRAIHTVNILGL 566
            + EST + + G G       L  DS  P   S  +    D N+  S+   A    N    
Sbjct: 436  KPESTKTTEGGVGN------LPEDSHMPKAVSVHKNGNSDNNSASSQVPFANTADNAHQQ 489

Query: 565  EEDSQTASSRSFFIQHGHGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFA 386
              +SQ  ++       G               + Y+ P                   SFA
Sbjct: 490  HLESQNMANGQSHFADGEASFSAARGPISGS-ITYSGPISYSGSVSLRSESSTTSTRSFA 548

Query: 385  FPVLHSEWNSSPVKMVKPDRRHLRKHRGWKLCFPCCRY 272
            FPVL +EWNSSPV+M K +RR L K +GWK    CCR+
Sbjct: 549  FPVLQNEWNSSPVRMAKAERRRLSKQKGWKQGILCCRF 586


>ref|XP_010554264.1| PREDICTED: uncharacterized protein LOC104824052 [Tarenaya
            hassleriana] gi|729400952|ref|XP_010554265.1| PREDICTED:
            uncharacterized protein LOC104824052 [Tarenaya
            hassleriana] gi|729400956|ref|XP_010554266.1| PREDICTED:
            uncharacterized protein LOC104824052 [Tarenaya
            hassleriana] gi|729400959|ref|XP_010554267.1| PREDICTED:
            uncharacterized protein LOC104824052 [Tarenaya
            hassleriana] gi|729400962|ref|XP_010554269.1| PREDICTED:
            uncharacterized protein LOC104824052 [Tarenaya
            hassleriana] gi|729400964|ref|XP_010554270.1| PREDICTED:
            uncharacterized protein LOC104824052 [Tarenaya
            hassleriana]
          Length = 487

 Score =  118 bits (295), Expect = 2e-23
 Identities = 111/456 (24%), Positives = 188/456 (41%), Gaps = 31/456 (6%)
 Frame = -2

Query: 1546 ELYTEEHGNSFRKVPGLDDLSDSEDSVNVAENKSANSINPFL----DPSCDDDLSQKEME 1379
            EL+++  GN            D E   N++ ++S+  ++  +    DP   D L ++   
Sbjct: 49   ELFSDTKGNKL----------DGEKDGNLSGHRSSECLSDKIKEAGDPGKGDSLGKERSV 98

Query: 1378 LYTDKTVTECELPELIVCFKEGSYSIIKDICIDEGLPSVDKTFRENGVVPDKELFTYLES 1199
             Y DK V +C+LPE++ C++E +  ++KDIC+DEG+P   + F    +  D + F   E 
Sbjct: 99   FYMDKNVMDCDLPEIVACYEEKTCHVVKDICVDEGVPL--QKFLSKKI--DSDQFG-SED 153

Query: 1198 SNELVKDIGHTTEP-------------NIDCQFEMDGANQCVSELDEGNVKARDENMVSD 1058
            S E   ++   T P              +  + + D  N  +S  +      + E  V D
Sbjct: 154  SMEAEVNVDVLTGPMTGNLESESLDIGPVKAEEDSDFCNASISGSEILTADKKSEGEVKD 213

Query: 1057 DLKVQTRFLVEGCNIDSPYESCNIDGD----DVQHNKDLPFVEREESLTTSSKEK----D 902
            D       + +   +D+P  S   + D        ++DL  +E  ++   S KE     D
Sbjct: 214  D--ANYTIVKDMPALDAPGISPESNSDRHIGGRGESEDLSEMEEVKTGGISGKESLTLAD 271

Query: 901  RVESEFPIQECDNNNSSRVSCDTDGSASRRQSNESQ------DMSEDGESANSLRPSTAV 740
             +  E   ++  N NS+     +   +  ++S+E+       D  +D ++  +L    A 
Sbjct: 272  VMSMEDGEKKPFNRNSNGPEEQSSQESRGKRSSETATLAPEVDEMDDSKTVEALSTVMAD 331

Query: 739  QEDESTNSNQVGEGETAGTVTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEE 560
             E   T   + GE ET+  +     +     ++ RE    T+  E QR+     +   + 
Sbjct: 332  AEAYETEKAKNGEEETSSCIENAEVTVHSLGSTSREATKTTRTGEPQRSESESYVSRHKF 391

Query: 559  DSQTASSRSFFIQHGHGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFP 380
              +  +   F    G GE            + Y+ P                   SFAFP
Sbjct: 392  TLEDPTDHPFPSGSGIGETSFSAAEPVSGHITYSGPISFSGSLSVRSDGSTTSTRSFAFP 451

Query: 379  VLHSEWNSSPVKMVKPDRRHLRKHRGWKLCFPCCRY 272
            VL +EWNSSPV+M K DRRHLR  + WK    CCR+
Sbjct: 452  VLQTEWNSSPVRMAKADRRHLRSQKSWKHSLLCCRF 487


>gb|KHG16888.1| Polyribonucleotide nucleotidyltransferase [Gossypium arboreum]
          Length = 408

 Score =  117 bits (294), Expect = 2e-23
 Identities = 107/407 (26%), Positives = 175/407 (42%), Gaps = 32/407 (7%)
 Frame = -2

Query: 1396 SQKEMELYTDKTVTECELPELIVCFKEGSYSIIKDICIDEGLPSVDKTFRENGVVPDKEL 1217
            S ++   Y +++V E  LPEL+VC+KE +Y ++KDICID+G+ + DK   ++G   +K L
Sbjct: 30   SIQDSVFYLNRSVMESNLPELVVCYKESTYHVVKDICIDDGVLTKDKFLFDSGA-NEKFL 88

Query: 1216 FTYLESSNELVKDIGHTTEPNIDCQFEMDGANQCVSELDEGNVKARDENMVSDDLKVQTR 1037
             + ++   +LVK+       N+    E    NQ   ++D+   K   +  +  D  +Q  
Sbjct: 89   PSEMDLEAQLVKE-------NLKAHPE---GNQSGKDIDD---KCSTKKKLDADTCIQDV 135

Query: 1036 FLVEGC--NIDSPYESCNIDGDDVQHNKDLPFVEREESLTTSSKEKDRVESEFPIQECD- 866
             L+E    N   PY+    D  D+  ++++    +E+++   +++  +      + E   
Sbjct: 136  SLLEESESNKGIPYQC---DSKDLILSREM----KEDAVKMITEDVSKKLYTLGLGELLL 188

Query: 865  NNNSSRVSCDTDGSASRRQSNESQDMSEDGESANSLRPSTAVQEDESTNSNQVGEGETAG 686
             +  S V  +   S  R    + Q+     E   ++ P+     +ES N N+        
Sbjct: 189  MSEMSTVKAEIVCSDCRSDGTQQQNFQNLSEKEATVMPALVSPVEESNNGNEEAILSAPA 248

Query: 685  TVTLNSDSS---------PPPTTSGREE-----------DPNTQKSEFQRAIHTVNILGL 566
             V+   +S           P   S  EE           D + Q S   R  H +++  L
Sbjct: 249  LVSAAEESEHGKWEATLISPVLASASEESTGSRIVDEVSDSSAQTSSKDRCCHNLDLEPL 308

Query: 565  EEDS---------QTASSRSFFIQHGHGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXX 413
               S         Q  SS    +Q G+GE            + Y+ P             
Sbjct: 309  ASGSTPKVEDPADQLLSSN---LQRGYGECSFSAAGL----ITYSGPIAYSGSLSHRSDS 361

Query: 412  XXXXXXSFAFPVLHSEWNSSPVKMVKPDRRHLRKHRGWKLCFPCCRY 272
                  SFAFP+L SEWNSSPV+M K +RRH RKHRGW+  F CCR+
Sbjct: 362  STTSTRSFAFPILQSEWNSSPVRMAKAERRHYRKHRGWRQGFLCCRF 408


>gb|AAX55105.1| hypothetical protein At2g03810 [Arabidopsis thaliana]
          Length = 439

 Score =  117 bits (292), Expect = 4e-23
 Identities = 113/440 (25%), Positives = 180/440 (40%), Gaps = 12/440 (2%)
 Frame = -2

Query: 1555 KETELYTEEHGNSFRKVPGLD-DLSDSEDSVNVAENKSANSINPFLDPSCD---DDLSQK 1388
            ++ EL   E+G +   V  L  D    E+  N A  K  ++ +      CD   D   +K
Sbjct: 31   EDAELKVPENGKNNNNVCELFYDTRSGEEWENEAGKKVRDTSH-----DCDANVDSPEKK 85

Query: 1387 EMELYTDKTVTECELPELIVCFKEGSYSIIKDICIDEGLPSVDKTFRENGVVPDKELFTY 1208
            +   Y DK VT C+LPE++VC+KE +Y I+KDIC+DEG+P           V +K LF  
Sbjct: 86   DPVFYMDKNVTACDLPEIVVCYKENTYHIVKDICVDEGVP-----------VQEKFLFGE 134

Query: 1207 LES-SNELVKDIGHTTEPNIDCQFEMDGANQCVSELDEGNVKARDENMVSDDLKVQTRFL 1031
             +S  +   +D+    + N++   E   A   +S++D+      D     D  +      
Sbjct: 135  KDSVKSSSTEDLMKADKTNVN-PSETKSAEDSISKVDDSEF-CNDHKTDRDVEESSGEDF 192

Query: 1030 VEGCNIDSPYESCN-IDGDDVQHNKDLPFVEREESLTTSSKEKDRVESEFPIQEC----- 869
             +     S Y   + I  ++V+ +        E     +SK++  +  +   +EC     
Sbjct: 193  ADAEGTSSNYNQEHLIVTEEVKASPTHGLSPSEIEPDENSKDEVAISQDNDSKECLTLGD 252

Query: 868  -DNNNSSRVSCDTDGSASRRQSNESQDMSEDGESANSLRPSTAVQEDESTNSNQVGEGET 692
              +    + S + D  +S     +S    +D E   SL  +    E E T   + GE + 
Sbjct: 253  ILSREDEQKSLNQDNISSDSHEEQSPSQLQDKEK-RSLETTAIETELEKTEEPKQGEEKL 311

Query: 691  AGTVTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEEDSQTASSRSFFIQHGH 512
            +   T  + S  P  T    E P T+    Q  +    +    ED + +SSR      G 
Sbjct: 312  SSVST--TTSQEPNKTCNEPEKPETENHHQQNCL----VENSYEDDKFSSSR-----FGE 360

Query: 511  GEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKMVKP 332
                          + Y+ P                   SFAFP+L SEWNSSPV+M K 
Sbjct: 361  TSFSAADSVSISGHITYSGPIAYSGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKA 420

Query: 331  DRRHLRKHRGWKLCFPCCRY 272
            D+R  R+  GW+    CCR+
Sbjct: 421  DKR--RQKGGWRHTLLCCRF 438


Top