BLASTX nr result

ID: Papaver32_contig00011162 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver32_contig00011162
         (2335 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_010257928.1 PREDICTED: uncharacterized protein LOC104597869 i...   220   8e-61
XP_010257925.1 PREDICTED: uncharacterized protein LOC104597869 i...   220   2e-60
XP_010261085.1 PREDICTED: uncharacterized protein LOC104599949 [...   189   4e-49
OMO72168.1 hypothetical protein COLO4_27800 [Corchorus olitorius]     149   1e-34
XP_018810844.1 PREDICTED: uncharacterized protein LOC108983598 i...   146   1e-33
XP_018810843.1 PREDICTED: uncharacterized protein LOC108983598 i...   146   1e-33
XP_016705234.1 PREDICTED: uncharacterized protein LOC107920186 [...   145   2e-33
EOY01582.1 18S pre-ribosomal assembly protein gar2-related, puta...   144   3e-33
XP_007045750.2 PREDICTED: uncharacterized protein LOC18610175 is...   144   5e-33
XP_012464097.1 PREDICTED: uncharacterized protein LOC105783281 [...   144   6e-33
EOY01581.1 18S pre-ribosomal assembly protein gar2-related, puta...   144   8e-33
XP_017971961.1 PREDICTED: uncharacterized protein LOC18610175 is...   144   1e-32
XP_017971960.1 PREDICTED: uncharacterized protein LOC18610175 is...   144   1e-32
XP_007045751.2 PREDICTED: uncharacterized protein LOC18610175 is...   144   1e-32
XP_017607955.1 PREDICTED: uncharacterized protein LOC108454133 [...   143   1e-32
KHG21027.1 Formate--tetrahydrofolate ligase [Gossypium arboreum]      143   1e-32
XP_016675638.1 PREDICTED: uncharacterized protein LOC107894985 [...   142   3e-32
XP_016900313.1 PREDICTED: uncharacterized protein LOC103489197 i...   137   5e-31
XP_006573172.1 PREDICTED: uncharacterized protein LOC100796112 [...   134   2e-29
XP_016667087.1 PREDICTED: uncharacterized protein LOC107887384 i...   132   2e-29

>XP_010257928.1 PREDICTED: uncharacterized protein LOC104597869 isoform X2 [Nelumbo
            nucifera]
          Length = 415

 Score =  220 bits (561), Expect = 8e-61
 Identities = 151/428 (35%), Positives = 210/428 (49%), Gaps = 7/428 (1%)
 Frame = -3

Query: 1559 EEHGNSFRKVPGLDDLSDSEDSVNVAENKSANSINPFLDPSCDDDLSQKEMELYTDKTVT 1380
            ++ G + R V GL D    +D +N  E K  + I  ++ PS +  LS+K  + YTDK+V 
Sbjct: 27   KQTGENVRNVKGLHDFVSMDDLINGREGKIGDHIPTYVLPSGEIKLSEKVTKFYTDKSVM 86

Query: 1379 ECELPELIVCFKEGSYSIIKDICIDEGLPSVDKTFRENGVVPDKELFTCLKSSNELVKDI 1200
            ECE+PELIVCFKEG Y ++KDIC+DEG+PS DK   ENG V  K    C   S+      
Sbjct: 87   ECEVPELIVCFKEGPYHVVKDICVDEGVPSQDKILTENGQVDCKP---CSMHSD------ 137

Query: 1199 GHTTEPNIDCQFEMDGANQCVSELDEGNVKARDENMVSDDLKVQTRFLVEGCNIDSPYES 1020
                + N D   +M G+    S++ +  V++  E         +  F  +  N D     
Sbjct: 138  ---LDVNSDLTKQMVGSVTLDSDVMKSLVQSDCEKNTDSQCNSKDLFQKDEKNADVE--- 191

Query: 1019 CNIDGDDVQHNKDLPFVEREESLTTSSKEKDRVESEFPIQECDNNNSSRVSCD--TDGSA 846
                 D++ H   L      E++ +  K K                 +  SC   T+  +
Sbjct: 192  -----DEIAHAHILDKKVMSENMLSVGKLK-----------------TEKSCPELTNFDS 229

Query: 845  SRRQSNESQDMSEDGESANSLRPSTAVQEDESTNSNQVGEGETAGTVTLNSDSSPPPT-T 669
            +  Q   +QDMS +G  ANS  PS A + D S   N+V         ++  DS+P  + T
Sbjct: 230  NGEQQAHNQDMSREGTLANSAVPSPAAESDSSNPDNKVPLNSKVENRSITFDSNPSTSAT 289

Query: 668  SGREEDPNTQKSEFQRAIHTV-NILGLEE---DSQTASSRSFFIQHGHGEXXXXXXXXXX 501
            SGR E  + QK++  + +HT+ N   LE+   +S TASSRSFFIQHGHGE          
Sbjct: 290  SGRVE--SKQKADSPQPLHTLLNTSRLEDGPVESLTASSRSFFIQHGHGESSFSAVGPMS 347

Query: 500  XPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKMVKPDRRHLRKHRGWK 321
              + Y+ P                   SFAFP+LHSEWNSSPVKM K D+RH RKHR WK
Sbjct: 348  GSITYSGPIPYSGSISLRSDSSTTSNRSFAFPILHSEWNSSPVKMAKADQRHFRKHRRWK 407

Query: 320  LCFPCCRY 297
            + F CC +
Sbjct: 408  MNFLCCSF 415


>XP_010257925.1 PREDICTED: uncharacterized protein LOC104597869 isoform X1 [Nelumbo
            nucifera] XP_010257926.1 PREDICTED: uncharacterized
            protein LOC104597869 isoform X1 [Nelumbo nucifera]
            XP_010257927.1 PREDICTED: uncharacterized protein
            LOC104597869 isoform X1 [Nelumbo nucifera]
          Length = 453

 Score =  220 bits (561), Expect = 2e-60
 Identities = 151/428 (35%), Positives = 210/428 (49%), Gaps = 7/428 (1%)
 Frame = -3

Query: 1559 EEHGNSFRKVPGLDDLSDSEDSVNVAENKSANSINPFLDPSCDDDLSQKEMELYTDKTVT 1380
            ++ G + R V GL D    +D +N  E K  + I  ++ PS +  LS+K  + YTDK+V 
Sbjct: 65   KQTGENVRNVKGLHDFVSMDDLINGREGKIGDHIPTYVLPSGEIKLSEKVTKFYTDKSVM 124

Query: 1379 ECELPELIVCFKEGSYSIIKDICIDEGLPSVDKTFRENGVVPDKELFTCLKSSNELVKDI 1200
            ECE+PELIVCFKEG Y ++KDIC+DEG+PS DK   ENG V  K    C   S+      
Sbjct: 125  ECEVPELIVCFKEGPYHVVKDICVDEGVPSQDKILTENGQVDCKP---CSMHSD------ 175

Query: 1199 GHTTEPNIDCQFEMDGANQCVSELDEGNVKARDENMVSDDLKVQTRFLVEGCNIDSPYES 1020
                + N D   +M G+    S++ +  V++  E         +  F  +  N D     
Sbjct: 176  ---LDVNSDLTKQMVGSVTLDSDVMKSLVQSDCEKNTDSQCNSKDLFQKDEKNADVE--- 229

Query: 1019 CNIDGDDVQHNKDLPFVEREESLTTSSKEKDRVESEFPIQECDNNNSSRVSCD--TDGSA 846
                 D++ H   L      E++ +  K K                 +  SC   T+  +
Sbjct: 230  -----DEIAHAHILDKKVMSENMLSVGKLK-----------------TEKSCPELTNFDS 267

Query: 845  SRRQSNESQDMSEDGESANSLRPSTAVQEDESTNSNQVGEGETAGTVTLNSDSSPPPT-T 669
            +  Q   +QDMS +G  ANS  PS A + D S   N+V         ++  DS+P  + T
Sbjct: 268  NGEQQAHNQDMSREGTLANSAVPSPAAESDSSNPDNKVPLNSKVENRSITFDSNPSTSAT 327

Query: 668  SGREEDPNTQKSEFQRAIHTV-NILGLEE---DSQTASSRSFFIQHGHGEXXXXXXXXXX 501
            SGR E  + QK++  + +HT+ N   LE+   +S TASSRSFFIQHGHGE          
Sbjct: 328  SGRVE--SKQKADSPQPLHTLLNTSRLEDGPVESLTASSRSFFIQHGHGESSFSAVGPMS 385

Query: 500  XPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKMVKPDRRHLRKHRGWK 321
              + Y+ P                   SFAFP+LHSEWNSSPVKM K D+RH RKHR WK
Sbjct: 386  GSITYSGPIPYSGSISLRSDSSTTSNRSFAFPILHSEWNSSPVKMAKADQRHFRKHRRWK 445

Query: 320  LCFPCCRY 297
            + F CC +
Sbjct: 446  MNFLCCSF 453


>XP_010261085.1 PREDICTED: uncharacterized protein LOC104599949 [Nelumbo nucifera]
            XP_019053958.1 PREDICTED: uncharacterized protein
            LOC104599949 [Nelumbo nucifera] XP_019053959.1 PREDICTED:
            uncharacterized protein LOC104599949 [Nelumbo nucifera]
            XP_019053961.1 PREDICTED: uncharacterized protein
            LOC104599949 [Nelumbo nucifera] XP_019053962.1 PREDICTED:
            uncharacterized protein LOC104599949 [Nelumbo nucifera]
            XP_019053963.1 PREDICTED: uncharacterized protein
            LOC104599949 [Nelumbo nucifera] XP_019053964.1 PREDICTED:
            uncharacterized protein LOC104599949 [Nelumbo nucifera]
          Length = 447

 Score =  189 bits (480), Expect = 4e-49
 Identities = 144/429 (33%), Positives = 205/429 (47%), Gaps = 8/429 (1%)
 Frame = -3

Query: 1559 EEH-GNSFRKVPGLDDLSDSEDSVNVAENKSANSINPFLDPSCDDDLSQKEMELYTDKTV 1383
            E+H G S R V GL D   +++ +N  EN++ +S   ++ PS +  LS+K    YTDK V
Sbjct: 65   EKHTGESLRNVKGLHDFVRTDNLINGKENETGDSAPMYVLPSGETKLSEKVTGFYTDKVV 124

Query: 1382 TECELPELIVCFKEGSYSIIKDICIDEGLPSVDKTFRENGVVPDKELFTCLKSSNELVKD 1203
             ECELP+L V FKE  Y ++KDICIDEG+PS+DK   EN  V  K  F            
Sbjct: 125  MECELPDLTVGFKEDPYRVVKDICIDEGVPSLDKILTENDEVDYKSCFP----------- 173

Query: 1202 IGHT-TEPNIDCQFEMDGANQCVSELDEGNVKARDENMVSDDLKVQTRFLVEG-CNIDSP 1029
              HT  + N D   E D     ++E+                     + LVE  CN D  
Sbjct: 174  --HTGLDVNSDLTKEKDSVLPSLNEM---------------------KSLVESYCNKDI- 209

Query: 1028 YESCNIDGDDVQHNKDLPFVEREESLTTSSKEKDRVESEFPIQECDNNNSSRVSCDTDGS 849
               CN    +V H KD  +V+ EE  T  +   + +    P+ + D  +S     +   +
Sbjct: 210  LNQCN---SEVLHQKD-EYVD-EEDKTAHNSTDEVIPGSVPLGKLDTEDSYIKPSNFGSN 264

Query: 848  ASRRQSNESQDMSEDGESANSLRPSTAVQEDESTNSNQVG-EGETAGTVTLNSDSSPPPT 672
              ++QSN  QD S++  +      S   + D+S  +N+V    +     T+ S     PT
Sbjct: 265  KDQQQSN--QDSSKEAPAEKYGISSPTEESDDSNPANKVPFNNKVENGSTIMSFHPSKPT 322

Query: 671  TSGREEDPNTQKSEFQRAIH-TVNILGLEE---DSQTASSRSFFIQHGHGEXXXXXXXXX 504
            T  REE   + K++  + +H  +++  LE+   DS T SSRS  IQHGHGE         
Sbjct: 323  T--REE--TSTKADSPQPLHILLSMSRLEDGTVDSLTGSSRSLCIQHGHGESSFSAAGPM 378

Query: 503  XXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKMVKPDRRHLRKHRGW 324
               + Y+ P                   SFAFP+LHSEWNSSPVKM K +RR+  KHRGW
Sbjct: 379  SGSITYSGPVPYSGSISLRSDSSTTSTRSFAFPILHSEWNSSPVKMAKANRRNFHKHRGW 438

Query: 323  KLCFPCCRY 297
            ++   CCR+
Sbjct: 439  RMNLLCCRF 447


>OMO72168.1 hypothetical protein COLO4_27800 [Corchorus olitorius]
          Length = 503

 Score =  149 bits (376), Expect = 1e-34
 Identities = 135/472 (28%), Positives = 203/472 (43%), Gaps = 51/472 (10%)
 Frame = -3

Query: 1559 EEHGNSFRKVPGLDDLSDS---------------EDSVNVAENKSANSINPFLDPSCDDD 1425
            E+     R + G D  SDS               + S++V E  + N    F D    D 
Sbjct: 52   EKQNGVMRDIKGNDGDSDSLCLENTRDGWPASKLDSSMHVNEFGNGNE-KEFRDFVTSDS 110

Query: 1424 LSQKEME------LYTDKTVTECELPELIVCFKEGSYSIIKDICIDEGLPSVDKTFRENG 1263
             S K+M+       Y DK+V EC+LPEL+VC+KE +Y ++KDICIDEG+P+ DK   E+ 
Sbjct: 111  HSSKKMDSLQGSVFYLDKSVMECDLPELVVCYKENTYHVVKDICIDEGVPTQDKFLFESD 170

Query: 1262 V--------------VPDKELFTCLKSSNELVKDIGHTTEPN--IDC----QFEMDGANQ 1143
            +              V +K+        ++  K+I +  + N  +D     Q E +  NQ
Sbjct: 171  MNEKNNCNFLPSCKLVEEKQDIPISSPEDQSGKNIDNGCDFNEKLDADACRQDESNKGNQ 230

Query: 1142 CVSELDEGNVKARDENM--VSDDLKVQTRFL------VEGCNIDSPYESCNIDGDDVQHN 987
            C  E      K +DE M  + DDL  +   L       E   + S   S     D ++  
Sbjct: 231  CDFEDFMMKRKVKDEEMKTIPDDLSKELFTLGELLSMTELSTVTSKAMSSECKSDGIE-- 288

Query: 986  KDLPFVEREESLTTSSKEKDRVESEFPIQECDNNNSSRVSCDTDGSASRRQSNESQDMSE 807
                    ++S+ +SS+++  V         ++NN++    D  G  S   + ES +  E
Sbjct: 289  --------QQSIQSSSEKEVNVNPPSVFVAEESNNNTEAMLDAPGLISA--AGESDNGKE 338

Query: 806  DGESANSLRPSTAVQEDESTNSNQVGEGETAGT--VTLNSDSSPPPTTSGREEDPNTQKS 633
            D    ++ + S + +   +T SN+V +     T  +T N  SS P  T+ ++E       
Sbjct: 339  DAIPISTSQVSVSEESTNNTLSNEVSDDNRLETESITFNFGSSAP--TNSKDECRPNLNC 396

Query: 632  EFQRAIHTVNILGLEEDSQTASSRSFFIQHGHGEXXXXXXXXXXXPLAYTDPAXXXXXXX 453
            E      T     LE+ +    S    +Q G GE            ++Y+ P        
Sbjct: 397  ELPETGTTPK---LEDTADQPISN--ILQRGTGETSFSASGPVTGLISYSGPIAYSGSLS 451

Query: 452  XXXXXXXXXXXSFAFPVLHSEWNSSPVKMVKPDRRHLRKHRGWKLCFPCCRY 297
                       SFAFPVL SEWNSSPV+M K DRRH RKHRGW+    CCR+
Sbjct: 452  LRSDSSTTSTRSFAFPVLQSEWNSSPVRMAKADRRHYRKHRGWRHGLFCCRF 503


>XP_018810844.1 PREDICTED: uncharacterized protein LOC108983598 isoform X2 [Juglans
            regia]
          Length = 509

 Score =  146 bits (369), Expect = 1e-33
 Identities = 134/453 (29%), Positives = 192/453 (42%), Gaps = 44/453 (9%)
 Frame = -3

Query: 1523 LDDL-SDSEDSVN--VAENKSANSINPFLDPSCDDDLSQKEMELYTDKTVTECELPELIV 1353
            +DDL ++SED V   VA +  ++    F D         K+ +   DK V ECELPEL V
Sbjct: 80   MDDLKNESEDEVRDFVASHTHSSRKTGFFD---------KDSDFVMDKGVMECELPELTV 130

Query: 1352 CFKEGSYSIIKDICIDEGLPSVDKTF------------------RENGVVPDKELFTCLK 1227
            C+KE  Y ++KDICIDEG+PS +K                     +N V+  ++  T + 
Sbjct: 131  CYKENGYHVVKDICIDEGVPSQEKILFGSGRDTKTVLIVHPPEKDQNKVLLKEKEDTEIY 190

Query: 1226 SSNELVKDIGHTTEPNIDCQFEMDGANQCVSELDEGNVKARDENMVSDDLKVQTRFLVEG 1047
            S +EL+    + ++ N   QF+     Q   +  E  +    E  +    K+    ++E 
Sbjct: 191  SPDELMFSSENDSKKNSANQFDSKDLIQTEEDSTESILNDATEERLLPGNKLP---MLER 247

Query: 1046 CNIDSPYESCNIDGDDVQHNKDLPFVEREESLTTSSKEKDRVESEFPIQECDNNNSSRVS 867
                      NID  DV+ +     +  E  +  S      VE        ++NNS R+S
Sbjct: 248  DKCAFHLNCLNIDSKDVEQHPS-QVISGENVILASPALVSGVE--------ESNNSGRIS 298

Query: 866  C---DTDGSASRRQSNESQDMSEDG----ESANSLRPSTAVQ----------EDESTNSN 738
                 T   A++  +N + D    G     SA     ST  Q           +ES NS+
Sbjct: 299  MLASSTSVYAAKESNNSAVDSMLAGPALVSSAEETNHSTGAQILATPNLVSAAEESNNSS 358

Query: 737  QVGE-----GETAGTVTLNSDSSPPPTTSGREEDPNTQK-SEFQRAIHTVNILGLEEDSQ 576
             V E      E  G +T +SDS  P   S R+E P TQ  S+F+  I   +     +   
Sbjct: 359  PVNEFFYNSKEERGGITFDSDSLAP-AASARQEGPETQNTSKFENLISDSHDTDSRQLHH 417

Query: 575  TASSRSFFIQHGHGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLH 396
            +    SF    G GE            + Y+ P                   SFAFP+L 
Sbjct: 418  SQGETSFSAA-GQGEEVFPVVGTFSSLINYSGPIAYSGNVSLRSDSSATSTRSFAFPILQ 476

Query: 395  SEWNSSPVKMVKPDRRHLRKHRGWKLCFPCCRY 297
            SEWNSSPV+M K DRRHLRKH+ W+  F CCR+
Sbjct: 477  SEWNSSPVRMAKADRRHLRKHKCWRKGFLCCRF 509


>XP_018810843.1 PREDICTED: uncharacterized protein LOC108983598 isoform X1 [Juglans
            regia]
          Length = 518

 Score =  146 bits (369), Expect = 1e-33
 Identities = 134/453 (29%), Positives = 192/453 (42%), Gaps = 44/453 (9%)
 Frame = -3

Query: 1523 LDDL-SDSEDSVN--VAENKSANSINPFLDPSCDDDLSQKEMELYTDKTVTECELPELIV 1353
            +DDL ++SED V   VA +  ++    F D         K+ +   DK V ECELPEL V
Sbjct: 89   MDDLKNESEDEVRDFVASHTHSSRKTGFFD---------KDSDFVMDKGVMECELPELTV 139

Query: 1352 CFKEGSYSIIKDICIDEGLPSVDKTF------------------RENGVVPDKELFTCLK 1227
            C+KE  Y ++KDICIDEG+PS +K                     +N V+  ++  T + 
Sbjct: 140  CYKENGYHVVKDICIDEGVPSQEKILFGSGRDTKTVLIVHPPEKDQNKVLLKEKEDTEIY 199

Query: 1226 SSNELVKDIGHTTEPNIDCQFEMDGANQCVSELDEGNVKARDENMVSDDLKVQTRFLVEG 1047
            S +EL+    + ++ N   QF+     Q   +  E  +    E  +    K+    ++E 
Sbjct: 200  SPDELMFSSENDSKKNSANQFDSKDLIQTEEDSTESILNDATEERLLPGNKLP---MLER 256

Query: 1046 CNIDSPYESCNIDGDDVQHNKDLPFVEREESLTTSSKEKDRVESEFPIQECDNNNSSRVS 867
                      NID  DV+ +     +  E  +  S      VE        ++NNS R+S
Sbjct: 257  DKCAFHLNCLNIDSKDVEQHPS-QVISGENVILASPALVSGVE--------ESNNSGRIS 307

Query: 866  C---DTDGSASRRQSNESQDMSEDG----ESANSLRPSTAVQ----------EDESTNSN 738
                 T   A++  +N + D    G     SA     ST  Q           +ES NS+
Sbjct: 308  MLASSTSVYAAKESNNSAVDSMLAGPALVSSAEETNHSTGAQILATPNLVSAAEESNNSS 367

Query: 737  QVGE-----GETAGTVTLNSDSSPPPTTSGREEDPNTQK-SEFQRAIHTVNILGLEEDSQ 576
             V E      E  G +T +SDS  P   S R+E P TQ  S+F+  I   +     +   
Sbjct: 368  PVNEFFYNSKEERGGITFDSDSLAP-AASARQEGPETQNTSKFENLISDSHDTDSRQLHH 426

Query: 575  TASSRSFFIQHGHGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLH 396
            +    SF    G GE            + Y+ P                   SFAFP+L 
Sbjct: 427  SQGETSFSAA-GQGEEVFPVVGTFSSLINYSGPIAYSGNVSLRSDSSATSTRSFAFPILQ 485

Query: 395  SEWNSSPVKMVKPDRRHLRKHRGWKLCFPCCRY 297
            SEWNSSPV+M K DRRHLRKH+ W+  F CCR+
Sbjct: 486  SEWNSSPVRMAKADRRHLRKHKCWRKGFLCCRF 518


>XP_016705234.1 PREDICTED: uncharacterized protein LOC107920186 [Gossypium hirsutum]
            XP_016705235.1 PREDICTED: uncharacterized protein
            LOC107920186 [Gossypium hirsutum] XP_016705236.1
            PREDICTED: uncharacterized protein LOC107920186
            [Gossypium hirsutum]
          Length = 505

 Score =  145 bits (367), Expect = 2e-33
 Identities = 123/441 (27%), Positives = 184/441 (41%), Gaps = 34/441 (7%)
 Frame = -3

Query: 1517 DLSDSEDSVNVAENKSANSINPFLDPSCDDDLSQKEMELYTDKTVTECELPELIVCFKEG 1338
            D S+S    +    K       F   S  +  S ++   Y DK+V +CELPEL+VC+KE 
Sbjct: 87   DCSNSVLDFSNGNEKEVRDFVTFNSHSSKNMDSFQDSVFYLDKSVMDCELPELVVCYKES 146

Query: 1337 SYSIIKDICIDEGLPSVDKTFRENGVVPDKEL---FTCLKSSNELVKDIGHTTEPNIDCQ 1167
            +Y ++KDICIDEG+P+ D    E+ V    E    +      NEL+K++  T  P  D  
Sbjct: 147  TYHVVKDICIDEGVPTQDMFLFESSVDEKSECNFSYPKKDQDNELMKEMSETDMPMQDIS 206

Query: 1166 FEMDGANQCVSELDE--GNVKARDENMVSDDLKVQTRFLVEGCNIDSPYESCNIDGDDVQ 993
            F  +  NQ   ++D   G+ K  D +    D+ +          I + +     D  D+ 
Sbjct: 207  FSPE-ENQSGKDIDNEGGSNKKLDADTYMQDIALSLEENKSNKGISNEW-----DPRDLL 260

Query: 992  HNKDLPFVEREESLTTSSKEKDRVESEFPIQECDNNNSSRVSCDTDGSASRRQSNESQDM 813
              +D+     E      SKE   +     + E     S  +S D       +QS E+   
Sbjct: 261  VTRDMKDDAMEMMSNDGSKELFTLGDILSLPELTTLKSEAMSPDCKSDRIEQQSFENSSK 320

Query: 812  SE------------------------DGE-----SANSLRPSTAVQEDESTNSNQVGEGE 720
             E                        +G       A  + P+ A    E+T+S  V E  
Sbjct: 321  KEVIVASAVEESNNLILSAPALVSTAEGSDSGKGEATPISPAPASASLEATSSGLVNE-- 378

Query: 719  TAGTVTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEEDSQTASSRSFFIQHG 540
              G++T +S SS P  TSG+  +         + + T     LEE +    S +  +Q+G
Sbjct: 379  -TGSITFDSRSSAP--TSGKGSN---------KPLETGRTSKLEETADQPFSSN--LQNG 424

Query: 539  HGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKMVK 360
            +GE            ++Y+ P                   SFAFP+L SEWNSSPV+M K
Sbjct: 425  NGESSFSAAGPLTGLISYSGPIAYSGNLSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAK 484

Query: 359  PDRRHLRKHRGWKLCFPCCRY 297
             DRR  R+HRGW+  F CCR+
Sbjct: 485  ADRRQYRRHRGWRQGFLCCRF 505


>EOY01582.1 18S pre-ribosomal assembly protein gar2-related, putative isoform 2
            [Theobroma cacao] EOY01583.1 18S pre-ribosomal assembly
            protein gar2-related, putative isoform 2 [Theobroma
            cacao] EOY01584.1 18S pre-ribosomal assembly protein
            gar2-related, putative isoform 2 [Theobroma cacao]
          Length = 470

 Score =  144 bits (363), Expect = 3e-33
 Identities = 134/456 (29%), Positives = 199/456 (43%), Gaps = 52/456 (11%)
 Frame = -3

Query: 1508 DSEDSVNVAENKSANSINPFL---DPSCDDDLSQKEMELYTDKTVTECELPELIVCFKEG 1338
            D   SVN   N +   +  F+    PS  +  S +    Y DK+V ECELPEL+VC+KE 
Sbjct: 30   DCSISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQNSVFYLDKSVMECELPELVVCYKES 89

Query: 1337 SYSIIKDICIDEGLPSVDKTFRENGVVPDKELFTCLKSSNELVKDIGHTTEPNIDCQFEM 1158
            +Y ++KDICIDEG+P+ DK   E G+    E   C    +E  +D    TE     + E 
Sbjct: 90   TYHVVKDICIDEGVPTQDKFLFETGM---DEKIDCNFLPSEKEQDSQLMTE-----KLET 141

Query: 1157 DGANQCVSELDEGNVKARD-ENMVSDDLKVQTRFLVEGCNIDSPYESCNIDGDDVQHNKD 981
            D   Q VS     N   +D +N    + KV T   ++  ++       N    +   +KD
Sbjct: 142  DMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKGIPNQCDSKD 201

Query: 980  LPFV-----EREESLTTS-SKEKDRVESEFPIQECDNNNSSRVS--CDTDG-------SA 846
            L        +  + +T   SKE   +     + E    NS  +S  C +DG       S+
Sbjct: 202  LMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSDGIEQQSFQSS 261

Query: 845  SRRQS----------NESQDMSEDG-----------ESANS-------LRPSTAVQEDES 750
            S+++            ES+D +E+            E  +S       + P+     +ES
Sbjct: 262  SKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSGKGEAILISPAQVSTSEES 321

Query: 749  TNSNQVGEGE-----TAGTVTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEE 585
            T+S+ V E         G++T N DSS P  TS ++E  +   SE    + T +   LE 
Sbjct: 322  TSSSLVNEVSYDNKLETGSITFNLDSSAP--TSSKDECHHNLDSE---PLGTGSTPKLEV 376

Query: 584  DSQTASSRSFFIQHGHGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFP 405
             +  + S +  +Q G GE            ++Y+ P                   SFAFP
Sbjct: 377  AADQSISNN--LQQGIGESSFSAAGLVTGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFP 434

Query: 404  VLHSEWNSSPVKMVKPDRRHLRKHRGWKLCFPCCRY 297
            +L SEWN SPV+M K DRRH RKH+GW+    CCR+
Sbjct: 435  ILQSEWNCSPVRMAKADRRHYRKHKGWRHGLLCCRF 470


>XP_007045750.2 PREDICTED: uncharacterized protein LOC18610175 isoform X4 [Theobroma
            cacao] XP_007045752.2 PREDICTED: uncharacterized protein
            LOC18610175 isoform X4 [Theobroma cacao]
          Length = 470

 Score =  144 bits (362), Expect = 5e-33
 Identities = 134/456 (29%), Positives = 199/456 (43%), Gaps = 52/456 (11%)
 Frame = -3

Query: 1508 DSEDSVNVAENKSANSINPFL---DPSCDDDLSQKEMELYTDKTVTECELPELIVCFKEG 1338
            D   SVN   N +   +  F+    PS  +  S +    Y DK+V ECELPEL+VC+KE 
Sbjct: 30   DCSISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQNSVFYLDKSVMECELPELVVCYKES 89

Query: 1337 SYSIIKDICIDEGLPSVDKTFRENGVVPDKELFTCLKSSNELVKDIGHTTEPNIDCQFEM 1158
            +Y ++KDICIDEG+P+ DK   E G+    E   C    +E  +D    TE     + E 
Sbjct: 90   TYHVVKDICIDEGVPTQDKFLFETGM---DEKIDCNFLPSEKEQDSQLMTE-----KLET 141

Query: 1157 DGANQCVSELDEGNVKARD-ENMVSDDLKVQTRFLVEGCNIDSPYESCNIDGDDVQHNKD 981
            D   Q VS     N   +D +N    + KV T   ++  ++       N    +   +KD
Sbjct: 142  DMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKGIPNQCDSKD 201

Query: 980  LPFV-----EREESLTTS-SKEKDRVESEFPIQECDNNNSSRVS--CDTDG-------SA 846
            L        +  + +T   SKE   +     + E    NS  +S  C +DG       S+
Sbjct: 202  LMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSDGIEQQSFQSS 261

Query: 845  SRRQS----------NESQDMSEDG-----------ESANS-------LRPSTAVQEDES 750
            S+++            ES+D +E+            E  +S       + P+     +ES
Sbjct: 262  SKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSGKGEAILISPAQVSTPEES 321

Query: 749  TNSNQVGEGE-----TAGTVTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEE 585
            T+S+ V E         G++T N DSS P  TS ++E  +   SE    + T +   LE 
Sbjct: 322  TSSSLVNEVSYDNKLETGSITFNLDSSAP--TSSKDECHHNLDSE---PLGTGSTPKLEV 376

Query: 584  DSQTASSRSFFIQHGHGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFP 405
             +  + S +  +Q G GE            ++Y+ P                   SFAFP
Sbjct: 377  AADQSISNN--LQQGIGESSFSAAGLVTGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFP 434

Query: 404  VLHSEWNSSPVKMVKPDRRHLRKHRGWKLCFPCCRY 297
            +L SEWN SPV+M K DRRH RKH+GW+    CCR+
Sbjct: 435  ILQSEWNCSPVRMAKADRRHYRKHKGWRHGLLCCRF 470


>XP_012464097.1 PREDICTED: uncharacterized protein LOC105783281 [Gossypium raimondii]
            XP_012464099.1 PREDICTED: uncharacterized protein
            LOC105783281 [Gossypium raimondii] KJB80435.1
            hypothetical protein B456_013G097400 [Gossypium
            raimondii] KJB80436.1 hypothetical protein
            B456_013G097400 [Gossypium raimondii]
          Length = 505

 Score =  144 bits (363), Expect = 6e-33
 Identities = 122/441 (27%), Positives = 182/441 (41%), Gaps = 34/441 (7%)
 Frame = -3

Query: 1517 DLSDSEDSVNVAENKSANSINPFLDPSCDDDLSQKEMELYTDKTVTECELPELIVCFKEG 1338
            D S+S    +    K       F   S  +  S ++   Y DK+V +CELPEL+VC+KE 
Sbjct: 87   DCSNSVHDFSNGNEKEVRDFVTFNSHSSKNMDSFQDSVFYLDKSVMDCELPELVVCYKES 146

Query: 1337 SYSIIKDICIDEGLPSVDKTFRENGVVPDKEL---FTCLKSSNELVKDIGHTTEPNIDCQ 1167
            +Y ++KDICIDEG+P+ D    E+ V    E    +      NEL+K++  T  P  D  
Sbjct: 147  TYHVVKDICIDEGVPTQDMFLFESSVDEKSECNFSYPKKDQDNELMKEMSETDMPMQDIS 206

Query: 1166 FEMDGANQCVSELDE--GNVKARDENMVSDDLKVQTRFLVEGCNIDSPYESCNIDGDDVQ 993
            F  +  NQ   ++D   G+ K  D +    D+ +          I + +     D  D+ 
Sbjct: 207  FSPE-ENQSGKDIDNECGSNKKLDADTYMQDIALSLEENKSNKGIPNEW-----DPRDLL 260

Query: 992  HNKDLPFVEREESLTTSSKEKDRVESEFPIQECDNNNSSRVSCDTDGSASRRQSNESQDM 813
              +D+     E      SKE   +     + E     S  +S D       +QS E+   
Sbjct: 261  VTRDMKDDAMEMMSNDGSKELFTLGDILSLPELATLKSEAMSPDCKSDRIEQQSFENSSK 320

Query: 812  SE------------------------DGE-----SANSLRPSTAVQEDESTNSNQVGEGE 720
             E                        +G       A  + P+ A    E+T+S  V E  
Sbjct: 321  KEVIVASAVEESNNLILSAPALVSTAEGSDIGKGEATPISPAPASASLEATSSGLVNE-- 378

Query: 719  TAGTVTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEEDSQTASSRSFFIQHG 540
              G++T +S SS P  TSG+  +   +     +         LEE +    S +  +Q G
Sbjct: 379  -TGSITFDSRSSAP--TSGKGSNKPLEAGRTSK---------LEETADQPFSSN--LQSG 424

Query: 539  HGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKMVK 360
            +GE            ++Y+ P                   SFAFP+L SEWNSSPV+M K
Sbjct: 425  NGESSFSAAGPLTGLISYSGPIAYSGNLSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAK 484

Query: 359  PDRRHLRKHRGWKLCFPCCRY 297
             DRR  R+HRGW+  F CCR+
Sbjct: 485  ADRRQYRRHRGWRQGFLCCRF 505


>EOY01581.1 18S pre-ribosomal assembly protein gar2-related, putative isoform 1
            [Theobroma cacao]
          Length = 527

 Score =  144 bits (363), Expect = 8e-33
 Identities = 134/456 (29%), Positives = 199/456 (43%), Gaps = 52/456 (11%)
 Frame = -3

Query: 1508 DSEDSVNVAENKSANSINPFL---DPSCDDDLSQKEMELYTDKTVTECELPELIVCFKEG 1338
            D   SVN   N +   +  F+    PS  +  S +    Y DK+V ECELPEL+VC+KE 
Sbjct: 87   DCSISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQNSVFYLDKSVMECELPELVVCYKES 146

Query: 1337 SYSIIKDICIDEGLPSVDKTFRENGVVPDKELFTCLKSSNELVKDIGHTTEPNIDCQFEM 1158
            +Y ++KDICIDEG+P+ DK   E G+    E   C    +E  +D    TE     + E 
Sbjct: 147  TYHVVKDICIDEGVPTQDKFLFETGM---DEKIDCNFLPSEKEQDSQLMTE-----KLET 198

Query: 1157 DGANQCVSELDEGNVKARD-ENMVSDDLKVQTRFLVEGCNIDSPYESCNIDGDDVQHNKD 981
            D   Q VS     N   +D +N    + KV T   ++  ++       N    +   +KD
Sbjct: 199  DMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKGIPNQCDSKD 258

Query: 980  LPFV-----EREESLTTS-SKEKDRVESEFPIQECDNNNSSRVS--CDTDG-------SA 846
            L        +  + +T   SKE   +     + E    NS  +S  C +DG       S+
Sbjct: 259  LMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSDGIEQQSFQSS 318

Query: 845  SRRQS----------NESQDMSEDG-----------ESANS-------LRPSTAVQEDES 750
            S+++            ES+D +E+            E  +S       + P+     +ES
Sbjct: 319  SKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSGKGEAILISPAQVSTSEES 378

Query: 749  TNSNQVGEGE-----TAGTVTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEE 585
            T+S+ V E         G++T N DSS P  TS ++E  +   SE    + T +   LE 
Sbjct: 379  TSSSLVNEVSYDNKLETGSITFNLDSSAP--TSSKDECHHNLDSE---PLGTGSTPKLEV 433

Query: 584  DSQTASSRSFFIQHGHGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFP 405
             +  + S +  +Q G GE            ++Y+ P                   SFAFP
Sbjct: 434  AADQSISNN--LQQGIGESSFSAAGLVTGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFP 491

Query: 404  VLHSEWNSSPVKMVKPDRRHLRKHRGWKLCFPCCRY 297
            +L SEWN SPV+M K DRRH RKH+GW+    CCR+
Sbjct: 492  ILQSEWNCSPVRMAKADRRHYRKHKGWRHGLLCCRF 527


>XP_017971961.1 PREDICTED: uncharacterized protein LOC18610175 isoform X3 [Theobroma
            cacao]
          Length = 527

 Score =  144 bits (362), Expect = 1e-32
 Identities = 134/456 (29%), Positives = 199/456 (43%), Gaps = 52/456 (11%)
 Frame = -3

Query: 1508 DSEDSVNVAENKSANSINPFL---DPSCDDDLSQKEMELYTDKTVTECELPELIVCFKEG 1338
            D   SVN   N +   +  F+    PS  +  S +    Y DK+V ECELPEL+VC+KE 
Sbjct: 87   DCSISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQNSVFYLDKSVMECELPELVVCYKES 146

Query: 1337 SYSIIKDICIDEGLPSVDKTFRENGVVPDKELFTCLKSSNELVKDIGHTTEPNIDCQFEM 1158
            +Y ++KDICIDEG+P+ DK   E G+    E   C    +E  +D    TE     + E 
Sbjct: 147  TYHVVKDICIDEGVPTQDKFLFETGM---DEKIDCNFLPSEKEQDSQLMTE-----KLET 198

Query: 1157 DGANQCVSELDEGNVKARD-ENMVSDDLKVQTRFLVEGCNIDSPYESCNIDGDDVQHNKD 981
            D   Q VS     N   +D +N    + KV T   ++  ++       N    +   +KD
Sbjct: 199  DMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKGIPNQCDSKD 258

Query: 980  LPFV-----EREESLTTS-SKEKDRVESEFPIQECDNNNSSRVS--CDTDG-------SA 846
            L        +  + +T   SKE   +     + E    NS  +S  C +DG       S+
Sbjct: 259  LMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSDGIEQQSFQSS 318

Query: 845  SRRQS----------NESQDMSEDG-----------ESANS-------LRPSTAVQEDES 750
            S+++            ES+D +E+            E  +S       + P+     +ES
Sbjct: 319  SKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSGKGEAILISPAQVSTPEES 378

Query: 749  TNSNQVGEGE-----TAGTVTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEE 585
            T+S+ V E         G++T N DSS P  TS ++E  +   SE    + T +   LE 
Sbjct: 379  TSSSLVNEVSYDNKLETGSITFNLDSSAP--TSSKDECHHNLDSE---PLGTGSTPKLEV 433

Query: 584  DSQTASSRSFFIQHGHGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFP 405
             +  + S +  +Q G GE            ++Y+ P                   SFAFP
Sbjct: 434  AADQSISNN--LQQGIGESSFSAAGLVTGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFP 491

Query: 404  VLHSEWNSSPVKMVKPDRRHLRKHRGWKLCFPCCRY 297
            +L SEWN SPV+M K DRRH RKH+GW+    CCR+
Sbjct: 492  ILQSEWNCSPVRMAKADRRHYRKHKGWRHGLLCCRF 527


>XP_017971960.1 PREDICTED: uncharacterized protein LOC18610175 isoform X2 [Theobroma
            cacao]
          Length = 538

 Score =  144 bits (362), Expect = 1e-32
 Identities = 134/456 (29%), Positives = 199/456 (43%), Gaps = 52/456 (11%)
 Frame = -3

Query: 1508 DSEDSVNVAENKSANSINPFL---DPSCDDDLSQKEMELYTDKTVTECELPELIVCFKEG 1338
            D   SVN   N +   +  F+    PS  +  S +    Y DK+V ECELPEL+VC+KE 
Sbjct: 98   DCSISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQNSVFYLDKSVMECELPELVVCYKES 157

Query: 1337 SYSIIKDICIDEGLPSVDKTFRENGVVPDKELFTCLKSSNELVKDIGHTTEPNIDCQFEM 1158
            +Y ++KDICIDEG+P+ DK   E G+    E   C    +E  +D    TE     + E 
Sbjct: 158  TYHVVKDICIDEGVPTQDKFLFETGM---DEKIDCNFLPSEKEQDSQLMTE-----KLET 209

Query: 1157 DGANQCVSELDEGNVKARD-ENMVSDDLKVQTRFLVEGCNIDSPYESCNIDGDDVQHNKD 981
            D   Q VS     N   +D +N    + KV T   ++  ++       N    +   +KD
Sbjct: 210  DMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKGIPNQCDSKD 269

Query: 980  LPFV-----EREESLTTS-SKEKDRVESEFPIQECDNNNSSRVS--CDTDG-------SA 846
            L        +  + +T   SKE   +     + E    NS  +S  C +DG       S+
Sbjct: 270  LMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSDGIEQQSFQSS 329

Query: 845  SRRQS----------NESQDMSEDG-----------ESANS-------LRPSTAVQEDES 750
            S+++            ES+D +E+            E  +S       + P+     +ES
Sbjct: 330  SKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSGKGEAILISPAQVSTPEES 389

Query: 749  TNSNQVGEGE-----TAGTVTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEE 585
            T+S+ V E         G++T N DSS P  TS ++E  +   SE    + T +   LE 
Sbjct: 390  TSSSLVNEVSYDNKLETGSITFNLDSSAP--TSSKDECHHNLDSE---PLGTGSTPKLEV 444

Query: 584  DSQTASSRSFFIQHGHGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFP 405
             +  + S +  +Q G GE            ++Y+ P                   SFAFP
Sbjct: 445  AADQSISNN--LQQGIGESSFSAAGLVTGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFP 502

Query: 404  VLHSEWNSSPVKMVKPDRRHLRKHRGWKLCFPCCRY 297
            +L SEWN SPV+M K DRRH RKH+GW+    CCR+
Sbjct: 503  ILQSEWNCSPVRMAKADRRHYRKHKGWRHGLLCCRF 538


>XP_007045751.2 PREDICTED: uncharacterized protein LOC18610175 isoform X1 [Theobroma
            cacao]
          Length = 543

 Score =  144 bits (362), Expect = 1e-32
 Identities = 134/456 (29%), Positives = 199/456 (43%), Gaps = 52/456 (11%)
 Frame = -3

Query: 1508 DSEDSVNVAENKSANSINPFL---DPSCDDDLSQKEMELYTDKTVTECELPELIVCFKEG 1338
            D   SVN   N +   +  F+    PS  +  S +    Y DK+V ECELPEL+VC+KE 
Sbjct: 103  DCSISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQNSVFYLDKSVMECELPELVVCYKES 162

Query: 1337 SYSIIKDICIDEGLPSVDKTFRENGVVPDKELFTCLKSSNELVKDIGHTTEPNIDCQFEM 1158
            +Y ++KDICIDEG+P+ DK   E G+    E   C    +E  +D    TE     + E 
Sbjct: 163  TYHVVKDICIDEGVPTQDKFLFETGM---DEKIDCNFLPSEKEQDSQLMTE-----KLET 214

Query: 1157 DGANQCVSELDEGNVKARD-ENMVSDDLKVQTRFLVEGCNIDSPYESCNIDGDDVQHNKD 981
            D   Q VS     N   +D +N    + KV T   ++  ++       N    +   +KD
Sbjct: 215  DMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKGIPNQCDSKD 274

Query: 980  LPFV-----EREESLTTS-SKEKDRVESEFPIQECDNNNSSRVS--CDTDG-------SA 846
            L        +  + +T   SKE   +     + E    NS  +S  C +DG       S+
Sbjct: 275  LMLTRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSDGIEQQSFQSS 334

Query: 845  SRRQS----------NESQDMSEDG-----------ESANS-------LRPSTAVQEDES 750
            S+++            ES+D +E+            E  +S       + P+     +ES
Sbjct: 335  SKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSGKGEAILISPAQVSTPEES 394

Query: 749  TNSNQVGEGE-----TAGTVTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEE 585
            T+S+ V E         G++T N DSS P  TS ++E  +   SE    + T +   LE 
Sbjct: 395  TSSSLVNEVSYDNKLETGSITFNLDSSAP--TSSKDECHHNLDSE---PLGTGSTPKLEV 449

Query: 584  DSQTASSRSFFIQHGHGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFP 405
             +  + S +  +Q G GE            ++Y+ P                   SFAFP
Sbjct: 450  AADQSISNN--LQQGIGESSFSAAGLVTGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFP 507

Query: 404  VLHSEWNSSPVKMVKPDRRHLRKHRGWKLCFPCCRY 297
            +L SEWN SPV+M K DRRH RKH+GW+    CCR+
Sbjct: 508  ILQSEWNCSPVRMAKADRRHYRKHKGWRHGLLCCRF 543


>XP_017607955.1 PREDICTED: uncharacterized protein LOC108454133 [Gossypium arboreum]
            XP_017607956.1 PREDICTED: uncharacterized protein
            LOC108454133 [Gossypium arboreum] XP_017607957.1
            PREDICTED: uncharacterized protein LOC108454133
            [Gossypium arboreum] XP_017607959.1 PREDICTED:
            uncharacterized protein LOC108454133 [Gossypium arboreum]
          Length = 505

 Score =  143 bits (360), Expect = 1e-32
 Identities = 121/441 (27%), Positives = 184/441 (41%), Gaps = 34/441 (7%)
 Frame = -3

Query: 1517 DLSDSEDSVNVAENKSANSINPFLDPSCDDDLSQKEMELYTDKTVTECELPELIVCFKEG 1338
            D S+S    +    K    I  F   S  +  S ++   Y DK+V +CELPEL+VC+KE 
Sbjct: 87   DCSNSVHDFSNGNEKEVRDIVTFNSHSSKNMDSFQDSVFYLDKSVMDCELPELVVCYKES 146

Query: 1337 SYSIIKDICIDEGLPSVDKTFRENGVVPDKEL---FTCLKSSNELVKDIGHTTEPNIDCQ 1167
            +Y ++KDICIDEG+P+ D    E+ V    E    +      NEL+K++  T  P  +  
Sbjct: 147  TYHVVKDICIDEGVPTQDMFLFESSVDEKSECNFSYPKKDQDNELMKEMSETDIPMQNIS 206

Query: 1166 FEMDGANQCVSELDE--GNVKARDENMVSDDLKVQTRFLVEGCNIDSPYESCNIDGDDVQ 993
            F  +  NQ   ++D   G+ K  + +    D+ +          I + +     D  D+ 
Sbjct: 207  FSPE-ENQSGKDIDNDCGSNKKLNADTYMQDIALSLEENKSNKGIPNEW-----DPRDLL 260

Query: 992  HNKDLPFVEREESLTTSSKEKDRVESEFPIQECDNNNSSRVSCDTDGSASRRQSNESQDM 813
              +D+     E      SKE   +       E     S  +S D     + +QS E+   
Sbjct: 261  VTRDMKDDAMEMMSNEGSKELFILGDILSFPELTTLKSEAMSPDFKSDRNEQQSFENSSK 320

Query: 812  SE-----DGESANSL------------------------RPSTAVQEDESTNSNQVGEGE 720
             E     + E +N+L                         P+ A    E+T+S  V E  
Sbjct: 321  KEVIVASEVEDSNNLILSAPALASTAEGSDSGKGEATPISPAPASASLEATSSGLVNE-- 378

Query: 719  TAGTVTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEEDSQTASSRSFFIQHG 540
              G++T +S SS P +  G  E   T ++             LEE +    S +  +Q G
Sbjct: 379  -TGSITFDSRSSAPTSGKGSSEPLETGRTS-----------KLEETADQPFSSN--LQSG 424

Query: 539  HGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKMVK 360
            +GE            ++Y+ P                   SFAFP+L SEWNSSPV+M K
Sbjct: 425  NGESSFSAAGPLTGLISYSGPITYSGNLSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAK 484

Query: 359  PDRRHLRKHRGWKLCFPCCRY 297
             D+R  R+HRGW+  F CCR+
Sbjct: 485  ADQRQYRRHRGWRQGFLCCRF 505


>KHG21027.1 Formate--tetrahydrofolate ligase [Gossypium arboreum]
          Length = 505

 Score =  143 bits (360), Expect = 1e-32
 Identities = 121/441 (27%), Positives = 184/441 (41%), Gaps = 34/441 (7%)
 Frame = -3

Query: 1517 DLSDSEDSVNVAENKSANSINPFLDPSCDDDLSQKEMELYTDKTVTECELPELIVCFKEG 1338
            D S+S    +    K    I  F   S  +  S ++   Y DK+V +CELPEL+VC+KE 
Sbjct: 87   DCSNSVHDFSNGNEKEVRDIVTFNSHSSKNMDSFQDSVFYLDKSVMDCELPELVVCYKES 146

Query: 1337 SYSIIKDICIDEGLPSVDKTFRENGVVPDKEL---FTCLKSSNELVKDIGHTTEPNIDCQ 1167
            +Y ++KDICIDEG+P+ D    E+ V    E    +      NEL+K++  T  P  +  
Sbjct: 147  TYHVVKDICIDEGVPTQDMFLFESSVDEKSECNFSYPKKDQDNELMKEMSETDIPMQNIS 206

Query: 1166 FEMDGANQCVSELDE--GNVKARDENMVSDDLKVQTRFLVEGCNIDSPYESCNIDGDDVQ 993
            F  +  NQ   ++D   G+ K  + +    D+ +          I + +     D  D+ 
Sbjct: 207  FSPE-ENQSGKDIDNDCGSNKKLNADTYMQDIALSLEENKSNKGIPNEW-----DPRDLL 260

Query: 992  HNKDLPFVEREESLTTSSKEKDRVESEFPIQECDNNNSSRVSCDTDGSASRRQSNESQDM 813
              +D+     E      SKE   +       E     S  +S D     + +QS E+   
Sbjct: 261  VTRDMKDDATEMMSNEGSKELFILGDILSFPELTTLKSEAMSPDFKSDRNEQQSFENSSK 320

Query: 812  SE-----DGESANSL------------------------RPSTAVQEDESTNSNQVGEGE 720
             E     + E +N+L                         P+ A    E+T+S  V E  
Sbjct: 321  KEVIVASEVEDSNNLILSAPALASTAEGSDSGKGEATPISPAPASASLEATSSGLVNE-- 378

Query: 719  TAGTVTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEEDSQTASSRSFFIQHG 540
              G++T +S SS P +  G  E   T ++             LEE +    S +  +Q G
Sbjct: 379  -TGSITFDSRSSAPTSGKGSSEPLETGRTS-----------KLEETADQPFSSN--LQSG 424

Query: 539  HGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKMVK 360
            +GE            ++Y+ P                   SFAFP+L SEWNSSPV+M K
Sbjct: 425  NGESSFSAAGPLTGLISYSGPITYSGNLSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAK 484

Query: 359  PDRRHLRKHRGWKLCFPCCRY 297
             D+R  R+HRGW+  F CCR+
Sbjct: 485  ADQRQYRRHRGWRQGFLCCRF 505


>XP_016675638.1 PREDICTED: uncharacterized protein LOC107894985 [Gossypium hirsutum]
            XP_016675640.1 PREDICTED: uncharacterized protein
            LOC107894985 [Gossypium hirsutum] XP_016675641.1
            PREDICTED: uncharacterized protein LOC107894985
            [Gossypium hirsutum]
          Length = 505

 Score =  142 bits (357), Expect = 3e-32
 Identities = 122/441 (27%), Positives = 186/441 (42%), Gaps = 34/441 (7%)
 Frame = -3

Query: 1517 DLSDSEDSVNVAENKSANSINPFLDPSCDDDLSQKEMELYTDKTVTECELPELIVCFKEG 1338
            D S+S    +    K    I  F   S  +  S ++   Y DK+V +CELPEL+VC+KE 
Sbjct: 87   DCSNSVHDFSNGNEKEVRDIVTFNSHSSKNMDSFQDSVFYLDKSVMDCELPELVVCYKES 146

Query: 1337 SYSIIKDICIDEGLPSVDKTFRENGVVPDKEL---FTCLKSSNELVKDIGHTTEPNIDCQ 1167
            +Y ++KDICIDEG+P+ D    E+ V    E    +      NEL+K++  T  P  +  
Sbjct: 147  TYHVVKDICIDEGVPTQDMFLFESSVDEKSECNFSYPKKDQDNELMKEMSETDIPMQNIS 206

Query: 1166 FEMDGANQCVSELDE--GNVKARDENMVSDDLKVQTRFLVEGCNIDSPYESCNIDGDDVQ 993
            F ++  NQ   ++D   G+ K  + +    D+ +          I + +     D  D+ 
Sbjct: 207  FSLE-ENQSGKDIDNDCGSNKKLNADTHMQDIALSLEENKSNKGIPNEW-----DPRDLL 260

Query: 992  HNKDLPFVEREESLTTSSKEKDRVESEFPIQECDNNNSSRVSCDTDGSASRRQSNESQDM 813
              +D+     E      SKE   +     + E     S  +S D     + +QS E+   
Sbjct: 261  VTRDMKDDAMEMMSNEGSKELFILGDILSLPELTTLKSEAMSPDCKSDRNEQQSFENSSK 320

Query: 812  SE-----DGESANSL------------------------RPSTAVQEDESTNSNQVGEGE 720
             E     + E +N+L                         P+ A    E+T+S  V E  
Sbjct: 321  KEVIVASEVEDSNNLILSAPALASTAEGSDSGKGEATPISPAPASASLEATSSGLVNE-- 378

Query: 719  TAGTVTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEEDSQTASSRSFFIQHG 540
              G++T +S SS   TTSG+              + T     LEE +    S +  +Q G
Sbjct: 379  -TGSITFDSRSS--ATTSGKGS---------SEPLETGRTSKLEETADQPFSSN--LQSG 424

Query: 539  HGEXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKMVK 360
            +GE            ++Y+ P                   SFAFP+L SEWNSSPV+M K
Sbjct: 425  NGESSFSAAGPLTGLISYSGPITYSGNLSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAK 484

Query: 359  PDRRHLRKHRGWKLCFPCCRY 297
             D+R  R+HRGW+  F CCR+
Sbjct: 485  ADQRQYRRHRGWRQGFLCCRF 505


>XP_016900313.1 PREDICTED: uncharacterized protein LOC103489197 isoform X3 [Cucumis
            melo]
          Length = 445

 Score =  137 bits (345), Expect = 5e-31
 Identities = 132/439 (30%), Positives = 197/439 (44%), Gaps = 25/439 (5%)
 Frame = -3

Query: 1538 RKVPGLDDLSDSEDSVNVAENKSANSINPFLDP---SCDDDLSQKEMELYTDKTVTECEL 1368
            R+   LDD +D +D            +  F+ P   SC  DLS+++ ELY +K++ EC+L
Sbjct: 68   RECLDLDDFNDYDD------------VKAFVSPLNNSCKVDLSEEDSELYMEKSIVECQL 115

Query: 1367 PELIVCFKEGSYSIIKDICIDEGLPSVDKTFRENGVVPDKELFTCLKSSNELVKDIGHTT 1188
            PELIVC+KE   +I+KDICID+G P  DK F             C  S +E  +D+    
Sbjct: 116  PELIVCYKENICNIVKDICIDDGTPR-DKLF-------------CGSSLDE--EDVCSIN 159

Query: 1187 EPNIDCQFEMDGANQCVSELDEGNVKARDENMVSDDLKVQTRFLVEGCNIDSPYESCNID 1008
             P        D  ++ V EL + ++ A D++  S+    +          DSP +  + D
Sbjct: 160  PPT------KDWKDESVGELKQRDMFASDDSEHSESFGSK----------DSPNQCDSKD 203

Query: 1007 -------GDDVQH--NKDLPFVER-EESL--TTSSKEKDRVESEFPIQECDNNNSSRVS- 867
                     DV +  + D+P  +   ESL   T +K K   +SE   Q C     S V  
Sbjct: 204  LASTPEAEYDVAYFTDNDMPMTDLVTESLKPLTDNKIKPHPQSE---QVCIETTCSEVPV 260

Query: 866  ----CDTDGSASRRQSNESQDMSED---GESANSLRPSTAVQEDESTNSNQVGEGETAGT 708
                 D     +R  ++ES   +ED    +SAN+   S +V   E+T+SN +   + +  
Sbjct: 261  LAHVADESFGNTRETTSESITSAEDPKNSDSANAPSTSASVGCKETTSSNPLASADKSEP 320

Query: 707  VTLNSDSSPPPTTSGREEDPNTQKSEFQ--RAIHTVNILGLEEDSQTASSRSFFIQHGHG 534
               N+ S+P      R E  +  + E++  R     N      DS T SS    +Q G G
Sbjct: 321  QCHNTSSNPK-----RVEYEDLPRVEYEDIRKTEVGNF-----DSHTVSSE---VQQGVG 367

Query: 533  EXXXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKMVKPD 354
            E            ++ +                     SFAFP+L +EWNSSPV+M KPD
Sbjct: 368  E-TSFSVAPLGSLMSNSGRIGYSGSISHRSDSSTTSTRSFAFPILQTEWNSSPVRMAKPD 426

Query: 353  RRHLRKHRGWKLCFPCCRY 297
            R+HL+KHRGW+    CCR+
Sbjct: 427  RKHLQKHRGWRHGILCCRF 445


>XP_006573172.1 PREDICTED: uncharacterized protein LOC100796112 [Glycine max]
            XP_006573173.1 PREDICTED: uncharacterized protein
            LOC100796112 [Glycine max] XP_006573174.1 PREDICTED:
            uncharacterized protein LOC100796112 [Glycine max]
            XP_003516473.2 PREDICTED: uncharacterized protein
            LOC100796112 [Glycine max] XP_014629246.1 PREDICTED:
            uncharacterized protein LOC100796112 [Glycine max]
            XP_014629249.1 PREDICTED: uncharacterized protein
            LOC100796112 [Glycine max] KHN44807.1 hypothetical
            protein glysoja_038982 [Glycine soja] KRH75138.1
            hypothetical protein GLYMA_01G064900 [Glycine max]
            KRH75139.1 hypothetical protein GLYMA_01G064900 [Glycine
            max] KRH75140.1 hypothetical protein GLYMA_01G064900
            [Glycine max] KRH75141.1 hypothetical protein
            GLYMA_01G064900 [Glycine max] KRH75142.1 hypothetical
            protein GLYMA_01G064900 [Glycine max] KRH75143.1
            hypothetical protein GLYMA_01G064900 [Glycine max]
            KRH75144.1 hypothetical protein GLYMA_01G064900 [Glycine
            max]
          Length = 517

 Score =  134 bits (336), Expect = 2e-29
 Identities = 135/498 (27%), Positives = 203/498 (40%), Gaps = 27/498 (5%)
 Frame = -3

Query: 1709 LYTEEQHAEDLVAVVDSINCVGNKSGNFTNPFLDPLTDEILSGKETELYTEEHGNSFRKV 1530
            L   EQ  +     ++S +C+ N+        ++P++    S K+ E        SF K 
Sbjct: 54   LENNEQGLDSSQYNMESADCMKNEYEAKVKDIVEPVSH---SSKDME--------SFMKF 102

Query: 1529 PGLDDLSDSEDSVNVAENKSANSINPFLDPSCDDDLSQKEMELYTDKTVTECELPELIVC 1350
            P            N  E+   +  +P  +P+   DL +  ++ Y DKTVTECE P L VC
Sbjct: 103  P------------NDVESVKRSLTSPISNPAEGRDLPRNSVDGYMDKTVTECE-PHLEVC 149

Query: 1349 FKEGSYSIIKDICIDEGLPSVDKTFRENGVVPDKELFTCLKSSNELVKDIGHTTEPNIDC 1170
            +KE +Y ++KDIC+DEG+ + DK    N V      F   +S     K   +T+   +  
Sbjct: 150  YKESNYHVVKDICVDEGVLNKDKVMFVNTVDEKAHNFFHSESYENKEKQKDNTSIKALSL 209

Query: 1169 Q-FEMDGANQCVSELDEGNVKARDE------NMVSDDLKVQTRFLVEGCNIDSPYESCNI 1011
               E    N  +SE  E   K +D       ++   + K    F  E         S N+
Sbjct: 210  TPTEEKAHNFFLSESYENKEKQKDNISINVLSLTPTEEKAHNFFPSESKEKQKDNTSINV 269

Query: 1010 -------DGDDVQHNKDLPFVEREESLTTSSKEKDRVESEF-PIQEC-----DNNNSSRV 870
                   + D+V  N D P     +    + K    V  E  P+ E      D      V
Sbjct: 270  LSLTPTEESDEVHANHDQPKGLMHKDGDATEKISGNVNKEMKPLPEDKVLLQDLLTEDSV 329

Query: 869  SCDTDGSASRRQSNESQDMSEDGESANSLR------PSTAVQEDESTNSNQVGEGETAGT 708
            S D  G    + SNE +  S+   S N++       PS A+ +DES N N + E E++  
Sbjct: 330  SSDDKGE---QISNEPELHSQSEGSKNTVEEAILESPSLALADDESNNDNMLSEKESS-- 384

Query: 707  VTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEEDSQTASSRSFFIQHGHGEX 528
             T   D S P +  G+EE       +      T+  +  + D Q  +     I H  GE 
Sbjct: 385  -THQLDPSRP-SDCGKEECHQAGVCKCDEIQQTMKPVEGKSDDQAVTGH---IHHSLGEA 439

Query: 527  XXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKMVKPDRR 348
                       ++Y+ P                   SFAFP++ SEWNSSPV+M K DR+
Sbjct: 440  SFSSIGPMSGRISYSGPVPYSGSISLRSDSSTTSTRSFAFPIIQSEWNSSPVRMAKADRK 499

Query: 347  HLRKHRG-WKLCFPCCRY 297
            H RK R  W+  F CC++
Sbjct: 500  HFRKQRWCWRDGFLCCKF 517


>XP_016667087.1 PREDICTED: uncharacterized protein LOC107887384 isoform X2 [Gossypium
            hirsutum] XP_016667088.1 PREDICTED: uncharacterized
            protein LOC107887384 isoform X2 [Gossypium hirsutum]
          Length = 464

 Score =  132 bits (333), Expect = 2e-29
 Identities = 123/437 (28%), Positives = 190/437 (43%), Gaps = 37/437 (8%)
 Frame = -3

Query: 1496 SVNVAENKSANSINPFLDP---SCDDDLSQKEMELYTDKTVTECELPELIVCFKEGSYSI 1326
            SVN   N +      F+ P   S  +  S ++   Y DK+V EC LPEL+VC+KE +Y +
Sbjct: 38   SVNDFSNGNEKEARDFVPPNSHSLKNMGSFQDSVFYLDKSVMECALPELVVCYKESAYHV 97

Query: 1325 IKDICIDEGLPSVDKTFRENGVVPDKELFTCLKSSNELVKDIGHTTEPNIDCQFEMDGAN 1146
            +KDICIDEG+P+ DK   ++GV  DK+       S E         +P  D   +     
Sbjct: 98   VKDICIDEGVPTQDKFLFDSGV--DKKSDCNFLPSEEDQDSKLLKEKPESDISMQAGSMY 155

Query: 1145 QCVSELDEGNVKARDENMVSDDLKVQTRFLVEGCNIDSPYESCNIDGDDVQHNKDLPFVE 966
               +++D+ N +  ++  +SD         +E     +   S   D +D+  ++ +    
Sbjct: 156  PEENQMDKDNERDSNKKTISDKYTQDISLSLEENEPKNRIPS-QCDTEDLILSRKMMDDT 214

Query: 965  REESLTTSSKEKDRVESEFPIQECDNNNSSRVS--CDTDGSASR--RQSNESQDM----- 813
             + +    SKE   +     + E        +S  C +DG   +  + S E + M     
Sbjct: 215  MKMARDDVSKELFTLGELLSMPEFSTVKPEALSSHCTSDGIKQQCFQNSKEKEVMVMPPL 274

Query: 812  -SEDGESANSLR--------PSTAVQEDES-----------TNSNQVGEGE-----TAGT 708
             S D ES NS +        P +  +E +S           T+S+ V E        A +
Sbjct: 275  VSADKESNNSCKETILSASAPVSVAEEMDSVKGEATMFSPATSSSLVNEVSDDSKLAARS 334

Query: 707  VTLNSDSSPPPTTSGREEDPNTQKSEFQRAIHTVNILGLEEDSQTASSRSFFIQHGHGEX 528
            +    DSS    TS ++E  +    E   A+ T +   LE+ +   SS +  +Q G+GE 
Sbjct: 335  IAFGFDSS--ALTSSKDEGCHNLDRE---ALETGHTPKLEDIADQPSSNN--LQCGNGES 387

Query: 527  XXXXXXXXXXPLAYTDPAXXXXXXXXXXXXXXXXXXSFAFPVLHSEWNSSPVKMVKPDRR 348
                       ++Y+ P                   SFAFP+L SEWNSSPV+M K DRR
Sbjct: 388  SFSAAGLVTGLISYSGPIAYSGSLSHRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRR 447

Query: 347  HLRKHRGWKLCFPCCRY 297
            H RKHRGW+    CCR+
Sbjct: 448  HYRKHRGWRQGLLCCRF 464


Top