BLASTX nr result

ID: Chrysanthemum22_contig00033785 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum22_contig00033785
         (1085 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|OTG04692.1| hypothetical protein HannXRQ_Chr12g0365221 [Helia...   273   1e-86
ref|XP_023762037.1| uncharacterized protein LOC111910438 [Lactuc...   273   2e-86
gb|OTG17667.1| hypothetical protein HannXRQ_Chr08g0214641 [Helia...   263   2e-82
ref|XP_023771694.1| uncharacterized protein LOC111920357 [Lactuc...   240   2e-73
gb|KVH96319.1| hypothetical protein Ccrd_001594 [Cynara carduncu...   233   8e-71
ref|XP_022022681.1| uncharacterized protein LOC110922792 [Helian...   227   1e-68
ref|XP_022014075.1| uncharacterized protein LOC110913562 [Helian...   226   4e-68
ref|XP_022863961.1| uncharacterized protein LOC111383992 [Olea e...   153   5e-40
emb|CDP00526.1| unnamed protein product [Coffea canephora]            151   1e-39
ref|XP_017221798.1| PREDICTED: uncharacterized protein LOC108198...   144   2e-36
gb|KZV34480.1| hypothetical protein F511_30016 [Dorcoceras hygro...   137   8e-34
ref|XP_019238648.1| PREDICTED: uncharacterized protein LOC109218...   133   3e-32
ref|XP_016474350.1| PREDICTED: uncharacterized protein LOC107796...   133   3e-32
gb|PIN25922.1| hypothetical protein CDL12_01336 [Handroanthus im...   133   5e-32
ref|XP_002514530.1| PREDICTED: uncharacterized protein LOC826452...   133   1e-31
ref|XP_011078544.1| uncharacterized protein LOC105162245 [Sesamu...   132   1e-31
ref|XP_021616013.1| uncharacterized protein LOC110617498 isoform...   129   3e-30
ref|XP_012081985.1| uncharacterized protein LOC105641938 [Jatrop...   128   4e-30
dbj|GAU48521.1| hypothetical protein TSUD_242990 [Trifolium subt...   127   9e-30
ref|XP_012478840.1| PREDICTED: uncharacterized protein LOC105794...   127   2e-29

>gb|OTG04692.1| hypothetical protein HannXRQ_Chr12g0365221 [Helianthus annuus]
          Length = 317

 Score =  273 bits (699), Expect = 1e-86
 Identities = 162/323 (50%), Positives = 205/323 (63%), Gaps = 19/323 (5%)
 Frame = +1

Query: 169  KNRKMKTEQKNTRFYSNLTHIXXXXXXXNPSCKIHPFYTSEVGICPYCLKDKLMNLTCSD 348
            K + +KTEQ+NT+F+SN  +           C+ HP  +S VGIC YCL +KL+ L C +
Sbjct: 4    KGKSVKTEQENTKFFSNSNYFPSSSDLP---CRKHPSNSSSVGICAYCLNEKLIELVCVE 60

Query: 349  CGEQRITPCSCSGSEFSSYRNSYSTLDVGSVG---RISFLIDNDKGCTNGDDTKSIFSNM 519
            CGEQR+  CSCS S+  SYRNS  T+DVGSVG   R+SFLI+N+K  +N D+ K++FS+M
Sbjct: 61   CGEQRL--CSCSCSDLDSYRNSSCTVDVGSVGSVGRMSFLIENEK-VSNADEPKTLFSHM 117

Query: 520  LKLKQSEQSKTQDVVLLKRSNSCVVEVKKNKGSWRICKFFKNMRKKGEID-----GXXXX 684
             KLKQSE   T+DVVLLKRSNSCVVEVKK+ G WRI K FK  R+K         G    
Sbjct: 118  -KLKQSE---TEDVVLLKRSNSCVVEVKKSNGFWRIGKLFKKKREKDGFSERNRGGLDQK 173

Query: 685  XXXXFMD-------SNINHEVSSMDCSSAKVSNSKELESRKSD----LMDFENGVVVKDV 831
                 +D       S+ +HEV S+ C SAK S+  E E RKS     L+DF+NG  VK+ 
Sbjct: 174  SEDYVIDVSRSRSLSSFHHEVGSIACLSAKNSDFNEFELRKSGFKGGLVDFKNGFRVKES 233

Query: 832  NLCKMXXXXXXXXXXXXXXXXXGKSKTEYSVFKECDTMDLVPSGGMGSSSCRFMVNERGI 1011
            +  ++                  +SKTE+SV K+ D+++   SGG+GSSSCR MVNERGI
Sbjct: 234  DFSRIDDDDEFIDLKIDLS---NRSKTEHSVLKKYDSLE---SGGVGSSSCRIMVNERGI 287

Query: 1012 KKVKNNHMKAWKWIFKHHSGKKD 1080
            KKVKNNHMKAWKWIF HHSGK D
Sbjct: 288  KKVKNNHMKAWKWIFNHHSGKND 310


>ref|XP_023762037.1| uncharacterized protein LOC111910438 [Lactuca sativa]
 gb|PLY86879.1| hypothetical protein LSAT_8X37860 [Lactuca sativa]
          Length = 330

 Score =  273 bits (699), Expect = 2e-86
 Identities = 155/331 (46%), Positives = 199/331 (60%), Gaps = 26/331 (7%)
 Frame = +1

Query: 169  KNRKMKTEQKNTRFYSNLTHIXXXXXXXNPSCKIHPFYTSEVGICPYCLKDKLMNLTCSD 348
            + + +K+E +NT F+S+  +           CK HPF +S VG+CPYCLK+KLMNL CSD
Sbjct: 4    RGKSVKSEHENTNFFSSSNYFSSSLDLP---CKKHPFNSSSVGVCPYCLKEKLMNLVCSD 60

Query: 349  CGEQRITPCSCSGSEFSSYRNSYSTLDVGSVGRISFLIDNDKGCTNGDDTKSIFSNMLKL 528
            CGEQR++ CSCS  + SSYRNS  ++ VGSVGR+SF I+N+KGC NGD+ K++ S+M ++
Sbjct: 61   CGEQRLSSCSCS--DVSSYRNSSCSMGVGSVGRLSFFIENEKGC-NGDEKKTLLSHMNQI 117

Query: 529  KQSEQSKTQDVVLLKRSNSCVVEVKKNKGSWRICKFFKNMRKKGE--------------- 663
               E ++T DVV LKRS+SCVVEVKK+   WRI K FK  R+K                 
Sbjct: 118  SMIE-TETDDVVFLKRSSSCVVEVKKSNSFWRIGKIFKKKREKERCSERNNRGGFDHARE 176

Query: 664  ---IDGXXXXXXXXFMDSNINHEVSSMDCSSAKVSNSKELESRKSDL----MDFENGVVV 822
               +D         FMD N  HEV S+ CS+AKVS+  + ESR S      +DFE+G  V
Sbjct: 177  VCVMDVSRSRSLSSFMDGNFGHEVGSVACSTAKVSDFNQSESRMSGFRGGSIDFESGFSV 236

Query: 823  KDVNLCKMXXXXXXXXXXXXXXXXX----GKSKTEYSVFKECDTMDLVPSGGMGSSSCRF 990
            KD +  +M                      KS  + SVFK+ D  +L    GMGSSSCR 
Sbjct: 237  KDSDFIRMDDDDDDDDDDSEFIDLKIDLSDKSTKDESVFKKYDPPELTCGDGMGSSSCRV 296

Query: 991  MVNERGIKKVKNNHMKAWKWIFKHHSGKKDL 1083
             +NER I+KVKNNH KAWK IFKHHSGKKDL
Sbjct: 297  TLNEREIEKVKNNHTKAWKGIFKHHSGKKDL 327


>gb|OTG17667.1| hypothetical protein HannXRQ_Chr08g0214641 [Helianthus annuus]
 gb|OTG17669.1| hypothetical protein HannXRQ_Chr08g0214671 [Helianthus annuus]
          Length = 310

 Score =  263 bits (671), Expect = 2e-82
 Identities = 151/319 (47%), Positives = 195/319 (61%), Gaps = 13/319 (4%)
 Frame = +1

Query: 163  ETKNRKMKTEQKNTRFYSNLTHIXXXXXXXNPSCKIHPFYTSEVGICPYCLKDKLMNLTC 342
            + + + +KTEQ    F SN           N SCK HP  +S+VGIC YCL +KL+ L C
Sbjct: 2    KNRGKSVKTEQVKPDFASNSNKFSQSSSSTNLSCKKHPKNSSQVGICSYCLSEKLIKLVC 61

Query: 343  SDCGEQRITPCSCSGSEFSSYRNSYSTLDVGSVGRISFLIDNDKGCTNGDDTKSIFSNML 522
            SDCGEQRI+ CSCS S FSSYRNS  T+DVGSVGRISFLI+N+KG +NGD+ K++FS+ L
Sbjct: 62   SDCGEQRISSCSCSCSGFSSYRNSSCTMDVGSVGRISFLIENEKG-SNGDEPKTLFSH-L 119

Query: 523  KLKQSEQSKTQDVVLLKRSNSCVVEVKKNKGSWRICKFFKNMRKKGE------------I 666
            K+KQ E   T+DVVL KRS+SCVVEVKK  G W+I K+FK  R+K               
Sbjct: 120  KMKQGE---TEDVVLFKRSSSCVVEVKKTNGFWKIGKYFKKKREKEGSSERSRVGSDQIS 176

Query: 667  DGXXXXXXXXFMDSNINHEVSSMDCSSAKVSNSKELESRKSDLMDFENGVVVKDVNLCKM 846
            D         FM    +HE  ++ CSSAK+S+  E+E+RK+    F+ G++  D +    
Sbjct: 177  DVSRSRSLSSFMGDKFHHETCNVACSSAKISDFSEIEARKN---GFKGGLMDDDED---- 229

Query: 847  XXXXXXXXXXXXXXXXXGKSKTEYSVFKECDTMDLV-PSGGMGSSSCRFMVNERGIKKVK 1023
                              KSKTE+SVFK  D +DL    GG+GSSSCR  VN+RGIKKVK
Sbjct: 230  ------SEFIDLKIDLLEKSKTEHSVFKMYDQLDLTGDCGGIGSSSCRITVNDRGIKKVK 283

Query: 1024 NNHMKAWKWIFKHHSGKKD 1080
            N+++K+W W F+ HS   D
Sbjct: 284  NSNVKSWNWNFRDHSRNND 302


>ref|XP_023771694.1| uncharacterized protein LOC111920357 [Lactuca sativa]
 gb|PLY79406.1| hypothetical protein LSAT_3X58841 [Lactuca sativa]
          Length = 327

 Score =  240 bits (612), Expect = 2e-73
 Identities = 149/306 (48%), Positives = 181/306 (59%), Gaps = 32/306 (10%)
 Frame = +1

Query: 262  CKIHPFYTSEVGICPYCLKDKLMNLTCSDCGEQRITPCSCSGSEFSSYRNSYSTLDVGSV 441
            CK HP  +S VGIC YCLKD+LM L CSDCGEQR++ CSCS  + SSYRNS  T+DVGS+
Sbjct: 33   CKKHPS-SSPVGICAYCLKDRLMKLVCSDCGEQRLSSCSCS--DVSSYRNSSCTVDVGSI 89

Query: 442  GRISFLIDNDKGCTNGDDTKSIFSNMLKLKQSEQSKTQDVVLLKRSNSCVVEVKKNKGSW 621
            GRISFLI+N+KG  +GD+ K++FS+M   KQ+++ +T+DV+LLKRSNSCVVEVKK+ G W
Sbjct: 90   GRISFLIENEKG-GSGDEQKTLFSHM---KQTKKRETEDVILLKRSNSCVVEVKKSNGFW 145

Query: 622  RICKFFKNMRK-------KGEI-------DGXXXXXXXXFMDSNINHE---VSSMDCSSA 750
            RI K FK  ++       K EI       D         F   N +HE   VS M  SSA
Sbjct: 146  RIGKLFKKKKREKDGFDEKSEIWVTDCAMDVSRSRSLCSFRGGNFDHEGGSVSDMAYSSA 205

Query: 751  KVSNSKELESRKSDL----MDFENGVVVKDVNLCKMXXXXXXXXXXXXXXXXXGKSKTEY 918
            K+S+  E E RKS      MDFE+G   K+    ++                  KSKTE+
Sbjct: 206  KISDFNESEPRKSGFRGGFMDFESGFSAKESEFSRIHDDSGFIDLKLDLSD---KSKTEH 262

Query: 919  SVFKECDTMDLVPSGGMG-----------SSSCRFMVNERGIKKVKNNHMKAWKWIFKHH 1065
            SVFK        PS G G           SSSCR  VN+RGIKK    H K WKWIFK H
Sbjct: 263  SVFKN-------PSDGSGGGGGCGGGGGVSSSCRITVNDRGIKKGSKGHSKVWKWIFKQH 315

Query: 1066 SGKKDL 1083
            SGKKDL
Sbjct: 316  SGKKDL 321


>gb|KVH96319.1| hypothetical protein Ccrd_001594 [Cynara cardunculus var. scolymus]
          Length = 331

 Score =  233 bits (595), Expect = 8e-71
 Identities = 144/303 (47%), Positives = 177/303 (58%), Gaps = 29/303 (9%)
 Frame = +1

Query: 262  CKIHPFYTSEVGICPYCLKDKLMNLTCSDCGEQRITPCSCSGSEFSSYRNSYSTLDVGSV 441
            CK HP  +S VGIC YCLKD+LM L CSDCGEQR++ CSCS  + SSYRNS  T+DVGSV
Sbjct: 33   CKKHPS-SSPVGICAYCLKDRLMKLVCSDCGEQRLSSCSCS--DVSSYRNSSCTVDVGSV 89

Query: 442  GRISFLIDNDKGCTNGDDTKSIFSNMLKLKQSEQSKTQDVVLLKRSNSCVVEVKKNKGSW 621
            GRISFLI+N+KG +  +  K++FS++ ++K+ E   T+DV+LLKRSNSCVVEVKK+ G W
Sbjct: 90   GRISFLIENEKGGSGDEQQKTLFSHIKQIKKRE---TEDVILLKRSNSCVVEVKKSNGFW 146

Query: 622  RICKFFKNMRKK---------GEIDGXXXXXXXXFMD-------------SNINHE---V 726
            RI K FK  R+K         G  D          MD             +N +HE   V
Sbjct: 147  RIGKLFKKKREKEGCRERNRDGFDDKSEIWVSDCVMDVSRSRSLCSFRGGANFDHEGGSV 206

Query: 727  SSMDCSSAKVSNSKELESRKSD----LMDFENGVVVKDVNLCKMXXXXXXXXXXXXXXXX 894
            S M  SSAK+S+  E E RKS     LMDFE+G   K+    ++                
Sbjct: 207  SDMAYSSAKISDFNESEPRKSGFRGGLMDFESGFAAKESEFSRIHDDSRFIDLKLDLSD- 265

Query: 895  XGKSKTEYSVFKECDTMDLVPSGGMGSSSCRFMVNERGIKKVKNNHMKAWKWIFKHHSGK 1074
              +SK E+ VFK          GG GSSSCR  VN+RGIKK    H K WKWIFK HSGK
Sbjct: 266  --ESKPEHPVFKNPPDGG-GGGGGGGSSSCRITVNDRGIKKGSKGHSKVWKWIFKQHSGK 322

Query: 1075 KDL 1083
            KD+
Sbjct: 323  KDM 325


>ref|XP_022022681.1| uncharacterized protein LOC110922792 [Helianthus annuus]
 gb|OTF86468.1| hypothetical protein HannXRQ_Chr17g0551111 [Helianthus annuus]
 gb|OTF86504.1| hypothetical protein HannXRQ_Chr17g0551511 [Helianthus annuus]
          Length = 322

 Score =  227 bits (579), Expect = 1e-68
 Identities = 143/327 (43%), Positives = 187/327 (57%), Gaps = 22/327 (6%)
 Frame = +1

Query: 169  KNRKMKTEQKNTRFYSNLTHIXXXXXXXNPSCKIHPFYTSEVGICPYCLKDKLMNLTCSD 348
            + + ++++Q+   F S+  +           CK HP  T +VGIC YCLKD+LM L CS+
Sbjct: 4    RGKSVESDQEYNNFNSDY-NFYYSSSSSGVQCKKHPSST-QVGICAYCLKDRLMKLVCSE 61

Query: 349  CGEQRITPCSCSGSEFSSYRNSYSTLDVGSVGRISFLIDNDKGCTNGDDTKSIFSNMLKL 528
            CGEQR++ CSCS  + SSYRNS  T+DVGSVGRISFLI+N+K   +GD+ K++FS++   
Sbjct: 62   CGEQRLSSCSCS--DVSSYRNSSCTVDVGSVGRISFLIENEKA-GSGDEQKTLFSHV--- 115

Query: 529  KQSEQSKTQDVVLLKRSNSCVVEVKKNKGSWRICKFFKNMRKKG--------------EI 666
            KQS++ +T+DV +LKRSNSCVVEVKK+ G WRI K FK  R+K                +
Sbjct: 116  KQSKKVETEDVFMLKRSNSCVVEVKKSNGFWRIGKLFKKKREKEGFRERNHPDWVSECGM 175

Query: 667  DGXXXXXXXXFMDSNINHE---VSSMDCSSAKVSNSKELESRKS----DLMDFENGVVVK 825
            D         +   N +H+   VS M  SSAK+S+  E E RKS     LMDFE G   K
Sbjct: 176  DVSRCRSLCSYRGGNFDHDGGSVSDMRLSSAKISDFNESEPRKSGFRGGLMDFETGFSAK 235

Query: 826  DVNLCKMXXXXXXXXXXXXXXXXXGKSKTEYSVFKE-CDTMDLVPSGGMGSSSCRFMVNE 1002
            +    ++                  +SKT+YSVFK   D      +G  GSSSCR  +NE
Sbjct: 236  ESEFSRI---HDDSSFIDLKLDLSDRSKTDYSVFKNPSDVGGCAGAGDGGSSSCRITINE 292

Query: 1003 RGIKKVKNNHMKAWKWIFKHHSGKKDL 1083
            RGIKK    H K WKWIFK    KKDL
Sbjct: 293  RGIKKGSKGHSKVWKWIFKQ---KKDL 316


>ref|XP_022014075.1| uncharacterized protein LOC110913562 [Helianthus annuus]
          Length = 335

 Score =  226 bits (577), Expect = 4e-68
 Identities = 147/345 (42%), Positives = 185/345 (53%), Gaps = 40/345 (11%)
 Frame = +1

Query: 169  KNRKMKTEQKNTRFYSNLTHIXXXXXXXNPSCKIHPFYTSEVGICPYCLKDKLMNLTCSD 348
            + + +++E + T FY +           N  CK HP  +S VGIC YCLKD+LM L CSD
Sbjct: 4    RGKSVESETEYTNFYQDYNFYNSSSS--NIPCKKHPS-SSPVGICAYCLKDRLMKLVCSD 60

Query: 349  CGEQRITPCSCSGSEFSSYRNSYSTLDVGSVGRISFLIDNDKGCTNGDDTKSIFSNMLKL 528
            CGEQR++ CSCS  + SSYRNS  T+DVGSVGRISFLI+ND       D +S+F      
Sbjct: 61   CGEQRLSSCSCS--DVSSYRNSSCTVDVGSVGRISFLIEND-------DQRSLFD----F 107

Query: 529  KQSEQSKTQDVVLLKRSNSCVVEVKKNKGSWRICKFFKNMRKKGE--------------I 666
            K+  + +T+DV++ KRSNSCVVEVKK+ G WRI K FK  R+K E              +
Sbjct: 108  KKQSKKETEDVLMFKRSNSCVVEVKKSHGFWRIGKLFKKRREKEECRERNSEIWVNDCGM 167

Query: 667  DGXXXXXXXXFMDSNINHE---VSSMDCSSAKVSNSKELESRKSD----LMDFENGVVVK 825
            D         F     +HE   VS M  SSAK+S+  E E RKS     LMDFE+G   K
Sbjct: 168  DVSRSRSLCSFRGGGFDHEGGSVSDMAFSSAKISDFNESEPRKSGFRGGLMDFEHGFSAK 227

Query: 826  DVNLCKMXXXXXXXXXXXXXXXXXGKSKTEYSVFKECD------TMDLVPSGGMG----- 972
            +    ++                  +SKTEYSVF + +       +++   GG G     
Sbjct: 228  ESEFSRIHDDSSFIDLKLDLSD---RSKTEYSVFNKTEYPVFKSPLEVGGCGGGGGSGGG 284

Query: 973  --------SSSCRFMVNERGIKKVKNNHMKAWKWIFKHHSGKKDL 1083
                    SSSCR  VNERGIKK    H K WKWIFK HSGKKDL
Sbjct: 285  VGGGGLLSSSSCRITVNERGIKKGSKGHSKVWKWIFKQHSGKKDL 329


>ref|XP_022863961.1| uncharacterized protein LOC111383992 [Olea europaea var. sylvestris]
          Length = 327

 Score =  153 bits (387), Expect = 5e-40
 Identities = 113/330 (34%), Positives = 155/330 (46%), Gaps = 34/330 (10%)
 Frame = +1

Query: 181  MKTEQKNTRFYSNLTHIXXXXXXXNPSCKIHPFYTSEVGICPYCLKDKLMNLTCSDCGEQ 360
            MK   K    Y+N   +          C+ HP  +S VGIC YCLK+KL+ L C++CGEQ
Sbjct: 1    MKERGKAVEIYNN--DLDFNYSASEMPCRKHPS-SSSVGICAYCLKEKLVKLVCTECGEQ 57

Query: 361  RITPCSCSGSEFSSYRNSYSTLDVGSVGRISFLIDNDKGCTNGDDTKSIFSNMLKLKQSE 540
            R++ CSC  S+  SYRNS S ++VGSVGRISFLI+N+K    G+       N+   ++ E
Sbjct: 58   RLSSCSC--SDIISYRNSCSAMEVGSVGRISFLIENEK---TGE-----LKNLKAKRKGE 107

Query: 541  QSKTQDVVLLKRSNSCVVEVKKNKGSWRICKFF------KNMRKKGEIDGXXXXXXXXFM 702
            + K+++V+LL+RS+S  VEVKK+ G W+I + F      ++  K GE +           
Sbjct: 108  EEKSEEVILLRRSSSSCVEVKKSNGFWKIKRLFRKKKSKRDFEKNGEFESFDDKSETWVS 167

Query: 703  D----------SNINHEVSSMDCSSAKVSN-------SKELESRKSDL----------MD 801
            D           +   E S    SSAK+S+         E E RKS             D
Sbjct: 168  DIMGVSRSRSLCSFRDEASDYAFSSAKISDVTSGVFMDSESEPRKSGFEPRKSGFRGGFD 227

Query: 802  FENGVVVKDVNLCKMXXXXXXXXXXXXXXXXXGKSKTEYSVFKECDTMDL-VPSGGMGSS 978
             E G   + V   K                    S    S FK+ D   + + +      
Sbjct: 228  AEIGTAKRGVYPVK-ESDFSAMDESAFIDLNLDLSADPKSDFKKSDQSGISLANMRSNGG 286

Query: 979  SCRFMVNERGIKKVKNNHMKAWKWIFKHHS 1068
            SCR  VNERG+KK    H K WKWIF+H S
Sbjct: 287  SCRITVNERGLKKGSKGH-KVWKWIFRHQS 315


>emb|CDP00526.1| unnamed protein product [Coffea canephora]
          Length = 274

 Score =  151 bits (381), Expect = 1e-39
 Identities = 103/284 (36%), Positives = 142/284 (50%), Gaps = 12/284 (4%)
 Frame = +1

Query: 262  CKIHPFYTSEVGICPYCLKDKLMNLTCSDCGEQRITPCSCSGSEFSSYRNSYSTLDVGSV 441
            CK HP  +S VGIC YCLKD+L+ L CS+CGEQR++ CSC  S+ SSYRNS  T DVGSV
Sbjct: 29   CKKHPS-SSSVGICAYCLKDRLVKLVCSECGEQRLSSCSC--SDISSYRNSSCTADVGSV 85

Query: 442  GRISFLIDNDKGCTNGDDTKSIFSNMLKL--KQSEQSKTQDVVLLKRSNSCVVEVKKNKG 615
            GRISFLIDN+K            + + KL  K+ +  K ++V+LL+RS+S  VEVK+ KG
Sbjct: 86   GRISFLIDNEK------------TELQKLHQKRKKGDKPEEVILLRRSSSSCVEVKRYKG 133

Query: 616  SWRICKFFKNMRKKG-EIDGXXXXXXXXFMDSNINHEVSSMDCSSAKVSNSKE--LESRK 786
             W+I + F+  ++KG E DG                     D    + S  +      ++
Sbjct: 134  FWKIKRLFRKKKQKGCEKDG-------------------QFDDKKPRKSGFRGFIFPVKE 174

Query: 787  SDLMDFENGVVVKDVNLCKMXXXXXXXXXXXXXXXXXGKSKTEYSVFKECDTMDLVPSGG 966
            SD    ++   + D+ L                     +SK E   F+  D  D   +GG
Sbjct: 175  SDFSAMDDSAFI-DLKL-----------------DLSSESKPELPAFRTSDASDHGATGG 216

Query: 967  M-------GSSSCRFMVNERGIKKVKNNHMKAWKWIFKHHSGKK 1077
            +          SCR  VN+RG  K   N  K WKW  K +SG++
Sbjct: 217  LRGETFPCNGGSCRIAVNDRGGLKRGTNSYKVWKWFSKQYSGRR 260


>ref|XP_017221798.1| PREDICTED: uncharacterized protein LOC108198556 [Daucus carota subsp.
            sativus]
 gb|KZM85879.1| hypothetical protein DCAR_026699 [Daucus carota subsp. sativus]
          Length = 353

 Score =  144 bits (364), Expect = 2e-36
 Identities = 113/330 (34%), Positives = 150/330 (45%), Gaps = 62/330 (18%)
 Frame = +1

Query: 262  CKIHPFYTSEVGICPYCLKDKLMNLTCSDCGEQRITPCSCSGSEFSSYRNSYSTLDVGSV 441
            CK HP  TS VGICPYCLK++L+ L CSDCGEQR++ CSCS  + SSYRNS ST+++GSV
Sbjct: 26   CKKHPSSTS-VGICPYCLKERLIKLVCSDCGEQRLSSCSCS--DVSSYRNSCSTMEIGSV 82

Query: 442  GRISFLIDNDKGCTNGDDTKSIFSNMLKLKQSEQSKTQDVVLLKRSNSCVVEVKKNKGSW 621
            GRISFLI+N+K            ++ LK ++S+     +V++LKRSNS   EVK++   W
Sbjct: 83   GRISFLIENEK------------TDQLKAEKSDD----EVIMLKRSNSTCTEVKRSHKFW 126

Query: 622  RICKFFKNMRKK------------GEIDGXXXXXXXXFMDSNI-----NHEVSSMDCSSA 750
            +  K F+  R+K             +  G            N      +HE S    SSA
Sbjct: 127  KFGKLFRKKREKQSGVCEKSDMCVSDYMGVSRSRSLCSFRGNHFLHDPDHESSDFAFSSA 186

Query: 751  KVSNSKELESRKS----DLMDFENGVV-----------------VKDVNLCKMXXXXXXX 867
            K+S+  E E R+S     LMD E+  +                  +D NL K        
Sbjct: 187  KISDFNESEPRRSGFSKGLMDVESAKISDFSEPRKSGFSRGLLEPEDFNLKKCVFPESEF 246

Query: 868  XXXXXXXXXXGK------------SKTEYSVFKECDTMDLVPSGGM------------GS 975
                       K            +KTEYSV K  +        G               
Sbjct: 247  SGMDDSRFIDLKLDLSSSSEPKTEAKTEYSVSKMSEFPISSSDNGRKFGNLKENKMAGHG 306

Query: 976  SSCRFMVNERGIKKVKNNHMKAWKWIFKHH 1065
             SCR  VNE+ I K  +   K W+WIF HH
Sbjct: 307  GSCRITVNEKRINK-GSKGQKVWRWIFSHH 335


>gb|KZV34480.1| hypothetical protein F511_30016 [Dorcoceras hygrometricum]
          Length = 320

 Score =  137 bits (344), Expect = 8e-34
 Identities = 95/298 (31%), Positives = 141/298 (47%), Gaps = 24/298 (8%)
 Frame = +1

Query: 259  SCKIHPFYTSEVGICPYCLKDKLMNLTCSDCGEQRITPCSCSGSEFSSYRNSYSTLDVGS 438
            SCK HP  +S  GIC YCL+D+L+ L CSDCGEQR++ CSCS    SSYRNS ST++VGS
Sbjct: 29   SCKKHPSRSSSTGICAYCLRDRLVKLVCSDCGEQRLSSCSCSDIS-SSYRNSCSTMEVGS 87

Query: 439  VGRISFLIDNDKGCTNGDDTKSIFSNMLKLKQSEQSKTQDVVLLKRSNSCVVEVKKNKGS 618
            VGRISFLI+N+K      D ++   ++   ++ E++  +  ++L+RS+S  VEVKKN+  
Sbjct: 88   VGRISFLIENEK-----VDLQNHHQHLNSKRRGEKT-GEAAIMLRRSSSSCVEVKKNRWF 141

Query: 619  WRICKFFKNMRKKG-EIDGXXXXXXXXFMDSNINHEVSSM-----DCSSAKVSNSKELES 780
            W I + F+  R KG E +G           S+I     S+       S + +    +L  
Sbjct: 142  WSIERLFRKKRNKGSEKNGEFCDEKSEIWLSDIVARSRSLCSLRGGKSLSFLDEGSDLGP 201

Query: 781  RKSDLMDFENGVVVKDVNLCKMXXXXXXXXXXXXXXXXXGKSKTEYSVFKECDTMDLVPS 960
              + + D  +G+ +   N C +                           + C    +  S
Sbjct: 202  SSAKISDVTSGIFLDHENKCDLFSGLDFRGTTKREFAQIPSDPDRVHSIQTCSVFPVKES 261

Query: 961  --GGMGSS----------------SCRFMVNERGIKKVKNNHMKAWKWIFKHHSGKKD 1080
               GM  S                +C   VN RG      +  + WKWIFK+H+  K+
Sbjct: 262  EFSGMDESAFIDLKLDSLPEPKPETCGIRVNARG------SGSRVWKWIFKYHNSIKN 313


>ref|XP_019238648.1| PREDICTED: uncharacterized protein LOC109218730 [Nicotiana attenuata]
 gb|OIT21585.1| hypothetical protein A4A49_34246 [Nicotiana attenuata]
          Length = 350

 Score =  133 bits (335), Expect = 3e-32
 Identities = 112/322 (34%), Positives = 150/322 (46%), Gaps = 50/322 (15%)
 Frame = +1

Query: 262  CKIHPFYTSEVGICPYCLKDKLMNLTCSDCGEQRITPCSCSGSEFSSYRNSYS--TLDVG 435
            CK HP  +S VGIC +CLK+KL+NL CS+CGEQR++ CSCS    SS  N+    T +VG
Sbjct: 28   CKKHPS-SSSVGICAFCLKEKLVNLVCSECGEQRLSSCSCSDHSSSSNNNNRKSCTAEVG 86

Query: 436  SVGRISFLIDNDKGCTNGDDTKSIFSNMLKLKQ-SEQSKTQDVVLLKRSNSCVVEVKKN- 609
            SVGRISFL++NDK       T  +   +   KQ +++ KT+ V+LL+RS+S  VE+KKN 
Sbjct: 87   SVGRISFLLENDK-------TDQLPQQLQPKKQTTKREKTEQVILLRRSSSSCVEIKKNS 139

Query: 610  KGSWRICKFFKNMRKKG------EIDGXXXXXXXXFM--------------DSNINHEVS 729
             G W+I + F+  +KKG      E+D          +                N   E S
Sbjct: 140  NGFWKIKRLFRKKKKKGCENGNNELDESSEIWVSDALAVSRSRSVCSLRGGGFNDTDESS 199

Query: 730  SMDCSSAKVSN------SKELESRKSDLMDFENGVV-------VKDVNL-------CKMX 849
                SSAK+S+          E RKS      + VV       VK+ +L        K+ 
Sbjct: 200  DYRFSSAKISDVTGGILMDSDEPRKSGFKGTLDSVVQNRSIFPVKESDLDDSAFIDLKLD 259

Query: 850  XXXXXXXXXXXXXXXXGKSKTEYSVFKECDTMDLVPSGGMG----SSSCRFMVNE--RGI 1011
                              S   Y        +     GG G    S SCR  VNE  RGI
Sbjct: 260  LSSESKQDFPAGMRLSNGSDNGYGFVHSIGNL----RGGNGVFTHSGSCRMSVNEYDRGI 315

Query: 1012 KKVKNNHMKAWKWIFKHHSGKK 1077
            K+    + K WKWIF+  SG+K
Sbjct: 316  KRSGKGN-KVWKWIFRQSSGRK 336


>ref|XP_016474350.1| PREDICTED: uncharacterized protein LOC107796131 [Nicotiana tabacum]
          Length = 350

 Score =  133 bits (335), Expect = 3e-32
 Identities = 111/321 (34%), Positives = 149/321 (46%), Gaps = 49/321 (15%)
 Frame = +1

Query: 262  CKIHPFYTSEVGICPYCLKDKLMNLTCSDCGEQRITPCSCSGSEFSSYRNSYS--TLDVG 435
            CK HP  +S +GIC +CLK+KL+NL CS+CGEQR++ CSCS    SS  N+    T +VG
Sbjct: 28   CKKHPS-SSSIGICAFCLKEKLVNLVCSECGEQRLSSCSCSDHSSSSNNNNRKSCTAEVG 86

Query: 436  SVGRISFLIDNDKGCTNGDDTKSIFSNMLKLKQSEQSKTQDVVLLKRSNSCVVEVKKN-K 612
            SVGRISFL++N+K     D T     +  K + +++ KT+ V+ L+RS+S  VE+KKN  
Sbjct: 87   SVGRISFLLENEK----TDQTPQ--QSQPKKQTTKREKTEQVIFLRRSSSSCVEIKKNSN 140

Query: 613  GSWRICKFFKNMRKKG------EIDGXXXXXXXXFM--------------DSNINHEVSS 732
            G W+I + FK  +KKG      E+D          +                N   E S 
Sbjct: 141  GFWKIKRLFKKKKKKGCENGNNELDESSEIWVSDALAVSRSRSVCSLRGGGLNDTDEGSD 200

Query: 733  MDCSSAKVSN------SKELESRKSDLMDFENGVV-------VKDVNL-------CKMXX 852
               SSAK+S+          E RKS      + VV       VK+ +L        K+  
Sbjct: 201  YRFSSAKISDVTGGILMDSDEPRKSGFKGTLDSVVPNRSIFPVKESDLDDSAFIDLKLDL 260

Query: 853  XXXXXXXXXXXXXXXGKSKTEYSVFKECDTMDLVPSGGMG----SSSCRFMVNE--RGIK 1014
                             S   Y        +     GG G    S SCR  VNE  RGIK
Sbjct: 261  SSESKQDFPTAMRLSNSSDNGYGFVHSIGNL----RGGNGVFTHSGSCRMSVNEFDRGIK 316

Query: 1015 KVKNNHMKAWKWIFKHHSGKK 1077
            +    + K WKWIFK  SG+K
Sbjct: 317  RSGKGN-KVWKWIFKQSSGRK 336


>gb|PIN25922.1| hypothetical protein CDL12_01336 [Handroanthus impetiginosus]
          Length = 374

 Score =  133 bits (335), Expect = 5e-32
 Identities = 68/133 (51%), Positives = 95/133 (71%)
 Frame = +1

Query: 262 CKIHPFYTSEVGICPYCLKDKLMNLTCSDCGEQRITPCSCSGSEFSSYRNSYSTLDVGSV 441
           CK HP  +S VGIC YCLKD+L+ L CS+CGEQR++ CSCS    SSYRNS ST++VGSV
Sbjct: 28  CKKHP-NSSSVGICAYCLKDRLVKLVCSECGEQRLSSCSCSDVS-SSYRNSCSTMEVGSV 85

Query: 442 GRISFLIDNDKGCTNGDDTKSIFSNMLKLKQSEQSKTQDVVLLKRSNSCVVEVKKNKGSW 621
           GR+SFLI+N+K  +  ++  +  +   K ++ E+ K++ V+ L+RSNS  VEVKK+ G W
Sbjct: 86  GRVSFLIENEKIESQQNNNNNNNNPSSKSRRGEE-KSEQVIFLRRSNSSCVEVKKSNGFW 144

Query: 622 RICKFFKNMRKKG 660
           +I + F+  R KG
Sbjct: 145 KIKRLFRKKRNKG 157


>ref|XP_002514530.1| PREDICTED: uncharacterized protein LOC8264526 [Ricinus communis]
 gb|EEF47636.1| conserved hypothetical protein [Ricinus communis]
          Length = 408

 Score =  133 bits (334), Expect = 1e-31
 Identities = 75/140 (53%), Positives = 91/140 (65%), Gaps = 4/140 (2%)
 Frame = +1

Query: 262 CKIHPFYTSEVGICPYCLKDKLMNLTCSDCGEQRITPCSCSGSEFSSYRNSYSTLDVGSV 441
           CK HP  +S VGIC YCLKD+L+ L CSDCGEQR++ CSC  SE SS RNS  T++VGSV
Sbjct: 27  CKKHPS-SSSVGICAYCLKDRLVKLVCSDCGEQRLSSCSC--SEISSNRNS-CTVEVGSV 82

Query: 442 GRISFLIDNDKGCTNGDDTKS----IFSNMLKLKQSEQSKTQDVVLLKRSNSCVVEVKKN 609
           GRISFLI+ND     G+ + S      +N  K K   +  T DV LLKRS+S  VE+K+ 
Sbjct: 83  GRISFLIENDNQRNEGNSSSSSQFQSQANSNKPKICREKTTDDVFLLKRSSSSCVEIKRK 142

Query: 610 KGSWRICKFFKNMRKKGEID 669
            G WRI K F   R+KG  D
Sbjct: 143 SGFWRIGKLFSKKREKGNND 162


>ref|XP_011078544.1| uncharacterized protein LOC105162245 [Sesamum indicum]
          Length = 382

 Score =  132 bits (332), Expect = 1e-31
 Identities = 71/133 (53%), Positives = 93/133 (69%)
 Frame = +1

Query: 262 CKIHPFYTSEVGICPYCLKDKLMNLTCSDCGEQRITPCSCSGSEFSSYRNSYSTLDVGSV 441
           CK HP  +S VGIC YCLKD+L+ L CS+CGEQR++ CSCS    SSYRNS ST++VGSV
Sbjct: 28  CKKHPS-SSSVGICAYCLKDRLVKLVCSECGEQRLSSCSCSDVS-SSYRNSCSTMEVGSV 85

Query: 442 GRISFLIDNDKGCTNGDDTKSIFSNMLKLKQSEQSKTQDVVLLKRSNSCVVEVKKNKGSW 621
           GR+SFLI+N+K      D++   +   K K+ E+ K + V+ L+RS+S  VEVKK+ G W
Sbjct: 86  GRVSFLIENEK-----IDSQQSNNPNPKSKRGEE-KAEQVIFLRRSSSSCVEVKKSNGFW 139

Query: 622 RICKFFKNMRKKG 660
           RI + FK  R KG
Sbjct: 140 RIKRLFKKKRNKG 152


>ref|XP_021616013.1| uncharacterized protein LOC110617498 isoform X1 [Manihot esculenta]
 ref|XP_021616014.1| uncharacterized protein LOC110617498 isoform X1 [Manihot esculenta]
          Length = 391

 Score =  129 bits (323), Expect = 3e-30
 Identities = 75/160 (46%), Positives = 94/160 (58%)
 Frame = +1

Query: 181 MKTEQKNTRFYSNLTHIXXXXXXXNPSCKIHPFYTSEVGICPYCLKDKLMNLTCSDCGEQ 360
           MK   K    Y N           +  CK HP  +S VGIC YCLKD+L+ L CSDCGEQ
Sbjct: 1   MKERGKAVEMYDNDMFQDYSISSSDLPCKKHPS-SSSVGICAYCLKDRLVKLVCSDCGEQ 59

Query: 361 RITPCSCSGSEFSSYRNSYSTLDVGSVGRISFLIDNDKGCTNGDDTKSIFSNMLKLKQSE 540
           R++ CSC  SE SS RNS  T++VGSVGRISFLI+NDK          +FS+     ++ 
Sbjct: 60  RLSSCSC--SEISSNRNS-CTVEVGSVGRISFLIENDK-------KSEVFSHSNSKPKNN 109

Query: 541 QSKTQDVVLLKRSNSCVVEVKKNKGSWRICKFFKNMRKKG 660
             K  +V LLKRS+S  VE+K+  G WRI K F   ++KG
Sbjct: 110 GEKADEVFLLKRSSSSCVEIKRKGGFWRIGKLFGKKKEKG 149


>ref|XP_012081985.1| uncharacterized protein LOC105641938 [Jatropha curcas]
 gb|KDP29324.1| hypothetical protein JCGZ_18245 [Jatropha curcas]
          Length = 383

 Score =  128 bits (322), Expect = 4e-30
 Identities = 81/169 (47%), Positives = 101/169 (59%), Gaps = 6/169 (3%)
 Frame = +1

Query: 181 MKTEQKNTRFYSNLTHIXXXXXXXNPS-----CKIHPFYTSEVGICPYCLKDKLMNLTCS 345
           MK   K    Y+N ++        + S     CK HP  +S VGIC YCLKD+LMNL CS
Sbjct: 1   MKERGKAVEIYNNNSNSDMYFQDYSTSSSDLPCKKHPS-SSSVGICAYCLKDRLMNLVCS 59

Query: 346 DCGEQRITPCSCSGSEFSSYRNSYSTLDVGSVGRISFLIDNDKGCTNGDDTKSIFSNMLK 525
           DCGEQR++ CSC  SE SS RNS  T+DVGSVGRISFLI+ND+   N + + S       
Sbjct: 60  DCGEQRLSSCSC--SEISSNRNS-CTVDVGSVGRISFLIENDQ--RNNEISNS------- 107

Query: 526 LKQSEQSKTQDV-VLLKRSNSCVVEVKKNKGSWRICKFFKNMRKKGEID 669
            K     KT++V  LLKRS+S  VE+K+  G WRI K F   R+KG  +
Sbjct: 108 -KPKIGEKTEEVNFLLKRSSSSCVEIKRKSGFWRIGKLFSKKREKGSYE 155


>dbj|GAU48521.1| hypothetical protein TSUD_242990 [Trifolium subterraneum]
          Length = 356

 Score =  127 bits (318), Expect = 9e-30
 Identities = 76/184 (41%), Positives = 107/184 (58%), Gaps = 4/184 (2%)
 Frame = +1

Query: 181 MKTEQKNTRFYSNLTHIXXXXXXXNPSCKIHPFYTSEVGICPYCLKDKLMNLTCSDCGEQ 360
           MK   K    Y+N            P CK HP  +S  G+C YCLK++L+ L CSDCGEQ
Sbjct: 1   MKDRNKGVEAYTNEMDCYYSTSDFLP-CKKHPSSSSSSGVCAYCLKERLVKLVCSDCGEQ 59

Query: 361 RITPCSCSGS-EFSSYRNSYSTLDVGSVGRISFLIDNDKGCTNGDDTKSIFSNMLKLKQS 537
           R++ CSCS   + +S RNS S ++VGSVGR+SFLI+N+K  TN    + + SN     ++
Sbjct: 60  RLSSCSCSDDIDITSNRNSCS-VEVGSVGRVSFLIENEKNETN--PIQHLSSNSTSNSKT 116

Query: 538 E-QSKTQDVVLLKRSNSCVVEVKKNKGSWRICKFFKNMRKK--GEIDGXXXXXXXXFMDS 708
           +   K + VV+L+RS+S  V++K++KG WRI K F+  +KK  G   G        +M  
Sbjct: 117 KIHDKEEQVVVLRRSSSNCVDIKRHKGFWRIGKLFRKNKKKDCGRSVGGFDEKSEIWMVD 176

Query: 709 NINH 720
           N NH
Sbjct: 177 NNNH 180


>ref|XP_012478840.1| PREDICTED: uncharacterized protein LOC105794282 [Gossypium
           raimondii]
 gb|KJB30544.1| hypothetical protein B456_005G149300 [Gossypium raimondii]
          Length = 405

 Score =  127 bits (319), Expect = 2e-29
 Identities = 72/163 (44%), Positives = 95/163 (58%), Gaps = 4/163 (2%)
 Frame = +1

Query: 181 MKTEQKNTRFYSNLTHIXXXXXXXNPS----CKIHPFYTSEVGICPYCLKDKLMNLTCSD 348
           MK   K    Y+N  +I         S    C+ HP  +S  G+C YCLKD+L+NL CSD
Sbjct: 1   MKERGKAVEVYNNNNNIDFFQDYSTSSDVVPCRKHP-QSSSAGVCAYCLKDRLINLVCSD 59

Query: 349 CGEQRITPCSCSGSEFSSYRNSYSTLDVGSVGRISFLIDNDKGCTNGDDTKSIFSNMLKL 528
           CGEQR++ CSCS    ++ RNS +  +VGSVGR+SFLI+N+    N  D +         
Sbjct: 60  CGEQRLSSCSCS-EIATNPRNSCAGGEVGSVGRVSFLIENE----NSRDHQVSNPKAKST 114

Query: 529 KQSEQSKTQDVVLLKRSNSCVVEVKKNKGSWRICKFFKNMRKK 657
                +K++DV+LLKRSNS  VE+KK  G WRI +FFK  R K
Sbjct: 115 SSGNNTKSEDVILLKRSNSSCVEIKKKNGFWRIGRFFKKKRDK 157


Top