BLASTX nr result

ID: Cheilocostus21_contig00020473 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cheilocostus21_contig00020473
         (743 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_009381996.1| PREDICTED: uncharacterized protein LOC103970...   125   1e-29
ref|XP_009413758.1| PREDICTED: uncharacterized protein LOC103994...   121   3e-28
ref|XP_010920259.1| PREDICTED: uncharacterized protein LOC105044...    96   4e-19
ref|XP_009383982.1| PREDICTED: uncharacterized protein LOC103971...    94   1e-18
ref|XP_008807585.2| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...    94   3e-18
gb|KFK29074.1| hypothetical protein AALP_AA7G085300 [Arabis alpina]    80   1e-13
ref|XP_018469334.1| PREDICTED: uncharacterized protein LOC108841...    79   2e-13
ref|XP_007160670.1| hypothetical protein PHAVU_001G007100g [Phas...    78   8e-13
ref|XP_010061047.1| PREDICTED: uncharacterized protein LOC104448...    78   9e-13
gb|KCW67946.1| hypothetical protein EUGRSUZ_F01643 [Eucalyptus g...    78   1e-12
ref|XP_023534934.1| uncharacterized protein LOC111796515 [Cucurb...    77   1e-12
ref|XP_022976330.1| uncharacterized protein LOC111476763 [Cucurb...    77   1e-12
ref|XP_020698077.1| uncharacterized protein LOC110110794 [Dendro...    77   1e-12
ref|XP_010268701.1| PREDICTED: uncharacterized protein LOC104605...    77   2e-12
ref|XP_022936402.1| uncharacterized protein LOC111443031 [Cucurb...    77   2e-12
ref|XP_021721176.1| uncharacterized protein LOC110688724 [Chenop...    77   2e-12
ref|XP_010533365.1| PREDICTED: uncharacterized protein LOC104809...    77   2e-12
ref|XP_015969843.1| uncharacterized protein LOC107493257 [Arachi...    76   3e-12
emb|CDY16999.1| BnaA08g03630D [Brassica napus]                         76   4e-12
ref|XP_013666538.2| uncharacterized protein LOC106371061 [Brassi...    76   4e-12

>ref|XP_009381996.1| PREDICTED: uncharacterized protein LOC103970083 [Musa acuminata
           subsp. malaccensis]
          Length = 451

 Score =  125 bits (313), Expect = 1e-29
 Identities = 67/106 (63%), Positives = 76/106 (71%)
 Frame = -3

Query: 435 DRHGKRKKKKQNTPPAPPLPQTRVXXXXXXXXXXXSWEQLKGLLSCRSAEASQVYDPSSA 256
           D H ++KKKK+     PP P+TR             WEQ K LLSCRS  A+QV+DPSS 
Sbjct: 40  DEHERKKKKKK----PPPTPRTRTLSPPSSSSSS--WEQFKSLLSCRSTAATQVHDPSST 93

Query: 255 VARLSRAACGSSICALRDVVHGNTRVVHRSDTDLSSSDAGSISQHE 118
            ARL RAACGSSICA+RDVVHGNTRVVHRSDTDL SS+A SI+QHE
Sbjct: 94  AARLGRAACGSSICAIRDVVHGNTRVVHRSDTDL-SSEASSIAQHE 138


>ref|XP_009413758.1| PREDICTED: uncharacterized protein LOC103994997 [Musa acuminata
           subsp. malaccensis]
          Length = 444

 Score =  121 bits (303), Expect = 3e-28
 Identities = 64/107 (59%), Positives = 73/107 (68%), Gaps = 1/107 (0%)
 Frame = -3

Query: 435 DRHGKRKKKKQNTPPAPPLPQTRVXXXXXXXXXXXSWEQLKGLLSCRSAEASQVYDPSSA 256
           D   +RK++K+   P+PP     +           SWEQ K LLSCRS  ASQV+DPSS 
Sbjct: 22  DDRRERKRRKRKKKPSPPGSTRTLAPPSPSSSSSSSWEQFKSLLSCRSTAASQVHDPSST 81

Query: 255 VARLSRAACGSSICALRDVVHGNTRVVHRSDTDLSSSDAG-SISQHE 118
            ARL RAACGSSICALRDV+HGNTRVVHR DTDLSSS    S+SQHE
Sbjct: 82  AARLGRAACGSSICALRDVLHGNTRVVHRPDTDLSSSAGSVSVSQHE 128


>ref|XP_010920259.1| PREDICTED: uncharacterized protein LOC105044151 [Elaeis guineensis]
          Length = 441

 Score = 95.9 bits (237), Expect = 4e-19
 Identities = 54/104 (51%), Positives = 63/104 (60%), Gaps = 2/104 (1%)
 Frame = -3

Query: 435 DRHGKRKKKKQNTPPAPPLPQTRVXXXXXXXXXXXSWEQLKGLLSCRSAEASQVYDPSSA 256
           D H ++KK  Q  PP PP P                WEQ K LLSCR +  SQV++PS  
Sbjct: 18  DHHKRKKKPSQLPPPRPPPPPQPTKRPAPPSS----WEQFKSLLSCRISAPSQVHEPSKL 73

Query: 255 VARLSRAACGSSICALRDVVHGNTRVVHRSDTD--LSSSDAGSI 130
             R S  +CGSSICA RDVVHGNTRVVHRSDTD   S+SD+G +
Sbjct: 74  GGRSS--SCGSSICAFRDVVHGNTRVVHRSDTDHYSSASDSGPL 115


>ref|XP_009383982.1| PREDICTED: uncharacterized protein LOC103971641 [Musa acuminata
           subsp. malaccensis]
 ref|XP_009383983.1| PREDICTED: uncharacterized protein LOC103971641 [Musa acuminata
           subsp. malaccensis]
          Length = 431

 Score = 94.4 bits (233), Expect = 1e-18
 Identities = 55/100 (55%), Positives = 65/100 (65%)
 Frame = -3

Query: 417 KKKKQNTPPAPPLPQTRVXXXXXXXXXXXSWEQLKGLLSCRSAEASQVYDPSSAVARLSR 238
           KKKK+  PP+PP  +T++            WEQ KGLLSCR   AS+V+DP+S   +L R
Sbjct: 54  KKKKKKPPPSPPT-ETQLPPSSSSS-----WEQFKGLLSCRITAASRVHDPASTAGKLGR 107

Query: 237 AACGSSICALRDVVHGNTRVVHRSDTDLSSSDAGSISQHE 118
           AACG SICALRDV      VVHRSDTD  +SDAGS SQ E
Sbjct: 108 AACGPSICALRDV------VVHRSDTD-PNSDAGSTSQRE 140


>ref|XP_008807585.2| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           LOC103719889 [Phoenix dactylifera]
          Length = 435

 Score = 93.6 bits (231), Expect = 3e-18
 Identities = 57/104 (54%), Positives = 67/104 (64%), Gaps = 4/104 (3%)
 Frame = -3

Query: 429 HGKRK-KKKQNTPPAPPLPQTRVXXXXXXXXXXXSWEQLKGLLSCR-SAEASQVYDPSSA 256
           HG+ K KKK + PP PP P               SWEQ K LLSCR +A ++QV+DPS  
Sbjct: 17  HGQHKRKKKPSQPPPPPQPAKT-------PAPPSSWEQFKSLLSCRIAAPSTQVHDPSKL 69

Query: 255 VARLSRAACGSSICALRDVVHGNTRVVHRSDTD--LSSSDAGSI 130
             R S  +CGSSICA RDVVHGNTRVVHRSDTD   S+SD+G +
Sbjct: 70  GGRSS--SCGSSICAFRDVVHGNTRVVHRSDTDRYSSASDSGPL 111


>gb|KFK29074.1| hypothetical protein AALP_AA7G085300 [Arabis alpina]
          Length = 411

 Score = 80.5 bits (197), Expect = 1e-13
 Identities = 43/104 (41%), Positives = 58/104 (55%), Gaps = 3/104 (2%)
 Frame = -3

Query: 438 LDRHGKRKKKKQNTPPAPPLPQTRVXXXXXXXXXXXSWEQLKGLLSCRSAEASQVYDPSS 259
           L  + K+K KK+ TPP P + +T             SW Q+K LLSC+  E  +V+DPS 
Sbjct: 3   LSSNPKKKNKKKKTPPPPHMQKT--------ISSSSSWSQIKNLLSCKQIEGPRVHDPSK 54

Query: 258 AVAR---LSRAACGSSICALRDVVHGNTRVVHRSDTDLSSSDAG 136
             A+    S A CGSS+C   D ++GN RV+HRSD    SS+ G
Sbjct: 55  ITAQGPYTSSALCGSSLCRFSDAIYGNARVIHRSDHSPESSNLG 98


>ref|XP_018469334.1| PREDICTED: uncharacterized protein LOC108841044 [Raphanus sativus]
          Length = 417

 Score = 79.3 bits (194), Expect = 2e-13
 Identities = 41/104 (39%), Positives = 59/104 (56%), Gaps = 3/104 (2%)
 Frame = -3

Query: 438 LDRHGKRKKKKQNTPP---APPLPQTRVXXXXXXXXXXXSWEQLKGLLSCRSAEASQVYD 268
           L    K+ + K+ TPP    P L QT+            SW Q+K LLSC+  E  +V+D
Sbjct: 17  LSSDSKKNRNKKTTPPPHQTPTLKQTQKLKQKTISSTSSSWSQIKNLLSCKQIEGPRVHD 76

Query: 267 PSSAVARLSRAACGSSICALRDVVHGNTRVVHRSDTDLSSSDAG 136
           PS    +++ ++CGSS+C   DV++GN RV+HRSD    SS+ G
Sbjct: 77  PS----KITLSSCGSSLCKFSDVIYGNARVIHRSDHSPESSNLG 116


>ref|XP_007160670.1| hypothetical protein PHAVU_001G007100g [Phaseolus vulgaris]
 gb|ESW32664.1| hypothetical protein PHAVU_001G007100g [Phaseolus vulgaris]
          Length = 408

 Score = 77.8 bits (190), Expect = 8e-13
 Identities = 42/103 (40%), Positives = 61/103 (59%)
 Frame = -3

Query: 432 RHGKRKKKKQNTPPAPPLPQTRVXXXXXXXXXXXSWEQLKGLLSCRSAEASQVYDPSSAV 253
           +H K+K+K++  PP+                    W+Q+K LL+C+  E S+V+DPS   
Sbjct: 25  QHQKQKQKQKQKPPSS-------------------WDQIKNLLTCKQMEGSRVHDPSKGY 65

Query: 252 ARLSRAACGSSICALRDVVHGNTRVVHRSDTDLSSSDAGSISQ 124
           +R+  +   SSIC+ RDVVHGNTRVVHRSD   SS ++ S+ Q
Sbjct: 66  SRIGSSC--SSICSFRDVVHGNTRVVHRSDN--SSPESSSLGQ 104


>ref|XP_010061047.1| PREDICTED: uncharacterized protein LOC104448832 [Eucalyptus
           grandis]
          Length = 454

 Score = 77.8 bits (190), Expect = 9e-13
 Identities = 42/96 (43%), Positives = 56/96 (58%)
 Frame = -3

Query: 423 KRKKKKQNTPPAPPLPQTRVXXXXXXXXXXXSWEQLKGLLSCRSAEASQVYDPSSAVARL 244
           K+K+K++   PAPP  Q+              W+Q K LL+C+  E S V+DPS    + 
Sbjct: 54  KQKQKQKQKQPAPPPSQSS-------------WDQFKNLLTCKQVEGSAVHDPSKNNNKP 100

Query: 243 SRAACGSSICALRDVVHGNTRVVHRSDTDLSSSDAG 136
             ++C SSIC+ RDVVHGNTRVVHR+D    SS  G
Sbjct: 101 VGSSC-SSICSFRDVVHGNTRVVHRADNSPESSSVG 135


>gb|KCW67946.1| hypothetical protein EUGRSUZ_F01643 [Eucalyptus grandis]
          Length = 506

 Score = 77.8 bits (190), Expect = 1e-12
 Identities = 42/96 (43%), Positives = 56/96 (58%)
 Frame = -3

Query: 423 KRKKKKQNTPPAPPLPQTRVXXXXXXXXXXXSWEQLKGLLSCRSAEASQVYDPSSAVARL 244
           K+K+K++   PAPP  Q+              W+Q K LL+C+  E S V+DPS    + 
Sbjct: 106 KQKQKQKQKQPAPPPSQSS-------------WDQFKNLLTCKQVEGSAVHDPSKNNNKP 152

Query: 243 SRAACGSSICALRDVVHGNTRVVHRSDTDLSSSDAG 136
             ++C SSIC+ RDVVHGNTRVVHR+D    SS  G
Sbjct: 153 VGSSC-SSICSFRDVVHGNTRVVHRADNSPESSSVG 187


>ref|XP_023534934.1| uncharacterized protein LOC111796515 [Cucurbita pepo subsp. pepo]
          Length = 404

 Score = 77.4 bits (189), Expect = 1e-12
 Identities = 42/101 (41%), Positives = 58/101 (57%), Gaps = 3/101 (2%)
 Frame = -3

Query: 429 HGKRKKKKQNTPPAPPLPQTRVXXXXXXXXXXXSWEQLKGLLSCRSAEASQVYDP---SS 259
           H KR+KK +N  P PP  Q+              W+Q+K LL+C+  E S+V++P   S 
Sbjct: 8   HNKRRKKHKNPSPPPPSAQSS-------------WDQIKSLLTCKQIETSRVHEPVKRSP 54

Query: 258 AVARLSRAACGSSICALRDVVHGNTRVVHRSDTDLSSSDAG 136
           A ++L  +   SSIC+ RDVVHGN +VVHR+D    SS  G
Sbjct: 55  AYSKLGSSC--SSICSFRDVVHGNAKVVHRADNSPESSSVG 93


>ref|XP_022976330.1| uncharacterized protein LOC111476763 [Cucurbita maxima]
          Length = 410

 Score = 77.4 bits (189), Expect = 1e-12
 Identities = 42/101 (41%), Positives = 58/101 (57%), Gaps = 3/101 (2%)
 Frame = -3

Query: 429 HGKRKKKKQNTPPAPPLPQTRVXXXXXXXXXXXSWEQLKGLLSCRSAEASQVYDP---SS 259
           H KR+KK +N  P PP  Q+              W+Q+K LL+C+  E S+V++P   S 
Sbjct: 8   HNKRRKKHKNPSPPPPSAQSS-------------WDQIKSLLTCKQIETSRVHEPVKRSP 54

Query: 258 AVARLSRAACGSSICALRDVVHGNTRVVHRSDTDLSSSDAG 136
           A ++L  +   SSIC+ RDVVHGN +VVHR+D    SS  G
Sbjct: 55  AYSKLGSSC--SSICSFRDVVHGNAKVVHRADNSPESSSVG 93


>ref|XP_020698077.1| uncharacterized protein LOC110110794 [Dendrobium catenatum]
 gb|PKU61131.1| hypothetical protein MA16_Dca025294 [Dendrobium catenatum]
          Length = 412

 Score = 77.4 bits (189), Expect = 1e-12
 Identities = 46/108 (42%), Positives = 62/108 (57%), Gaps = 5/108 (4%)
 Frame = -3

Query: 429 HGKRKKKKQNTPPAPPLPQTRVXXXXXXXXXXXSWEQLKGLLSCRSAEA-SQVYDPSSAV 253
           H +++KKKQ  PP PP P + +            W+  + LL+C+S    ++V+DPSSA 
Sbjct: 15  HRRKQKKKQQRPPPPPPPPSSLTLS---------WDHFRNLLNCKSTSGPTKVHDPSSAA 65

Query: 252 ARLSRAACG----SSICALRDVVHGNTRVVHRSDTDLSSSDAGSISQH 121
           A   +   G    SS CAL DVVHG+ RVVHRSDTD  +S  G  S+H
Sbjct: 66  AAGKQQRLGLRFGSSGCALGDVVHGSVRVVHRSDTDQYNSSGGD-SRH 112


>ref|XP_010268701.1| PREDICTED: uncharacterized protein LOC104605577 [Nelumbo nucifera]
          Length = 410

 Score = 77.0 bits (188), Expect = 2e-12
 Identities = 43/98 (43%), Positives = 57/98 (58%), Gaps = 2/98 (2%)
 Frame = -3

Query: 432 RHGKRKKKKQNTPPAPPLPQTRVXXXXXXXXXXXSWEQLKGLLSCRSAEASQVYDPSSAV 253
           +H KRK+K+Q  P +                    W+Q+K LL+C+  E SQV+DPS  V
Sbjct: 35  QHSKRKRKQQKQPSS--------------------WDQIKSLLTCKQIEGSQVHDPSKNV 74

Query: 252 ARLSR--AACGSSICALRDVVHGNTRVVHRSDTDLSSS 145
              S+  A+C SSIC+ +DVVHGNTRVVHR+D    SS
Sbjct: 75  GGYSKLGASC-SSICSFKDVVHGNTRVVHRADNSPESS 111


>ref|XP_022936402.1| uncharacterized protein LOC111443031 [Cucurbita moschata]
          Length = 411

 Score = 76.6 bits (187), Expect = 2e-12
 Identities = 42/101 (41%), Positives = 59/101 (58%), Gaps = 3/101 (2%)
 Frame = -3

Query: 429 HGKRKKKKQNTPPAPPLPQTRVXXXXXXXXXXXSWEQLKGLLSCRSAEASQVYDP---SS 259
           H KR+K+ +N  P PP P +             SW+Q+K LL+C+  E S+V++P   S 
Sbjct: 8   HNKRRKRHKNPSPPPPPPPSA----------QSSWDQIKSLLTCKQIETSRVHEPVKRSP 57

Query: 258 AVARLSRAACGSSICALRDVVHGNTRVVHRSDTDLSSSDAG 136
           A ++L  +   SSIC+ RDVVHGN +VVHR+D    SS  G
Sbjct: 58  AYSKLGSSC--SSICSFRDVVHGNAKVVHRADNSPESSSVG 96


>ref|XP_021721176.1| uncharacterized protein LOC110688724 [Chenopodium quinoa]
          Length = 439

 Score = 76.6 bits (187), Expect = 2e-12
 Identities = 45/108 (41%), Positives = 62/108 (57%), Gaps = 6/108 (5%)
 Frame = -3

Query: 429 HGKRKKKKQNTP------PAPPLPQTRVXXXXXXXXXXXSWEQLKGLLSCRSAEASQVYD 268
           H K + KKQ TP      P P  P +              W+Q+K LL+C+  E S+V+D
Sbjct: 41  HHKEENKKQQTPVKQSTIPKPKQPSS--------------WDQIKNLLTCKQVEGSRVHD 86

Query: 267 PSSAVARLSRAACGSSICALRDVVHGNTRVVHRSDTDLSSSDAGSISQ 124
           PS   +  + ++CGS +C+ RDVVHGNTRVVHRSD   SS ++ S+ Q
Sbjct: 87  PSKGTSS-AISSCGS-MCSFRDVVHGNTRVVHRSDN--SSPESSSLGQ 130


>ref|XP_010533365.1| PREDICTED: uncharacterized protein LOC104809174 [Tarenaya
           hassleriana]
          Length = 451

 Score = 76.6 bits (187), Expect = 2e-12
 Identities = 46/113 (40%), Positives = 62/113 (54%), Gaps = 13/113 (11%)
 Frame = -3

Query: 423 KRKKKKQNTPPA------PPLPQTRVXXXXXXXXXXXSWEQLKGLLSCRSAEASQVYDPS 262
           K+KKK+   PP+      PP  +T+            SW Q+K LLSC+  E  +V+DPS
Sbjct: 28  KKKKKRPAKPPSQPQTLPPPRTRTQTQKHKPLFSSPSSWSQIKYLLSCKQIEGPRVHDPS 87

Query: 261 S-------AVARLSRAACGSSICALRDVVHGNTRVVHRSDTDLSSSDAGSISQ 124
                     A LS ++CGSS C L DVV+GN RVVHRSD   SS ++ ++ Q
Sbjct: 88  KIPHHGLYTSATLS-SSCGSSFCRLSDVVYGNARVVHRSDHSSSSPESSNLGQ 139


>ref|XP_015969843.1| uncharacterized protein LOC107493257 [Arachis duranensis]
          Length = 411

 Score = 76.3 bits (186), Expect = 3e-12
 Identities = 37/70 (52%), Positives = 51/70 (72%)
 Frame = -3

Query: 327 WEQLKGLLSCRSAEASQVYDPSSAVARLSRAACGSSICALRDVVHGNTRVVHRSDTDLSS 148
           WEQ+K L+SC+  E S+V+DPSS+ + +  +   SSIC+ RDVVHGNTRVVHR+D   SS
Sbjct: 50  WEQIKNLISCKQVEGSRVHDPSSSKSMMGSSC--SSICSFRDVVHGNTRVVHRADN--SS 105

Query: 147 SDAGSISQHE 118
            ++ S+ Q E
Sbjct: 106 PESSSLGQQE 115


>emb|CDY16999.1| BnaA08g03630D [Brassica napus]
          Length = 414

 Score = 75.9 bits (185), Expect = 4e-12
 Identities = 37/96 (38%), Positives = 56/96 (58%)
 Frame = -3

Query: 423 KRKKKKQNTPPAPPLPQTRVXXXXXXXXXXXSWEQLKGLLSCRSAEASQVYDPSSAVARL 244
           K+K +K+ TP      QT+            SW Q+K LLSC+  E  +V++PS    ++
Sbjct: 24  KKKSRKKQTPLPQTQTQTQTLKKKTVQSSSSSWSQIKNLLSCKQIEGPRVHEPS----KI 79

Query: 243 SRAACGSSICALRDVVHGNTRVVHRSDTDLSSSDAG 136
           + ++CGSS+C   DV++GN RV+HRSD    SS+ G
Sbjct: 80  TSSSCGSSLCKFSDVIYGNARVIHRSDHSPGSSNLG 115


>ref|XP_013666538.2| uncharacterized protein LOC106371061 [Brassica napus]
          Length = 418

 Score = 75.9 bits (185), Expect = 4e-12
 Identities = 37/96 (38%), Positives = 56/96 (58%)
 Frame = -3

Query: 423 KRKKKKQNTPPAPPLPQTRVXXXXXXXXXXXSWEQLKGLLSCRSAEASQVYDPSSAVARL 244
           K+K +K+ TP      QT+            SW Q+K LLSC+  E  +V++PS    ++
Sbjct: 24  KKKSRKKQTPLPQTQTQTQTLKKKTVQSSSSSWSQIKNLLSCKQIEGPRVHEPS----KI 79

Query: 243 SRAACGSSICALRDVVHGNTRVVHRSDTDLSSSDAG 136
           + ++CGSS+C   DV++GN RV+HRSD    SS+ G
Sbjct: 80  TSSSCGSSLCKFSDVIYGNARVIHRSDHSPGSSNLG 115


Top