BLASTX nr result

ID: Cheilocostus21_contig00035695 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cheilocostus21_contig00035695
         (1861 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_009407935.1| PREDICTED: uncharacterized protein LOC103990...   346   e-108
ref|XP_018684442.1| PREDICTED: uncharacterized protein LOC103990...   309   2e-94
ref|XP_010942605.1| PREDICTED: uncharacterized protein LOC105060...   184   6e-47
ref|XP_018735943.1| cell surface glycoprotein (predicted) [Sugiy...    67   9e-08
ref|WP_064215345.1| hypothetical protein [Lactobacillus aviarius...    65   3e-07
gb|ATY59222.1| hypothetical protein A9K55_002533 [Cordyceps mili...    61   9e-06

>ref|XP_009407935.1| PREDICTED: uncharacterized protein LOC103990494 isoform X1 [Musa
            acuminata subsp. malaccensis]
          Length = 541

 Score =  346 bits (887), Expect = e-108
 Identities = 229/497 (46%), Positives = 293/497 (58%), Gaps = 15/497 (3%)
 Frame = -2

Query: 1839 VKQKAFSYHIPDSMPVRNACHGSRGAWFLQIVASRLPDISMQQTVVYSSIN---RQYSSY 1669
            VKQK+F YHIPDSMP++ AC G R  WFL++ A+RL D+SM Q  + SS+N   R+ S Y
Sbjct: 57   VKQKSFLYHIPDSMPIKEACQGFRSTWFLRMDATRLTDVSMHQASIPSSMNQFRRRDSPY 116

Query: 1668 QGETIKPKSNKIPTSTNLMSPSNSASCKLQHQFSSTESDMDHKREPFISNM-PRSSTFNT 1492
              ET+ P S K+P S      SNS  C    +F  TE DMDH  E  ISNM   S+    
Sbjct: 117  LEETVTPNSYKLPAS------SNSNCCTPGGRFVCTEVDMDHILEGSISNMIDCSNASKA 170

Query: 1491 VDFTISSLTTTNVGNQEGKTATDHLKCDEAIVGVISSSSFSQKGV-KRMKLGHSIQGNGV 1315
            VDFTIS++T TN+ NQ G    D    DE  V V S    SQK + KR KL H IQGN V
Sbjct: 171  VDFTISTVTATNMINQAGNIVRDCSTWDEREVDVPSLPLISQKSLSKRRKLEHGIQGNVV 230

Query: 1314 GKEASDTRKTCKTG----FDSRKEVIRGNTLIDEAEYVIQREKVCQLEFPLERRQNMSSL 1147
            GKEA  + KT K+G     DS KEV + NT    A    Q++K    E P E+ Q+ +SL
Sbjct: 231  GKEAVISNKTRKSGVGSSLDSGKEVRKDNT----APNATQKDKAWVSEGPSEKEQHKNSL 286

Query: 1146 SKEETCSVDFSEKISVTGLILRYFSDNEVNSN-SIPKKNGRETENTAPQLNNFPKVQESH 970
            S EET S++ SEKISVTGLI RYFS+ E  S  S   KN  +T++   Q  +    QE H
Sbjct: 287  SMEETPSMNLSEKISVTGLISRYFSNQEEASTCSNLHKNVMDTQSVTVQPTSMLNTQEFH 346

Query: 969  LIGETGFA-TSIQNSADKKGSNDAESHLETACSFNINTSETNQFCSLSNKKTKSIQNNAT 793
            L   TGFA   +Q+SAD K SN+ ES   TACSFNIN  ET   C  SN   K  ++  T
Sbjct: 347  LNEATGFAMKDVQSSADDKSSNEGESDCNTACSFNIN-KETKHVCLTSNNVRKIAKSKPT 405

Query: 792  SSDGNRRRKATLSSVKKN--TTECSSHQADNVSSGGSKPPKSTNQTDGKCIYATPHTTPK 619
            +S+G  +  A +SS KK+    E   ++A N +S G+K   S++Q + K ++A P  T  
Sbjct: 406  TSEGKGQPSAIISSAKKDPRVAESRCYEAGNTTSCGNKSLISSSQVNRKLVHARPRRTST 465

Query: 618  LS--PSPFNKHNRAFNSYADRCEIGKHLVHAANNICSSVSEKKSAKSRWISRCRNMSATD 445
             S   SP +K  R  +S  D+ E+GK LV AAN I SSVS +K  +S +ISRC NMSA +
Sbjct: 466  FSSLSSPISKRKRILSS-NDKNEVGKRLVQAANKIYSSVSAEKLPQSSYISRCGNMSAPN 524

Query: 444  LIAIVKRFSFDVSDADD 394
             I I KRFSF+++D+DD
Sbjct: 525  SIVIPKRFSFEINDSDD 541


>ref|XP_018684442.1| PREDICTED: uncharacterized protein LOC103990494 isoform X2 [Musa
            acuminata subsp. malaccensis]
          Length = 525

 Score =  309 bits (792), Expect = 2e-94
 Identities = 211/467 (45%), Positives = 269/467 (57%), Gaps = 15/467 (3%)
 Frame = -2

Query: 1839 VKQKAFSYHIPDSMPVRNACHGSRGAWFLQIVASRLPDISMQQTVVYSSIN---RQYSSY 1669
            VKQK+F YHIPDSMP++ AC G R  WFL++ A+RL D+SM Q  + SS+N   R+ S Y
Sbjct: 57   VKQKSFLYHIPDSMPIKEACQGFRSTWFLRMDATRLTDVSMHQASIPSSMNQFRRRDSPY 116

Query: 1668 QGETIKPKSNKIPTSTNLMSPSNSASCKLQHQFSSTESDMDHKREPFISNM-PRSSTFNT 1492
              ET+ P S K+P S      SNS  C    +F  TE DMDH  E  ISNM   S+    
Sbjct: 117  LEETVTPNSYKLPAS------SNSNCCTPGGRFVCTEVDMDHILEGSISNMIDCSNASKA 170

Query: 1491 VDFTISSLTTTNVGNQEGKTATDHLKCDEAIVGVISSSSFSQKGV-KRMKLGHSIQGNGV 1315
            VDFTIS++T TN+ NQ G    D    DE  V V S    SQK + KR KL H IQGN V
Sbjct: 171  VDFTISTVTATNMINQAGNIVRDCSTWDEREVDVPSLPLISQKSLSKRRKLEHGIQGNVV 230

Query: 1314 GKEASDTRKTCKTG----FDSRKEVIRGNTLIDEAEYVIQREKVCQLEFPLERRQNMSSL 1147
            GKEA  + KT K+G     DS KEV + NT    A    Q++K    E P E+ Q+ +SL
Sbjct: 231  GKEAVISNKTRKSGVGSSLDSGKEVRKDNT----APNATQKDKAWVSEGPSEKEQHKNSL 286

Query: 1146 SKEETCSVDFSEKISVTGLILRYFSDNEVNSN-SIPKKNGRETENTAPQLNNFPKVQESH 970
            S EET S++ SEKISVTGLI RYFS+ E  S  S   KN  +T++   Q  +    QE H
Sbjct: 287  SMEETPSMNLSEKISVTGLISRYFSNQEEASTCSNLHKNVMDTQSVTVQPTSMLNTQEFH 346

Query: 969  LIGETGFA-TSIQNSADKKGSNDAESHLETACSFNINTSETNQFCSLSNKKTKSIQNNAT 793
            L   TGFA   +Q+SAD K SN+ ES   TACSFNIN  ET   C  SN   K  ++  T
Sbjct: 347  LNEATGFAMKDVQSSADDKSSNEGESDCNTACSFNIN-KETKHVCLTSNNVRKIAKSKPT 405

Query: 792  SSDGNRRRKATLSSVKKN--TTECSSHQADNVSSGGSKPPKSTNQTDGKCIYATPHTTPK 619
            +S+G  +  A +SS KK+    E   ++A N +S G+K   S++Q + K ++A P  T  
Sbjct: 406  TSEGKGQPSAIISSAKKDPRVAESRCYEAGNTTSCGNKSLISSSQVNRKLVHARPRRTST 465

Query: 618  LS--PSPFNKHNRAFNSYADRCEIGKHLVHAANNICSSVSEKKSAKS 484
             S   SP +K  R  +S  D+ E+GK LV AAN I SSVS +K  +S
Sbjct: 466  FSSLSSPISKRKRILSS-NDKNEVGKRLVQAANKIYSSVSAEKLPQS 511


>ref|XP_010942605.1| PREDICTED: uncharacterized protein LOC105060541 [Elaeis guineensis]
          Length = 584

 Score =  184 bits (467), Expect = 6e-47
 Identities = 150/517 (29%), Positives = 249/517 (48%), Gaps = 35/517 (6%)
 Frame = -2

Query: 1839 VKQKAFSYHIPDSMPVRNACHGSRGAWFLQIVASRLPDISMQQTVVYSSINR---QYSSY 1669
            VK+K   YH+PDSM ++ +  G  G WFL + AS+  D+  Q+  + S ++    Q  S 
Sbjct: 79   VKRKTHFYHLPDSMAIKKSYQGLAGTWFLHMNASQSVDMPRQEASIPSGLSHSTGQDFSN 138

Query: 1668 QGETIKPKSNKIPTSTNLMSPSNSASCKLQHQ-------FSSTESDMDHKREPFISNMPR 1510
            Q E ++P +   P   NL + +NS  C+ +++        +S E+       P +SN+  
Sbjct: 139  QREPVRPNAKVSPALENLTASNNSNQCEHRNREVMAIQVITSLETK---DSMPIVSNVTD 195

Query: 1509 SSTFNTVDFTISSLTTTNVGNQEGKTATDHLKCDEAIVGVISSSSFSQKGVKRMKLGHSI 1330
             S         +    TN+ +Q  K    H   +E  + V+S     ++  K+ K+    
Sbjct: 196  CSKSEKA----AEFRATNMKSQASKNGGHHSIFNERQLDVLSLPICGKEKRKKRKIWERG 251

Query: 1329 QGNGVGK-EASDTR--KTCKTGFDSRKEVI-----------RGNTLIDEAEYVIQREKVC 1192
              + VG     D R  K  +T      E+I           +  T+ DE++ + Q++K  
Sbjct: 252  NSSNVGSGHVKDIREEKLARTQKMHTPELISCLKPDKGKGTKDGTITDESQGMDQKDKNL 311

Query: 1191 QLEFPLERRQNMSSLSKEETCSVDFSEKISVTGLILRYFSD-NEVNSNSIPKKNGRETEN 1015
              E PL  R ++SS +  +T S+  S  IS++GLI RYFSD  EVNS S   KN + T+ 
Sbjct: 312  SSELPLLSRPHISSSTVRQTPSISLSGNISLSGLISRYFSDYEEVNSCSSLTKNIQNTKK 371

Query: 1014 TAPQLNNFPKVQESHLIGETGFAT-SIQNSADKKGSNDAESHLETACSFNINTSETNQF- 841
               Q N+  KV++      TG+A  + + S ++K S       +T CSFN+++    +  
Sbjct: 372  LTHQFNDVLKVEKLQQNEITGYANYAAEFSVNQKSSKRIALGSDTICSFNVSSDGVKRLE 431

Query: 840  ---CSLSNKKTKSIQNNATSSDGNRRRKATLSSVKKNTTECSSH----QADNVSSGGSKP 682
                ++SN++    +N   + D  RR K+ +  +K  +  C+      QAD+  +  +K 
Sbjct: 432  DYPMNISNREHD--KNEIVAFDRQRRTKSKM--MKDTSVACNDEYSPCQADHKKTFRNKF 487

Query: 681  PKSTNQTDGKCIYATPH-TTPKLSPSPFNKHNRAFNSYADRCEIGKHLVHAANNICSSVS 505
             +     + K +  TPH ++   +P  FNKH  A  S   R E+GK LV AANNI  S S
Sbjct: 488  SRGKQPVNKKKVCLTPHCSSVNSTPLSFNKHTGASRSKFGRNEVGKRLVQAANNIWFSAS 547

Query: 504  EKKSAKSRWISRCRNMSATDLIAIVKRFSFDVSDADD 394
             KKS +S ++SR  N+S  + +  V +  F++ ++DD
Sbjct: 548  GKKSLQSMFVSRSGNLSLPESVVPVNKLPFEIDESDD 584


>ref|XP_018735943.1| cell surface glycoprotein (predicted) [Sugiyamaella lignohabitans]
 gb|ANB13466.1| cell surface glycoprotein (predicted) [Sugiyamaella lignohabitans]
          Length = 2425

 Score = 67.4 bits (163), Expect = 9e-08
 Identities = 75/383 (19%), Positives = 144/383 (37%), Gaps = 1/383 (0%)
 Frame = -2

Query: 1629 TSTNLMSPSNSASCKLQHQFSSTESDMDHKREPFISNMPRSSTFNTVDFTISSLTTTNVG 1450
            TST+  S   S S   +H  +ST+       +PF  + P S+  +T   T S  T+T+V 
Sbjct: 1473 TSTSTSSKHTSTSTSSKHTSTSTKHSTSTSHQPF--SQPISTKSSTKHSTSSKTTSTHVT 1530

Query: 1449 NQEGKTATDHLKCDEAIVGVISSSSFSQKGVKRMKLGHSIQGNGVGKEA-SDTRKTCKTG 1273
            ++   T+T H    ++     ++S  + K     K    +        + + T KT  T 
Sbjct: 1531 SKSTSTSTKHSTTTKSTSSKGTTSRTTSKATTTSKTTSKLTSTTFKTTSKTTTSKTSSTS 1590

Query: 1272 FDSRKEVIRGNTLIDEAEYVIQREKVCQLEFPLERRQNMSSLSKEETCSVDFSEKISVTG 1093
              +  +    +    +      +           +  + +S +  +T S   S+  S T 
Sbjct: 1591 SKTTSKTTSTSKTTSKTSSTTSKTTSKTTSKTTSKTTSKTSSTTSKTTSKTTSKTSSTTS 1650

Query: 1092 LILRYFSDNEVNSNSIPKKNGRETENTAPQLNNFPKVQESHLIGETGFATSIQNSADKKG 913
                        S++  K   + T  T  +  +      S    +T   TS   +  K  
Sbjct: 1651 ------KTTSKTSSTTSKTTSKTTSKTTSKTTSKTSSTTSKTTSKTSSTTS--KTTSKTS 1702

Query: 912  SNDAESHLETACSFNINTSETNQFCSLSNKKTKSIQNNATSSDGNRRRKATLSSVKKNTT 733
            S  +++  +T+ + +  TS+T+   S +  KT S  +  TS   +   K T S     T+
Sbjct: 1703 STTSKTTSKTSSTTSKTTSKTSSTTSKTTSKTSSTTSKTTSKTSSTTSKTT-SKTSSTTS 1761

Query: 732  ECSSHQADNVSSGGSKPPKSTNQTDGKCIYATPHTTPKLSPSPFNKHNRAFNSYADRCEI 553
            + +S  +   S   SK   +T++T  K    T  TT K S +     ++  ++ +     
Sbjct: 1762 KTTSKTSSTTSKTTSKTSSTTSKTTSKTSSTTSKTTSKTSSTTSKTTSKTSSTTSKTTSK 1821

Query: 552  GKHLVHAANNICSSVSEKKSAKS 484
                     +  SS + K ++K+
Sbjct: 1822 TSSTTSKTTSKTSSTTSKTTSKT 1844


>ref|WP_064215345.1| hypothetical protein [Lactobacillus aviarius]
 gb|OAQ02506.1| hypothetical protein A3O10_07345 [Lactobacillus aviarius]
 gb|OAQ02510.1| hypothetical protein A3O11_00545 [Lactobacillus aviarius]
 gb|OAS78242.1| hypothetical protein A3O18_06875 [Lactobacillus aviarius]
 gb|PEG70191.1| hypothetical protein A3P04_07770 [Lactobacillus aviarius]
 gb|PEG73363.1| hypothetical protein A3O82_06715 [Lactobacillus aviarius]
          Length = 778

 Score = 65.5 bits (158), Expect = 3e-07
 Identities = 70/352 (19%), Positives = 141/352 (40%), Gaps = 9/352 (2%)
 Frame = -2

Query: 1698 SSINRQYSSYQGETIKPKSNKIPTSTNLMSPSNSASCKLQHQFSSTESDMDHKREPFISN 1519
            SS+    SS +  ++K  S+K  +S+ + S S+  S  ++   S   S         +S+
Sbjct: 383  SSLKSSSSSTKSSSVKSSSSKKSSSSKISSSSSQKSSNIESSLSKVSSSKQSSTSSNVSS 442

Query: 1518 MPRSSTFNTVDFTISSLTTTNVGNQEGKTATDHLKCDEAIVGVISSSSFSQKGVKRMKLG 1339
              +SS+ +    ++ S ++    +++  T++D           +SS   S          
Sbjct: 443  QLKSSSSSQKSSSVESSSSKTSSSKQSSTSSD-----------VSSQMKSS--------- 482

Query: 1338 HSIQGNGVGKEASDTRKTCKTGFDSRKEVIRGNTLIDEAEYVIQREKVCQLEFPLERRQN 1159
                 N   K++S    +     +S++     NT    +  +       Q     E   +
Sbjct: 483  -----NNSSKKSSSVENSSSKASNSKQSSTSSNT----SSQLKSSNSSSQKSSSAESSSS 533

Query: 1158 MSSLSKEETCSVDFSEKISVTGLILRYFSDNEVNSNSIPK-KNGRETENTAPQLNNFPKV 982
             +S SK+ + S + S ++  +    R  S  E +S+     K    + NT+ QL +    
Sbjct: 534  KASSSKQSSTSSNVSSQLKSSNSSSRKSSSTENSSSKTSSNKQSSTSSNTSSQLKSSNSS 593

Query: 981  QESHLIGETGFA--------TSIQNSADKKGSNDAESHLETACSFNINTSETNQFCSLSN 826
             +     E+  +        ++  N  +KK SN  +S  +T+ S   ++ E +   S S+
Sbjct: 594  NQKSSSAESSSSKTSSNKQSSTSSNGNNKKSSNVGDSSSKTSSSLKSSSGELSGKNSSSS 653

Query: 825  KKTKSIQNNATSSDGNRRRKATLSSVKKNTTECSSHQADNVSSGGSKPPKST 670
             K  S Q+ A     +   K+ LSS + N ++ ++    NV+  G+K   S+
Sbjct: 654  LKKSSTQDPADPGAKSSNSKSALSSTESNGSKKTADPDKNVADPGNKSSSSS 705


>gb|ATY59222.1| hypothetical protein A9K55_002533 [Cordyceps militaris]
          Length = 1578

 Score = 60.8 bits (146), Expect = 9e-06
 Identities = 72/372 (19%), Positives = 128/372 (34%), Gaps = 10/372 (2%)
 Frame = -2

Query: 1698 SSINRQYSSYQGETIKPKSNKIPTSTNLMSPSNSASCKLQHQFSSTES----------DM 1549
            SS      S +  +    ++  PT+++  S S++ S       SSTES          + 
Sbjct: 372  SSTETTALSTESTSSTESTSSTPTTSSTESTSSTESTSSTESTSSTESTSSTESTSSTES 431

Query: 1548 DHKREPFISNMPRSSTFNTVDFTISSLTTTNVGNQEGKTATDHLKCDEAIVGVISSSSFS 1369
                +   S    SST +T   T S+ +T    + E  ++T+            S+S  S
Sbjct: 432  TSSTKSTSSTESTSSTASTTSSTESTSSTATTSSTESTSSTESTSSTPTTSSTESTS--S 489

Query: 1368 QKGVKRMKLGHSIQGNGVGKEASDTRKTCKTGFDSRKEVIRGNTLIDEAEYVIQREKVCQ 1189
             +      +  S +     +  S T  T  T   S  E      +    E     E    
Sbjct: 490  TESTSSTPITSSTESMSSTESTSSTATTSSTKSTSSTESTSSTPITSSTESTSSTEST-- 547

Query: 1188 LEFPLERRQNMSSLSKEETCSVDFSEKISVTGLILRYFSDNEVNSNSIPKKNGRETENTA 1009
                       ++ S E T S + +     T       S  E  S++    +   T +T 
Sbjct: 548  -------SSTATTSSTESTSSTESTSSTPTTSSTTESTSSTESTSSTPTTSSTESTSSTT 600

Query: 1008 PQLNNFPKVQESHLIGETGFATSIQNSADKKGSNDAESHLETACSFNINTSETNQFCSLS 829
                +    + +     T   T   +S +   S    S  E+  S   +T  T+   S S
Sbjct: 601  SSTKSTSSTESTSSTPTTSSTTKSTSSTESTSSTPTTSSTESTSSTTSSTESTSSTESTS 660

Query: 828  NKKTKSIQNNATSSDGNRRRKATLSSVKKNTTECSSHQADNVSSGGSKPPKSTNQTDGKC 649
            +  T S    +TSS  +     T SS +  ++  SS ++ + +   S  P +++ T+   
Sbjct: 661  STPTTSSTTESTSSTESTSSTPTTSSTESTSSTTSSTESTSSTESTSSTPTTSSTTESTS 720

Query: 648  IYATPHTTPKLS 613
               +  +TP  S
Sbjct: 721  STESTSSTPTTS 732


Top