BLASTX nr result

ID: Cheilocostus21_contig00044430 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cheilocostus21_contig00044430
         (1918 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_009403991.1| PREDICTED: uncharacterized protein LOC103987...   107   2e-20
ref|XP_009403990.1| PREDICTED: uncharacterized protein LOC103987...    81   5e-12
ref|XP_019704444.1| PREDICTED: uncharacterized protein LOC105041...    69   3e-08
ref|XP_019704447.1| PREDICTED: uncharacterized protein LOC105041...    66   2e-07
ref|XP_019704446.1| PREDICTED: uncharacterized protein LOC105041...    66   2e-07
ref|XP_019704445.1| PREDICTED: uncharacterized protein LOC105041...    66   2e-07

>ref|XP_009403991.1| PREDICTED: uncharacterized protein LOC103987420 isoform X2 [Musa
            acuminata subsp. malaccensis]
          Length = 851

 Score =  107 bits (267), Expect = 2e-20
 Identities = 143/552 (25%), Positives = 224/552 (40%), Gaps = 35/552 (6%)
 Frame = -3

Query: 1916 RCEGMKWLQNEKNEDLHHLVDDNFDGKGGSNVDAYSRTSATSSVEGYASPDYSIKDESFY 1737
            RC+G+    +EK+E+  HL  +NF G   +++    +T+  +S++GYA P+YS ++    
Sbjct: 386  RCKGLTQHLDEKDENQSHLAYNNFGGTEQNSIHVNLKTNGANSIDGYAMPEYSFENGWAS 445

Query: 1736 SDCSS---HSYDMVLFDSDSPESLLSVAMINKDSHG------------QALDDEGFCLHK 1602
             D S    H+ D+VL DS+ P  L  + + N DS G            QA+D  G C   
Sbjct: 446  PDVSCQNIHNADIVLLDSEPPGVLSCLLVGNDDSQGKDSSMHEMRTADQAIDC-GQCFSP 504

Query: 1601 LN-LSLERTAALPSDGRLHQDKPIAGNFSFPTINAYEDVANLGLISDISSNEQFTDESLK 1425
             + L  +  A + ++         AGNF F      +D+        +S++ + +D++L 
Sbjct: 505  TSELVPDEAAKVSNNDNFQHYNHSAGNFLFA-----DDLC-------MSTSGESSDQTL- 551

Query: 1424 ENIESPRNCCLYIESD---------DQASSTLKTGDWIIPSQYAITSHTALTDSDGQGST 1272
             N+E  RN   +I S          D + + L+  D  IP+     S     D  G+   
Sbjct: 552  -NVEE-RNIG-HIHSSLPTGVFSVHDTSCACLQHMDKSIPNTAYKNSAQMKNDIAGRCCV 608

Query: 1271 VILDRYYSNDDNTCSKPTCGEFEAEQQLIIDTVIMEKTNLFNCVV-VTDANQQFYDAEAF 1095
              LD +Y N+       T G        +    +  K +  +C   + D    F DA A 
Sbjct: 609  NNLDPHYRNE-------TSGMVSG----VAIVTMNSKLSCSHCSKDLPDMEYMFADAAA- 656

Query: 1094 KDEDAAPDSVFSLNIKANGTNDMFGVCHIHESYSNMSSAWCELKSGNLVIQNQSEMLSED 915
              E  + +S FS  +   G       CH    ++   + +             ++   + 
Sbjct: 657  --EGTSDNSYFSGTLSTLGP-----ACHSSTVHTGAPAEF------------DNDCTRDQ 697

Query: 914  GPENCLNHDSLEANRGFLVQGGKNLLEMMSQKEQSNNYHASGSFSETNSDLIGVDTYNLH 735
                C NH+ L    G +                          +ETNS++     Y  H
Sbjct: 698  ASFACSNHEILAEAMGDIT-------------------------AETNSEV----RYEAH 728

Query: 734  PMSLPKGDNTSTSTCYGESSEQTLPV--EERKKETEVCEKNALTKLASSGKSKDGNT--- 570
             +              G S  + +P   EERKK+ +    N  + +  SG +K+      
Sbjct: 729  LID-----------DVGTSMNRMVPDVDEERKKDMDDVANNLPSTMTISGNNKEETVVKI 777

Query: 569  ---GNIVFKSIAGSCTMVGALLVFLYLRRK-DSEKNSHTVVSLRAKRSSQEDRTQXXXXX 402
               G  V KSIAGS T++GALLVFL+LRRK D EKN HTV  L+ K + Q+  T      
Sbjct: 778  RSPGKKVLKSIAGSVTLIGALLVFLHLRRKSDKEKNYHTVTPLQTKETCQKVDTHNTVEV 837

Query: 401  XXXXXXYPGERL 366
                  YPGERL
Sbjct: 838  GKSDKKYPGERL 849


>ref|XP_009403990.1| PREDICTED: uncharacterized protein LOC103987420 isoform X1 [Musa
            acuminata subsp. malaccensis]
          Length = 860

 Score = 80.9 bits (198), Expect = 5e-12
 Identities = 127/535 (23%), Positives = 209/535 (39%), Gaps = 37/535 (6%)
 Frame = -3

Query: 1916 RCEGMKWLQNEKNEDLHHLVDDNFDGKGGSNVDAYSRTSATSSVEGYASPDYSIKDESFY 1737
            RC+G+    +EK+E+  HL  +NF G   +++    +T+  +S++GYA P+YS ++    
Sbjct: 386  RCKGLTQHLDEKDENQSHLAYNNFGGTEQNSIHVNLKTNGANSIDGYAMPEYSFENGWAS 445

Query: 1736 SDCSS---HSYDMVLFDSDSPESLLSVAMINKDSHG------------QALDDEGFCLHK 1602
             D S    H+ D+VL DS+ P  L  + + N DS G            QA+D  G C   
Sbjct: 446  PDVSCQNIHNADIVLLDSEPPGVLSCLLVGNDDSQGKDSSMHEMRTADQAIDC-GQCFSP 504

Query: 1601 LN-LSLERTAALPSDGRLHQDKPIAGNFSFPTINAYEDVANLGLISDISSNEQFTDESLK 1425
             + L  +  A + ++         AGNF F      +D+        +S++ + +D++L 
Sbjct: 505  TSELVPDEAAKVSNNDNFQHYNHSAGNFLFA-----DDLC-------MSTSGESSDQTL- 551

Query: 1424 ENIESPRNCCLYIESD---------DQASSTLKTGDWIIPSQYAITSHTALTDSDGQGST 1272
             N+E  RN   +I S          D + + L+  D  IP+     S     D  G+   
Sbjct: 552  -NVEE-RNIG-HIHSSLPTGVFSVHDTSCACLQHMDKSIPNTAYKNSAQMKNDIAGRCCV 608

Query: 1271 VILDRYYSNDDNTCSKPTCGEFEAEQQLIIDTVIMEKTNLFNC----VVVTDANQQFYDA 1104
              LD +Y N+       T G       ++    I+   +  +C      + D    F DA
Sbjct: 609  NNLDPHYRNE-------TSG-------MVSGVAIVTMNSKLSCSHCSKDLPDMEYMFADA 654

Query: 1103 EAFKDEDAAPDSVFSLNIKANGTNDMFGVCHIHESYSNMSSAWCELKSGNLVIQNQSEML 924
             A   E  + +S FS  +   G       CH    ++   + +             ++  
Sbjct: 655  AA---EGTSDNSYFSGTLSTLGP-----ACHSSTVHTGAPAEF------------DNDCT 694

Query: 923  SEDGPENCLNHDSLEANRGFLVQGGKNLLEMMSQKEQSNNYHASGSFSETNSDLIGVDTY 744
             +     C NH+ L    G +                          +ETNS++     Y
Sbjct: 695  RDQASFACSNHEILAEAMGDIT-------------------------AETNSEV----RY 725

Query: 743  NLHPMSLPKGDNTSTSTCYGESSEQTLP--VEERKKETEVCEKNALTKLASSGKSKD--- 579
              H +              G S  + +P   EERKK+ +    N  + +  SG +K+   
Sbjct: 726  EAHLID-----------DVGTSMNRMVPDVDEERKKDMDDVANNLPSTMTISGNNKEETV 774

Query: 578  ---GNTGNIVFKSIAGSCTMVGALLVFLYLRRKDSEKNSHTVVSLRAKRSSQEDR 423
                + G  V KSIAGS T++GALLVFL+L   +          LR  R  +  R
Sbjct: 775  VKIRSPGKKVLKSIAGSVTLIGALLVFLHLSAGERAIKRKITTLLRPYRPKKHVR 829


>ref|XP_019704444.1| PREDICTED: uncharacterized protein LOC105041326 isoform X1 [Elaeis
            guineensis]
          Length = 1226

 Score = 68.9 bits (167), Expect = 3e-08
 Identities = 48/123 (39%), Positives = 64/123 (52%), Gaps = 8/123 (6%)
 Frame = -3

Query: 704  STSTCYGESSEQTLPVEERKKETEVCEKNALTKLASSGKSKDGNT------GNIVFKSIA 543
            S   C  E  E    +E+  +  E  EK ++T L S GK +DG T      G I+ KS+A
Sbjct: 1105 SAPGCAVEGEENNC-MEQISEGAEESEKESVTTLVSGGKRRDGTTSERHSSGKIMLKSVA 1163

Query: 542  GSCTMVGALLVFLYLRRK-DSEKN-SHTVVSLRAKRSSQEDRTQXXXXXXXXXXXYPGER 369
            G  T+VG+L + L+LRRK D EKN +  VV L+ ++   E  TQ           YPGER
Sbjct: 1164 GGITLVGSLFLLLHLRRKRDKEKNGAAVVVPLQIQKPGMEGSTQKKIEIGKSDSLYPGER 1223

Query: 368  LMF 360
            L F
Sbjct: 1224 LKF 1226


>ref|XP_019704447.1| PREDICTED: uncharacterized protein LOC105041326 isoform X4 [Elaeis
            guineensis]
 ref|XP_019704448.1| PREDICTED: uncharacterized protein LOC105041326 isoform X4 [Elaeis
            guineensis]
          Length = 1086

 Score = 66.2 bits (160), Expect = 2e-07
 Identities = 46/124 (37%), Positives = 64/124 (51%), Gaps = 9/124 (7%)
 Frame = -3

Query: 704  STSTCYGESSEQTLPVEERKKETEVCEKNALTKLASSGKSKDGNT------GNIVFKSIA 543
            S   C  E  E    +E+  +  E  EK ++T L S GK +DG T      G I+ KS+A
Sbjct: 964  SAPGCAVEGEENNC-MEQISEGAEESEKESVTTLVSGGKRRDGTTSERHSSGKIMLKSVA 1022

Query: 542  GSCTMVGALLVFLYL--RRKDSEKN-SHTVVSLRAKRSSQEDRTQXXXXXXXXXXXYPGE 372
            G  T+VG+L + L+L  R++D EKN +  VV L+ ++   E  TQ           YPGE
Sbjct: 1023 GGITLVGSLFLLLHLSRRKRDKEKNGAAVVVPLQIQKPGMEGSTQKKIEIGKSDSLYPGE 1082

Query: 371  RLMF 360
            RL F
Sbjct: 1083 RLKF 1086


>ref|XP_019704446.1| PREDICTED: uncharacterized protein LOC105041326 isoform X3 [Elaeis
            guineensis]
          Length = 1113

 Score = 66.2 bits (160), Expect = 2e-07
 Identities = 46/124 (37%), Positives = 64/124 (51%), Gaps = 9/124 (7%)
 Frame = -3

Query: 704  STSTCYGESSEQTLPVEERKKETEVCEKNALTKLASSGKSKDGNT------GNIVFKSIA 543
            S   C  E  E    +E+  +  E  EK ++T L S GK +DG T      G I+ KS+A
Sbjct: 991  SAPGCAVEGEENNC-MEQISEGAEESEKESVTTLVSGGKRRDGTTSERHSSGKIMLKSVA 1049

Query: 542  GSCTMVGALLVFLYL--RRKDSEKN-SHTVVSLRAKRSSQEDRTQXXXXXXXXXXXYPGE 372
            G  T+VG+L + L+L  R++D EKN +  VV L+ ++   E  TQ           YPGE
Sbjct: 1050 GGITLVGSLFLLLHLSRRKRDKEKNGAAVVVPLQIQKPGMEGSTQKKIEIGKSDSLYPGE 1109

Query: 371  RLMF 360
            RL F
Sbjct: 1110 RLKF 1113


>ref|XP_019704445.1| PREDICTED: uncharacterized protein LOC105041326 isoform X2 [Elaeis
            guineensis]
          Length = 1227

 Score = 66.2 bits (160), Expect = 2e-07
 Identities = 46/124 (37%), Positives = 64/124 (51%), Gaps = 9/124 (7%)
 Frame = -3

Query: 704  STSTCYGESSEQTLPVEERKKETEVCEKNALTKLASSGKSKDGNT------GNIVFKSIA 543
            S   C  E  E    +E+  +  E  EK ++T L S GK +DG T      G I+ KS+A
Sbjct: 1105 SAPGCAVEGEENNC-MEQISEGAEESEKESVTTLVSGGKRRDGTTSERHSSGKIMLKSVA 1163

Query: 542  GSCTMVGALLVFLYL--RRKDSEKN-SHTVVSLRAKRSSQEDRTQXXXXXXXXXXXYPGE 372
            G  T+VG+L + L+L  R++D EKN +  VV L+ ++   E  TQ           YPGE
Sbjct: 1164 GGITLVGSLFLLLHLSRRKRDKEKNGAAVVVPLQIQKPGMEGSTQKKIEIGKSDSLYPGE 1223

Query: 371  RLMF 360
            RL F
Sbjct: 1224 RLKF 1227


Top