BLASTX nr result

ID: Cheilocostus21_contig00048226 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cheilocostus21_contig00048226
         (1128 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_009388528.1| PREDICTED: uncharacterized protein LOC103975...   201   1e-53
ref|XP_009388527.1| PREDICTED: uncharacterized protein LOC103975...   201   1e-53
ref|XP_009388529.1| PREDICTED: uncharacterized protein LOC103975...   175   9e-45
ref|XP_008792087.1| PREDICTED: uncharacterized protein LOC103708...    90   8e-16
ref|XP_010914918.1| PREDICTED: CRC domain-containing protein TSO...    81   1e-12
ref|XP_019704481.1| PREDICTED: CRC domain-containing protein TSO...    77   1e-11

>ref|XP_009388528.1| PREDICTED: uncharacterized protein LOC103975320 isoform X2 [Musa
            acuminata subsp. malaccensis]
          Length = 1077

 Score =  201 bits (511), Expect = 1e-53
 Identities = 146/396 (36%), Positives = 200/396 (50%), Gaps = 22/396 (5%)
 Frame = +3

Query: 6    RKSDFVRSSLSLTSVGRAKFSLQHSNDATIFPANVPRSGEHTSTLVVNLIHT--EIESEG 179
            RKSD ++SS SL     AKFSL+ S DAT    ++ R   H S   V L  +  E E EG
Sbjct: 71   RKSDILKSSQSLALDAVAKFSLRQSADATTLSGDMLRCKSHGSVSTVKLTPSGHESEGEG 130

Query: 180  NPSCDHESCVXXXXXXXXXXXXXXHICGNPSASSESLNQVAEMHKNVHRICTITDESRME 359
            NPS +HE+C                IC + S S E   Q AE+ +NVH  CT T+E+ +E
Sbjct: 131  NPSPEHETCGGSSTCVDAFLADTLEICDDDSVSPECSKQSAELPQNVHGSCTSTNENIIE 190

Query: 360  SCKSDFARLRSVSPSGSALVDEFVTDPEVLSDSSDLQSKQEVVLSETVQSDCLTCQEMNA 539
            S K+    L+SVSP  SA VD F+ DP   SDS DLQS+Q VVL + + S C++ +EMN+
Sbjct: 191  SNKNRSDHLQSVSPLASAFVDRFIADPSEHSDSPDLQSQQAVVLPQRLCSVCISTEEMNS 250

Query: 540  RIHVS--------------LASASAENDFAADEACFLGNPSLNDGESNTVDEAKEDNKAL 677
             IHVS               +S++AE D   DEA FL N SL D +    DE KE +K  
Sbjct: 251  EIHVSPMKNSFASEATEWLNSSSNAEKDLKRDEASFLVNLSLKDDKPKMADEGKESHKIA 310

Query: 678  LDEQFESTINNSYVAGHSFMEPAADQQPEVDVVDQNRTNYISSGLDNKGNGDAFHEKPSV 857
            LDE       ++ +     M P          V+QN+T+ +S  LD+KG  +   EKP+ 
Sbjct: 311  LDEPLVPITTDNCMDTDLLMAPQ---------VNQNKTDDVSLKLDDKGK-NVSDEKPNN 360

Query: 858  ISYCDSNARDLKSQMPSTHDPEKXXXXXXXXXXXXDVQAVDNDAPLSLKKNVVFSDSQIP 1037
             S    N++ +     +                  DVQ V +D  +S  + ++ +D+Q P
Sbjct: 361  TSDFCCNSQGILQICAAGEQGVLHSSPQSLPELLQDVQVVSHDPDVS-GEFILSADNQTP 419

Query: 1038 NDE------RGMRKRLQFEVVENQKAITEGQGQSTI 1127
             DE      RGMRKRLQFE +EN K  T+G+ +  I
Sbjct: 420  YDEDTAQLQRGMRKRLQFEAIENHK--TDGECKKLI 453


>ref|XP_009388527.1| PREDICTED: uncharacterized protein LOC103975320 isoform X1 [Musa
            acuminata subsp. malaccensis]
          Length = 1090

 Score =  201 bits (511), Expect = 1e-53
 Identities = 146/396 (36%), Positives = 200/396 (50%), Gaps = 22/396 (5%)
 Frame = +3

Query: 6    RKSDFVRSSLSLTSVGRAKFSLQHSNDATIFPANVPRSGEHTSTLVVNLIHT--EIESEG 179
            RKSD ++SS SL     AKFSL+ S DAT    ++ R   H S   V L  +  E E EG
Sbjct: 71   RKSDILKSSQSLALDAVAKFSLRQSADATTLSGDMLRCKSHGSVSTVKLTPSGHESEGEG 130

Query: 180  NPSCDHESCVXXXXXXXXXXXXXXHICGNPSASSESLNQVAEMHKNVHRICTITDESRME 359
            NPS +HE+C                IC + S S E   Q AE+ +NVH  CT T+E+ +E
Sbjct: 131  NPSPEHETCGGSSTCVDAFLADTLEICDDDSVSPECSKQSAELPQNVHGSCTSTNENIIE 190

Query: 360  SCKSDFARLRSVSPSGSALVDEFVTDPEVLSDSSDLQSKQEVVLSETVQSDCLTCQEMNA 539
            S K+    L+SVSP  SA VD F+ DP   SDS DLQS+Q VVL + + S C++ +EMN+
Sbjct: 191  SNKNRSDHLQSVSPLASAFVDRFIADPSEHSDSPDLQSQQAVVLPQRLCSVCISTEEMNS 250

Query: 540  RIHVS--------------LASASAENDFAADEACFLGNPSLNDGESNTVDEAKEDNKAL 677
             IHVS               +S++AE D   DEA FL N SL D +    DE KE +K  
Sbjct: 251  EIHVSPMKNSFASEATEWLNSSSNAEKDLKRDEASFLVNLSLKDDKPKMADEGKESHKIA 310

Query: 678  LDEQFESTINNSYVAGHSFMEPAADQQPEVDVVDQNRTNYISSGLDNKGNGDAFHEKPSV 857
            LDE       ++ +     M P          V+QN+T+ +S  LD+KG  +   EKP+ 
Sbjct: 311  LDEPLVPITTDNCMDTDLLMAPQ---------VNQNKTDDVSLKLDDKGK-NVSDEKPNN 360

Query: 858  ISYCDSNARDLKSQMPSTHDPEKXXXXXXXXXXXXDVQAVDNDAPLSLKKNVVFSDSQIP 1037
             S    N++ +     +                  DVQ V +D  +S  + ++ +D+Q P
Sbjct: 361  TSDFCCNSQGILQICAAGEQGVLHSSPQSLPELLQDVQVVSHDPDVS-GEFILSADNQTP 419

Query: 1038 NDE------RGMRKRLQFEVVENQKAITEGQGQSTI 1127
             DE      RGMRKRLQFE +EN K  T+G+ +  I
Sbjct: 420  YDEDTAQLQRGMRKRLQFEAIENHK--TDGECKKLI 453


>ref|XP_009388529.1| PREDICTED: uncharacterized protein LOC103975320 isoform X3 [Musa
            acuminata subsp. malaccensis]
 ref|XP_018675959.1| PREDICTED: uncharacterized protein LOC103975320 isoform X3 [Musa
            acuminata subsp. malaccensis]
          Length = 986

 Score =  175 bits (444), Expect = 9e-45
 Identities = 129/360 (35%), Positives = 178/360 (49%), Gaps = 22/360 (6%)
 Frame = +3

Query: 114  RSGEHTSTLVVNLIHT--EIESEGNPSCDHESCVXXXXXXXXXXXXXXHICGNPSASSES 287
            R   H S   V L  +  E E EGNPS +HE+C                IC + S S E 
Sbjct: 3    RCKSHGSVSTVKLTPSGHESEGEGNPSPEHETCGGSSTCVDAFLADTLEICDDDSVSPEC 62

Query: 288  LNQVAEMHKNVHRICTITDESRMESCKSDFARLRSVSPSGSALVDEFVTDPEVLSDSSDL 467
              Q AE+ +NVH  CT T+E+ +ES K+    L+SVSP  SA VD F+ DP   SDS DL
Sbjct: 63   SKQSAELPQNVHGSCTSTNENIIESNKNRSDHLQSVSPLASAFVDRFIADPSEHSDSPDL 122

Query: 468  QSKQEVVLSETVQSDCLTCQEMNARIHVS--------------LASASAENDFAADEACF 605
            QS+Q VVL + + S C++ +EMN+ IHVS               +S++AE D   DEA F
Sbjct: 123  QSQQAVVLPQRLCSVCISTEEMNSEIHVSPMKNSFASEATEWLNSSSNAEKDLKRDEASF 182

Query: 606  LGNPSLNDGESNTVDEAKEDNKALLDEQFESTINNSYVAGHSFMEPAADQQPEVDVVDQN 785
            L N SL D +    DE KE +K  LDE       ++ +     M P          V+QN
Sbjct: 183  LVNLSLKDDKPKMADEGKESHKIALDEPLVPITTDNCMDTDLLMAPQ---------VNQN 233

Query: 786  RTNYISSGLDNKGNGDAFHEKPSVISYCDSNARDLKSQMPSTHDPEKXXXXXXXXXXXXD 965
            +T+ +S  LD+KG  +   EKP+  S    N++ +     +                  D
Sbjct: 234  KTDDVSLKLDDKGK-NVSDEKPNNTSDFCCNSQGILQICAAGEQGVLHSSPQSLPELLQD 292

Query: 966  VQAVDNDAPLSLKKNVVFSDSQIPNDE------RGMRKRLQFEVVENQKAITEGQGQSTI 1127
            VQ V +D  +S  + ++ +D+Q P DE      RGMRKRLQFE +EN K  T+G+ +  I
Sbjct: 293  VQVVSHDPDVS-GEFILSADNQTPYDEDTAQLQRGMRKRLQFEAIENHK--TDGECKKLI 349


>ref|XP_008792087.1| PREDICTED: uncharacterized protein LOC103708783 [Phoenix dactylifera]
          Length = 503

 Score = 89.7 bits (221), Expect = 8e-16
 Identities = 100/387 (25%), Positives = 163/387 (42%), Gaps = 29/387 (7%)
 Frame = +3

Query: 51   GRAKFSLQHSNDATIFPANVPRSGEHTSTLVVNLIH-TEIESEGNPSCDHESCVXXXXXX 227
            G  KF   ++N+AT    +   +G   S  ++     T+ E + +     + C       
Sbjct: 77   GAEKFPPVYANNATALSESAQINGPLISMSMIQFGPCTQKEGDNDSPSQDQPCSSPSSCV 136

Query: 228  XXXXXXXXHICGNPSASSESLN-QVAEMHKNVHRICTITDESRMESCKSDFARLRSVSPS 404
                      C N S S +  + Q A M + +    T  DE +++  K DF +LR +SPS
Sbjct: 137  AAFLADPLENCDNSSGSPDLCSKQAAIMPQPIQADITSADEKQLKCIKEDFHQLRPISPS 196

Query: 405  GSALVDEFVTDP-EVLSDSS--DLQSKQEVVLSETVQSDCLTCQEMNARIHVSL------ 557
               LV+E   D  + L+  +  +L S+Q     + +++D  + +E NA IHVS       
Sbjct: 197  --ILVNEITADSVDCLTSHALLNLHSEQASDPPQALRNDFTSIEETNAEIHVSDVKKCAI 254

Query: 558  --------ASASAENDFAADEACFLGNPSLNDGESNTVDEAKEDNKALLDEQFESTINNS 713
                    +S  AE D   +E+ F    +  + +    DEAK+  +   DEQ    ++  
Sbjct: 255  TKATKSLGSSYLAEEDPPREESSF--PVAQIESKVKMADEAKDIYERSHDEQ-PGPVSTD 311

Query: 714  YVAGHSFMEPAADQQPEVDVVDQNRTNYISSGLDNKGNGDAFHEK--PSVISYCDSNARD 887
             +A        A  Q  +  +DQN  +Y S GL ++      H K  PS +  C    +D
Sbjct: 312  AIA-------VACSQNGLKSMDQNMASYPSCGLKDEDRDVVGHNKASPSTLVMCPHGTQD 364

Query: 888  L-KSQMPSTHDPEKXXXXXXXXXXXXDVQAVDNDAPLSLKKNVVFSDSQIPNDE------ 1046
              K++  +   PE             D+Q VD          +V++++QIP D+      
Sbjct: 365  ANKNRAEAGKWPEFDSTPQWLPESLQDIQVVDEHLDDPGAICIVYAENQIPYDQEEGTQH 424

Query: 1047 -RGMRKRLQFEVVENQKAITEGQGQST 1124
             RGMRKRLQFE VEN +       +S+
Sbjct: 425  QRGMRKRLQFEAVENHQMSIVSNSESS 451


>ref|XP_010914918.1| PREDICTED: CRC domain-containing protein TSO1-like isoform X1 [Elaeis
            guineensis]
          Length = 1077

 Score = 80.9 bits (198), Expect = 1e-12
 Identities = 100/389 (25%), Positives = 159/389 (40%), Gaps = 31/389 (7%)
 Frame = +3

Query: 21   VRSSLSLTSVGRAKFSLQHSNDATIFPANVPRSGEHTS-TLVVNLIHTEIESEGNPSCDH 197
            ++ S  L   G  KF   ++N+AT    +   +G   S ++V +   T+ E + +     
Sbjct: 67   LKRSQHLLLNGAEKFPPGYANNATEMSESAQINGPLISMSMVQSGSCTQKEGDNDCPSQD 126

Query: 198  ESCVXXXXXXXXXXXXXXHICGNPSASSESLN-QVAEMHKNVHRICTITDESRMESCKSD 374
            + C                 C NPS S +  + Q AEM + +    T  DE +++  K D
Sbjct: 127  QPCSSPSSCVDAFLADPLENCDNPSGSPDLYSKQAAEMPQRIQADVTSVDEKQIKCSKED 186

Query: 375  FARLRSVSPSGSALVDEFVTDPEVLSDSSDLQSKQEVVLSE---TVQSDCLTCQEMNARI 545
            F +L+ +SPS   LV E   D      S  L +      S+    + +D ++ +E N  I
Sbjct: 187  FNQLQPISPS--ILVKEITADSMDCLTSPALPNPHLEQASDPPRALLNDFISTEETNPEI 244

Query: 546  HVS--------LASASAENDFAADEACFLGNPSLN--------DGESNTVDEAKEDNKAL 677
            +VS         A+ S  + + A+E    G P           + +    +EAK+ N+  
Sbjct: 245  YVSDVKKCTITKATKSLGSSYQAEE----GPPGEESSFPVAQFESKLKMANEAKDINERS 300

Query: 678  LDEQFESTINNSYVAGHSFMEPAADQQPEVDVVDQNRTNYISSGLDNKGNGDAFHEK--P 851
             DEQ    ++   +A           Q  +  +DQN  +Y S GL +K      H K  P
Sbjct: 301  CDEQ-PGPVSTDAIA-------VTCSQNGLQSMDQNMASYPSFGLKDKDCDVVGHNKASP 352

Query: 852  SVISYCDSNARDLKSQMPSTHD-PEKXXXXXXXXXXXXDVQAVDNDAPLSLKKNVVFSDS 1028
            S        A+D   ++       E             DVQAVD          ++F+++
Sbjct: 353  STSVMYPHGAQDANKKLAVDGKCAELDSTPQWLPESHEDVQAVDKHLDNPGAICIMFAEN 412

Query: 1029 QIPND-------ERGMRKRLQFEVVENQK 1094
            QIP D       +RGMRKRLQFE VEN++
Sbjct: 413  QIPYDLEEGTQHQRGMRKRLQFEAVENRR 441


>ref|XP_019704481.1| PREDICTED: CRC domain-containing protein TSO1-like isoform X2 [Elaeis
            guineensis]
          Length = 986

 Score = 77.4 bits (189), Expect = 1e-11
 Identities = 86/309 (27%), Positives = 131/309 (42%), Gaps = 30/309 (9%)
 Frame = +3

Query: 258  CGNPSASSESLN-QVAEMHKNVHRICTITDESRMESCKSDFARLRSVSPSGSALVDEFVT 434
            C NPS S +  + Q AEM + +    T  DE +++  K DF +L+ +SPS   LV E   
Sbjct: 56   CDNPSGSPDLYSKQAAEMPQRIQADVTSVDEKQIKCSKEDFNQLQPISPS--ILVKEITA 113

Query: 435  DPEVLSDSSDLQSKQEVVLSE---TVQSDCLTCQEMNARIHVS--------LASASAEND 581
            D      S  L +      S+    + +D ++ +E N  I+VS         A+ S  + 
Sbjct: 114  DSMDCLTSPALPNPHLEQASDPPRALLNDFISTEETNPEIYVSDVKKCTITKATKSLGSS 173

Query: 582  FAADEACFLGNPSLN--------DGESNTVDEAKEDNKALLDEQFESTINNSYVAGHSFM 737
            + A+E    G P           + +    +EAK+ N+   DEQ    ++   +A     
Sbjct: 174  YQAEE----GPPGEESSFPVAQFESKLKMANEAKDINERSCDEQ-PGPVSTDAIA----- 223

Query: 738  EPAADQQPEVDVVDQNRTNYISSGLDNKGNGDAFHEK--PSVISYCDSNARDLKSQMPST 911
                  Q  +  +DQN  +Y S GL +K      H K  PS        A+D   ++   
Sbjct: 224  --VTCSQNGLQSMDQNMASYPSFGLKDKDCDVVGHNKASPSTSVMYPHGAQDANKKLAVD 281

Query: 912  HD-PEKXXXXXXXXXXXXDVQAVDNDAPLSLKKNVVFSDSQIPND-------ERGMRKRL 1067
                E             DVQAVD          ++F+++QIP D       +RGMRKRL
Sbjct: 282  GKCAELDSTPQWLPESHEDVQAVDKHLDNPGAICIMFAENQIPYDLEEGTQHQRGMRKRL 341

Query: 1068 QFEVVENQK 1094
            QFE VEN++
Sbjct: 342  QFEAVENRR 350


Top