BLASTX nr result

ID: Cheilocostus21_contig00055516 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cheilocostus21_contig00055516
         (946 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_022888992.1| uncharacterized protein LOC111404413 [Olea e...    50   9e-10
gb|PRQ29559.1| putative nucleotidyltransferase, Ribonuclease H [...    69   5e-09
emb|CAN68706.1| hypothetical protein VITISV_001642 [Vitis vinifera]    45   1e-08
gb|PKU66055.1| hypothetical protein MA16_Dca017376 [Dendrobium c...    50   1e-08
emb|CAN77045.1| hypothetical protein VITISV_035256 [Vitis vinifera]    63   4e-07
gb|EOY16699.1| Uncharacterized protein TCM_035549 [Theobroma cacao]    63   4e-07
gb|EOX92840.1| DNA/RNA polymerases superfamily protein [Theobrom...    61   1e-06
gb|PKA51272.1| hypothetical protein AXF42_Ash010712 [Apostasia s...    60   2e-06
gb|EOY26248.1| Uncharacterized protein TCM_046829 [Theobroma cacao]    60   3e-06
gb|PKA61057.1| putative mitochondrial protein [Apostasia shenzhe...    60   3e-06
gb|EOX94045.1| DNA/RNA polymerases superfamily protein, partial ...    60   3e-06

>ref|XP_022888992.1| uncharacterized protein LOC111404413 [Olea europaea var.
           sylvestris]
          Length = 530

 Score = 50.1 bits (118), Expect(2) = 9e-10
 Identities = 25/52 (48%), Positives = 34/52 (65%), Gaps = 2/52 (3%)
 Frame = +1

Query: 61  DLVIHDDYLFKGIQFFITQ--VRSFLF*KLHTRGLVGHFGRDKIVALVIDHF 210
           D ++ D YLF+G +  I +  +R FL  +LH  GL GHFGR+K +ALV D F
Sbjct: 265 DFLLRDGYLFRGTKLCIPRSLLRKFLVWELHAGGLDGHFGREKTIALVEDRF 316



 Score = 42.4 bits (98), Expect(2) = 9e-10
 Identities = 22/53 (41%), Positives = 32/53 (60%)
 Frame = +2

Query: 290 NMSLYTLLFFSH*P*KDICLDIVVSSPKVAQGHDSILVVVARFPR*YILSYVL 448
           N  +YT L   H P  D+ +D V+   K+A+GHDSI VVV RF +  +L  ++
Sbjct: 344 NTRIYTPLPVPHTPWCDLSMDFVLGLLKIARGHDSIFVVVDRFSKMLLLENIV 396


>gb|PRQ29559.1| putative nucleotidyltransferase, Ribonuclease H [Rosa chinensis]
          Length = 680

 Score = 68.6 bits (166), Expect = 5e-09
 Identities = 71/246 (28%), Positives = 107/246 (43%), Gaps = 20/246 (8%)
 Frame = +1

Query: 61  DLVIHDDYLFKGIQFFI--TQVRSFLF*KLHTRGLVGHFGRDKIVALVIDHFLGRQLSGI 234
           + V+ D +LFKG +  I  T VR FL  +LH  G+ GHFGRDK +ALV D F    L   
Sbjct: 262 EFVLQDGFLFKGTKLCIPCTSVRDFLILELHAGGIAGHFGRDKTIALVEDRFYWPSLKRD 321

Query: 235 LLKLFFNIIFVSCSSY*IEHEPLY-PVIFLTLALKRYLLGHCCKLAKSCSRT*FYLSCGC 411
           + K+         + +  ++  LY P+       +   +     L K+  R         
Sbjct: 322 VAKVVERCRTCQLAKHKRQNTGLYTPLPVPHTPWQDISMDFVLGLPKTTQRHDSIFVVVD 381

Query: 412 *VSKMIHFILCTKSKDTSHVAKLF---IFFENGMIA*IIICQCLRICKVLLKDPIENLWH 582
             SKM HF+ C+K+ D S VAKLF   +   +G+   I+  + +R      K     LWH
Sbjct: 382 RFSKMAHFLPCSKTFDASEVAKLFMDEVVRLHGLPKTIVSDRDVRFMSYFWK----TLWH 437

Query: 583 ----NMEFSPIFICRNMIKLRW*TTVWVTPYLYR----------WEKIRN*ELLLPTAEI 720
                ++FS  +  +         T   T  + R           E IR+ + +LP AE 
Sbjct: 438 MLGTKLKFSSAYHPQ---------TDGQTEVVNRSLGNLLRSLVGEHIRSWDSILPIAEF 488

Query: 721 DHNNSI 738
            +NNS+
Sbjct: 489 AYNNSV 494


>emb|CAN68706.1| hypothetical protein VITISV_001642 [Vitis vinifera]
          Length = 1082

 Score = 45.1 bits (105), Expect(3) = 1e-08
 Identities = 33/99 (33%), Positives = 56/99 (56%), Gaps = 6/99 (6%)
 Frame = +2

Query: 281  TK*NMSLYTLLFFSH*P*KDICLDIVVSSPKVAQGHDSILVVVARFPR*YILSYVLSPK- 457
            +K N+ LYT L     P +D+ +D V+  P+  QG  SI VVV RF +   +++ +  K 
Sbjct: 834  SKQNIGLYTSLPIPSKPWEDLSMDFVLGLPRTQQGFHSIFVVVDRFSK---MTHFIPCKK 890

Query: 458  -IPLTWLNYLFFLKMV*LHKLSFANVSE----YAKYF*K 559
             + +++++ LFF ++V LH L  + VS+    +  YF K
Sbjct: 891  ALNVSYVSTLFFKEVVRLHGLPQSIVSDRDVKFMSYFLK 929



 Score = 42.0 bits (97), Expect(3) = 1e-08
 Identities = 23/52 (44%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
 Frame = +1

Query: 61  DLVIHDDYLFKGIQFFI--TQVRSFLF*KLHTRGLVGHFGRDKIVALVIDHF 210
           D  I + YLF   +  +  T +R  +  +LH  G+ GHFGRDK +ALV DHF
Sbjct: 758 DFQILEGYLFYKNRLCLPRTSLRDHVIWELHGGGMGGHFGRDKTIALVGDHF 809



 Score = 21.2 bits (43), Expect(3) = 1e-08
 Identities = 10/24 (41%), Positives = 14/24 (58%)
 Frame = +3

Query: 207 FFGASIKWDITKIIF*YHICQLFK 278
           FF  S+K D+ K+I     CQ+ K
Sbjct: 809 FFWPSLKKDVWKVIKQCRACQVGK 832


>gb|PKU66055.1| hypothetical protein MA16_Dca017376 [Dendrobium catenatum]
          Length = 579

 Score = 50.1 bits (118), Expect(2) = 1e-08
 Identities = 27/67 (40%), Positives = 40/67 (59%), Gaps = 2/67 (2%)
 Frame = +1

Query: 16  NFEHICKIVQVGNYHDLVIHDDYLFKGIQFFITQ--VRSFLF*KLHTRGLVGHFGRDKIV 189
           +F+HI +  Q  NY  L + D+YLF G +  I +  +R  L  + H  GL GHFGRDK +
Sbjct: 183 DFQHIWEKCQGSNYKQLHVKDEYLFYGKRLCIPKCSLRLALVTESHDGGLSGHFGRDKTI 242

Query: 190 ALVIDHF 210
            L+ ++F
Sbjct: 243 NLLFENF 249



 Score = 38.9 bits (89), Expect(2) = 1e-08
 Identities = 28/75 (37%), Positives = 40/75 (53%), Gaps = 1/75 (1%)
 Frame = +2

Query: 290 NMSLYTLLFFSH*P*KDICLDIVVSSPKVAQGHDSILVVVARFPR*YILSYVLSPK-IPL 466
           N   YT L     P  DI LD VV  P   +  DSI+VVV RF +  ++ +V   K +  
Sbjct: 277 NAGFYTPLPIPSSPWVDISLDFVVGLPLTQRKKDSIMVVVDRFSK--MVHFVACTKTLDA 334

Query: 467 TWLNYLFFLKMV*LH 511
           T +  LFF++++ LH
Sbjct: 335 THVADLFFMEIIRLH 349


>emb|CAN77045.1| hypothetical protein VITISV_035256 [Vitis vinifera]
          Length = 665

 Score = 62.8 bits (151), Expect = 4e-07
 Identities = 48/148 (32%), Positives = 67/148 (45%), Gaps = 3/148 (2%)
 Frame = +1

Query: 49  GNYHDLVIHDDYLFKGIQFFITQ--VRSFLF*KLHTRGLVGHFGRDKIVALVIDHFLGRQ 222
           G Y +  +HD YLFKG    +    +R  +  +LH+RG   HFGRDK +A+  DHF    
Sbjct: 325 GAYPNFXLHDGYLFKGTXLCLXDXSLREQVIWELHSRGXAXHFGRDKTIAMTEDHFYWPS 384

Query: 223 LSGILLKLFFNIIFVSCSSY*IEHEPLY-PVIFLTLALKRYLLGHCCKLAKSCSRT*FYL 399
           L   + K          S    ++  LY P+       +   +     L K+  R     
Sbjct: 385 LKRDVTKNVSKCRTCQPSKGRKKNTGLYMPLPVPHEPWQELSIDFVLGLPKTFRRHDSIF 444

Query: 400 SCGC*VSKMIHFILCTKSKDTSHVAKLF 483
                 SKM+HFI C+K+ D  HVAKLF
Sbjct: 445 VMVDRFSKMVHFIPCSKTLDAVHVAKLF 472


>gb|EOY16699.1| Uncharacterized protein TCM_035549 [Theobroma cacao]
          Length = 1392

 Score = 62.8 bits (151), Expect = 4e-07
 Identities = 53/148 (35%), Positives = 75/148 (50%), Gaps = 10/148 (6%)
 Frame = +1

Query: 70   IHDDYLFKGIQFFITQ--VRSFLF*KLHTRGLVGHFGRDKIVALVIDHFLG---RQLSGI 234
            +H+DYLFKG Q  I +  +R  +  +LH  GL GHFGRDK +A+V D +     RQ    
Sbjct: 938  LHEDYLFKGNQLCIPEGSLREQIIRELHGNGLGGHFGRDKTLAMVADRYYWPKMRQDVER 997

Query: 235  LLKLFFNIIFVSCSS-----Y*IEHEPLYPVIFLTLALKRYLLGHCCKLAKSCSRT*FYL 399
            L+K     +F   S+     Y    EP  P I L++    ++LG    L K+  R     
Sbjct: 998  LVKRCPTCLFGKGSAQNTGLYVPLPEPDAPWIHLSM---DFVLG----LPKTAKRFDSIF 1050

Query: 400  SCGC*VSKMIHFILCTKSKDTSHVAKLF 483
                  SKM HFI C ++ D +H+A+LF
Sbjct: 1051 VVVDRFSKMAHFIPCFRTSDATHIAELF 1078


>gb|EOX92840.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 647

 Score = 60.8 bits (146), Expect = 1e-06
 Identities = 51/148 (34%), Positives = 76/148 (51%), Gaps = 10/148 (6%)
 Frame = +1

Query: 70  IHDDYLFKGIQFFITQ--VRSFLF*KLHTRGLVGHFGRDKIVALVIDHF----LGRQLSG 231
           +H+DYLFKG Q  I +  +R  +  +LH  GL GHFGRDK +A+V D +    + R +  
Sbjct: 266 LHEDYLFKGNQLCILEGSLREQIIGELHGNGLGGHFGRDKTLAMVADRYYWPKMHRDVER 325

Query: 232 ILLK----LFFNIIFVSCSSY*IEHEPLYPVIFLTLALKRYLLGHCCKLAKSCSRT*FYL 399
           ++ +    LF      +   Y    EP  P I L++    ++LG   K+AK        +
Sbjct: 326 LVKRCSTCLFGKGSAQNTGLYVPLLEPDAPWIHLSM---DFVLG-LPKIAKGFDSIFVVV 381

Query: 400 SCGC*VSKMIHFILCTKSKDTSHVAKLF 483
                 SKM HFI C K+ D +H+A+LF
Sbjct: 382 YQ---FSKMAHFIPCFKTSDATHIAELF 406


>gb|PKA51272.1| hypothetical protein AXF42_Ash010712 [Apostasia shenzhenica]
          Length = 481

 Score = 60.1 bits (144), Expect = 2e-06
 Identities = 47/146 (32%), Positives = 72/146 (49%), Gaps = 4/146 (2%)
 Frame = +1

Query: 61  DLVIHDDYLFKGIQFFI--TQVRSFLF*KLHTRGLVGHFGRDKIVALVID-HFLGRQLSG 231
           D +I + YLF+G +  I  + +R FL  +LH+ G  GHFGRDK  ALV D ++  RQL  
Sbjct: 28  DYLIREGYLFRGPRLCIPDSSLREFLIQELHSGGAAGHFGRDKTAALVSDRYYWPRQLKD 87

Query: 232 ILLKLFFNIIFVSCSSY*IEHEPLY-PVIFLTLALKRYLLGHCCKLAKSCSRT*FYLSCG 408
           +  ++         +    ++  LY P+       +   +     L ++  R    L   
Sbjct: 88  V-ARIVSRCRTCQVAKGGKQNTRLYTPLPIPDRPWEDLSMDFVLGLPRTSRRHDCILVVV 146

Query: 409 C*VSKMIHFILCTKSKDTSHVAKLFI 486
              SKM HFI C+K+ D SH+A LF+
Sbjct: 147 DRFSKMAHFIPCSKTSDASHIATLFV 172


>gb|EOY26248.1| Uncharacterized protein TCM_046829 [Theobroma cacao]
          Length = 672

 Score = 60.1 bits (144), Expect = 3e-06
 Identities = 55/171 (32%), Positives = 82/171 (47%), Gaps = 12/171 (7%)
 Frame = +1

Query: 7   FLHNFEHICKIVQVGNYHDLV--IHDDYLFKGIQFFITQ--VRSFLF*KLHTRGLVGHFG 174
           F+ NF  I   +      D    +H+DYLFKG Q  I +  +R  +  +LH  GL GHFG
Sbjct: 246 FIRNFSSIMSPITESLKKDGFEWLHEDYLFKGNQLCIPEGSLREQIIRELHGNGLGGHFG 305

Query: 175 RDKIVALVIDHF----LGRQLSGILLK----LFFNIIFVSCSSY*IEHEPLYPVIFLTLA 330
           RDK +A+V D +    + R +  ++ +    LF      +   Y    EP  P I L++ 
Sbjct: 306 RDKTLAMVADRYYWPKMRRDVERLVKRCPACLFGKGSAQNTGLYVPLPEPDAPWIHLSM- 364

Query: 331 LKRYLLGHCCKLAKSCSRT*FYLSCGC*VSKMIHFILCTKSKDTSHVAKLF 483
              ++LG   K AK        +      SKM HFI C ++ D +H+A+LF
Sbjct: 365 --DFVLG-LPKTAKGFDSIFVVVDR---FSKMAHFIPCFRTSDATHIAELF 409


>gb|PKA61057.1| putative mitochondrial protein [Apostasia shenzhenica]
          Length = 829

 Score = 60.1 bits (144), Expect = 3e-06
 Identities = 47/146 (32%), Positives = 72/146 (49%), Gaps = 4/146 (2%)
 Frame = +1

Query: 61  DLVIHDDYLFKGIQFFI--TQVRSFLF*KLHTRGLVGHFGRDKIVALVID-HFLGRQLSG 231
           D +I + YLF+G +  I  + +R FL  +LH+ G  GHFGRDK  ALV D ++  RQL  
Sbjct: 376 DYLIREGYLFRGPRLCIPDSSLREFLIQELHSSGAAGHFGRDKTAALVSDRYYWPRQLKD 435

Query: 232 ILLKLFFNIIFVSCSSY*IEHEPLY-PVIFLTLALKRYLLGHCCKLAKSCSRT*FYLSCG 408
           +  ++         +    ++  LY P+       +   +     L ++  R    L   
Sbjct: 436 V-TRVVSRCRTCQVAKGGKQNTGLYTPLPIPDRPWEDLSMDFVLGLPRTSRRHGCILVVV 494

Query: 409 C*VSKMIHFILCTKSKDTSHVAKLFI 486
              SKM HFI C+K+ D SH+A LF+
Sbjct: 495 DRFSKMAHFIPCSKTSDASHIATLFV 520


>gb|EOX94045.1| DNA/RNA polymerases superfamily protein, partial [Theobroma cacao]
          Length = 624

 Score = 59.7 bits (143), Expect = 3e-06
 Identities = 50/148 (33%), Positives = 75/148 (50%), Gaps = 10/148 (6%)
 Frame = +1

Query: 70  IHDDYLFKGIQFFITQ--VRSFLF*KLHTRGLVGHFGRDKIVALVIDHF----LGRQLSG 231
           +H+DYLFKG Q  I +  +R  +  +LH  GL GHFGRDK +A+V D +    + R +  
Sbjct: 450 LHEDYLFKGNQLCIPEGSLREQIIRELHGNGLGGHFGRDKTLAMVADRYYWPKMRRDVER 509

Query: 232 ILLK----LFFNIIFVSCSSY*IEHEPLYPVIFLTLALKRYLLGHCCKLAKSCSRT*FYL 399
           ++ +    LF      +   Y    EP  P I L++    ++LG   K AK        +
Sbjct: 510 LVKRCPACLFGKGSAQNTGLYVPLPEPDAPWIHLSM---DFVLG-LPKTAKGFDSIFVVV 565

Query: 400 SCGC*VSKMIHFILCTKSKDTSHVAKLF 483
                 SKM HFI C ++ D +H+A+LF
Sbjct: 566 DR---FSKMAHFIPCFRTSDATHIAELF 590


Top