BLASTX nr result

ID: Catharanthus22_contig00018659 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00018659
         (252 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EMJ11389.1| hypothetical protein PRUPE_ppa017790mg [Prunus pe...    99   8e-19
gb|EMJ08431.1| hypothetical protein PRUPE_ppa026856mg [Prunus pe...    98   1e-18
gb|EMJ21939.1| hypothetical protein PRUPE_ppa023598mg [Prunus pe...    97   2e-18
gb|EMJ21583.1| hypothetical protein PRUPE_ppa021778mg [Prunus pe...    97   2e-18
gb|EMJ17360.1| hypothetical protein PRUPE_ppa015308mg, partial [...    94   2e-17
gb|EMJ13768.1| hypothetical protein PRUPE_ppa015570mg, partial [...    93   3e-17
gb|EMJ28035.1| hypothetical protein PRUPE_ppa017565mg, partial [...    81   2e-13
gb|EMJ04159.1| hypothetical protein PRUPE_ppa015086mg [Prunus pe...    80   2e-13
ref|XP_004154771.1| PREDICTED: kanadaptin-like [Cucumis sativus]       80   4e-13
gb|EMJ02107.1| hypothetical protein PRUPE_ppb018196mg [Prunus pe...    78   1e-12
gb|EMS54598.1| Transposon Ty3-G Gag-Pol polyprotein [Triticum ur...    78   1e-12
gb|EMJ01397.1| hypothetical protein PRUPE_ppa016013mg, partial [...    78   1e-12
gb|EOX95569.1| DNA/RNA polymerases superfamily protein [Theobrom...    77   2e-12
emb|CAN62756.1| hypothetical protein VITISV_011119 [Vitis vinifera]    77   2e-12
gb|EOY26248.1| Uncharacterized protein TCM_046829 [Theobroma cacao]    76   4e-12
gb|EOY16837.1| Uncharacterized protein TCM_035725 [Theobroma cacao]    76   5e-12
gb|EOY08377.1| Uncharacterized protein TCM_022739 [Theobroma cacao]    76   5e-12
gb|EOY01158.1| DNA/RNA polymerases superfamily protein [Theobrom...    76   5e-12
gb|EOX94045.1| DNA/RNA polymerases superfamily protein, partial ...    76   5e-12
gb|EOY16699.1| Uncharacterized protein TCM_035549 [Theobroma cacao]    75   7e-12

>gb|EMJ11389.1| hypothetical protein PRUPE_ppa017790mg [Prunus persica]
          Length = 1485

 Score = 98.6 bits (244), Expect = 8e-19
 Identities = 45/81 (55%), Positives = 59/81 (72%)
 Frame = -2

Query: 251  RQPCDDFFIHDGFFMKGNQLCIPVSSLREKVIRDXXXXXXXXXXGRDKTIVVVMERFYWP 72
            ++P  D+F+++G+  KGNQLCIPVSSLREK+IRD          GRDKTI  + ERFYWP
Sbjct: 1073 QEPMADYFLNEGYLFKGNQLCIPVSSLREKLIRDLHGGGLSGHLGRDKTIAGMEERFYWP 1132

Query: 71   HLRKDVNKIVQRCYVYQTAKG 9
             L++DV  IV++CY  QT+KG
Sbjct: 1133 QLKRDVGTIVRKCYTCQTSKG 1153


>gb|EMJ08431.1| hypothetical protein PRUPE_ppa026856mg [Prunus persica]
          Length = 1493

 Score = 97.8 bits (242), Expect = 1e-18
 Identities = 45/81 (55%), Positives = 58/81 (71%)
 Frame = -2

Query: 251  RQPCDDFFIHDGFFMKGNQLCIPVSSLREKVIRDXXXXXXXXXXGRDKTIVVVMERFYWP 72
            ++P  D+F+ +G+  KGNQLCIPVSSLREK+IRD          GRDKTI  + ERFYWP
Sbjct: 1081 QEPMTDYFLTEGYLFKGNQLCIPVSSLREKLIRDLHGGGLSGHLGRDKTIAGMEERFYWP 1140

Query: 71   HLRKDVNKIVQRCYVYQTAKG 9
             L++DV  IV++CY  QT+KG
Sbjct: 1141 QLKRDVGTIVRKCYTCQTSKG 1161


>gb|EMJ21939.1| hypothetical protein PRUPE_ppa023598mg [Prunus persica]
          Length = 1457

 Score = 97.4 bits (241), Expect = 2e-18
 Identities = 44/81 (54%), Positives = 59/81 (72%)
 Frame = -2

Query: 251  RQPCDDFFIHDGFFMKGNQLCIPVSSLREKVIRDXXXXXXXXXXGRDKTIVVVMERFYWP 72
            ++P  D+F+++G+  KGNQLCIPVSSLREK+I+D          GRDKTI  + ERFYWP
Sbjct: 1049 QEPMADYFLNEGYLFKGNQLCIPVSSLREKLIQDLHGGGLSGHLGRDKTIAGMKERFYWP 1108

Query: 71   HLRKDVNKIVQRCYVYQTAKG 9
             L++DV  IV++CY  QT+KG
Sbjct: 1109 QLKRDVGTIVRKCYTCQTSKG 1129


>gb|EMJ21583.1| hypothetical protein PRUPE_ppa021778mg [Prunus persica]
          Length = 1384

 Score = 97.1 bits (240), Expect = 2e-18
 Identities = 44/81 (54%), Positives = 59/81 (72%)
 Frame = -2

Query: 251  RQPCDDFFIHDGFFMKGNQLCIPVSSLREKVIRDXXXXXXXXXXGRDKTIVVVMERFYWP 72
            ++P  D+F+++G+  KGNQLCIPVSSLREK+I+D          GRDKTI  + ERFYWP
Sbjct: 998  QEPMADYFLNEGYLFKGNQLCIPVSSLREKLIQDLHGGGLSGHLGRDKTIAGMEERFYWP 1057

Query: 71   HLRKDVNKIVQRCYVYQTAKG 9
             L++DV  IV++CY  QT+KG
Sbjct: 1058 QLKRDVGTIVRKCYTCQTSKG 1078


>gb|EMJ17360.1| hypothetical protein PRUPE_ppa015308mg, partial [Prunus persica]
          Length = 1150

 Score = 94.0 bits (232), Expect = 2e-17
 Identities = 45/81 (55%), Positives = 57/81 (70%)
 Frame = -2

Query: 251  RQPCDDFFIHDGFFMKGNQLCIPVSSLREKVIRDXXXXXXXXXXGRDKTIVVVMERFYWP 72
            ++P  D+F+++G+  KGNQLCIPVSSLREK IRD          GRDKTIV   ERFYW 
Sbjct: 840  QEPMADYFLNEGYLFKGNQLCIPVSSLREKPIRDLHGGGLSGHLGRDKTIVGTEERFYWL 899

Query: 71   HLRKDVNKIVQRCYVYQTAKG 9
             L++DV  IV++CY  QT+KG
Sbjct: 900  QLKRDVGTIVRKCYSCQTSKG 920


>gb|EMJ13768.1| hypothetical protein PRUPE_ppa015570mg, partial [Prunus persica]
          Length = 541

 Score = 93.2 bits (230), Expect = 3e-17
 Identities = 43/81 (53%), Positives = 57/81 (70%)
 Frame = -2

Query: 251 RQPCDDFFIHDGFFMKGNQLCIPVSSLREKVIRDXXXXXXXXXXGRDKTIVVVMERFYWP 72
           ++P  D+F+++G+  KGNQLCIPVSSLREK+IRD          G DKTI  + E FYWP
Sbjct: 180 QEPMADYFLNEGYLFKGNQLCIPVSSLREKLIRDLHGGGLSGHLGCDKTIAGMEETFYWP 239

Query: 71  HLRKDVNKIVQRCYVYQTAKG 9
            L++DV  IV++CY  QT+KG
Sbjct: 240 QLKRDVGTIVRKCYTCQTSKG 260


>gb|EMJ28035.1| hypothetical protein PRUPE_ppa017565mg, partial [Prunus persica]
          Length = 914

 Score = 80.9 bits (198), Expect = 2e-13
 Identities = 39/80 (48%), Positives = 51/80 (63%)
 Frame = -2

Query: 248 QPCDDFFIHDGFFMKGNQLCIPVSSLREKVIRDXXXXXXXXXXGRDKTIVVVMERFYWPH 69
           +P  DFF+++G+  K N LCIPVS LREK+ RD          GR  TIV + ERFYWP 
Sbjct: 555 EPMADFFLNEGYLFKANLLCIPVSPLREKLSRDLHGGGLSGYLGRYTTIVGLEERFYWPQ 614

Query: 68  LRKDVNKIVQRCYVYQTAKG 9
           L++ V  IV++CY+ Q  KG
Sbjct: 615 LKRYVGTIVRKCYICQVLKG 634


>gb|EMJ04159.1| hypothetical protein PRUPE_ppa015086mg [Prunus persica]
          Length = 606

 Score = 80.5 bits (197), Expect = 2e-13
 Identities = 38/76 (50%), Positives = 51/76 (67%)
 Frame = -2

Query: 236 DFFIHDGFFMKGNQLCIPVSSLREKVIRDXXXXXXXXXXGRDKTIVVVMERFYWPHLRKD 57
           D+F+++G+  KGNQLCIPVSSLREK+IRD          G  K I  + ERFYWP L++D
Sbjct: 297 DYFLNEGYLFKGNQLCIPVSSLREKLIRDLHGEGLSGHLGLVKIIAGLEERFYWPQLKRD 356

Query: 56  VNKIVQRCYVYQTAKG 9
           V  IV +C++   +KG
Sbjct: 357 VGTIVCKCHICLVSKG 372


>ref|XP_004154771.1| PREDICTED: kanadaptin-like [Cucumis sativus]
          Length = 962

 Score = 79.7 bits (195), Expect = 4e-13
 Identities = 36/79 (45%), Positives = 51/79 (64%)
 Frame = -2

Query: 239 DDFFIHDGFFMKGNQLCIPVSSLREKVIRDXXXXXXXXXXGRDKTIVVVMERFYWPHLRK 60
           +DF I +G+  KG+QLCIP +SLRE ++++          G+DKT  ++  RF+WP LR+
Sbjct: 74  EDFHIMEGYLFKGDQLCIPRTSLREALLKEAHSGGLAGHFGQDKTFEIISHRFFWPQLRR 133

Query: 59  DVNKIVQRCYVYQTAKGHS 3
           D N  V+RC  YQ AKG S
Sbjct: 134 DCNNFVKRCPTYQRAKGLS 152


>gb|EMJ02107.1| hypothetical protein PRUPE_ppb018196mg [Prunus persica]
          Length = 532

 Score = 78.2 bits (191), Expect = 1e-12
 Identities = 39/79 (49%), Positives = 49/79 (62%)
 Frame = -2

Query: 248 QPCDDFFIHDGFFMKGNQLCIPVSSLREKVIRDXXXXXXXXXXGRDKTIVVVMERFYWPH 69
           Q   DF + DG+  KGN+LCI  +S REK+IRD          GRDKTI  + ER+YWP 
Sbjct: 318 QSVTDFHLSDGYLFKGNKLCILETSSREKLIRDLYRGGLSGHLGRDKTIASMEERYYWPQ 377

Query: 68  LRKDVNKIVQRCYVYQTAK 12
           L+K V KIVQ+ Y  Q +K
Sbjct: 378 LKKGVGKIVQKSYTCQVSK 396


>gb|EMS54598.1| Transposon Ty3-G Gag-Pol polyprotein [Triticum urartu]
          Length = 1704

 Score = 77.8 bits (190), Expect = 1e-12
 Identities = 35/77 (45%), Positives = 49/77 (63%)
 Frame = -2

Query: 239 DDFFIHDGFFMKGNQLCIPVSSLREKVIRDXXXXXXXXXXGRDKTIVVVMERFYWPHLRK 60
           DD+ + DG+  K ++LCIP SSL +K++R+          GRDKTI  +  R++WP L++
Sbjct: 636 DDYLMQDGYLFKNDRLCIPKSSLHDKLVRELHSSDLSGHVGRDKTIANLEARYFWPQLKR 695

Query: 59  DVNKIVQRCYVYQTAKG 9
           D  K VQRC V QT KG
Sbjct: 696 DAGKFVQRCPVCQTCKG 712


>gb|EMJ01397.1| hypothetical protein PRUPE_ppa016013mg, partial [Prunus persica]
          Length = 1057

 Score = 77.8 bits (190), Expect = 1e-12
 Identities = 36/81 (44%), Positives = 50/81 (61%)
 Frame = -2

Query: 251 RQPCDDFFIHDGFFMKGNQLCIPVSSLREKVIRDXXXXXXXXXXGRDKTIVVVMERFYWP 72
           ++P  D+F+++G+  KGNQLCIPVSSLREK+IRD                      FYWP
Sbjct: 656 QEPVADYFLNEGYLFKGNQLCIPVSSLREKLIRDLHGGG-----------------FYWP 698

Query: 71  HLRKDVNKIVQRCYVYQTAKG 9
            L++D+  IV++CY  QT+KG
Sbjct: 699 QLKRDIGTIVRKCYTCQTSKG 719


>gb|EOX95569.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1452

 Score = 77.4 bits (189), Expect = 2e-12
 Identities = 34/73 (46%), Positives = 49/73 (67%)
 Frame = -2

Query: 227  IHDGFFMKGNQLCIPVSSLREKVIRDXXXXXXXXXXGRDKTIVVVMERFYWPHLRKDVNK 48
            +H+ +  KGNQLCIP  SLRE++IR+          GRDKT+V+V +R+YWP +R+DV +
Sbjct: 998  LHEDYLFKGNQLCIPEGSLREQIIRELHGNGLGGHFGRDKTLVMVADRYYWPKMRRDVER 1057

Query: 47   IVQRCYVYQTAKG 9
            +V+RC      KG
Sbjct: 1058 LVKRCPACLFGKG 1070


>emb|CAN62756.1| hypothetical protein VITISV_011119 [Vitis vinifera]
          Length = 465

 Score = 77.4 bits (189), Expect = 2e-12
 Identities = 37/68 (54%), Positives = 47/68 (69%)
 Frame = -2

Query: 236 DFFIHDGFFMKGNQLCIPVSSLREKVIRDXXXXXXXXXXGRDKTIVVVMERFYWPHLRKD 57
           +F +HDGF  +G QLCIP SSLRE++IR+          GRDKTI +V ER+YWP L++D
Sbjct: 395 NFLVHDGFLYRGTQLCIPRSSLREQLIRELYADXLGGHVGRDKTISLVDERYYWPQLKQD 454

Query: 56  VNKIVQRC 33
           V   V RC
Sbjct: 455 VGHFVXRC 462


>gb|EOY26248.1| Uncharacterized protein TCM_046829 [Theobroma cacao]
          Length = 672

 Score = 76.3 bits (186), Expect = 4e-12
 Identities = 33/74 (44%), Positives = 49/74 (66%)
 Frame = -2

Query: 230 FIHDGFFMKGNQLCIPVSSLREKVIRDXXXXXXXXXXGRDKTIVVVMERFYWPHLRKDVN 51
           ++H+ +  KGNQLCIP  SLRE++IR+          GRDKT+ +V +R+YWP +R+DV 
Sbjct: 268 WLHEDYLFKGNQLCIPEGSLREQIIRELHGNGLGGHFGRDKTLAMVADRYYWPKMRRDVE 327

Query: 50  KIVQRCYVYQTAKG 9
           ++V+RC      KG
Sbjct: 328 RLVKRCPACLFGKG 341


>gb|EOY16837.1| Uncharacterized protein TCM_035725 [Theobroma cacao]
          Length = 499

 Score = 75.9 bits (185), Expect = 5e-12
 Identities = 33/73 (45%), Positives = 48/73 (65%)
 Frame = -2

Query: 227 IHDGFFMKGNQLCIPVSSLREKVIRDXXXXXXXXXXGRDKTIVVVMERFYWPHLRKDVNK 48
           +H+ +  KGNQLCIP  SLRE++IR+          GRDKT+ +V +R+YWP +R+DV +
Sbjct: 45  LHEDYLFKGNQLCIPKGSLREQIIRELHGNGLGGHFGRDKTLAMVADRYYWPKMRRDVER 104

Query: 47  IVQRCYVYQTAKG 9
           +V+RC      KG
Sbjct: 105 LVKRCPACLFGKG 117


>gb|EOY08377.1| Uncharacterized protein TCM_022739 [Theobroma cacao]
          Length = 379

 Score = 75.9 bits (185), Expect = 5e-12
 Identities = 33/73 (45%), Positives = 48/73 (65%)
 Frame = -2

Query: 227 IHDGFFMKGNQLCIPVSSLREKVIRDXXXXXXXXXXGRDKTIVVVMERFYWPHLRKDVNK 48
           +H+ +  KGNQLCIP  SLRE++IR+          GRDKT+ +V +R+YWP +R+DV +
Sbjct: 45  LHEDYLFKGNQLCIPEGSLREQIIRELHGNGLGGHFGRDKTLAMVADRYYWPKMRRDVER 104

Query: 47  IVQRCYVYQTAKG 9
           +V+RC      KG
Sbjct: 105 LVKRCPACLFGKG 117


>gb|EOY01158.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 786

 Score = 75.9 bits (185), Expect = 5e-12
 Identities = 33/73 (45%), Positives = 48/73 (65%)
 Frame = -2

Query: 227 IHDGFFMKGNQLCIPVSSLREKVIRDXXXXXXXXXXGRDKTIVVVMERFYWPHLRKDVNK 48
           +H+ +  KGNQLCIP  SLRE++IR+          GRDKT+ +V +R+YWP +R+DV +
Sbjct: 450 LHEDYLFKGNQLCIPEGSLREQIIRELHGNGLGGHFGRDKTLAMVADRYYWPKMRRDVER 509

Query: 47  IVQRCYVYQTAKG 9
           +V+RC      KG
Sbjct: 510 LVKRCPACLFGKG 522


>gb|EOX94045.1| DNA/RNA polymerases superfamily protein, partial [Theobroma cacao]
          Length = 624

 Score = 75.9 bits (185), Expect = 5e-12
 Identities = 33/73 (45%), Positives = 48/73 (65%)
 Frame = -2

Query: 227 IHDGFFMKGNQLCIPVSSLREKVIRDXXXXXXXXXXGRDKTIVVVMERFYWPHLRKDVNK 48
           +H+ +  KGNQLCIP  SLRE++IR+          GRDKT+ +V +R+YWP +R+DV +
Sbjct: 450 LHEDYLFKGNQLCIPEGSLREQIIRELHGNGLGGHFGRDKTLAMVADRYYWPKMRRDVER 509

Query: 47  IVQRCYVYQTAKG 9
           +V+RC      KG
Sbjct: 510 LVKRCPACLFGKG 522


>gb|EOY16699.1| Uncharacterized protein TCM_035549 [Theobroma cacao]
          Length = 1392

 Score = 75.5 bits (184), Expect = 7e-12
 Identities = 33/73 (45%), Positives = 48/73 (65%)
 Frame = -2

Query: 227  IHDGFFMKGNQLCIPVSSLREKVIRDXXXXXXXXXXGRDKTIVVVMERFYWPHLRKDVNK 48
            +H+ +  KGNQLCIP  SLRE++IR+          GRDKT+ +V +R+YWP +R+DV +
Sbjct: 938  LHEDYLFKGNQLCIPEGSLREQIIRELHGNGLGGHFGRDKTLAMVADRYYWPKMRQDVER 997

Query: 47   IVQRCYVYQTAKG 9
            +V+RC      KG
Sbjct: 998  LVKRCPTCLFGKG 1010