BLASTX nr result

ID: Catharanthus22_contig00048247 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00048247
         (324 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003518524.1| PREDICTED: uncharacterized protein LOC100798...   113   2e-23
ref|XP_006452328.1| hypothetical protein CICLE_v10008166mg [Citr...   108   6e-22
ref|XP_004244123.1| PREDICTED: uncharacterized protein LOC101249...   108   6e-22
ref|XP_006346218.1| PREDICTED: uncharacterized protein LOC102582...   108   7e-22
gb|ESW14274.1| hypothetical protein PHAVU_008G267300g [Phaseolus...   107   2e-21
gb|EOY12373.1| Uncharacterized protein isoform 4 [Theobroma cacao]    105   5e-21
gb|EOY12370.1| Uncharacterized protein isoform 1 [Theobroma cacao]    105   5e-21
ref|XP_006858203.1| hypothetical protein AMTR_s00062p00174310 [A...   105   6e-21
ref|XP_004512818.1| PREDICTED: uncharacterized protein LOC101490...   105   6e-21
ref|XP_006595735.1| PREDICTED: uncharacterized protein LOC100812...   104   1e-20
ref|XP_003617526.1| hypothetical protein MTR_5g092580 [Medicago ...   103   2e-20
ref|XP_004491393.1| PREDICTED: uncharacterized protein LOC101503...   103   2e-20
ref|XP_002278240.2| PREDICTED: uncharacterized protein LOC100255...   100   2e-19
emb|CBI26632.3| unnamed protein product [Vitis vinifera]              100   2e-19
emb|CAN63914.1| hypothetical protein VITISV_004851 [Vitis vinifera]   100   2e-19
dbj|BAB01158.1| unnamed protein product [Arabidopsis thaliana]         97   3e-18
ref|NP_188685.2| uncharacterized protein [Arabidopsis thaliana] ...    97   3e-18
ref|XP_002522945.1| conserved hypothetical protein [Ricinus comm...    97   3e-18
gb|AAM96977.1| unknown protein [Arabidopsis thaliana] gi|2319842...    97   3e-18
ref|XP_002885379.1| predicted protein [Arabidopsis lyrata subsp....    96   4e-18

>ref|XP_003518524.1| PREDICTED: uncharacterized protein LOC100798619 [Glycine max]
          Length = 533

 Score =  113 bits (283), Expect = 2e-23
 Identities = 54/106 (50%), Positives = 75/106 (70%)
 Frame = +2

Query: 2   FNHGEATKPPAKQKETTKPSTSRRRKNIKNTKVQESSQGAESWVNPRTGAGNPKDASKRR 181
           F+HG++++  A ++ +TK S S+   N K  K   S+Q +E WV PR+    PKDA KRR
Sbjct: 412 FSHGKSSRKKATKRNSTKNSVSK--SNNKTNKSNPSNQSSEGWVEPRSCTSLPKDAGKRR 469

Query: 182 VQAVGQSAGRWYTGSDGRRVYVGRNGQEMTGQFAYRQ*RRESGTGF 319
           VQA GQSAG W+T  +GR+VYV ++G+E+TG+ AYRQ R+ESG GF
Sbjct: 470 VQASGQSAGHWFTSPEGRKVYVNKSGEELTGRNAYRQYRKESGAGF 515


>ref|XP_006452328.1| hypothetical protein CICLE_v10008166mg [Citrus clementina]
           gi|568842498|ref|XP_006475183.1| PREDICTED:
           uncharacterized protein LOC102619494 [Citrus sinensis]
           gi|557555554|gb|ESR65568.1| hypothetical protein
           CICLE_v10008166mg [Citrus clementina]
          Length = 477

 Score =  108 bits (271), Expect = 6e-22
 Identities = 52/106 (49%), Positives = 72/106 (67%)
 Frame = +2

Query: 2   FNHGEATKPPAKQKETTKPSTSRRRKNIKNTKVQESSQGAESWVNPRTGAGNPKDASKRR 181
           F++GE++KP   QK  +K S++R R   K       S  +E WV+P++ +  PKDA KRR
Sbjct: 362 FSNGESSKPKGTQKINSKKSSTRGRNKSKK------SNASEGWVDPKSSSTAPKDAGKRR 415

Query: 182 VQAVGQSAGRWYTGSDGRRVYVGRNGQEMTGQFAYRQ*RRESGTGF 319
           V A  QSAG WYT  +GR+VY+ R+GQE++GQ AYRQ R+E+G GF
Sbjct: 416 VHATTQSAGHWYTSPEGRKVYISRSGQELSGQTAYRQYRKENGAGF 461


>ref|XP_004244123.1| PREDICTED: uncharacterized protein LOC101249283 [Solanum
           lycopersicum]
          Length = 463

 Score =  108 bits (271), Expect = 6e-22
 Identities = 56/107 (52%), Positives = 72/107 (67%)
 Frame = +2

Query: 2   FNHGEATKPPAKQKETTKPSTSRRRKNIKNTKVQESSQGAESWVNPRTGAGNPKDASKRR 181
           F+H   +K  A QK     ++ + RKN+K     E SQG+E WVNP++ AG PKDA +RR
Sbjct: 341 FSHEGGSKRTA-QKSADGTNSRKSRKNVKQPNNVEESQGSERWVNPKSSAGIPKDAGRRR 399

Query: 182 VQAVGQSAGRWYTGSDGRRVYVGRNGQEMTGQFAYRQ*RRESGTGFS 322
           VQAVG+SAG WYT  DGR+VYV  NGQE +G+ AY   R+E G GF+
Sbjct: 400 VQAVGKSAGHWYTNGDGRKVYVDNNGQEFSGRSAYICYRKEKG-GFN 445


>ref|XP_006346218.1| PREDICTED: uncharacterized protein LOC102582285 [Solanum tuberosum]
          Length = 463

 Score =  108 bits (270), Expect = 7e-22
 Identities = 57/104 (54%), Positives = 69/104 (66%), Gaps = 1/104 (0%)
 Frame = +2

Query: 2   FNHGEATKPPAKQKETTKPSTSRRRKNIKNTKVQESSQGAESWVNPRTGAGNPKDASKRR 181
           F H   +K  A QK      + + RKN+K     E SQG+E WVNP++ AG PKDA +RR
Sbjct: 340 FCHEGGSKRTA-QKSADVTDSRKSRKNVKQPNNVEESQGSERWVNPKSSAGIPKDAGRRR 398

Query: 182 VQAVG-QSAGRWYTGSDGRRVYVGRNGQEMTGQFAYRQ*RRESG 310
           VQAVG +SAG WYT  DGR+VYV  NGQE +GQ AYR  R+ESG
Sbjct: 399 VQAVGSKSAGHWYTNGDGRKVYVANNGQEFSGQSAYRCYRKESG 442


>gb|ESW14274.1| hypothetical protein PHAVU_008G267300g [Phaseolus vulgaris]
          Length = 536

 Score =  107 bits (267), Expect = 2e-21
 Identities = 50/106 (47%), Positives = 72/106 (67%)
 Frame = +2

Query: 2   FNHGEATKPPAKQKETTKPSTSRRRKNIKNTKVQESSQGAESWVNPRTGAGNPKDASKRR 181
           F+HGE+++  A ++ +TK S S+ +     +     S  +  WV+PR+ A +PKDA KRR
Sbjct: 413 FSHGESSRKRATKRNSTKNSVSKGKSKANKSIPANQSCASGDWVDPRSFASSPKDAGKRR 472

Query: 182 VQAVGQSAGRWYTGSDGRRVYVGRNGQEMTGQFAYRQ*RRESGTGF 319
           VQA GQSAG WYT  +G++VYV ++GQE+TG+ AY Q R+ESG  F
Sbjct: 473 VQASGQSAGHWYTSPEGKKVYVNKSGQELTGRGAYAQYRKESGKAF 518


>gb|EOY12373.1| Uncharacterized protein isoform 4 [Theobroma cacao]
          Length = 452

 Score =  105 bits (263), Expect = 5e-21
 Identities = 52/106 (49%), Positives = 72/106 (67%)
 Frame = +2

Query: 2   FNHGEATKPPAKQKETTKPSTSRRRKNIKNTKVQESSQGAESWVNPRTGAGNPKDASKRR 181
           F++GE++K    QK   K  +  RRK  KN+K +E++  +E WV+ ++ A  PK+A KRR
Sbjct: 336 FSNGESSKQRGSQKGGGKKCSMSRRKKSKNSKAEETA--SEGWVDLKSSAAIPKNAGKRR 393

Query: 182 VQAVGQSAGRWYTGSDGRRVYVGRNGQEMTGQFAYRQ*RRESGTGF 319
           V A  Q AG WYT  +GR+VYV R+GQE++GQ AYR  R+ESG GF
Sbjct: 394 VHASDQPAGHWYTSPEGRKVYVSRSGQELSGQMAYRHYRKESGAGF 439


>gb|EOY12370.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 453

 Score =  105 bits (263), Expect = 5e-21
 Identities = 52/106 (49%), Positives = 72/106 (67%)
 Frame = +2

Query: 2   FNHGEATKPPAKQKETTKPSTSRRRKNIKNTKVQESSQGAESWVNPRTGAGNPKDASKRR 181
           F++GE++K    QK   K  +  RRK  KN+K +E++  +E WV+ ++ A  PK+A KRR
Sbjct: 337 FSNGESSKQRGSQKGGGKKCSMSRRKKSKNSKAEETA--SEGWVDLKSSAAIPKNAGKRR 394

Query: 182 VQAVGQSAGRWYTGSDGRRVYVGRNGQEMTGQFAYRQ*RRESGTGF 319
           V A  Q AG WYT  +GR+VYV R+GQE++GQ AYR  R+ESG GF
Sbjct: 395 VHASDQPAGHWYTSPEGRKVYVSRSGQELSGQMAYRHYRKESGAGF 440


>ref|XP_006858203.1| hypothetical protein AMTR_s00062p00174310 [Amborella trichopoda]
           gi|548862306|gb|ERN19670.1| hypothetical protein
           AMTR_s00062p00174310 [Amborella trichopoda]
          Length = 540

 Score =  105 bits (262), Expect = 6e-21
 Identities = 49/103 (47%), Positives = 67/103 (65%)
 Frame = +2

Query: 11  GEATKPPAKQKETTKPSTSRRRKNIKNTKVQESSQGAESWVNPRTGAGNPKDASKRRVQA 190
           G   +P +   E  + S+ ++RK     K + + Q ++ WVNP++    PKDA KRRV A
Sbjct: 408 GGQNQPRSTLNEGNEGSSKKKRKTQSKGKAKRAPQTSDGWVNPKSEVNPPKDAGKRRVSA 467

Query: 191 VGQSAGRWYTGSDGRRVYVGRNGQEMTGQFAYRQ*RRESGTGF 319
            G S+G WYTG DGR+VYV +NGQE+TGQ AYR  R+ESG G+
Sbjct: 468 DGVSSGHWYTGEDGRKVYVTKNGQELTGQTAYRHYRKESGMGY 510


>ref|XP_004512818.1| PREDICTED: uncharacterized protein LOC101490882 [Cicer arietinum]
          Length = 492

 Score =  105 bits (262), Expect = 6e-21
 Identities = 54/102 (52%), Positives = 70/102 (68%), Gaps = 1/102 (0%)
 Frame = +2

Query: 14  EATKPPAKQKETTKPSTSRRRKNIKNTKVQ-ESSQGAESWVNPRTGAGNPKDASKRRVQA 190
           E     ++ K T + ST R     KN K +  SS  + +WV P++ +  PKDAS+RRVQA
Sbjct: 377 EIISSSSRNKTTKRNSTKRSVSKSKNEKSKLNSSNVSGNWVEPKSCSSMPKDASERRVQA 436

Query: 191 VGQSAGRWYTGSDGRRVYVGRNGQEMTGQFAYRQ*RRESGTG 316
             QSAG WYTGSDGR+VYV ++GQE+TG+ AYRQ R+ESGTG
Sbjct: 437 SSQSAGHWYTGSDGRKVYVSKSGQELTGRNAYRQYRKESGTG 478


>ref|XP_006595735.1| PREDICTED: uncharacterized protein LOC100812954 [Glycine max]
          Length = 603

 Score =  104 bits (260), Expect = 1e-20
 Identities = 48/106 (45%), Positives = 69/106 (65%)
 Frame = +2

Query: 2   FNHGEATKPPAKQKETTKPSTSRRRKNIKNTKVQESSQGAESWVNPRTGAGNPKDASKRR 181
           F+HGE+++  A ++  +K S S+       +     S  +  WV PR+    PKDA KRR
Sbjct: 480 FSHGESSRKKATKRNGSKNSVSKSNSKANKSNPANQSCASGGWVEPRSCTSLPKDAGKRR 539

Query: 182 VQAVGQSAGRWYTGSDGRRVYVGRNGQEMTGQFAYRQ*RRESGTGF 319
           VQA G+SAG W+T  +GR+VYV ++G+E+TG+ AYRQ R+ESG GF
Sbjct: 540 VQASGESAGHWFTSPEGRKVYVNKSGEELTGRNAYRQYRKESGAGF 585


>ref|XP_003617526.1| hypothetical protein MTR_5g092580 [Medicago truncatula]
           gi|355518861|gb|AET00485.1| hypothetical protein
           MTR_5g092580 [Medicago truncatula]
          Length = 587

 Score =  103 bits (258), Expect = 2e-20
 Identities = 52/98 (53%), Positives = 70/98 (71%), Gaps = 2/98 (2%)
 Frame = +2

Query: 32  AKQKETTKPSTSRRR-KNIKNTKVQ-ESSQGAESWVNPRTGAGNPKDASKRRVQAVGQSA 205
           + +K TTK + ++R     KN + +   S  + +WV P++ AG PKDA KRRVQA  QSA
Sbjct: 472 SSRKNTTKRNNTKRSVSKAKNAQCKLNPSNVSGNWVEPKSRAGMPKDAGKRRVQASSQSA 531

Query: 206 GRWYTGSDGRRVYVGRNGQEMTGQFAYRQ*RRESGTGF 319
           G WYTGSDGR+VYV ++GQE+TG+ AY+  R+ESGTGF
Sbjct: 532 GHWYTGSDGRKVYVSKSGQELTGRNAYKHYRKESGTGF 569


>ref|XP_004491393.1| PREDICTED: uncharacterized protein LOC101503265 [Cicer arietinum]
          Length = 501

 Score =  103 bits (257), Expect = 2e-20
 Identities = 51/106 (48%), Positives = 73/106 (68%)
 Frame = +2

Query: 2   FNHGEATKPPAKQKETTKPSTSRRRKNIKNTKVQESSQGAESWVNPRTGAGNPKDASKRR 181
           F+ G +++  A ++ +TK S S+ +     +K+  S+    +WV P++    P+DA KRR
Sbjct: 386 FSSGTSSRKKATKRSSTKSSVSKSKNG--QSKLNPSNVSG-NWVEPKSCTSMPRDAGKRR 442

Query: 182 VQAVGQSAGRWYTGSDGRRVYVGRNGQEMTGQFAYRQ*RRESGTGF 319
           VQA  QSAG WYTGSDGR+VYV ++GQE+TG+ AYR  R+ESGTGF
Sbjct: 443 VQASSQSAGHWYTGSDGRKVYVNKSGQELTGRNAYRNYRKESGTGF 488


>ref|XP_002278240.2| PREDICTED: uncharacterized protein LOC100255618 [Vitis vinifera]
          Length = 470

 Score =  100 bits (250), Expect = 2e-19
 Identities = 48/103 (46%), Positives = 66/103 (64%)
 Frame = +2

Query: 2   FNHGEATKPPAKQKETTKPSTSRRRKNIKNTKVQESSQGAESWVNPRTGAGNPKDASKRR 181
           F+HGEA+K    Q  +   ST + RKN +     E+   + SWVNP++ A  PK A K +
Sbjct: 346 FSHGEASKKQVNQDVSIGRSTMQARKNARKFNADEALNASGSWVNPKSCASIPKKAGKGQ 405

Query: 182 VQAVGQSAGRWYTGSDGRRVYVGRNGQEMTGQFAYRQ*RRESG 310
           V A GQSA RWYT  DGR+VYV ++GQE+TG  AYR  ++++G
Sbjct: 406 VHANGQSASRWYTSPDGRKVYVTKSGQELTGSMAYRHYKKDNG 448


>emb|CBI26632.3| unnamed protein product [Vitis vinifera]
          Length = 421

 Score =  100 bits (250), Expect = 2e-19
 Identities = 48/103 (46%), Positives = 66/103 (64%)
 Frame = +2

Query: 2   FNHGEATKPPAKQKETTKPSTSRRRKNIKNTKVQESSQGAESWVNPRTGAGNPKDASKRR 181
           F+HGEA+K    Q  +   ST + RKN +     E+   + SWVNP++ A  PK A K +
Sbjct: 297 FSHGEASKKQVNQDVSIGRSTMQARKNARKFNADEALNASGSWVNPKSCASIPKKAGKGQ 356

Query: 182 VQAVGQSAGRWYTGSDGRRVYVGRNGQEMTGQFAYRQ*RRESG 310
           V A GQSA RWYT  DGR+VYV ++GQE+TG  AYR  ++++G
Sbjct: 357 VHANGQSASRWYTSPDGRKVYVTKSGQELTGSMAYRHYKKDNG 399


>emb|CAN63914.1| hypothetical protein VITISV_004851 [Vitis vinifera]
          Length = 510

 Score =  100 bits (250), Expect = 2e-19
 Identities = 48/103 (46%), Positives = 66/103 (64%)
 Frame = +2

Query: 2   FNHGEATKPPAKQKETTKPSTSRRRKNIKNTKVQESSQGAESWVNPRTGAGNPKDASKRR 181
           F+HGEA+K    Q  +   ST + RKN +     E+   + SWVNP++ A  PK A K +
Sbjct: 386 FSHGEASKKQVNQDVSIGRSTMQARKNARKFNADEALNASGSWVNPKSCASIPKKAGKGQ 445

Query: 182 VQAVGQSAGRWYTGSDGRRVYVGRNGQEMTGQFAYRQ*RRESG 310
           V A GQSA RWYT  DGR+VYV ++GQE+TG  AYR  ++++G
Sbjct: 446 VHANGQSASRWYTSPDGRKVYVTKSGQELTGSMAYRHYKKDNG 488


>dbj|BAB01158.1| unnamed protein product [Arabidopsis thaliana]
          Length = 486

 Score = 96.7 bits (239), Expect = 3e-18
 Identities = 45/100 (45%), Positives = 66/100 (66%)
 Frame = +2

Query: 20  TKPPAKQKETTKPSTSRRRKNIKNTKVQESSQGAESWVNPRTGAGNPKDASKRRVQAVGQ 199
           +K  +K  +++  S  R +   K +K QES+  +E W+NP+T A  PKDA KRRV A   
Sbjct: 372 SKGSSKAGDSSSKSCRRGKTKSKVSKSQESAHNSEGWLNPKTRAAAPKDAGKRRVSADSG 431

Query: 200 SAGRWYTGSDGRRVYVGRNGQEMTGQFAYRQ*RRESGTGF 319
           SAG W+T  +GR+VY+ ++GQE +GQ AYR  ++E+G GF
Sbjct: 432 SAGHWFTSPEGRKVYISKSGQEFSGQSAYRCYKKENGVGF 471


>ref|NP_188685.2| uncharacterized protein [Arabidopsis thaliana]
           gi|332642866|gb|AEE76387.1| uncharacterized protein
           AT3G20490 [Arabidopsis thaliana]
          Length = 458

 Score = 96.7 bits (239), Expect = 3e-18
 Identities = 45/100 (45%), Positives = 66/100 (66%)
 Frame = +2

Query: 20  TKPPAKQKETTKPSTSRRRKNIKNTKVQESSQGAESWVNPRTGAGNPKDASKRRVQAVGQ 199
           +K  +K  +++  S  R +   K +K QES+  +E W+NP+T A  PKDA KRRV A   
Sbjct: 344 SKGSSKAGDSSSKSCRRGKTKSKVSKSQESAHNSEGWLNPKTRAAAPKDAGKRRVSADSG 403

Query: 200 SAGRWYTGSDGRRVYVGRNGQEMTGQFAYRQ*RRESGTGF 319
           SAG W+T  +GR+VY+ ++GQE +GQ AYR  ++E+G GF
Sbjct: 404 SAGHWFTSPEGRKVYISKSGQEFSGQSAYRCYKKENGVGF 443


>ref|XP_002522945.1| conserved hypothetical protein [Ricinus communis]
           gi|223537757|gb|EEF39375.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 477

 Score = 96.7 bits (239), Expect = 3e-18
 Identities = 48/100 (48%), Positives = 66/100 (66%)
 Frame = +2

Query: 8   HGEATKPPAKQKETTKPSTSRRRKNIKNTKVQESSQGAESWVNPRTGAGNPKDASKRRVQ 187
           +GEA+K     +   K ST  R K+ K + V+E+   ++ W++P+  A  PKDA KRRV 
Sbjct: 354 NGEASKKGGTCRNNNKDSTRGRSKS-KKSIVKEALPASQVWIDPKRSASIPKDAGKRRVH 412

Query: 188 AVGQSAGRWYTGSDGRRVYVGRNGQEMTGQFAYRQ*RRES 307
           A GQ+AG WYT  +GR+VYV R+GQE+TGQ AYR  R+ S
Sbjct: 413 ANGQAAGHWYTSPEGRKVYVSRSGQELTGQMAYRHYRKAS 452


>gb|AAM96977.1| unknown protein [Arabidopsis thaliana] gi|23198428|gb|AAN15741.1|
           unknown protein [Arabidopsis thaliana]
          Length = 458

 Score = 96.7 bits (239), Expect = 3e-18
 Identities = 45/100 (45%), Positives = 66/100 (66%)
 Frame = +2

Query: 20  TKPPAKQKETTKPSTSRRRKNIKNTKVQESSQGAESWVNPRTGAGNPKDASKRRVQAVGQ 199
           +K  +K  +++  S  R +   K +K QES+  +E W+NP+T A  PKDA KRRV A   
Sbjct: 344 SKGSSKAGDSSSKSCRRGKTKSKVSKSQESAHNSEGWLNPKTRAAAPKDAGKRRVSADSG 403

Query: 200 SAGRWYTGSDGRRVYVGRNGQEMTGQFAYRQ*RRESGTGF 319
           SAG W+T  +GR+VY+ ++GQE +GQ AYR  ++E+G GF
Sbjct: 404 SAGHWFTSPEGRKVYISKSGQEFSGQSAYRCYKKENGVGF 443


>ref|XP_002885379.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
           gi|297331219|gb|EFH61638.1| predicted protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 482

 Score = 96.3 bits (238), Expect = 4e-18
 Identities = 46/100 (46%), Positives = 66/100 (66%)
 Frame = +2

Query: 20  TKPPAKQKETTKPSTSRRRKNIKNTKVQESSQGAESWVNPRTGAGNPKDASKRRVQAVGQ 199
           +K  +K  +++  S  R +   K +K QES+  +E W+NP+T A  PKDA KRRV A   
Sbjct: 368 SKGSSKTGDSSSKSCRRGQTKSKFSKGQESAHNSEGWLNPKTRAAAPKDAGKRRVSANSG 427

Query: 200 SAGRWYTGSDGRRVYVGRNGQEMTGQFAYRQ*RRESGTGF 319
           SAG W+T  +GR+VY+ ++GQE +GQ AYR  R+E+G GF
Sbjct: 428 SAGHWFTSPEGRKVYISKSGQEFSGQSAYRCYRKENGGGF 467


Top