BLASTX nr result
ID: Catharanthus22_contig00025657
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00025657 (577 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006578457.1| PREDICTED: uncharacterized protein LOC102662... 112 5e-23 gb|EMJ23111.1| hypothetical protein PRUPE_ppa026797mg [Prunus pe... 111 1e-22 ref|XP_006574998.1| PREDICTED: uncharacterized protein LOC102661... 110 3e-22 gb|ESW25422.1| hypothetical protein PHAVU_003G034500g [Phaseolus... 106 4e-21 ref|XP_004490295.1| PREDICTED: uncharacterized protein LOC101498... 101 2e-19 ref|XP_002315977.2| hypothetical protein POPTR_0010s14360g [Popu... 90 5e-16 gb|EXB44961.1| hypothetical protein L484_026550 [Morus notabilis] 89 1e-15 gb|EOY32333.1| Uncharacterized protein TCM_040140 [Theobroma cacao] 88 1e-15 ref|XP_002514530.1| conserved hypothetical protein [Ricinus comm... 87 4e-15 ref|XP_006484367.1| PREDICTED: uncharacterized protein LOC102609... 83 6e-14 ref|XP_006437785.1| hypothetical protein CICLE_v10033580mg [Citr... 83 6e-14 ref|XP_002312350.2| hypothetical protein POPTR_0008s10900g [Popu... 75 9e-12 gb|EOY01483.1| Uncharacterized protein TCM_011354 [Theobroma cacao] 65 1e-08 >ref|XP_006578457.1| PREDICTED: uncharacterized protein LOC102662789 [Glycine max] Length = 419 Score = 112 bits (281), Expect = 5e-23 Identities = 75/174 (43%), Positives = 94/174 (54%), Gaps = 21/174 (12%) Frame = -3 Query: 575 SEPRKSGFRGVTEA-------ENGTDLKNLKKIGGPMELERT----YSVPNRSIFPVKES 429 +EPR+SGF GV E+G DLK K GG M+ Y NR +F ++ES Sbjct: 245 AEPRRSGFDGVERRDFLFDSYESGNDLKGGVKKGGLMDGGGDGGGFYGGVNRRVFSLRES 304 Query: 428 DFNAMDESAFIDLKLDLSSSESTRPSEFAAAVKINEEG-----SSENGNFLKNNEGN--- 273 DF MDES+FIDLKLD SS EF+AA N G S NGNF+ ++ G Sbjct: 305 DFKGMDESSFIDLKLDYSSESK---HEFSAAKMSNNMGGDTFSSFRNGNFMPHDVGGGSY 361 Query: 272 --FEAKIPLKHSGSCRITVNDRALKRGTNSYKIWKWFSRYHSGGGRRTSKKDEN 117 L + GSCR+TVNDR +KRG S K W+W RYHS G + K+DE+ Sbjct: 362 GGLVGDGVLTNGGSCRLTVNDRGIKRGRKSMKGWRWIFRYHSNWG-SSRKRDED 414 >gb|EMJ23111.1| hypothetical protein PRUPE_ppa026797mg [Prunus persica] Length = 418 Score = 111 bits (278), Expect = 1e-22 Identities = 75/168 (44%), Positives = 90/168 (53%), Gaps = 15/168 (8%) Frame = -3 Query: 575 SEPRKSGFRG-----VTEAENGTDLKNLKKIGGPMELERTYSVPNRSIFPVKESDFNAMD 411 +EPRKSGF G V +E GTD K KK G ++ +S R +F +KESDF+ MD Sbjct: 250 AEPRKSGFDGEKKDEVLASELGTDFKGAKKNGLMEAADKGFSGATRRVFSLKESDFSGMD 309 Query: 410 ESAFIDLKLDLSSSESTRPSEFAAAVKINEE------GSSENGNFLKNN-EGNFEAKIP- 255 ES FIDLKLD S+ EF+A N GS +F+ N G F I Sbjct: 310 ESGFIDLKLDYSAESK---PEFSAMKMGNITDTDSVFGSMRGSDFVANECGGPFGGLIGD 366 Query: 254 --LKHSGSCRITVNDRALKRGTNSYKIWKWFSRYHSGGGRRTSKKDEN 117 H GSCRITVNDR +KRG S K WKW R+H G T KKDE+ Sbjct: 367 GIFSHGGSCRITVNDRGIKRGRKSSKGWKWIFRHHPSWG-STRKKDED 413 >ref|XP_006574998.1| PREDICTED: uncharacterized protein LOC102661207 [Glycine max] Length = 416 Score = 110 bits (275), Expect = 3e-22 Identities = 76/177 (42%), Positives = 96/177 (54%), Gaps = 24/177 (13%) Frame = -3 Query: 575 SEPRKSGFRGVTEA---------ENGTDLKN-LKKIGGPMELERT----YSVPNRSIFPV 438 +EPR+SGF GV E+G+DLK +KK GG M+ Y NR +F + Sbjct: 239 AEPRRSGFDGVERRDFLFDNNNYESGSDLKGGVKKGGGLMDGVGDGGGFYGGVNRRVFSL 298 Query: 437 KESDFNAMDESAFIDLKLDLSSSESTRPSEFAAAVKINEEG-----SSENGNFLKNNEGN 273 +ESDF MDES+FIDLKLD SS EF+AA N G S GNF+ ++ G Sbjct: 299 RESDFKGMDESSFIDLKLDYSSESK---HEFSAAKMSNNMGGDTFSSFRGGNFMPHDGGG 355 Query: 272 -----FEAKIPLKHSGSCRITVNDRALKRGTNSYKIWKWFSRYHSGGGRRTSKKDEN 117 L + GSCRITVNDR +KRG S K W+W RYHS G + K+DE+ Sbjct: 356 GSYGGLVGDGVLTNGGSCRITVNDRGIKRGRKSMKGWRWIFRYHSNWG-SSRKRDED 411 >gb|ESW25422.1| hypothetical protein PHAVU_003G034500g [Phaseolus vulgaris] Length = 395 Score = 106 bits (265), Expect = 4e-21 Identities = 71/168 (42%), Positives = 95/168 (56%), Gaps = 16/168 (9%) Frame = -3 Query: 575 SEPRKSGFRG-----VTEAENGTDLKNLK----KIGGPMELERTYSVPNRSIFPVKESDF 423 +EPR+SGF G + + E+G DL NLK K GG M+ + + NR +F ++ESDF Sbjct: 227 AEPRRSGFDGERRDFLFDYESGNDL-NLKGGGKKSGGLMDGDGGFYGANRRVFSLRESDF 285 Query: 422 NAMDESAFIDLKLDLSSSESTRPSEFAAAVKIN---EEGSSENGNFLKNNEGNFEAKIP- 255 MDES+FIDLKLD S+ EF+ + N S GNF+ ++ G + Sbjct: 286 KGMDESSFIDLKLDYSAESK---HEFSGSKMRNLGDAFSSFRGGNFMAHDGGGSYGGLVG 342 Query: 254 ---LKHSGSCRITVNDRALKRGTNSYKIWKWFSRYHSGGGRRTSKKDE 120 L + GSCRITVNDR +KRG S K W+W RYHS G + K+DE Sbjct: 343 DGVLTNGGSCRITVNDRGIKRGRKSMKGWRWIFRYHSNWG-SSRKRDE 389 >ref|XP_004490295.1| PREDICTED: uncharacterized protein LOC101498006 [Cicer arietinum] Length = 389 Score = 101 bits (251), Expect = 2e-19 Identities = 69/156 (44%), Positives = 90/156 (57%), Gaps = 3/156 (1%) Frame = -3 Query: 575 SEPRKSGFRGVTEAENGTDLKNLKKIGGPMELERTYSVPNRSIFPVKESDFNAMDESAFI 396 +EPR+S F V E ENG K+ G +Y+V NR +F ++ESDF MDES+FI Sbjct: 245 AEPRRSDF--VFECENGR--KSEVDYG-------SYNVVNRRVFSLRESDFKGMDESSFI 293 Query: 395 DLKLDLSSSESTRPSEFAAAVKINEEGSSENGNFLKNNEGNF---EAKIPLKHSGSCRIT 225 DLKLD SS ES + F A K+ E S G+ GNF +A+ L GSCRIT Sbjct: 294 DLKLDYSS-ESKQHDLFNA--KMGENTLSAFGSI--RGAGNFMPHDAEGVLTSGGSCRIT 348 Query: 224 VNDRALKRGTNSYKIWKWFSRYHSGGGRRTSKKDEN 117 VNDR +K G + K W+W RYHS G + K+D++ Sbjct: 349 VNDRGVKTGRKNMKGWRWIFRYHSNWGGSSRKRDQD 384 >ref|XP_002315977.2| hypothetical protein POPTR_0010s14360g [Populus trichocarpa] gi|550329796|gb|EEF02148.2| hypothetical protein POPTR_0010s14360g [Populus trichocarpa] Length = 397 Score = 89.7 bits (221), Expect = 5e-16 Identities = 60/163 (36%), Positives = 84/163 (51%), Gaps = 10/163 (6%) Frame = -3 Query: 575 SEPRKSGFRGVTEAENGTDLKNLKKIGGPMELERTYSVPN-RSIFPVKESDFNAMDESAF 399 +EPRKSGF G GT L+ P L+ +S PN R +F +KE +F +D+S F Sbjct: 240 AEPRKSGFDGEKRDATGTALE-------PERLDPGHSGPNTRRVFSLKEGNFTTVDDSGF 292 Query: 398 IDLKLDLSSSESTRPSEFAAAVKINEE----GSSENGNFLKNNE--GNFEAKIP---LKH 246 IDLK D S ES V +++ GS G+ + ++ G+F + + H Sbjct: 293 IDLKFDFPS-ESKADLSSVKMVSLSDSHSALGSLRGGDAVAQDQYGGHFGSLVGDGLFSH 351 Query: 245 SGSCRITVNDRALKRGTNSYKIWKWFSRYHSGGGRRTSKKDEN 117 GSCRITV+DR +KR S+K W+W R H KKDE+ Sbjct: 352 GGSCRITVSDRGIKRSRKSFKSWRWIFRQHPS----AKKKDED 390 >gb|EXB44961.1| hypothetical protein L484_026550 [Morus notabilis] Length = 406 Score = 88.6 bits (218), Expect = 1e-15 Identities = 65/165 (39%), Positives = 80/165 (48%), Gaps = 7/165 (4%) Frame = -3 Query: 575 SEPRKSGFRGVTEAENGTDLKNLKKIG--GPMELERTYSVPNRSIFPVKESD-FNAMDES 405 +EPRKSGF G E N T+ + G + + + NR +F +KESD F+ MDES Sbjct: 244 TEPRKSGFDG-GEINNNTNKREYYCYSDVGLVSDQFSNGGGNRRVFSLKESDYFSGMDES 302 Query: 404 AFIDLKLDLSSSESTRPSEFAAAVKINEEGS---SENGNFLKN-NEGNFEAKIPLKHSGS 237 FIDLKLD S+ P + + GS S N F N N GS Sbjct: 303 GFIDLKLDYSAESKAVPDHQFCSSSESAFGSMRASTNSEFFNGGNSYNESENGVFNRGGS 362 Query: 236 CRITVNDRALKRGTNSYKIWKWFSRYHSGGGRRTSKKDENSGDFI 102 CRITVNDR LK+G S K WKW R H G TS + GD + Sbjct: 363 CRITVNDRGLKKGRKSLKGWKWIFRQHLG----TSHGVKKDGDLM 403 >gb|EOY32333.1| Uncharacterized protein TCM_040140 [Theobroma cacao] Length = 399 Score = 88.2 bits (217), Expect = 1e-15 Identities = 60/166 (36%), Positives = 88/166 (53%), Gaps = 13/166 (7%) Frame = -3 Query: 575 SEPRKSGF----RGVTEAENG-TDLKNLKKIGGPMELERTYSVPNRSIFPVKESDFNAMD 411 +EPRKSGF R T E+ D+K ++K G M+L+ +S NR +F +KES F D Sbjct: 230 AEPRKSGFDSERRDSTFMESDIADIKAVRKGGVLMDLDGGFSSVNRRVFSLKESYFTGGD 289 Query: 410 ESAFIDLKLDL-SSSESTRPSEFAAAVKINEE--GSSENGNFLKNNEGN-----FEAKIP 255 +S FIDLK D S S+ P+ + + GS+ G+F+ + G Sbjct: 290 DSGFIDLKFDFQSESKGEIPALKKGGLSDSHSAFGSTRGGDFVPHESGGSIRNALVGDGA 349 Query: 254 LKHSGSCRITVNDRALKRGTNSYKIWKWFSRYHSGGGRRTSKKDEN 117 + GSCRITVN+R +K+ S+K W+W ++H T KKDE+ Sbjct: 350 FCNGGSCRITVNERGIKKSRKSFKGWRWIFKHHPNWS-GTRKKDED 394 >ref|XP_002514530.1| conserved hypothetical protein [Ricinus communis] gi|223546134|gb|EEF47636.1| conserved hypothetical protein [Ricinus communis] Length = 408 Score = 86.7 bits (213), Expect = 4e-15 Identities = 57/169 (33%), Positives = 85/169 (50%), Gaps = 12/169 (7%) Frame = -3 Query: 566 RKSGFRGVTEAENGTDLKNLKKIGGPMELERTYSVPN-RSIFPVKESDFNAMDESAFIDL 390 RKSGF ++G D + + P +L+ Y N R +F +KE +F +D+S FIDL Sbjct: 246 RKSGFSEAEPRKSGYDGEKKDTVLDPEKLDSCYGTNNPRRVFSLKEGNFTTIDDSGFIDL 305 Query: 389 KLDLSSSESTRPSEF--------AAAVKINEEGSSENGNFLKNN---EGNFEAKIPLKHS 243 K D SS+S + A + I+ GS G+FL + +G F + Sbjct: 306 KFDFPSSDSKSELPYIVKMGGGGALSDSISAFGSMRGGDFLAQDHYSDGLFS------NG 359 Query: 242 GSCRITVNDRALKRGTNSYKIWKWFSRYHSGGGRRTSKKDENSGDFIKS 96 GSCRITV+DR +KR S+K W+W R + + KK+E+ D + S Sbjct: 360 GSCRITVSDRGIKRSRKSFKSWRWIFRPNHPNAK---KKEEDQEDIVVS 405 >ref|XP_006484367.1| PREDICTED: uncharacterized protein LOC102609778 [Citrus sinensis] Length = 409 Score = 82.8 bits (203), Expect = 6e-14 Identities = 55/169 (32%), Positives = 81/169 (47%), Gaps = 17/169 (10%) Frame = -3 Query: 575 SEPRKSGF-------RGVTEAENGTDLKNLKKIGGPMELERTYSVPNRSIFPVKESDFNA 417 +EPRKSGF V E+E G+D + NR +F +KES F Sbjct: 251 AEPRKSGFDSDQKRDSSVVESEFGSDFNR--------------TAANRRVFSLKESYFTG 296 Query: 416 MDESAFIDLKLDLSSSESTRP--------SEFAAAVKINEEGSSENGNFLKNNEGNFEAK 261 ++S FIDLK DLSS S+ +A G G+FL ++ ++ Sbjct: 297 GEDSGFIDLKFDLSSESKLDHHSVKMGVLSDSNSAFGSMRGGGGGGGDFLASDH-QCGSR 355 Query: 260 IPLKHSGSCRITVNDRALKRGTNSYKIWKWFSRYHSG--GGRRTSKKDE 120 + L H GSCRITVN+R +K+ S+K W+W ++H GR+ +D+ Sbjct: 356 MLLSHGGSCRITVNERGIKKSRKSFKGWRWIFKHHPNWISGRKKDDEDQ 404 >ref|XP_006437785.1| hypothetical protein CICLE_v10033580mg [Citrus clementina] gi|557539981|gb|ESR51025.1| hypothetical protein CICLE_v10033580mg [Citrus clementina] Length = 495 Score = 82.8 bits (203), Expect = 6e-14 Identities = 55/169 (32%), Positives = 81/169 (47%), Gaps = 17/169 (10%) Frame = -3 Query: 575 SEPRKSGF-------RGVTEAENGTDLKNLKKIGGPMELERTYSVPNRSIFPVKESDFNA 417 +EPRKSGF V E+E G+D + NR +F +KES F Sbjct: 337 AEPRKSGFDSDQKRDSSVVESEFGSDFNR--------------TAANRRVFSLKESYFTG 382 Query: 416 MDESAFIDLKLDLSSSESTRP--------SEFAAAVKINEEGSSENGNFLKNNEGNFEAK 261 ++S FIDLK DLSS S+ +A G G+FL ++ ++ Sbjct: 383 GEDSGFIDLKFDLSSESKLDHHSVKMGVLSDSNSAFGSMRGGGGGGGDFLASDH-QCGSR 441 Query: 260 IPLKHSGSCRITVNDRALKRGTNSYKIWKWFSRYHSG--GGRRTSKKDE 120 + L H GSCRITVN+R +K+ S+K W+W ++H GR+ +D+ Sbjct: 442 MLLSHGGSCRITVNERGIKKSRKSFKGWRWIFKHHPNWISGRKKDDEDQ 490 >ref|XP_002312350.2| hypothetical protein POPTR_0008s10900g [Populus trichocarpa] gi|550332815|gb|EEE89717.2| hypothetical protein POPTR_0008s10900g [Populus trichocarpa] Length = 387 Score = 75.5 bits (184), Expect = 9e-12 Identities = 46/147 (31%), Positives = 72/147 (48%), Gaps = 7/147 (4%) Frame = -3 Query: 575 SEPRKSGFRGVTEAENGTDLKNLKKIGGPMELERTYSVPNRSIFPVKESDFNAMDESAFI 396 +EPRKSGF G T L++ + G R +F +KE +F +D+S FI Sbjct: 240 AEPRKSGFDGEKRDTTSTALESERLDSGHD------GANTRRVFSLKEGNFTTVDDSGFI 293 Query: 395 DLKLDL---SSSESTRPSEFAAAVKINEEGSSENGNFLKNNE----GNFEAKIPLKHSGS 237 DLK D S ++ + +++ + GS G+ + ++ G+ P + S Sbjct: 294 DLKFDFPPESKADLSAVKMVSSSDSNSAFGSMRGGDVVAQDQYGGFGSLMGDGPCSNGSS 353 Query: 236 CRITVNDRALKRGTNSYKIWKWFSRYH 156 CRITV+DR +KR S+K W+W R H Sbjct: 354 CRITVSDRGIKRSRKSFKSWRWIFRQH 380 >gb|EOY01483.1| Uncharacterized protein TCM_011354 [Theobroma cacao] Length = 386 Score = 65.1 bits (157), Expect = 1e-08 Identities = 47/148 (31%), Positives = 77/148 (52%), Gaps = 16/148 (10%) Frame = -3 Query: 566 RKSGF----RGVTEAE-NGTDLKNLKKI---GGPMELERTYSVPNRSIFPVKESDFNAMD 411 RKSGF R T E + D++ ++K GG ++++ +S NR +F + ES F D Sbjct: 227 RKSGFDCERRDSTFKEFDIADIRGIRKSVGGGGLLDVDGGFSGANRRVFSLNESYFTGGD 286 Query: 410 ESAFIDLKLDLSSSE-----STRPSEFAAAVKINEEG---SSENGNFLKNNEGNFEAKIP 255 +S FIDLK D S + + F+A + + G + ++G + N EA Sbjct: 287 DSGFIDLKFDFQSESKGDVPAVKKGVFSAFGSMRDGGDFMTHKSGRSIDNALVGDEA--- 343 Query: 254 LKHSGSCRITVNDRALKRGTNSYKIWKW 171 + GSCR+TV++R +K+ S+K W+W Sbjct: 344 FCNGGSCRMTVDERGIKKSRRSFKGWRW 371