BLASTX nr result
ID: Catharanthus22_contig00038927
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00038927 (474 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002520529.1| pentatricopeptide repeat-containing protein,... 144 2e-32 gb|EMJ09415.1| hypothetical protein PRUPE_ppa018916mg, partial [... 138 9e-31 ref|XP_004240613.1| PREDICTED: pentatricopeptide repeat-containi... 137 1e-30 ref|XP_002314162.2| hypothetical protein POPTR_0009s04000g, part... 136 2e-30 ref|XP_003632146.1| PREDICTED: pentatricopeptide repeat-containi... 136 2e-30 emb|CBI16090.3| unnamed protein product [Vitis vinifera] 136 2e-30 ref|XP_004289402.1| PREDICTED: pentatricopeptide repeat-containi... 130 1e-28 gb|EOY33964.1| Pentatricopeptide repeat superfamily protein [The... 128 7e-28 ref|XP_004160205.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 122 5e-26 ref|XP_004143208.1| PREDICTED: pentatricopeptide repeat-containi... 122 5e-26 gb|ABK26521.1| unknown [Picea sitchensis] 109 3e-22 ref|XP_002280360.1| PREDICTED: pentatricopeptide repeat-containi... 108 6e-22 gb|EOX91779.1| Pentatricopeptide repeat (PPR) superfamily protei... 104 1e-20 ref|XP_002302000.2| pentatricopeptide repeat-containing family p... 103 2e-20 ref|XP_006846299.1| hypothetical protein AMTR_s00012p00252800 [A... 103 2e-20 gb|EPS72239.1| hypothetical protein M569_02517, partial [Genlise... 103 3e-20 ref|XP_006646734.1| PREDICTED: pentatricopeptide repeat-containi... 102 7e-20 gb|EOY05619.1| Pentatricopeptide repeat (PPR) superfamily protei... 102 7e-20 ref|XP_006342194.1| PREDICTED: pentatricopeptide repeat-containi... 101 9e-20 gb|EMJ17597.1| hypothetical protein PRUPE_ppa022530mg [Prunus pe... 101 1e-19 >ref|XP_002520529.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223540371|gb|EEF41942.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 606 Score = 144 bits (362), Expect = 2e-32 Identities = 68/93 (73%), Positives = 82/93 (88%) Frame = +2 Query: 2 GCLQEAEAFINNMPIKPGPTVFKALLSAAQVHGNKEIAVQSARRLVELCPNDPATYIVLA 181 G L EAEA IN MP+ PGP+V+KALLSA VHGN+EIAV+SAR+L+EL P+DPATYI+L+ Sbjct: 514 GYLSEAEAIINCMPMDPGPSVYKALLSACLVHGNREIAVRSARKLLELWPDDPATYILLS 573 Query: 182 NVLATEGNWNDAANVRKLMCDRGVRKKPGSSWV 280 N+LATEG W+DAA+VRKLMCDRGVRK PG SW+ Sbjct: 574 NMLATEGYWDDAADVRKLMCDRGVRKNPGYSWI 606 >gb|EMJ09415.1| hypothetical protein PRUPE_ppa018916mg, partial [Prunus persica] Length = 562 Score = 138 bits (347), Expect = 9e-31 Identities = 64/93 (68%), Positives = 81/93 (87%) Frame = +2 Query: 2 GCLQEAEAFINNMPIKPGPTVFKALLSAAQVHGNKEIAVQSARRLVELCPNDPATYIVLA 181 G L EA+AF+++MPI+PGP+V+KALLSA +VHGNKEIA++SA++L EL PNDPATYI+L+ Sbjct: 470 GNLHEAQAFVDSMPIEPGPSVYKALLSACKVHGNKEIALRSAKKLQELWPNDPATYILLS 529 Query: 182 NVLATEGNWNDAANVRKLMCDRGVRKKPGSSWV 280 NVL T G W+DAA VRKLM DRG+RK PG SW+ Sbjct: 530 NVLVTGGCWDDAAGVRKLMYDRGIRKTPGHSWI 562 >ref|XP_004240613.1| PREDICTED: pentatricopeptide repeat-containing protein At2g27610-like [Solanum lycopersicum] Length = 536 Score = 137 bits (345), Expect = 1e-30 Identities = 65/93 (69%), Positives = 78/93 (83%) Frame = +2 Query: 2 GCLQEAEAFINNMPIKPGPTVFKALLSAAQVHGNKEIAVQSARRLVELCPNDPATYIVLA 181 G L EAE FINNMPI+PGP+V+K+LL+A Q+HGNK +AV SA++LVEL PNDPATY++LA Sbjct: 444 GHLHEAEDFINNMPIEPGPSVYKSLLNACQLHGNKGLAVISAKKLVELRPNDPATYVLLA 503 Query: 182 NVLATEGNWNDAANVRKLMCDRGVRKKPGSSWV 280 NVLA EGNW DA RKLM DRG+ KKPG SW+ Sbjct: 504 NVLALEGNWKDAEGQRKLMLDRGLSKKPGYSWL 536 >ref|XP_002314162.2| hypothetical protein POPTR_0009s04000g, partial [Populus trichocarpa] gi|550330984|gb|EEE88117.2| hypothetical protein POPTR_0009s04000g, partial [Populus trichocarpa] Length = 606 Score = 136 bits (343), Expect = 2e-30 Identities = 62/89 (69%), Positives = 80/89 (89%) Frame = +2 Query: 2 GCLQEAEAFINNMPIKPGPTVFKALLSAAQVHGNKEIAVQSARRLVELCPNDPATYIVLA 181 G L EAEAFIN+MPI P P+V+KALLSA+ VHGN+EIA +SA++L+EL PNDPATY++L+ Sbjct: 510 GYLNEAEAFINSMPIVPAPSVYKALLSASLVHGNREIAARSAKKLLELWPNDPATYVLLS 569 Query: 182 NVLATEGNWNDAANVRKLMCDRGVRKKPG 268 +VL +GNW+DAA++RKLMCDRG+RKKPG Sbjct: 570 SVLTVDGNWDDAADLRKLMCDRGLRKKPG 598 >ref|XP_003632146.1| PREDICTED: pentatricopeptide repeat-containing protein At4g13650-like [Vitis vinifera] Length = 628 Score = 136 bits (343), Expect = 2e-30 Identities = 60/93 (64%), Positives = 79/93 (84%) Frame = +2 Query: 2 GCLQEAEAFINNMPIKPGPTVFKALLSAAQVHGNKEIAVQSARRLVELCPNDPATYIVLA 181 G L EAE FIN MPI+PGP+V+KALLSA QVHGN EIAV+ A++L+++CPNDP Y++L+ Sbjct: 536 GYLSEAEDFINTMPIEPGPSVYKALLSACQVHGNVEIAVRCAKKLLQMCPNDPVIYVLLS 595 Query: 182 NVLATEGNWNDAANVRKLMCDRGVRKKPGSSWV 280 NV AT G W++ A++RK+MCDRGVRK+PG SW+ Sbjct: 596 NVQATVGYWDNVASIRKVMCDRGVRKEPGYSWI 628 >emb|CBI16090.3| unnamed protein product [Vitis vinifera] Length = 458 Score = 136 bits (343), Expect = 2e-30 Identities = 60/93 (64%), Positives = 79/93 (84%) Frame = +2 Query: 2 GCLQEAEAFINNMPIKPGPTVFKALLSAAQVHGNKEIAVQSARRLVELCPNDPATYIVLA 181 G L EAE FIN MPI+PGP+V+KALLSA QVHGN EIAV+ A++L+++CPNDP Y++L+ Sbjct: 366 GYLSEAEDFINTMPIEPGPSVYKALLSACQVHGNVEIAVRCAKKLLQMCPNDPVIYVLLS 425 Query: 182 NVLATEGNWNDAANVRKLMCDRGVRKKPGSSWV 280 NV AT G W++ A++RK+MCDRGVRK+PG SW+ Sbjct: 426 NVQATVGYWDNVASIRKVMCDRGVRKEPGYSWI 458 >ref|XP_004289402.1| PREDICTED: pentatricopeptide repeat-containing protein At2g13600-like [Fragaria vesca subsp. vesca] Length = 558 Score = 130 bits (328), Expect = 1e-28 Identities = 60/93 (64%), Positives = 76/93 (81%) Frame = +2 Query: 2 GCLQEAEAFINNMPIKPGPTVFKALLSAAQVHGNKEIAVQSARRLVELCPNDPATYIVLA 181 G L EAEAF+ +MPI+PG +V++ALLSA+QVHGNKE+A +SA L +LCPND TYI+L+ Sbjct: 466 GNLHEAEAFVGSMPIEPGASVYRALLSASQVHGNKELAFRSATTLQQLCPNDHGTYILLS 525 Query: 182 NVLATEGNWNDAANVRKLMCDRGVRKKPGSSWV 280 NVL T G+W+DAA VRK M DRG+RK PG SW+ Sbjct: 526 NVLLTRGSWDDAAGVRKFMYDRGIRKTPGYSWI 558 >gb|EOY33964.1| Pentatricopeptide repeat superfamily protein [Theobroma cacao] Length = 623 Score = 128 bits (322), Expect = 7e-28 Identities = 61/91 (67%), Positives = 75/91 (82%) Frame = +2 Query: 8 LQEAEAFINNMPIKPGPTVFKALLSAAQVHGNKEIAVQSARRLVELCPNDPATYIVLANV 187 L EAEAFIN+MPI+PGP+V+KALLSA +V+GN EIA +SA RL+EL PNDPATY++L+ V Sbjct: 533 LNEAEAFINSMPIEPGPSVYKALLSACEVYGNIEIATRSANRLLELWPNDPATYVLLSKV 592 Query: 188 LATEGNWNDAANVRKLMCDRGVRKKPGSSWV 280 L +W+DAA V KLM DRGVRK PG SW+ Sbjct: 593 LKMGNDWDDAAGVCKLMSDRGVRKNPGCSWI 623 >ref|XP_004160205.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g11290-like [Cucumis sativus] Length = 616 Score = 122 bits (306), Expect = 5e-26 Identities = 54/93 (58%), Positives = 79/93 (84%) Frame = +2 Query: 2 GCLQEAEAFINNMPIKPGPTVFKALLSAAQVHGNKEIAVQSARRLVELCPNDPATYIVLA 181 G L EAEAFI ++PI+PG +++KALLSA +HGNK+IA+++A++L+EL P DPATYI+L+ Sbjct: 524 GKLYEAEAFIESIPIEPGISIYKALLSACLIHGNKDIAIRTAKKLLELYPYDPATYIMLS 583 Query: 182 NVLATEGNWNDAANVRKLMCDRGVRKKPGSSWV 280 N L +G W+DAA++R+LM +RGV+K+PG SW+ Sbjct: 584 NALGRDGYWDDAASIRRLMSNRGVKKEPGFSWM 616 >ref|XP_004143208.1| PREDICTED: pentatricopeptide repeat-containing protein At1g11290-like [Cucumis sativus] Length = 616 Score = 122 bits (306), Expect = 5e-26 Identities = 54/93 (58%), Positives = 79/93 (84%) Frame = +2 Query: 2 GCLQEAEAFINNMPIKPGPTVFKALLSAAQVHGNKEIAVQSARRLVELCPNDPATYIVLA 181 G L EAEAFI ++PI+PG +++KALLSA +HGNK+IA+++A++L+EL P DPATYI+L+ Sbjct: 524 GKLYEAEAFIESIPIEPGISIYKALLSACLIHGNKDIAIRTAKKLLELYPYDPATYIMLS 583 Query: 182 NVLATEGNWNDAANVRKLMCDRGVRKKPGSSWV 280 N L +G W+DAA++R+LM +RGV+K+PG SW+ Sbjct: 584 NALGRDGYWDDAASIRRLMSNRGVKKEPGFSWM 616 >gb|ABK26521.1| unknown [Picea sitchensis] Length = 370 Score = 109 bits (273), Expect = 3e-22 Identities = 47/93 (50%), Positives = 68/93 (73%) Frame = +2 Query: 2 GCLQEAEAFINNMPIKPGPTVFKALLSAAQVHGNKEIAVQSARRLVELCPNDPATYIVLA 181 GCL EA FIN MP++P +V+ +LL A +VHGN E+A ++ +L+EL P +P TY++L+ Sbjct: 149 GCLDEALNFINQMPVEPNASVWGSLLGACRVHGNIELAERAVEQLIELTPENPGTYVLLS 208 Query: 182 NVLATEGNWNDAANVRKLMCDRGVRKKPGSSWV 280 N+ A G W+DA VRK+M DR V+K+PG SW+ Sbjct: 209 NIYAAAGRWDDAGKVRKMMKDRSVKKEPGCSWI 241 >ref|XP_002280360.1| PREDICTED: pentatricopeptide repeat-containing protein At1g25360 [Vitis vinifera] Length = 799 Score = 108 bits (271), Expect = 6e-22 Identities = 48/105 (45%), Positives = 72/105 (68%) Frame = +2 Query: 2 GCLQEAEAFINNMPIKPGPTVFKALLSAAQVHGNKEIAVQSARRLVELCPNDPATYIVLA 181 G EA+ I MP++PGP +++ALL+ ++HGN ++ +Q+A RL EL P TY++L+ Sbjct: 578 GKFSEAKDMIETMPVEPGPPIWEALLAGCRIHGNMDLGIQAAERLFELMPQHDGTYVLLS 637 Query: 182 NVLATEGNWNDAANVRKLMCDRGVRKKPGSSWV*FETRGITKLLD 316 N+ AT G W+D A VRKLM D+GV+K+PG SW+ E + L+D Sbjct: 638 NMYATVGRWDDVAKVRKLMRDKGVKKEPGCSWIEVENKVHVFLVD 682 >gb|EOX91779.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao] Length = 718 Score = 104 bits (259), Expect = 1e-20 Identities = 50/93 (53%), Positives = 66/93 (70%) Frame = +2 Query: 2 GCLQEAEAFINNMPIKPGPTVFKALLSAAQVHGNKEIAVQSARRLVELCPNDPATYIVLA 181 G L EAE I MP PG + ALLSA ++HGN E+A ++A +L+EL P++ Y++LA Sbjct: 494 GKLSEAERLIETMPFSPGSIGWAALLSACKMHGNIELASRAANQLLELEPSNAVPYVMLA 553 Query: 182 NVLATEGNWNDAANVRKLMCDRGVRKKPGSSWV 280 N+ A+ G W +AA VRKLM DRGVRKKPG SW+ Sbjct: 554 NMYASSGKWEEAATVRKLMRDRGVRKKPGCSWI 586 >ref|XP_002302000.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550344162|gb|EEE81273.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 797 Score = 103 bits (258), Expect = 2e-20 Identities = 47/109 (43%), Positives = 72/109 (66%) Frame = +2 Query: 2 GCLQEAEAFINNMPIKPGPTVFKALLSAAQVHGNKEIAVQSARRLVELCPNDPATYIVLA 181 G EA+ + +MP +PG +++ALL+ ++HGN ++ +++A RL EL P TY++L+ Sbjct: 576 GKFSEAKEVMESMPFEPGAPIWEALLAGCRIHGNIDLGIEAAERLFELKPQHDGTYVLLS 635 Query: 182 NVLATEGNWNDAANVRKLMCDRGVRKKPGSSWV*FETRGITKLLDMRNH 328 N+ A G WND A VRKLM DRGV+K+PG SW+ E + + L+ NH Sbjct: 636 NMYAVAGQWNDMAKVRKLMRDRGVKKEPGCSWIEVENKVHSFLVGDANH 684 >ref|XP_006846299.1| hypothetical protein AMTR_s00012p00252800 [Amborella trichopoda] gi|548849069|gb|ERN07974.1| hypothetical protein AMTR_s00012p00252800 [Amborella trichopoda] Length = 627 Score = 103 bits (257), Expect = 2e-20 Identities = 49/93 (52%), Positives = 66/93 (70%) Frame = +2 Query: 2 GCLQEAEAFINNMPIKPGPTVFKALLSAAQVHGNKEIAVQSARRLVELCPNDPATYIVLA 181 G L EAE IN MPIKP ++++ALLSA +VHGN +AV++A L+ELCP+D + Y++L+ Sbjct: 508 GYLNEAEELINKMPIKPEASIYRALLSACRVHGNMGMAVRAAGCLLELCPSDASAYVLLS 567 Query: 182 NVLATEGNWNDAANVRKLMCDRGVRKKPGSSWV 280 NV +G W D +VR LM GV K+PG SWV Sbjct: 568 NVFGEKGYWADKEHVRMLMGSNGVTKEPGYSWV 600 >gb|EPS72239.1| hypothetical protein M569_02517, partial [Genlisea aurea] Length = 786 Score = 103 bits (256), Expect = 3e-20 Identities = 49/98 (50%), Positives = 67/98 (68%) Frame = +2 Query: 2 GCLQEAEAFINNMPIKPGPTVFKALLSAAQVHGNKEIAVQSARRLVELCPNDPATYIVLA 181 G L EAE I +P +PG +++ALLS ++H N ++AV +A RL EL P + TYI+LA Sbjct: 565 GKLTEAERVIEALPFRPGAPIWEALLSGCKLHRNMDLAVHAAERLFELIPENDGTYILLA 624 Query: 182 NVLATEGNWNDAANVRKLMCDRGVRKKPGSSWV*FETR 295 N+ AT G W+ A VRKLM DRGV+K+PG SW+ E + Sbjct: 625 NMFATSGRWDQVAAVRKLMRDRGVKKEPGCSWLEVENK 662 >ref|XP_006646734.1| PREDICTED: pentatricopeptide repeat-containing protein At4g13650-like [Oryza brachyantha] Length = 569 Score = 102 bits (253), Expect = 7e-20 Identities = 47/92 (51%), Positives = 69/92 (75%) Frame = +2 Query: 2 GCLQEAEAFINNMPIKPGPTVFKALLSAAQVHGNKEIAVQSARRLVELCPNDPATYIVLA 181 G L EAE IN+MPIKPG +V++ALLSA Q+HGN EIA+Q ++RL+EL P+D + ++ L+ Sbjct: 470 GYLNEAEYLINSMPIKPGASVYRALLSACQIHGNLEIAIQVSKRLIELNPHDSSVHVQLS 529 Query: 182 NVLATEGNWNDAANVRKLMCDRGVRKKPGSSW 277 N A +G W++AA +R+ M +G+ K+P SW Sbjct: 530 NAFAGDGRWDNAAEIREAMSGKGIVKEP--SW 559 >gb|EOY05619.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao] Length = 788 Score = 102 bits (253), Expect = 7e-20 Identities = 48/109 (44%), Positives = 70/109 (64%) Frame = +2 Query: 2 GCLQEAEAFINNMPIKPGPTVFKALLSAAQVHGNKEIAVQSARRLVELCPNDPATYIVLA 181 G EA+ + +MP +PG V++ALL+ + HGN ++ +Q+A RL+EL P +Y++L+ Sbjct: 567 GKFLEAKDVLTSMPFEPGAPVWEALLAGCRTHGNVDLGIQAAERLIELMPQHDGSYVLLS 626 Query: 182 NVLATEGNWNDAANVRKLMCDRGVRKKPGSSWV*FETRGITKLLDMRNH 328 N+ AT G W+D A RKLM DRGV K+PG SWV E + L+D H Sbjct: 627 NMYATAGRWDDVAKTRKLMRDRGVHKEPGCSWVEVENKVHVFLVDDAVH 675 >ref|XP_006342194.1| PREDICTED: pentatricopeptide repeat-containing protein At1g25360-like [Solanum tuberosum] Length = 804 Score = 101 bits (252), Expect = 9e-20 Identities = 47/93 (50%), Positives = 64/93 (68%) Frame = +2 Query: 2 GCLQEAEAFINNMPIKPGPTVFKALLSAAQVHGNKEIAVQSARRLVELCPNDPATYIVLA 181 G L EA+ I NMP KPG +++ALL+ + H N ++ V++A +L EL P TYI+LA Sbjct: 583 GRLLEAKEVIQNMPYKPGAPIWEALLAGCRTHRNVDLGVEAAEQLFELTPQHDGTYILLA 642 Query: 182 NVLATEGNWNDAANVRKLMCDRGVRKKPGSSWV 280 N A G W+DAA VRKLM D+GV+K+PG SW+ Sbjct: 643 NTFAAAGRWDDAAKVRKLMRDQGVKKEPGCSWI 675 >gb|EMJ17597.1| hypothetical protein PRUPE_ppa022530mg [Prunus persica] Length = 689 Score = 101 bits (251), Expect = 1e-19 Identities = 45/93 (48%), Positives = 64/93 (68%) Frame = +2 Query: 2 GCLQEAEAFINNMPIKPGPTVFKALLSAAQVHGNKEIAVQSARRLVELCPNDPATYIVLA 181 G EA+ I +MP +PG +++ALL+ + HGN ++ +Q+A RL EL P TYI+L+ Sbjct: 468 GEFTEAKGLIESMPFEPGGPIWEALLAGCRTHGNMDLGIQAAERLFELVPQHDGTYILLS 527 Query: 182 NVLATEGNWNDAANVRKLMCDRGVRKKPGSSWV 280 N+ A G W+D A VRKLM DRGV+K+PG SW+ Sbjct: 528 NLYAAIGRWDDVAKVRKLMRDRGVKKEPGCSWI 560