BLASTX nr result
ID: Catharanthus22_contig00013377
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00013377 (2100 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006352332.1| PREDICTED: pentatricopeptide repeat-containi... 119 5e-24 gb|EMJ05969.1| hypothetical protein PRUPE_ppa015604mg [Prunus pe... 101 1e-18 ref|XP_006383060.1| pentatricopeptide repeat-containing family p... 98 2e-17 ref|XP_002327644.1| predicted protein [Populus trichocarpa] 98 2e-17 emb|CAN73672.1| hypothetical protein VITISV_031859 [Vitis vinifera] 97 4e-17 ref|XP_006425390.1| hypothetical protein CICLE_v10027592mg [Citr... 94 2e-16 gb|EXC35313.1| hypothetical protein L484_026636 [Morus notabilis] 93 5e-16 gb|ESW29877.1| hypothetical protein PHAVU_002G106000g [Phaseolus... 91 2e-15 ref|XP_003516541.1| PREDICTED: pentatricopeptide repeat-containi... 91 3e-15 ref|XP_003612228.1| Pentatricopeptide repeat-containing protein ... 90 3e-15 ref|XP_004158900.1| PREDICTED: pentatricopeptide repeat-containi... 88 2e-14 ref|XP_004135020.1| PREDICTED: pentatricopeptide repeat-containi... 88 2e-14 ref|XP_004250558.1| PREDICTED: pentatricopeptide repeat-containi... 87 3e-14 ref|XP_004512166.1| PREDICTED: pentatricopeptide repeat-containi... 86 8e-14 ref|XP_006857380.1| hypothetical protein AMTR_s00067p00130250 [A... 79 1e-11 ref|XP_006403930.1| hypothetical protein EUTSA_v10010283mg [Eutr... 76 7e-11 ref|NP_190700.2| pentatricopeptide repeat-containing protein [Ar... 76 7e-11 emb|CAB62654.1| putative protein [Arabidopsis thaliana] 76 7e-11 ref|XP_002877796.1| binding protein [Arabidopsis lyrata subsp. l... 75 1e-10 ref|XP_002267596.1| PREDICTED: pentatricopeptide repeat-containi... 72 7e-10 >ref|XP_006352332.1| PREDICTED: pentatricopeptide repeat-containing protein At3g51320-like isoform X1 [Solanum tuberosum] gi|565371484|ref|XP_006352333.1| PREDICTED: pentatricopeptide repeat-containing protein At3g51320-like isoform X2 [Solanum tuberosum] Length = 534 Score = 119 bits (298), Expect = 5e-24 Identities = 53/95 (55%), Positives = 72/95 (75%) Frame = -2 Query: 1814 IFRCIQFPSPFCVNTMIKACSCSSVPHMAMLLYIERRKDGFCPNSFSFPSLVSAYSKMGI 1635 +F+CI FP F VNT+IKA +CSS+P A++ Y +R K+GF PNSF+FP L+SA ++ G Sbjct: 87 VFKCIHFPDTFSVNTVIKAYACSSLPDNAVVFYFQRLKNGFLPNSFTFPPLMSACARRGR 146 Query: 1634 LNLGRQCHGQAAKNEFDFVLLVQNSMLHFYACVGF 1530 L+ G++CHGQ KN D VL VQNS++HFY+C GF Sbjct: 147 LDSGQKCHGQVVKNGVDGVLQVQNSLVHFYSCCGF 181 >gb|EMJ05969.1| hypothetical protein PRUPE_ppa015604mg [Prunus persica] Length = 568 Score = 101 bits (251), Expect = 1e-18 Identities = 45/90 (50%), Positives = 63/90 (70%) Frame = -2 Query: 1814 IFRCIQFPSPFCVNTMIKACSCSSVPHMAMLLYIERRKDGFCPNSFSFPSLVSAYSKMGI 1635 IFRCI P FCVNT+IKA S SS+P A+++Y E ++GF P S++F L+ + +KMG Sbjct: 101 IFRCIDLPGTFCVNTVIKAYSVSSMPDQALVVYFEWLRNGFAPTSYTFVPLIGSCAKMGS 160 Query: 1634 LNLGRQCHGQAAKNEFDFVLLVQNSMLHFY 1545 + GR+CHGQ K+ D +L VQNS++H Y Sbjct: 161 VESGRKCHGQVVKHGLDSLLQVQNSLIHMY 190 >ref|XP_006383060.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550338637|gb|ERP60857.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 564 Score = 97.8 bits (242), Expect = 2e-17 Identities = 48/100 (48%), Positives = 65/100 (65%) Frame = -2 Query: 1814 IFRCIQFPSPFCVNTMIKACSCSSVPHMAMLLYIERRKDGFCPNSFSFPSLVSAYSKMGI 1635 IF+ I P F VN ++KA S SS P+ A++ Y E K GFCPNS++F SL +K+G Sbjct: 103 IFKFIASPGTFVVNNVVKAYSLSSEPNKALVFYFEMLKSGFCPNSYTFVSLFGCCAKVGC 162 Query: 1634 LNLGRQCHGQAAKNEFDFVLLVQNSMLHFYACVGFDAGLS 1515 LG++ HGQA KN D +L V+NS++H Y C G D GL+ Sbjct: 163 AKLGKKYHGQAVKNGVDRILPVENSLIHCYGCCG-DMGLA 201 >ref|XP_002327644.1| predicted protein [Populus trichocarpa] Length = 564 Score = 97.8 bits (242), Expect = 2e-17 Identities = 48/100 (48%), Positives = 65/100 (65%) Frame = -2 Query: 1814 IFRCIQFPSPFCVNTMIKACSCSSVPHMAMLLYIERRKDGFCPNSFSFPSLVSAYSKMGI 1635 IF+ I P F VN ++KA S SS P+ A++ Y E K GFCPNS++F SL +K+G Sbjct: 103 IFKFIASPGTFVVNNVVKAYSLSSEPNKALVFYFEMLKSGFCPNSYTFVSLFGCCAKVGC 162 Query: 1634 LNLGRQCHGQAAKNEFDFVLLVQNSMLHFYACVGFDAGLS 1515 LG++ HGQA KN D +L V+NS++H Y C G D GL+ Sbjct: 163 AKLGKKYHGQAVKNGVDRILPVENSLIHCYGCCG-DMGLA 201 >emb|CAN73672.1| hypothetical protein VITISV_031859 [Vitis vinifera] Length = 901 Score = 96.7 bits (239), Expect = 4e-17 Identities = 49/94 (52%), Positives = 58/94 (61%) Frame = -2 Query: 1814 IFRCIQFPSPFCVNTMIKACSCSSVPHMAMLLYIERRKDGFCPNSFSFPSLVSAYSKMGI 1635 IFR I P CVN +IKA S SSV H A++ Y E ++GF NSF+FP L S K G Sbjct: 426 IFRSIDSPDTVCVNAVIKAYSISSVAHQALVFYFETLRNGFMCNSFTFPPLFSCCRKXGC 485 Query: 1634 LNLGRQCHGQAAKNEFDFVLLVQNSMLHFYACVG 1533 + G + HGQA KN D VL VQNSM+H Y C G Sbjct: 486 VEYGEKFHGQAIKNGVDNVLDVQNSMVHMYGCCG 519 >ref|XP_006425390.1| hypothetical protein CICLE_v10027592mg [Citrus clementina] gi|557527380|gb|ESR38630.1| hypothetical protein CICLE_v10027592mg [Citrus clementina] Length = 563 Score = 94.0 bits (232), Expect = 2e-16 Identities = 43/94 (45%), Positives = 59/94 (62%) Frame = -2 Query: 1814 IFRCIQFPSPFCVNTMIKACSCSSVPHMAMLLYIERRKDGFCPNSFSFPSLVSAYSKMGI 1635 +F+CI P FCVN ++KA S S VP A++ Y + K+GF PNS++F SL + +K G Sbjct: 103 VFKCINNPGTFCVNAVVKAYSNSCVPDQAVVFYFQMIKNGFMPNSYTFVSLFGSCAKTGC 162 Query: 1634 LNLGRQCHGQAAKNEFDFVLLVQNSMLHFYACVG 1533 + G CHG A KN DF L V NS+++ Y C G Sbjct: 163 VERGGMCHGLALKNGVDFELPVMNSLINMYGCFG 196 >gb|EXC35313.1| hypothetical protein L484_026636 [Morus notabilis] Length = 577 Score = 92.8 bits (229), Expect = 5e-16 Identities = 41/94 (43%), Positives = 59/94 (62%) Frame = -2 Query: 1814 IFRCIQFPSPFCVNTMIKACSCSSVPHMAMLLYIERRKDGFCPNSFSFPSLVSAYSKMGI 1635 IFR I FP FCVNT+++A S + A++ Y E ++GF PNS++F +++ +K+G Sbjct: 98 IFRYIDFPGAFCVNTVLRAYSVGFDSNQALIFYFESLRNGFSPNSYTFVTVLGCCAKLGS 157 Query: 1634 LNLGRQCHGQAAKNEFDFVLLVQNSMLHFYACVG 1533 L G C GQA KN D L +QNS++H Y C G Sbjct: 158 LESGEMCRGQAIKNGVDSALQIQNSLIHMYGCCG 191 >gb|ESW29877.1| hypothetical protein PHAVU_002G106000g [Phaseolus vulgaris] Length = 583 Score = 90.9 bits (224), Expect = 2e-15 Identities = 43/94 (45%), Positives = 57/94 (60%) Frame = -2 Query: 1814 IFRCIQFPSPFCVNTMIKACSCSSVPHMAMLLYIERRKDGFCPNSFSFPSLVSAYSKMGI 1635 IFR I FCVNT+I A S PH ++ Y GF PNS++F LV + ++ G Sbjct: 99 IFRHINSSDTFCVNTVIHAYCDSDAPHQTVIFYFRSLMRGFFPNSYTFVPLVGSCARTGC 158 Query: 1634 LNLGRQCHGQAAKNEFDFVLLVQNSMLHFYACVG 1533 ++ G++CH QA KN D VL VQNS++H YAC G Sbjct: 159 VDSGKECHAQATKNGVDSVLPVQNSLIHMYACCG 192 >ref|XP_003516541.1| PREDICTED: pentatricopeptide repeat-containing protein At3g51320-like [Glycine max] Length = 579 Score = 90.5 bits (223), Expect = 3e-15 Identities = 44/94 (46%), Positives = 58/94 (61%) Frame = -2 Query: 1814 IFRCIQFPSPFCVNTMIKACSCSSVPHMAMLLYIERRKDGFCPNSFSFPSLVSAYSKMGI 1635 IFR I FCVN +I+A S S P A++ Y GF PNS++F LV++ +KMG Sbjct: 95 IFRSINSLDTFCVNIVIQAYSNSHAPREAIVFYFRSLMRGFFPNSYTFVPLVASCAKMGC 154 Query: 1634 LNLGRQCHGQAAKNEFDFVLLVQNSMLHFYACVG 1533 + G++CH QA KN D VL VQNS++H Y C G Sbjct: 155 IGSGKECHAQATKNGVDSVLPVQNSLIHMYVCCG 188 >ref|XP_003612228.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355513563|gb|AES95186.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 665 Score = 90.1 bits (222), Expect = 3e-15 Identities = 43/85 (50%), Positives = 56/85 (65%) Frame = -2 Query: 1784 FCVNTMIKACSCSSVPHMAMLLYIERRKDGFCPNSFSFPSLVSAYSKMGILNLGRQCHGQ 1605 FCVNT+I + S VPH A++ Y K GF NS++F SL+SA SKM ++ G+ CHGQ Sbjct: 103 FCVNTVINSYCNSYVPHKAIVFYFSSLKIGFFANSYTFVSLISACSKMSCVDNGKMCHGQ 162 Query: 1604 AAKNEFDFVLLVQNSMLHFYACVGF 1530 A KN DFVL V+NS+ H Y G+ Sbjct: 163 AVKNGVDFVLPVENSLAHMYGSCGY 187 >ref|XP_004158900.1| PREDICTED: pentatricopeptide repeat-containing protein At3g51320-like [Cucumis sativus] Length = 547 Score = 87.8 bits (216), Expect = 2e-14 Identities = 44/92 (47%), Positives = 61/92 (66%) Frame = -2 Query: 1814 IFRCIQFPSPFCVNTMIKACSCSSVPHMAMLLYIERRKDGFCPNSFSFPSLVSAYSKMGI 1635 IFR I+ P+ FCVN +IKA S S+VP A+ +Y E +G P+S++F SL SA + G Sbjct: 104 IFRHIKVPNTFCVNRVIKAYSLSTVPLEAVFVYFEWLGNGLRPDSYTFLSLFSACASFGC 163 Query: 1634 LNLGRQCHGQAAKNEFDFVLLVQNSMLHFYAC 1539 GR+CHGQA KN D V+++ NS++H Y C Sbjct: 164 GASGRKCHGQAFKNGVDSVMVLGNSLIHMYGC 195 >ref|XP_004135020.1| PREDICTED: pentatricopeptide repeat-containing protein At3g51320-like [Cucumis sativus] Length = 575 Score = 87.8 bits (216), Expect = 2e-14 Identities = 44/92 (47%), Positives = 61/92 (66%) Frame = -2 Query: 1814 IFRCIQFPSPFCVNTMIKACSCSSVPHMAMLLYIERRKDGFCPNSFSFPSLVSAYSKMGI 1635 IFR I+ P+ FCVN +IKA S S+VP A+ +Y E +G P+S++F SL SA + G Sbjct: 103 IFRHIKVPNTFCVNRVIKAYSLSTVPLEAVFVYFEWLGNGLRPDSYTFLSLFSACASFGC 162 Query: 1634 LNLGRQCHGQAAKNEFDFVLLVQNSMLHFYAC 1539 GR+CHGQA KN D V+++ NS++H Y C Sbjct: 163 GASGRKCHGQAFKNGVDSVMVLGNSLIHMYGC 194 >ref|XP_004250558.1| PREDICTED: pentatricopeptide repeat-containing protein At3g51320-like [Solanum lycopersicum] Length = 195 Score = 87.0 bits (214), Expect = 3e-14 Identities = 39/72 (54%), Positives = 54/72 (75%) Frame = -2 Query: 1745 SVPHMAMLLYIERRKDGFCPNSFSFPSLVSAYSKMGILNLGRQCHGQAAKNEFDFVLLVQ 1566 ++P A++ Y +R K+GF PNSF+FP L+SA +K G L+ G++CHGQ KN D VL VQ Sbjct: 36 TLPDNAVVFYYQRLKNGFLPNSFTFPPLMSACAKTGSLDSGQKCHGQVMKNGVDGVLQVQ 95 Query: 1565 NSMLHFYACVGF 1530 NS++HFY+C GF Sbjct: 96 NSIVHFYSCCGF 107 >ref|XP_004512166.1| PREDICTED: pentatricopeptide repeat-containing protein At3g51320-like [Cicer arietinum] Length = 598 Score = 85.5 bits (210), Expect = 8e-14 Identities = 41/84 (48%), Positives = 55/84 (65%) Frame = -2 Query: 1784 FCVNTMIKACSCSSVPHMAMLLYIERRKDGFCPNSFSFPSLVSAYSKMGILNLGRQCHGQ 1605 FCVNT+I + S VP+ A++ Y + K F PNS++F L+ + S MG ++ GR CH Q Sbjct: 115 FCVNTVINSYCNSYVPNKAIVFYFQSLKIRFFPNSYTFVPLIGSCSNMGCVDSGRMCHAQ 174 Query: 1604 AAKNEFDFVLLVQNSMLHFYACVG 1533 A KN DFVL VQNS++H YA G Sbjct: 175 AVKNGVDFVLPVQNSLVHMYASCG 198 >ref|XP_006857380.1| hypothetical protein AMTR_s00067p00130250 [Amborella trichopoda] gi|548861473|gb|ERN18847.1| hypothetical protein AMTR_s00067p00130250 [Amborella trichopoda] Length = 823 Score = 78.6 bits (192), Expect = 1e-11 Identities = 46/122 (37%), Positives = 63/122 (51%) Frame = -2 Query: 1847 CLSMGFFNYN*IFRCIQFPSPFCVNTMIKACSCSSVPHMAMLLYIERRKDGFCPNSFSFP 1668 CLS +FR + P NT+IKA S SS P A+ Y E G PN+F+FP Sbjct: 70 CLSYALM----VFRQLNSPELRAYNTIIKALSLSSDPIQAISFYHEMVLKGVHPNNFTFP 125 Query: 1667 SLVSAYSKMGILNLGRQCHGQAAKNEFDFVLLVQNSMLHFYACVGFDAGLSSNFNVFVYG 1488 LV++ +K+ +N G +CH + K FD V+ V NS++H YAC + F V Sbjct: 126 PLVASCAKVTAINEGEKCHTEVVKRGFDQVIFVANSLVHMYACFKLISYARQVFYEMVER 185 Query: 1487 DF 1482 DF Sbjct: 186 DF 187 >ref|XP_006403930.1| hypothetical protein EUTSA_v10010283mg [Eutrema salsugineum] gi|557105049|gb|ESQ45383.1| hypothetical protein EUTSA_v10010283mg [Eutrema salsugineum] Length = 529 Score = 75.9 bits (185), Expect = 7e-11 Identities = 36/84 (42%), Positives = 49/84 (58%) Frame = -2 Query: 1784 FCVNTMIKACSCSSVPHMAMLLYIERRKDGFCPNSFSFPSLVSAYSKMGILNLGRQCHGQ 1605 +C N + KA SS P A+ Y + RK GF P+++SF L K ++ G+ CHGQ Sbjct: 84 YCANPVFKAYLLSSTPQQALGFYFDIRKCGFVPDTYSFVPLFGCIEKTCCVDSGKMCHGQ 143 Query: 1604 AAKNEFDFVLLVQNSMLHFYACVG 1533 A K+ D VL VQNS++H Y C G Sbjct: 144 AIKHGCDQVLPVQNSLMHMYTCCG 167 >ref|NP_190700.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|122230198|sp|Q0WVU0.1|PP278_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At3g51320 gi|110741620|dbj|BAE98758.1| hypothetical protein [Arabidopsis thaliana] gi|332645257|gb|AEE78778.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 530 Score = 75.9 bits (185), Expect = 7e-11 Identities = 36/84 (42%), Positives = 51/84 (60%) Frame = -2 Query: 1784 FCVNTMIKACSCSSVPHMAMLLYIERRKDGFCPNSFSFPSLVSAYSKMGILNLGRQCHGQ 1605 +C N + KA SS P A+ Y + + GF P+S++F SL+S K ++ G+ CHGQ Sbjct: 84 YCANPVFKAYLVSSSPKQALGFYFDILRFGFVPDSYTFVSLISCIEKTCCVDSGKMCHGQ 143 Query: 1604 AAKNEFDFVLLVQNSMLHFYACVG 1533 A K+ D VL VQNS++H Y C G Sbjct: 144 AIKHGCDQVLPVQNSLMHMYTCCG 167 >emb|CAB62654.1| putative protein [Arabidopsis thaliana] Length = 486 Score = 75.9 bits (185), Expect = 7e-11 Identities = 36/84 (42%), Positives = 51/84 (60%) Frame = -2 Query: 1784 FCVNTMIKACSCSSVPHMAMLLYIERRKDGFCPNSFSFPSLVSAYSKMGILNLGRQCHGQ 1605 +C N + KA SS P A+ Y + + GF P+S++F SL+S K ++ G+ CHGQ Sbjct: 61 YCANPVFKAYLVSSSPKQALGFYFDILRFGFVPDSYTFVSLISCIEKTCCVDSGKMCHGQ 120 Query: 1604 AAKNEFDFVLLVQNSMLHFYACVG 1533 A K+ D VL VQNS++H Y C G Sbjct: 121 AIKHGCDQVLPVQNSLMHMYTCCG 144 >ref|XP_002877796.1| binding protein [Arabidopsis lyrata subsp. lyrata] gi|297323634|gb|EFH54055.1| binding protein [Arabidopsis lyrata subsp. lyrata] Length = 530 Score = 75.1 bits (183), Expect = 1e-10 Identities = 36/84 (42%), Positives = 51/84 (60%) Frame = -2 Query: 1784 FCVNTMIKACSCSSVPHMAMLLYIERRKDGFCPNSFSFPSLVSAYSKMGILNLGRQCHGQ 1605 +C N + KA SS P A+ Y + + GF P++++F SLVS K ++ G+ CHGQ Sbjct: 84 YCANPVFKAYLVSSSPKQALGFYFDILRFGFVPDTYTFVSLVSCIEKTCCVDSGKMCHGQ 143 Query: 1604 AAKNEFDFVLLVQNSMLHFYACVG 1533 A K+ D VL VQNS++H Y C G Sbjct: 144 AIKHGCDQVLPVQNSLIHMYTCCG 167 >ref|XP_002267596.1| PREDICTED: pentatricopeptide repeat-containing protein At5g06540-like [Vitis vinifera] Length = 623 Score = 72.4 bits (176), Expect = 7e-10 Identities = 38/110 (34%), Positives = 59/110 (53%) Frame = -2 Query: 1814 IFRCIQFPSPFCVNTMIKACSCSSVPHMAMLLYIERRKDGFCPNSFSFPSLVSAYSKMGI 1635 IF IQ P+ F N MI+ S S P A Y++ ++ G P++ +FP LV + +K+ Sbjct: 75 IFSQIQNPNLFIFNAMIRGHSGSKNPDQAFHFYVQSQRQGLLPDNLTFPFLVKSCTKLHC 134 Query: 1634 LNLGRQCHGQAAKNEFDFVLLVQNSMLHFYACVGFDAGLSSNFNVFVYGD 1485 +++G Q HG K+ F+ + VQNS++H YA G + F Y D Sbjct: 135 ISMGSQAHGHIIKHGFEKDVYVQNSLVHMYATFGDTEAATLIFQRMYYVD 184