BLASTX nr result
ID: Catharanthus22_contig00027880
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00027880 (439 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003632994.1| PREDICTED: pentatricopeptide repeat-containi... 133 3e-29 ref|XP_004295634.1| PREDICTED: pentatricopeptide repeat-containi... 129 3e-28 ref|XP_006355278.1| PREDICTED: pentatricopeptide repeat-containi... 127 2e-27 ref|XP_004244886.1| PREDICTED: pentatricopeptide repeat-containi... 123 2e-26 gb|EPS63069.1| hypothetical protein M569_11717 [Genlisea aurea] 121 8e-26 ref|XP_002531058.1| pentatricopeptide repeat-containing protein,... 119 3e-25 gb|EMJ13849.1| hypothetical protein PRUPE_ppa018206mg, partial [... 119 4e-25 gb|ESW19934.1| hypothetical protein PHAVU_006G167300g [Phaseolus... 116 3e-24 ref|XP_006468073.1| PREDICTED: pentatricopeptide repeat-containi... 115 5e-24 ref|XP_004485987.1| PREDICTED: pentatricopeptide repeat-containi... 115 8e-24 ref|XP_003541672.2| PREDICTED: pentatricopeptide repeat-containi... 111 1e-22 ref|XP_004158687.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 108 6e-22 ref|XP_004134903.1| PREDICTED: pentatricopeptide repeat-containi... 108 6e-22 gb|EXB51999.1| hypothetical protein L484_019777 [Morus notabilis] 106 4e-21 gb|EOX96932.1| Tetratricopeptide repeat (TPR)-like superfamily p... 101 1e-19 ref|XP_006413827.1| hypothetical protein EUTSA_v10027143mg [Eutr... 87 3e-15 ref|XP_006836321.1| hypothetical protein AMTR_s00092p00064890 [A... 87 3e-15 ref|NP_001078414.1| pentatricopeptide repeat-containing protein ... 86 7e-15 emb|CAB45902.1| putative protein (fragment) [Arabidopsis thalian... 86 7e-15 ref|XP_002869909.1| binding protein [Arabidopsis lyrata subsp. l... 83 4e-14 >ref|XP_003632994.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Vitis vinifera] Length = 613 Score = 133 bits (334), Expect = 3e-29 Identities = 68/112 (60%), Positives = 83/112 (74%) Frame = +2 Query: 104 MHSNPKYSESSNLAEINPKPHLLNLTIAAQKDPQLSPKPYILKKCIALLLSCASSNYKLR 283 MHSN + + +P+ H + TI+ P+ SPK YILKKCIALLLSCASS +K R Sbjct: 1 MHSN-QLGRQPLIPTHSPRKHF-SFTISTSTCPE-SPKSYILKKCIALLLSCASSKFKFR 57 Query: 284 QVHAFSIRHGVALSNPDMGKHLIFLLVSLSGPMSYARKIFNQIEEPNIFTWN 439 Q+HAFSIRHGV L+NPDMGK+LIF L+S PMSYA +IF+QI+ PNIFTWN Sbjct: 58 QIHAFSIRHGVPLTNPDMGKYLIFTLLSFCSPMSYAHQIFSQIQNPNIFTWN 109 >ref|XP_004295634.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Fragaria vesca subsp. vesca] Length = 611 Score = 129 bits (325), Expect = 3e-28 Identities = 66/112 (58%), Positives = 82/112 (73%) Frame = +2 Query: 104 MHSNPKYSESSNLAEINPKPHLLNLTIAAQKDPQLSPKPYILKKCIALLLSCASSNYKLR 283 +HS+ ++ +++L + NPK P +P P+IL+KCIALL SCASSN KL+ Sbjct: 6 IHSSLAHTVTADLTQ-NPK---------TTSFPSQTPLPFILQKCIALLQSCASSNSKLK 55 Query: 284 QVHAFSIRHGVALSNPDMGKHLIFLLVSLSGPMSYARKIFNQIEEPNIFTWN 439 Q+HAFSIRHGV LSNPDMGKHLIF VSLS PMSYA IF+QI+ PN+FTWN Sbjct: 56 QIHAFSIRHGVPLSNPDMGKHLIFTSVSLSSPMSYAHHIFSQIKHPNVFTWN 107 >ref|XP_006355278.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Solanum tuberosum] Length = 585 Score = 127 bits (318), Expect = 2e-27 Identities = 60/77 (77%), Positives = 68/77 (88%) Frame = +2 Query: 209 SPKPYILKKCIALLLSCASSNYKLRQVHAFSIRHGVALSNPDMGKHLIFLLVSLSGPMSY 388 S KPYI+KKCIALLLSCASS YK +QVHAFSIR + LS+P+MGK+LIF LVSLSGPM Y Sbjct: 5 STKPYIVKKCIALLLSCASSTYKFKQVHAFSIRRRIPLSSPEMGKYLIFTLVSLSGPMCY 64 Query: 389 ARKIFNQIEEPNIFTWN 439 A+KIFNQI+ PNIFTWN Sbjct: 65 AKKIFNQIQFPNIFTWN 81 >ref|XP_004244886.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Solanum lycopersicum] Length = 585 Score = 123 bits (309), Expect = 2e-26 Identities = 59/77 (76%), Positives = 66/77 (85%) Frame = +2 Query: 209 SPKPYILKKCIALLLSCASSNYKLRQVHAFSIRHGVALSNPDMGKHLIFLLVSLSGPMSY 388 S KPYI+KKCI LLLSCASS YK +QVHAFSIR + LSNP MGK+LIF LVSLSGPM Y Sbjct: 5 STKPYIVKKCITLLLSCASSTYKFKQVHAFSIRRRIPLSNPYMGKYLIFTLVSLSGPMCY 64 Query: 389 ARKIFNQIEEPNIFTWN 439 A++IFNQI+ PNIFTWN Sbjct: 65 AQQIFNQIQFPNIFTWN 81 >gb|EPS63069.1| hypothetical protein M569_11717 [Genlisea aurea] Length = 601 Score = 121 bits (304), Expect = 8e-26 Identities = 61/81 (75%), Positives = 68/81 (83%), Gaps = 1/81 (1%) Frame = +2 Query: 200 PQLSPKPYILKKCIALLLSCASSNY-KLRQVHAFSIRHGVALSNPDMGKHLIFLLVSLSG 376 P S + YILKKCIALLLSCASS+ KLRQVHAFSIRHGV+LS+P MGKHLIF LVSLS Sbjct: 16 PSESGRSYILKKCIALLLSCASSSVAKLRQVHAFSIRHGVSLSSPSMGKHLIFTLVSLSE 75 Query: 377 PMSYARKIFNQIEEPNIFTWN 439 PM YA K+F+QI PNIFTW+ Sbjct: 76 PMQYAHKVFDQIPHPNIFTWD 96 >ref|XP_002531058.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223529353|gb|EEF31319.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 341 Score = 119 bits (299), Expect = 3e-25 Identities = 62/113 (54%), Positives = 77/113 (68%), Gaps = 1/113 (0%) Frame = +2 Query: 104 MHSNPKYSESSNL-AEINPKPHLLNLTIAAQKDPQLSPKPYILKKCIALLLSCASSNYKL 280 MHS P + L + NP + A + + +P PYI+KKCIALL CASS YKL Sbjct: 1 MHSTPPTDQQLVLHSSQNP----ITYFTAPKPASKENPIPYIVKKCIALLQICASSKYKL 56 Query: 281 RQVHAFSIRHGVALSNPDMGKHLIFLLVSLSGPMSYARKIFNQIEEPNIFTWN 439 +Q+HAFSIRHGV +NPDMGKHLI+ +VS+S PM+YA IF I+ PNIFTWN Sbjct: 57 QQIHAFSIRHGVLPNNPDMGKHLIYSIVSVSAPMTYAHNIFTLIQNPNIFTWN 109 >gb|EMJ13849.1| hypothetical protein PRUPE_ppa018206mg, partial [Prunus persica] Length = 604 Score = 119 bits (298), Expect = 4e-25 Identities = 58/96 (60%), Positives = 71/96 (73%) Frame = +2 Query: 152 NPKPHLLNLTIAAQKDPQLSPKPYILKKCIALLLSCASSNYKLRQVHAFSIRHGVALSNP 331 NPK +L+ + PQ +P YIL+KCIALL CASS K++Q+HAFS+RHGV LS+P Sbjct: 6 NPKTLFSSLSAPSPTFPQ-NPIHYILQKCIALLQCCASSKLKMQQIHAFSVRHGVPLSSP 64 Query: 332 DMGKHLIFLLVSLSGPMSYARKIFNQIEEPNIFTWN 439 DMGKHLIF VSL PM YA +IF+QI PN+FTWN Sbjct: 65 DMGKHLIFTTVSLKAPMPYAHQIFSQIRSPNVFTWN 100 >gb|ESW19934.1| hypothetical protein PHAVU_006G167300g [Phaseolus vulgaris] Length = 611 Score = 116 bits (291), Expect = 3e-24 Identities = 56/97 (57%), Positives = 66/97 (68%) Frame = +2 Query: 149 INPKPHLLNLTIAAQKDPQLSPKPYILKKCIALLLSCASSNYKLRQVHAFSIRHGVALSN 328 + P + + T PQ Y+L KCI LL S ASS YKLRQ+HAFSIRHGV+L N Sbjct: 11 VQSHPSMFHATNFLSTTPQNPLPYYLLTKCIVLLQSSASSKYKLRQIHAFSIRHGVSLHN 70 Query: 329 PDMGKHLIFLLVSLSGPMSYARKIFNQIEEPNIFTWN 439 PDM KHLIF +VSLS PMSYA +F +I PN+FTWN Sbjct: 71 PDMAKHLIFTIVSLSAPMSYAYNVFTRIHNPNVFTWN 107 >ref|XP_006468073.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Citrus sinensis] Length = 616 Score = 115 bits (289), Expect = 5e-24 Identities = 62/113 (54%), Positives = 81/113 (71%), Gaps = 1/113 (0%) Frame = +2 Query: 104 MHSN-PKYSESSNLAEINPKPHLLNLTIAAQKDPQLSPKPYILKKCIALLLSCASSNYKL 280 MHS P Y E S+L L + T A+Q++P S +++KCI LL CASS +KL Sbjct: 6 MHSKQPSYEEISDLP--CRVKSLYHSTPASQENPITS----VVRKCITLLQVCASSKHKL 59 Query: 281 RQVHAFSIRHGVALSNPDMGKHLIFLLVSLSGPMSYARKIFNQIEEPNIFTWN 439 +QVHAFSIRHGV L+NPD+GK+LI+ +VSLS PMSYA IF+ +++PNIFTWN Sbjct: 60 KQVHAFSIRHGVPLNNPDLGKYLIYAIVSLSFPMSYAHNIFSHVQDPNIFTWN 112 >ref|XP_004485987.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cicer arietinum] Length = 610 Score = 115 bits (287), Expect = 8e-24 Identities = 59/106 (55%), Positives = 76/106 (71%), Gaps = 3/106 (2%) Frame = +2 Query: 131 SSNLAEI--NPKPHLLN-LTIAAQKDPQLSPKPYILKKCIALLLSCASSNYKLRQVHAFS 301 SS L+ + PK HL + +T + + +P +IL KCIALL CASS +KL+Q+HAFS Sbjct: 4 SSKLSSLFHTPKNHLSSFITFSTTSE---NPTSHILTKCIALLQYCASSKHKLKQIHAFS 60 Query: 302 IRHGVALSNPDMGKHLIFLLVSLSGPMSYARKIFNQIEEPNIFTWN 439 IRHGV L+NPDMGK+LIF +VSLS PMSYA +F + PN+FTWN Sbjct: 61 IRHGVPLNNPDMGKYLIFTVVSLSAPMSYAYNVFTLLHNPNVFTWN 106 >ref|XP_003541672.2| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Glycine max] Length = 607 Score = 111 bits (277), Expect = 1e-22 Identities = 50/71 (70%), Positives = 59/71 (83%) Frame = +2 Query: 227 LKKCIALLLSCASSNYKLRQVHAFSIRHGVALSNPDMGKHLIFLLVSLSGPMSYARKIFN 406 L KCI+LL CASS +KL+Q+HAFSIRHGV+L+NPDMGKHLIF +VSLS PMSYA +F Sbjct: 33 LTKCISLLQFCASSKHKLKQIHAFSIRHGVSLNNPDMGKHLIFTIVSLSAPMSYAYNVFT 92 Query: 407 QIEEPNIFTWN 439 I PN+FTWN Sbjct: 93 VIHNPNVFTWN 103 >ref|XP_004158687.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis sativus] Length = 609 Score = 108 bits (271), Expect = 6e-22 Identities = 50/73 (68%), Positives = 59/73 (80%) Frame = +2 Query: 221 YILKKCIALLLSCASSNYKLRQVHAFSIRHGVALSNPDMGKHLIFLLVSLSGPMSYARKI 400 +IL+KCI+L+ C SS KL+Q+HAFSIRHGV NPD KHLIF LVSLS PMS+A +I Sbjct: 32 FILRKCISLVQLCGSSQSKLKQIHAFSIRHGVPPQNPDFNKHLIFALVSLSAPMSFAAQI 91 Query: 401 FNQIEEPNIFTWN 439 FNQI+ PNIFTWN Sbjct: 92 FNQIQAPNIFTWN 104 >ref|XP_004134903.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis sativus] Length = 609 Score = 108 bits (271), Expect = 6e-22 Identities = 50/73 (68%), Positives = 59/73 (80%) Frame = +2 Query: 221 YILKKCIALLLSCASSNYKLRQVHAFSIRHGVALSNPDMGKHLIFLLVSLSGPMSYARKI 400 +IL+KCI+L+ C SS KL+Q+HAFSIRHGV NPD KHLIF LVSLS PMS+A +I Sbjct: 32 FILRKCISLVQLCGSSQSKLKQIHAFSIRHGVPPQNPDFNKHLIFALVSLSAPMSFAAQI 91 Query: 401 FNQIEEPNIFTWN 439 FNQI+ PNIFTWN Sbjct: 92 FNQIQAPNIFTWN 104 >gb|EXB51999.1| hypothetical protein L484_019777 [Morus notabilis] Length = 623 Score = 106 bits (264), Expect = 4e-21 Identities = 50/77 (64%), Positives = 60/77 (77%) Frame = +2 Query: 209 SPKPYILKKCIALLLSCASSNYKLRQVHAFSIRHGVALSNPDMGKHLIFLLVSLSGPMSY 388 +P P I+ K I+LL CASS KL Q+HAFSIRHGV L++PDMGKHLIF VSLS MSY Sbjct: 42 NPIPIIIAKYISLLQLCASSESKLMQIHAFSIRHGVPLADPDMGKHLIFTAVSLSASMSY 101 Query: 389 ARKIFNQIEEPNIFTWN 439 A +F+QI+ PNI+TWN Sbjct: 102 ANNVFSQIDRPNIYTWN 118 >gb|EOX96932.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1 [Theobroma cacao] gi|508705037|gb|EOX96933.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1 [Theobroma cacao] gi|508705038|gb|EOX96934.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1 [Theobroma cacao] Length = 616 Score = 101 bits (251), Expect = 1e-19 Identities = 46/77 (59%), Positives = 61/77 (79%) Frame = +2 Query: 209 SPKPYILKKCIALLLSCASSNYKLRQVHAFSIRHGVALSNPDMGKHLIFLLVSLSGPMSY 388 +P +I+KKCI+LL + SS KLRQ+HAFS+RHGV L++PD+GKHLI+ LVSLS PMSY Sbjct: 36 NPVSFIVKKCISLLQNYGSSELKLRQIHAFSLRHGVPLNDPDIGKHLIYSLVSLSTPMSY 95 Query: 389 ARKIFNQIEEPNIFTWN 439 IF++I+ N+F WN Sbjct: 96 PYSIFSRIQSSNVFIWN 112 >ref|XP_006413827.1| hypothetical protein EUTSA_v10027143mg [Eutrema salsugineum] gi|557114997|gb|ESQ55280.1| hypothetical protein EUTSA_v10027143mg [Eutrema salsugineum] Length = 595 Score = 86.7 bits (213), Expect = 3e-15 Identities = 45/76 (59%), Positives = 59/76 (77%), Gaps = 4/76 (5%) Frame = +2 Query: 224 ILKKCIALLLSCA-SSNYKLRQVHAFSIRHGVALSNPDMGKHLIFLLVSLSG--PMSYAR 394 ++ KCI LL +C SS KL++VHAFSIRHGV++S+ + GKHLIF LVSL PMSYA Sbjct: 14 MVDKCITLLQTCGVSSLTKLKKVHAFSIRHGVSISDAEFGKHLIFYLVSLPSPPPMSYAH 73 Query: 395 KIFNQIEEP-NIFTWN 439 K+F++IE+P N+F WN Sbjct: 74 KVFSKIEKPINVFIWN 89 >ref|XP_006836321.1| hypothetical protein AMTR_s00092p00064890 [Amborella trichopoda] gi|548838839|gb|ERM99174.1| hypothetical protein AMTR_s00092p00064890 [Amborella trichopoda] Length = 285 Score = 86.7 bits (213), Expect = 3e-15 Identities = 56/132 (42%), Positives = 74/132 (56%), Gaps = 20/132 (15%) Frame = +2 Query: 104 MHSNPKYSESSNLAEINPKPHLLNLTIAAQKDPQLSPKPYILK----------------- 232 MH NP S+S P+P ++ A K+P LSP P L Sbjct: 1 MHINPIPSQSFMPL---PRPS----SVPAFKNPTLSPLPSFLSLKPRTFTNSPYYPLPKT 53 Query: 233 ---KCIALLLSCASSNYKLRQVHAFSIRHGVALSNPDMGKHLIFLLVSLSGPMSYARKIF 403 +C+ALL +C +S ++Q+HAF +R+GVA S+P +GKHLIF LVSLS PM YA IF Sbjct: 54 IPNQCVALLQNC-NSLPSVKQIHAFGLRNGVAPSDPLVGKHLIFSLVSLSTPMRYALNIF 112 Query: 404 NQIEEPNIFTWN 439 + I+ PN FTWN Sbjct: 113 SHIQFPNAFTWN 124 >ref|NP_001078414.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|223635630|sp|A8MQA3.2|PP330_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g21065 gi|332658994|gb|AEE84394.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 595 Score = 85.5 bits (210), Expect = 7e-15 Identities = 45/76 (59%), Positives = 60/76 (78%), Gaps = 4/76 (5%) Frame = +2 Query: 224 ILKKCIALLLSCA-SSNYKLRQVHAFSIRHGVALSNPDMGKHLIFLLVSLSG--PMSYAR 394 +++KCI LL + SS KLRQ+HAFSIRHGV++S+ ++GKHLIF LVSL PMSYA Sbjct: 14 MVEKCINLLQTYGVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAH 73 Query: 395 KIFNQIEEP-NIFTWN 439 K+F++IE+P N+F WN Sbjct: 74 KVFSKIEKPINVFIWN 89 >emb|CAB45902.1| putative protein (fragment) [Arabidopsis thaliana] gi|7268904|emb|CAB79107.1| putative protein (fragment) [Arabidopsis thaliana] Length = 1495 Score = 85.5 bits (210), Expect = 7e-15 Identities = 45/76 (59%), Positives = 60/76 (78%), Gaps = 4/76 (5%) Frame = +2 Query: 224 ILKKCIALLLSCA-SSNYKLRQVHAFSIRHGVALSNPDMGKHLIFLLVSLSG--PMSYAR 394 +++KCI LL + SS KLRQ+HAFSIRHGV++S+ ++GKHLIF LVSL PMSYA Sbjct: 14 MVEKCINLLQTYGVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAH 73 Query: 395 KIFNQIEEP-NIFTWN 439 K+F++IE+P N+F WN Sbjct: 74 KVFSKIEKPINVFIWN 89 >ref|XP_002869909.1| binding protein [Arabidopsis lyrata subsp. lyrata] gi|297315745|gb|EFH46168.1| binding protein [Arabidopsis lyrata subsp. lyrata] Length = 595 Score = 82.8 bits (203), Expect = 4e-14 Identities = 44/76 (57%), Positives = 60/76 (78%), Gaps = 4/76 (5%) Frame = +2 Query: 224 ILKKCIALLLSCA-SSNYKLRQVHAFSIRHGVALSNPDMGKHLIFLLVSLSG--PMSYAR 394 +++KCI LL + SS KLRQ+HAFSIR+GV++S+ ++GKHLIF LVSL PMSYA Sbjct: 14 MVEKCINLLQTYGVSSLTKLRQIHAFSIRNGVSISDAELGKHLIFYLVSLPSPPPMSYAH 73 Query: 395 KIFNQIEEP-NIFTWN 439 K+F++IE+P N+F WN Sbjct: 74 KVFSKIEKPINVFIWN 89