BLASTX nr result
ID: Catharanthus23_contig00032487
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00032487 (309 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOX98874.1| Tetratricopeptide repeat-like superfamily protein... 137 2e-30 emb|CBI37746.3| unnamed protein product [Vitis vinifera] 137 2e-30 ref|XP_002279627.1| PREDICTED: pentatricopeptide repeat-containi... 137 2e-30 emb|CAN79511.1| hypothetical protein VITISV_014157 [Vitis vinifera] 133 2e-29 ref|XP_002328268.1| predicted protein [Populus trichocarpa] gi|5... 133 3e-29 gb|ABK95971.1| unknown [Populus trichocarpa] 133 3e-29 ref|XP_006486839.1| PREDICTED: pentatricopeptide repeat-containi... 132 5e-29 ref|XP_006422430.1| hypothetical protein CICLE_v10027827mg [Citr... 132 5e-29 gb|EXC31540.1| hypothetical protein L484_006572 [Morus notabilis] 126 3e-27 ref|XP_004231461.1| PREDICTED: pentatricopeptide repeat-containi... 125 6e-27 ref|XP_002520932.1| pentatricopeptide repeat-containing protein,... 125 8e-27 gb|EMJ00856.1| hypothetical protein PRUPE_ppa003439mg [Prunus pe... 122 5e-26 ref|XP_004168522.1| PREDICTED: pentatricopeptide repeat-containi... 119 5e-25 ref|XP_004140062.1| PREDICTED: pentatricopeptide repeat-containi... 118 9e-25 ref|XP_004292418.1| PREDICTED: pentatricopeptide repeat-containi... 117 2e-24 ref|XP_006409337.1| hypothetical protein EUTSA_v10023059mg [Eutr... 111 1e-22 sp|Q9SII7.2|PP159_ARATH RecName: Full=Pentatricopeptide repeat-c... 111 1e-22 ref|NP_179312.1| pentatricopeptide repeat-containing protein [Ar... 111 1e-22 ref|XP_006299587.1| hypothetical protein CARUB_v10015765mg [Caps... 109 3e-22 ref|XP_004228842.1| PREDICTED: pentatricopeptide repeat-containi... 108 7e-22 >gb|EOX98874.1| Tetratricopeptide repeat-like superfamily protein, putative [Theobroma cacao] Length = 742 Score = 137 bits (344), Expect = 2e-30 Identities = 66/102 (64%), Positives = 81/102 (79%) Frame = +3 Query: 3 GMNGPPQDALALHAKMRSQGLKPNLVITLSVLSACSHGGLVEEGLSLFRTLLQEYGAAVR 182 GMNG P++ALAL +M+ QGLKPN V TLS LSACSHGGL+EEGLS ++++ EYG Sbjct: 528 GMNGLPREALALVPEMKLQGLKPNSVTTLSALSACSHGGLIEEGLSFLKSMVHEYGTVPG 587 Query: 183 LEHYSCLIDLLARAGRLDSALNLIKEIPNELRPGASAWGAIL 308 LEHYSC+ID+L RAG+LDSA+ LI IP+ + GASAWGAIL Sbjct: 588 LEHYSCVIDMLGRAGKLDSAVELINHIPDGHKAGASAWGAIL 629 >emb|CBI37746.3| unnamed protein product [Vitis vinifera] Length = 2090 Score = 137 bits (344), Expect = 2e-30 Identities = 63/102 (61%), Positives = 84/102 (82%) Frame = +3 Query: 3 GMNGPPQDALALHAKMRSQGLKPNLVITLSVLSACSHGGLVEEGLSLFRTLLQEYGAAVR 182 GMNG +DALAL ++M+ GLKPN+V TLSVLSACSHGGLVEEGLS F ++Q++G Sbjct: 516 GMNGLARDALALLSEMKLHGLKPNVVTTLSVLSACSHGGLVEEGLSFFENMVQDHGVEPG 575 Query: 183 LEHYSCLIDLLARAGRLDSALNLIKEIPNELRPGASAWGAIL 308 LEHYSC++D+L+RAG+L+SA+NLI+++P +R GA WGA+L Sbjct: 576 LEHYSCMVDMLSRAGKLNSAMNLIEKMPERMRDGAGLWGALL 617 >ref|XP_002279627.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17210-like [Vitis vinifera] Length = 742 Score = 137 bits (344), Expect = 2e-30 Identities = 63/102 (61%), Positives = 84/102 (82%) Frame = +3 Query: 3 GMNGPPQDALALHAKMRSQGLKPNLVITLSVLSACSHGGLVEEGLSLFRTLLQEYGAAVR 182 GMNG +DALAL ++M+ GLKPN+V TLSVLSACSHGGLVEEGLS F ++Q++G Sbjct: 530 GMNGLARDALALLSEMKLHGLKPNVVTTLSVLSACSHGGLVEEGLSFFENMVQDHGVEPG 589 Query: 183 LEHYSCLIDLLARAGRLDSALNLIKEIPNELRPGASAWGAIL 308 LEHYSC++D+L+RAG+L+SA+NLI+++P +R GA WGA+L Sbjct: 590 LEHYSCMVDMLSRAGKLNSAMNLIEKMPERMRDGAGLWGALL 631 >emb|CAN79511.1| hypothetical protein VITISV_014157 [Vitis vinifera] Length = 1007 Score = 133 bits (335), Expect = 2e-29 Identities = 62/102 (60%), Positives = 81/102 (79%) Frame = +3 Query: 3 GMNGPPQDALALHAKMRSQGLKPNLVITLSVLSACSHGGLVEEGLSLFRTLLQEYGAAVR 182 GMNG +DALAL ++M+ GLKPN V TLSVLSACSHGGLVEEGLS F ++Q++G Sbjct: 530 GMNGLARDALALLSEMKLHGLKPNXVTTLSVLSACSHGGLVEEGLSFFENMVQDHGVEPG 589 Query: 183 LEHYSCLIDLLARAGRLDSALNLIKEIPNELRPGASAWGAIL 308 LEHYSC++D+L RAG+L+ A+NLI+++P +R GA WGA+L Sbjct: 590 LEHYSCMVDMLXRAGKLNXAMNLIEKMPERMRDGAGLWGALL 631 >ref|XP_002328268.1| predicted protein [Populus trichocarpa] gi|566167839|ref|XP_006384846.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550341614|gb|ERP62643.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 749 Score = 133 bits (334), Expect = 3e-29 Identities = 62/102 (60%), Positives = 82/102 (80%) Frame = +3 Query: 3 GMNGPPQDALALHAKMRSQGLKPNLVITLSVLSACSHGGLVEEGLSLFRTLLQEYGAAVR 182 GMNG +ALAL A+M+ GLKPN V TLSVL+ACSHGGLVEEGLSLF++++QE G Sbjct: 529 GMNGLAHEALALFAEMKRHGLKPNPVTTLSVLAACSHGGLVEEGLSLFKSMVQELGLEPG 588 Query: 183 LEHYSCLIDLLARAGRLDSALNLIKEIPNELRPGASAWGAIL 308 EHYSC++D+L RAG+LD+A+ +IK +P+ L+ GAS WG++L Sbjct: 589 FEHYSCMVDMLGRAGKLDTAIEVIKAMPHNLKNGASIWGSLL 630 >gb|ABK95971.1| unknown [Populus trichocarpa] Length = 749 Score = 133 bits (334), Expect = 3e-29 Identities = 62/102 (60%), Positives = 82/102 (80%) Frame = +3 Query: 3 GMNGPPQDALALHAKMRSQGLKPNLVITLSVLSACSHGGLVEEGLSLFRTLLQEYGAAVR 182 GMNG +ALAL A+M+ GLKPN V TLSVL+ACSHGGLVEEGLSLF++++QE G Sbjct: 529 GMNGLAHEALALFAEMKRHGLKPNPVTTLSVLAACSHGGLVEEGLSLFKSMVQELGLEPG 588 Query: 183 LEHYSCLIDLLARAGRLDSALNLIKEIPNELRPGASAWGAIL 308 EHYSC++D+L RAG+LD+A+ +IK +P+ L+ GAS WG++L Sbjct: 589 FEHYSCMVDMLGRAGKLDTAIEVIKAMPDNLKNGASIWGSLL 630 >ref|XP_006486839.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17210-like [Citrus sinensis] Length = 755 Score = 132 bits (332), Expect = 5e-29 Identities = 63/102 (61%), Positives = 82/102 (80%) Frame = +3 Query: 3 GMNGPPQDALALHAKMRSQGLKPNLVITLSVLSACSHGGLVEEGLSLFRTLLQEYGAAVR 182 GMNG +ALAL A+M+ GL+PN V TLSVLSACSHGGLVEEGLS F +++Q++G Sbjct: 539 GMNGLAHEALALVAEMKLGGLQPNAVTTLSVLSACSHGGLVEEGLSFFNSMVQDHGVEPA 598 Query: 183 LEHYSCLIDLLARAGRLDSALNLIKEIPNELRPGASAWGAIL 308 LEHYSC++D+LARAG LD A++LI ++P+ L+ ASAWGA+L Sbjct: 599 LEHYSCMVDMLARAGELDIAIDLINQMPDNLKATASAWGALL 640 >ref|XP_006422430.1| hypothetical protein CICLE_v10027827mg [Citrus clementina] gi|557524364|gb|ESR35670.1| hypothetical protein CICLE_v10027827mg [Citrus clementina] Length = 825 Score = 132 bits (332), Expect = 5e-29 Identities = 63/102 (61%), Positives = 82/102 (80%) Frame = +3 Query: 3 GMNGPPQDALALHAKMRSQGLKPNLVITLSVLSACSHGGLVEEGLSLFRTLLQEYGAAVR 182 GMNG +ALAL A+M+ GL+PN V TLSVLSACSHGGLVEEGLS F +++Q++G Sbjct: 529 GMNGLAHEALALVAEMKLGGLQPNAVTTLSVLSACSHGGLVEEGLSFFNSMVQDHGVEPA 588 Query: 183 LEHYSCLIDLLARAGRLDSALNLIKEIPNELRPGASAWGAIL 308 LEHYSC++D+LARAG LD A++LI ++P+ L+ ASAWGA+L Sbjct: 589 LEHYSCMVDMLARAGELDIAIDLINQMPDNLKATASAWGALL 630 >gb|EXC31540.1| hypothetical protein L484_006572 [Morus notabilis] Length = 743 Score = 126 bits (316), Expect = 3e-27 Identities = 58/102 (56%), Positives = 79/102 (77%) Frame = +3 Query: 3 GMNGPPQDALALHAKMRSQGLKPNLVITLSVLSACSHGGLVEEGLSLFRTLLQEYGAAVR 182 GMN ++ALALHA+ + GLKPN V TL VLSACSHGGL+EEGLS F ++ +++G Sbjct: 529 GMNSLAREALALHAETKLHGLKPNAVTTLCVLSACSHGGLLEEGLSFFNSMARDHGVEPT 588 Query: 183 LEHYSCLIDLLARAGRLDSALNLIKEIPNELRPGASAWGAIL 308 LEHYSC++D+L+RAG L+SA++ IK++P L GA+AW A+L Sbjct: 589 LEHYSCVVDMLSRAGNLNSAMDFIKKMPEGLEAGANAWSAVL 630 >ref|XP_004231461.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17210-like [Solanum lycopersicum] Length = 752 Score = 125 bits (314), Expect = 6e-27 Identities = 61/102 (59%), Positives = 81/102 (79%) Frame = +3 Query: 3 GMNGPPQDALALHAKMRSQGLKPNLVITLSVLSACSHGGLVEEGLSLFRTLLQEYGAAVR 182 GM G P +ALAL +M+ GL+PN V LS+LSACSHGGLVEEG+SLF L+ ++ + Sbjct: 524 GMIGLPNEALALFHEMKVCGLRPNQVTALSLLSACSHGGLVEEGVSLFEELIWDHEVELV 583 Query: 183 LEHYSCLIDLLARAGRLDSALNLIKEIPNELRPGASAWGAIL 308 +EHYSCL+DLLARAG++DSA+NLI ++ ++PGASAWGA+L Sbjct: 584 IEHYSCLVDLLARAGKVDSAMNLIGKLGVGVKPGASAWGALL 625 >ref|XP_002520932.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223539898|gb|EEF41477.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 757 Score = 125 bits (313), Expect = 8e-27 Identities = 57/102 (55%), Positives = 79/102 (77%) Frame = +3 Query: 3 GMNGPPQDALALHAKMRSQGLKPNLVITLSVLSACSHGGLVEEGLSLFRTLLQEYGAAVR 182 GMNG +ALAL A+M+S +KPN + LSVL+ACSHGGLVE GLS+F++++Q++G Sbjct: 545 GMNGLAHEALALLAQMKSHEIKPNALTYLSVLTACSHGGLVEMGLSVFKSMIQDHGVDPE 604 Query: 183 LEHYSCLIDLLARAGRLDSALNLIKEIPNELRPGASAWGAIL 308 EHYSC++D+L+RAG+LD A+ LI+ +P R GAS WGA+L Sbjct: 605 FEHYSCMVDMLSRAGKLDDAMELIRMMPETFRAGASVWGALL 646 >gb|EMJ00856.1| hypothetical protein PRUPE_ppa003439mg [Prunus persica] Length = 574 Score = 122 bits (306), Expect = 5e-26 Identities = 60/105 (57%), Positives = 79/105 (75%), Gaps = 3/105 (2%) Frame = +3 Query: 3 GMNGPPQDALALHAKMRSQGLKPNLVITLSVLSACSHGGLVEEGLSLFRTLLQEYGAAVR 182 GMNG +ALAL A+M+ GLKPN V LSVLSACSHGGLVEEG+SLF ++ Q++G R Sbjct: 355 GMNGLGHEALALLAEMKLYGLKPNAVTILSVLSACSHGGLVEEGVSLFNSMAQDHGVEPR 414 Query: 183 LEHYSCLIDLLARAGRLDSALNLIKEIPNELRPGA---SAWGAIL 308 LEHY+C++D+L RAG+L +A+ IK P +L+ A +AWGA+L Sbjct: 415 LEHYTCVVDMLGRAGKLVTAMEFIKRFPQDLKAAARASNAWGALL 459 >ref|XP_004168522.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17210-like [Cucumis sativus] Length = 747 Score = 119 bits (297), Expect = 5e-25 Identities = 53/101 (52%), Positives = 75/101 (74%) Frame = +3 Query: 6 MNGPPQDALALHAKMRSQGLKPNLVITLSVLSACSHGGLVEEGLSLFRTLLQEYGAAVRL 185 +NG +AL L K++ G KPN V LS+LSACSHGGL+EEGLS F +++Q++G L Sbjct: 529 INGLAHEALMLFEKIKQNGTKPNAVTALSLLSACSHGGLIEEGLSFFTSMVQKHGIEPGL 588 Query: 186 EHYSCLIDLLARAGRLDSALNLIKEIPNELRPGASAWGAIL 308 EHYSC++D+L+RAG+ + AL LI+++P E+ GAS WG +L Sbjct: 589 EHYSCIVDMLSRAGKFNEALELIEKLPKEMEAGASIWGTLL 629 >ref|XP_004140062.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17210-like [Cucumis sativus] Length = 747 Score = 118 bits (295), Expect = 9e-25 Identities = 53/101 (52%), Positives = 75/101 (74%) Frame = +3 Query: 6 MNGPPQDALALHAKMRSQGLKPNLVITLSVLSACSHGGLVEEGLSLFRTLLQEYGAAVRL 185 +NG +AL L K++ G KPN V LS+LSACSHGGL+EEGLS F +++Q++G L Sbjct: 529 INGLAHEALMLFEKIKQNGTKPNAVTALSLLSACSHGGLMEEGLSFFTSMVQKHGIEPGL 588 Query: 186 EHYSCLIDLLARAGRLDSALNLIKEIPNELRPGASAWGAIL 308 EHYSC++D+L+RAG+ + AL LI+++P E+ GAS WG +L Sbjct: 589 EHYSCIVDMLSRAGKFNEALELIEKLPKEMEAGASIWGTLL 629 >ref|XP_004292418.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17210-like [Fragaria vesca subsp. vesca] Length = 750 Score = 117 bits (292), Expect = 2e-24 Identities = 60/104 (57%), Positives = 78/104 (75%), Gaps = 2/104 (1%) Frame = +3 Query: 3 GMNGPPQDALALHAKMRSQGLKPNLVITLSVLSACSHGGLVEEGLSLFRTLLQEYGAAVR 182 GMNG ALAL +M+ +KPN V TLSVLSACSHGGLVEEGLS F +L+Q++G R Sbjct: 531 GMNGLAHKALALVEEMKLYAVKPNAVTTLSVLSACSHGGLVEEGLSFFNSLVQDHGVEPR 590 Query: 183 LEHYSCLIDLLARAGRLDSALNLIKEIPNELRP-GA-SAWGAIL 308 LEHY+C++D+L RAG+L A++ IK+IP L+ GA +AW A+L Sbjct: 591 LEHYACVVDMLGRAGKLQMAMDFIKKIPQGLKAVGANAAWSALL 634 >ref|XP_006409337.1| hypothetical protein EUTSA_v10023059mg [Eutrema salsugineum] gi|557110499|gb|ESQ50790.1| hypothetical protein EUTSA_v10023059mg [Eutrema salsugineum] Length = 741 Score = 111 bits (277), Expect = 1e-22 Identities = 49/101 (48%), Positives = 77/101 (76%) Frame = +3 Query: 6 MNGPPQDALALHAKMRSQGLKPNLVITLSVLSACSHGGLVEEGLSLFRTLLQEYGAAVRL 185 M+G P ALA +M+ +G KPN V L+VLSAC+HGGL+++GL +F+++++++ L Sbjct: 526 MSGLPDKALASFEEMKREGYKPNAVTYLAVLSACNHGGLIKQGLMIFKSMVKDHNKP-SL 584 Query: 186 EHYSCLIDLLARAGRLDSALNLIKEIPNELRPGASAWGAIL 308 +HYSC++D+L+RAG +D A+ LIK +P ++ GASAWG+IL Sbjct: 585 QHYSCVVDMLSRAGEIDKAMELIKNLPEHVKAGASAWGSIL 625 >sp|Q9SII7.2|PP159_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g17210 Length = 736 Score = 111 bits (277), Expect = 1e-22 Identities = 51/101 (50%), Positives = 77/101 (76%) Frame = +3 Query: 6 MNGPPQDALALHAKMRSQGLKPNLVITLSVLSACSHGGLVEEGLSLFRTLLQEYGAAVRL 185 +NG P ALAL +M+ +G PN V L+ LSAC+HGGLV++GL +F+++++E L Sbjct: 526 INGLPDKALALFDEMKQKGYTPNAVTYLAALSACNHGGLVKKGLMIFKSMVEE-DHKPSL 584 Query: 186 EHYSCLIDLLARAGRLDSALNLIKEIPNELRPGASAWGAIL 308 +HYSC++D+L+RAG +D+A+ LIK +P +++ GASAWGAIL Sbjct: 585 QHYSCIVDMLSRAGEIDTAVELIKNLPEDVKAGASAWGAIL 625 >ref|NP_179312.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|4584344|gb|AAD25139.1| putative selenium-binding protein [Arabidopsis thaliana] gi|330251504|gb|AEC06598.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 715 Score = 111 bits (277), Expect = 1e-22 Identities = 51/101 (50%), Positives = 77/101 (76%) Frame = +3 Query: 6 MNGPPQDALALHAKMRSQGLKPNLVITLSVLSACSHGGLVEEGLSLFRTLLQEYGAAVRL 185 +NG P ALAL +M+ +G PN V L+ LSAC+HGGLV++GL +F+++++E L Sbjct: 505 INGLPDKALALFDEMKQKGYTPNAVTYLAALSACNHGGLVKKGLMIFKSMVEE-DHKPSL 563 Query: 186 EHYSCLIDLLARAGRLDSALNLIKEIPNELRPGASAWGAIL 308 +HYSC++D+L+RAG +D+A+ LIK +P +++ GASAWGAIL Sbjct: 564 QHYSCIVDMLSRAGEIDTAVELIKNLPEDVKAGASAWGAIL 604 >ref|XP_006299587.1| hypothetical protein CARUB_v10015765mg [Capsella rubella] gi|482568296|gb|EOA32485.1| hypothetical protein CARUB_v10015765mg [Capsella rubella] Length = 740 Score = 109 bits (273), Expect = 3e-22 Identities = 50/101 (49%), Positives = 77/101 (76%) Frame = +3 Query: 6 MNGPPQDALALHAKMRSQGLKPNLVITLSVLSACSHGGLVEEGLSLFRTLLQEYGAAVRL 185 +NG P ALAL +M+ QG PN V L+ LSAC+HGGLV++GL +F+++++ + L Sbjct: 528 INGLPDKALALFEEMKRQGNTPNAVTYLAALSACNHGGLVKKGLMIFKSMVENDHKPL-L 586 Query: 186 EHYSCLIDLLARAGRLDSALNLIKEIPNELRPGASAWGAIL 308 +HYSC++D+L+RAG +D+A+ LIK++P +++ GAS WGAIL Sbjct: 587 QHYSCIVDMLSRAGEIDTAMELIKKLPEDVKAGASTWGAIL 627 >ref|XP_004228842.1| PREDICTED: pentatricopeptide repeat-containing protein At1g26900, mitochondrial-like [Solanum lycopersicum] Length = 544 Score = 108 bits (270), Expect = 7e-22 Identities = 52/102 (50%), Positives = 73/102 (71%) Frame = +3 Query: 3 GMNGPPQDALALHAKMRSQGLKPNLVITLSVLSACSHGGLVEEGLSLFRTLLQEYGAAVR 182 G++G +DA+AL +M +G +PN V L+V SACSHGGLV EG+S FR ++ EYG + Sbjct: 353 GVHGEAKDAIALFHRMEDEGFRPNEVTFLAVFSACSHGGLVAEGISCFRKMVLEYGLTPK 412 Query: 183 LEHYSCLIDLLARAGRLDSALNLIKEIPNELRPGASAWGAIL 308 +EHY CLID+L RAG L++A LIK++P + A+AW A+L Sbjct: 413 IEHYGCLIDILGRAGLLETARELIKDLP--IEGDATAWRALL 452