BLASTX nr result
ID: Catharanthus23_contig00035756
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00035756 (309 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004231149.1| PREDICTED: pentatricopeptide repeat-containi... 163 3e-38 ref|XP_006347950.1| PREDICTED: pentatricopeptide repeat-containi... 161 7e-38 gb|EOX96449.1| Pentatricopeptide repeat superfamily protein isof... 152 3e-35 ref|XP_002326871.1| predicted protein [Populus trichocarpa] 151 1e-34 ref|XP_006397667.1| hypothetical protein EUTSA_v10001725mg [Eutr... 146 2e-33 emb|CAN70994.1| hypothetical protein VITISV_038698 [Vitis vinifera] 145 7e-33 ref|XP_004308509.1| PREDICTED: pentatricopeptide repeat-containi... 144 9e-33 ref|XP_002271824.1| PREDICTED: pentatricopeptide repeat-containi... 144 1e-32 gb|EPS70650.1| hypothetical protein M569_04107 [Genlisea aurea] 142 4e-32 ref|XP_006293876.1| hypothetical protein CARUB_v10022861mg [Caps... 140 2e-31 ref|XP_002511599.1| pentatricopeptide repeat-containing protein,... 140 2e-31 ref|NP_182015.1| Pentatricopeptide repeat-containing protein [Ar... 139 4e-31 ref|XP_002880144.1| pentatricopeptide repeat-containing protein ... 139 5e-31 gb|EXB37620.1| hypothetical protein L484_021826 [Morus notabilis] 138 7e-31 gb|ABE65907.1| pentatricopeptide repeat-containing protein [Arab... 137 2e-30 gb|EMJ05119.1| hypothetical protein PRUPE_ppa015039mg, partial [... 135 4e-30 ref|XP_003551717.1| PREDICTED: pentatricopeptide repeat-containi... 130 2e-28 gb|ESW11626.1| hypothetical protein PHAVU_008G046100g [Phaseolus... 128 7e-28 ref|XP_004986805.1| PREDICTED: pentatricopeptide repeat-containi... 126 3e-27 ref|XP_004489099.1| PREDICTED: pentatricopeptide repeat-containi... 122 5e-26 >ref|XP_004231149.1| PREDICTED: pentatricopeptide repeat-containing protein At2g44880-like [Solanum lycopersicum] Length = 605 Score = 163 bits (412), Expect = 3e-38 Identities = 77/103 (74%), Positives = 87/103 (84%) Frame = -1 Query: 309 GGYCQNKQPQEALRLFREFLTQSNFEPDDVTVVSVLPAIADLGILELGNWVHHFVRKKKL 130 GGYCQNKQPQEAL+LF E + EPD VTVVSVLPAIADLG L+LGNWVH +V++KKL Sbjct: 286 GGYCQNKQPQEALKLFHELQMGTTLEPDGVTVVSVLPAIADLGALDLGNWVHQYVKRKKL 345 Query: 129 DRSSNVCTALVDMYAKCGEIEKAKSVFSNIKVKVASVWNALIN 1 DRSSNVCTAL+DMYAKCGEI KA+ F+ IKVK +S WNALIN Sbjct: 346 DRSSNVCTALIDMYAKCGEIAKAREFFNEIKVKESSSWNALIN 388 >ref|XP_006347950.1| PREDICTED: pentatricopeptide repeat-containing protein At2g44880-like [Solanum tuberosum] Length = 603 Score = 161 bits (408), Expect = 7e-38 Identities = 76/103 (73%), Positives = 86/103 (83%) Frame = -1 Query: 309 GGYCQNKQPQEALRLFREFLTQSNFEPDDVTVVSVLPAIADLGILELGNWVHHFVRKKKL 130 GGYCQNKQPQEAL+LF E + EPD VTVVSVLPAIADLG L+LGNW+H +V+++KL Sbjct: 284 GGYCQNKQPQEALKLFHELQMGTTLEPDGVTVVSVLPAIADLGALDLGNWIHQYVKRRKL 343 Query: 129 DRSSNVCTALVDMYAKCGEIEKAKSVFSNIKVKVASVWNALIN 1 DRSSNVCTALVDMYAKCGEI KA+ F IKVK +S WNALIN Sbjct: 344 DRSSNVCTALVDMYAKCGEIAKAREFFDEIKVKESSSWNALIN 386 >gb|EOX96449.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma cacao] gi|508704554|gb|EOX96450.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma cacao] gi|508704555|gb|EOX96451.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma cacao] Length = 890 Score = 152 bits (385), Expect = 3e-35 Identities = 70/103 (67%), Positives = 83/103 (80%) Frame = -1 Query: 309 GGYCQNKQPQEALRLFREFLTQSNFEPDDVTVVSVLPAIADLGILELGNWVHHFVRKKKL 130 GGYCQNKQP EAL+LF E + + FEPD VT+VS+LPAIADLG L+LG WVHHFV++KKL Sbjct: 578 GGYCQNKQPHEALKLFHEMQSSTFFEPDKVTIVSILPAIADLGALDLGEWVHHFVQRKKL 637 Query: 129 DRSSNVCTALVDMYAKCGEIEKAKSVFSNIKVKVASVWNALIN 1 D++ NVCT LVDMYAKCGEI KAK +F + K + WNALIN Sbjct: 638 DKAINVCTGLVDMYAKCGEINKAKRIFYEMPEKEIASWNALIN 680 >ref|XP_002326871.1| predicted protein [Populus trichocarpa] Length = 581 Score = 151 bits (381), Expect = 1e-34 Identities = 71/103 (68%), Positives = 83/103 (80%) Frame = -1 Query: 309 GGYCQNKQPQEALRLFREFLTQSNFEPDDVTVVSVLPAIADLGILELGNWVHHFVRKKKL 130 GGYCQNKQP EAL+LFRE + + FEP++VTVVS+LPAIA LG LELG WVH FV++KKL Sbjct: 269 GGYCQNKQPHEALKLFRELQSSTVFEPNEVTVVSILPAIATLGALELGEWVHRFVQRKKL 328 Query: 129 DRSSNVCTALVDMYAKCGEIEKAKSVFSNIKVKVASVWNALIN 1 D + NVCT+LVDMY KCGEI KA+ VFS I K + WNALIN Sbjct: 329 DAAVNVCTSLVDMYLKCGEISKARKVFSEIPKKETATWNALIN 371 >ref|XP_006397667.1| hypothetical protein EUTSA_v10001725mg [Eutrema salsugineum] gi|557098740|gb|ESQ39120.1| hypothetical protein EUTSA_v10001725mg [Eutrema salsugineum] Length = 644 Score = 146 bits (369), Expect = 2e-33 Identities = 65/103 (63%), Positives = 82/103 (79%) Frame = -1 Query: 309 GGYCQNKQPQEALRLFREFLTQSNFEPDDVTVVSVLPAIADLGILELGNWVHHFVRKKKL 130 GGYCQNKQPQEA+RLF+E ++ EPDDVTVVSVLPAI+D G L LG W HHFV++KKL Sbjct: 246 GGYCQNKQPQEAIRLFQEMQATTSLEPDDVTVVSVLPAISDTGALSLGEWCHHFVQRKKL 305 Query: 129 DRSSNVCTALVDMYAKCGEIEKAKSVFSNIKVKVASVWNALIN 1 D+ VCTA++DMY+KCGEIEKA+ +F + K + WNA+I+ Sbjct: 306 DKMVKVCTAILDMYSKCGEIEKARKIFDEMPEKEVASWNAMIH 348 >emb|CAN70994.1| hypothetical protein VITISV_038698 [Vitis vinifera] Length = 751 Score = 145 bits (365), Expect = 7e-33 Identities = 68/102 (66%), Positives = 83/102 (81%) Frame = -1 Query: 306 GYCQNKQPQEALRLFREFLTQSNFEPDDVTVVSVLPAIADLGILELGNWVHHFVRKKKLD 127 GY QNKQP EAL+LF E + ++ EPD+VT+VSVLPAIADLG L+LG WVH FVR+KKLD Sbjct: 430 GYXQNKQPYEALKLFHEMQSTTSLEPDEVTIVSVLPAIADLGALDLGGWVHRFVRRKKLD 489 Query: 126 RSSNVCTALVDMYAKCGEIEKAKSVFSNIKVKVASVWNALIN 1 R++NV TAL+DMYAKCGEI K++ VF N+ K + WNALIN Sbjct: 490 RATNVGTALIDMYAKCGEIVKSRGVFDNMPEKETASWNALIN 531 >ref|XP_004308509.1| PREDICTED: pentatricopeptide repeat-containing protein At2g44880-like [Fragaria vesca subsp. vesca] Length = 563 Score = 144 bits (364), Expect = 9e-33 Identities = 66/103 (64%), Positives = 82/103 (79%) Frame = -1 Query: 309 GGYCQNKQPQEALRLFREFLTQSNFEPDDVTVVSVLPAIADLGILELGNWVHHFVRKKKL 130 GGYCQNKQP EA+RLF E + ++ EPD VT+VS+LPAIADLG L+LG+WVH FV +KKL Sbjct: 246 GGYCQNKQPHEAVRLFHEMQSSTSLEPDAVTIVSILPAIADLGALDLGHWVHEFVERKKL 305 Query: 129 DRSSNVCTALVDMYAKCGEIEKAKSVFSNIKVKVASVWNALIN 1 D+ +N+ TALVDMYAKCGEI KA+ +F + K + WNALIN Sbjct: 306 DKLTNIYTALVDMYAKCGEITKARKLFDEMPEKETASWNALIN 348 >ref|XP_002271824.1| PREDICTED: pentatricopeptide repeat-containing protein At2g44880 [Vitis vinifera] gi|297734603|emb|CBI16654.3| unnamed protein product [Vitis vinifera] Length = 577 Score = 144 bits (363), Expect = 1e-32 Identities = 68/102 (66%), Positives = 83/102 (81%) Frame = -1 Query: 306 GYCQNKQPQEALRLFREFLTQSNFEPDDVTVVSVLPAIADLGILELGNWVHHFVRKKKLD 127 GY QNKQP EAL+LF E + ++ EPD+VT+VSVLPAIADLG L+LG WVH FVR+KKLD Sbjct: 256 GYRQNKQPYEALKLFHEMQSTTSLEPDEVTIVSVLPAIADLGALDLGGWVHRFVRRKKLD 315 Query: 126 RSSNVCTALVDMYAKCGEIEKAKSVFSNIKVKVASVWNALIN 1 R++NV TAL+DMYAKCGEI K++ VF N+ K + WNALIN Sbjct: 316 RATNVGTALIDMYAKCGEIVKSRGVFDNMPEKETASWNALIN 357 >gb|EPS70650.1| hypothetical protein M569_04107 [Genlisea aurea] Length = 564 Score = 142 bits (359), Expect = 4e-32 Identities = 65/103 (63%), Positives = 84/103 (81%) Frame = -1 Query: 309 GGYCQNKQPQEALRLFREFLTQSNFEPDDVTVVSVLPAIADLGILELGNWVHHFVRKKKL 130 GGYC+NKQP EA+ LFRE L+Q F+PD VTVVS+LPAIA+LG ++LGN + F+++ +L Sbjct: 283 GGYCRNKQPHEAVALFRELLSQKRFDPDGVTVVSILPAIAELGAVDLGNRMFEFIKRNQL 342 Query: 129 DRSSNVCTALVDMYAKCGEIEKAKSVFSNIKVKVASVWNALIN 1 DRSSNV T+ VDM+AKCGEI KA+SVF +++ KV WNALIN Sbjct: 343 DRSSNVSTSAVDMFAKCGEISKARSVFDDLQTKVTCTWNALIN 385 >ref|XP_006293876.1| hypothetical protein CARUB_v10022861mg [Capsella rubella] gi|565472150|ref|XP_006293877.1| hypothetical protein CARUB_v10022861mg [Capsella rubella] gi|482562584|gb|EOA26774.1| hypothetical protein CARUB_v10022861mg [Capsella rubella] gi|482562585|gb|EOA26775.1| hypothetical protein CARUB_v10022861mg [Capsella rubella] Length = 596 Score = 140 bits (352), Expect = 2e-31 Identities = 61/103 (59%), Positives = 81/103 (78%) Frame = -1 Query: 309 GGYCQNKQPQEALRLFREFLTQSNFEPDDVTVVSVLPAIADLGILELGNWVHHFVRKKKL 130 GGYCQNKQP A+RLF+E ++ +PDDVT++SVLPAI+D G L LG W HHFV++KKL Sbjct: 287 GGYCQNKQPHAAIRLFQEMQATTSLDPDDVTILSVLPAISDTGALSLGEWCHHFVQRKKL 346 Query: 129 DRSSNVCTALVDMYAKCGEIEKAKSVFSNIKVKVASVWNALIN 1 D+ VCTA++DMY+KCG+IEKAKS+F + K + WNA+I+ Sbjct: 347 DKKVKVCTAVLDMYSKCGKIEKAKSMFDEMPEKEVASWNAMIH 389 >ref|XP_002511599.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223548779|gb|EEF50268.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 429 Score = 140 bits (352), Expect = 2e-31 Identities = 67/103 (65%), Positives = 81/103 (78%) Frame = -1 Query: 309 GGYCQNKQPQEALRLFREFLTQSNFEPDDVTVVSVLPAIADLGILELGNWVHHFVRKKKL 130 GGY QN + EAL+LF E +++ FEPD VTVVSVLPAIADLG L+LG+W+H F R KK+ Sbjct: 255 GGYSQNNKSHEALKLFHEMQSRTLFEPDKVTVVSVLPAIADLGALDLGSWIHQFARLKKI 314 Query: 129 DRSSNVCTALVDMYAKCGEIEKAKSVFSNIKVKVASVWNALIN 1 DRS NVCTALVDMYAKCGE+ KA+ VF ++ K + WNALIN Sbjct: 315 DRSINVCTALVDMYAKCGEMLKARRVFDSMPKKEEASWNALIN 357 >ref|NP_182015.1| Pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|218546766|sp|Q1PEU4.2|PP201_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g44880 gi|2344896|gb|AAC31836.1| hypothetical protein [Arabidopsis thaliana] gi|330255385|gb|AEC10479.1| Pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 555 Score = 139 bits (350), Expect = 4e-31 Identities = 61/103 (59%), Positives = 80/103 (77%) Frame = -1 Query: 309 GGYCQNKQPQEALRLFREFLTQSNFEPDDVTVVSVLPAIADLGILELGNWVHHFVRKKKL 130 GGYCQNKQPQE +RLF+E ++ +PDDVT++SVLPAI+D G L LG W H FV++KKL Sbjct: 246 GGYCQNKQPQEGIRLFQEMQATTSLDPDDVTILSVLPAISDTGALSLGEWCHCFVQRKKL 305 Query: 129 DRSSNVCTALVDMYAKCGEIEKAKSVFSNIKVKVASVWNALIN 1 D+ VCTA++DMY+KCGEIEKAK +F + K + WNA+I+ Sbjct: 306 DKKVKVCTAILDMYSKCGEIEKAKRIFDEMPEKQVASWNAMIH 348 >ref|XP_002880144.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297325983|gb|EFH56403.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 555 Score = 139 bits (349), Expect = 5e-31 Identities = 61/103 (59%), Positives = 80/103 (77%) Frame = -1 Query: 309 GGYCQNKQPQEALRLFREFLTQSNFEPDDVTVVSVLPAIADLGILELGNWVHHFVRKKKL 130 GGYCQNKQPQEA+RLF+E ++ +PDDVT++SVLPAI+D G L LG W H FV++K L Sbjct: 246 GGYCQNKQPQEAIRLFQEMQATTSLDPDDVTILSVLPAISDTGALSLGEWCHCFVQRKNL 305 Query: 129 DRSSNVCTALVDMYAKCGEIEKAKSVFSNIKVKVASVWNALIN 1 D+ VCTA++DMY+KCGEIEKAK +F + K + WNA+I+ Sbjct: 306 DKKVKVCTAILDMYSKCGEIEKAKRIFDEMPEKQVASWNAMIH 348 >gb|EXB37620.1| hypothetical protein L484_021826 [Morus notabilis] Length = 594 Score = 138 bits (348), Expect = 7e-31 Identities = 62/102 (60%), Positives = 79/102 (77%) Frame = -1 Query: 306 GYCQNKQPQEALRLFREFLTQSNFEPDDVTVVSVLPAIADLGILELGNWVHHFVRKKKLD 127 GYCQN QP EAL+LFRE + EP++VT+VS+LPAIADLG L+LG W+H FV+KK+ D Sbjct: 287 GYCQNNQPLEALKLFREMQDSTLLEPNEVTIVSILPAIADLGALDLGCWIHQFVQKKRFD 346 Query: 126 RSSNVCTALVDMYAKCGEIEKAKSVFSNIKVKVASVWNALIN 1 +CTAL+DMYAKCGE+EKAK++F + K + WNALIN Sbjct: 347 GLVKICTALIDMYAKCGEVEKAKTIFDEMPEKEIASWNALIN 388 >gb|ABE65907.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 555 Score = 137 bits (344), Expect = 2e-30 Identities = 60/103 (58%), Positives = 79/103 (76%) Frame = -1 Query: 309 GGYCQNKQPQEALRLFREFLTQSNFEPDDVTVVSVLPAIADLGILELGNWVHHFVRKKKL 130 GGYCQNKQPQE + LF+E ++ +PDDVT++SVLPAI+D G L LG W H FV++KKL Sbjct: 246 GGYCQNKQPQEGITLFQEMQATTSLDPDDVTILSVLPAISDTGALSLGEWCHCFVQRKKL 305 Query: 129 DRSSNVCTALVDMYAKCGEIEKAKSVFSNIKVKVASVWNALIN 1 D+ VCTA++DMY+KCGEIEKAK +F + K + WNA+I+ Sbjct: 306 DKKVKVCTAILDMYSKCGEIEKAKRIFDEMPEKQVASWNAMIH 348 >gb|EMJ05119.1| hypothetical protein PRUPE_ppa015039mg, partial [Prunus persica] Length = 487 Score = 135 bits (341), Expect = 4e-30 Identities = 67/103 (65%), Positives = 78/103 (75%) Frame = -1 Query: 309 GGYCQNKQPQEALRLFREFLTQSNFEPDDVTVVSVLPAIADLGILELGNWVHHFVRKKKL 130 GGY QNKQP EAL+LF E + + E D VTVVS+LPAIADLG L+LG WVH FVR+KKL Sbjct: 176 GGYSQNKQPHEALKLFHELQSNMSLELDGVTVVSILPAIADLGALDLGLWVHKFVRRKKL 235 Query: 129 DRSSNVCTALVDMYAKCGEIEKAKSVFSNIKVKVASVWNALIN 1 DR N+CTALVDMYAK GEI +AK +F + K + WNALIN Sbjct: 236 DRVINICTALVDMYAKRGEITEAKRLFDEMPEKETASWNALIN 278 >ref|XP_003551717.1| PREDICTED: pentatricopeptide repeat-containing protein At2g44880-like [Glycine max] Length = 599 Score = 130 bits (326), Expect = 2e-28 Identities = 62/103 (60%), Positives = 77/103 (74%) Frame = -1 Query: 309 GGYCQNKQPQEALRLFREFLTQSNFEPDDVTVVSVLPAIADLGILELGNWVHHFVRKKKL 130 GGYCQN++ +AL LFRE T S EP++VTVV VLPA+ADLG L+LG W+H F +KKL Sbjct: 292 GGYCQNRRSHDALELFREMQTAS-VEPNEVTVVCVLPAVADLGALDLGRWIHRFALRKKL 350 Query: 129 DRSSNVCTALVDMYAKCGEIEKAKSVFSNIKVKVASVWNALIN 1 DRS+ + TAL+DMYAKCGEI KAK F + + + WNALIN Sbjct: 351 DRSARIGTALIDMYAKCGEITKAKLAFEGMTERETASWNALIN 393 >gb|ESW11626.1| hypothetical protein PHAVU_008G046100g [Phaseolus vulgaris] Length = 602 Score = 128 bits (322), Expect = 7e-28 Identities = 61/103 (59%), Positives = 77/103 (74%) Frame = -1 Query: 309 GGYCQNKQPQEALRLFREFLTQSNFEPDDVTVVSVLPAIADLGILELGNWVHHFVRKKKL 130 GGYCQN++ EAL LFRE T EP++VT++ VLPA+ADLG L+LG W+H F ++KK Sbjct: 294 GGYCQNRRSHEALELFREMQTVL-VEPNEVTILCVLPAVADLGALDLGGWIHRFAQRKKF 352 Query: 129 DRSSNVCTALVDMYAKCGEIEKAKSVFSNIKVKVASVWNALIN 1 DRS+ V TAL+DMYAKCGEI KAK VF + + + WNALIN Sbjct: 353 DRSARVGTALIDMYAKCGEITKAKLVFEEMTERETASWNALIN 395 >ref|XP_004986805.1| PREDICTED: pentatricopeptide repeat-containing protein At2g44880-like [Setaria italica] Length = 535 Score = 126 bits (316), Expect = 3e-27 Identities = 58/103 (56%), Positives = 79/103 (76%), Gaps = 1/103 (0%) Frame = -1 Query: 306 GYCQNKQPQEALRLFREFLTQS-NFEPDDVTVVSVLPAIADLGILELGNWVHHFVRKKKL 130 GYC+N++ +AL+LFRE +QS FEP++VT+VSV+PAI D G ++LG WVH F R+K L Sbjct: 217 GYCRNRESGKALKLFRELQSQSCPFEPNEVTLVSVIPAITDTGAMDLGRWVHEFARRKGL 276 Query: 129 DRSSNVCTALVDMYAKCGEIEKAKSVFSNIKVKVASVWNALIN 1 DR +NV TAL+DMY KCG ++AK VF+ + K A+ WNA+IN Sbjct: 277 DRRANVATALIDMYLKCGNADEAKRVFNQLNPKDATCWNAIIN 319 >ref|XP_004489099.1| PREDICTED: pentatricopeptide repeat-containing protein At2g44880-like [Cicer arietinum] Length = 600 Score = 122 bits (306), Expect = 5e-26 Identities = 58/102 (56%), Positives = 74/102 (72%) Frame = -1 Query: 306 GYCQNKQPQEALRLFREFLTQSNFEPDDVTVVSVLPAIADLGILELGNWVHHFVRKKKLD 127 GYC+N++P +AL+LF E + E + VTVVSVLPA+ADL L+LG W+H FV++ +LD Sbjct: 292 GYCENRRPHDALKLFCEMRGSLDMEMNKVTVVSVLPAVADLSALDLGVWIHGFVQRNRLD 351 Query: 126 RSSNVCTALVDMYAKCGEIEKAKSVFSNIKVKVASVWNALIN 1 +VC ALVDMYAKCGEI KAK +F + K S WNALIN Sbjct: 352 EDVHVCNALVDMYAKCGEIGKAKLLFEEMNEKDTSSWNALIN 393