BLASTX nr result
ID: Catharanthus22_contig00026054
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00026054 (550 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY21924.1| Pentatricopeptide repeat-containing protein, puta... 106 4e-21 ref|XP_006354771.1| PREDICTED: pentatricopeptide repeat-containi... 100 3e-19 emb|CAN66662.1| hypothetical protein VITISV_031722 [Vitis vinifera] 100 4e-19 ref|XP_004242174.1| PREDICTED: pentatricopeptide repeat-containi... 99 9e-19 ref|XP_003634263.1| PREDICTED: pentatricopeptide repeat-containi... 98 2e-18 emb|CBI15198.3| unnamed protein product [Vitis vinifera] 98 2e-18 ref|XP_004301139.1| PREDICTED: pentatricopeptide repeat-containi... 97 3e-18 ref|XP_002514728.1| pentatricopeptide repeat-containing protein,... 92 6e-17 ref|XP_006440635.1| hypothetical protein CICLE_v10024595mg [Citr... 92 1e-16 ref|XP_004167903.1| PREDICTED: pentatricopeptide repeat-containi... 89 9e-16 ref|NP_200948.1| pentatricopeptide repeat-containing protein [Ar... 69 1e-09 ref|XP_002864731.1| pentatricopeptide repeat-containing protein ... 68 1e-09 ref|XP_006394512.1| hypothetical protein EUTSA_v10005336mg [Eutr... 67 3e-09 gb|EMJ12012.1| hypothetical protein PRUPE_ppa016961mg, partial [... 63 5e-08 >gb|EOY21924.1| Pentatricopeptide repeat-containing protein, putative [Theobroma cacao] Length = 676 Score = 106 bits (264), Expect = 4e-21 Identities = 59/101 (58%), Positives = 71/101 (70%), Gaps = 2/101 (1%) Frame = +2 Query: 254 LNSKTQEEALEIFHS--ATINPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKFR 427 LNS+T +AL +F+S INP KNL+ +SAIIH LT AK Y ARCLIK LI+ LQ Sbjct: 41 LNSQTPHQALNLFNSNIKLINPSKNLEPYSAIIHVLTGAKLYTDARCLIKYLIKTLQSSL 100 Query: 428 KPRRACSSVFKALSQLGSDNSTSNVYGVLIIALCEMGLIDE 550 KPRRAC +F ALS+L + T NV+G LIIA EMGLI+E Sbjct: 101 KPRRACHLIFNALSKLQTSKFTPNVFGSLIIAFSEMGLIEE 141 >ref|XP_006354771.1| PREDICTED: pentatricopeptide repeat-containing protein At5g61400-like [Solanum tuberosum] Length = 675 Score = 100 bits (248), Expect = 3e-19 Identities = 53/101 (52%), Positives = 72/101 (71%), Gaps = 2/101 (1%) Frame = +2 Query: 254 LNSKTQEEALEIFHSATI--NPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKFR 427 LN+KT EA+ +F+S + +P K+L +HSAIIH LT A+ Y+ ARCLIK LIE L+K Sbjct: 44 LNAKTCSEAMRLFNSTILRTDPTKDLTLHSAIIHYLTRARLYLDARCLIKRLIENLRKNS 103 Query: 428 KPRRACSSVFKALSQLGSDNSTSNVYGVLIIALCEMGLIDE 550 PR+ CS +F L ++ S S+ NV+GVLIIAL EMG +D+ Sbjct: 104 NPRKVCSLIFNDLGKIDS-GSSCNVFGVLIIALSEMGFVDD 143 >emb|CAN66662.1| hypothetical protein VITISV_031722 [Vitis vinifera] Length = 1060 Score = 99.8 bits (247), Expect = 4e-19 Identities = 63/145 (43%), Positives = 83/145 (57%), Gaps = 3/145 (2%) Frame = +2 Query: 125 MLKIFPPKSKSVLSKRAPIHPHRFLQF-XXXXXXXXXXXXXXXXLNSKTQEEALEIFHSA 301 MLK FPPKS+ + +K + L +T +ALE+FHS Sbjct: 1 MLKSFPPKSRRIYAKHSSFISRPLSSSPSSSSSDSSPSSLPNSILTCRTANQALELFHSV 60 Query: 302 T--INPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKFRKPRRACSSVFKALSQL 475 + + KN +++SAIIH LT AK Y ARCL++DLI+ LQK R+ R C SVF LS+L Sbjct: 61 SRRADLAKNPQLYSAIIHVLTGAKLYAKARCLMRDLIQCLQKSRR-SRICCSVFNVLSRL 119 Query: 476 GSDNSTSNVYGVLIIALCEMGLIDE 550 S T NV+GVLIIA EMGL++E Sbjct: 120 ESSKFTPNVFGVLIIAFSEMGLVEE 144 >ref|XP_004242174.1| PREDICTED: pentatricopeptide repeat-containing protein At5g61400-like [Solanum lycopersicum] Length = 691 Score = 98.6 bits (244), Expect = 9e-19 Identities = 53/101 (52%), Positives = 71/101 (70%), Gaps = 2/101 (1%) Frame = +2 Query: 254 LNSKTQEEALEIFHSATI--NPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKFR 427 LN+K EA+ ++ SA + +P K+L +HSAIIH LT A+ Y+ ARCLIK LIE L+K Sbjct: 44 LNAKNCSEAMRLYDSAIVKTDPTKDLTLHSAIIHYLTRARLYLDARCLIKRLIENLRKNS 103 Query: 428 KPRRACSSVFKALSQLGSDNSTSNVYGVLIIALCEMGLIDE 550 PR+ CS VF L ++ S S+ NV+GVLIIAL EMG +D+ Sbjct: 104 NPRKVCSLVFNDLGKIDS-GSSCNVFGVLIIALSEMGFVDD 143 >ref|XP_003634263.1| PREDICTED: pentatricopeptide repeat-containing protein At5g61400-like [Vitis vinifera] Length = 665 Score = 97.8 bits (242), Expect = 2e-18 Identities = 62/145 (42%), Positives = 82/145 (56%), Gaps = 3/145 (2%) Frame = +2 Query: 125 MLKIFPPKSKSVLSKRAPIHPHRFLQF-XXXXXXXXXXXXXXXXLNSKTQEEALEIFHSA 301 MLK FPPKS+ + +K + L +T +ALE+FHS Sbjct: 1 MLKSFPPKSRRIYAKHSSFISRPLSSSPSSSSSDSSPSSLPNSILTCRTANQALELFHSV 60 Query: 302 T--INPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKFRKPRRACSSVFKALSQL 475 + + KN +++SAIIH LT AK Y ARCL++DLI+ LQ R+ R C SVF LS+L Sbjct: 61 SRRADLAKNPQLYSAIIHVLTGAKLYAKARCLMRDLIQCLQNSRR-SRICCSVFNVLSRL 119 Query: 476 GSDNSTSNVYGVLIIALCEMGLIDE 550 S T NV+GVLIIA EMGL++E Sbjct: 120 ESSKFTPNVFGVLIIAFSEMGLVEE 144 >emb|CBI15198.3| unnamed protein product [Vitis vinifera] Length = 948 Score = 97.8 bits (242), Expect = 2e-18 Identities = 62/145 (42%), Positives = 82/145 (56%), Gaps = 3/145 (2%) Frame = +2 Query: 125 MLKIFPPKSKSVLSKRAPIHPHRFLQF-XXXXXXXXXXXXXXXXLNSKTQEEALEIFHSA 301 MLK FPPKS+ + +K + L +T +ALE+FHS Sbjct: 1 MLKSFPPKSRRIYAKHSSFISRPLSSSPSSSSSDSSPSSLPNSILTCRTANQALELFHSV 60 Query: 302 T--INPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKFRKPRRACSSVFKALSQL 475 + + KN +++SAIIH LT AK Y ARCL++DLI+ LQ R+ R C SVF LS+L Sbjct: 61 SRRADLAKNPQLYSAIIHVLTGAKLYAKARCLMRDLIQCLQNSRR-SRICCSVFNVLSRL 119 Query: 476 GSDNSTSNVYGVLIIALCEMGLIDE 550 S T NV+GVLIIA EMGL++E Sbjct: 120 ESSKFTPNVFGVLIIAFSEMGLVEE 144 >ref|XP_004301139.1| PREDICTED: pentatricopeptide repeat-containing protein At5g61400-like [Fragaria vesca subsp. vesca] Length = 631 Score = 96.7 bits (239), Expect = 3e-18 Identities = 53/101 (52%), Positives = 70/101 (69%), Gaps = 2/101 (1%) Frame = +2 Query: 254 LNSKTQEEALEIFHSAT--INPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKFR 427 LN ++ EALEIF+SAT + K+ +++SAI H L AK Y+ AR LIK+LI++LQK Sbjct: 27 LNCRSPAEALEIFNSATKQVGARKDFQLYSAITHVLVSAKLYVKARLLIKELIQDLQKSC 86 Query: 428 KPRRACSSVFKALSQLGSDNSTSNVYGVLIIALCEMGLIDE 550 K RRAC V+ ALS L + NV+GVLI AL EMGL++E Sbjct: 87 KSRRACELVYNALSGLERSRVSRNVFGVLINALSEMGLVEE 127 >ref|XP_002514728.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223546332|gb|EEF47834.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 517 Score = 92.4 bits (228), Expect = 6e-17 Identities = 62/146 (42%), Positives = 77/146 (52%), Gaps = 4/146 (2%) Frame = +2 Query: 125 MLKIFPPKSKSVLSKRAPIHPHRFLQFXXXXXXXXXXXXXXXXLNSKTQEEALEIFHSAT 304 MLK+FPPK L + + L+SKT E+AL+ F S Sbjct: 1 MLKLFPPKHSLKLVET---------KIRFFTSLSPPSDLTTIILHSKTPEQALDTFTSVL 51 Query: 305 I----NPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKFRKPRRACSSVFKALSQ 472 NP K L ++SA+IH LT AK Y ARCL KDLI+ L + PRR S VF ALSQ Sbjct: 52 KQNPNNPTKKLHLYSAVIHYLTGAKVYPTARCLTKDLIQTLLQSCTPRRVNSLVFNALSQ 111 Query: 473 LGSDNSTSNVYGVLIIALCEMGLIDE 550 L +V+GVLIIA E+GL+DE Sbjct: 112 LRGSKFNPSVFGVLIIAFSEVGLVDE 137 >ref|XP_006440635.1| hypothetical protein CICLE_v10024595mg [Citrus clementina] gi|557542897|gb|ESR53875.1| hypothetical protein CICLE_v10024595mg [Citrus clementina] Length = 697 Score = 91.7 bits (226), Expect = 1e-16 Identities = 62/162 (38%), Positives = 77/162 (47%), Gaps = 20/162 (12%) Frame = +2 Query: 125 MLKIFPPKSKSVL------------------SKRAPIHPHRFLQFXXXXXXXXXXXXXXX 250 MLKIFPPK K S P P L Sbjct: 1 MLKIFPPKQKLTFIDIKINKQPLITLRSISESSSMPSTPFSSLSSSSSSSLPPRSNLTNA 60 Query: 251 XLNSKTQEEALEIFHSAT--INPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKF 424 LNSKT +AL +F+S++ +NP K+L +AI + L AK Y +ARCLIKD+ E L K Sbjct: 61 ILNSKTPNQALVLFNSSSKKLNPTKSLAPFAAIFYVLANAKLYKNARCLIKDVTENLLKS 120 Query: 425 RKPRRACSSVFKALSQLGSDNSTSNVYGVLIIALCEMGLIDE 550 RKP C SVF AL+ L +V+ LIIA EMG I+E Sbjct: 121 RKPHHVCYSVFNALNSLEIPKFNPSVFSTLIIAFSEMGHIEE 162 >ref|XP_004167903.1| PREDICTED: pentatricopeptide repeat-containing protein At5g61400-like [Cucumis sativus] Length = 645 Score = 88.6 bits (218), Expect = 9e-16 Identities = 47/99 (47%), Positives = 64/99 (64%) Frame = +2 Query: 254 LNSKTQEEALEIFHSATINPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKFRKP 433 LN ++ +ALE F++A P KN++++SAIIH L +K HAR L+ DL++ L K KP Sbjct: 38 LNCRSPWKALEFFNAA---PEKNIQLYSAIIHVLVGSKLLSHARYLLNDLVQNLVKSHKP 94 Query: 434 RRACSSVFKALSQLGSDNSTSNVYGVLIIALCEMGLIDE 550 AC F LS+L S T NVYG LII LC+M L++E Sbjct: 95 YHACQLAFSELSRLKSSKFTPNVYGELIIVLCKMELVEE 133 >ref|NP_200948.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75171473|sp|Q9FLJ4.1|PP440_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At5g61400 gi|9757861|dbj|BAB08495.1| unnamed protein product [Arabidopsis thaliana] gi|332010079|gb|AED97462.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 654 Score = 68.6 bits (166), Expect = 1e-09 Identities = 37/102 (36%), Positives = 59/102 (57%), Gaps = 3/102 (2%) Frame = +2 Query: 254 LNSKTQEEALEIFHSAT---INPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKF 424 L ++ EEA ++F +++ ++ +L+ SA+IH LT A Y ARCLIK LIE L++ Sbjct: 49 LKCRSAEEAFKLFETSSRSRVSKSNDLQSFSAVIHVLTGAHKYTLARCLIKSLIERLKRH 108 Query: 425 RKPRRACSSVFKALSQLGSDNSTSNVYGVLIIALCEMGLIDE 550 +P +F AL + S + V+ +LI+ EMGL +E Sbjct: 109 SEPSNMSHRLFNALEDIQSPKFSIGVFSLLIMEFLEMGLFEE 150 >ref|XP_002864731.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297310566|gb|EFH40990.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 732 Score = 68.2 bits (165), Expect = 1e-09 Identities = 37/102 (36%), Positives = 59/102 (57%), Gaps = 3/102 (2%) Frame = +2 Query: 254 LNSKTQEEALEIFHSATIN---PHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKF 424 L ++ EEA +F +++I+ +L+ SA+IH LT A Y ARCLIK LIE L+++ Sbjct: 83 LKCRSAEEAFRLFETSSISRLSKTTDLQSFSAVIHVLTGAHKYTLARCLIKSLIERLRRY 142 Query: 425 RKPRRACSSVFKALSQLGSDNSTSNVYGVLIIALCEMGLIDE 550 +P +F AL + S + V+ +LI+ EMGL ++ Sbjct: 143 SEPTNISHRLFNALEDIQSPKFSIGVFSLLIMEFLEMGLFED 184 >ref|XP_006394512.1| hypothetical protein EUTSA_v10005336mg [Eutrema salsugineum] gi|557091151|gb|ESQ31798.1| hypothetical protein EUTSA_v10005336mg [Eutrema salsugineum] Length = 665 Score = 67.0 bits (162), Expect = 3e-09 Identities = 39/102 (38%), Positives = 59/102 (57%), Gaps = 3/102 (2%) Frame = +2 Query: 254 LNSKTQEEALEIFHSAT---INPHKNLKVHSAIIHCLTEAKSYIHARCLIKDLIEELQKF 424 L ++ EEAL +F +++ I+ +L+ SA+IH LT A+ + ARCLIK LIE L++ Sbjct: 55 LKCRSAEEALRLFETSSRLKISNINDLRSFSALIHVLTGAQKFTVARCLIKRLIESLRRQ 114 Query: 425 RKPRRACSSVFKALSQLGSDNSTSNVYGVLIIALCEMGLIDE 550 KP VF AL + + V+ +LI+ L EMG +E Sbjct: 115 NKPTNVSYRVFNALEDIQTPEFCIGVFSLLIMELVEMGWFEE 156 >gb|EMJ12012.1| hypothetical protein PRUPE_ppa016961mg, partial [Prunus persica] Length = 548 Score = 62.8 bits (151), Expect = 5e-08 Identities = 32/55 (58%), Positives = 39/55 (70%) Frame = +2 Query: 386 CLIKDLIEELQKFRKPRRACSSVFKALSQLGSDNSTSNVYGVLIIALCEMGLIDE 550 CLIKDLI +LQKF P AC VF AL+ L S + +V+GVLII L EMGL++E Sbjct: 1 CLIKDLIHDLQKFCNPSLACHLVFDALNCLESSRFSPDVFGVLIIGLSEMGLVEE 55