BLASTX nr result
ID: Catharanthus22_contig00035873
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00035873 (470 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002265420.1| PREDICTED: pentatricopeptide repeat-containi... 124 2e-26 ref|XP_006481967.1| PREDICTED: pentatricopeptide repeat-containi... 120 2e-25 gb|ACP39950.1| pentatricopeptide repeat protein [Gossypium hirsu... 117 2e-24 gb|EOY32165.1| Pentatricopeptide repeat protein isoform 1 [Theob... 116 3e-24 gb|EMJ22674.1| hypothetical protein PRUPE_ppa004164mg [Prunus pe... 115 6e-24 gb|EXB87349.1| Serine/threonine-protein phosphatase PP2A-4 catal... 115 8e-24 ref|XP_002526196.1| pentatricopeptide repeat-containing protein,... 103 2e-20 ref|XP_004295922.1| PREDICTED: pentatricopeptide repeat-containi... 100 2e-19 ref|XP_006430418.1| hypothetical protein CICLE_v10011492mg [Citr... 96 4e-18 emb|CBI31119.3| unnamed protein product [Vitis vinifera] 96 4e-18 ref|XP_002327460.1| predicted protein [Populus trichocarpa] 94 2e-17 gb|EOY32166.1| Pentatricopeptide repeat protein isoform 2 [Theob... 92 7e-17 ref|XP_006437177.1| hypothetical protein CICLE_v10031197mg [Citr... 79 6e-13 ref|XP_006484869.1| PREDICTED: pentatricopeptide repeat-containi... 78 1e-12 ref|XP_006447073.1| hypothetical protein CICLE_v10018352mg [Citr... 77 2e-12 gb|EMJ07625.1| hypothetical protein PRUPE_ppa001946mg [Prunus pe... 77 3e-12 ref|XP_006847829.1| hypothetical protein AMTR_s00029p00051450 [A... 71 1e-10 gb|EXC01449.1| hypothetical protein L484_022020 [Morus notabilis] 70 4e-10 ref|XP_006356395.1| PREDICTED: pentatricopeptide repeat-containi... 70 4e-10 ref|XP_002517091.1| pentatricopeptide repeat-containing protein,... 69 5e-10 >ref|XP_002265420.1| PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Vitis vinifera] Length = 537 Score = 124 bits (310), Expect = 2e-26 Identities = 60/114 (52%), Positives = 82/114 (71%) Frame = -2 Query: 343 MVSSFASKYLKSSYRALSLLDQPCLSLSQFKQIQSHLIVSGTIADPFAAGKLLAGFAVPN 164 M++S SKYLKSS R LSLLDQ C++++ KQIQSHL VSGT+ DPFAAG++++ AV Sbjct: 1 MLASVGSKYLKSSRRVLSLLDQ-CVTMAHIKQIQSHLTVSGTLFDPFAAGRIISFCAVSA 59 Query: 163 YGDVSHVYALFRFIPSRSTYMWNTMIRTFAENGQPMNAILLSKEMIKNGFLLNN 2 GD+SH Y LF +P R++++WNTM+R F + +P + L K M+ GFL NN Sbjct: 60 QGDISHAYLLFLSLPRRTSFIWNTMLRAFTDKKEPATVLSLYKYMLSTGFLPNN 113 >ref|XP_006481967.1| PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Citrus sinensis] Length = 546 Score = 120 bits (301), Expect = 2e-25 Identities = 60/111 (54%), Positives = 82/111 (73%) Frame = -2 Query: 334 SFASKYLKSSYRALSLLDQPCLSLSQFKQIQSHLIVSGTIADPFAAGKLLAGFAVPNYGD 155 +FASK LK+S R++SLLDQ CL++ Q KQIQSHL VSGT+ DPFA GK++ + + GD Sbjct: 3 AFASKILKASKRSVSLLDQ-CLNMKQIKQIQSHLTVSGTLWDPFAVGKIIGFCSASDIGD 61 Query: 154 VSHVYALFRFIPSRSTYMWNTMIRTFAENGQPMNAILLSKEMIKNGFLLNN 2 +SH Y LF + R+T++WNTMIR FAE +P+ A L K+M+++ FL NN Sbjct: 62 LSHGYRLFVCLQYRTTFIWNTMIRGFAEKNEPIKAFALYKQMLRSDFLPNN 112 >gb|ACP39950.1| pentatricopeptide repeat protein [Gossypium hirsutum] gi|227462998|gb|ACP39951.1| pentatricopeptide repeat protein [Gossypium hirsutum] Length = 532 Score = 117 bits (293), Expect = 2e-24 Identities = 58/109 (53%), Positives = 79/109 (72%) Frame = -2 Query: 328 ASKYLKSSYRALSLLDQPCLSLSQFKQIQSHLIVSGTIADPFAAGKLLAGFAVPNYGDVS 149 ASKY+K+S R LSLL+Q C ++SQ KQ+ SHLIVS + DPFAAGK+++ FAV + D+S Sbjct: 3 ASKYVKASQRCLSLLEQ-CRTMSQIKQMHSHLIVSASRLDPFAAGKIISLFAVSSNADIS 61 Query: 148 HVYALFRFIPSRSTYMWNTMIRTFAENGQPMNAILLSKEMIKNGFLLNN 2 H Y LF +P R+T++WNT+IR F E + A+ L K M++ GFL NN Sbjct: 62 HAYKLFLSLPHRTTFIWNTIIRIFVEKNENATALSLYKNMLQTGFLPNN 110 >gb|EOY32165.1| Pentatricopeptide repeat protein isoform 1 [Theobroma cacao] Length = 537 Score = 116 bits (291), Expect = 3e-24 Identities = 58/112 (51%), Positives = 83/112 (74%) Frame = -2 Query: 337 SSFASKYLKSSYRALSLLDQPCLSLSQFKQIQSHLIVSGTIADPFAAGKLLAGFAVPNYG 158 SSFAS+++K+S R+L+LLD+ CL++SQ KQ+QSHLIVS T+ DP+AAG++++ AV Sbjct: 4 SSFASRFVKASRRSLALLDR-CLTMSQIKQMQSHLIVSATLLDPYAAGRIISFCAVSADA 62 Query: 157 DVSHVYALFRFIPSRSTYMWNTMIRTFAENGQPMNAILLSKEMIKNGFLLNN 2 D+SH Y LF + R+T++WNT+IR F E AI L K M+++GFL NN Sbjct: 63 DLSHAYKLFLSLQHRTTFIWNTIIRAFVERNANATAISLYKNMLQSGFLPNN 114 >gb|EMJ22674.1| hypothetical protein PRUPE_ppa004164mg [Prunus persica] Length = 526 Score = 115 bits (288), Expect = 6e-24 Identities = 56/105 (53%), Positives = 76/105 (72%) Frame = -2 Query: 316 LKSSYRALSLLDQPCLSLSQFKQIQSHLIVSGTIADPFAAGKLLAGFAVPNYGDVSHVYA 137 +KSS RALS+LDQ CL+++ KQIQSHL VSGT+ DP+AA K++ V N GD+ H + Sbjct: 1 MKSSKRALSILDQ-CLTMAHIKQIQSHLTVSGTLFDPYAAAKIITFCTVSNSGDLRHAFQ 59 Query: 136 LFRFIPSRSTYMWNTMIRTFAENGQPMNAILLSKEMIKNGFLLNN 2 LFR +P R+TY+WN +IR AEN + M A+ L +MI++G L NN Sbjct: 60 LFRHMPYRTTYIWNVVIRALAENNESMRAVSLYSDMIQSGLLPNN 104 >gb|EXB87349.1| Serine/threonine-protein phosphatase PP2A-4 catalytic subunit [Morus notabilis] Length = 783 Score = 115 bits (287), Expect = 8e-24 Identities = 58/105 (55%), Positives = 75/105 (71%) Frame = -2 Query: 316 LKSSYRALSLLDQPCLSLSQFKQIQSHLIVSGTIADPFAAGKLLAGFAVPNYGDVSHVYA 137 +KSS R LSLLDQ CL+ +Q KQIQSHL VSGT+ DP+AA K++A + + V H Y Sbjct: 1 MKSSRRVLSLLDQ-CLTFTQIKQIQSHLAVSGTLFDPYAAAKIIAFCSTSDASYVCHAYR 59 Query: 136 LFRFIPSRSTYMWNTMIRTFAENGQPMNAILLSKEMIKNGFLLNN 2 LF +P R+TY+WNTMIR FAE + + A+ L K M++NGFL NN Sbjct: 60 LFHCMPYRTTYIWNTMIRAFAEGNEAIRALSLYKNMLENGFLPNN 104 >ref|XP_002526196.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223534500|gb|EEF36200.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 491 Score = 103 bits (258), Expect = 2e-20 Identities = 54/110 (49%), Positives = 77/110 (70%), Gaps = 1/110 (0%) Frame = -2 Query: 328 ASKYLKSSYRALSLLDQPCLSLSQFKQIQSHLIVSGTIADPFAAGKLLAGFAVP-NYGDV 152 +S Y+K+S + L LLD+ C ++SQ KQIQ+HL VSGT+ DP+AA K+++ A+ N + Sbjct: 3 SSTYVKASKKTLFLLDK-CRTISQIKQIQTHLTVSGTLKDPYAAAKIISFCALSSNQFSL 61 Query: 151 SHVYALFRFIPSRSTYMWNTMIRTFAENGQPMNAILLSKEMIKNGFLLNN 2 SH Y LF + RST++WNT+IR FAE +P AI+L K M+ + FL NN Sbjct: 62 SHAYRLFLGLRHRSTFIWNTVIRAFAEKNEPRKAIMLFKNMLYSNFLPNN 111 >ref|XP_004295922.1| PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Fragaria vesca subsp. vesca] Length = 524 Score = 100 bits (250), Expect = 2e-19 Identities = 50/100 (50%), Positives = 72/100 (72%), Gaps = 1/100 (1%) Frame = -2 Query: 313 KSSYRALSLLDQPCLSLSQFKQIQSHLIVSGTIADPFAAGKLLAGFAVP-NYGDVSHVYA 137 ++S RALSLLDQ CL+++ KQ+QSHL VSGT+ DP+AA K+++ A+ N + H Y Sbjct: 3 RASKRALSLLDQ-CLTMAHIKQVQSHLAVSGTLFDPYAAAKVISFCALSHNPHHLRHAYH 61 Query: 136 LFRFIPSRSTYMWNTMIRTFAENGQPMNAILLSKEMIKNG 17 LFRF+P+R+TY+WN MIRTF ++ P A+ L M++ G Sbjct: 62 LFRFMPTRTTYIWNLMIRTFTDSNDPTQALSLYTNMLRTG 101 >ref|XP_006430418.1| hypothetical protein CICLE_v10011492mg [Citrus clementina] gi|557532475|gb|ESR43658.1| hypothetical protein CICLE_v10011492mg [Citrus clementina] Length = 519 Score = 96.3 bits (238), Expect = 4e-18 Identities = 45/88 (51%), Positives = 62/88 (70%) Frame = -2 Query: 265 LSQFKQIQSHLIVSGTIADPFAAGKLLAGFAVPNYGDVSHVYALFRFIPSRSTYMWNTMI 86 + Q KQIQSHL VSGT+ DPFA GK++ + + GD+SH Y LF + R+T++WNTMI Sbjct: 1 MKQIKQIQSHLTVSGTLWDPFAVGKIIGFCSASDIGDLSHGYRLFVCLQYRTTFIWNTMI 60 Query: 85 RTFAENGQPMNAILLSKEMIKNGFLLNN 2 R FAE +P+ A L K+M+++ FL NN Sbjct: 61 RGFAEKNEPIKAFALYKQMLRSDFLPNN 88 >emb|CBI31119.3| unnamed protein product [Vitis vinifera] Length = 512 Score = 96.3 bits (238), Expect = 4e-18 Identities = 43/88 (48%), Positives = 61/88 (69%) Frame = -2 Query: 265 LSQFKQIQSHLIVSGTIADPFAAGKLLAGFAVPNYGDVSHVYALFRFIPSRSTYMWNTMI 86 ++ KQIQSHL VSGT+ DPFAAG++++ AV GD+SH Y LF +P R++++WNTM+ Sbjct: 1 MAHIKQIQSHLTVSGTLFDPFAAGRIISFCAVSAQGDISHAYLLFLSLPRRTSFIWNTML 60 Query: 85 RTFAENGQPMNAILLSKEMIKNGFLLNN 2 R F + +P + L K M+ GFL NN Sbjct: 61 RAFTDKKEPATVLSLYKYMLSTGFLPNN 88 >ref|XP_002327460.1| predicted protein [Populus trichocarpa] Length = 540 Score = 93.6 bits (231), Expect = 2e-17 Identities = 47/110 (42%), Positives = 74/110 (67%), Gaps = 1/110 (0%) Frame = -2 Query: 328 ASKYLKSSYRALSLLDQPCLSLSQFKQIQSHLIVSGTIADPFAAGKLLAGFAVPN-YGDV 152 AS+YLK+S + +SLLDQ L++SQ KQIQSHL V+ T+ DP+AA K+++ A N + Sbjct: 3 ASRYLKASKKTISLLDQKGLTISQLKQIQSHLTVTATLKDPYAAAKIISLHAHSNARSSL 62 Query: 151 SHVYALFRFIPSRSTYMWNTMIRTFAENGQPMNAILLSKEMIKNGFLLNN 2 + LF + ++ST++WNTM++ F E + + A L K M+++ +L NN Sbjct: 63 FYAERLFLCLQNKSTFIWNTMMQAFVEKNEAVRAFSLYKHMLESNYLPNN 112 >gb|EOY32166.1| Pentatricopeptide repeat protein isoform 2 [Theobroma cacao] Length = 511 Score = 92.0 bits (227), Expect = 7e-17 Identities = 44/88 (50%), Positives = 61/88 (69%) Frame = -2 Query: 265 LSQFKQIQSHLIVSGTIADPFAAGKLLAGFAVPNYGDVSHVYALFRFIPSRSTYMWNTMI 86 +SQ KQ+QSHLIVS T+ DP+AAG++++ AV D+SH Y LF + R+T++WNT+I Sbjct: 1 MSQIKQMQSHLIVSATLLDPYAAGRIISFCAVSADADLSHAYKLFLSLQHRTTFIWNTII 60 Query: 85 RTFAENGQPMNAILLSKEMIKNGFLLNN 2 R F E AI L K M+++GFL NN Sbjct: 61 RAFVERNANATAISLYKNMLQSGFLPNN 88 >ref|XP_006437177.1| hypothetical protein CICLE_v10031197mg [Citrus clementina] gi|557539373|gb|ESR50417.1| hypothetical protein CICLE_v10031197mg [Citrus clementina] Length = 534 Score = 79.0 bits (193), Expect = 6e-13 Identities = 45/110 (40%), Positives = 67/110 (60%) Frame = -2 Query: 355 SPTVMVSSFASKYLKSSYRALSLLDQPCLSLSQFKQIQSHLIVSGTIADPFAAGKLLAGF 176 SPT M SK++ S LSLLD+ C S+ K+I +HLI +G DP AA ++LA F Sbjct: 9 SPTSM-----SKFI-SDQPLLSLLDKQCTSMKDLKKIHAHLIKTGLPKDPIAASRILA-F 61 Query: 175 AVPNYGDVSHVYALFRFIPSRSTYMWNTMIRTFAENGQPMNAILLSKEMI 26 GD+++ Y +F I + ++WNT+IR F+++ P NAILL +M+ Sbjct: 62 CTSPAGDINYAYLVFTQIKKPNLFIWNTIIRGFSQSSTPRNAILLFIDML 111 >ref|XP_006484869.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic-like [Citrus sinensis] Length = 534 Score = 78.2 bits (191), Expect = 1e-12 Identities = 44/110 (40%), Positives = 66/110 (60%) Frame = -2 Query: 355 SPTVMVSSFASKYLKSSYRALSLLDQPCLSLSQFKQIQSHLIVSGTIADPFAAGKLLAGF 176 SPT M SK++ S LSLLD+ C S+ K+I +HLI +G DP AA ++L F Sbjct: 9 SPTSM-----SKFI-SDQPLLSLLDKQCTSMKDLKKIHAHLIKTGLAKDPIAASRILT-F 61 Query: 175 AVPNYGDVSHVYALFRFIPSRSTYMWNTMIRTFAENGQPMNAILLSKEMI 26 GD+++ Y +F I + ++WNT+IR F+++ P NAILL +M+ Sbjct: 62 CTSPAGDINYAYLVFTQIKKPNLFIWNTIIRGFSQSSTPRNAILLFIDML 111 >ref|XP_006447073.1| hypothetical protein CICLE_v10018352mg [Citrus clementina] gi|568831618|ref|XP_006470058.1| PREDICTED: pentatricopeptide repeat-containing protein At5g56310-like [Citrus sinensis] gi|557549684|gb|ESR60313.1| hypothetical protein CICLE_v10018352mg [Citrus clementina] Length = 463 Score = 77.0 bits (188), Expect = 2e-12 Identities = 43/97 (44%), Positives = 57/97 (58%) Frame = -2 Query: 295 LSLLDQPCLSLSQFKQIQSHLIVSGTIADPFAAGKLLAGFAVPNYGDVSHVYALFRFIPS 116 LSLL C S+ Q KQI + +I+S I D FAA +LLA A+ + GD+S+ LF I S Sbjct: 18 LSLLADKCKSMHQLKQIHAQMIISSRIQDHFAASRLLAFCALSSSGDLSYATRLFNSIQS 77 Query: 115 RSTYMWNTMIRTFAENGQPMNAILLSKEMIKNGFLLN 5 + +MWNT+IR A + P AI L M + GF N Sbjct: 78 PNHFMWNTLIRAQASSLNPDKAIFLYMNMRRTGFAPN 114 >gb|EMJ07625.1| hypothetical protein PRUPE_ppa001946mg [Prunus persica] Length = 738 Score = 76.6 bits (187), Expect = 3e-12 Identities = 36/109 (33%), Positives = 65/109 (59%) Frame = -2 Query: 352 PTVMVSSFASKYLKSSYRALSLLDQPCLSLSQFKQIQSHLIVSGTIADPFAAGKLLAGFA 173 P +F++ SS+ ALSL+DQ C S+ Q KQ+ + ++ +G + DP++A KL+ A Sbjct: 15 PNSSSPTFSTDLRFSSHPALSLIDQ-CTSIKQLKQVHAQMLRTGVLFDPYSASKLITASA 73 Query: 172 VPNYGDVSHVYALFRFIPSRSTYMWNTMIRTFAENGQPMNAILLSKEMI 26 + ++ + + +F IP + Y WNT+IR +A + P +IL+ +M+ Sbjct: 74 LSSFSSLDYARQVFDQIPQPNVYTWNTLIRAYASSSDPAESILVFLDML 122 >ref|XP_006847829.1| hypothetical protein AMTR_s00029p00051450 [Amborella trichopoda] gi|548851134|gb|ERN09410.1| hypothetical protein AMTR_s00029p00051450 [Amborella trichopoda] Length = 139 Score = 71.2 bits (173), Expect = 1e-10 Identities = 29/89 (32%), Positives = 56/89 (62%) Frame = -2 Query: 268 SLSQFKQIQSHLIVSGTIADPFAAGKLLAGFAVPNYGDVSHVYALFRFIPSRSTYMWNTM 89 ++ + +Q+ +H+ +G DPF +LLA FA+ N+GD+ H A+F + + +M+NT+ Sbjct: 26 TMLELRQLHAHITTAGLSFDPFTISRLLALFAISNHGDIDHAQAIFAHFKNPTPFMYNTI 85 Query: 88 IRTFAENGQPMNAILLSKEMIKNGFLLNN 2 IR F+++ +P+ AI +M+ +G +N Sbjct: 86 IRGFSQSSEPVKAIQTFNQMLSSGIYPDN 114 >gb|EXC01449.1| hypothetical protein L484_022020 [Morus notabilis] Length = 739 Score = 69.7 bits (169), Expect = 4e-10 Identities = 32/94 (34%), Positives = 58/94 (61%) Frame = -2 Query: 307 SYRALSLLDQPCLSLSQFKQIQSHLIVSGTIADPFAAGKLLAGFAVPNYGDVSHVYALFR 128 +Y LSL++Q C SL + KQI + ++ +G DPF+A KL+ A+ ++ + + + +F Sbjct: 31 NYPLLSLIEQ-CTSLKELKQIHAQMLRTGLFFDPFSASKLITVCAMSSFSSLDYAHQVFD 89 Query: 127 FIPSRSTYMWNTMIRTFAENGQPMNAILLSKEMI 26 IP + Y WNT+IR +A + P+ +I++ M+ Sbjct: 90 QIPKPNLYTWNTIIRAYASSSDPIQSIVVFLRML 123 >ref|XP_006356395.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic-like [Solanum tuberosum] Length = 522 Score = 69.7 bits (169), Expect = 4e-10 Identities = 41/113 (36%), Positives = 66/113 (58%), Gaps = 1/113 (0%) Frame = -2 Query: 355 SPTVMVSSFASKYLKSSYRALSLLDQPCLSLSQFKQIQSHLIVSGTIADPFAAGKLLAGF 176 S +V S+ SK++ S L +L+ C +++ K+I +HLI SG I D A+ ++LA Sbjct: 2 SSSVCSSTSISKFI-SDQPYLHMLETKCTTMTDLKKIHAHLIKSGLIKDKIASSRVLAFS 60 Query: 175 AV-PNYGDVSHVYALFRFIPSRSTYMWNTMIRTFAENGQPMNAILLSKEMIKN 20 A P GD+++ +F I + + + WNT+IR F+E+ P AI L EM+ N Sbjct: 61 AKSPPIGDINYANLVFTHIENPNLFTWNTIIRGFSESSTPQYAIHLFIEMLNN 113 >ref|XP_002517091.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223543726|gb|EEF45254.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 606 Score = 69.3 bits (168), Expect = 5e-10 Identities = 32/89 (35%), Positives = 53/89 (59%) Frame = -2 Query: 289 LLDQPCLSLSQFKQIQSHLIVSGTIADPFAAGKLLAGFAVPNYGDVSHVYALFRFIPSRS 110 L+ + C + Q KQIQ+H+I++G I F ++LA A+ + GD+ H + LF I + Sbjct: 163 LIMESCTCMIQLKQIQAHMIITGLITHTFPVSRVLAFCALADTGDIRHAHLLFNQIEYPN 222 Query: 109 TYMWNTMIRTFAENGQPMNAILLSKEMIK 23 TY+WNTMIR F+ P+ + +M++ Sbjct: 223 TYIWNTMIRGFSNAKMPVMGLSFFWQMVR 251