BLASTX nr result
ID: Catharanthus22_contig00025649
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00025649 (449 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004233195.1| PREDICTED: pentatricopeptide repeat-containi... 106 4e-21 ref|XP_006353048.1| PREDICTED: pentatricopeptide repeat-containi... 103 2e-20 gb|EXB75175.1| hypothetical protein L484_025954 [Morus notabilis] 73 3e-11 emb|CBI31086.3| unnamed protein product [Vitis vinifera] 72 6e-11 ref|XP_002268853.1| PREDICTED: pentatricopeptide repeat-containi... 72 6e-11 emb|CAN66615.1| hypothetical protein VITISV_022030 [Vitis vinifera] 72 8e-11 ref|XP_006468372.1| PREDICTED: pentatricopeptide repeat-containi... 69 6e-10 ref|XP_006448816.1| hypothetical protein CICLE_v10014257mg [Citr... 69 6e-10 ref|XP_004503357.1| PREDICTED: pentatricopeptide repeat-containi... 60 2e-07 ref|NP_193861.1| pentatricopeptide repeat-containing protein [Ar... 59 5e-07 emb|CAA17548.1| putative protein [Arabidopsis thaliana] 59 5e-07 gb|ESW10516.1| hypothetical protein PHAVU_009G216300g [Phaseolus... 59 7e-07 ref|XP_003547574.1| PREDICTED: pentatricopeptide repeat-containi... 59 7e-07 ref|XP_006413798.1| hypothetical protein EUTSA_v10024394mg [Eutr... 59 9e-07 ref|XP_006587119.1| PREDICTED: pentatricopeptide repeat-containi... 56 4e-06 >ref|XP_004233195.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300-like [Solanum lycopersicum] Length = 853 Score = 106 bits (264), Expect = 4e-21 Identities = 58/110 (52%), Positives = 74/110 (67%), Gaps = 8/110 (7%) Frame = -1 Query: 308 KNLSVFKFRSIQTSTARPFVANFPY--------TEEALASKIAPLLVSCSTPGLNGSSLH 153 KN+ RSI + A N P+ TEE LASK+AP+L SC++ N L Sbjct: 6 KNICSIYRRSISVAAAFSSKPNSPFIQDSVIHCTEEVLASKLAPILQSCNSSAEN---LG 62 Query: 152 SIMRRGQQVHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNL 3 S++R+G+QVHAQ+TVNGI+NLGILGTRILGMY+LCN+ DA LF+QL L Sbjct: 63 SVIRKGEQVHAQVTVNGIDNLGILGTRILGMYVLCNRFIDAKKLFFQLRL 112 >ref|XP_006353048.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300-like [Solanum tuberosum] Length = 852 Score = 103 bits (258), Expect = 2e-20 Identities = 57/110 (51%), Positives = 73/110 (66%), Gaps = 8/110 (7%) Frame = -1 Query: 308 KNLSVFKFRSIQTSTARPFVANFPY--------TEEALASKIAPLLVSCSTPGLNGSSLH 153 KN+ RSI + A N P+ TE+ LASK+AP+L SC+ N L Sbjct: 6 KNICSIFRRSISVAAAFSSKPNSPFFQDSAFHNTEQVLASKLAPILQSCTNSTEN---LG 62 Query: 152 SIMRRGQQVHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNL 3 S++R+G+QVHAQ+TVNGI+NLGILGTRILGMY+LCN+ DA LF+QL L Sbjct: 63 SVLRKGEQVHAQVTVNGIDNLGILGTRILGMYVLCNRFIDAKKLFFQLQL 112 >gb|EXB75175.1| hypothetical protein L484_025954 [Morus notabilis] Length = 850 Score = 73.2 bits (178), Expect = 3e-11 Identities = 34/86 (39%), Positives = 56/86 (65%) Frame = -1 Query: 260 RPFVANFPYTEEALASKIAPLLVSCSTPGLNGSSLHSIMRRGQQVHAQITVNGINNLGIL 81 +PF+ NFP TEEAL + +L +C H+++++G+Q+HAQ+ NGI+ ++ Sbjct: 36 KPFI-NFPRTEEALTNHFLSILQACCD--------HALLQQGRQIHAQVIANGISRKNLI 86 Query: 80 GTRILGMYILCNKHSDANNLFYQLNL 3 GT+IL +Y+LC A N+FY+L+L Sbjct: 87 GTKILAVYVLCGSFLYAKNVFYRLDL 112 >emb|CBI31086.3| unnamed protein product [Vitis vinifera] Length = 766 Score = 72.4 bits (176), Expect = 6e-11 Identities = 39/102 (38%), Positives = 62/102 (60%), Gaps = 3/102 (2%) Frame = -1 Query: 299 SVFKFRSIQTSTA---RPFVANFPYTEEALASKIAPLLVSCSTPGLNGSSLHSIMRRGQQ 129 + FK +S T++ +P + + +++LA ++ +L +C+ P S + +G+Q Sbjct: 17 TTFKLKSFHTNSVNIGKPLQFSI-HNDDSLAPQLVSILQTCTDP--------SGLSQGRQ 67 Query: 128 VHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNL 3 HAQ+ VNGI GILGT++LGMY+LC DA N+FYQL L Sbjct: 68 AHAQMLVNGIGYNGILGTKLLGMYVLCGAFLDAKNIFYQLRL 109 >ref|XP_002268853.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300-like [Vitis vinifera] Length = 853 Score = 72.4 bits (176), Expect = 6e-11 Identities = 39/102 (38%), Positives = 62/102 (60%), Gaps = 3/102 (2%) Frame = -1 Query: 299 SVFKFRSIQTSTA---RPFVANFPYTEEALASKIAPLLVSCSTPGLNGSSLHSIMRRGQQ 129 + FK +S T++ +P + + +++LA ++ +L +C+ P S + +G+Q Sbjct: 17 TTFKLKSFHTNSVNIGKPLQFSI-HNDDSLAPQLVSILQTCTDP--------SGLSQGRQ 67 Query: 128 VHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNL 3 HAQ+ VNGI GILGT++LGMY+LC DA N+FYQL L Sbjct: 68 AHAQMLVNGIGYNGILGTKLLGMYVLCGAFLDAKNIFYQLRL 109 >emb|CAN66615.1| hypothetical protein VITISV_022030 [Vitis vinifera] Length = 818 Score = 72.0 bits (175), Expect = 8e-11 Identities = 39/102 (38%), Positives = 61/102 (59%), Gaps = 3/102 (2%) Frame = -1 Query: 299 SVFKFRSIQTST---ARPFVANFPYTEEALASKIAPLLVSCSTPGLNGSSLHSIMRRGQQ 129 + FK +S T++ +P + + +++LA ++ +L +C+ P S + G+Q Sbjct: 17 TTFKLKSFHTNSINIGKPLQFSI-HNDDSLAPQLVSILQTCTDP--------SGLSHGRQ 67 Query: 128 VHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNL 3 HAQ+ VNGI GILGT++LGMY+LC DA N+FYQL L Sbjct: 68 AHAQMLVNGIGYNGILGTKLLGMYVLCGAFLDAKNIFYQLRL 109 >ref|XP_006468372.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300-like [Citrus sinensis] Length = 847 Score = 68.9 bits (167), Expect = 6e-10 Identities = 42/117 (35%), Positives = 66/117 (56%), Gaps = 5/117 (4%) Frame = -1 Query: 338 MYSR---SSIAACKNLSVFKFRSIQTSTAR--PFVANFPYTEEALASKIAPLLVSCSTPG 174 MY R SS S FK +SI ++ + + T+ ALAS + +L +C+ Sbjct: 1 MYQRLITSSHKCLSTFSAFKCKSIHSNCEHFTNQLVSSHKTDTALASHLGSILEACAD-- 58 Query: 173 LNGSSLHSIMRRGQQVHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNL 3 HS++++G+QVH+Q +NGI++ LG +ILGMY+LC DA N+F +L+L Sbjct: 59 ------HSVLQQGRQVHSQFILNGISDNAALGAKILGMYVLCGGFIDAGNMFPRLDL 109 >ref|XP_006448816.1| hypothetical protein CICLE_v10014257mg [Citrus clementina] gi|557551427|gb|ESR62056.1| hypothetical protein CICLE_v10014257mg [Citrus clementina] Length = 848 Score = 68.9 bits (167), Expect = 6e-10 Identities = 42/117 (35%), Positives = 69/117 (58%), Gaps = 5/117 (4%) Frame = -1 Query: 338 MYSRSSIAACKNLSVF---KFRSIQTSTAR--PFVANFPYTEEALASKIAPLLVSCSTPG 174 MY R ++ K LS+F K +SI ++ + + T+ ALAS + +L +C+ Sbjct: 1 MYQRLITSSHKCLSIFSAFKCKSIHSNCEHFTNQLVSSHKTDTALASHLGSILEACAD-- 58 Query: 173 LNGSSLHSIMRRGQQVHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNL 3 HS++++G+QVH+Q +NGI++ LG +ILGMY+LC DA N+F +L+L Sbjct: 59 ------HSVLQQGRQVHSQFILNGISDNAALGAKILGMYVLCGGFIDAGNMFPRLDL 109 >ref|XP_004503357.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300-like [Cicer arietinum] Length = 875 Score = 60.5 bits (145), Expect = 2e-07 Identities = 30/78 (38%), Positives = 52/78 (66%) Frame = -1 Query: 236 YTEEALASKIAPLLVSCSTPGLNGSSLHSIMRRGQQVHAQITVNGINNLGILGTRILGMY 57 + E++LA+++ + CS S+++R +Q+HA + V+G+++ LG+RILGMY Sbjct: 65 FFEQSLAAQLECMFRDCSNFDA------SMVQRVRQIHAHVVVSGMSDSLTLGSRILGMY 118 Query: 56 ILCNKHSDANNLFYQLNL 3 ILC + +DA NLF++L L Sbjct: 119 ILCGRFNDAGNLFFRLQL 136 >ref|NP_193861.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75207660|sp|Q9STE1.1|PP333_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g21300 gi|3402749|emb|CAA20195.1| putative protein [Arabidopsis thaliana] gi|7268926|emb|CAB79129.1| putative protein [Arabidopsis thaliana] gi|332659037|gb|AEE84437.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 857 Score = 59.3 bits (142), Expect = 5e-07 Identities = 30/78 (38%), Positives = 45/78 (57%) Frame = -1 Query: 236 YTEEALASKIAPLLVSCSTPGLNGSSLHSIMRRGQQVHAQITVNGINNLGILGTRILGMY 57 + EE + +++ LL +CS P L +R+G+QVHA + VN I+ RILGMY Sbjct: 29 FLEETIPRRLSLLLQACSNPNL--------LRQGKQVHAFLIVNSISGDSYTDERILGMY 80 Query: 56 ILCNKHSDANNLFYQLNL 3 +C SD +FY+L+L Sbjct: 81 AMCGSFSDCGKMFYRLDL 98 >emb|CAA17548.1| putative protein [Arabidopsis thaliana] Length = 434 Score = 59.3 bits (142), Expect = 5e-07 Identities = 30/78 (38%), Positives = 45/78 (57%) Frame = -1 Query: 236 YTEEALASKIAPLLVSCSTPGLNGSSLHSIMRRGQQVHAQITVNGINNLGILGTRILGMY 57 + EE + +++ LL +CS P L +R+G+QVHA + VN I+ RILGMY Sbjct: 29 FLEETIPRRLSLLLQACSNPNL--------LRQGKQVHAFLIVNSISGDSYTDERILGMY 80 Query: 56 ILCNKHSDANNLFYQLNL 3 +C SD +FY+L+L Sbjct: 81 AMCGSFSDCGKMFYRLDL 98 >gb|ESW10516.1| hypothetical protein PHAVU_009G216300g [Phaseolus vulgaris] Length = 848 Score = 58.9 bits (141), Expect = 7e-07 Identities = 36/118 (30%), Positives = 64/118 (54%), Gaps = 6/118 (5%) Frame = -1 Query: 338 MYSRSSIAACKNL--SVFKFRSIQTSTARPFVANF----PYTEEALASKIAPLLVSCSTP 177 MY+ S++ + L S KF T+ ++ P T+++L + L +CS Sbjct: 1 MYNTSNLCSIFRLAFSRSKFMHTATNICNNVISKSHLLPPETQDSLTPHLESLFRACSDA 60 Query: 176 GLNGSSLHSIMRRGQQVHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNL 3 S++++ +QVH Q+ V G++++ L +RILG+Y+LC + DA NLF++L L Sbjct: 61 --------SLLQQVRQVHTQVVVGGMSDVCSLSSRILGLYVLCGRIKDAENLFFRLEL 110 >ref|XP_003547574.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300-like [Glycine max] Length = 846 Score = 58.9 bits (141), Expect = 7e-07 Identities = 29/83 (34%), Positives = 51/83 (61%) Frame = -1 Query: 251 VANFPYTEEALASKIAPLLVSCSTPGLNGSSLHSIMRRGQQVHAQITVNGINNLGILGTR 72 V + P T++ L +++ L +CS S++++ +QVH QI V G++++ L +R Sbjct: 33 VMSKPETQDYLTTQLESLFRACSDA--------SVVQQARQVHTQIIVGGMSDVCALSSR 84 Query: 71 ILGMYILCNKHSDANNLFYQLNL 3 +LG+Y+LC + SD NLF+ L L Sbjct: 85 VLGLYVLCGRISDGGNLFFGLEL 107 >ref|XP_006413798.1| hypothetical protein EUTSA_v10024394mg [Eutrema salsugineum] gi|557114968|gb|ESQ55251.1| hypothetical protein EUTSA_v10024394mg [Eutrema salsugineum] Length = 842 Score = 58.5 bits (140), Expect = 9e-07 Identities = 31/85 (36%), Positives = 48/85 (56%) Frame = -1 Query: 257 PFVANFPYTEEALASKIAPLLVSCSTPGLNGSSLHSIMRRGQQVHAQITVNGINNLGILG 78 P + Y E+L ++ LL SCS +++R+G+QVHA + VN +++ + Sbjct: 21 PMEYSSQYLVESLPRRLTLLLQSCSDS--------TLLRQGKQVHAFLIVNRVSSESYMA 72 Query: 77 TRILGMYILCNKHSDANNLFYQLNL 3 RILGMY +C SD LFY+L+L Sbjct: 73 ERILGMYAMCGSFSDCGKLFYRLDL 97 >ref|XP_006587119.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21300-like isoform X2 [Glycine max] gi|571476945|ref|XP_003535029.2| PREDICTED: pentatricopeptide repeat-containing protein At4g21300-like isoform X1 [Glycine max] Length = 848 Score = 56.2 bits (134), Expect = 4e-06 Identities = 34/117 (29%), Positives = 65/117 (55%), Gaps = 5/117 (4%) Frame = -1 Query: 338 MYSRSS-IAACKNLSVFKFRSIQTSTA----RPFVANFPYTEEALASKIAPLLVSCSTPG 174 MY+R++ + + LS + + + T+T V P T ++L +++ L +CS Sbjct: 1 MYNRTTNLCSIFRLSFSRSKLMHTATTSICNNNNVMAKPETLDSLTTQLESLFRACSDA- 59 Query: 173 LNGSSLHSIMRRGQQVHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNL 3 S++++ +QVH Q+ V G+ ++ +R+LG+Y+LC + DA NLF++L L Sbjct: 60 -------SMVQQARQVHTQVIVGGMGDVCAPSSRVLGLYVLCGRFRDAGNLFFELEL 109