BLASTX nr result
ID: Catharanthus23_contig00006307
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00006307 (2909 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004253081.1| PREDICTED: pentatricopeptide repeat-containi... 332 4e-88 gb|EXB40453.1| hypothetical protein L484_013756 [Morus notabilis] 327 2e-86 ref|XP_006342489.1| PREDICTED: pentatricopeptide repeat-containi... 326 4e-86 gb|EOX98954.1| Pentatricopeptide repeat superfamily protein, put... 312 6e-82 emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera] 310 3e-81 ref|XP_002283311.1| PREDICTED: pentatricopeptide repeat-containi... 308 1e-80 ref|XP_004290096.1| PREDICTED: pentatricopeptide repeat-containi... 306 4e-80 ref|XP_006422375.1| hypothetical protein CICLE_v10028759mg [Citr... 305 1e-79 ref|XP_006394459.1| hypothetical protein EUTSA_v10005467mg [Eutr... 305 1e-79 ref|XP_006486551.1| PREDICTED: pentatricopeptide repeat-containi... 303 2e-79 gb|EPS62438.1| hypothetical protein M569_12351, partial [Genlise... 303 4e-79 ref|XP_006590678.1| PREDICTED: pentatricopeptide repeat-containi... 301 1e-78 ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containi... 301 1e-78 ref|XP_004512018.1| PREDICTED: pentatricopeptide repeat-containi... 299 4e-78 gb|ESW28907.1| hypothetical protein PHAVU_002G027800g [Phaseolus... 299 5e-78 ref|XP_002868835.1| pentatricopeptide repeat-containing protein ... 298 1e-77 ref|XP_006573589.1| PREDICTED: pentatricopeptide repeat-containi... 295 8e-77 ref|XP_006573588.1| PREDICTED: pentatricopeptide repeat-containi... 295 8e-77 ref|XP_003516576.2| PREDICTED: pentatricopeptide repeat-containi... 295 8e-77 ref|NP_195528.1| pentatricopeptide repeat-containing protein [Ar... 294 2e-76 >ref|XP_004253081.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform 1 [Solanum lycopersicum] gi|460415472|ref|XP_004253082.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform 2 [Solanum lycopersicum] Length = 340 Score = 332 bits (852), Expect = 4e-88 Identities = 166/233 (71%), Positives = 195/233 (83%), Gaps = 2/233 (0%) Frame = +2 Query: 2000 DNESQGRGRGVVEDSDFLERFKLGFDRTKR-VNSDSK-ESPDQTADTTAEPTPEDADEIF 2173 +NESQ + + + DFL+RF+LGFDR + N++ K ES D PEDADEIF Sbjct: 102 NNESQMKSQ---DSEDFLKRFQLGFDRKEENPNTNPKAESRDCPVSEAPPAPPEDADEIF 158 Query: 2174 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKL 2353 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVV+GFCKAQK Sbjct: 159 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVDGFCKAQKF 218 Query: 2354 DDAIRIFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGL 2533 DDA+RIFRKMQ NGI PNAFSYGI+++ L +GKRLDDA EFC EMLEAGH+PNV TF+ L Sbjct: 219 DDAVRIFRKMQGNGIIPNAFSYGIIIRGLSQGKRLDDALEFCLEMLEAGHSPNVVTFVTL 278 Query: 2534 VDCFCREKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692 VD FC+EK LE+ +++I T+RQKGF++D+KAVRE+LDKKGPF+P+VWEAI GK Sbjct: 279 VDGFCKEKSLEDAQNMIKTVRQKGFIVDDKAVREFLDKKGPFLPVVWEAILGK 331 >gb|EXB40453.1| hypothetical protein L484_013756 [Morus notabilis] Length = 306 Score = 327 bits (838), Expect = 2e-86 Identities = 167/233 (71%), Positives = 185/233 (79%), Gaps = 6/233 (2%) Frame = +2 Query: 2012 QGRGRGVVEDSDFLERFKLGFDRTKRVNSDSKESPDQTADTTAEPTP------EDADEIF 2173 +GRG ED FLE+FKLG D +K +E P + A P P EDADEIF Sbjct: 70 RGRGPLTSEDDSFLEKFKLGLDSSK---DGMQEKPRREAARPKPPLPQPPPPPEDADEIF 126 Query: 2174 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKL 2353 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLM+EKGTIPEVVIYTAVV+GFCKAQKL Sbjct: 127 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMKEKGTIPEVVIYTAVVDGFCKAQKL 186 Query: 2354 DDAIRIFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGL 2533 DDA+RIFRKMQSNGI PNAFSY +LVQ LC GKRL+D EFC EMLEAGH+PNVATF+GL Sbjct: 187 DDAVRIFRKMQSNGIEPNAFSYSVLVQGLCGGKRLEDGLEFCVEMLEAGHSPNVATFVGL 246 Query: 2534 VDCFCREKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692 VD C EKG+EE + +I LR KGF+L+EKAVRE+LDKK F P VWEAIFGK Sbjct: 247 VDGLCEEKGVEEAQGVIGKLRDKGFLLNEKAVREFLDKKASFSPSVWEAIFGK 299 >ref|XP_006342489.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like [Solanum tuberosum] Length = 354 Score = 326 bits (835), Expect = 4e-86 Identities = 167/247 (67%), Positives = 193/247 (78%), Gaps = 7/247 (2%) Frame = +2 Query: 1973 RKSAQFGYGDNESQGRGRGVVEDSDFLERFKLGFDRTKRVNSDSK-------ESPDQTAD 2131 R+S + G +SQ + DFL+RF+LGFDR K N ++ ES D Sbjct: 107 RRSGENNGGQMKSQ-------DSEDFLKRFQLGFDR-KEENPNTNPALHPKGESSDSPVS 158 Query: 2132 TTAEPTPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVI 2311 PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVI Sbjct: 159 EAPPAPPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVI 218 Query: 2312 YTAVVEGFCKAQKLDDAIRIFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEML 2491 YTAVV+GF KAQK DDA+RIFRKMQ NGI PNAFSYGIL++ L +G RLDDA EFC EML Sbjct: 219 YTAVVDGFFKAQKFDDAVRIFRKMQGNGIIPNAFSYGILIRGLSQGNRLDDAFEFCLEML 278 Query: 2492 EAGHAPNVATFIGLVDCFCREKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLV 2671 EAGH+PNV TF+ LVD FC+EK LE+ +++I T+RQKGF++D+KAVREYLDKKGPF+P+V Sbjct: 279 EAGHSPNVVTFVTLVDGFCKEKSLEDAQNMIKTVRQKGFIVDDKAVREYLDKKGPFLPVV 338 Query: 2672 WEAIFGK 2692 WEAI GK Sbjct: 339 WEAILGK 345 >gb|EOX98954.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma cacao] Length = 345 Score = 312 bits (799), Expect = 6e-82 Identities = 156/221 (70%), Positives = 184/221 (83%), Gaps = 3/221 (1%) Frame = +2 Query: 2039 DSDFLERFKLGFD--RTKRVNSDSKESPDQTADTTAEPTP-EDADEIFKKMKETGLIPNA 2209 D +FLE+FKLG D R K+ + + + + +P+P +DADEIFKKMKETGLIPNA Sbjct: 118 DENFLEKFKLGLDNKRGKQPSDSEAAALLRRKEQEEKPSPPQDADEIFKKMKETGLIPNA 177 Query: 2210 VAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRKMQS 2389 VAMLDGLCKDGL+QEAMKLFG MREKGTIPEVVIYTAVV+GFCKA KLDDA RIFRKMQS Sbjct: 178 VAMLDGLCKDGLIQEAMKLFGSMREKGTIPEVVIYTAVVDGFCKAHKLDDAKRIFRKMQS 237 Query: 2390 NGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKGLEE 2569 G+TPN+FSY +L+Q L R +LDDA EFC EMLEAGH+PNV TF+GLVD C+EKG+EE Sbjct: 238 KGVTPNSFSYIVLIQGLYRCNKLDDAIEFCLEMLEAGHSPNVTTFVGLVDGLCKEKGVEE 297 Query: 2570 TESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692 +S+I TL+QKGFVL++KAVR++LDKK PF PLVWEAIFGK Sbjct: 298 AQSVIGTLKQKGFVLNDKAVRQFLDKKAPFSPLVWEAIFGK 338 >emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera] Length = 381 Score = 310 bits (793), Expect = 3e-81 Identities = 151/224 (67%), Positives = 183/224 (81%), Gaps = 2/224 (0%) Frame = +2 Query: 2027 GVVEDSDFLERFKLGFDRTKRVNSDSKESP--DQTADTTAEPTPEDADEIFKKMKETGLI 2200 G + FLERFKLG + +R + P +Q A+ E P++ADEIF+KMKE+GLI Sbjct: 151 GATLEDGFLERFKLGVQKKERPQESAAAQPSREQDANHGKEQPPQNADEIFRKMKESGLI 210 Query: 2201 PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRK 2380 PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKA++LDDA+RIFRK Sbjct: 211 PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKARQLDDAVRIFRK 270 Query: 2381 MQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKG 2560 MQ+NGI+PNAFSY +L++ + +G RLD A +FC EMLEAGH+PNVAT + L+ FC+EKG Sbjct: 271 MQNNGISPNAFSYTVLIRGMYKGNRLDIAVDFCVEMLEAGHSPNVATLVDLIHEFCKEKG 330 Query: 2561 LEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692 +EE +++I TL+QKG +D+KAVREYLDKKGP PLVWEA FGK Sbjct: 331 VEEAKNVIVTLKQKGLFVDDKAVREYLDKKGPQSPLVWEAFFGK 374 >ref|XP_002283311.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like [Vitis vinifera] Length = 380 Score = 308 bits (788), Expect = 1e-80 Identities = 150/224 (66%), Positives = 183/224 (81%), Gaps = 2/224 (0%) Frame = +2 Query: 2027 GVVEDSDFLERFKLGFDRTKRVNSDSKESP--DQTADTTAEPTPEDADEIFKKMKETGLI 2200 G + FLERFKLG + +R + P +Q A+ E P++ADEIF+KMKE+GLI Sbjct: 150 GATLEDGFLERFKLGVQKKERPQESAAAQPSREQDANHGKEQPPQNADEIFRKMKESGLI 209 Query: 2201 PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRK 2380 PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKA++L+DA+RIFRK Sbjct: 210 PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKARQLNDAVRIFRK 269 Query: 2381 MQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKG 2560 MQ+NGI+PNAFSY +L++ + +G RLD A +FC EMLEAGH+PNVAT + L+ FC+EKG Sbjct: 270 MQNNGISPNAFSYTVLIRGMYKGNRLDIAVDFCVEMLEAGHSPNVATLVDLIHEFCKEKG 329 Query: 2561 LEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692 +EE +++I TL+QKG +D+KAVREYLDKKGP PLVWEA FGK Sbjct: 330 VEEAKNVIVTLKQKGLFVDDKAVREYLDKKGPQSPLVWEAFFGK 373 >ref|XP_004290096.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like [Fragaria vesca subsp. vesca] Length = 309 Score = 306 bits (783), Expect = 4e-80 Identities = 151/222 (68%), Positives = 184/222 (82%), Gaps = 2/222 (0%) Frame = +2 Query: 2033 VEDSDFLERFKLGFDRTKRVNSDSKESPDQTADTTA-EPTP-EDADEIFKKMKETGLIPN 2206 ++DS FLE+ K+G +++KR E P + A+ +P P E+A+EIFKKMKETGLIPN Sbjct: 87 LQDSSFLEKLKMGLEKSKR------EKPQEAAEPPPPQPQPTEEANEIFKKMKETGLIPN 140 Query: 2207 AVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRKMQ 2386 AVAMLDGLCKDGLVQEAMKLFG MREKGTIPEVVIYTAVVEGFCK +K +DA R+FRKMQ Sbjct: 141 AVAMLDGLCKDGLVQEAMKLFGSMREKGTIPEVVIYTAVVEGFCKGRKPEDAKRVFRKMQ 200 Query: 2387 SNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKGLE 2566 SNGI PNAFSY ++VQ LCR +++ DAAEFCGEMLEAGH+PNV TF+GLVD C+E G+E Sbjct: 201 SNGIVPNAFSYNVMVQGLCRCEKMKDAAEFCGEMLEAGHSPNVTTFVGLVDGVCKENGVE 260 Query: 2567 ETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692 ES+I L+Q+G+V++EKAVRE+LDK+ F P+VWEAIFGK Sbjct: 261 GGESVIGKLKQRGYVVNEKAVREFLDKRASFSPMVWEAIFGK 302 >ref|XP_006422375.1| hypothetical protein CICLE_v10028759mg [Citrus clementina] gi|557524309|gb|ESR35615.1| hypothetical protein CICLE_v10028759mg [Citrus clementina] Length = 344 Score = 305 bits (780), Expect = 1e-79 Identities = 155/256 (60%), Positives = 195/256 (76%), Gaps = 8/256 (3%) Frame = +2 Query: 1949 FSPASQKNRKSAQFGYGDNESQGRGR-GVVEDSDFLERFKLGFDR-------TKRVNSDS 2104 F+ Q+ R Q N + + GV D +FL++FKL D+ + + Sbjct: 84 FNNYQQQQRPQQQSFQSPNRPRPKSPDGVQSDENFLDQFKLAIDKKPDNPQQNESLGERQ 143 Query: 2105 KESPDQTADTTAEPTPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMRE 2284 ++ P++ + +EP P++ADEIFKKMKETGLIPNAVAMLDGLCKDGL+QEAMKLFGLMRE Sbjct: 144 EQKPNRN-EPISEP-PQEADEIFKKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMRE 201 Query: 2285 KGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRKMQSNGITPNAFSYGILVQALCRGKRLDD 2464 KGTIPEVVIYTAVV+GFCKAQK DDA RIFRKMQSNGI PNAFSY +L+Q L + +L++ Sbjct: 202 KGTIPEVVIYTAVVDGFCKAQKFDDAKRIFRKMQSNGIAPNAFSYNLLIQGLYKCNKLEE 261 Query: 2465 AAEFCGEMLEAGHAPNVATFIGLVDCFCREKGLEETESLITTLRQKGFVLDEKAVREYLD 2644 A E+C EMLEAGH+PNV TF+GLVD CREKG+E+ +S+I TL++KGF++++KAVRE+LD Sbjct: 262 AVEYCIEMLEAGHSPNVTTFVGLVDGLCREKGVEKAQSVIATLKEKGFLVNDKAVREFLD 321 Query: 2645 KKGPFMPLVWEAIFGK 2692 KK PF VWEAIFGK Sbjct: 322 KKAPFSSSVWEAIFGK 337 >ref|XP_006394459.1| hypothetical protein EUTSA_v10005467mg [Eutrema salsugineum] gi|557091098|gb|ESQ31745.1| hypothetical protein EUTSA_v10005467mg [Eutrema salsugineum] Length = 295 Score = 305 bits (780), Expect = 1e-79 Identities = 150/232 (64%), Positives = 178/232 (76%) Frame = +2 Query: 1997 GDNESQGRGRGVVEDSDFLERFKLGFDRTKRVNSDSKESPDQTADTTAEPTPEDADEIFK 2176 G N ++ + D DFLE+FKLG + ++ K P Q P PED++EIFK Sbjct: 59 GSNSARPSQPAKLSDHDFLEQFKLGVKQDDSRKTEQK--PQQETSPEPLPAPEDSEEIFK 116 Query: 2177 KMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLD 2356 MKE GLIPNAVAMLDGLCKDGLVQEAMKLFGLMR+KGTIPEVVIYTAVVEGFCKA K++ Sbjct: 117 NMKEGGLIPNAVAMLDGLCKDGLVQEAMKLFGLMRDKGTIPEVVIYTAVVEGFCKAHKIE 176 Query: 2357 DAIRIFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLV 2536 DA RIFRKMQ+NGI PNAFSYG+LVQ LC LDDA +FCGEMLE+GH+PNV+TF+GLV Sbjct: 177 DAKRIFRKMQTNGIVPNAFSYGVLVQGLCNCNMLDDAVDFCGEMLESGHSPNVSTFVGLV 236 Query: 2537 DCFCREKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692 D CREKG+E+ +S I TL QKGF ++ KAV+E+++KK F L WEAIF K Sbjct: 237 DALCREKGVEQAQSAIDTLNQKGFAVNLKAVKEFMEKKASFPSLAWEAIFKK 288 >ref|XP_006486551.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like [Citrus sinensis] Length = 387 Score = 303 bits (777), Expect = 2e-79 Identities = 153/229 (66%), Positives = 184/229 (80%), Gaps = 7/229 (3%) Frame = +2 Query: 2027 GVVEDSDFLERFKLGFDRTKRVNSDSKESPDQTADTT-------AEPTPEDADEIFKKMK 2185 GV D +FL++FKL D+ K N ES Q + +EP P++ADEIFKKMK Sbjct: 154 GVQSDENFLDQFKLAIDK-KPGNPQQNESLGQRQEQKPNRNEPISEP-PQEADEIFKKMK 211 Query: 2186 ETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAI 2365 ETGLIPNAVAMLDGLCKDGL+QEAMKLFGLMREKGTIPEVVIYTAVV+GFCKAQK DDA Sbjct: 212 ETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVVDGFCKAQKFDDAK 271 Query: 2366 RIFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCF 2545 RIFRKMQSNGI PNAFSY +L+Q L + +L++A E+C EMLEAGH+PNV TF+GLVD Sbjct: 272 RIFRKMQSNGIAPNAFSYNLLIQGLYKCNKLEEAVEYCIEMLEAGHSPNVTTFVGLVDGL 331 Query: 2546 CREKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692 CRE+G+E+ +S+I TL++KGF++++KAVRE+LDKK PF VWEAIFGK Sbjct: 332 CRERGVEKAQSVIATLKEKGFLVNDKAVREFLDKKAPFSSSVWEAIFGK 380 >gb|EPS62438.1| hypothetical protein M569_12351, partial [Genlisea aurea] Length = 272 Score = 303 bits (775), Expect = 4e-79 Identities = 150/223 (67%), Positives = 181/223 (81%), Gaps = 6/223 (2%) Frame = +2 Query: 2039 DSDFLERFKLGFDRT------KRVNSDSKESPDQTADTTAEPTPEDADEIFKKMKETGLI 2200 DSDFLERFKLGFDR + V S+ ++ + PE+ADEIF+KMKETGLI Sbjct: 41 DSDFLERFKLGFDRKTTTPPGRVVESEKAGGEEEKEEQQPLSPPENADEIFRKMKETGLI 100 Query: 2201 PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRK 2380 PNAVAMLDGLCKDGLVQ+A+KLFG MREKG+IP+VV+YTAVVEGFCKAQK DDAIRIF+K Sbjct: 101 PNAVAMLDGLCKDGLVQDALKLFGTMREKGSIPDVVVYTAVVEGFCKAQKHDDAIRIFKK 160 Query: 2381 MQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKG 2560 M+SNGI PNAFSY IL++ LC GKRL+DA+ F EMLE G++PN+ATF GLV+ +C+EKG Sbjct: 161 MKSNGIAPNAFSYQILIRGLCDGKRLEDASGFTAEMLETGYSPNLATFTGLVNGWCQEKG 220 Query: 2561 LEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFG 2689 LEE ++L+ ++QKGF ++EKAVREYLDKKGPF VWEAI G Sbjct: 221 LEEAKTLVGAMKQKGFSVEEKAVREYLDKKGPFSSPVWEAILG 263 >ref|XP_006590678.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X2 [Glycine max] Length = 395 Score = 301 bits (771), Expect = 1e-78 Identities = 159/233 (68%), Positives = 182/233 (78%), Gaps = 8/233 (3%) Frame = +2 Query: 2018 RGRGVVEDSDFLERFKLGFDRTKRVN----SDSKESPDQTADTTAEPT----PEDADEIF 2173 R G DS FL +FKLGFD K VN + SK+S + +P P+DADEIF Sbjct: 158 RDAGQSGDS-FLNKFKLGFD-DKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDADEIF 215 Query: 2174 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKL 2353 KKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGLMREKGTIPE+VIYTAVVEG+ KA K Sbjct: 216 KKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLMREKGTIPEIVIYTAVVEGYTKAHKA 275 Query: 2354 DDAIRIFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGL 2533 DDA RIFRKMQS+G++PNAFSY +L+Q L + RL DA EFC EMLEAGH+PNV TF+GL Sbjct: 276 DDAKRIFRKMQSSGVSPNAFSYMVLIQGLYKCSRLHDAFEFCVEMLEAGHSPNVTTFVGL 335 Query: 2534 VDCFCREKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692 VD FC EKG+EE +S I TL KGFV++EKAVR++LDKK PF P VWEAIFGK Sbjct: 336 VDGFCNEKGVEEAKSAIKTLTDKGFVVNEKAVRQFLDKKAPFSPSVWEAIFGK 388 >ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X1 [Glycine max] Length = 388 Score = 301 bits (771), Expect = 1e-78 Identities = 159/233 (68%), Positives = 182/233 (78%), Gaps = 8/233 (3%) Frame = +2 Query: 2018 RGRGVVEDSDFLERFKLGFDRTKRVN----SDSKESPDQTADTTAEPT----PEDADEIF 2173 R G DS FL +FKLGFD K VN + SK+S + +P P+DADEIF Sbjct: 151 RDAGQSGDS-FLNKFKLGFD-DKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDADEIF 208 Query: 2174 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKL 2353 KKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGLMREKGTIPE+VIYTAVVEG+ KA K Sbjct: 209 KKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLMREKGTIPEIVIYTAVVEGYTKAHKA 268 Query: 2354 DDAIRIFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGL 2533 DDA RIFRKMQS+G++PNAFSY +L+Q L + RL DA EFC EMLEAGH+PNV TF+GL Sbjct: 269 DDAKRIFRKMQSSGVSPNAFSYMVLIQGLYKCSRLHDAFEFCVEMLEAGHSPNVTTFVGL 328 Query: 2534 VDCFCREKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692 VD FC EKG+EE +S I TL KGFV++EKAVR++LDKK PF P VWEAIFGK Sbjct: 329 VDGFCNEKGVEEAKSAIKTLTDKGFVVNEKAVRQFLDKKAPFSPSVWEAIFGK 381 >ref|XP_004512018.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X1 [Cicer arietinum] gi|502161087|ref|XP_004512019.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X2 [Cicer arietinum] Length = 371 Score = 299 bits (766), Expect = 4e-78 Identities = 160/259 (61%), Positives = 189/259 (72%), Gaps = 16/259 (6%) Frame = +2 Query: 1964 QKNRKSAQFGYGDNESQGRGRGVVEDS--------DFLERFKLGFDRTKRVNSDSKESPD 2119 ++ KS+Q G +GR V E S FL++FKLGFD K N ES Sbjct: 112 RRGSKSSQIDLGF-----QGRNVAEVSRDAGQLGDSFLDKFKLGFD-DKVGNHSEVESNG 165 Query: 2120 QTADTTA--------EPTPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGL 2275 QT + A EP P+DADEIFKKMKETGLIPNAVAMLDGLCKDG VQEA+KLFGL Sbjct: 166 QTEGSRASDTDQPAQEPMPQDADEIFKKMKETGLIPNAVAMLDGLCKDGNVQEALKLFGL 225 Query: 2276 MREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRKMQSNGITPNAFSYGILVQALCRGKR 2455 MREKGTIPE+VIYTAVVEG+ KA K DDAIRIFRKMQSNGI+PNA+S+ +L+Q L + R Sbjct: 226 MREKGTIPEIVIYTAVVEGYTKAHKADDAIRIFRKMQSNGISPNAYSFTVLIQGLYKCSR 285 Query: 2456 LDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKGLEETESLITTLRQKGFVLDEKAVRE 2635 L DA EFC EMLEAG++ NV TF+G+VD FC+E G+EE + +I TL +KGF DEKAVRE Sbjct: 286 LQDALEFCVEMLEAGYSLNVTTFVGVVDGFCKEDGVEEAKGVIKTLTEKGFAYDEKAVRE 345 Query: 2636 YLDKKGPFMPLVWEAIFGK 2692 +LDKK PF P +WEA+FGK Sbjct: 346 FLDKKAPFSPSIWEAVFGK 364 >gb|ESW28907.1| hypothetical protein PHAVU_002G027800g [Phaseolus vulgaris] gi|561030329|gb|ESW28908.1| hypothetical protein PHAVU_002G027800g [Phaseolus vulgaris] Length = 451 Score = 299 bits (765), Expect = 5e-78 Identities = 151/225 (67%), Positives = 175/225 (77%), Gaps = 10/225 (4%) Frame = +2 Query: 2048 FLERFKLGFD----------RTKRVNSDSKESPDQTADTTAEPTPEDADEIFKKMKETGL 2197 FL++FKL FD +K+ + +PDQ A EP P+DADEIFKKMKETGL Sbjct: 223 FLDKFKLAFDDKTVNLSEVAASKQSEEAKRSNPDQQAQ---EPVPQDADEIFKKMKETGL 279 Query: 2198 IPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFR 2377 IPNAVAMLDGLCKDGLVQEA+KLF LMREKGTIPE+VIYTAVVEG+ KA K DDA RIFR Sbjct: 280 IPNAVAMLDGLCKDGLVQEALKLFALMREKGTIPEIVIYTAVVEGYTKADKADDAKRIFR 339 Query: 2378 KMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREK 2557 KMQS+GI+PNAFSY ++VQ L + +RL DA EFC EMLEAGH+PNV TF+ LVD FC+EK Sbjct: 340 KMQSSGISPNAFSYTVIVQGLYKCRRLQDAFEFCVEMLEAGHSPNVTTFVSLVDGFCKEK 399 Query: 2558 GLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692 G+EE + + TL KGF DEKAVR++LDKK PF P VWEAIFGK Sbjct: 400 GVEEAKDAVKTLTGKGFAFDEKAVRQFLDKKTPFSPSVWEAIFGK 444 >ref|XP_002868835.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297314671|gb|EFH45094.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 301 Score = 298 bits (762), Expect = 1e-77 Identities = 153/228 (67%), Positives = 177/228 (77%), Gaps = 2/228 (0%) Frame = +2 Query: 2015 GRGRGVVEDSDFLERFKLGFDRTKRVNSDSKESP--DQTADTTAEPTPEDADEIFKKMKE 2188 G+ + D FLE+FKLG VN DS+E+P +Q P PED+DEIFKKMKE Sbjct: 74 GKIDNTLSDDGFLEQFKLG------VNQDSQETPKPEQYPQDPLLP-PEDSDEIFKKMKE 126 Query: 2189 TGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIR 2368 GLIPNAVAMLDGLCKDGLVQEAMKLFGLMR+KGTIPEVVIYTAVVEGFCKA K++DA R Sbjct: 127 GGLIPNAVAMLDGLCKDGLVQEAMKLFGLMRDKGTIPEVVIYTAVVEGFCKAHKIEDAKR 186 Query: 2369 IFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFC 2548 IFRKMQ+NGITPNAFSYG+LVQ L LDDA FC EMLE+GH+PN+ TF+GLVD C Sbjct: 187 IFRKMQTNGITPNAFSYGVLVQGLYNCNMLDDAVTFCCEMLESGHSPNIPTFVGLVDALC 246 Query: 2549 REKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692 REKG+E+ +S I L QKGF L+ KAV+E++DK+ PF L WEAIF K Sbjct: 247 REKGVEQAQSAIDGLNQKGFALNVKAVKEFMDKRAPFPSLAWEAIFKK 294 >ref|XP_006573589.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X3 [Glycine max] gi|571435834|ref|XP_006573590.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X4 [Glycine max] Length = 403 Score = 295 bits (755), Expect = 8e-77 Identities = 152/223 (68%), Positives = 178/223 (79%), Gaps = 8/223 (3%) Frame = +2 Query: 2048 FLERFKLGFDRTKRVN----SDSKESPDQTADTTAEPT----PEDADEIFKKMKETGLIP 2203 FL++FKLGFD K VN + SK+S + +P P+DA+EIFKKMKETGLIP Sbjct: 175 FLDKFKLGFD-DKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDANEIFKKMKETGLIP 233 Query: 2204 NAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRKM 2383 NAVAMLDGLCKDGLVQEA+KLFGL+REKGTIPE+VIYTAVVEG+ KA K DDA RIFRKM Sbjct: 234 NAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVEGYTKAHKADDAKRIFRKM 293 Query: 2384 QSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKGL 2563 QS+GI+PNAFSY +L+Q L + RL DA EFC EMLEAGH+PNV F+GLVD FC EKG+ Sbjct: 294 QSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSPNVTAFVGLVDGFCNEKGV 353 Query: 2564 EETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692 EE +S I TL +KGFV++EKAV ++LDKK PF P VWEAIFGK Sbjct: 354 EEAKSAIKTLTEKGFVVNEKAVGQFLDKKAPFSPSVWEAIFGK 396 >ref|XP_006573588.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X2 [Glycine max] Length = 431 Score = 295 bits (755), Expect = 8e-77 Identities = 152/223 (68%), Positives = 178/223 (79%), Gaps = 8/223 (3%) Frame = +2 Query: 2048 FLERFKLGFDRTKRVN----SDSKESPDQTADTTAEPT----PEDADEIFKKMKETGLIP 2203 FL++FKLGFD K VN + SK+S + +P P+DA+EIFKKMKETGLIP Sbjct: 203 FLDKFKLGFD-DKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDANEIFKKMKETGLIP 261 Query: 2204 NAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRKM 2383 NAVAMLDGLCKDGLVQEA+KLFGL+REKGTIPE+VIYTAVVEG+ KA K DDA RIFRKM Sbjct: 262 NAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVEGYTKAHKADDAKRIFRKM 321 Query: 2384 QSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKGL 2563 QS+GI+PNAFSY +L+Q L + RL DA EFC EMLEAGH+PNV F+GLVD FC EKG+ Sbjct: 322 QSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSPNVTAFVGLVDGFCNEKGV 381 Query: 2564 EETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692 EE +S I TL +KGFV++EKAV ++LDKK PF P VWEAIFGK Sbjct: 382 EEAKSAIKTLTEKGFVVNEKAVGQFLDKKAPFSPSVWEAIFGK 424 >ref|XP_003516576.2| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X1 [Glycine max] Length = 457 Score = 295 bits (755), Expect = 8e-77 Identities = 152/223 (68%), Positives = 178/223 (79%), Gaps = 8/223 (3%) Frame = +2 Query: 2048 FLERFKLGFDRTKRVN----SDSKESPDQTADTTAEPT----PEDADEIFKKMKETGLIP 2203 FL++FKLGFD K VN + SK+S + +P P+DA+EIFKKMKETGLIP Sbjct: 229 FLDKFKLGFD-DKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDANEIFKKMKETGLIP 287 Query: 2204 NAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRKM 2383 NAVAMLDGLCKDGLVQEA+KLFGL+REKGTIPE+VIYTAVVEG+ KA K DDA RIFRKM Sbjct: 288 NAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVEGYTKAHKADDAKRIFRKM 347 Query: 2384 QSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKGL 2563 QS+GI+PNAFSY +L+Q L + RL DA EFC EMLEAGH+PNV F+GLVD FC EKG+ Sbjct: 348 QSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSPNVTAFVGLVDGFCNEKGV 407 Query: 2564 EETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692 EE +S I TL +KGFV++EKAV ++LDKK PF P VWEAIFGK Sbjct: 408 EEAKSAIKTLTEKGFVVNEKAVGQFLDKKAPFSPSVWEAIFGK 450 >ref|NP_195528.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|79326453|ref|NP_001031806.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75266764|sp|Q9SZL5.1|PP356_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g38150 gi|4467121|emb|CAB37555.1| putative protein [Arabidopsis thaliana] gi|7270799|emb|CAB80480.1| putative protein [Arabidopsis thaliana] gi|26453272|dbj|BAC43709.1| unknown protein [Arabidopsis thaliana] gi|332661484|gb|AEE86884.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332661485|gb|AEE86885.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 302 Score = 294 bits (752), Expect = 2e-76 Identities = 149/227 (65%), Positives = 173/227 (76%), Gaps = 1/227 (0%) Frame = +2 Query: 2015 GRGRGVVEDSDFLERFKLGFDRTKRVNSDSKESPD-QTADTTAEPTPEDADEIFKKMKET 2191 G+ + D FLE+FKLG VN DS+E+P + P PED+DEIFKKMKE Sbjct: 75 GKSDTTLSDDGFLEQFKLG------VNQDSRETPKPEQYPQEPLPPPEDSDEIFKKMKEG 128 Query: 2192 GLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRI 2371 GLIPNAVAMLDGLCKDGLVQEAMKLFGLMR+KGTIPEVVIYTAVVE FCKA K++DA RI Sbjct: 129 GLIPNAVAMLDGLCKDGLVQEAMKLFGLMRDKGTIPEVVIYTAVVEAFCKAHKIEDAKRI 188 Query: 2372 FRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCR 2551 FRKMQ+NGI PNAFSYG+LVQ L LDDA FC EMLE+GH+PNV TF+ LVD CR Sbjct: 189 FRKMQNNGIAPNAFSYGVLVQGLYNCNMLDDAVAFCSEMLESGHSPNVPTFVELVDALCR 248 Query: 2552 EKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692 KG+E+ +S I TL QKGF ++ KAV+E++DK+ PF L WEAIF K Sbjct: 249 VKGVEQAQSAIDTLNQKGFAVNVKAVKEFMDKRAPFPSLAWEAIFKK 295