BLASTX nr result
ID: Catharanthus22_contig00007556
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00007556 (1777 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004253081.1| PREDICTED: pentatricopeptide repeat-containi... 332 2e-88 gb|EXB40453.1| hypothetical protein L484_013756 [Morus notabilis] 327 1e-86 ref|XP_006342489.1| PREDICTED: pentatricopeptide repeat-containi... 326 2e-86 gb|EOX98954.1| Pentatricopeptide repeat superfamily protein, put... 312 3e-82 emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera] 310 2e-81 ref|XP_002283311.1| PREDICTED: pentatricopeptide repeat-containi... 308 6e-81 ref|XP_004290096.1| PREDICTED: pentatricopeptide repeat-containi... 306 2e-80 ref|XP_006422375.1| hypothetical protein CICLE_v10028759mg [Citr... 305 5e-80 ref|XP_006394459.1| hypothetical protein EUTSA_v10005467mg [Eutr... 305 5e-80 ref|XP_006486551.1| PREDICTED: pentatricopeptide repeat-containi... 303 1e-79 gb|EPS62438.1| hypothetical protein M569_12351, partial [Genlise... 303 2e-79 ref|XP_006590678.1| PREDICTED: pentatricopeptide repeat-containi... 301 6e-79 ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containi... 301 6e-79 ref|XP_004512018.1| PREDICTED: pentatricopeptide repeat-containi... 299 2e-78 gb|ESW28907.1| hypothetical protein PHAVU_002G027800g [Phaseolus... 299 3e-78 ref|XP_002868835.1| pentatricopeptide repeat-containing protein ... 298 7e-78 ref|XP_006573589.1| PREDICTED: pentatricopeptide repeat-containi... 295 4e-77 ref|XP_006573588.1| PREDICTED: pentatricopeptide repeat-containi... 295 4e-77 ref|XP_003516576.2| PREDICTED: pentatricopeptide repeat-containi... 295 4e-77 ref|NP_195528.1| pentatricopeptide repeat-containing protein [Ar... 294 1e-76 >ref|XP_004253081.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform 1 [Solanum lycopersicum] gi|460415472|ref|XP_004253082.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform 2 [Solanum lycopersicum] Length = 340 Score = 332 bits (852), Expect = 2e-88 Identities = 166/233 (71%), Positives = 195/233 (83%), Gaps = 2/233 (0%) Frame = +2 Query: 863 DNESQGRGRGVVEDSDFLERFKLGFDRTKR-VNSDSK-ESPDQTADTTAEPTPEDADEIF 1036 +NESQ + + + DFL+RF+LGFDR + N++ K ES D PEDADEIF Sbjct: 102 NNESQMKSQ---DSEDFLKRFQLGFDRKEENPNTNPKAESRDCPVSEAPPAPPEDADEIF 158 Query: 1037 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKL 1216 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVV+GFCKAQK Sbjct: 159 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVDGFCKAQKF 218 Query: 1217 DDAIRIFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGL 1396 DDA+RIFRKMQ NGI PNAFSYGI+++ L +GKRLDDA EFC EMLEAGH+PNV TF+ L Sbjct: 219 DDAVRIFRKMQGNGIIPNAFSYGIIIRGLSQGKRLDDALEFCLEMLEAGHSPNVVTFVTL 278 Query: 1397 VDCFCREKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 1555 VD FC+EK LE+ +++I T+RQKGF++D+KAVRE+LDKKGPF+P+VWEAI GK Sbjct: 279 VDGFCKEKSLEDAQNMIKTVRQKGFIVDDKAVREFLDKKGPFLPVVWEAILGK 331 >gb|EXB40453.1| hypothetical protein L484_013756 [Morus notabilis] Length = 306 Score = 327 bits (838), Expect = 1e-86 Identities = 167/233 (71%), Positives = 185/233 (79%), Gaps = 6/233 (2%) Frame = +2 Query: 875 QGRGRGVVEDSDFLERFKLGFDRTKRVNSDSKESPDQTADTTAEPTP------EDADEIF 1036 +GRG ED FLE+FKLG D +K +E P + A P P EDADEIF Sbjct: 70 RGRGPLTSEDDSFLEKFKLGLDSSK---DGMQEKPRREAARPKPPLPQPPPPPEDADEIF 126 Query: 1037 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKL 1216 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLM+EKGTIPEVVIYTAVV+GFCKAQKL Sbjct: 127 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMKEKGTIPEVVIYTAVVDGFCKAQKL 186 Query: 1217 DDAIRIFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGL 1396 DDA+RIFRKMQSNGI PNAFSY +LVQ LC GKRL+D EFC EMLEAGH+PNVATF+GL Sbjct: 187 DDAVRIFRKMQSNGIEPNAFSYSVLVQGLCGGKRLEDGLEFCVEMLEAGHSPNVATFVGL 246 Query: 1397 VDCFCREKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 1555 VD C EKG+EE + +I LR KGF+L+EKAVRE+LDKK F P VWEAIFGK Sbjct: 247 VDGLCEEKGVEEAQGVIGKLRDKGFLLNEKAVREFLDKKASFSPSVWEAIFGK 299 >ref|XP_006342489.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like [Solanum tuberosum] Length = 354 Score = 326 bits (835), Expect = 2e-86 Identities = 167/247 (67%), Positives = 193/247 (78%), Gaps = 7/247 (2%) Frame = +2 Query: 836 RKSAQFGYGDNESQGRGRGVVEDSDFLERFKLGFDRTKRVNSDSK-------ESPDQTAD 994 R+S + G +SQ + DFL+RF+LGFDR K N ++ ES D Sbjct: 107 RRSGENNGGQMKSQ-------DSEDFLKRFQLGFDR-KEENPNTNPALHPKGESSDSPVS 158 Query: 995 TTAEPTPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVI 1174 PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVI Sbjct: 159 EAPPAPPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVI 218 Query: 1175 YTAVVEGFCKAQKLDDAIRIFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEML 1354 YTAVV+GF KAQK DDA+RIFRKMQ NGI PNAFSYGIL++ L +G RLDDA EFC EML Sbjct: 219 YTAVVDGFFKAQKFDDAVRIFRKMQGNGIIPNAFSYGILIRGLSQGNRLDDAFEFCLEML 278 Query: 1355 EAGHAPNVATFIGLVDCFCREKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLV 1534 EAGH+PNV TF+ LVD FC+EK LE+ +++I T+RQKGF++D+KAVREYLDKKGPF+P+V Sbjct: 279 EAGHSPNVVTFVTLVDGFCKEKSLEDAQNMIKTVRQKGFIVDDKAVREYLDKKGPFLPVV 338 Query: 1535 WEAIFGK 1555 WEAI GK Sbjct: 339 WEAILGK 345 >gb|EOX98954.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma cacao] Length = 345 Score = 312 bits (799), Expect = 3e-82 Identities = 156/221 (70%), Positives = 184/221 (83%), Gaps = 3/221 (1%) Frame = +2 Query: 902 DSDFLERFKLGFD--RTKRVNSDSKESPDQTADTTAEPTP-EDADEIFKKMKETGLIPNA 1072 D +FLE+FKLG D R K+ + + + + +P+P +DADEIFKKMKETGLIPNA Sbjct: 118 DENFLEKFKLGLDNKRGKQPSDSEAAALLRRKEQEEKPSPPQDADEIFKKMKETGLIPNA 177 Query: 1073 VAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRKMQS 1252 VAMLDGLCKDGL+QEAMKLFG MREKGTIPEVVIYTAVV+GFCKA KLDDA RIFRKMQS Sbjct: 178 VAMLDGLCKDGLIQEAMKLFGSMREKGTIPEVVIYTAVVDGFCKAHKLDDAKRIFRKMQS 237 Query: 1253 NGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKGLEE 1432 G+TPN+FSY +L+Q L R +LDDA EFC EMLEAGH+PNV TF+GLVD C+EKG+EE Sbjct: 238 KGVTPNSFSYIVLIQGLYRCNKLDDAIEFCLEMLEAGHSPNVTTFVGLVDGLCKEKGVEE 297 Query: 1433 TESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 1555 +S+I TL+QKGFVL++KAVR++LDKK PF PLVWEAIFGK Sbjct: 298 AQSVIGTLKQKGFVLNDKAVRQFLDKKAPFSPLVWEAIFGK 338 >emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera] Length = 381 Score = 310 bits (793), Expect = 2e-81 Identities = 151/224 (67%), Positives = 183/224 (81%), Gaps = 2/224 (0%) Frame = +2 Query: 890 GVVEDSDFLERFKLGFDRTKRVNSDSKESP--DQTADTTAEPTPEDADEIFKKMKETGLI 1063 G + FLERFKLG + +R + P +Q A+ E P++ADEIF+KMKE+GLI Sbjct: 151 GATLEDGFLERFKLGVQKKERPQESAAAQPSREQDANHGKEQPPQNADEIFRKMKESGLI 210 Query: 1064 PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRK 1243 PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKA++LDDA+RIFRK Sbjct: 211 PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKARQLDDAVRIFRK 270 Query: 1244 MQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKG 1423 MQ+NGI+PNAFSY +L++ + +G RLD A +FC EMLEAGH+PNVAT + L+ FC+EKG Sbjct: 271 MQNNGISPNAFSYTVLIRGMYKGNRLDIAVDFCVEMLEAGHSPNVATLVDLIHEFCKEKG 330 Query: 1424 LEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 1555 +EE +++I TL+QKG +D+KAVREYLDKKGP PLVWEA FGK Sbjct: 331 VEEAKNVIVTLKQKGLFVDDKAVREYLDKKGPQSPLVWEAFFGK 374 >ref|XP_002283311.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like [Vitis vinifera] Length = 380 Score = 308 bits (788), Expect = 6e-81 Identities = 150/224 (66%), Positives = 183/224 (81%), Gaps = 2/224 (0%) Frame = +2 Query: 890 GVVEDSDFLERFKLGFDRTKRVNSDSKESP--DQTADTTAEPTPEDADEIFKKMKETGLI 1063 G + FLERFKLG + +R + P +Q A+ E P++ADEIF+KMKE+GLI Sbjct: 150 GATLEDGFLERFKLGVQKKERPQESAAAQPSREQDANHGKEQPPQNADEIFRKMKESGLI 209 Query: 1064 PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRK 1243 PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKA++L+DA+RIFRK Sbjct: 210 PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKARQLNDAVRIFRK 269 Query: 1244 MQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKG 1423 MQ+NGI+PNAFSY +L++ + +G RLD A +FC EMLEAGH+PNVAT + L+ FC+EKG Sbjct: 270 MQNNGISPNAFSYTVLIRGMYKGNRLDIAVDFCVEMLEAGHSPNVATLVDLIHEFCKEKG 329 Query: 1424 LEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 1555 +EE +++I TL+QKG +D+KAVREYLDKKGP PLVWEA FGK Sbjct: 330 VEEAKNVIVTLKQKGLFVDDKAVREYLDKKGPQSPLVWEAFFGK 373 >ref|XP_004290096.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like [Fragaria vesca subsp. vesca] Length = 309 Score = 306 bits (783), Expect = 2e-80 Identities = 151/222 (68%), Positives = 184/222 (82%), Gaps = 2/222 (0%) Frame = +2 Query: 896 VEDSDFLERFKLGFDRTKRVNSDSKESPDQTADTTA-EPTP-EDADEIFKKMKETGLIPN 1069 ++DS FLE+ K+G +++KR E P + A+ +P P E+A+EIFKKMKETGLIPN Sbjct: 87 LQDSSFLEKLKMGLEKSKR------EKPQEAAEPPPPQPQPTEEANEIFKKMKETGLIPN 140 Query: 1070 AVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRKMQ 1249 AVAMLDGLCKDGLVQEAMKLFG MREKGTIPEVVIYTAVVEGFCK +K +DA R+FRKMQ Sbjct: 141 AVAMLDGLCKDGLVQEAMKLFGSMREKGTIPEVVIYTAVVEGFCKGRKPEDAKRVFRKMQ 200 Query: 1250 SNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKGLE 1429 SNGI PNAFSY ++VQ LCR +++ DAAEFCGEMLEAGH+PNV TF+GLVD C+E G+E Sbjct: 201 SNGIVPNAFSYNVMVQGLCRCEKMKDAAEFCGEMLEAGHSPNVTTFVGLVDGVCKENGVE 260 Query: 1430 ETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 1555 ES+I L+Q+G+V++EKAVRE+LDK+ F P+VWEAIFGK Sbjct: 261 GGESVIGKLKQRGYVVNEKAVREFLDKRASFSPMVWEAIFGK 302 >ref|XP_006422375.1| hypothetical protein CICLE_v10028759mg [Citrus clementina] gi|557524309|gb|ESR35615.1| hypothetical protein CICLE_v10028759mg [Citrus clementina] Length = 344 Score = 305 bits (780), Expect = 5e-80 Identities = 155/256 (60%), Positives = 195/256 (76%), Gaps = 8/256 (3%) Frame = +2 Query: 812 FSPASQKNRKSAQFGYGDNESQGRGR-GVVEDSDFLERFKLGFDR-------TKRVNSDS 967 F+ Q+ R Q N + + GV D +FL++FKL D+ + + Sbjct: 84 FNNYQQQQRPQQQSFQSPNRPRPKSPDGVQSDENFLDQFKLAIDKKPDNPQQNESLGERQ 143 Query: 968 KESPDQTADTTAEPTPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMRE 1147 ++ P++ + +EP P++ADEIFKKMKETGLIPNAVAMLDGLCKDGL+QEAMKLFGLMRE Sbjct: 144 EQKPNRN-EPISEP-PQEADEIFKKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMRE 201 Query: 1148 KGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRKMQSNGITPNAFSYGILVQALCRGKRLDD 1327 KGTIPEVVIYTAVV+GFCKAQK DDA RIFRKMQSNGI PNAFSY +L+Q L + +L++ Sbjct: 202 KGTIPEVVIYTAVVDGFCKAQKFDDAKRIFRKMQSNGIAPNAFSYNLLIQGLYKCNKLEE 261 Query: 1328 AAEFCGEMLEAGHAPNVATFIGLVDCFCREKGLEETESLITTLRQKGFVLDEKAVREYLD 1507 A E+C EMLEAGH+PNV TF+GLVD CREKG+E+ +S+I TL++KGF++++KAVRE+LD Sbjct: 262 AVEYCIEMLEAGHSPNVTTFVGLVDGLCREKGVEKAQSVIATLKEKGFLVNDKAVREFLD 321 Query: 1508 KKGPFMPLVWEAIFGK 1555 KK PF VWEAIFGK Sbjct: 322 KKAPFSSSVWEAIFGK 337 >ref|XP_006394459.1| hypothetical protein EUTSA_v10005467mg [Eutrema salsugineum] gi|557091098|gb|ESQ31745.1| hypothetical protein EUTSA_v10005467mg [Eutrema salsugineum] Length = 295 Score = 305 bits (780), Expect = 5e-80 Identities = 150/232 (64%), Positives = 178/232 (76%) Frame = +2 Query: 860 GDNESQGRGRGVVEDSDFLERFKLGFDRTKRVNSDSKESPDQTADTTAEPTPEDADEIFK 1039 G N ++ + D DFLE+FKLG + ++ K P Q P PED++EIFK Sbjct: 59 GSNSARPSQPAKLSDHDFLEQFKLGVKQDDSRKTEQK--PQQETSPEPLPAPEDSEEIFK 116 Query: 1040 KMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLD 1219 MKE GLIPNAVAMLDGLCKDGLVQEAMKLFGLMR+KGTIPEVVIYTAVVEGFCKA K++ Sbjct: 117 NMKEGGLIPNAVAMLDGLCKDGLVQEAMKLFGLMRDKGTIPEVVIYTAVVEGFCKAHKIE 176 Query: 1220 DAIRIFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLV 1399 DA RIFRKMQ+NGI PNAFSYG+LVQ LC LDDA +FCGEMLE+GH+PNV+TF+GLV Sbjct: 177 DAKRIFRKMQTNGIVPNAFSYGVLVQGLCNCNMLDDAVDFCGEMLESGHSPNVSTFVGLV 236 Query: 1400 DCFCREKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 1555 D CREKG+E+ +S I TL QKGF ++ KAV+E+++KK F L WEAIF K Sbjct: 237 DALCREKGVEQAQSAIDTLNQKGFAVNLKAVKEFMEKKASFPSLAWEAIFKK 288 >ref|XP_006486551.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like [Citrus sinensis] Length = 387 Score = 303 bits (777), Expect = 1e-79 Identities = 153/229 (66%), Positives = 184/229 (80%), Gaps = 7/229 (3%) Frame = +2 Query: 890 GVVEDSDFLERFKLGFDRTKRVNSDSKESPDQTADTT-------AEPTPEDADEIFKKMK 1048 GV D +FL++FKL D+ K N ES Q + +EP P++ADEIFKKMK Sbjct: 154 GVQSDENFLDQFKLAIDK-KPGNPQQNESLGQRQEQKPNRNEPISEP-PQEADEIFKKMK 211 Query: 1049 ETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAI 1228 ETGLIPNAVAMLDGLCKDGL+QEAMKLFGLMREKGTIPEVVIYTAVV+GFCKAQK DDA Sbjct: 212 ETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVVDGFCKAQKFDDAK 271 Query: 1229 RIFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCF 1408 RIFRKMQSNGI PNAFSY +L+Q L + +L++A E+C EMLEAGH+PNV TF+GLVD Sbjct: 272 RIFRKMQSNGIAPNAFSYNLLIQGLYKCNKLEEAVEYCIEMLEAGHSPNVTTFVGLVDGL 331 Query: 1409 CREKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 1555 CRE+G+E+ +S+I TL++KGF++++KAVRE+LDKK PF VWEAIFGK Sbjct: 332 CRERGVEKAQSVIATLKEKGFLVNDKAVREFLDKKAPFSSSVWEAIFGK 380 >gb|EPS62438.1| hypothetical protein M569_12351, partial [Genlisea aurea] Length = 272 Score = 303 bits (775), Expect = 2e-79 Identities = 150/223 (67%), Positives = 181/223 (81%), Gaps = 6/223 (2%) Frame = +2 Query: 902 DSDFLERFKLGFDRT------KRVNSDSKESPDQTADTTAEPTPEDADEIFKKMKETGLI 1063 DSDFLERFKLGFDR + V S+ ++ + PE+ADEIF+KMKETGLI Sbjct: 41 DSDFLERFKLGFDRKTTTPPGRVVESEKAGGEEEKEEQQPLSPPENADEIFRKMKETGLI 100 Query: 1064 PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRK 1243 PNAVAMLDGLCKDGLVQ+A+KLFG MREKG+IP+VV+YTAVVEGFCKAQK DDAIRIF+K Sbjct: 101 PNAVAMLDGLCKDGLVQDALKLFGTMREKGSIPDVVVYTAVVEGFCKAQKHDDAIRIFKK 160 Query: 1244 MQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKG 1423 M+SNGI PNAFSY IL++ LC GKRL+DA+ F EMLE G++PN+ATF GLV+ +C+EKG Sbjct: 161 MKSNGIAPNAFSYQILIRGLCDGKRLEDASGFTAEMLETGYSPNLATFTGLVNGWCQEKG 220 Query: 1424 LEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFG 1552 LEE ++L+ ++QKGF ++EKAVREYLDKKGPF VWEAI G Sbjct: 221 LEEAKTLVGAMKQKGFSVEEKAVREYLDKKGPFSSPVWEAILG 263 >ref|XP_006590678.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X2 [Glycine max] Length = 395 Score = 301 bits (771), Expect = 6e-79 Identities = 159/233 (68%), Positives = 182/233 (78%), Gaps = 8/233 (3%) Frame = +2 Query: 881 RGRGVVEDSDFLERFKLGFDRTKRVN----SDSKESPDQTADTTAEPT----PEDADEIF 1036 R G DS FL +FKLGFD K VN + SK+S + +P P+DADEIF Sbjct: 158 RDAGQSGDS-FLNKFKLGFD-DKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDADEIF 215 Query: 1037 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKL 1216 KKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGLMREKGTIPE+VIYTAVVEG+ KA K Sbjct: 216 KKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLMREKGTIPEIVIYTAVVEGYTKAHKA 275 Query: 1217 DDAIRIFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGL 1396 DDA RIFRKMQS+G++PNAFSY +L+Q L + RL DA EFC EMLEAGH+PNV TF+GL Sbjct: 276 DDAKRIFRKMQSSGVSPNAFSYMVLIQGLYKCSRLHDAFEFCVEMLEAGHSPNVTTFVGL 335 Query: 1397 VDCFCREKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 1555 VD FC EKG+EE +S I TL KGFV++EKAVR++LDKK PF P VWEAIFGK Sbjct: 336 VDGFCNEKGVEEAKSAIKTLTDKGFVVNEKAVRQFLDKKAPFSPSVWEAIFGK 388 >ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X1 [Glycine max] Length = 388 Score = 301 bits (771), Expect = 6e-79 Identities = 159/233 (68%), Positives = 182/233 (78%), Gaps = 8/233 (3%) Frame = +2 Query: 881 RGRGVVEDSDFLERFKLGFDRTKRVN----SDSKESPDQTADTTAEPT----PEDADEIF 1036 R G DS FL +FKLGFD K VN + SK+S + +P P+DADEIF Sbjct: 151 RDAGQSGDS-FLNKFKLGFD-DKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDADEIF 208 Query: 1037 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKL 1216 KKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGLMREKGTIPE+VIYTAVVEG+ KA K Sbjct: 209 KKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLMREKGTIPEIVIYTAVVEGYTKAHKA 268 Query: 1217 DDAIRIFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGL 1396 DDA RIFRKMQS+G++PNAFSY +L+Q L + RL DA EFC EMLEAGH+PNV TF+GL Sbjct: 269 DDAKRIFRKMQSSGVSPNAFSYMVLIQGLYKCSRLHDAFEFCVEMLEAGHSPNVTTFVGL 328 Query: 1397 VDCFCREKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 1555 VD FC EKG+EE +S I TL KGFV++EKAVR++LDKK PF P VWEAIFGK Sbjct: 329 VDGFCNEKGVEEAKSAIKTLTDKGFVVNEKAVRQFLDKKAPFSPSVWEAIFGK 381 >ref|XP_004512018.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X1 [Cicer arietinum] gi|502161087|ref|XP_004512019.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X2 [Cicer arietinum] Length = 371 Score = 299 bits (766), Expect = 2e-78 Identities = 160/259 (61%), Positives = 189/259 (72%), Gaps = 16/259 (6%) Frame = +2 Query: 827 QKNRKSAQFGYGDNESQGRGRGVVEDS--------DFLERFKLGFDRTKRVNSDSKESPD 982 ++ KS+Q G +GR V E S FL++FKLGFD K N ES Sbjct: 112 RRGSKSSQIDLGF-----QGRNVAEVSRDAGQLGDSFLDKFKLGFD-DKVGNHSEVESNG 165 Query: 983 QTADTTA--------EPTPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGL 1138 QT + A EP P+DADEIFKKMKETGLIPNAVAMLDGLCKDG VQEA+KLFGL Sbjct: 166 QTEGSRASDTDQPAQEPMPQDADEIFKKMKETGLIPNAVAMLDGLCKDGNVQEALKLFGL 225 Query: 1139 MREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRKMQSNGITPNAFSYGILVQALCRGKR 1318 MREKGTIPE+VIYTAVVEG+ KA K DDAIRIFRKMQSNGI+PNA+S+ +L+Q L + R Sbjct: 226 MREKGTIPEIVIYTAVVEGYTKAHKADDAIRIFRKMQSNGISPNAYSFTVLIQGLYKCSR 285 Query: 1319 LDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKGLEETESLITTLRQKGFVLDEKAVRE 1498 L DA EFC EMLEAG++ NV TF+G+VD FC+E G+EE + +I TL +KGF DEKAVRE Sbjct: 286 LQDALEFCVEMLEAGYSLNVTTFVGVVDGFCKEDGVEEAKGVIKTLTEKGFAYDEKAVRE 345 Query: 1499 YLDKKGPFMPLVWEAIFGK 1555 +LDKK PF P +WEA+FGK Sbjct: 346 FLDKKAPFSPSIWEAVFGK 364 >gb|ESW28907.1| hypothetical protein PHAVU_002G027800g [Phaseolus vulgaris] gi|561030329|gb|ESW28908.1| hypothetical protein PHAVU_002G027800g [Phaseolus vulgaris] Length = 451 Score = 299 bits (765), Expect = 3e-78 Identities = 151/225 (67%), Positives = 175/225 (77%), Gaps = 10/225 (4%) Frame = +2 Query: 911 FLERFKLGFD----------RTKRVNSDSKESPDQTADTTAEPTPEDADEIFKKMKETGL 1060 FL++FKL FD +K+ + +PDQ A EP P+DADEIFKKMKETGL Sbjct: 223 FLDKFKLAFDDKTVNLSEVAASKQSEEAKRSNPDQQAQ---EPVPQDADEIFKKMKETGL 279 Query: 1061 IPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFR 1240 IPNAVAMLDGLCKDGLVQEA+KLF LMREKGTIPE+VIYTAVVEG+ KA K DDA RIFR Sbjct: 280 IPNAVAMLDGLCKDGLVQEALKLFALMREKGTIPEIVIYTAVVEGYTKADKADDAKRIFR 339 Query: 1241 KMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREK 1420 KMQS+GI+PNAFSY ++VQ L + +RL DA EFC EMLEAGH+PNV TF+ LVD FC+EK Sbjct: 340 KMQSSGISPNAFSYTVIVQGLYKCRRLQDAFEFCVEMLEAGHSPNVTTFVSLVDGFCKEK 399 Query: 1421 GLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 1555 G+EE + + TL KGF DEKAVR++LDKK PF P VWEAIFGK Sbjct: 400 GVEEAKDAVKTLTGKGFAFDEKAVRQFLDKKTPFSPSVWEAIFGK 444 >ref|XP_002868835.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297314671|gb|EFH45094.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 301 Score = 298 bits (762), Expect = 7e-78 Identities = 153/228 (67%), Positives = 177/228 (77%), Gaps = 2/228 (0%) Frame = +2 Query: 878 GRGRGVVEDSDFLERFKLGFDRTKRVNSDSKESP--DQTADTTAEPTPEDADEIFKKMKE 1051 G+ + D FLE+FKLG VN DS+E+P +Q P PED+DEIFKKMKE Sbjct: 74 GKIDNTLSDDGFLEQFKLG------VNQDSQETPKPEQYPQDPLLP-PEDSDEIFKKMKE 126 Query: 1052 TGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIR 1231 GLIPNAVAMLDGLCKDGLVQEAMKLFGLMR+KGTIPEVVIYTAVVEGFCKA K++DA R Sbjct: 127 GGLIPNAVAMLDGLCKDGLVQEAMKLFGLMRDKGTIPEVVIYTAVVEGFCKAHKIEDAKR 186 Query: 1232 IFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFC 1411 IFRKMQ+NGITPNAFSYG+LVQ L LDDA FC EMLE+GH+PN+ TF+GLVD C Sbjct: 187 IFRKMQTNGITPNAFSYGVLVQGLYNCNMLDDAVTFCCEMLESGHSPNIPTFVGLVDALC 246 Query: 1412 REKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 1555 REKG+E+ +S I L QKGF L+ KAV+E++DK+ PF L WEAIF K Sbjct: 247 REKGVEQAQSAIDGLNQKGFALNVKAVKEFMDKRAPFPSLAWEAIFKK 294 >ref|XP_006573589.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X3 [Glycine max] gi|571435834|ref|XP_006573590.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X4 [Glycine max] Length = 403 Score = 295 bits (755), Expect = 4e-77 Identities = 152/223 (68%), Positives = 178/223 (79%), Gaps = 8/223 (3%) Frame = +2 Query: 911 FLERFKLGFDRTKRVN----SDSKESPDQTADTTAEPT----PEDADEIFKKMKETGLIP 1066 FL++FKLGFD K VN + SK+S + +P P+DA+EIFKKMKETGLIP Sbjct: 175 FLDKFKLGFD-DKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDANEIFKKMKETGLIP 233 Query: 1067 NAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRKM 1246 NAVAMLDGLCKDGLVQEA+KLFGL+REKGTIPE+VIYTAVVEG+ KA K DDA RIFRKM Sbjct: 234 NAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVEGYTKAHKADDAKRIFRKM 293 Query: 1247 QSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKGL 1426 QS+GI+PNAFSY +L+Q L + RL DA EFC EMLEAGH+PNV F+GLVD FC EKG+ Sbjct: 294 QSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSPNVTAFVGLVDGFCNEKGV 353 Query: 1427 EETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 1555 EE +S I TL +KGFV++EKAV ++LDKK PF P VWEAIFGK Sbjct: 354 EEAKSAIKTLTEKGFVVNEKAVGQFLDKKAPFSPSVWEAIFGK 396 >ref|XP_006573588.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X2 [Glycine max] Length = 431 Score = 295 bits (755), Expect = 4e-77 Identities = 152/223 (68%), Positives = 178/223 (79%), Gaps = 8/223 (3%) Frame = +2 Query: 911 FLERFKLGFDRTKRVN----SDSKESPDQTADTTAEPT----PEDADEIFKKMKETGLIP 1066 FL++FKLGFD K VN + SK+S + +P P+DA+EIFKKMKETGLIP Sbjct: 203 FLDKFKLGFD-DKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDANEIFKKMKETGLIP 261 Query: 1067 NAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRKM 1246 NAVAMLDGLCKDGLVQEA+KLFGL+REKGTIPE+VIYTAVVEG+ KA K DDA RIFRKM Sbjct: 262 NAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVEGYTKAHKADDAKRIFRKM 321 Query: 1247 QSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKGL 1426 QS+GI+PNAFSY +L+Q L + RL DA EFC EMLEAGH+PNV F+GLVD FC EKG+ Sbjct: 322 QSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSPNVTAFVGLVDGFCNEKGV 381 Query: 1427 EETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 1555 EE +S I TL +KGFV++EKAV ++LDKK PF P VWEAIFGK Sbjct: 382 EEAKSAIKTLTEKGFVVNEKAVGQFLDKKAPFSPSVWEAIFGK 424 >ref|XP_003516576.2| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X1 [Glycine max] Length = 457 Score = 295 bits (755), Expect = 4e-77 Identities = 152/223 (68%), Positives = 178/223 (79%), Gaps = 8/223 (3%) Frame = +2 Query: 911 FLERFKLGFDRTKRVN----SDSKESPDQTADTTAEPT----PEDADEIFKKMKETGLIP 1066 FL++FKLGFD K VN + SK+S + +P P+DA+EIFKKMKETGLIP Sbjct: 229 FLDKFKLGFD-DKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDANEIFKKMKETGLIP 287 Query: 1067 NAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRKM 1246 NAVAMLDGLCKDGLVQEA+KLFGL+REKGTIPE+VIYTAVVEG+ KA K DDA RIFRKM Sbjct: 288 NAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVEGYTKAHKADDAKRIFRKM 347 Query: 1247 QSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKGL 1426 QS+GI+PNAFSY +L+Q L + RL DA EFC EMLEAGH+PNV F+GLVD FC EKG+ Sbjct: 348 QSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSPNVTAFVGLVDGFCNEKGV 407 Query: 1427 EETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 1555 EE +S I TL +KGFV++EKAV ++LDKK PF P VWEAIFGK Sbjct: 408 EEAKSAIKTLTEKGFVVNEKAVGQFLDKKAPFSPSVWEAIFGK 450 >ref|NP_195528.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|79326453|ref|NP_001031806.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75266764|sp|Q9SZL5.1|PP356_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g38150 gi|4467121|emb|CAB37555.1| putative protein [Arabidopsis thaliana] gi|7270799|emb|CAB80480.1| putative protein [Arabidopsis thaliana] gi|26453272|dbj|BAC43709.1| unknown protein [Arabidopsis thaliana] gi|332661484|gb|AEE86884.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332661485|gb|AEE86885.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 302 Score = 294 bits (752), Expect = 1e-76 Identities = 149/227 (65%), Positives = 173/227 (76%), Gaps = 1/227 (0%) Frame = +2 Query: 878 GRGRGVVEDSDFLERFKLGFDRTKRVNSDSKESPD-QTADTTAEPTPEDADEIFKKMKET 1054 G+ + D FLE+FKLG VN DS+E+P + P PED+DEIFKKMKE Sbjct: 75 GKSDTTLSDDGFLEQFKLG------VNQDSRETPKPEQYPQEPLPPPEDSDEIFKKMKEG 128 Query: 1055 GLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRI 1234 GLIPNAVAMLDGLCKDGLVQEAMKLFGLMR+KGTIPEVVIYTAVVE FCKA K++DA RI Sbjct: 129 GLIPNAVAMLDGLCKDGLVQEAMKLFGLMRDKGTIPEVVIYTAVVEAFCKAHKIEDAKRI 188 Query: 1235 FRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCR 1414 FRKMQ+NGI PNAFSYG+LVQ L LDDA FC EMLE+GH+PNV TF+ LVD CR Sbjct: 189 FRKMQNNGIAPNAFSYGVLVQGLYNCNMLDDAVAFCSEMLESGHSPNVPTFVELVDALCR 248 Query: 1415 EKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 1555 KG+E+ +S I TL QKGF ++ KAV+E++DK+ PF L WEAIF K Sbjct: 249 VKGVEQAQSAIDTLNQKGFAVNVKAVKEFMDKRAPFPSLAWEAIFKK 295