BLASTX nr result
ID: Rehmannia25_contig00006329
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia25_contig00006329 (949 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EPS62438.1| hypothetical protein M569_12351, partial [Genlise... 298 2e-78 ref|XP_004253081.1| PREDICTED: pentatricopeptide repeat-containi... 288 2e-75 ref|XP_006342489.1| PREDICTED: pentatricopeptide repeat-containi... 276 8e-72 ref|XP_006422375.1| hypothetical protein CICLE_v10028759mg [Citr... 269 1e-69 ref|XP_006486551.1| PREDICTED: pentatricopeptide repeat-containi... 266 7e-69 gb|EOX98954.1| Pentatricopeptide repeat superfamily protein, put... 265 1e-68 ref|XP_002868835.1| pentatricopeptide repeat-containing protein ... 259 8e-67 ref|XP_006590678.1| PREDICTED: pentatricopeptide repeat-containi... 259 1e-66 ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containi... 259 1e-66 gb|EXB40453.1| hypothetical protein L484_013756 [Morus notabilis] 259 1e-66 emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera] 254 3e-65 ref|XP_002514391.1| pentatricopeptide repeat-containing protein,... 254 4e-65 gb|AFK36371.1| unknown [Lotus japonicus] 253 6e-65 ref|XP_006394459.1| hypothetical protein EUTSA_v10005467mg [Eutr... 253 7e-65 ref|XP_002283311.1| PREDICTED: pentatricopeptide repeat-containi... 253 7e-65 ref|XP_006573589.1| PREDICTED: pentatricopeptide repeat-containi... 252 2e-64 ref|XP_006573588.1| PREDICTED: pentatricopeptide repeat-containi... 252 2e-64 ref|XP_003516576.2| PREDICTED: pentatricopeptide repeat-containi... 252 2e-64 ref|XP_004290096.1| PREDICTED: pentatricopeptide repeat-containi... 249 1e-63 ref|NP_195528.1| pentatricopeptide repeat-containing protein [Ar... 248 2e-63 >gb|EPS62438.1| hypothetical protein M569_12351, partial [Genlisea aurea] Length = 272 Score = 298 bits (763), Expect = 2e-78 Identities = 151/222 (68%), Positives = 179/222 (80%), Gaps = 3/222 (1%) Frame = -3 Query: 659 PPEPIPNRPLRRQSYPYGSPRIPKPNRGREIENQNSFRG-ETDADFLERFKLGFDSKVEN 483 PPEPIPNRPLR +S S PK +R R N + E+D+DFLERFKLGFD K Sbjct: 1 PPEPIPNRPLRGRSV--ASRITPKSDRIRGSGNPRAAAAAESDSDFLERFKLGFDRKTTT 58 Query: 482 P--KIDSADKSIQSEKAENMEPLSPPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQE 309 P ++ ++K+ E+ E +PLSPPE+ADEIF+KMKETGLIPNAVAMLDGLCKDGLVQ+ Sbjct: 59 PPGRVVESEKAGGEEEKEEQQPLSPPENADEIFRKMKETGLIPNAVAMLDGLCKDGLVQD 118 Query: 308 AMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFDDAVRIFKKMQSNGVIPNVFSYQVLVR 129 A+KLFG MREKG+IP+VVVYTAVVEGFCKA K DDA+RIFKKM+SNG+ PN FSYQ+L+R Sbjct: 119 ALKLFGTMREKGSIPDVVVYTAVVEGFCKAQKHDDAIRIFKKMKSNGIAPNAFSYQILIR 178 Query: 128 GLFSGKRLEDAYGFTIEMLEAGHSPNIATFTGLVDGYCREKG 3 GL GKRLEDA GFT EMLE G+SPN+ATFTGLV+G+C+EKG Sbjct: 179 GLCDGKRLEDASGFTAEMLETGYSPNLATFTGLVNGWCQEKG 220 >ref|XP_004253081.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform 1 [Solanum lycopersicum] gi|460415472|ref|XP_004253082.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform 2 [Solanum lycopersicum] Length = 340 Score = 288 bits (738), Expect = 2e-75 Identities = 150/245 (61%), Positives = 176/245 (71%), Gaps = 14/245 (5%) Frame = -3 Query: 698 FSSIDDGLGRSDYPP--EPIPNRPLRRQSY-PYGSPRIPKPN-----------RGREIEN 561 FS D S+YPP EPIPNRPLR S P+ + P+ R N Sbjct: 44 FSDYSDESAESNYPPPPEPIPNRPLRADSRRPFNPSQRQHPSNRSSPNHSTTFRRSSENN 103 Query: 560 QNSFRGETDADFLERFKLGFDSKVENPKIDSADKSIQSEKAENMEPLSPPEDADEIFKKM 381 ++ + + DFL+RF+LGFD K ENP + +S +E P +PPEDADEIFKKM Sbjct: 104 ESQMKSQDSEDFLKRFQLGFDRKEENPNTNPKAESRDCPVSE--APPAPPEDADEIFKKM 161 Query: 380 KETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFDDA 201 KETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVV+YTAVV+GFCKA KFDDA Sbjct: 162 KETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVDGFCKAQKFDDA 221 Query: 200 VRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIATFTGLVDG 21 VRIF+KMQ NG+IPN FSY +++RGL GKRL+DA F +EMLEAGHSPN+ TF LVDG Sbjct: 222 VRIFRKMQGNGIIPNAFSYGIIIRGLSQGKRLDDALEFCLEMLEAGHSPNVVTFVTLVDG 281 Query: 20 YCREK 6 +C+EK Sbjct: 282 FCKEK 286 >ref|XP_006342489.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like [Solanum tuberosum] Length = 354 Score = 276 bits (706), Expect = 8e-72 Identities = 147/256 (57%), Positives = 173/256 (67%), Gaps = 26/256 (10%) Frame = -3 Query: 695 SSIDDGLGRSDYPP--EPIPNRPLRRQSY-----------------PYGSPRIPKPNRGR 573 S+ D +S+YPP +PIPNRPLR S P + P Sbjct: 45 SNYSDEFTQSNYPPPPDPIPNRPLRGDSKRPLRDDSRRPLRDDFRRPLRADSSNNPTHST 104 Query: 572 EIE-----NQNSFRGETDADFLERFKLGFDSKVENPKIDSA--DKSIQSEKAENMEPLSP 414 + N + + DFL+RF+LGFD K ENP + A K S+ + P +P Sbjct: 105 TLRRSGENNGGQMKSQDSEDFLKRFQLGFDRKEENPNTNPALHPKGESSDSPVSEAPPAP 164 Query: 413 PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVE 234 PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVV+YTAVV+ Sbjct: 165 PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVD 224 Query: 233 GFCKAHKFDDAVRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSP 54 GF KA KFDDAVRIF+KMQ NG+IPN FSY +L+RGL G RL+DA+ F +EMLEAGHSP Sbjct: 225 GFFKAQKFDDAVRIFRKMQGNGIIPNAFSYGILIRGLSQGNRLDDAFEFCLEMLEAGHSP 284 Query: 53 NIATFTGLVDGYCREK 6 N+ TF LVDG+C+EK Sbjct: 285 NVVTFVTLVDGFCKEK 300 Score = 57.4 bits (137), Expect = 8e-06 Identities = 30/86 (34%), Positives = 48/86 (55%), Gaps = 3/86 (3%) Frame = -3 Query: 410 EDADEIFKKMKETGLIPNAVA---MLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAV 240 +DA IF+KM+ G+IPNA + ++ GL + + +A + M E G P VV + + Sbjct: 233 DDAVRIFRKMQGNGIIPNAFSYGILIRGLSQGNRLDDAFEFCLEMLEAGHSPNVVTFVTL 292 Query: 239 VEGFCKAHKFDDAVRIFKKMQSNGVI 162 V+GFCK +DA + K ++ G I Sbjct: 293 VDGFCKEKSLEDAQNMIKTVRQKGFI 318 >ref|XP_006422375.1| hypothetical protein CICLE_v10028759mg [Citrus clementina] gi|557524309|gb|ESR35615.1| hypothetical protein CICLE_v10028759mg [Citrus clementina] Length = 344 Score = 269 bits (687), Expect = 1e-69 Identities = 142/242 (58%), Positives = 173/242 (71%), Gaps = 23/242 (9%) Frame = -3 Query: 659 PPEPIPNRPLR-----------RQSYP-----YGSPRIPK------PNRGREIENQNSFR 546 PPEPIP+RPLR R+S+ Y + P+ PNR R Sbjct: 55 PPEPIPDRPLRGERPFTNQNQNRRSFQPRFNNYQQQQRPQQQSFQSPNRPRPKSPDGV-- 112 Query: 545 GETDADFLERFKLGFDSKVENPKIDSADKSIQSEKAENMEPLS-PPEDADEIFKKMKETG 369 ++D +FL++FKL D K +NP+ + + Q +K EP+S PP++ADEIFKKMKETG Sbjct: 113 -QSDENFLDQFKLAIDKKPDNPQQNESLGERQEQKPNRNEPISEPPQEADEIFKKMKETG 171 Query: 368 LIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFDDAVRIF 189 LIPNAVAMLDGLCKDGL+QEAMKLFGLMREKGTIPEVV+YTAVV+GFCKA KFDDA RIF Sbjct: 172 LIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVVDGFCKAQKFDDAKRIF 231 Query: 188 KKMQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIATFTGLVDGYCRE 9 +KMQSNG+ PN FSY +L++GL+ +LE+A + IEMLEAGHSPN+ TF GLVDG CRE Sbjct: 232 RKMQSNGIAPNAFSYNLLIQGLYKCNKLEEAVEYCIEMLEAGHSPNVTTFVGLVDGLCRE 291 Query: 8 KG 3 KG Sbjct: 292 KG 293 >ref|XP_006486551.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like [Citrus sinensis] Length = 387 Score = 266 bits (681), Expect = 7e-69 Identities = 142/249 (57%), Positives = 175/249 (70%), Gaps = 22/249 (8%) Frame = -3 Query: 683 DGLGRSDY-PPEPIPNRPLR--------RQSYPYGSPRIPKPNRGREIENQNSFRG---- 543 D R+D PPEPIP+RPLR Q+ PR + ++ Q SF+ Sbjct: 89 DNDNRNDQNPPEPIPDRPLRGERPFTNQNQNRRSFQPRFNNYQQ-QQRPQQQSFQSPNGP 147 Query: 542 --------ETDADFLERFKLGFDSKVENPKIDSADKSIQSEKAENMEPLS-PPEDADEIF 390 ++D +FL++FKL D K NP+ + + Q +K EP+S PP++ADEIF Sbjct: 148 RPKSPDGVQSDENFLDQFKLAIDKKPGNPQQNESLGQRQEQKPNRNEPISEPPQEADEIF 207 Query: 389 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKF 210 KKMKETGLIPNAVAMLDGLCKDGL+QEAMKLFGLMREKGTIPEVV+YTAVV+GFCKA KF Sbjct: 208 KKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVVDGFCKAQKF 267 Query: 209 DDAVRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIATFTGL 30 DDA RIF+KMQSNG+ PN FSY +L++GL+ +LE+A + IEMLEAGHSPN+ TF GL Sbjct: 268 DDAKRIFRKMQSNGIAPNAFSYNLLIQGLYKCNKLEEAVEYCIEMLEAGHSPNVTTFVGL 327 Query: 29 VDGYCREKG 3 VDG CRE+G Sbjct: 328 VDGLCRERG 336 >gb|EOX98954.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma cacao] Length = 345 Score = 265 bits (678), Expect = 1e-68 Identities = 141/240 (58%), Positives = 168/240 (70%), Gaps = 16/240 (6%) Frame = -3 Query: 674 GRSDYPPEPIPNRPLRRQSYPYGSPRIPKPNRGREIENQNSFRG---------------- 543 G D PPEPIPNR L Q P+ +P + N +SF+ Sbjct: 58 GDGDKPPEPIPNRSLEGQR-PF-NPSFRETKGATLNSNGSSFQSFNTKFASDPNRKREDS 115 Query: 542 ETDADFLERFKLGFDSKVENPKIDSADKSIQSEKAENMEPLSPPEDADEIFKKMKETGLI 363 ++D +FLE+FKLG D+K DS ++ K + +P SPP+DADEIFKKMKETGLI Sbjct: 116 QSDENFLEKFKLGLDNKRGKQPSDSEAAALLRRKEQEEKP-SPPQDADEIFKKMKETGLI 174 Query: 362 PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFDDAVRIFKK 183 PNAVAMLDGLCKDGL+QEAMKLFG MREKGTIPEVV+YTAVV+GFCKAHK DDA RIF+K Sbjct: 175 PNAVAMLDGLCKDGLIQEAMKLFGSMREKGTIPEVVIYTAVVDGFCKAHKLDDAKRIFRK 234 Query: 182 MQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIATFTGLVDGYCREKG 3 MQS GV PN FSY VL++GL+ +L+DA F +EMLEAGHSPN+ TF GLVDG C+EKG Sbjct: 235 MQSKGVTPNSFSYIVLIQGLYRCNKLDDAIEFCLEMLEAGHSPNVTTFVGLVDGLCKEKG 294 >ref|XP_002868835.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297314671|gb|EFH45094.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 301 Score = 259 bits (663), Expect = 8e-67 Identities = 141/234 (60%), Positives = 166/234 (70%), Gaps = 3/234 (1%) Frame = -3 Query: 695 SSIDDGLGRSDYPPEPIPNRPLR--RQSYPYGSPRIPKPNRGREIENQNSFRGETDADFL 522 S+ D G + PPEP+PNRPLR R S + P + + +I+N S D FL Sbjct: 32 STGDKGQEKQQNPPEPLPNRPLRGERSSNSHREPPARQAHDLGKIDNTLS-----DDGFL 86 Query: 521 ERFKLGFDS-KVENPKIDSADKSIQSEKAENMEPLSPPEDADEIFKKMKETGLIPNAVAM 345 E+FKLG + E PK + + +PL PPED+DEIFKKMKE GLIPNAVAM Sbjct: 87 EQFKLGVNQDSQETPKPEQYPQ----------DPLLPPEDSDEIFKKMKEGGLIPNAVAM 136 Query: 344 LDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFDDAVRIFKKMQSNGV 165 LDGLCKDGLVQEAMKLFGLMR+KGTIPEVV+YTAVVEGFCKAHK +DA RIF+KMQ+NG+ Sbjct: 137 LDGLCKDGLVQEAMKLFGLMRDKGTIPEVVIYTAVVEGFCKAHKIEDAKRIFRKMQTNGI 196 Query: 164 IPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIATFTGLVDGYCREKG 3 PN FSY VLV+GL++ L+DA F EMLE+GHSPNI TF GLVD CREKG Sbjct: 197 TPNAFSYGVLVQGLYNCNMLDDAVTFCCEMLESGHSPNIPTFVGLVDALCREKG 250 >ref|XP_006590678.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X2 [Glycine max] Length = 395 Score = 259 bits (662), Expect = 1e-66 Identities = 149/271 (54%), Positives = 174/271 (64%), Gaps = 40/271 (14%) Frame = -3 Query: 695 SSIDDGLGRSDYP-PEPIPNRPLRR-----------QSYPYGSPRIP------------- 591 SS D G SD EPIP+RPLR Q Y GS P Sbjct: 77 SSFKDN-GESDQSLSEPIPSRPLRSRKPVNQPPPRFQEYDRGSHSFPPRFYDNHGGPDEL 135 Query: 590 -KPNRGREIE---------NQNSFRGETDADFLERFKLGFDSKVENPKIDSADKSIQSEK 441 + N+ +I+ N G++ FL +FKLGFD K N +A K QSE+ Sbjct: 136 DQTNKSSKIDLAFQNTNVAKTNRDAGQSGDSFLNKFKLGFDDKTVNLSEVAASK--QSEE 193 Query: 440 AENMEPLSP-----PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREK 276 A+ P P P+DADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGLMREK Sbjct: 194 AKRSNPNQPAQESMPQDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLMREK 253 Query: 275 GTIPEVVVYTAVVEGFCKAHKFDDAVRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDA 96 GTIPE+V+YTAVVEG+ KAHK DDA RIF+KMQS+GV PN FSY VL++GL+ RL DA Sbjct: 254 GTIPEIVIYTAVVEGYTKAHKADDAKRIFRKMQSSGVSPNAFSYMVLIQGLYKCSRLHDA 313 Query: 95 YGFTIEMLEAGHSPNIATFTGLVDGYCREKG 3 + F +EMLEAGHSPN+ TF GLVDG+C EKG Sbjct: 314 FEFCVEMLEAGHSPNVTTFVGLVDGFCNEKG 344 >ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X1 [Glycine max] Length = 388 Score = 259 bits (662), Expect = 1e-66 Identities = 149/271 (54%), Positives = 174/271 (64%), Gaps = 40/271 (14%) Frame = -3 Query: 695 SSIDDGLGRSDYP-PEPIPNRPLRR-----------QSYPYGSPRIP------------- 591 SS D G SD EPIP+RPLR Q Y GS P Sbjct: 70 SSFKDN-GESDQSLSEPIPSRPLRSRKPVNQPPPRFQEYDRGSHSFPPRFYDNHGGPDEL 128 Query: 590 -KPNRGREIE---------NQNSFRGETDADFLERFKLGFDSKVENPKIDSADKSIQSEK 441 + N+ +I+ N G++ FL +FKLGFD K N +A K QSE+ Sbjct: 129 DQTNKSSKIDLAFQNTNVAKTNRDAGQSGDSFLNKFKLGFDDKTVNLSEVAASK--QSEE 186 Query: 440 AENMEPLSP-----PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREK 276 A+ P P P+DADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGLMREK Sbjct: 187 AKRSNPNQPAQESMPQDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLMREK 246 Query: 275 GTIPEVVVYTAVVEGFCKAHKFDDAVRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDA 96 GTIPE+V+YTAVVEG+ KAHK DDA RIF+KMQS+GV PN FSY VL++GL+ RL DA Sbjct: 247 GTIPEIVIYTAVVEGYTKAHKADDAKRIFRKMQSSGVSPNAFSYMVLIQGLYKCSRLHDA 306 Query: 95 YGFTIEMLEAGHSPNIATFTGLVDGYCREKG 3 + F +EMLEAGHSPN+ TF GLVDG+C EKG Sbjct: 307 FEFCVEMLEAGHSPNVTTFVGLVDGFCNEKG 337 >gb|EXB40453.1| hypothetical protein L484_013756 [Morus notabilis] Length = 306 Score = 259 bits (661), Expect = 1e-66 Identities = 141/226 (62%), Positives = 160/226 (70%) Frame = -3 Query: 680 GLGRSDYPPEPIPNRPLRRQSYPYGSPRIPKPNRGREIENQNSFRGETDADFLERFKLGF 501 G G SD P ++ R +S P +P RGR E D+ FLE+FKLG Sbjct: 43 GNGESDETTGPSFSQNPRERSRPN------RPPRGR-----GPLTSEDDS-FLEKFKLGL 90 Query: 500 DSKVENPKIDSADKSIQSEKAENMEPLSPPEDADEIFKKMKETGLIPNAVAMLDGLCKDG 321 DS + + + + K +P PPEDADEIFKKMKETGLIPNAVAMLDGLCKDG Sbjct: 91 DSSKDGMQ-EKPRREAARPKPPLPQPPPPPEDADEIFKKMKETGLIPNAVAMLDGLCKDG 149 Query: 320 LVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFDDAVRIFKKMQSNGVIPNVFSYQ 141 LVQEAMKLFGLM+EKGTIPEVV+YTAVV+GFCKA K DDAVRIF+KMQSNG+ PN FSY Sbjct: 150 LVQEAMKLFGLMKEKGTIPEVVIYTAVVDGFCKAQKLDDAVRIFRKMQSNGIEPNAFSYS 209 Query: 140 VLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIATFTGLVDGYCREKG 3 VLV+GL GKRLED F +EMLEAGHSPN+ATF GLVDG C EKG Sbjct: 210 VLVQGLCGGKRLEDGLEFCVEMLEAGHSPNVATFVGLVDGLCEEKG 255 >emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera] Length = 381 Score = 254 bits (650), Expect = 3e-65 Identities = 139/248 (56%), Positives = 165/248 (66%), Gaps = 17/248 (6%) Frame = -3 Query: 695 SSIDDGLGRSDYPPEPIPNRPLRRQS--------YPYGSPRIPKPNRGREIENQNSFRGE 540 SS G G S PP PIPNRPLR + P +PK + F Sbjct: 85 SSSCGGGGSSSNPPNPIPNRPLRGEQRMNRPPPHIPQRKLGLPKDEGVDRASQASPFNQP 144 Query: 539 TDAD---------FLERFKLGFDSKVENPKIDSADKSIQSEKAENMEPLSPPEDADEIFK 387 + A+ FLERFKLG K E P+ +SA E+ N PP++ADEIF+ Sbjct: 145 SPAEKVGATLEDGFLERFKLGVQKK-ERPQ-ESAAAQPSREQDANHGKEQPPQNADEIFR 202 Query: 386 KMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFD 207 KMKE+GLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVV+YTAVVEGFCKA + D Sbjct: 203 KMKESGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKARQLD 262 Query: 206 DAVRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIATFTGLV 27 DAVRIF+KMQ+NG+ PN FSY VL+RG++ G RL+ A F +EMLEAGHSPN+AT L+ Sbjct: 263 DAVRIFRKMQNNGISPNAFSYTVLIRGMYKGNRLDIAVDFCVEMLEAGHSPNVATLVDLI 322 Query: 26 DGYCREKG 3 +C+EKG Sbjct: 323 HEFCKEKG 330 >ref|XP_002514391.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223546488|gb|EEF47987.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 313 Score = 254 bits (648), Expect = 4e-65 Identities = 135/232 (58%), Positives = 157/232 (67%), Gaps = 4/232 (1%) Frame = -3 Query: 686 DDGLGRSDYPPEPIPNRPLRRQSY----PYGSPRIPKPNRGREIENQNSFRGETDADFLE 519 DD + PP PIPNRPLR Q+ SPRIP+ N NQN + DFLE Sbjct: 39 DDASNVDNSPPHPIPNRPLRGQTSFNQSQSQSPRIPRRNT-----NQNHLSSD---DFLE 90 Query: 518 RFKLGFDSKVENPKIDSADKSIQSEKAENMEPLSPPEDADEIFKKMKETGLIPNAVAMLD 339 +FKL + + + + + E P PP DA++IF KMKETGLIPNAVAMLD Sbjct: 91 KFKLNKRNHKDEIPHQINNHTSKDENINKSSPPPPPPDANDIFNKMKETGLIPNAVAMLD 150 Query: 338 GLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFDDAVRIFKKMQSNGVIP 159 GLCKDGLVQEAMKLFGLMR+KGTIPEVVVYTAVV+GFCKAHK DDA RIFKKM NG+ P Sbjct: 151 GLCKDGLVQEAMKLFGLMRQKGTIPEVVVYTAVVDGFCKAHKTDDAKRIFKKMIDNGITP 210 Query: 158 NVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIATFTGLVDGYCREKG 3 N FSY V ++GL ++DA F +ML+AGHSPN+ TF GLVDG CREKG Sbjct: 211 NAFSYTVTIQGLCKCNAVDDAVDFCFQMLDAGHSPNVTTFVGLVDGLCREKG 262 >gb|AFK36371.1| unknown [Lotus japonicus] Length = 372 Score = 253 bits (647), Expect = 6e-65 Identities = 143/249 (57%), Positives = 171/249 (68%), Gaps = 32/249 (12%) Frame = -3 Query: 653 EPIPNRPLR--------RQSYPYGSPRIPKP----NRGR----EIENQNS-----FRGET 537 EPIPNR LR + Y GS R +P NRGR E+ N++S F+G Sbjct: 74 EPIPNRALRGTQPVNPHSREYNRGS-RSSRPRFDGNRGRPDDVEMTNKSSQTDIGFQGRN 132 Query: 536 DAD-----------FLERFKLGFDSKVENPKIDSADKSIQSEKAENMEPLSPPEDADEIF 390 +D FL++FKLGFD+K N +A + K+ N + PEDADEIF Sbjct: 133 MSDTNKVVNKLGDSFLDKFKLGFDNKAGNSSEVAASNLSEEAKSANSNQPAMPEDADEIF 192 Query: 389 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKF 210 KKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGLMREKGTIPE+V+YTAVVEG+ KAHK Sbjct: 193 KKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLMREKGTIPEIVIYTAVVEGYTKAHKA 252 Query: 209 DDAVRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIATFTGL 30 DDA RIF+KMQSNG+ PN FSY VLV+GL RL+DA+ F +EMLEAGHSPN+ TF L Sbjct: 253 DDAKRIFRKMQSNGISPNAFSYTVLVQGLCKCSRLQDAFEFCVEMLEAGHSPNMTTFVDL 312 Query: 29 VDGYCREKG 3 VDG+ +E+G Sbjct: 313 VDGFVKEQG 321 >ref|XP_006394459.1| hypothetical protein EUTSA_v10005467mg [Eutrema salsugineum] gi|557091098|gb|ESQ31745.1| hypothetical protein EUTSA_v10005467mg [Eutrema salsugineum] Length = 295 Score = 253 bits (646), Expect = 7e-65 Identities = 135/229 (58%), Positives = 160/229 (69%), Gaps = 1/229 (0%) Frame = -3 Query: 686 DDGLGRSDYPPEPIPNRPLRRQSYPYGSPRIPKPNRGREIENQNSFRGETDADFLERFKL 507 D+ + PPEP+PNRPLR + RG + +D DFLE+FKL Sbjct: 36 DNSQQQQQNPPEPLPNRPLRGE-------------RGSNSARPSQPAKLSDHDFLEQFKL 82 Query: 506 GFDSKVENPKIDSADKSIQSEKAENM-EPLSPPEDADEIFKKMKETGLIPNAVAMLDGLC 330 G K D + K+ Q + E EPL PED++EIFK MKE GLIPNAVAMLDGLC Sbjct: 83 GV-------KQDDSRKTEQKPQQETSPEPLPAPEDSEEIFKNMKEGGLIPNAVAMLDGLC 135 Query: 329 KDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFDDAVRIFKKMQSNGVIPNVF 150 KDGLVQEAMKLFGLMR+KGTIPEVV+YTAVVEGFCKAHK +DA RIF+KMQ+NG++PN F Sbjct: 136 KDGLVQEAMKLFGLMRDKGTIPEVVIYTAVVEGFCKAHKIEDAKRIFRKMQTNGIVPNAF 195 Query: 149 SYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIATFTGLVDGYCREKG 3 SY VLV+GL + L+DA F EMLE+GHSPN++TF GLVD CREKG Sbjct: 196 SYGVLVQGLCNCNMLDDAVDFCGEMLESGHSPNVSTFVGLVDALCREKG 244 >ref|XP_002283311.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like [Vitis vinifera] Length = 380 Score = 253 bits (646), Expect = 7e-65 Identities = 138/248 (55%), Positives = 165/248 (66%), Gaps = 17/248 (6%) Frame = -3 Query: 695 SSIDDGLGRSDYPPEPIPNRPLRRQS--------YPYGSPRIPKPNRGREIENQNSFRGE 540 SS G G S PP PIPNRPLR + P +PK + F Sbjct: 84 SSSSCGGGSSSNPPNPIPNRPLRGEQRMNRPPPHIPQRKLGLPKDEGVDRASQASPFNQP 143 Query: 539 TDAD---------FLERFKLGFDSKVENPKIDSADKSIQSEKAENMEPLSPPEDADEIFK 387 + A+ FLERFKLG K E P+ +SA E+ N PP++ADEIF+ Sbjct: 144 SPAEKVGATLEDGFLERFKLGVQKK-ERPQ-ESAAAQPSREQDANHGKEQPPQNADEIFR 201 Query: 386 KMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFD 207 KMKE+GLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVV+YTAVVEGFCKA + + Sbjct: 202 KMKESGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKARQLN 261 Query: 206 DAVRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIATFTGLV 27 DAVRIF+KMQ+NG+ PN FSY VL+RG++ G RL+ A F +EMLEAGHSPN+AT L+ Sbjct: 262 DAVRIFRKMQNNGISPNAFSYTVLIRGMYKGNRLDIAVDFCVEMLEAGHSPNVATLVDLI 321 Query: 26 DGYCREKG 3 +C+EKG Sbjct: 322 HEFCKEKG 329 >ref|XP_006573589.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X3 [Glycine max] gi|571435834|ref|XP_006573590.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X4 [Glycine max] Length = 403 Score = 252 bits (643), Expect = 2e-64 Identities = 139/257 (54%), Positives = 171/257 (66%), Gaps = 40/257 (15%) Frame = -3 Query: 653 EPIPNRPLRRQ------------------SYP------YGSP-RIPKPNRGREIE----- 564 EPIP+RPLR + S+P +G P + K N+ +I+ Sbjct: 98 EPIPSRPLRGKKPINQPPPRFREYDRGSHSFPPRFDDNHGGPDELDKINKSSQIDLAFQG 157 Query: 563 -----NQNSFRGETDADFLERFKLGFDSKVENPKIDSADKSIQSEKAENMEPLSP----- 414 N G++ FL++FKLGFD K N +A K QSE+A+ P P Sbjct: 158 TTNVAETNRDVGKSGGSFLDKFKLGFDDKTVNLSEVAASK--QSEEAKRSNPNQPAQESM 215 Query: 413 PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVE 234 P+DA+EIFKKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGL+REKGTIPE+V+YTAVVE Sbjct: 216 PQDANEIFKKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVE 275 Query: 233 GFCKAHKFDDAVRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSP 54 G+ KAHK DDA RIF+KMQS+G+ PN FSY VL++GL+ RL DA+ F +EMLEAGHSP Sbjct: 276 GYTKAHKADDAKRIFRKMQSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSP 335 Query: 53 NIATFTGLVDGYCREKG 3 N+ F GLVDG+C EKG Sbjct: 336 NVTAFVGLVDGFCNEKG 352 >ref|XP_006573588.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X2 [Glycine max] Length = 431 Score = 252 bits (643), Expect = 2e-64 Identities = 139/257 (54%), Positives = 171/257 (66%), Gaps = 40/257 (15%) Frame = -3 Query: 653 EPIPNRPLRRQ------------------SYP------YGSP-RIPKPNRGREIE----- 564 EPIP+RPLR + S+P +G P + K N+ +I+ Sbjct: 126 EPIPSRPLRGKKPINQPPPRFREYDRGSHSFPPRFDDNHGGPDELDKINKSSQIDLAFQG 185 Query: 563 -----NQNSFRGETDADFLERFKLGFDSKVENPKIDSADKSIQSEKAENMEPLSP----- 414 N G++ FL++FKLGFD K N +A K QSE+A+ P P Sbjct: 186 TTNVAETNRDVGKSGGSFLDKFKLGFDDKTVNLSEVAASK--QSEEAKRSNPNQPAQESM 243 Query: 413 PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVE 234 P+DA+EIFKKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGL+REKGTIPE+V+YTAVVE Sbjct: 244 PQDANEIFKKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVE 303 Query: 233 GFCKAHKFDDAVRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSP 54 G+ KAHK DDA RIF+KMQS+G+ PN FSY VL++GL+ RL DA+ F +EMLEAGHSP Sbjct: 304 GYTKAHKADDAKRIFRKMQSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSP 363 Query: 53 NIATFTGLVDGYCREKG 3 N+ F GLVDG+C EKG Sbjct: 364 NVTAFVGLVDGFCNEKG 380 >ref|XP_003516576.2| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like isoform X1 [Glycine max] Length = 457 Score = 252 bits (643), Expect = 2e-64 Identities = 139/257 (54%), Positives = 171/257 (66%), Gaps = 40/257 (15%) Frame = -3 Query: 653 EPIPNRPLRRQ------------------SYP------YGSP-RIPKPNRGREIE----- 564 EPIP+RPLR + S+P +G P + K N+ +I+ Sbjct: 152 EPIPSRPLRGKKPINQPPPRFREYDRGSHSFPPRFDDNHGGPDELDKINKSSQIDLAFQG 211 Query: 563 -----NQNSFRGETDADFLERFKLGFDSKVENPKIDSADKSIQSEKAENMEPLSP----- 414 N G++ FL++FKLGFD K N +A K QSE+A+ P P Sbjct: 212 TTNVAETNRDVGKSGGSFLDKFKLGFDDKTVNLSEVAASK--QSEEAKRSNPNQPAQESM 269 Query: 413 PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVE 234 P+DA+EIFKKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGL+REKGTIPE+V+YTAVVE Sbjct: 270 PQDANEIFKKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVE 329 Query: 233 GFCKAHKFDDAVRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSP 54 G+ KAHK DDA RIF+KMQS+G+ PN FSY VL++GL+ RL DA+ F +EMLEAGHSP Sbjct: 330 GYTKAHKADDAKRIFRKMQSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSP 389 Query: 53 NIATFTGLVDGYCREKG 3 N+ F GLVDG+C EKG Sbjct: 390 NVTAFVGLVDGFCNEKG 406 >ref|XP_004290096.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like [Fragaria vesca subsp. vesca] Length = 309 Score = 249 bits (635), Expect = 1e-63 Identities = 136/226 (60%), Positives = 159/226 (70%), Gaps = 7/226 (3%) Frame = -3 Query: 659 PPEPIPNRPLRRQSYPYGSPRIPK-----PNRGREIENQNSFRGETDADFLERFKLGFD- 498 PPEPIPNRPLR Q P + + PN R EN N D+ FLE+ K+G + Sbjct: 45 PPEPIPNRPLRGQRASNPQPNLERRRESPPNLERRRENPNP--PLQDSSFLEKLKMGLEK 102 Query: 497 SKVENPKIDSADKSIQSEKAENMEPL-SPPEDADEIFKKMKETGLIPNAVAMLDGLCKDG 321 SK E P+ E AE P P E+A+EIFKKMKETGLIPNAVAMLDGLCKDG Sbjct: 103 SKREKPQ----------EAAEPPPPQPQPTEEANEIFKKMKETGLIPNAVAMLDGLCKDG 152 Query: 320 LVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFDDAVRIFKKMQSNGVIPNVFSYQ 141 LVQEAMKLFG MREKGTIPEVV+YTAVVEGFCK K +DA R+F+KMQSNG++PN FSY Sbjct: 153 LVQEAMKLFGSMREKGTIPEVVIYTAVVEGFCKGRKPEDAKRVFRKMQSNGIVPNAFSYN 212 Query: 140 VLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIATFTGLVDGYCREKG 3 V+V+GL ++++DA F EMLEAGHSPN+ TF GLVDG C+E G Sbjct: 213 VMVQGLCRCEKMKDAAEFCGEMLEAGHSPNVTTFVGLVDGVCKENG 258 Score = 62.8 bits (151), Expect = 2e-07 Identities = 29/89 (32%), Positives = 50/89 (56%), Gaps = 3/89 (3%) Frame = -3 Query: 413 PEDADEIFKKMKETGLIPNAVA---MLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTA 243 PEDA +F+KM+ G++PNA + M+ GLC+ +++A + G M E G P V + Sbjct: 189 PEDAKRVFRKMQSNGIVPNAFSYNVMVQGLCRCEKMKDAAEFCGEMLEAGHSPNVTTFVG 248 Query: 242 VVEGFCKAHKFDDAVRIFKKMQSNGVIPN 156 +V+G CK + + + K++ G + N Sbjct: 249 LVDGVCKENGVEGGESVIGKLKQRGYVVN 277 >ref|NP_195528.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|79326453|ref|NP_001031806.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75266764|sp|Q9SZL5.1|PP356_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g38150 gi|4467121|emb|CAB37555.1| putative protein [Arabidopsis thaliana] gi|7270799|emb|CAB80480.1| putative protein [Arabidopsis thaliana] gi|26453272|dbj|BAC43709.1| unknown protein [Arabidopsis thaliana] gi|332661484|gb|AEE86884.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332661485|gb|AEE86885.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 302 Score = 248 bits (633), Expect = 2e-63 Identities = 134/221 (60%), Positives = 158/221 (71%), Gaps = 2/221 (0%) Frame = -3 Query: 659 PPEPIPNRPLRRQSYPYGSPRIPKPNRGREIENQNSFRGETDADFLERFKLGF--DSKVE 486 PPEP+PNRPLR + S R P + + ++ +D FLE+FKLG DS+ E Sbjct: 45 PPEPLPNRPLRGERSS-NSHREPPARQAHNLGKSDTTL--SDDGFLEQFKLGVNQDSR-E 100 Query: 485 NPKIDSADKSIQSEKAENMEPLSPPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEA 306 PK + + EPL PPED+DEIFKKMKE GLIPNAVAMLDGLCKDGLVQEA Sbjct: 101 TPKPEQYPQ----------EPLPPPEDSDEIFKKMKEGGLIPNAVAMLDGLCKDGLVQEA 150 Query: 305 MKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFDDAVRIFKKMQSNGVIPNVFSYQVLVRG 126 MKLFGLMR+KGTIPEVV+YTAVVE FCKAHK +DA RIF+KMQ+NG+ PN FSY VLV+G Sbjct: 151 MKLFGLMRDKGTIPEVVIYTAVVEAFCKAHKIEDAKRIFRKMQNNGIAPNAFSYGVLVQG 210 Query: 125 LFSGKRLEDAYGFTIEMLEAGHSPNIATFTGLVDGYCREKG 3 L++ L+DA F EMLE+GHSPN+ TF LVD CR KG Sbjct: 211 LYNCNMLDDAVAFCSEMLESGHSPNVPTFVELVDALCRVKG 251