BLASTX nr result
ID: Mentha24_contig00013071
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha24_contig00013071 (725 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU44833.1| hypothetical protein MIMGU_mgv1a017808mg, partial... 349 6e-94 ref|XP_006338641.1| PREDICTED: pentatricopeptide repeat-containi... 334 2e-89 ref|XP_004231824.1| PREDICTED: pentatricopeptide repeat-containi... 332 1e-88 ref|XP_006447217.1| hypothetical protein CICLE_v10014357mg [Citr... 322 1e-85 gb|EPS70491.1| hypothetical protein M569_04265, partial [Genlise... 321 2e-85 ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containi... 313 5e-83 gb|EXC31403.1| hypothetical protein L484_017686 [Morus notabilis] 306 6e-81 ref|XP_007031692.1| Pentatricopeptide repeat (PPR-like) superfam... 302 9e-80 ref|XP_006399946.1| hypothetical protein EUTSA_v10015672mg [Eutr... 300 3e-79 ref|XP_003548551.1| PREDICTED: pentatricopeptide repeat-containi... 289 8e-76 ref|XP_007140836.1| hypothetical protein PHAVU_008G145600g [Phas... 288 1e-75 ref|XP_002526948.1| pentatricopeptide repeat-containing protein,... 288 1e-75 ref|XP_007220233.1| hypothetical protein PRUPE_ppa001979mg [Prun... 287 3e-75 ref|XP_006292935.1| hypothetical protein CARUB_v10019206mg [Caps... 286 5e-75 ref|XP_002324000.1| pentatricopeptide repeat-containing family p... 285 1e-74 ref|XP_002873660.1| pentatricopeptide repeat-containing protein ... 283 3e-74 ref|XP_004308618.1| PREDICTED: pentatricopeptide repeat-containi... 277 2e-72 ref|NP_190245.1| pentatricopeptide repeat-containing protein [Ar... 276 7e-72 ref|XP_006852234.1| hypothetical protein AMTR_s00049p00149530 [A... 267 2e-69 gb|EAY82798.1| hypothetical protein OsI_38004 [Oryza sativa Indi... 222 1e-55 >gb|EYU44833.1| hypothetical protein MIMGU_mgv1a017808mg, partial [Mimulus guttatus] Length = 659 Score = 349 bits (895), Expect = 6e-94 Identities = 171/245 (69%), Positives = 204/245 (83%), Gaps = 4/245 (1%) Frame = +1 Query: 1 VYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLE 180 VYSTIIRGFGK+KK++SAMALFEWLKRKS E IQPNL+IYNSLLGA+K+A FDF++ Sbjct: 96 VYSTIIRGFGKDKKVDSAMALFEWLKRKSNEADSPIQPNLYIYNSLLGALKQAESFDFVD 155 Query: 181 KIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGY 360 +M+DMA G+ PNVVTYNTLMGIYIE KE+K +LFEEMP+KGI PSPAS+SIVL Y Sbjct: 156 DVMSDMAAKGLLPNVVTYNTLMGIYIEHRKEAKVFELFEEMPTKGIFPSPASYSIVLLAY 215 Query: 361 RRLEDGFGALAFYVQTRNRYEQGEIGRDDD---REDWEHEFSKLENFIITLCYQVMRRWL 531 RRLEDGFGAL F+V+ R+++++GEIG+D+D EDW EF+KLENF I +CYQVMRRWL Sbjct: 216 RRLEDGFGALTFFVEIRDKFQKGEIGKDNDGEEEEDWVDEFAKLENFTIRICYQVMRRWL 275 Query: 532 VKSENRSNDVLRLLQEMDNARLKHGREEHERLIWACTREEHCVVAKELYTRIREV-DDEI 708 V S+N S +VLRLL+EMD A L+ G EEHERLIWACTREEH +V KELY RIRE+ EI Sbjct: 276 VNSKNLSTEVLRLLKEMDKAGLQPGHEEHERLIWACTREEHYIVVKELYARIREMTSTEI 335 Query: 709 SVSVC 723 S+SVC Sbjct: 336 SLSVC 340 >ref|XP_006338641.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like [Solanum tuberosum] Length = 740 Score = 334 bits (857), Expect = 2e-89 Identities = 160/241 (66%), Positives = 200/241 (82%) Frame = +1 Query: 1 VYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLE 180 VYS++IRGFGK+KKL SAMAL EWL+R+S++ G I N+FIYNSLLGA+KEA K+DF++ Sbjct: 184 VYSSMIRGFGKDKKLNSAMALVEWLRRRSKDNIGSISLNVFIYNSLLGAIKEAGKYDFVD 243 Query: 181 KIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGY 360 K+M+DM GV PNVVTYNTLM IYIE+G+E +AL LF MP KG++PSPAS+S LF Y Sbjct: 244 KVMDDMVSEGVQPNVVTYNTLMRIYIEQGRELEALNLFRLMPKKGLSPSPASYSTALFAY 303 Query: 361 RRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWEHEFSKLENFIITLCYQVMRRWLVKS 540 RRLEDGFGA+ F+V+TR +Y+ GEIG ++ E+WE EF+KLENFI+ +CYQVMR+WLVK Sbjct: 304 RRLEDGFGAITFFVETREKYQNGEIGNIEE-ENWEDEFAKLENFIVRICYQVMRQWLVKG 362 Query: 541 ENRSNDVLRLLQEMDNARLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSV 720 EN + +VL+LL +MD ARL+ R E+ERL+WACTREEH VVAKELY RIRE D EIS+SV Sbjct: 363 ENANTNVLKLLTDMDRARLQLSRAEYERLVWACTREEHHVVAKELYNRIRERDTEISLSV 422 Query: 721 C 723 C Sbjct: 423 C 423 Score = 60.8 bits (146), Expect = 4e-07 Identities = 35/136 (25%), Positives = 66/136 (48%) Frame = +1 Query: 4 YSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLEK 183 Y ++ K K + A+ +++ + + I+PNL+ Y + KF+ ++ Sbjct: 535 YGALLSALEKGKLYDEALQVWKHMIKVG------IEPNLYAYTIMASIYTAQGKFNIVDS 588 Query: 184 IMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGYR 363 I+ +M GV P VVT+N ++ G ES A + F+ M ++ ITP+ S+ +++ Sbjct: 589 IIKEMVTTGVEPTVVTFNAIISGCARNGMESVAYEWFQRMKTQNITPNEVSYEMLIEAL- 647 Query: 364 RLEDGFGALAFYVQTR 411 DG LA+ + R Sbjct: 648 -ANDGKPRLAYELYVR 662 >ref|XP_004231824.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like [Solanum lycopersicum] Length = 742 Score = 332 bits (850), Expect = 1e-88 Identities = 160/242 (66%), Positives = 200/242 (82%), Gaps = 1/242 (0%) Frame = +1 Query: 1 VYSTIIRGFGKEKKLESAMALFEWLKRK-SEETGGLIQPNLFIYNSLLGAVKEARKFDFL 177 VYS++IRGFGK+KKL SAMAL EWL+R+ ++ G I N+FIYNSLLGA+KEA K+DF+ Sbjct: 185 VYSSMIRGFGKDKKLNSAMALVEWLRRRRGKDNIGSISLNVFIYNSLLGAIKEAGKYDFV 244 Query: 178 EKIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFG 357 +K+M+DM GV PNVVTYNTLM YIE+G+E +AL+LF EMP KG+TPSPAS+S LF Sbjct: 245 DKVMDDMVSEGVQPNVVTYNTLMRTYIEQGRELEALKLFREMPKKGLTPSPASYSTALFA 304 Query: 358 YRRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWEHEFSKLENFIITLCYQVMRRWLVK 537 YRRLEDGFGA+ F+V+TR RY+ GEIG ++ E+WE EF+KLENFI+ +CYQVMR+WLVK Sbjct: 305 YRRLEDGFGAITFFVETRERYQNGEIGNIEE-ENWEDEFAKLENFIVRICYQVMRQWLVK 363 Query: 538 SENRSNDVLRLLQEMDNARLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVS 717 EN + +VL+LL +MD ARL+ R E+ERL+WACTREEH VVAKELY RIRE D +IS+S Sbjct: 364 GENANTNVLKLLTDMDRARLQLSRAEYERLVWACTREEHYVVAKELYNRIRERDTDISLS 423 Query: 718 VC 723 VC Sbjct: 424 VC 425 Score = 61.6 bits (148), Expect = 3e-07 Identities = 35/136 (25%), Positives = 66/136 (48%) Frame = +1 Query: 4 YSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLEK 183 Y ++ K K + A+ +++ + + I+PNL+ Y + KF+ ++ Sbjct: 537 YGALLSALEKGKLYDEALQVWKHMIKVG------IEPNLYAYTIMASIYTAQGKFNIVDS 590 Query: 184 IMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGYR 363 I+ +M GV P VVT+N ++ G ES A + F+ M ++ ITP+ S+ +++ Sbjct: 591 IIKEMVTTGVEPTVVTFNAIISGCARNGMESVAYEWFQRMKTQNITPNEVSYEVLIEAL- 649 Query: 364 RLEDGFGALAFYVQTR 411 DG LA+ + R Sbjct: 650 -ANDGKPRLAYELYVR 664 >ref|XP_006447217.1| hypothetical protein CICLE_v10014357mg [Citrus clementina] gi|568831365|ref|XP_006469938.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like [Citrus sinensis] gi|557549828|gb|ESR60457.1| hypothetical protein CICLE_v10014357mg [Citrus clementina] Length = 768 Score = 322 bits (824), Expect = 1e-85 Identities = 157/241 (65%), Positives = 196/241 (81%) Frame = +1 Query: 1 VYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLE 180 V+S++IRGFGKEK+ + AMAL EWLKRK ETGG I PNLF+YNSLLGAVK+++KF+ ++ Sbjct: 210 VHSSMIRGFGKEKRTDCAMALVEWLKRKKRETGGFIGPNLFVYNSLLGAVKQSQKFEEMD 269 Query: 181 KIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGY 360 +IMNDMA GV+PNVVTYNTLM IYIE+G+ +KAL + EE+ KG+TPS S+S L Y Sbjct: 270 RIMNDMAEEGVNPNVVTYNTLMAIYIEQGEGTKALNVLEEIKKKGLTPSAVSYSQALLAY 329 Query: 361 RRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWEHEFSKLENFIITLCYQVMRRWLVKS 540 RR+EDG GAL F+V+ R +Y +GEIG+ DD E+WE+EF KL++FII +CYQVMRRWLVK Sbjct: 330 RRMEDGNGALKFFVELREKYLKGEIGKGDD-ENWENEFVKLKDFIIRICYQVMRRWLVKD 388 Query: 541 ENRSNDVLRLLQEMDNARLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSV 720 EN S +VL+LL EMD A L+ + E+ERL+WACTREEH VVAKE Y RIRE DEIS+SV Sbjct: 389 ENLSTNVLKLLIEMDKAGLRPVKAEYERLVWACTREEHYVVAKEFYARIRERHDEISLSV 448 Query: 721 C 723 C Sbjct: 449 C 449 Score = 56.6 bits (135), Expect = 8e-06 Identities = 42/167 (25%), Positives = 74/167 (44%) Frame = +1 Query: 4 YSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLEK 183 Y ++ K K + A +++ + E PNL+ Y + KF+ +E Sbjct: 561 YGALLSALEKGKLYDEASRVWQHMLNVGAE------PNLYAYTIMASIFTAQGKFNLVEL 614 Query: 184 IMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGYR 363 I +MA + + P VVTYN ++ + G S A + F M + I+P+ ++ +++ Sbjct: 615 IFREMASSRIEPTVVTYNAIISACGQNGMSSAAYEWFHRMKVQNISPNEITYEMLIEALA 674 Query: 364 RLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWEHEFSKLENFIITL 504 + DG LA+ + R R E E+ D EFS++ I L Sbjct: 675 K--DGKPRLAYDLYLRARNE--ELNLSSKAYDAILEFSQVYGATIDL 717 >gb|EPS70491.1| hypothetical protein M569_04265, partial [Genlisea aurea] Length = 557 Score = 321 bits (822), Expect = 2e-85 Identities = 152/241 (63%), Positives = 195/241 (80%) Frame = +1 Query: 1 VYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLE 180 VYST+IRG GKEK+++SAMALFEWL+RKS+E+G ++ NLF+YNSLLGA+K+A FD +E Sbjct: 38 VYSTVIRGLGKEKRIQSAMALFEWLQRKSKESGSKLKLNLFVYNSLLGAMKQAEAFDLVE 97 Query: 181 KIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGY 360 ++M M GVHPNVVT+N LMGI+IE+G E +AL+LF EM GI+PSPAS+S VL Y Sbjct: 98 EVMTKMGAEGVHPNVVTFNALMGIHIEQGNELRALELFREMLMMGISPSPASYSTVLNAY 157 Query: 361 RRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWEHEFSKLENFIITLCYQVMRRWLVKS 540 RR+E+G GA++F+++TRN+Y G++ DDD EDWE E SKLENF + +CYQVMRRWLVK Sbjct: 158 RRMENGSGAVSFFIETRNKYRNGDMANDDD-EDWELEISKLENFTLRICYQVMRRWLVKR 216 Query: 541 ENRSNDVLRLLQEMDNARLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSV 720 N S +VL+LL+EMDNA L E E+LIWACTRE+HC VAKELYTR+RE+ +IS+SV Sbjct: 217 GNFSTEVLKLLKEMDNAGLNCDPENLEKLIWACTREDHCAVAKELYTRVREMGADISLSV 276 Query: 721 C 723 C Sbjct: 277 C 277 >ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610 [Vitis vinifera] Length = 763 Score = 313 bits (801), Expect = 5e-83 Identities = 154/241 (63%), Positives = 191/241 (79%) Frame = +1 Query: 1 VYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLE 180 VYST+IRGFG +K+L++AMAL EWLKRK +ET G PNLF+YNSLLGAVK++ KF +E Sbjct: 207 VYSTMIRGFGTDKRLDAAMALVEWLKRK-KETNGSKGPNLFVYNSLLGAVKQSEKFALVE 265 Query: 181 KIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGY 360 K+MNDMA G+ PNVVTYNTLM IY+E+G+ +AL + EE+ G+ PSP S+S L Y Sbjct: 266 KVMNDMAREGILPNVVTYNTLMSIYLEQGRSVEALNILEEIQKNGLCPSPVSYSTALLVY 325 Query: 361 RRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWEHEFSKLENFIITLCYQVMRRWLVKS 540 RR+EDG GAL F+++ R Y +GEIG+D D EDWE+EF KL+NF I +CYQVMRRWLVK Sbjct: 326 RRMEDGHGALKFFIELRENYLKGEIGKDAD-EDWENEFVKLKNFTIRICYQVMRRWLVKE 384 Query: 541 ENRSNDVLRLLQEMDNARLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSV 720 N+S +L+LL +MDNA L+ GR E+ERL+WACTREEH VVAKELYTRIRE EIS+SV Sbjct: 385 GNQSPILLKLLADMDNAGLQPGRAEYERLVWACTREEHYVVAKELYTRIRERHTEISLSV 444 Query: 721 C 723 C Sbjct: 445 C 445 >gb|EXC31403.1| hypothetical protein L484_017686 [Morus notabilis] Length = 737 Score = 306 bits (783), Expect = 6e-81 Identities = 149/241 (61%), Positives = 187/241 (77%) Frame = +1 Query: 1 VYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLE 180 V+ST+IRG G+EK L+ A AL EWLKRK EE GLI NLFIYNSLLGAVK++ +F +E Sbjct: 213 VFSTMIRGLGREKLLDPAFALLEWLKRKKEENNGLISLNLFIYNSLLGAVKQSEQFGEME 272 Query: 181 KIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGY 360 K++N MA GV PNVVTYNT+M I++E G+ +KAL + EE+ KG+TPSP S+S L Y Sbjct: 273 KVLNYMAQEGVVPNVVTYNTMMAIHLENGEGTKALSVLEEIRKKGLTPSPVSYSTALLAY 332 Query: 361 RRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWEHEFSKLENFIITLCYQVMRRWLVKS 540 RR+EDG GAL F+V+ R +Y++GE+G+DDD EDWE+EF KLENF I +CYQVMR WLV Sbjct: 333 RRMEDGHGALKFFVEIREKYQKGEMGKDDD-EDWENEFVKLENFTIRVCYQVMRHWLVNE 391 Query: 541 ENRSNDVLRLLQEMDNARLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSV 720 +N S +VL+LL +MD A + R EHERL+WACTREEH +VAKELY RIRE +IS+SV Sbjct: 392 DNLSTNVLKLLTKMDIAGIPPSRSEHERLLWACTREEHHLVAKELYDRIREGYSDISLSV 451 Query: 721 C 723 C Sbjct: 452 C 452 >ref|XP_007031692.1| Pentatricopeptide repeat (PPR-like) superfamily protein, putative [Theobroma cacao] gi|508710721|gb|EOY02618.1| Pentatricopeptide repeat (PPR-like) superfamily protein, putative [Theobroma cacao] Length = 741 Score = 302 bits (773), Expect = 9e-80 Identities = 142/241 (58%), Positives = 191/241 (79%) Frame = +1 Query: 1 VYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLE 180 V+S++I+GFG++ +++AMAL EWLKRK ++GG + PNLFIYNSLLGAVK +++F +E Sbjct: 184 VHSSMIKGFGRDNYMDAAMALVEWLKRKKNDSGGSVGPNLFIYNSLLGAVKHSKQFREME 243 Query: 181 KIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGY 360 KI+ DM GV PN+VTYN LM IY+E+G+ +KAL + EE+ KG +PSP S+S L Y Sbjct: 244 KILKDMEEEGVIPNIVTYNVLMAIYLEQGEATKALNVLEEIQEKGFSPSPVSYSTALLAY 303 Query: 361 RRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWEHEFSKLENFIITLCYQVMRRWLVKS 540 RR+EDG GAL F+++ R +Y +G++G+D D E+WE+EF KLENF + +C QVMRRWLVK Sbjct: 304 RRMEDGNGALKFFIELREKYVKGDLGKDAD-ENWEYEFVKLENFTVRICQQVMRRWLVKD 362 Query: 541 ENRSNDVLRLLQEMDNARLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSV 720 EN S +VL+LL++MDNA LK +E++ER+IWACT EEH VVAKELY+RIRE EIS+SV Sbjct: 363 ENLSTNVLKLLRDMDNAGLKLSKEDYERIIWACTCEEHYVVAKELYSRIRERHSEISLSV 422 Query: 721 C 723 C Sbjct: 423 C 423 >ref|XP_006399946.1| hypothetical protein EUTSA_v10015672mg [Eutrema salsugineum] gi|557101036|gb|ESQ41399.1| hypothetical protein EUTSA_v10015672mg [Eutrema salsugineum] Length = 688 Score = 300 bits (769), Expect = 3e-79 Identities = 152/241 (63%), Positives = 181/241 (75%) Frame = +1 Query: 1 VYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLE 180 VY +IRGFGK+K+L+ AMA+ +WLKRK E+GGLI PNLFIYNSLLGA+KE+R F E Sbjct: 165 VYCAMIRGFGKDKRLKPAMAVVDWLKRKKIESGGLIGPNLFIYNSLLGAMKESRGFGETE 224 Query: 181 KIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGY 360 KI++DM G+ PN+VTYNTLM IY+EEG+ KAL + + + KG PSP ++S L Y Sbjct: 225 KILSDMEEEGIVPNIVTYNTLMVIYMEEGEFHKALGILDLVKEKGFEPSPVTYSTALLVY 284 Query: 361 RRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWEHEFSKLENFIITLCYQVMRRWLVKS 540 RRLEDG GAL F+ + R +Y + EIG D D DWE EF KLENFI +CYQVMRRWLVK Sbjct: 285 RRLEDGMGALEFFAELREKYSKREIGNDAD-YDWEFEFVKLENFIGRICYQVMRRWLVKD 343 Query: 541 ENRSNDVLRLLQEMDNARLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSV 720 EN + +L+LL MDNA LK REEHERLIWACTREEH VV KELY RIRE EIS+SV Sbjct: 344 ENLTTKMLKLLNAMDNAGLKPSREEHERLIWACTREEHYVVGKELYKRIRERFPEISLSV 403 Query: 721 C 723 C Sbjct: 404 C 404 >ref|XP_003548551.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like [Glycine max] Length = 808 Score = 289 bits (739), Expect = 8e-76 Identities = 140/241 (58%), Positives = 179/241 (74%) Frame = +1 Query: 1 VYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLE 180 V+STII GFGKEK+++SA+ LF W+K++ ET G PNLFIYN LLG VK++ +F +E Sbjct: 250 VFSTIISGFGKEKRMDSALILFNWMKKRKIETNGSFGPNLFIYNGLLGVVKQSGQFAEME 309 Query: 181 KIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGY 360 I+N+MA +G+ NVVTYNTLM IYIE+G+ KAL + EE+ G+TPSP S+S L Y Sbjct: 310 VILNEMAEDGIAYNVVTYNTLMAIYIEKGECDKALNMLEEIRRNGLTPSPVSYSQALLAY 369 Query: 361 RRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWEHEFSKLENFIITLCYQVMRRWLVKS 540 RR+EDG+GAL F+V+ R +Y QGEIG+DDD EDWE E KLE F I +CYQVMR WLV Sbjct: 370 RRMEDGYGALNFFVEFREKYRQGEIGKDDDGEDWEKECLKLEKFTIRVCYQVMRCWLVSR 429 Query: 541 ENRSNDVLRLLQEMDNARLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSV 720 +N S +VL+ L +MDN + R + ERL WACTRE+H +V KELY RIRE D+IS+SV Sbjct: 430 DNLSKNVLKFLVDMDNVGIPLPRADLERLAWACTREDHYIVVKELYNRIRERYDKISLSV 489 Query: 721 C 723 C Sbjct: 490 C 490 >ref|XP_007140836.1| hypothetical protein PHAVU_008G145600g [Phaseolus vulgaris] gi|561013969|gb|ESW12830.1| hypothetical protein PHAVU_008G145600g [Phaseolus vulgaris] Length = 752 Score = 288 bits (738), Expect = 1e-75 Identities = 140/241 (58%), Positives = 178/241 (73%) Frame = +1 Query: 1 VYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLE 180 V+STII FGKEK+++SA+ LFEW+K++ ET G PNLFIYN LLG VK++ +F +E Sbjct: 194 VFSTIINSFGKEKRMDSALILFEWMKKRKIETNGSFGPNLFIYNGLLGVVKQSGQFAQME 253 Query: 181 KIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGY 360 I+N+MA +G+ NVVTYNTLM IYIE+G+ +AL + EE+ G TPSP S+S L Y Sbjct: 254 TILNEMAKDGISYNVVTYNTLMAIYIEKGEFDRALNVLEEIHGNGFTPSPVSYSQALLAY 313 Query: 361 RRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWEHEFSKLENFIITLCYQVMRRWLVKS 540 RR+ED GAL F+V+ R Y +GEIG DDD EDWE E KLE F I +CYQVMR WLV S Sbjct: 314 RRMEDCNGALNFFVELRENYHRGEIGEDDDGEDWEEELMKLEKFTIRICYQVMRCWLVSS 373 Query: 541 ENRSNDVLRLLQEMDNARLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSV 720 +N S +VL+ L +MDNA + R + ERL+WACTRE+H +V KELYTRIRE D+IS+SV Sbjct: 374 DNLSKNVLKFLVDMDNAGIPLTRADLERLVWACTREDHYIVVKELYTRIRERYDKISLSV 433 Query: 721 C 723 C Sbjct: 434 C 434 >ref|XP_002526948.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223533700|gb|EEF35435.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 671 Score = 288 bits (738), Expect = 1e-75 Identities = 141/241 (58%), Positives = 186/241 (77%) Frame = +1 Query: 1 VYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLE 180 VYS++I+ FG + K+ESA+AL EWLKR+ +E G I PNLFIYNSLL AVK+++ F+ E Sbjct: 115 VYSSMIKAFGWDNKMESALALVEWLKRR-KEIGSSIGPNLFIYNSLLSAVKKSKLFEEAE 173 Query: 181 KIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGY 360 KI+NDM G+ PNVVTYNTLMGIY+E+G+ +KAL + E+M KG P+ AS+S L Y Sbjct: 174 KILNDMTQEGIAPNVVTYNTLMGIYVEKGQATKALNILEQMHEKGFIPTAASYSTALLAY 233 Query: 361 RRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWEHEFSKLENFIITLCYQVMRRWLVKS 540 R +EDG GALAF+V +++Y +G+IG++ D E+WE+EF KLE FII +CYQVMRRWLV+ Sbjct: 234 RGMEDGHGALAFFVDIKDKYLKGKIGKNSD-ENWENEFVKLETFIIRICYQVMRRWLVRH 292 Query: 541 ENRSNDVLRLLQEMDNARLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSV 720 +N S DVL+LL +MD A L+ + E+ERL+WACTRE+H V KELY RIRE +IS+SV Sbjct: 293 DNFSTDVLKLLTDMDKAGLQPSQAEYERLVWACTREDHYAVGKELYIRIRERHSKISLSV 352 Query: 721 C 723 C Sbjct: 353 C 353 >ref|XP_007220233.1| hypothetical protein PRUPE_ppa001979mg [Prunus persica] gi|462416695|gb|EMJ21432.1| hypothetical protein PRUPE_ppa001979mg [Prunus persica] Length = 734 Score = 287 bits (734), Expect = 3e-75 Identities = 138/241 (57%), Positives = 182/241 (75%) Frame = +1 Query: 1 VYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLE 180 V+S++IRGFG+++ ++SA A+ EWLKRKSEET G I PNLFIYNSLLGAVK++++F ++ Sbjct: 204 VFSSMIRGFGRDRLMDSAFAVVEWLKRKSEETNGSITPNLFIYNSLLGAVKQSKQFGEMD 263 Query: 181 KIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGY 360 K+++ M GV NVVTYNT M IYIE+G +KAL + E++ KG+ PS S+S L Y Sbjct: 264 KVLSAMTEEGVELNVVTYNTKMAIYIEQGLSTKALDVLEDIEKKGLIPSSVSYSTALLAY 323 Query: 361 RRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWEHEFSKLENFIITLCYQVMRRWLVKS 540 +R+EDG GAL F+++ R +Y +G+I + + EDWEHEF +LENF +CYQVMRRWLVK Sbjct: 324 QRMEDGNGALQFFIEFREKYHKGDISK-ESVEDWEHEFIQLENFTKRVCYQVMRRWLVKD 382 Query: 541 ENRSNDVLRLLQEMDNARLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSV 720 +N S +VL+LL +MD A + R EHERL+WACTREEH VAKELY RIRE EI +SV Sbjct: 383 DNLSTNVLKLLAQMDIAGVPLSRAEHERLLWACTREEHYTVAKELYNRIRERHTEIGISV 442 Query: 721 C 723 C Sbjct: 443 C 443 >ref|XP_006292935.1| hypothetical protein CARUB_v10019206mg [Capsella rubella] gi|482561642|gb|EOA25833.1| hypothetical protein CARUB_v10019206mg [Capsella rubella] Length = 673 Score = 286 bits (732), Expect = 5e-75 Identities = 142/241 (58%), Positives = 179/241 (74%) Frame = +1 Query: 1 VYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLE 180 V+ +I GFGK+K+LE A+A+ +WLKRK E+G +I PNLFIYNSLLGA+K+ F E Sbjct: 153 VFCAMISGFGKDKRLEPAVAVVDWLKRKKSESGSVIGPNLFIYNSLLGAMKQLSAFGEAE 212 Query: 181 KIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGY 360 K+++DM G+ PN+VTYNTLM IY+EEG+ KAL + + + KG P+P ++S L Y Sbjct: 213 KVLSDMEEEGIVPNIVTYNTLMVIYMEEGEFLKALGILDLVKEKGFEPNPITYSTALLVY 272 Query: 361 RRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWEHEFSKLENFIITLCYQVMRRWLVKS 540 RR+EDG GAL F+V+ R +Y + EIG D D DW+ EF KLENFI +CYQVMRRWLVK+ Sbjct: 273 RRMEDGMGALEFFVELREKYSKREIGNDPD-YDWKFEFFKLENFIGRICYQVMRRWLVKN 331 Query: 541 ENRSNDVLRLLQEMDNARLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSV 720 EN + VL+LL MD+A LK REEHERLIWACTREEH +V KELY RIRE EIS+SV Sbjct: 332 ENWTTRVLKLLNAMDSAGLKPSREEHERLIWACTREEHYIVGKELYKRIRERFPEISLSV 391 Query: 721 C 723 C Sbjct: 392 C 392 Score = 57.8 bits (138), Expect = 4e-06 Identities = 27/116 (23%), Positives = 59/116 (50%) Frame = +1 Query: 4 YSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLEK 183 Y ++ K K + A ++ + + I+PNL+ Y ++ + +KF+ L+ Sbjct: 504 YGALLSALEKGKLYDEAFRVWNHMVKVG------IEPNLYAYTTMASVLTGQQKFNLLDT 557 Query: 184 IMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVL 351 ++ +MA G+ P+VVTYN ++ + G A + F M S+ + P+ ++ +++ Sbjct: 558 LLKEMASKGIEPSVVTYNAVISGCAKNGLSGVAYEWFHRMKSENVEPNEITYEMLI 613 >ref|XP_002324000.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|222867002|gb|EEF04133.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 709 Score = 285 bits (729), Expect = 1e-74 Identities = 142/241 (58%), Positives = 184/241 (76%) Frame = +1 Query: 1 VYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLE 180 VY ++I+GFG +KK+E A+AL +WLK K +ET G I PNLFIYNSLL AVK++ +++ E Sbjct: 155 VYLSMIKGFGWDKKMEPAIALVDWLKIK-KETDGTIVPNLFIYNSLLSAVKQSEQYEETE 213 Query: 181 KIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGY 360 KI+ M GV PNVVTYN LM IY+++G+ KAL + EEM G TPS AS+S L Y Sbjct: 214 KILERMTQEGVAPNVVTYNILMVIYVKQGQAKKALDVLEEMRRNGFTPSAASYSSALLAY 273 Query: 361 RRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWEHEFSKLENFIITLCYQVMRRWLVKS 540 R++EDG GAL F+V+ +++Y +GEIG+D D EDWE E+ KLENF I +CYQVMRRWLV+ Sbjct: 274 RKMEDGDGALKFFVEIKDKYMKGEIGKDAD-EDWEREYVKLENFTIRVCYQVMRRWLVRL 332 Query: 541 ENRSNDVLRLLQEMDNARLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSV 720 EN + +VL+LL +MD A L+ GR ++ERL+WACTREEH VVAKELY RIRE +IS+SV Sbjct: 333 ENLNTNVLKLLTDMDKAELQPGRSDYERLVWACTREEHYVVAKELYIRIRERCSDISLSV 392 Query: 721 C 723 C Sbjct: 393 C 393 >ref|XP_002873660.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297319497|gb|EFH49919.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 674 Score = 283 bits (725), Expect = 3e-74 Identities = 143/241 (59%), Positives = 180/241 (74%) Frame = +1 Query: 1 VYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLE 180 VY +IRGFGK+K+L+ A+A+ +WL+RK E+GG+I PNLFIYNSLLGA+K++ + E Sbjct: 155 VYCAMIRGFGKDKRLKPAIAVVDWLRRKKSESGGVIGPNLFIYNSLLGAMKQSSVGE-AE 213 Query: 181 KIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGY 360 KI++DM G+ PN+VTYNTLM IY+E+G+ KAL + + + KG P+P ++S L Y Sbjct: 214 KILSDMEEEGIVPNIVTYNTLMVIYMEKGEFHKALGILDLVKEKGFEPNPITYSTALLVY 273 Query: 361 RRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWEHEFSKLENFIITLCYQVMRRWLVKS 540 RR+EDG GAL F+V+ R +Y + EIG D D DWE EF KLENFI +CYQVMRRWLVK Sbjct: 274 RRMEDGMGALEFFVELREKYSKREIGNDADY-DWEFEFVKLENFIGRICYQVMRRWLVKD 332 Query: 541 ENRSNDVLRLLQEMDNARLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSV 720 EN + VL+LL MDNA K REEHERLIWACTREEH +V KELY RIRE EIS+SV Sbjct: 333 ENWTTRVLKLLNAMDNAGPKPSREEHERLIWACTREEHYIVGKELYKRIRERFPEISLSV 392 Query: 721 C 723 C Sbjct: 393 C 393 >ref|XP_004308618.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like [Fragaria vesca subsp. vesca] Length = 657 Score = 277 bits (709), Expect = 2e-72 Identities = 133/241 (55%), Positives = 179/241 (74%) Frame = +1 Query: 1 VYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLE 180 V+S++IRGFG++K ++SA A+ EWLKR+ EET G++ PNLFI+NSLLGAVK+ ++F ++ Sbjct: 131 VFSSMIRGFGRDKLMDSAFAVVEWLKRRGEETNGMVAPNLFIFNSLLGAVKQCKQFGEMD 190 Query: 181 KIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGY 360 K++ DM GV PN+VTYNT M IY+E+G +KAL + EE+ KG+ SP ++S L Y Sbjct: 191 KVLADMTQEGVEPNIVTYNTKMAIYVEQGLSTKALDVLEEIQKKGMIASPVTYSTALQAY 250 Query: 361 RRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWEHEFSKLENFIITLCYQVMRRWLVKS 540 +R++DG GAL F+V+ R +Y G+I + EDWE EF KLE+F +CYQVMR WLV Sbjct: 251 QRMQDGIGALEFFVEFREKYRNGDICNVSE-EDWESEFLKLESFTKRVCYQVMRWWLVMD 309 Query: 541 ENRSNDVLRLLQEMDNARLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSV 720 ++ S +VL+LL MDNA + GR EHERL+WACTRE+H VAKELY RIRE EIS+SV Sbjct: 310 DDLSINVLKLLVNMDNAGIPLGRAEHERLLWACTREDHYNVAKELYCRIRERHSEISLSV 369 Query: 721 C 723 C Sbjct: 370 C 370 >ref|NP_190245.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75206903|sp|Q9SNB7.1|PP264_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At3g46610 gi|6523064|emb|CAB62331.1| hypothetical protein [Arabidopsis thaliana] gi|332644660|gb|AEE78181.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 665 Score = 276 bits (705), Expect = 7e-72 Identities = 140/241 (58%), Positives = 176/241 (73%) Frame = +1 Query: 1 VYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLE 180 V+ +I+GFGK+K+L+ A+A+ +WLKRK E+GG+I PNLFIYNSLLGA+ R F E Sbjct: 148 VFCAMIKGFGKDKRLKPAVAVVDWLKRKKSESGGVIGPNLFIYNSLLGAM---RGFGEAE 204 Query: 181 KIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGY 360 KI+ DM G+ PN+VTYNTLM IY+EEG+ KAL + + KG P+P ++S L Y Sbjct: 205 KILKDMEEEGIVPNIVTYNTLMVIYMEEGEFLKALGILDLTKEKGFEPNPITYSTALLVY 264 Query: 361 RRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWEHEFSKLENFIITLCYQVMRRWLVKS 540 RR+EDG GAL F+V+ R +Y + EIG D DWE EF KLENFI +CYQVMRRWLVK Sbjct: 265 RRMEDGMGALEFFVELREKYAKREIGNDVGY-DWEFEFVKLENFIGRICYQVMRRWLVKD 323 Query: 541 ENRSNDVLRLLQEMDNARLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVSV 720 +N + VL+LL MD+A ++ REEHERLIWACTREEH +V KELY RIRE EIS+SV Sbjct: 324 DNWTTRVLKLLNAMDSAGVRPSREEHERLIWACTREEHYIVGKELYKRIRERFSEISLSV 383 Query: 721 C 723 C Sbjct: 384 C 384 >ref|XP_006852234.1| hypothetical protein AMTR_s00049p00149530 [Amborella trichopoda] gi|548855838|gb|ERN13701.1| hypothetical protein AMTR_s00049p00149530 [Amborella trichopoda] Length = 754 Score = 267 bits (683), Expect = 2e-69 Identities = 134/242 (55%), Positives = 178/242 (73%), Gaps = 1/242 (0%) Frame = +1 Query: 1 VYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLE 180 VYS++IRGFG ++L+ A+AL EWLKR + T G NL+IYNSLLGA K + ++ + Sbjct: 191 VYSSMIRGFGMAERLKPAIALVEWLKRGKKSTNGGAILNLYIYNSLLGAAKASHSYEKVG 250 Query: 181 KIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGY 360 KI+ DM G+ PN+VT NTLM +Y+E+GK +A +F E+P G++PSP ++S VL Y Sbjct: 251 KIIEDMEKQGILPNIVTLNTLMSVYLEQGKTQEARDIFSEIPRNGLSPSPVTYSTVLQIY 310 Query: 361 RRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWEHEFSKLENFIITLCYQVMRRWLVKS 540 R++ED GAL F+V++R +Y++GEI +D EDWE+EF+KLENF I +CYQVMR WLVK Sbjct: 311 RKMEDAKGALEFFVESREKYKKGEI-ENDSCEDWENEFAKLENFTIRICYQVMRGWLVKG 369 Query: 541 ENR-SNDVLRLLQEMDNARLKHGREEHERLIWACTREEHCVVAKELYTRIREVDDEISVS 717 R + DVL+LL E+D A LK GR +ERLIWACT E H +VAKELY RIRE + EIS+S Sbjct: 370 GGREATDVLKLLIELDKAGLKPGRAIYERLIWACTNEGHYIVAKELYQRIRENNTEISLS 429 Query: 718 VC 723 VC Sbjct: 430 VC 431 >gb|EAY82798.1| hypothetical protein OsI_38004 [Oryza sativa Indica Group] Length = 669 Score = 222 bits (565), Expect = 1e-55 Identities = 112/242 (46%), Positives = 159/242 (65%), Gaps = 1/242 (0%) Frame = +1 Query: 1 VYSTIIRGFGKEKKLESAMALFEWLKRKSEETGGLIQPNLFIYNSLLGAVKEARKFDFLE 180 VY+++IRG GKE++L++A A+ E LKR S GG+ N F+YN LLGAVK + +F + Sbjct: 111 VYTSVIRGLGKERRLDAAFAVVEHLKRGSGSGGGV---NQFVYNCLLGAVKNSGEFGRIH 167 Query: 181 KIMNDMAINGVHPNVVTYNTLMGIYIEEGKESKALQLFEEMPSKGITPSPASFSIVLFGY 360 ++ DM G+ PNVVT+NTLM IY+E+GK + ++F+ + G+ P+ A++S V+ Y Sbjct: 168 DVLADMEAQGIPPNVVTFNTLMSIYVEQGKIDEVFRVFDTIEGSGLVPTAATYSTVMSSY 227 Query: 361 RRLEDGFGALAFYVQTRNRYEQGEIGRDDDREDWEHEFSKLENFIITLCYQVMRRWLVKS 540 ++ D F AL F + R Y +GE+ +REDW+ EF K E + +CY MRR LV Sbjct: 228 KKAGDAFAALKFLTKLREMYNKGELA--GNREDWDREFVKFEKLTVRVCYMAMRRSLVGG 285 Query: 541 ENRSNDVLRLLQEMDNARLKHGREEHERLIWACTREEHCVVAKELYTRIRE-VDDEISVS 717 EN +VL++L MD A +K R ++ERL+WACT EEH +AKELY RIRE D IS+S Sbjct: 286 ENPVGEVLKVLLGMDEAGVKPDRRDYERLVWACTGEEHYTIAKELYQRIRERGDGVISLS 345 Query: 718 VC 723 VC Sbjct: 346 VC 347