BLASTX nr result
ID: Mentha24_contig00044778
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha24_contig00044778 (406 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU18728.1| hypothetical protein MIMGU_mgv1a003317mg [Mimulus... 198 6e-49 ref|XP_002278530.1| PREDICTED: pentatricopeptide repeat-containi... 196 3e-48 ref|XP_006340743.1| PREDICTED: pentatricopeptide repeat-containi... 191 1e-46 ref|XP_004233739.1| PREDICTED: pentatricopeptide repeat-containi... 188 8e-46 ref|XP_004295543.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 187 1e-45 ref|XP_004154607.1| PREDICTED: pentatricopeptide repeat-containi... 187 1e-45 ref|XP_004140023.1| PREDICTED: pentatricopeptide repeat-containi... 187 1e-45 ref|XP_002529510.1| pentatricopeptide repeat-containing protein,... 183 2e-44 gb|EPS65333.1| hypothetical protein M569_09443, partial [Genlise... 181 7e-44 ref|XP_006579327.1| PREDICTED: pentatricopeptide repeat-containi... 176 3e-42 gb|EXB51207.1| hypothetical protein L484_019198 [Morus notabilis] 174 9e-42 ref|XP_006848380.1| hypothetical protein AMTR_s00013p00202120 [A... 174 1e-41 ref|XP_006845841.1| hypothetical protein AMTR_s00154p00028930 [A... 173 3e-41 ref|XP_002308024.2| pentatricopeptide repeat-containing family p... 172 4e-41 ref|XP_007014350.1| Pentatricopeptide repeat (PPR) superfamily p... 172 6e-41 ref|XP_006421323.1| hypothetical protein CICLE_v10004347mg [Citr... 167 1e-39 ref|XP_006492928.1| PREDICTED: pentatricopeptide repeat-containi... 166 2e-39 ref|XP_007137613.1| hypothetical protein PHAVU_009G141200g [Phas... 166 4e-39 ref|XP_004491150.1| PREDICTED: pentatricopeptide repeat-containi... 159 5e-37 emb|CBI29825.3| unnamed protein product [Vitis vinifera] 158 9e-37 >gb|EYU18728.1| hypothetical protein MIMGU_mgv1a003317mg [Mimulus guttatus] Length = 592 Score = 198 bits (504), Expect = 6e-49 Identities = 98/135 (72%), Positives = 120/135 (88%) Frame = +2 Query: 2 EAQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALI 181 EA+SL+L+ISQH QFP++ TYTILICG CRNGLL EA++IFN MEKL CSPSVVTFNALI Sbjct: 185 EARSLELEISQHNQFPNSCTYTILICGLCRNGLLGEAQEIFNGMEKLNCSPSVVTFNALI 244 Query: 182 DGLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 DGLCKA K++EA+LML+KME+ ++PSLFLRLSQG DR+LD+ASL +KVE ++ESG I KA Sbjct: 245 DGLCKAAKVDEARLMLHKMEIGRNPSLFLRLSQGTDRVLDSASLHKKVETLVESGLIHKA 304 Query: 362 YELLKKLTDSGVVPD 406 Y+LL +L DSGVVP+ Sbjct: 305 YKLLIQLADSGVVPN 319 Score = 65.9 bits (159), Expect = 6e-09 Identities = 41/135 (30%), Positives = 74/135 (54%) Frame = +2 Query: 2 EAQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALI 181 +A +L +++Q P+ TYT++I G CR +A ++F M+ GC P T+NAL+ Sbjct: 29 DALNLYDEMTQRRILPTKITYTVVISGMCRAKRTHDAHRMFELMKTRGCQPDSATYNALL 88 Query: 182 DGLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 DG CK G+++EA L+K ++ +R G ++D ++++ I A Sbjct: 89 DGFCKCGQIDEA-FKLFKSFRDDGYNVGIR---GFGCLID---------GLIKAKRISGA 135 Query: 362 YELLKKLTDSGVVPD 406 +L +++ D+G+VPD Sbjct: 136 EKLFQQVLDAGLVPD 150 >ref|XP_002278530.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540-like [Vitis vinifera] Length = 798 Score = 196 bits (498), Expect = 3e-48 Identities = 97/135 (71%), Positives = 118/135 (87%) Frame = +2 Query: 2 EAQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALI 181 +A+SL+L+IS+++ FP++ TYTILICG CRNGLL EA QIFN ME LGCSPS++TFNALI Sbjct: 394 KARSLQLEISKNDCFPTSCTYTILICGMCRNGLLDEARQIFNQMENLGCSPSIMTFNALI 453 Query: 182 DGLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 DGLCKAG+LEEA+ + YKME+ K+PSLFLRLSQGADR++DTASLQ VE + ESG ILKA Sbjct: 454 DGLCKAGELEEARHLFYKMEIGKNPSLFLRLSQGADRVMDTASLQTMVERLCESGLILKA 513 Query: 362 YELLKKLTDSGVVPD 406 Y+LL +L DSGVVPD Sbjct: 514 YKLLMQLADSGVVPD 528 Score = 57.8 bits (138), Expect = 2e-06 Identities = 31/120 (25%), Positives = 57/120 (47%) Frame = +2 Query: 47 PSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALIDGLCKAGKLEEAQLM 226 P+T YTI++ G C+ + ++ N M+ GC P +T NAL+DG CK G+++EA + Sbjct: 234 PNTMIYTIILSGLCQAKRTDDVHRLLNTMKVSGCCPDSITCNALLDGFCKLGQIDEAFAL 293 Query: 227 LYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKAYELLKKLTDSGVVPD 406 L+L + +L ++ + + + E +K+ +G+ PD Sbjct: 294 -------------LQLFEKEGYVLGIKGYSSLIDGLFRAKRYDEVQEWCRKMFKAGIEPD 340 >ref|XP_006340743.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540-like [Solanum tuberosum] Length = 775 Score = 191 bits (484), Expect = 1e-46 Identities = 91/135 (67%), Positives = 115/135 (85%) Frame = +2 Query: 2 EAQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALI 181 +A+SL+L+IS+++ FP T+TY+I+ICG CRNGL+ EA IFN+MEKLGC PSVVTFN LI Sbjct: 383 QARSLQLEISENDCFPDTYTYSIVICGMCRNGLVEEARHIFNEMEKLGCFPSVVTFNTLI 442 Query: 182 DGLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 DGLCKAG+LEEA LM YKME+ K+PSLFLRLSQGADR+LD+ SLQ+ +E + E+G ILKA Sbjct: 443 DGLCKAGELEEAHLMFYKMEIGKNPSLFLRLSQGADRVLDSVSLQKMIEKLCETGKILKA 502 Query: 362 YELLKKLTDSGVVPD 406 Y+LL +L D G VP+ Sbjct: 503 YKLLMQLADCGFVPN 517 Score = 66.2 bits (160), Expect = 4e-09 Identities = 37/135 (27%), Positives = 68/135 (50%) Frame = +2 Query: 2 EAQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALI 181 +A +L ++++ PS TYT+++ G C+ +A ++ N M+ GC P VT+NAL+ Sbjct: 208 DALALFDEMTERGVLPSKITYTVILSGLCQAKRTDDAYRLLNVMKTRGCRPDFVTYNALL 267 Query: 182 DGLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 +G CK G+++E + LR + ++D ++ + + I +A Sbjct: 268 NGFCKLGRVDETHAL-------------LRSFENEGYLMDIKGYTCLIDGFVRTKRIDEA 314 Query: 362 YELLKKLTDSGVVPD 406 + KKL + VVPD Sbjct: 315 QSVFKKLFEKNVVPD 329 >ref|XP_004233739.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540-like [Solanum lycopersicum] Length = 753 Score = 188 bits (477), Expect = 8e-46 Identities = 90/135 (66%), Positives = 114/135 (84%) Frame = +2 Query: 2 EAQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALI 181 +A+SL+L+IS+++ FP T+TY+I+ICG CRNGL+ EA IFN+MEKLGC PSVVTFN LI Sbjct: 361 QARSLQLEISENDCFPDTYTYSIVICGMCRNGLVEEARHIFNEMEKLGCFPSVVTFNTLI 420 Query: 182 DGLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 DGLCKAG+LEEA LM YKME+ K+PSLFLRLSQGADR+LD+ SLQ+ +E + E+G I KA Sbjct: 421 DGLCKAGELEEAHLMFYKMEIGKNPSLFLRLSQGADRVLDSVSLQKMIEKLCETGKIHKA 480 Query: 362 YELLKKLTDSGVVPD 406 Y+LL +L D G VP+ Sbjct: 481 YKLLMQLADCGFVPN 495 Score = 67.0 bits (162), Expect = 3e-09 Identities = 37/135 (27%), Positives = 69/135 (51%) Frame = +2 Query: 2 EAQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALI 181 +A +L ++++ PS TYT+++ G C+ +A ++ N M+ GC P VT+NAL+ Sbjct: 186 DALALFDEMTERGVLPSKITYTVILSGLCQAKRTDDAYRLLNVMKTRGCKPDFVTYNALL 245 Query: 182 DGLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 +G CK G+++EA ++ LR + ++D ++ + + I +A Sbjct: 246 NGFCKLGRVDEAHVL-------------LRSFENEGYLMDIKGYTCLIDGFVRTKRIDEA 292 Query: 362 YELLKKLTDSGVVPD 406 + K L + VVPD Sbjct: 293 QSVFKNLFEKNVVPD 307 >ref|XP_004295543.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g79540-like [Fragaria vesca subsp. vesca] Length = 768 Score = 187 bits (476), Expect = 1e-45 Identities = 92/135 (68%), Positives = 114/135 (84%) Frame = +2 Query: 2 EAQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALI 181 EA+SL L+IS+ + FP+ TYTILICG CRNGL+ EAEQIFN+MEKLGC P VVTFNALI Sbjct: 377 EARSLHLEISKQDCFPNACTYTILICGMCRNGLVGEAEQIFNEMEKLGCVPCVVTFNALI 436 Query: 182 DGLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 DGLCKA KL++A ++ YKME+ + PSLFLRLSQG+DRI+D+ASLQ+KVE + +SG IL+A Sbjct: 437 DGLCKASKLKDAHMLFYKMEIGRKPSLFLRLSQGSDRIIDSASLQKKVEQLCDSGLILQA 496 Query: 362 YELLKKLTDSGVVPD 406 Y+LL +L SGV PD Sbjct: 497 YKLLIQLASSGVAPD 511 Score = 65.5 bits (158), Expect = 8e-09 Identities = 42/135 (31%), Positives = 71/135 (52%), Gaps = 7/135 (5%) Frame = +2 Query: 23 DISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALIDGLCKAG 202 +++Q P T TYTI++ G C+ EA ++ + M + GC P++VT++AL+DG CK G Sbjct: 209 EMAQRGIAPDTVTYTIIVSGLCQAKRAHEAHRLVDKMRETGCVPNIVTYHALLDGYCKLG 268 Query: 203 KLEEAQLML-------YKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 +L+EA ++ Y + + SL L + A R + L K+ ++ Sbjct: 269 RLDEAYALVRSFQRIGYVLGVEGYSSLIFGLFR-ARRFDEALGLYGKLLGEGIEPDVILC 327 Query: 362 YELLKKLTDSGVVPD 406 L+K L+D+G V D Sbjct: 328 TILIKGLSDAGRVKD 342 Score = 56.2 bits (134), Expect = 5e-06 Identities = 28/77 (36%), Positives = 47/77 (61%) Frame = +2 Query: 8 QSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALIDG 187 Q LK ++S P+ TY+ILI G+C+ +A Q+F++M + G +P VT+ ++ G Sbjct: 174 QMLKCNLS-----PTRSTYSILINGFCKTRKTQDALQMFDEMAQRGIAPDTVTYTIIVSG 228 Query: 188 LCKAGKLEEAQLMLYKM 238 LC+A + EA ++ KM Sbjct: 229 LCQAKRAHEAHRLVDKM 245 >ref|XP_004154607.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540-like [Cucumis sativus] Length = 950 Score = 187 bits (476), Expect = 1e-45 Identities = 88/135 (65%), Positives = 115/135 (85%) Frame = +2 Query: 2 EAQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALI 181 EA+SL+L+IS+H+ FP+ HTY+ILICG C+NGL+ +A+ IF +MEKLGC PSVVTFN+LI Sbjct: 391 EAESLRLEISKHDCFPNNHTYSILICGMCKNGLINKAQHIFKEMEKLGCLPSVVTFNSLI 450 Query: 182 DGLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 +GLCKA +LEEA+L+ Y+ME+V+ PSLFLRLSQG D++ D ASLQ +E + ESG ILKA Sbjct: 451 NGLCKANRLEEARLLFYQMEIVRKPSLFLRLSQGTDKVFDIASLQVMMERLCESGMILKA 510 Query: 362 YELLKKLTDSGVVPD 406 Y+LL +L DSGV+PD Sbjct: 511 YKLLMQLVDSGVLPD 525 >ref|XP_004140023.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540-like [Cucumis sativus] Length = 783 Score = 187 bits (476), Expect = 1e-45 Identities = 88/135 (65%), Positives = 115/135 (85%) Frame = +2 Query: 2 EAQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALI 181 EA+SL+L+IS+H+ FP+ HTY+ILICG C+NGL+ +A+ IF +MEKLGC PSVVTFN+LI Sbjct: 391 EAESLRLEISKHDCFPNNHTYSILICGMCKNGLINKAQHIFKEMEKLGCLPSVVTFNSLI 450 Query: 182 DGLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 +GLCKA +LEEA+L+ Y+ME+V+ PSLFLRLSQG D++ D ASLQ +E + ESG ILKA Sbjct: 451 NGLCKANRLEEARLLFYQMEIVRKPSLFLRLSQGTDKVFDIASLQVMMERLCESGMILKA 510 Query: 362 YELLKKLTDSGVVPD 406 Y+LL +L DSGV+PD Sbjct: 511 YKLLMQLVDSGVLPD 525 >ref|XP_002529510.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223531026|gb|EEF32879.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 804 Score = 183 bits (464), Expect = 2e-44 Identities = 90/135 (66%), Positives = 113/135 (83%) Frame = +2 Query: 2 EAQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALI 181 EA+SL L+IS+++ F S TYTILICG CR+GL+ +A+QIFN+MEK GC PSVVTFNALI Sbjct: 389 EAKSLHLEISKNDCFSSACTYTILICGMCRSGLVGDAQQIFNEMEKHGCYPSVVTFNALI 448 Query: 182 DGLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 DG CKAG +E+AQL+ YKME+ ++PSLFLRLSQGA+R+LDTASLQ VE + +SG ILKA Sbjct: 449 DGFCKAGNIEKAQLLFYKMEIGRNPSLFLRLSQGANRVLDTASLQTMVEQLCDSGLILKA 508 Query: 362 YELLKKLTDSGVVPD 406 Y +L +LTDSG P+ Sbjct: 509 YNILMQLTDSGFAPN 523 Score = 61.2 bits (147), Expect = 1e-07 Identities = 42/154 (27%), Positives = 71/154 (46%), Gaps = 26/154 (16%) Frame = +2 Query: 23 DISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALIDGLCKAG 202 +++Q P+ TYTI+I G C+ A ++F M+ GC P VT+NAL+ G CK G Sbjct: 221 EMTQRRILPNKITYTIIISGLCQAQKADVAYRLFIAMKDHGCIPDSVTYNALLHGFCKLG 280 Query: 203 KLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVED----------------- 331 +++EA +L E + ++ QG ++D ++ ED Sbjct: 281 RVDEALGLLKYFEKDR----YVLDKQGYSCLIDGLFRARRFEDAQVWYRKMTEHNIKPDV 336 Query: 332 ---------MMESGSILKAYELLKKLTDSGVVPD 406 + ++G A LL ++T+ G+VPD Sbjct: 337 ILYTIMMKGLSKAGKFKDALRLLNEMTERGLVPD 370 >gb|EPS65333.1| hypothetical protein M569_09443, partial [Genlisea aurea] Length = 564 Score = 181 bits (460), Expect = 7e-44 Identities = 91/135 (67%), Positives = 113/135 (83%) Frame = +2 Query: 2 EAQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALI 181 EA+SL+L+IS+H Q P T TYTILI G CRNGLL EA ++F+DME GCSPS TFNALI Sbjct: 276 EAKSLELEISKHGQLPDTCTYTILISGLCRNGLLGEAGKMFSDMESRGCSPSAATFNALI 335 Query: 182 DGLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 DGLCKAG L EAQL L+KME+ ++PSLFLRL+QG++R+LD SL++ VE+M+ SGSILKA Sbjct: 336 DGLCKAGDLSEAQLTLFKMEIGRNPSLFLRLTQGSERVLDRDSLRKMVENMVTSGSILKA 395 Query: 362 YELLKKLTDSGVVPD 406 Y+LL +L+D GVVPD Sbjct: 396 YKLLIQLSDCGVVPD 410 >ref|XP_006579327.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540-like [Glycine max] Length = 557 Score = 176 bits (446), Expect = 3e-42 Identities = 83/134 (61%), Positives = 112/134 (83%) Frame = +2 Query: 5 AQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALID 184 A+SL+L+IS+H+ F + T+TI+IC C+ G+ +A++IFN MEKLGC PS+VTFNAL+D Sbjct: 174 ARSLQLEISEHQGFHNVCTHTIIICDLCKRGMAEKAQEIFNKMEKLGCFPSIVTFNALMD 233 Query: 185 GLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKAY 364 GLCKAGKLEEA L+LYKME+ +SPSLF RLSQG+D++LD+ +LQ+KVE M E+G +L AY Sbjct: 234 GLCKAGKLEEAHLLLYKMEIGRSPSLFFRLSQGSDQVLDSVALQKKVEQMCEAGQLLDAY 293 Query: 365 ELLKKLTDSGVVPD 406 +LL +L SGV+PD Sbjct: 294 KLLIQLAGSGVMPD 307 >gb|EXB51207.1| hypothetical protein L484_019198 [Morus notabilis] Length = 759 Score = 174 bits (442), Expect = 9e-42 Identities = 85/135 (62%), Positives = 108/135 (80%) Frame = +2 Query: 2 EAQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALI 181 EA+SL L+IS + FP+ TYTILICG CRNGL+ EA+QIF +M+K+GC PSVVTFN+LI Sbjct: 354 EARSLHLEISNRDCFPNACTYTILICGMCRNGLVKEAQQIFEEMDKVGCFPSVVTFNSLI 413 Query: 182 DGLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 GLCKAG+L +A L+ Y+ME+ ++PSLFLRLSQG R+LD SLQ VE + ESG +LKA Sbjct: 414 HGLCKAGELGKAHLLFYRMEIGRNPSLFLRLSQGGGRVLDGGSLQAVVEKLCESGLVLKA 473 Query: 362 YELLKKLTDSGVVPD 406 Y +L +L DSGV+PD Sbjct: 474 YRILTQLADSGVMPD 488 Score = 65.9 bits (159), Expect = 6e-09 Identities = 35/135 (25%), Positives = 69/135 (51%) Frame = +2 Query: 2 EAQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALI 181 +AQ + ++++ P TYTI+I G C+ + EA ++ ME+ GC P V +NAL+ Sbjct: 179 DAQKMFDEMAERGLAPDERTYTIIISGLCQAKRVDEARRLLITMEESGCCPDTVAYNALL 238 Query: 182 DGLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 +G C+ G+++EA F+R S+ ++ ++ + ++ ++A Sbjct: 239 NGYCQLGRIDEAY-------------AFMRWSEKEGYVVGLKGYSCLIDGLFKAKRYVEA 285 Query: 362 YELLKKLTDSGVVPD 406 + +K+ +GV PD Sbjct: 286 HGWFRKMIKAGVKPD 300 Score = 57.4 bits (137), Expect = 2e-06 Identities = 35/122 (28%), Positives = 61/122 (50%), Gaps = 2/122 (1%) Frame = +2 Query: 47 PSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALIDGLCKAGKLEEAQLM 226 P TY +++C R + A ++N+M + C+P +VTFN LI G CK+G++++AQ M Sbjct: 124 PDVFTYNVILCLMLRKQVFSLALALYNEMLESNCTPDLVTFNILIHGFCKSGQIQDAQKM 183 Query: 227 LYKMEMVKSPSLFLRLSQGADRIL--DTASLQQKVEDMMESGSILKAYELLKKLTDSGVV 400 +M A+R L D + + + ++ + +A LL + +SG Sbjct: 184 FDEM---------------AERGLAPDERTYTIIISGLCQAKRVDEARRLLITMEESGCC 228 Query: 401 PD 406 PD Sbjct: 229 PD 230 >ref|XP_006848380.1| hypothetical protein AMTR_s00013p00202120 [Amborella trichopoda] gi|548851686|gb|ERN09961.1| hypothetical protein AMTR_s00013p00202120 [Amborella trichopoda] Length = 789 Score = 174 bits (441), Expect = 1e-41 Identities = 84/135 (62%), Positives = 111/135 (82%) Frame = +2 Query: 2 EAQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALI 181 +A+SL+L+IS+ + FP + TYTILICG C+ GL+ EAE+IF +M++LGCSP+V+TFN+LI Sbjct: 391 KARSLRLEISKEDCFPDSTTYTILICGLCKEGLVNEAEEIFEEMKRLGCSPTVMTFNSLI 450 Query: 182 DGLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 +GLCKAG +E+A ++ YKMEM +PSLFLRLSQG+D LD+ASLQ VE + SG ILKA Sbjct: 451 NGLCKAGAVEKAHILFYKMEMGSNPSLFLRLSQGSDPALDSASLQSMVERLCNSGLILKA 510 Query: 362 YELLKKLTDSGVVPD 406 Y+LLK+L SG VPD Sbjct: 511 YKLLKELVKSGAVPD 525 >ref|XP_006845841.1| hypothetical protein AMTR_s00154p00028930 [Amborella trichopoda] gi|548848485|gb|ERN07516.1| hypothetical protein AMTR_s00154p00028930 [Amborella trichopoda] Length = 275 Score = 173 bits (438), Expect = 3e-41 Identities = 83/135 (61%), Positives = 112/135 (82%) Frame = +2 Query: 2 EAQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALI 181 EA+SL+L+IS+ + FP + TYTILICG C+ GL+ +AE+IF +M++LGCSP+V+TFN+LI Sbjct: 28 EARSLRLEISKKDCFPDSATYTILICGLCKEGLVNKAEEIFEEMKRLGCSPTVMTFNSLI 87 Query: 182 DGLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 +G CKAG +E+A ++ KMEM ++PSLFLRLSQG+D +LD+ASLQ VE + SG ILKA Sbjct: 88 NGFCKAGAMEKAHILFNKMEMGRNPSLFLRLSQGSDPVLDSASLQSMVERLCSSGLILKA 147 Query: 362 YELLKKLTDSGVVPD 406 Y+LLK+L SGVVPD Sbjct: 148 YKLLKELVKSGVVPD 162 >ref|XP_002308024.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550335473|gb|EEE91547.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 838 Score = 172 bits (436), Expect = 4e-41 Identities = 83/134 (61%), Positives = 108/134 (80%) Frame = +2 Query: 2 EAQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALI 181 EA+SL+L+IS+H+ FP+ TY+ILI G CRNGL +A++IFN+MEKLGC PS VTFN+LI Sbjct: 389 EARSLQLEISRHDCFPNVKTYSILISGMCRNGLTRDAQEIFNEMEKLGCYPSAVTFNSLI 448 Query: 182 DGLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 DGLCK G+LE+A L+ YKME+ ++PSLFLRLSQG +LD+ASLQ+ VE + +SG I KA Sbjct: 449 DGLCKTGQLEKAHLLFYKMEIGRNPSLFLRLSQGPSHVLDSASLQKMVEQLCDSGLIHKA 508 Query: 362 YELLKKLTDSGVVP 403 Y +L +L DSG P Sbjct: 509 YRILMQLADSGDAP 522 Score = 68.9 bits (167), Expect = 7e-10 Identities = 47/150 (31%), Positives = 75/150 (50%), Gaps = 22/150 (14%) Frame = +2 Query: 23 DISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALIDGLCKAG 202 +++Q P TY ++I G CR+ + +A ++F+ M+ G P VT NAL++G C Sbjct: 221 EMTQRGILPDAFTYCVVISGLCRSKRVDDAYRLFDKMKDSGVGPDFVTCNALLNGFCMLD 280 Query: 203 KLEEAQLMLYKMEM------VKSPSLFLRLSQGADRILDTASLQQK-VED---------- 331 +++EA +L E V+ S +R A R D L +K +ED Sbjct: 281 RVDEAFSLLRLFEKDGYVLDVRGYSCLIRGLFRAKRYEDVQLLYRKMIEDNVKPDVYLYT 340 Query: 332 -----MMESGSILKAYELLKKLTDSGVVPD 406 + E+G + A ELL ++T+SGVVPD Sbjct: 341 IMMKGLAEAGKVRDALELLNEMTESGVVPD 370 Score = 58.2 bits (139), Expect = 1e-06 Identities = 37/120 (30%), Positives = 59/120 (49%) Frame = +2 Query: 47 PSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALIDGLCKAGKLEEAQLM 226 P +TY +++ + L+ A ++ M KL C P+V TF+ LIDGLCK+G +++A Sbjct: 159 PDVYTYNMILDVLIQKNFLLLALTVYTRMMKLNCLPNVATFSILIDGLCKSGNVKDAL-- 216 Query: 227 LYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKAYELLKKLTDSGVVPD 406 LF ++Q + D + + + S + AY L K+ DSGV PD Sbjct: 217 ----------HLFDEMTQ-RGILPDAFTYCVVISGLCRSKRVDDAYRLFDKMKDSGVGPD 265 >ref|XP_007014350.1| Pentatricopeptide repeat (PPR) superfamily protein, putative [Theobroma cacao] gi|508784713|gb|EOY31969.1| Pentatricopeptide repeat (PPR) superfamily protein, putative [Theobroma cacao] Length = 800 Score = 172 bits (435), Expect = 6e-41 Identities = 85/135 (62%), Positives = 110/135 (81%) Frame = +2 Query: 2 EAQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALI 181 +A+SL+L+IS ++ FP+ TYTILI G C+NGL+ EA+QIF++MEKLGC PSVVTFNALI Sbjct: 395 QARSLQLEISSYDCFPNACTYTILISGMCQNGLVGEAQQIFDEMEKLGCFPSVVTFNALI 454 Query: 182 DGLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 DGL KAG+LE+A L+ YKME+ ++PSLFLRLS G+ +LD++SLQ VE + ESG ILKA Sbjct: 455 DGLSKAGQLEKAHLLFYKMEIGRNPSLFLRLSHGSSGVLDSSSLQTMVEQLYESGRILKA 514 Query: 362 YELLKKLTDSGVVPD 406 Y +L +L D G VPD Sbjct: 515 YRILMQLADGGNVPD 529 Score = 60.1 bits (144), Expect = 3e-07 Identities = 38/150 (25%), Positives = 74/150 (49%), Gaps = 22/150 (14%) Frame = +2 Query: 23 DISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALIDGLCKAG 202 +++Q P+ +YTI++ G C+ +A ++ N M++ GCSP V +NAL++G C+ G Sbjct: 227 EMTQRGIEPNRCSYTIIVSGLCQADRADDACRLLNKMKESGCSPDFVAYNALLNGFCQLG 286 Query: 203 KLEEAQLMLYKMEM------VKSPSLFLRLSQGADRILDTASLQQK-------------- 322 +++EA +L + ++ S F+ A R + + K Sbjct: 287 RVDEAFALLQSFQKDGFVLGLRGYSSFINGLFRARRFEEAYAWYTKMFEENVKPDVVLYA 346 Query: 323 --VEDMMESGSILKAYELLKKLTDSGVVPD 406 + + +G + A +LL ++T+ G+VPD Sbjct: 347 IMLRGLSVAGKVEDAMKLLSEMTERGLVPD 376 >ref|XP_006421323.1| hypothetical protein CICLE_v10004347mg [Citrus clementina] gi|557523196|gb|ESR34563.1| hypothetical protein CICLE_v10004347mg [Citrus clementina] Length = 801 Score = 167 bits (423), Expect = 1e-39 Identities = 80/135 (59%), Positives = 108/135 (80%) Frame = +2 Query: 2 EAQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALI 181 +A+SL+++I + + P+THT+TILICG CRNG++ +A+++FN MEK GC PSV TFNALI Sbjct: 395 QARSLQVEIWKRDSLPNTHTFTILICGMCRNGMVDDAQKLFNKMEKAGCFPSVGTFNALI 454 Query: 182 DGLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 DGLCKAG+LE+A L+ YKME+ K+P+LFLRLSQG +R+ D ASLQ VE SG I KA Sbjct: 455 DGLCKAGELEKANLLFYKMEIGKNPTLFLRLSQGGNRVHDKASLQTMVEQYCTSGLIHKA 514 Query: 362 YELLKKLTDSGVVPD 406 Y++L +L +SG +PD Sbjct: 515 YKILMQLAESGNLPD 529 Score = 65.1 bits (157), Expect = 1e-08 Identities = 46/150 (30%), Positives = 72/150 (48%), Gaps = 22/150 (14%) Frame = +2 Query: 23 DISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALIDGLCKAG 202 +++Q P+ TYTI+I G C+ EA ++F M+ GCSP V +NAL++G CK Sbjct: 227 EMTQRGILPNKFTYTIVISGLCQINRADEAYRLFLKMKDSGCSPDFVAYNALLNGFCKLR 286 Query: 203 KLEEAQLMLYKME---MVKSPSLFLRLSQGADRI--LDTA------SLQQKVE------- 328 ++EA +L E V + L G R D A ++K+E Sbjct: 287 GVDEALALLRSFEKDGFVPGLGSYSCLIDGLFRAKRYDEAYAWYRKMFEEKIEPDVVLYG 346 Query: 329 ----DMMESGSILKAYELLKKLTDSGVVPD 406 + E+G + A +LL ++D G+VPD Sbjct: 347 VIIRGLSEAGKVKDAMKLLSDMSDRGIVPD 376 >ref|XP_006492928.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540-like [Citrus sinensis] Length = 869 Score = 166 bits (421), Expect = 2e-39 Identities = 80/135 (59%), Positives = 107/135 (79%) Frame = +2 Query: 2 EAQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALI 181 +A+SL+++I + + P+THT+TILICG CRNG++ +A+++FN MEK GC PSV TFNALI Sbjct: 463 QARSLQVEIWKRDSLPNTHTFTILICGMCRNGMVDDAQKLFNKMEKAGCFPSVGTFNALI 522 Query: 182 DGLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 DGLCKAG+LE+A L+ YKME+ K+P LFLRLSQG +R+ D ASLQ VE SG I KA Sbjct: 523 DGLCKAGELEKANLLFYKMEIGKNPMLFLRLSQGGNRVHDKASLQTMVEQYCTSGLIHKA 582 Query: 362 YELLKKLTDSGVVPD 406 Y++L +L +SG +PD Sbjct: 583 YKILMQLAESGNLPD 597 Score = 65.1 bits (157), Expect = 1e-08 Identities = 46/150 (30%), Positives = 72/150 (48%), Gaps = 22/150 (14%) Frame = +2 Query: 23 DISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALIDGLCKAG 202 +++Q P+ TYTI+I G C+ EA ++F M+ GCSP V +NAL++G CK Sbjct: 295 EMTQRGILPNKFTYTIVISGLCQINRADEAYRLFLKMKDSGCSPDFVAYNALLNGFCKLR 354 Query: 203 KLEEAQLMLYKME---MVKSPSLFLRLSQGADRI--LDTA------SLQQKVE------- 328 ++EA +L E V + L G R D A ++K+E Sbjct: 355 GVDEALALLRSFEKDGFVPGLGSYSCLIDGLFRAKRYDEAYAWYRKMFEEKIEPDVVLYG 414 Query: 329 ----DMMESGSILKAYELLKKLTDSGVVPD 406 + E+G + A +LL ++D G+VPD Sbjct: 415 VIIRGLSEAGKVKDAMKLLSDMSDRGIVPD 444 >ref|XP_007137613.1| hypothetical protein PHAVU_009G141200g [Phaseolus vulgaris] gi|561010700|gb|ESW09607.1| hypothetical protein PHAVU_009G141200g [Phaseolus vulgaris] Length = 719 Score = 166 bits (419), Expect = 4e-39 Identities = 82/134 (61%), Positives = 106/134 (79%) Frame = +2 Query: 5 AQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALID 184 A+SL+L IS+HE F + T+TILIC C+ G++ EA++IFN MEK GC PS+VTFN LI Sbjct: 361 ARSLQLQISEHEGFHNVCTHTILICDLCKRGMVDEAQEIFNRMEKSGCFPSLVTFNTLIY 420 Query: 185 GLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKAY 364 GLCKAGKLEEA LM YKMEM +SPSLF RLS+G++++LD SL++KVE M E+G +L AY Sbjct: 421 GLCKAGKLEEAHLMWYKMEMGRSPSLFFRLSRGSNQVLDRVSLRKKVEQMCETGQLLDAY 480 Query: 365 ELLKKLTDSGVVPD 406 + L +L DSGV+ D Sbjct: 481 KFLIQLADSGVMSD 494 >ref|XP_004491150.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540-like [Cicer arietinum] Length = 747 Score = 159 bits (401), Expect = 5e-37 Identities = 77/134 (57%), Positives = 106/134 (79%) Frame = +2 Query: 5 AQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALID 184 A SL L++S+ + +T+TILIC C+ G++ +A+++FN MEKLGC PSVVTFN LI+ Sbjct: 346 AMSLYLEMSER----NAYTHTILICEMCKRGMVEDAQEVFNQMEKLGCIPSVVTFNVLIN 401 Query: 185 GLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKAY 364 GLCKA KLEEA+L+ YKME+ ++PSLFL LSQG+ ++LD+ SLQ+K+E M E+G L+AY Sbjct: 402 GLCKADKLEEARLLFYKMEIGRNPSLFLSLSQGSAQVLDSTSLQKKIEQMCEAGQFLEAY 461 Query: 365 ELLKKLTDSGVVPD 406 + L +L DSGVVPD Sbjct: 462 KFLIQLADSGVVPD 475 >emb|CBI29825.3| unnamed protein product [Vitis vinifera] Length = 722 Score = 158 bits (399), Expect = 9e-37 Identities = 79/134 (58%), Positives = 106/134 (79%) Frame = +2 Query: 2 EAQSLKLDISQHEQFPSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALI 181 +A+SL+L+IS+++ FP++ TYTILICG CRNGLL EA QIFN ME LGCSPS++TFNALI Sbjct: 394 KARSLQLEISKNDCFPTSCTYTILICGMCRNGLLDEARQIFNQMENLGCSPSIMTFNALI 453 Query: 182 DGLCKAGKLEEAQLMLYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKA 361 DGLCKAG+LEEA+ + YKME+ K+PSLFLRLSQGADR++DTA+ +V+ + A Sbjct: 454 DGLCKAGELEEARHLFYKMEIGKNPSLFLRLSQGADRVMDTANGFHRVDREED------A 507 Query: 362 YELLKKLTDSGVVP 403 + +L ++ +G P Sbjct: 508 FRVLDQMVKNGCTP 521 Score = 57.8 bits (138), Expect = 2e-06 Identities = 31/120 (25%), Positives = 57/120 (47%) Frame = +2 Query: 47 PSTHTYTILICGYCRNGLLVEAEQIFNDMEKLGCSPSVVTFNALIDGLCKAGKLEEAQLM 226 P+T YTI++ G C+ + ++ N M+ GC P +T NAL+DG CK G+++EA + Sbjct: 234 PNTMIYTIILSGLCQAKRTDDVHRLLNTMKVSGCCPDSITCNALLDGFCKLGQIDEAFAL 293 Query: 227 LYKMEMVKSPSLFLRLSQGADRILDTASLQQKVEDMMESGSILKAYELLKKLTDSGVVPD 406 L+L + +L ++ + + + E +K+ +G+ PD Sbjct: 294 -------------LQLFEKEGYVLGIKGYSSLIDGLFRAKRYDEVQEWCRKMFKAGIEPD 340