BLASTX nr result
ID: Mentha29_contig00026153
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00026153 (1152 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU20314.1| hypothetical protein MIMGU_mgv1a003025mg [Mimulus... 131 7e-56 emb|CAN62945.1| hypothetical protein VITISV_002230 [Vitis vinifera] 119 5e-41 ref|XP_002279628.2| PREDICTED: pentatricopeptide repeat-containi... 119 5e-41 ref|XP_006474468.1| PREDICTED: pentatricopeptide repeat-containi... 115 3e-36 ref|XP_002308534.1| hypothetical protein POPTR_0006s23980g [Popu... 97 9e-36 ref|XP_006453029.1| hypothetical protein CICLE_v10007870mg [Citr... 112 1e-34 ref|XP_002516112.1| pentatricopeptide repeat-containing protein,... 96 4e-34 ref|XP_007136868.1| hypothetical protein PHAVU_009G080600g [Phas... 96 2e-33 gb|EXB48288.1| hypothetical protein L484_003771 [Morus notabilis] 98 3e-32 ref|XP_007224434.1| hypothetical protein PRUPE_ppa021491mg [Prun... 96 9e-32 ref|XP_006578089.1| PREDICTED: pentatricopeptide repeat-containi... 91 3e-31 ref|XP_006849607.1| hypothetical protein AMTR_s00024p00204830 [A... 91 1e-25 ref|XP_007012378.1| Pentatricopeptide repeat superfamily protein... 69 2e-23 ref|XP_006606631.1| PREDICTED: pentatricopeptide repeat-containi... 57 3e-15 ref|XP_004985596.1| PREDICTED: pentatricopeptide repeat-containi... 54 7e-13 ref|XP_004498166.1| PREDICTED: pentatricopeptide repeat-containi... 50 1e-12 ref|XP_007049304.1| Pentatricopeptide repeat-containing protein,... 59 3e-12 ref|XP_007049305.1| Pentatricopeptide repeat-containing protein,... 59 3e-12 ref|XP_002319601.2| hypothetical protein POPTR_0013s03290g [Popu... 55 3e-12 ref|XP_004139858.1| PREDICTED: pentatricopeptide repeat-containi... 62 4e-12 >gb|EYU20314.1| hypothetical protein MIMGU_mgv1a003025mg [Mimulus guttatus] Length = 614 Score = 131 bits (330), Expect(2) = 7e-56 Identities = 68/132 (51%), Positives = 92/132 (69%) Frame = -2 Query: 902 TIRFLKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNGHALDSSLYSLLI 723 TI LKQM QWGC PNF+SYSEVI GLV + R+Q +R+G LD++LYS LI Sbjct: 482 TIILLKQMPQWGCSPNFISYSEVIIGLVGVKERIQDIDMIVNDMIRDGLGLDTTLYSFLI 541 Query: 722 QANCVNGNVEKAIDLFEKMIGDRFVIKKESFDVFVKELSARGLMKEVETLFDQMRNSCST 543 +A CV+G+V +A+ LF++MI I+K+ F+VFV+E+ +RGLM EV+ LFDQM S Sbjct: 542 RAYCVSGDVRRAVCLFKEMIDQSLTIRKDCFEVFVEEMHSRGLMHEVQDLFDQMTPS--- 598 Query: 542 FDVVTYQSILDE 507 FD+ Y S+LD+ Sbjct: 599 FDLKEYLSVLDD 610 Score = 114 bits (285), Expect(2) = 7e-56 Identities = 56/90 (62%), Positives = 70/90 (77%), Gaps = 4/90 (4%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 THTSLLKGY +AG+S EAI HFKEMVSLGM+LD KS+++I E+CKLGRPDEA +L++M Sbjct: 395 THTSLLKGYCVAGRSNEAITHFKEMVSLGMNLDKKSFAVIVNEYCKLGRPDEAVVLLREM 454 Query: 970 RGR----VITCCNAVLRSYTKLQEFDKPFV 893 + R ++ NAV RSY KLQEFDK + Sbjct: 455 KARGIRPIVASFNAVFRSYVKLQEFDKTII 484 >emb|CAN62945.1| hypothetical protein VITISV_002230 [Vitis vinifera] Length = 912 Score = 119 bits (297), Expect(2) = 5e-41 Identities = 63/133 (47%), Positives = 82/133 (61%) Frame = -2 Query: 890 LKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNGHALDSSLYSLLIQANC 711 L+QM Q GC PNFLSYS VI GL A+GRM RNGH LD+S+YS L++ C Sbjct: 484 LRQMKQLGCTPNFLSYSTVIDGLCKAKGRMHEVEEFVDDMCRNGHHLDASMYSWLVKGYC 543 Query: 710 VNGNVEKAIDLFEKMIGDRFVIKKESFDVFVKELSARGLMKEVETLFDQMRNSCSTFDVV 531 +GN + A+ LF +M+ +VI ESF FVK LSA+ EVE F++M C D+ Sbjct: 544 EDGNADMAMRLFCEMLDMGYVINLESFLAFVKGLSAKEKAFEVEKFFEEMSRRCPGIDIH 603 Query: 530 TYQSILDEYVCGN 492 Y+ ILDE++C N Sbjct: 604 KYRRILDEHLCKN 616 Score = 77.4 bits (189), Expect(2) = 5e-41 Identities = 37/87 (42%), Positives = 56/87 (64%), Gaps = 4/87 (4%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 THTS+LKG + GK ++A +H KEMV LGM D K+Y ++ E+CK+G+ D+A +LK+M Sbjct: 393 THTSILKGLCVVGKLDDAARHLKEMVGLGMEADAKAYGVVVNEYCKIGKADDAISLLKEM 452 Query: 970 RGR----VITCCNAVLRSYTKLQEFDK 902 + R ++ NAV R + + DK Sbjct: 453 KSRGINPSVSSFNAVFRILVESGKTDK 479 Score = 44.7 bits (104), Expect(2) = 7e-06 Identities = 29/128 (22%), Positives = 62/128 (48%), Gaps = 5/128 (3%) Frame = -2 Query: 899 IRFLKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNGHALDSSLYSLLIQ 720 ++++++MV C PN L+Y+ +I+GL G + NG + + ++ +++ Sbjct: 341 MKYMEEMVSRNCEPNVLTYNAIIYGL-CLNGNVDEAKRMMTRMRLNGLKDNVATHTSILK 399 Query: 719 ANCVNGNVEKAIDLFEKMIGDRFVIKKESFDVFVKELSARGLMKEVETLFDQMRN----- 555 CV G ++ A ++M+G +++ V V E G + +L +M++ Sbjct: 400 GLCVVGKLDDAARHLKEMVGLGMEADAKAYGVVVNEYCKIGKADDAISLLKEMKSRGINP 459 Query: 554 SCSTFDVV 531 S S+F+ V Sbjct: 460 SVSSFNAV 467 Score = 33.1 bits (74), Expect(2) = 7e-06 Identities = 19/60 (31%), Positives = 32/60 (53%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 T+T++++GY G E A F EM G +L +Y+ + FCK G + A ++ +M Sbjct: 255 TYTTIIRGYCKMGMIENAKNVFDEM---GCKPNLVTYNTMINGFCKKGLMESAMKIVDQM 311 >ref|XP_002279628.2| PREDICTED: pentatricopeptide repeat-containing protein At1g63330-like [Vitis vinifera] Length = 563 Score = 119 bits (297), Expect(2) = 5e-41 Identities = 63/133 (47%), Positives = 82/133 (61%) Frame = -2 Query: 890 LKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNGHALDSSLYSLLIQANC 711 L+QM Q GC PNFLSYS VI GL A+GRM RNGH LD+S+YS L++ C Sbjct: 400 LRQMKQLGCTPNFLSYSTVIDGLCKAKGRMHEVEEFVDDMCRNGHHLDASMYSWLVKGYC 459 Query: 710 VNGNVEKAIDLFEKMIGDRFVIKKESFDVFVKELSARGLMKEVETLFDQMRNSCSTFDVV 531 +GN + A+ LF +M+ +VI ESF FVK LSA+ EVE F++M C D+ Sbjct: 460 EDGNADMAMRLFCEMLDMGYVINLESFLAFVKGLSAKEKAFEVEKFFEEMSRRCPGIDIH 519 Query: 530 TYQSILDEYVCGN 492 Y+ ILDE++C N Sbjct: 520 KYRRILDEHLCKN 532 Score = 77.4 bits (189), Expect(2) = 5e-41 Identities = 37/87 (42%), Positives = 56/87 (64%), Gaps = 4/87 (4%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 THTS+LKG + GK ++A +H KEMV LGM D K+Y ++ E+CK+G+ D+A +LK+M Sbjct: 309 THTSILKGLCVVGKLDDAARHLKEMVGLGMEADAKAYGVVVNEYCKIGKADDAISLLKEM 368 Query: 970 RGR----VITCCNAVLRSYTKLQEFDK 902 + R ++ NAV R + + DK Sbjct: 369 KSRGINPSVSSFNAVFRILVESGKTDK 395 >ref|XP_006474468.1| PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like [Citrus sinensis] gi|343887304|dbj|BAK61850.1| PPR containing protein [Citrus unshiu] Length = 567 Score = 115 bits (287), Expect(2) = 3e-36 Identities = 60/144 (41%), Positives = 86/144 (59%) Frame = -2 Query: 929 LYEAARIR*TIRFLKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNGHAL 750 L E + I LKQM Q CLPNF+SY+ +I GL A+GRMQ +R+GH L Sbjct: 412 LVENGELDRAILLLKQMPQMDCLPNFVSYNTIICGLCMAKGRMQDVEDLVDRMIRSGHNL 471 Query: 749 DSSLYSLLIQANCVNGNVEKAIDLFEKMIGDRFVIKKESFDVFVKELSARGLMKEVETLF 570 D ++YS L++ C GNVE + + +M+ ++VI ESF V VK+L A+G + E E LF Sbjct: 472 DFTMYSCLLKGYCEEGNVENVMQIAHEMVTKKYVIGLESFSVLVKQLCAKGKVTEAEKLF 531 Query: 569 DQMRNSCSTFDVVTYQSILDEYVC 498 D + C DV +Y+ +LD+ +C Sbjct: 532 DTC-SRCPAVDVDSYRRVLDQQIC 554 Score = 65.1 bits (157), Expect(2) = 3e-36 Identities = 32/87 (36%), Positives = 52/87 (59%), Gaps = 4/87 (4%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 TH S+LKG + GK ++A+ + + ++ M+ D+KSY ++ FCK+G+ DEA +LK+M Sbjct: 334 THKSMLKGLCVVGKFDQAVGYLRNVMEANMNPDVKSYEVVINGFCKIGKSDEAISLLKEM 393 Query: 970 RGR----VITCCNAVLRSYTKLQEFDK 902 R R + NAV R + E D+ Sbjct: 394 RARGLKPTVFSFNAVFRILVENGELDR 420 >ref|XP_002308534.1| hypothetical protein POPTR_0006s23980g [Populus trichocarpa] gi|222854510|gb|EEE92057.1| hypothetical protein POPTR_0006s23980g [Populus trichocarpa] Length = 567 Score = 97.4 bits (241), Expect(2) = 9e-36 Identities = 53/141 (37%), Positives = 75/141 (53%) Frame = -2 Query: 929 LYEAARIR*TIRFLKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNGHAL 750 L E + + LKQ+ GCLPN +SYS VI GL + GRMQ +++G + Sbjct: 418 LVEIGELDKAVLLLKQVKNMGCLPNLVSYSTVICGLCRSHGRMQEVAGLVDDMLQDGFEM 477 Query: 749 DSSLYSLLIQANCVNGNVEKAIDLFEKMIGDRFVIKKESFDVFVKELSARGLMKEVETLF 570 D++LYS L+ C GN E A+ F I +VI +SF FV + +G + E E +F Sbjct: 478 DATLYSCLVGGFCEAGNEEMAMRAFYDSINKNYVINLQSFSFFVNLMCGKGKVIEAEQIF 537 Query: 569 DQMRNSCSTFDVVTYQSILDE 507 M CS DV +YQ +LD+ Sbjct: 538 KDMCRRCSLVDVDSYQRVLDD 558 Score = 81.3 bits (199), Expect(2) = 9e-36 Identities = 41/87 (47%), Positives = 57/87 (65%), Gaps = 4/87 (4%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 TH S+LKG S+AGKSEEAI +F EM+ GM LD K + ++ +CK+ +PDEA +LK+M Sbjct: 340 THLSILKGLSVAGKSEEAIGYFSEMIRKGMKLDAKEHEVVITAYCKMRKPDEAISLLKEM 399 Query: 970 R----GRVITCCNAVLRSYTKLQEFDK 902 + R + NAVLR ++ E DK Sbjct: 400 QAKGISRSVGSFNAVLRILVEIGELDK 426 >ref|XP_006453029.1| hypothetical protein CICLE_v10007870mg [Citrus clementina] gi|557556255|gb|ESR66269.1| hypothetical protein CICLE_v10007870mg [Citrus clementina] Length = 567 Score = 112 bits (279), Expect(2) = 1e-34 Identities = 59/144 (40%), Positives = 85/144 (59%) Frame = -2 Query: 929 LYEAARIR*TIRFLKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNGHAL 750 L E + I LKQM Q CLPNF+SY+ +I GL A+GRMQ +R+GH Sbjct: 412 LVENGELDRAILLLKQMPQMDCLPNFMSYNTIICGLCMAKGRMQDVEDLVDCMIRSGHNF 471 Query: 749 DSSLYSLLIQANCVNGNVEKAIDLFEKMIGDRFVIKKESFDVFVKELSARGLMKEVETLF 570 D ++YS L++ C GNVE + + +M+ ++VI ESF V VK+L A+G + E E LF Sbjct: 472 DFTMYSCLLKGYCEEGNVENVMRIAHEMVTKKYVIGLESFSVLVKQLCAKGKVTEAEKLF 531 Query: 569 DQMRNSCSTFDVVTYQSILDEYVC 498 M + C DV +Y+ +LD+ +C Sbjct: 532 G-MCSRCPAVDVDSYRRVLDQQIC 554 Score = 63.2 bits (152), Expect(2) = 1e-34 Identities = 32/87 (36%), Positives = 51/87 (58%), Gaps = 4/87 (4%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 TH S+LKG + GK ++A+ + + M+ M+ D+KSY ++ F K+G+ DEA +LK+M Sbjct: 334 THKSMLKGLCVVGKFDQAVGYLRNMIEANMNPDVKSYEVVVNGFGKIGKSDEAISLLKEM 393 Query: 970 RGR----VITCCNAVLRSYTKLQEFDK 902 R R + NAV R + E D+ Sbjct: 394 RARGLKPTVFSFNAVFRILVENGELDR 420 >ref|XP_002516112.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223544598|gb|EEF46114.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 461 Score = 95.9 bits (237), Expect(2) = 4e-34 Identities = 53/131 (40%), Positives = 74/131 (56%) Frame = -2 Query: 899 IRFLKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNGHALDSSLYSLLIQ 720 I LKQM GC PNF+SY+ VI GL A+GRMQ + +G A+D+++YS L++ Sbjct: 321 IFLLKQMQGMGCRPNFISYNIVIGGLCGAKGRMQNVKELLHNMLCSGLAVDATMYSSLVK 380 Query: 719 ANCVNGNVEKAIDLFEKMIGDRFVIKKESFDVFVKELSARGLMKEVETLFDQMRNSCSTF 540 C +GN E A + + I + +VI ESF VF ++ +G VE + +M CS Sbjct: 381 GYCEDGNEEMAKQVLYEAIDNNYVIDSESFSVFANKMCEKGKAVGVENILKEMCKRCSVV 440 Query: 539 DVVTYQSILDE 507 DV Y ILDE Sbjct: 441 DVGNYWRILDE 451 Score = 77.4 bits (189), Expect(2) = 4e-34 Identities = 37/86 (43%), Positives = 59/86 (68%), Gaps = 4/86 (4%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 TH S+LKG AGKS+EA+ +FKEM+ GM D+K+Y+++ E+CK+ +P+EA +LK+M Sbjct: 233 THMSILKGLCYAGKSDEAVNYFKEMIRKGMKCDVKAYAVVINEYCKMKKPNEAIALLKEM 292 Query: 970 RGR----VITCCNAVLRSYTKLQEFD 905 + + ++ NAV++ KL E D Sbjct: 293 KAKGINPSVSSFNAVIQILMKLGEPD 318 >ref|XP_007136868.1| hypothetical protein PHAVU_009G080600g [Phaseolus vulgaris] gi|561009955|gb|ESW08862.1| hypothetical protein PHAVU_009G080600g [Phaseolus vulgaris] Length = 550 Score = 95.9 bits (237), Expect(2) = 2e-33 Identities = 51/123 (41%), Positives = 74/123 (60%) Frame = -2 Query: 929 LYEAARIR*TIRFLKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNGHAL 750 L + +I + LKQM Q C PNFLSY +I GL +GRMQ ++NGH L Sbjct: 417 LVDEGKIDEGVLLLKQMSQMRCSPNFLSYCILICGLCKVKGRMQAVEELVSDMLQNGHNL 476 Query: 749 DSSLYSLLIQANCVNGNVEKAIDLFEKMIGDRFVIKKESFDVFVKELSARGLMKEVETLF 570 D+++YS L++ C +G+ E A+ F ++ FVIK++ F FVK L A+G +KE ET+F Sbjct: 477 DATMYSCLLEGYCEDGDEEMALKTFYDLMNKNFVIKQDIFCTFVKVLCAKGKLKEGETVF 536 Query: 569 DQM 561 + M Sbjct: 537 EGM 539 Score = 74.7 bits (182), Expect(2) = 2e-33 Identities = 37/77 (48%), Positives = 53/77 (68%), Gaps = 4/77 (5%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 T+TSLLKG + GKS+EA KH +EMVS GM D+K+Y ++ E+CK+G+P EA +L++M Sbjct: 339 TNTSLLKGLCIVGKSDEAFKHLREMVSRGMKPDVKAYGVVVNEYCKIGKPREAVSLLREM 398 Query: 970 RGR----VITCCNAVLR 932 R ++ NAV R Sbjct: 399 VTRGVKPSVSSFNAVFR 415 >gb|EXB48288.1| hypothetical protein L484_003771 [Morus notabilis] Length = 563 Score = 97.8 bits (242), Expect(2) = 3e-32 Identities = 48/131 (36%), Positives = 73/131 (55%) Frame = -2 Query: 890 LKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNGHALDSSLYSLLIQANC 711 LKQM + GC PNF+SY VI L +G MQ + +GH D +LY L+++ C Sbjct: 421 LKQMPKMGCSPNFVSYCTVICSLCGMKGMMQEVEELVHYMIHDGHIPDLALYCCLVKSYC 480 Query: 710 VNGNVEKAIDLFEKMIGDRFVIKKESFDVFVKELSARGLMKEVETLFDQMRNSCSTFDVV 531 +GNV+KA+ +F + + ++I +SF + L A G KE T+F + CS D Sbjct: 481 EDGNVDKAMRVFNETLDRNYIINLDSFSALINALCAAGRHKEAITIFKDVSRRCSKLDRN 540 Query: 530 TYQSILDEYVC 498 TY+ ++DE +C Sbjct: 541 TYKKVVDELLC 551 Score = 68.9 bits (167), Expect(2) = 3e-32 Identities = 36/87 (41%), Positives = 56/87 (64%), Gaps = 4/87 (4%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 THT++LKG + GK +EA +EMV LGM+LD+K+Y ++ E C+ +P+EA +L++M Sbjct: 330 THTTILKGLCITGKLDEAFAVLREMVFLGMNLDVKAYGVVLNECCRRRKPEEAISLLREM 389 Query: 970 R----GRVITCCNAVLRSYTKLQEFDK 902 R ++ NAVLRS + E D+ Sbjct: 390 RVQGLNPHVSSFNAVLRSLGENGELDR 416 >ref|XP_007224434.1| hypothetical protein PRUPE_ppa021491mg [Prunus persica] gi|462421370|gb|EMJ25633.1| hypothetical protein PRUPE_ppa021491mg [Prunus persica] Length = 557 Score = 95.9 bits (237), Expect(2) = 9e-32 Identities = 53/132 (40%), Positives = 70/132 (53%) Frame = -2 Query: 929 LYEAARIR*TIRFLKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNGHAL 750 L E + I L +M Q GC PNF SY+ VI L GRM + NGH L Sbjct: 376 LVENGDLERAIILLNKMNQMGCSPNFFSYNTVICSLCNLRGRMGEVEEFVGDMLWNGHKL 435 Query: 749 DSSLYSLLIQANCVNGNVEKAIDLFEKMIGDRFVIKKESFDVFVKELSARGLMKEVETLF 570 D+ LYS +I C +GNV AI F + + +I ESF + VKEL A+G++ E E +F Sbjct: 436 DTVLYSCVIMGYCEDGNVNMAIQAFCGALDNNHIISLESFSILVKELCAKGMVLEAERIF 495 Query: 569 DQMRNSCSTFDV 534 + M N C+ DV Sbjct: 496 EDMCNRCTVVDV 507 Score = 69.3 bits (168), Expect(2) = 9e-32 Identities = 28/63 (44%), Positives = 49/63 (77%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 THT++LKG + GK++EA+KH +++V+LGM D+++Y ++ E+CK+ +PD A +L++M Sbjct: 298 THTAILKGLCIVGKADEAVKHLQDIVNLGMKPDVEAYGVVFNEYCKMRKPDGAMSILREM 357 Query: 970 RGR 962 R R Sbjct: 358 RMR 360 >ref|XP_006578089.1| PREDICTED: pentatricopeptide repeat-containing protein At4g11690-like [Glycine max] Length = 551 Score = 90.9 bits (224), Expect(2) = 3e-31 Identities = 49/127 (38%), Positives = 72/127 (56%) Frame = -2 Query: 929 LYEAARIR*TIRFLKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNGHAL 750 L + +I + LKQM + GC PNFLSY VI GL +GRMQ ++NGH L Sbjct: 414 LVDEGKIDEGLHLLKQMPKMGCSPNFLSYCTVICGLCEVKGRMQQVEELVSNMLQNGHNL 473 Query: 749 DSSLYSLLIQANCVNGNVEKAIDLFEKMIGDRFVIKKESFDVFVKELSARGLMKEVETLF 570 D+++Y+ L+ C + + E A ++ FVI ++ F FVK L A+G +KE ET+ Sbjct: 474 DATMYNCLLLGYCEDRDEEMAQKTVYDIMDKNFVINQDIFCTFVKLLCAKGKLKEAETVS 533 Query: 569 DQMRNSC 549 ++MR C Sbjct: 534 EEMRRRC 540 Score = 72.8 bits (177), Expect(2) = 3e-31 Identities = 36/77 (46%), Positives = 54/77 (70%), Gaps = 4/77 (5%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 T+TSLLKG+ + GKS+EA+KH +EMVS GM D+K+Y ++ E+CK+ +P EA +L++M Sbjct: 336 TNTSLLKGFCIVGKSDEAVKHLREMVSRGMKPDVKAYGVVVNEYCKIRKPSEAVLLLREM 395 Query: 970 RGR----VITCCNAVLR 932 R ++ NAV R Sbjct: 396 VVRGVKPNVSSFNAVFR 412 >ref|XP_006849607.1| hypothetical protein AMTR_s00024p00204830 [Amborella trichopoda] gi|548853182|gb|ERN11188.1| hypothetical protein AMTR_s00024p00204830 [Amborella trichopoda] Length = 550 Score = 91.3 bits (225), Expect(2) = 1e-25 Identities = 52/139 (37%), Positives = 68/139 (48%) Frame = -2 Query: 929 LYEAARIR*TIRFLKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNGHAL 750 L E I I L QM + GC PNF+SYS VI L +GRMQ +RNGH Sbjct: 407 LCEGGEINRAIDILNQMPKKGCSPNFVSYSTVICSLCKCKGRMQKVEDLVVVMLRNGHKP 466 Query: 749 DSSLYSLLIQANCVNGNVEKAIDLFEKMIGDRFVIKKESFDVFVKELSARGLMKEVETLF 570 D S+YS ++ C N ++ A+ +F +MI I + F VKEL G E E +F Sbjct: 467 DISMYSEMVMGYCENDDLAMALGVFHEMIEKGLAINVQGFSAIVKELRQNGRTSEAENVF 526 Query: 569 DQMRNSCSTFDVVTYQSIL 513 +M C D +Y IL Sbjct: 527 KEMFRRCHVPDEESYARIL 545 Score = 53.1 bits (126), Expect(2) = 1e-25 Identities = 26/66 (39%), Positives = 43/66 (65%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 T+TSLLKG+ + K ++A K +EMVS G+ D K+Y ++ +CK+GR +A +L++M Sbjct: 329 TYTSLLKGFCILEKLDDAAKVLEEMVSKGLKPDSKAYGVLLNAYCKVGRAVDAIHVLERM 388 Query: 970 RGRVIT 953 +T Sbjct: 389 VNAGVT 394 >ref|XP_007012378.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma cacao] gi|508782741|gb|EOY29997.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma cacao] Length = 722 Score = 68.9 bits (167), Expect(2) = 2e-23 Identities = 36/87 (41%), Positives = 56/87 (64%), Gaps = 4/87 (4%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 TH S+LKG + G+S+EAI++F+ MV M LD K+Y ++ +CKL + DEA +LK+M Sbjct: 335 THMSILKGLCVVGRSKEAIEYFRWMVRCNMDLDAKAYGIVVNVYCKLRKLDEAILLLKEM 394 Query: 970 RGRVI----TCCNAVLRSYTKLQEFDK 902 GR I + N+V R+ + +E D+ Sbjct: 395 SGRGIYPNVSSFNSVFRTLVESRELDR 421 Score = 68.2 bits (165), Expect(2) = 2e-23 Identities = 35/89 (39%), Positives = 51/89 (57%) Frame = -2 Query: 929 LYEAARIR*TIRFLKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNGHAL 750 L E+ + I LKQM Q GC PN LSYS VI L AEGRMQ ++NG + Sbjct: 413 LVESRELDRAIMLLKQMPQLGCSPNLLSYSTVICSLCRAEGRMQEVRYLVDDMLQNGIVI 472 Query: 749 DSSLYSLLIQANCVNGNVEKAIDLFEKMI 663 D+++Y +++ + +GN E A+ + +MI Sbjct: 473 DATMYGCIVEGHSEDGNEEMAVQVLNEMI 501 >ref|XP_006606631.1| PREDICTED: pentatricopeptide repeat-containing protein At1g62720-like [Glycine max] Length = 280 Score = 57.4 bits (137), Expect(2) = 3e-15 Identities = 28/80 (35%), Positives = 45/80 (56%) Frame = -2 Query: 929 LYEAARIR*TIRFLKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNGHAL 750 L + ++ + LKQM + GC PNFL Y +I GL +GRMQ ++NGH L Sbjct: 194 LVDEGKLNKRVILLKQMPKMGCSPNFLFYCTMICGLCKVKGRMQQVEELILDMLQNGHNL 253 Query: 749 DSSLYSLLIQANCVNGNVEK 690 D+++Y+ L+ C + + E+ Sbjct: 254 DATMYNCLLAGYCQDRDEER 273 Score = 52.4 bits (124), Expect(2) = 3e-15 Identities = 30/78 (38%), Positives = 49/78 (62%), Gaps = 5/78 (6%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAI-EFCKLGRPDEAAGMLKK 974 T+T+++ G G E A K F+EMVS GM+ D+K+Y ++ + E+CK+ +P EA +L++ Sbjct: 115 TYTTMICGLCKVGMVESARKVFEEMVSHGMNPDVKAYRVVVVNEYCKIRKPSEAVLLLRE 174 Query: 973 MRGRVI----TCCNAVLR 932 M R + + NAV R Sbjct: 175 MVVRRVKPSMSSFNAVFR 192 >ref|XP_004985596.1| PREDICTED: pentatricopeptide repeat-containing protein At5g01110-like [Setaria italica] Length = 650 Score = 53.5 bits (127), Expect(2) = 7e-13 Identities = 36/147 (24%), Positives = 67/147 (45%) Frame = -2 Query: 944 CCVEELYEAARIR*TIRFLKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMR 765 C + EA RIR +MV GCLP+ ++Y+ ++ GL + ++ Sbjct: 344 CRAGSMSEALRIR------DEMVGCGCLPDVVTYNTLLSGL-CKQRKLLDAEELLNEMKE 396 Query: 764 NGHALDSSLYSLLIQANCVNGNVEKAIDLFEKMIGDRFVIKKESFDVFVKELSARGLMKE 585 G D ++ LI C GN+EKA+ LF+ ++ R +++ + + +G + + Sbjct: 397 RGVTPDLCTFTTLIHGYCREGNIEKALQLFDTLLHQRLRPDVVTYNSLIDGMCRKGDLTK 456 Query: 584 VETLFDQMRNSCSTFDVVTYQSILDEY 504 L+D M + VTY ++D + Sbjct: 457 ANELWDDMHALEIFPNHVTYSILIDSH 483 Score = 48.1 bits (113), Expect(2) = 7e-13 Identities = 24/58 (41%), Positives = 36/58 (62%) Frame = -3 Query: 1138 LLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKMRG 965 L+ G+ G+ EEA+K +KEM G++ D+ S+S + F + G+ D AA L KMRG Sbjct: 269 LIGGFCRVGEVEEAVKFYKEMQQRGVTPDMVSFSCLIGLFSRRGKMDRAAEYLSKMRG 326 >ref|XP_004498166.1| PREDICTED: pentatricopeptide repeat-containing protein At5g39710-like isoform X1 [Cicer arietinum] gi|502123561|ref|XP_004498167.1| PREDICTED: pentatricopeptide repeat-containing protein At5g39710-like isoform X2 [Cicer arietinum] Length = 749 Score = 50.4 bits (119), Expect(2) = 1e-12 Identities = 31/125 (24%), Positives = 64/125 (51%) Frame = -2 Query: 884 QMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNGHALDSSLYSLLIQANCVN 705 +MV+ G LP++++YS +I GL + ++ + G D Y+ L+ CV Sbjct: 468 EMVEKGILPDYVTYSSLIQGL-CRQRKLSEAFDLFREMVLVGLLPDEVTYTSLMNGYCVE 526 Query: 704 GNVEKAIDLFEKMIGDRFVIKKESFDVFVKELSARGLMKEVETLFDQMRNSCSTFDVVTY 525 G + KA+DL ++M+ F+ ++ V + L+ + +E + L ++ S + VTY Sbjct: 527 GELSKALDLHDEMMKKGFLPDVVTYSVLINGLNKKARTREAKKLLLKLFYDESVPNDVTY 586 Query: 524 QSILD 510 ++++ Sbjct: 587 NTLIE 591 Score = 50.1 bits (118), Expect(2) = 1e-12 Identities = 26/60 (43%), Positives = 35/60 (58%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 T+T+L+ G+ G EA K EM+ G S + +Y+ I FC LGR DEA G+LK M Sbjct: 375 TYTTLVDGFCRLGLMNEAYKVLSEMIDSGFSPSVVTYNAIIHGFCCLGRVDEAVGVLKGM 434 Score = 44.3 bits (103), Expect(2) = 4e-06 Identities = 32/150 (21%), Positives = 64/150 (42%) Frame = -2 Query: 938 VEELYEAARIR*TIRFLKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNG 759 + L R++ T+ +++M G P+ ++Y+ ++ G EG G Sbjct: 275 INGLCSQGRMKETMEVIREMNLKGLSPDCVTYNTLVNGF-CKEGNFHQGFVLLHEMAGKG 333 Query: 758 HALDSSLYSLLIQANCVNGNVEKAIDLFEKMIGDRFVIKKESFDVFVKELSARGLMKEVE 579 + + Y+ LI C N+ +A+++ + M + ++ V GLM E Sbjct: 334 LSPNVVTYTTLINGMCKVKNLSRALEILDHMRVRGLSPNERTYTTLVDGFCRLGLMNEAY 393 Query: 578 TLFDQMRNSCSTFDVVTYQSILDEYVCGNR 489 + +M +S + VVTY +I+ + C R Sbjct: 394 KVLSEMIDSGFSPSVVTYNAIIHGFCCLGR 423 Score = 34.3 bits (77), Expect(2) = 4e-06 Identities = 15/60 (25%), Positives = 34/60 (56%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 T+ +++G AG + ++ +EM + G ++ +Y+ + +CK + ++A G+LK M Sbjct: 200 TYNVMIRGMVSAGNLDSGLRLIREMETRGCLPNVVTYNTMITAYCKENKIEDAFGLLKIM 259 >ref|XP_007049304.1| Pentatricopeptide repeat-containing protein, putative isoform 1 [Theobroma cacao] gi|508701565|gb|EOX93461.1| Pentatricopeptide repeat-containing protein, putative isoform 1 [Theobroma cacao] Length = 909 Score = 58.9 bits (141), Expect(2) = 3e-12 Identities = 34/146 (23%), Positives = 68/146 (46%) Frame = -2 Query: 938 VEELYEAARIR*TIRFLKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNG 759 + L E R+ I + M WGC PN +Y+ +I GL + +++ ++NG Sbjct: 300 ISSLCEFGRVDEAIEIVGSMRTWGCYPNVQTYTALISGLFRVQ-KLEMAVGFYHKMVKNG 358 Query: 758 HALDSSLYSLLIQANCVNGNVEKAIDLFEKMIGDRFVIKKESFDVFVKELSARGLMKEVE 579 + Y++LI C G A+D+F M+ + ++++ +K L G ++ Sbjct: 359 LVPSTVTYNVLINELCAEGRFAIALDIFNWMLRHSTLPNTQTYNEIIKALCLMGDTEKAM 418 Query: 578 TLFDQMRNSCSTFDVVTYQSILDEYV 501 LF +M + ++TY +++ Y+ Sbjct: 419 ALFHKMLRIGPSPTLITYNTLIGGYL 444 Score = 40.4 bits (93), Expect(2) = 3e-12 Identities = 22/60 (36%), Positives = 33/60 (55%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 T+TSL+ GY + A + F +MV G + +YS + C +GR DEA GM ++M Sbjct: 225 TYTSLILGYCRNQNLDLAFEVFYKMVKEGCDPNSVTYSNLINGLCNVGRVDEALGMFEEM 284 >ref|XP_007049305.1| Pentatricopeptide repeat-containing protein, putative isoform 2 [Theobroma cacao] gi|590712142|ref|XP_007049306.1| Pentatricopeptide repeat-containing protein, putative isoform 2 [Theobroma cacao] gi|508701566|gb|EOX93462.1| Pentatricopeptide repeat-containing protein, putative isoform 2 [Theobroma cacao] gi|508701567|gb|EOX93463.1| Pentatricopeptide repeat-containing protein, putative isoform 2 [Theobroma cacao] Length = 716 Score = 58.9 bits (141), Expect(2) = 3e-12 Identities = 34/146 (23%), Positives = 68/146 (46%) Frame = -2 Query: 938 VEELYEAARIR*TIRFLKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNG 759 + L E R+ I + M WGC PN +Y+ +I GL + +++ ++NG Sbjct: 107 ISSLCEFGRVDEAIEIVGSMRTWGCYPNVQTYTALISGLFRVQ-KLEMAVGFYHKMVKNG 165 Query: 758 HALDSSLYSLLIQANCVNGNVEKAIDLFEKMIGDRFVIKKESFDVFVKELSARGLMKEVE 579 + Y++LI C G A+D+F M+ + ++++ +K L G ++ Sbjct: 166 LVPSTVTYNVLINELCAEGRFAIALDIFNWMLRHSTLPNTQTYNEIIKALCLMGDTEKAM 225 Query: 578 TLFDQMRNSCSTFDVVTYQSILDEYV 501 LF +M + ++TY +++ Y+ Sbjct: 226 ALFHKMLRIGPSPTLITYNTLIGGYL 251 Score = 40.4 bits (93), Expect(2) = 3e-12 Identities = 22/60 (36%), Positives = 33/60 (55%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 T+TSL+ GY + A + F +MV G + +YS + C +GR DEA GM ++M Sbjct: 32 TYTSLILGYCRNQNLDLAFEVFYKMVKEGCDPNSVTYSNLINGLCNVGRVDEALGMFEEM 91 >ref|XP_002319601.2| hypothetical protein POPTR_0013s03290g [Populus trichocarpa] gi|550324834|gb|EEE95524.2| hypothetical protein POPTR_0013s03290g [Populus trichocarpa] Length = 497 Score = 55.5 bits (132), Expect(2) = 3e-12 Identities = 33/132 (25%), Positives = 63/132 (47%) Frame = -2 Query: 899 IRFLKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNGHALDSSLYSLLIQ 720 ++ LK+M + GC PN ++Y+ +I L + + ++ G D YS ++ Sbjct: 102 LQLLKKMEEKGCKPNVVAYNTIIDSL-CKDRLVTEAMDFFSEMVKEGIPPDVFTYSSILH 160 Query: 719 ANCVNGNVEKAIDLFEKMIGDRFVIKKESFDVFVKELSARGLMKEVETLFDQMRNSCSTF 540 C G V +A LF++M+ + K +F + + L + ++ E +F+ M Sbjct: 161 GFCNLGRVNEATSLFKQMVERNVIPNKVTFTILIDGLCKKRMISEAWLVFETMTEKGLEP 220 Query: 539 DVVTYQSILDEY 504 DV TY +++D Y Sbjct: 221 DVYTYNALVDGY 232 Score = 43.9 bits (102), Expect(2) = 3e-12 Identities = 24/60 (40%), Positives = 32/60 (53%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 T +LL G K +A+K F EMV +G D+ +YS I CK+G A +LKKM Sbjct: 49 TFNTLLSGLCSKAKIMDAVKLFDEMVKMGHEPDVITYSTIINGLCKMGNTTMALQLLKKM 108 Score = 46.6 bits (109), Expect(2) = 8e-10 Identities = 35/126 (27%), Positives = 57/126 (45%), Gaps = 6/126 (4%) Frame = -2 Query: 890 LKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNGHALDSSLYSLLIQANC 711 LK+M +G LPN ++YS V+ GL G + + + +Y++LI+ C Sbjct: 315 LKEMCSYGLLPNLITYSIVLDGL-CKHGHLDEAFELLKAMQESKIEPNIFIYTILIEGMC 373 Query: 710 VNGNVEKAIDLFEKMIGDRFVIKKESFDVFVKELSARGLMKEVETLFDQMR------NSC 549 G +E A +LF + ++ V + L GL E LF +M NSC Sbjct: 374 TFGKLEAARELFSNLFVKGIQPTVVTYTVMISGLLKGGLSNEACELFREMAVNGCLPNSC 433 Query: 548 STFDVV 531 T++V+ Sbjct: 434 -TYNVI 438 Score = 44.7 bits (104), Expect(2) = 8e-10 Identities = 23/66 (34%), Positives = 39/66 (59%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 T+ +L+ GY + +EA K F M G + +++SY+++ CK GR DEA G+L +M Sbjct: 224 TYNALVDGYCSRSQMDEAQKLFNIMDRKGCAPNVRSYNILINGHCKSGRIDEAKGLLAEM 283 Query: 970 RGRVIT 953 + +T Sbjct: 284 SHKSLT 289 Score = 47.4 bits (111), Expect(2) = 4e-07 Identities = 23/61 (37%), Positives = 39/61 (63%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 T+++L++G+ G+ +EA + KEM S G+ +L +YS++ CK G DEA +LK M Sbjct: 294 TYSTLMRGFCQVGRPQEAQELLKEMCSYGLLPNLITYSIVLDGLCKHGHLDEAFELLKAM 353 Query: 970 R 968 + Sbjct: 354 Q 354 Score = 34.7 bits (78), Expect(2) = 4e-07 Identities = 29/104 (27%), Positives = 45/104 (43%) Frame = -2 Query: 941 CVEELYEAARIR*TIRFLKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRN 762 C EAAR + F+K G P ++Y+ +I GL+ G N Sbjct: 373 CTFGKLEAARELFSNLFVK-----GIQPTVVTYTVMISGLLKG-GLSNEACELFREMAVN 426 Query: 761 GHALDSSLYSLLIQANCVNGNVEKAIDLFEKMIGDRFVIKKESF 630 G +S Y+++IQ NG+ A+ L E+M+G F +F Sbjct: 427 GCLPNSCTYNVIIQGFLRNGDTPNAVRLIEEMVGKGFSADSSTF 470 >ref|XP_004139858.1| PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like [Cucumis sativus] gi|449530677|ref|XP_004172320.1| PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like [Cucumis sativus] Length = 839 Score = 62.0 bits (149), Expect(2) = 4e-12 Identities = 39/145 (26%), Positives = 73/145 (50%) Frame = -2 Query: 938 VEELYEAARIR*TIRFLKQMVQWGCLPNFLSYSEVIFGLVAAEGRMQXXXXXXXXXMRNG 759 + L +A R ++ L +M + GC PN +Y+ +I GL + +G+ + + +G Sbjct: 314 IVSLCDAGRSCEAVKLLGKMKKRGCGPNVQTYTALISGL-SRDGKFEVAIGVYHKMLADG 372 Query: 758 HALDSSLYSLLIQANCVNGNVEKAIDLFEKMIGDRFVIKKESFDVFVKELSARGLMKEVE 579 + YS LI V G E A+ +FE M+ + E+++V +K + G +++ Sbjct: 373 LVPTAVTYSALINQLYVEGRFETALTIFEWMLSHDSLPNTETYNVIIKGFCSIGYIQKAT 432 Query: 578 TLFDQMRNSCSTFDVVTYQSILDEY 504 +FDQM + + +V+TY I+ Y Sbjct: 433 AIFDQMLKAGPSPNVITYNIIIHIY 457 Score = 37.0 bits (84), Expect(2) = 4e-12 Identities = 21/60 (35%), Positives = 32/60 (53%) Frame = -3 Query: 1150 THTSLLKGYSMAGKSEEAIKHFKEMVSLGMSLDLKSYSMIAIEFCKLGRPDEAAGMLKKM 971 T+TSL+ G+ G + A + F MV G + +YS + C GR +EA ML++M Sbjct: 239 TYTSLIIGHCKNGNLDLAFEMFDRMVKDGCDPNSVTYSALINGLCSEGRLEEAMDMLEEM 298