BLASTX nr result

ID: Mentha26_contig00035995 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00035995
         (508 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU19588.1| hypothetical protein MIMGU_mgv1a018605mg [Mimulus...   214   1e-53
ref|XP_002274427.1| PREDICTED: pentatricopeptide repeat-containi...   192   5e-47
ref|XP_007038851.1| Pentatricopeptide repeat-containing protein,...   188   8e-46
ref|XP_002513633.1| pentatricopeptide repeat-containing protein,...   179   4e-43
ref|XP_007220608.1| hypothetical protein PRUPE_ppa001496mg [Prun...   174   9e-42
ref|XP_002305943.2| hypothetical protein POPTR_0004s07030g [Popu...   172   4e-41
ref|XP_007143480.1| hypothetical protein PHAVU_007G075500g [Phas...   169   4e-40
ref|XP_004496516.1| PREDICTED: pentatricopeptide repeat-containi...   166   4e-39
gb|EXC05947.1| hypothetical protein L484_014215 [Morus notabilis]     165   5e-39
ref|XP_003592182.1| Pentatricopeptide repeat-containing protein ...   164   9e-39
ref|XP_006281960.1| hypothetical protein CARUB_v10028179mg [Caps...   163   3e-38
ref|XP_006589520.1| PREDICTED: pentatricopeptide repeat-containi...   160   1e-37
ref|NP_200097.1| pentatricopeptide repeat-containing protein [Ar...   160   2e-37
ref|XP_006401775.1| hypothetical protein EUTSA_v10012630mg [Eutr...   159   3e-37
ref|XP_002865930.1| pentatricopeptide repeat-containing protein ...   159   5e-37
gb|EMT17957.1| hypothetical protein F775_08872 [Aegilops tauschii]    147   2e-33
ref|XP_002466053.1| hypothetical protein SORBIDRAFT_01g000260 [S...   145   4e-33
ref|XP_006357376.1| PREDICTED: putative pentatricopeptide repeat...   141   1e-31
ref|XP_003559087.1| PREDICTED: pentatricopeptide repeat-containi...   140   1e-31
tpg|DAA52614.1| TPA: hypothetical protein ZEAMMB73_283558 [Zea m...   140   2e-31

>gb|EYU19588.1| hypothetical protein MIMGU_mgv1a018605mg [Mimulus guttatus]
          Length = 737

 Score =  214 bits (544), Expect = 1e-53
 Identities = 111/168 (66%), Positives = 129/168 (76%), Gaps = 1/168 (0%)
 Frame = -3

Query: 503  SGFGQWMSVLNGLVDFYGKSRCMVEAQRAFEEVCKPDTVSWNALIHSFALNECTASALST 324
            SGF  W SVLNGL+DFYGK  C+ +AQ+AF+E+ +PD  SWN LI+ FA N  T SALS 
Sbjct: 519  SGFVGWKSVLNGLIDFYGKCGCVSDAQKAFDEIPEPDIFSWNGLIYGFAHNRLTTSALSA 578

Query: 323  LEDMRLAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVDLLG 144
            LEDMRLAGA+P SFTL TVL  C+Q GL D+  EYF  LR LYD+KP+L H  LLVDLLG
Sbjct: 579  LEDMRLAGARPDSFTLSTVLFACSQDGLADLGVEYFHSLRELYDIKPQLSHCNLLVDLLG 638

Query: 143  RAGRLEEAVSLLKSIPF-RPNALIYKRLLIACKLHKNMLLAEEMAVKG 3
             AGRLEEAVSL++SIPF R NA IYKRLL ACKLH  +LL EE+A +G
Sbjct: 639  WAGRLEEAVSLVESIPFIRNNASIYKRLLCACKLHGKLLLGEEIARRG 686


>ref|XP_002274427.1| PREDICTED: pentatricopeptide repeat-containing protein At5g52850,
            chloroplastic [Vitis vinifera]
            gi|302143764|emb|CBI22625.3| unnamed protein product
            [Vitis vinifera]
          Length = 880

 Score =  192 bits (487), Expect = 5e-47
 Identities = 93/168 (55%), Positives = 121/168 (72%)
 Frame = -3

Query: 506  KSGFGQWMSVLNGLVDFYGKSRCMVEAQRAFEEVCKPDTVSWNALIHSFALNECTASALS 327
            KSG G W+SV NGLVD YGK  C+ +A R+F E+ +PD VSWN LI   A N   +SALS
Sbjct: 548  KSGLGSWISVSNGLVDLYGKCGCIHDAHRSFLEITEPDAVSWNGLIFGLASNGHVSSALS 607

Query: 326  TLEDMRLAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVDLL 147
              EDMRLAG +P   T + VL  C+ GGLVD+  +YF  +R  + ++P+L HY  LVDLL
Sbjct: 608  AFEDMRLAGVEPDQITCLLVLYACSHGGLVDMGLDYFQSMREKHGIRPQLDHYVCLVDLL 667

Query: 146  GRAGRLEEAVSLLKSIPFRPNALIYKRLLIACKLHKNMLLAEEMAVKG 3
            GRAGRLEEA+++++++PF+P+ALIYK LL ACKLH N+ L E MA +G
Sbjct: 668  GRAGRLEEAMNVIETMPFKPDALIYKTLLGACKLHGNIPLGEHMARQG 715



 Score = 55.8 bits (133), Expect = 6e-06
 Identities = 31/84 (36%), Positives = 45/84 (53%), Gaps = 1/84 (1%)
 Frame = -3

Query: 503 SGFGQWMSVLNGLVDFYGKSRCMVE-AQRAFEEVCKPDTVSWNALIHSFALNECTASALS 327
           +G    +SV N LVD Y K   M+E A RAF  +  P+ +SW +LI  F+ +     ++ 
Sbjct: 346 AGLENDVSVGNSLVDMYMKCSNMIEDAVRAFRGIASPNVISWTSLIAGFSEHGLEEESIK 405

Query: 326 TLEDMRLAGAKPFSFTLMTVLNIC 255
               M+  G +P SFTL T+L  C
Sbjct: 406 VFGAMQGVGVRPNSFTLSTILGAC 429


>ref|XP_007038851.1| Pentatricopeptide repeat-containing protein, putative [Theobroma
            cacao] gi|508776096|gb|EOY23352.1| Pentatricopeptide
            repeat-containing protein, putative [Theobroma cacao]
          Length = 884

 Score =  188 bits (477), Expect = 8e-46
 Identities = 91/168 (54%), Positives = 120/168 (71%)
 Frame = -3

Query: 506  KSGFGQWMSVLNGLVDFYGKSRCMVEAQRAFEEVCKPDTVSWNALIHSFALNECTASALS 327
            KSG G+W+SV NGLVD YGK  C+ +AQRAF E+  PD  SWN LI   A     +SALS
Sbjct: 551  KSGLGRWVSVANGLVDLYGKCGCICDAQRAFGEITVPDIFSWNGLISGLASIGSISSALS 610

Query: 326  TLEDMRLAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVDLL 147
              +DMRLAG +P S T + +L+ CN G LVD+  EYF  +R ++D+ P+L HY  LVD+L
Sbjct: 611  AFDDMRLAGVRPDSVTFLLLLSACNNGKLVDLGLEYFQSMREVHDIVPQLDHYVHLVDIL 670

Query: 146  GRAGRLEEAVSLLKSIPFRPNALIYKRLLIACKLHKNMLLAEEMAVKG 3
            GR GRLEEA+ +++++PFR +A IYK LL ACK H+N+ LAE+MA +G
Sbjct: 671  GRGGRLEEAMEVVQTMPFRADASIYKTLLRACKAHRNIPLAEDMARRG 718



 Score = 68.2 bits (165), Expect = 1e-09
 Identities = 46/132 (34%), Positives = 66/132 (50%)
 Frame = -3

Query: 506 KSGFGQWMSVLNGLVDFYGKSRCMVEAQRAFEEVCKPDTVSWNALIHSFALNECTASALS 327
           K GF Q   +++GL+DFY K     EA + F  V   DTVSW  +I SF   +  + AL 
Sbjct: 147 KQGFEQNPILVSGLLDFYSKFNFTGEAYKLFIYVGNHDTVSWTTMISSFVQAQRWSKALL 206

Query: 326 TLEDMRLAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVDLL 147
              DM  AG  P  FT + +L +C+  GL      +  +L  L  VK  +     LVD+ 
Sbjct: 207 LYVDMVEAGVPPNEFTFVKLLGVCSVLGLKYGKLVHAHML--LRGVKLNVVVKTALVDMY 264

Query: 146 GRAGRLEEAVSL 111
            R  R+E+A+ +
Sbjct: 265 ARCQRMEDAIKV 276


>ref|XP_002513633.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223547541|gb|EEF49036.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 777

 Score =  179 bits (454), Expect = 4e-43
 Identities = 86/168 (51%), Positives = 119/168 (70%)
 Frame = -3

Query: 506  KSGFGQWMSVLNGLVDFYGKSRCMVEAQRAFEEVCKPDTVSWNALIHSFALNECTASALS 327
            KSG    +SV NGL+D YGK   + EA+RAF E+ +PD VSWN LI   A N   +SALS
Sbjct: 552  KSGLSCCLSVANGLIDLYGKYGLVHEARRAFTEITEPDVVSWNGLISGLASNGHISSALS 611

Query: 326  TLEDMRLAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVDLL 147
              +DMRL G +P S T + VL+ C+ GGLVD+  +YF  +R ++DV+P+  HY  LVD+L
Sbjct: 612  AFDDMRLRGIQPDSITFLLVLSTCSHGGLVDMGLQYFHSMREMHDVEPQSDHYVCLVDIL 671

Query: 146  GRAGRLEEAVSLLKSIPFRPNALIYKRLLIACKLHKNMLLAEEMAVKG 3
            GRAGRLEEA+++++++P  P+A IYK LL AC +H+NM L E++A +G
Sbjct: 672  GRAGRLEEAMNIIETMPLEPDASIYKTLLAACSIHRNMNLGEDVARRG 719



 Score = 61.2 bits (147), Expect = 1e-07
 Identities = 42/141 (29%), Positives = 71/141 (50%), Gaps = 5/141 (3%)
 Frame = -3

Query: 506 KSGFGQWMSVLNGLVDFYGKSRCMVE-AQRAFEEVCKPDTVSWNALIHSFALNECTASAL 330
           ++G    + V N LVD Y K  C+VE   R F  +  P+ +SW +LI  FA +     +L
Sbjct: 349 RTGLEDDVPVGNALVDMYMKCSCIVEHGLRMFRGIKSPNVISWTSLIAGFAEHGFQQDSL 408

Query: 329 STLEDMRLAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYAL---- 162
           +   +MR  G +P SFTL  VL +C+      +   Y  L  + + +K +  +  +    
Sbjct: 409 NLFMEMRTVGVQPNSFTLSIVLRVCSA-----IKSPYQTLKLHGHIIKTKADYDVVVGNA 463

Query: 161 LVDLLGRAGRLEEAVSLLKSI 99
           LVD    +GR+++A  ++K +
Sbjct: 464 LVDAYAGSGRVDDAWRVVKDM 484


>ref|XP_007220608.1| hypothetical protein PRUPE_ppa001496mg [Prunus persica]
           gi|462417070|gb|EMJ21807.1| hypothetical protein
           PRUPE_ppa001496mg [Prunus persica]
          Length = 814

 Score =  174 bits (442), Expect = 9e-42
 Identities = 87/168 (51%), Positives = 117/168 (69%)
 Frame = -3

Query: 506 KSGFGQWMSVLNGLVDFYGKSRCMVEAQRAFEEVCKPDTVSWNALIHSFALNECTASALS 327
           K+G    +SV N LVD YGK  C  +A RAF+ + +PD VSWN LI   A     +SALS
Sbjct: 473 KAGLASGISVSNALVDLYGKCGCTDDAYRAFKGISEPDIVSWNGLISGLASTGHISSALS 532

Query: 326 TLEDMRLAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVDLL 147
           T +DMRLAG KP S T + VL  C+ GGLV++  E+F  +R  +++ P+L HYA LVDLL
Sbjct: 533 TFDDMRLAGFKPDSITFLLVLFACSHGGLVELGLEHFQSMREKHEIAPQLDHYACLVDLL 592

Query: 146 GRAGRLEEAVSLLKSIPFRPNALIYKRLLIACKLHKNMLLAEEMAVKG 3
           GRAGRLE+A+ ++ ++PF+P+ALIYK LL ACK H+N+ L E +A +G
Sbjct: 593 GRAGRLEDAMEVIMTMPFKPDALIYKTLLGACKSHRNIALGEYVARQG 640


>ref|XP_002305943.2| hypothetical protein POPTR_0004s07030g [Populus trichocarpa]
            gi|550340500|gb|EEE86454.2| hypothetical protein
            POPTR_0004s07030g [Populus trichocarpa]
          Length = 771

 Score =  172 bits (436), Expect = 4e-41
 Identities = 87/168 (51%), Positives = 115/168 (68%)
 Frame = -3

Query: 506  KSGFGQWMSVLNGLVDFYGKSRCMVEAQRAFEEVCKPDTVSWNALIHSFALNECTASALS 327
            KSG G  +SV NGLV FYGK     +A+RAF E+ +PD VSWN LI   A     +SALS
Sbjct: 551  KSGLGSSISVSNGLVSFYGKCGLTRDAERAFAEIREPDIVSWNGLISVLASYGHISSALS 610

Query: 326  TLEDMRLAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVDLL 147
              +DMRL G KP S T + VL  C   GLVD+  EYF+ ++ ++ ++P+L HY  L DLL
Sbjct: 611  AFDDMRLTGVKPDSVTFLLVLFTCTHCGLVDMGLEYFNSMKEMHGIEPQLDHYVCLFDLL 670

Query: 146  GRAGRLEEAVSLLKSIPFRPNALIYKRLLIACKLHKNMLLAEEMAVKG 3
            GRAGRLEEA+ +L+++P RPNA IYK LL ACK+H+ + L E++A +G
Sbjct: 671  GRAGRLEEAMEILETMPIRPNASIYKTLLAACKVHRIVPLGEDIASRG 718


>ref|XP_007143480.1| hypothetical protein PHAVU_007G075500g [Phaseolus vulgaris]
            gi|561016670|gb|ESW15474.1| hypothetical protein
            PHAVU_007G075500g [Phaseolus vulgaris]
          Length = 882

 Score =  169 bits (428), Expect = 4e-40
 Identities = 83/165 (50%), Positives = 114/165 (69%)
 Frame = -3

Query: 506  KSGFGQWMSVLNGLVDFYGKSRCMVEAQRAFEEVCKPDTVSWNALIHSFALNECTASALS 327
            KSGF    SV N LV  YGK   M +A RAF+++ +PDTVSWN LI   A N   + ALS
Sbjct: 552  KSGFEICNSVSNSLVHLYGKCGSMHDAYRAFKDIKEPDTVSWNGLISGLASNGHISDALS 611

Query: 326  TLEDMRLAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVDLL 147
              +DMRLAG KP SFT +++++ C+QG L++   +YF  +   YD+ P+L HY  L+DLL
Sbjct: 612  AFDDMRLAGVKPDSFTFLSLISACSQGSLLNQGLDYFYSMEKTYDITPKLDHYVCLMDLL 671

Query: 146  GRAGRLEEAVSLLKSIPFRPNALIYKRLLIACKLHKNMLLAEEMA 12
            GR GRLEEA+ +++++PF P+++ YK LL ACKLH N+ L E+MA
Sbjct: 672  GRGGRLEEALGVIETMPFMPDSVTYKTLLNACKLHGNVPLGEDMA 716


>ref|XP_004496516.1| PREDICTED: pentatricopeptide repeat-containing protein At5g52850,
            chloroplastic-like [Cicer arietinum]
          Length = 885

 Score =  166 bits (419), Expect = 4e-39
 Identities = 82/165 (49%), Positives = 110/165 (66%)
 Frame = -3

Query: 506  KSGFGQWMSVLNGLVDFYGKSRCMVEAQRAFEEVCKPDTVSWNALIHSFALNECTASALS 327
            KSGF ++ SV N LV  Y K      A RAF+++ KPD  SWN LI   ALN   + ALS
Sbjct: 555  KSGFQRFNSVSNSLVHLYSKCGSTHHAHRAFKDMNKPDQFSWNGLISGLALNGYISQALS 614

Query: 326  TLEDMRLAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVDLL 147
              +DMRLAG KP S TL+++++ C+ GGL+D+  EYF  +   Y + P+L HY  LVDLL
Sbjct: 615  AFDDMRLAGVKPDSVTLLSLISACSHGGLLDLGLEYFYSMEKAYHITPKLDHYVCLVDLL 674

Query: 146  GRAGRLEEAVSLLKSIPFRPNALIYKRLLIACKLHKNMLLAEEMA 12
            GR GRLEEA  +++++PF P+++I K LL AC LH N+ L E+MA
Sbjct: 675  GRGGRLEEATGVIETMPFEPDSMICKTLLNACNLHGNVALGEDMA 719


>gb|EXC05947.1| hypothetical protein L484_014215 [Morus notabilis]
          Length = 805

 Score =  165 bits (418), Expect = 5e-39
 Identities = 82/168 (48%), Positives = 114/168 (67%)
 Frame = -3

Query: 506 KSGFGQWMSVLNGLVDFYGKSRCMVEAQRAFEEVCKPDTVSWNALIHSFALNECTASALS 327
           KSGFG   SV N LVD Y K     +A +AF E+  PD VSWN LI   A N   + ALS
Sbjct: 473 KSGFGGCTSVSNALVDLYWKCGYGNDAYKAFAEISDPDVVSWNGLISGLASNGYISGALS 532

Query: 326 TLEDMRLAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVDLL 147
             +DMRLAG KP S + ++VL  C++G LV++  EYF  +++++++ P + HY  LVDLL
Sbjct: 533 AFDDMRLAGLKPDSVSFLSVLFACSRGNLVNLGIEYFHSMKSMHNMTPEIDHYICLVDLL 592

Query: 146 GRAGRLEEAVSLLKSIPFRPNALIYKRLLIACKLHKNMLLAEEMAVKG 3
           GRAG  E+A+ ++ S+PF+P+ L+YK LL ACKLH+N+ L E+MA +G
Sbjct: 593 GRAGLFEDAMEVIDSMPFKPDTLVYKTLLGACKLHRNIPLGEDMARRG 640


>ref|XP_003592182.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355481230|gb|AES62433.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 912

 Score =  164 bits (416), Expect = 9e-39
 Identities = 80/165 (48%), Positives = 113/165 (68%)
 Frame = -3

Query: 506  KSGFGQWMSVLNGLVDFYGKSRCMVEAQRAFEEVCKPDTVSWNALIHSFALNECTASALS 327
            KSGF +  SV N LV  Y K   + +A RAF+++ +PD  SWN LI  F+ N   + ALS
Sbjct: 549  KSGFQRCHSVSNSLVHLYSKCGSIHDANRAFKDISEPDAFSWNGLISGFSWNGLISHALS 608

Query: 326  TLEDMRLAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVDLL 147
            T +DMRLAG KP S TL+++++ C+ GGL+++  EYF  ++  Y + P+L HY  LVDLL
Sbjct: 609  TFDDMRLAGVKPDSITLLSLISACSHGGLLELGLEYFHSMQKEYHITPKLDHYMCLVDLL 668

Query: 146  GRAGRLEEAVSLLKSIPFRPNALIYKRLLIACKLHKNMLLAEEMA 12
            GR GRLEEA+ +++ + F+P++LI K LL AC LH N+ L E+MA
Sbjct: 669  GRGGRLEEAMGVIEKMSFKPDSLICKTLLNACNLHGNVALGEDMA 713


>ref|XP_006281960.1| hypothetical protein CARUB_v10028179mg [Capsella rubella]
            gi|482550664|gb|EOA14858.1| hypothetical protein
            CARUB_v10028179mg [Capsella rubella]
          Length = 895

 Score =  163 bits (412), Expect = 3e-38
 Identities = 77/168 (45%), Positives = 113/168 (67%)
 Frame = -3

Query: 506  KSGFGQWMSVLNGLVDFYGKSRCMVEAQRAFEEVCKPDTVSWNALIHSFALNECTASALS 327
            KSGF   +SV N L+D YGK   +  A++ FEE+  PD VSWN +I + A N C +SALS
Sbjct: 556  KSGFSSSVSVSNSLLDMYGKCGLLEHAKKVFEEIAIPDVVSWNGVISALASNGCISSALS 615

Query: 326  TLEDMRLAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVDLL 147
              E+MR+ G +P S T + +L+ C+ G L ++  EYF  +  +++++P++ HY  LV +L
Sbjct: 616  AFEEMRMKGFEPDSVTFLILLSACSNGRLTEMGLEYFQSMTEIHNIEPQIDHYVHLVGIL 675

Query: 146  GRAGRLEEAVSLLKSIPFRPNALIYKRLLIACKLHKNMLLAEEMAVKG 3
            GRA RLEEA  +++++  RPNALI+K LL AC+ H N+ L E+MA KG
Sbjct: 676  GRAARLEEATGVVETMQLRPNALIFKTLLRACRYHGNLALGEDMANKG 723


>ref|XP_006589520.1| PREDICTED: pentatricopeptide repeat-containing protein At5g52850,
            chloroplastic-like [Glycine max]
          Length = 881

 Score =  160 bits (406), Expect = 1e-37
 Identities = 79/165 (47%), Positives = 110/165 (66%)
 Frame = -3

Query: 506  KSGFGQWMSVLNGLVDFYGKSRCMVEAQRAFEEVCKPDTVSWNALIHSFALNECTASALS 327
            KSGF +  SV N LV  Y K   M +A R F+++ +PD VSWN LI   A N   + ALS
Sbjct: 551  KSGFERCNSVSNSLVHSYSKCGSMRDAYRVFKDITEPDRVSWNGLISGLASNGLISDALS 610

Query: 326  TLEDMRLAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVDLL 147
              +DMRLAG KP S T ++++  C+QG L++   +YF  +   Y + P+L HY  LVDLL
Sbjct: 611  AFDDMRLAGVKPDSVTFLSLIFACSQGSLLNQGLDYFYSMEKTYHITPKLDHYVCLVDLL 670

Query: 146  GRAGRLEEAVSLLKSIPFRPNALIYKRLLIACKLHKNMLLAEEMA 12
            GR GRLEEA+ +++++PF+P+++IYK LL AC LH N+ L E+MA
Sbjct: 671  GRGGRLEEAMGVIETMPFKPDSVIYKTLLNACNLHGNVPLGEDMA 715


>ref|NP_200097.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75171558|sp|Q9FLX6.1|PP430_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At5g52850, chloroplastic; Flags: Precursor
            gi|10177099|dbj|BAB10433.1| selenium-binding protein-like
            [Arabidopsis thaliana] gi|332008885|gb|AED96268.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 893

 Score =  160 bits (405), Expect = 2e-37
 Identities = 76/168 (45%), Positives = 112/168 (66%)
 Frame = -3

Query: 506  KSGFGQWMSVLNGLVDFYGKSRCMVEAQRAFEEVCKPDTVSWNALIHSFALNECTASALS 327
            KSGF    SVLN LVD Y K   + +A++ FEE+  PD VSWN L+   A N   +SALS
Sbjct: 556  KSGFSGAASVLNSLVDMYSKCGSLEDAKKVFEEIATPDVVSWNGLVSGLASNGFISSALS 615

Query: 326  TLEDMRLAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVDLL 147
              E+MR+   +P S T + +L+ C+ G L D+  EYF +++ +Y+++P++ HY  LV +L
Sbjct: 616  AFEEMRMKETEPDSVTFLILLSACSNGRLTDLGLEYFQVMKKIYNIEPQVEHYVHLVGIL 675

Query: 146  GRAGRLEEAVSLLKSIPFRPNALIYKRLLIACKLHKNMLLAEEMAVKG 3
            GRAGRLEEA  +++++  +PNA+I+K LL AC+   N+ L E+MA KG
Sbjct: 676  GRAGRLEEATGVVETMHLKPNAMIFKTLLRACRYRGNLSLGEDMANKG 723


>ref|XP_006401775.1| hypothetical protein EUTSA_v10012630mg [Eutrema salsugineum]
            gi|557102865|gb|ESQ43228.1| hypothetical protein
            EUTSA_v10012630mg [Eutrema salsugineum]
          Length = 897

 Score =  159 bits (403), Expect = 3e-37
 Identities = 75/170 (44%), Positives = 114/170 (67%), Gaps = 2/170 (1%)
 Frame = -3

Query: 506  KSGFGQWMSVL--NGLVDFYGKSRCMVEAQRAFEEVCKPDTVSWNALIHSFALNECTASA 333
            KSG+   +SV   N L+D YGK   + +A++ FEE   PD V+WN L+   A N C +SA
Sbjct: 555  KSGYSSSVSVSVSNSLIDMYGKCGILEDAKKVFEETANPDVVTWNGLVSGLASNGCISSA 614

Query: 332  LSTLEDMRLAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVD 153
            LS  E+M++ G +P S T + +L+ C+ G L ++  EYF  ++ ++D++P++ HY  LV 
Sbjct: 615  LSAFEEMKMKGTEPDSVTFLILLSACSYGRLTEMGLEYFHSMKKIHDIEPQIEHYVHLVG 674

Query: 152  LLGRAGRLEEAVSLLKSIPFRPNALIYKRLLIACKLHKNMLLAEEMAVKG 3
            +LGRAGRLEEA  +++++P  PNALI+K LL AC+ H ++ L E+MA KG
Sbjct: 675  ILGRAGRLEEARGIVETMPLGPNALIFKTLLRACRYHGDLSLGEDMANKG 724


>ref|XP_002865930.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297311765|gb|EFH42189.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 878

 Score =  159 bits (401), Expect = 5e-37
 Identities = 75/168 (44%), Positives = 114/168 (67%)
 Frame = -3

Query: 506  KSGFGQWMSVLNGLVDFYGKSRCMVEAQRAFEEVCKPDTVSWNALIHSFALNECTASALS 327
            KSGF   +SVLN LVD Y K   + +A++ FEE+  PD VSWN L+   A     +SALS
Sbjct: 555  KSGFSGAVSVLNSLVDMYSKCGSLEDAKKVFEEIAMPDVVSWNGLVSGLASIGRISSALS 614

Query: 326  TLEDMRLAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVDLL 147
              E+MR+ G +P S T + +L+ C++G L ++  EYF  ++ +++++P++ HY  LV +L
Sbjct: 615  AFEEMRMKGTEPDSVTFLILLSACSKGRLTEMGLEYFQSMKTIHNMEPQIEHYVHLVGIL 674

Query: 146  GRAGRLEEAVSLLKSIPFRPNALIYKRLLIACKLHKNMLLAEEMAVKG 3
            GRAGRLEEA  +++++  +PNA+I+K LL AC+ H N+ L E+MA KG
Sbjct: 675  GRAGRLEEATGVVETMHLKPNAMIFKTLLRACRYHGNLSLGEDMANKG 722


>gb|EMT17957.1| hypothetical protein F775_08872 [Aegilops tauschii]
          Length = 597

 Score =  147 bits (370), Expect = 2e-33
 Identities = 72/168 (42%), Positives = 100/168 (59%), Gaps = 1/168 (0%)
 Frame = -3

Query: 506 KSGFGQWMSVLNGLVDFYGKSRCMVEAQRAFEEVCKPDTVSWNALIHSFALNE-CTASAL 330
           K G    +SV N L++ Y K +C+ +A+  F  + +P  VSWN LI   A N  C   AL
Sbjct: 386 KLGLSSQVSVSNSLINMYSKHKCVEDAKSVFHSIREPSVVSWNTLISGLAYNNGCYYEAL 445

Query: 329 STLEDMRLAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVDL 150
           S  EDM LAGA+P S T   VL  C  G LVD+   +F+ +RN + V P   HY L +D+
Sbjct: 446 SVFEDMTLAGAQPDSITFSAVLYACTHGALVDIGINHFNSMRNSFGVSPERSHYTLFLDM 505

Query: 149 LGRAGRLEEAVSLLKSIPFRPNALIYKRLLIACKLHKNMLLAEEMAVK 6
           LGRAGRL EA   ++++P RP+  +YK LL  C+LH ++ +AE +  K
Sbjct: 506 LGRAGRLTEAACTIEAMPIRPDVSMYKNLLAFCELHNDLSVAENITRK 553



 Score = 55.8 bits (133), Expect = 6e-06
 Identities = 43/147 (29%), Positives = 69/147 (46%), Gaps = 6/147 (4%)
 Frame = -3

Query: 485 MSVLNGLVDFYGKSRC-MVEAQRAFEEVCKPDTVSWNALIHSFALNECTASALSTLEDMR 309
           +SV N LVDFY KS   +++   AF    +P+ VSW ALI   A +     A +   +MR
Sbjct: 185 ISVCNALVDFYSKSSARLLDLLHAFSATDRPNVVSWTALIAGLARHGRDKDAFAAFAEMR 244

Query: 308 LAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYAL-----LVDLLG 144
            +  +P SFT+ T+L  C+     D       +  + Y +K  L    +     LV L  
Sbjct: 245 ASEVQPNSFTVSTLLKGCSSSSESDSFLHATKI--HAYVLKTSLGSLDVSVGNSLVHLYS 302

Query: 143 RAGRLEEAVSLLKSIPFRPNALIYKRL 63
           R  R+++A ++  ++    + L Y  L
Sbjct: 303 RFARMDDAWAVATTMACARDNLTYTSL 329


>ref|XP_002466053.1| hypothetical protein SORBIDRAFT_01g000260 [Sorghum bicolor]
           gi|241919907|gb|EER93051.1| hypothetical protein
           SORBIDRAFT_01g000260 [Sorghum bicolor]
          Length = 681

 Score =  145 bits (367), Expect = 4e-33
 Identities = 68/167 (40%), Positives = 104/167 (62%)
 Frame = -3

Query: 506 KSGFGQWMSVLNGLVDFYGKSRCMVEAQRAFEEVCKPDTVSWNALIHSFALNECTASALS 327
           K G    +S+ N L++ Y + +C+ +A+RAF+ + +P   SWNA+I   A N     ALS
Sbjct: 490 KLGLSGQVSLSNSLINMYSRCKCLEDAKRAFQSIREPSVGSWNAIISGMAFNASYTEALS 549

Query: 326 TLEDMRLAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVDLL 147
             EDM LAGA+P   T   VL+ C++GGLVD+  ++F+ + NL+DV P+  HY   +D+L
Sbjct: 550 VFEDMILAGAQPDGVTFTVVLSTCSRGGLVDIGIKHFNSMTNLFDVSPQKSHYTWFLDML 609

Query: 146 GRAGRLEEAVSLLKSIPFRPNALIYKRLLIACKLHKNMLLAEEMAVK 6
           GRAGR  E    ++++P +P+  IY+ LL  CKLH   ++ E +A K
Sbjct: 610 GRAGRFTEVAHTIEAMPVQPDISIYRTLLAYCKLHNAQVVGEYIAKK 656


>ref|XP_006357376.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At1g17630-like isoform X1 [Solanum tuberosum]
           gi|565382079|ref|XP_006357377.1| PREDICTED: putative
           pentatricopeptide repeat-containing protein
           At1g17630-like isoform X2 [Solanum tuberosum]
           gi|565382081|ref|XP_006357378.1| PREDICTED: putative
           pentatricopeptide repeat-containing protein
           At1g17630-like isoform X3 [Solanum tuberosum]
          Length = 727

 Score =  141 bits (355), Expect = 1e-31
 Identities = 68/156 (43%), Positives = 100/156 (64%)
 Frame = -3

Query: 473 NGLVDFYGKSRCMVEAQRAFEEVCKPDTVSWNALIHSFALNECTASALSTLEDMRLAGAK 294
           NGLV+ Y K   + +    FE V K D +SWN +I  F ++   A+AL T E M  AG K
Sbjct: 466 NGLVNMYMKCGSLWKGNIVFEGVGKKDLISWNTMISGFGMHGLGATALETFEQMTSAGTK 525

Query: 293 PFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVDLLGRAGRLEEAVS 114
           P   T + VL+ C+  GLVD  ++ FD ++ ++ VKP++ HYA +VDLLGRAG L+ A  
Sbjct: 526 PDGITFVAVLSACSHAGLVDEGYKIFDQMKKVFGVKPQMEHYACMVDLLGRAGLLQRASE 585

Query: 113 LLKSIPFRPNALIYKRLLIACKLHKNMLLAEEMAVK 6
           +++++P RPNA ++  LL +CK+HKN  +AEE A +
Sbjct: 586 MVQNMPMRPNACVWGALLNSCKMHKNTEVAEETAAQ 621


>ref|XP_003559087.1| PREDICTED: pentatricopeptide repeat-containing protein At5g52850,
            chloroplastic-like [Brachypodium distachyon]
          Length = 719

 Score =  140 bits (354), Expect = 1e-31
 Identities = 64/167 (38%), Positives = 99/167 (59%)
 Frame = -3

Query: 506  KSGFGQWMSVLNGLVDFYGKSRCMVEAQRAFEEVCKPDTVSWNALIHSFALNECTASALS 327
            K G    +SV N L++ Y + +C+ +A   F+ + +P  VSWNALI   A N C   ALS
Sbjct: 509  KLGLNSQLSVSNSLINMYSQCKCLEDATCVFQSIKEPSVVSWNALISGLASNGCYYEALS 568

Query: 326  TLEDMRLAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVDLL 147
              EDM L G +P   T   VL  C+ GG +D+   +F  ++ L+ + P+  HY L +D+L
Sbjct: 569  AFEDMALVGVQPDGVTFSIVLYACSHGGFIDIGISHFSSMKTLFGISPQRSHYTLFLDML 628

Query: 146  GRAGRLEEAVSLLKSIPFRPNALIYKRLLIACKLHKNMLLAEEMAVK 6
            GRAGRL EA   + ++P +P+  +Y+ LL  C+LH ++++ E +A K
Sbjct: 629  GRAGRLAEAACTIDTMPVQPDLSMYRNLLAFCELHNDLVVGETIARK 675



 Score = 61.6 bits (148), Expect = 1e-07
 Identities = 41/142 (28%), Positives = 67/142 (47%), Gaps = 1/142 (0%)
 Frame = -3

Query: 485 MSVLNGLVDFYGKSR-CMVEAQRAFEEVCKPDTVSWNALIHSFALNECTASALSTLEDMR 309
           +SV N LVDFY KS  C+++    F  V +P+ VSW A I   A +     A +   +MR
Sbjct: 311 ISVCNALVDFYSKSSTCLLDLLHTFNAVDRPNVVSWTAFIAGLARHGRDEDAFAAFAEMR 370

Query: 308 LAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVDLLGRAGRL 129
             G +P SFT+ T+L  C+       A +    +         +     LV+L  R  R+
Sbjct: 371 AGGVQPNSFTISTLLKGCSSSQSFLHATKIHAYVLKTSSESLNVAVGNSLVNLYSRFARM 430

Query: 128 EEAVSLLKSIPFRPNALIYKRL 63
           ++A ++  ++ F  ++  Y  L
Sbjct: 431 DDAWAVATTMAFVRDSFTYTSL 452


>tpg|DAA52614.1| TPA: hypothetical protein ZEAMMB73_283558 [Zea mays]
          Length = 706

 Score =  140 bits (352), Expect = 2e-31
 Identities = 64/167 (38%), Positives = 102/167 (61%)
 Frame = -3

Query: 506 KSGFGQWMSVLNGLVDFYGKSRCMVEAQRAFEEVCKPDTVSWNALIHSFALNECTASALS 327
           K G    +S+ N L++ Y + + + +A+  F+ + +P  VSWNA+I   A N     ALS
Sbjct: 490 KLGLSGQVSLSNSLINMYSRCKSLEDAKSVFQSIREPSVVSWNAIIFGMAFNGSYTEALS 549

Query: 326 TLEDMRLAGAKPFSFTLMTVLNICNQGGLVDVAFEYFDLLRNLYDVKPRLYHYALLVDLL 147
             EDM LAGA+P   T   VL+ C+ GGLVD+  ++F+ + N++DV P+  HY L +D+L
Sbjct: 550 VFEDMILAGAQPDGVTFTVVLSACSHGGLVDIGIKHFNSMANMFDVPPQKSHYTLFLDML 609

Query: 146 GRAGRLEEAVSLLKSIPFRPNALIYKRLLIACKLHKNMLLAEEMAVK 6
           GR+GR  E    ++++P +P+  IY+ LL+ CK H   ++ E +A K
Sbjct: 610 GRSGRFTEVAHTIEAMPVQPDLSIYRTLLVYCKFHNEPVVGEYIAKK 656


Top