BLASTX nr result

ID: Akebia27_contig00014880 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00014880
         (2243 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera]   346   2e-92
ref|XP_002283311.1| PREDICTED: pentatricopeptide repeat-containi...   345   7e-92
ref|XP_006590678.1| PREDICTED: pentatricopeptide repeat-containi...   338   5e-90
ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containi...   338   5e-90
ref|XP_006422375.1| hypothetical protein CICLE_v10028759mg [Citr...   338   6e-90
gb|EXB40453.1| hypothetical protein L484_013756 [Morus notabilis]     336   2e-89
ref|XP_006486551.1| PREDICTED: pentatricopeptide repeat-containi...   332   6e-88
ref|XP_007043123.1| Pentatricopeptide repeat superfamily protein...   329   4e-87
ref|XP_004253081.1| PREDICTED: pentatricopeptide repeat-containi...   323   2e-85
ref|XP_006342489.1| PREDICTED: pentatricopeptide repeat-containi...   319   3e-84
ref|XP_006573589.1| PREDICTED: pentatricopeptide repeat-containi...   319   4e-84
ref|XP_006573588.1| PREDICTED: pentatricopeptide repeat-containi...   319   4e-84
ref|XP_003516576.2| PREDICTED: pentatricopeptide repeat-containi...   319   4e-84
gb|AFK36371.1| unknown [Lotus japonicus]                              314   1e-82
gb|EYU32378.1| hypothetical protein MIMGU_mgv1a026042mg, partial...   310   2e-81
ref|XP_007156913.1| hypothetical protein PHAVU_002G027800g [Phas...   310   2e-81
ref|XP_004512018.1| PREDICTED: pentatricopeptide repeat-containi...   303   2e-79
ref|XP_006384881.1| hypothetical protein POPTR_0004s21920g [Popu...   302   5e-79
ref|XP_004290096.1| PREDICTED: pentatricopeptide repeat-containi...   301   8e-79
ref|XP_002514391.1| pentatricopeptide repeat-containing protein,...   297   1e-77

>emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera]
          Length = 381

 Score =  346 bits (888), Expect = 2e-92
 Identities = 214/428 (50%), Positives = 260/428 (60%), Gaps = 25/428 (5%)
 Frame = +3

Query: 816  MPGKVSKLLCSNFCRR------------STIPL---LNHFSTIRNGPIRRG-ERRYDASE 947
            + GKVSK++ S+  +             S +PL   L  FS+I      RG  RR D + 
Sbjct: 5    LQGKVSKVVFSDCLKDLLHSSHSSPSNPSPLPLPLLLRRFSSIDASSSTRGASRREDLAN 64

Query: 948  DDVLRKLDSGYEENDQVDERKSEAHF-------DPPSPIPNRPLRGERRTN-PFKHEKYD 1103
            +  L       E +D    RKS +         +PP+PIPNRPLRGE+R N P  H    
Sbjct: 65   NSDL--FSPSTEPDDDTYGRKSSSSCGGGGSSSNPPNPIPNRPLRGEQRMNRPPPH---- 118

Query: 1104 DNDEKMKPSLFQVPRTQFQNRPMXXXXXXXXXKDSINNNDLFTGNFNLGGSIDNDENRRK 1283
                        +P+ +                                  +  DE   +
Sbjct: 119  ------------IPQRKL--------------------------------GLPKDEGVDR 134

Query: 1284 QSYKNPTTFHIPNKEEKKDDQSVDSFLEKFKLGDEKKESPLKDTPIQISKP-DSPTPTDS 1460
             S  +P  F+ P+  EK      D FLE+FKLG +KKE P +    Q S+  D+    + 
Sbjct: 135  ASQASP--FNQPSPAEKVGATLEDGFLERFKLGVQKKERPQESAAAQPSREQDANHGKEQ 192

Query: 1461 LPQDTEEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVV 1640
             PQ+ +EIF+KMKE+GLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVV
Sbjct: 193  PPQNADEIFRKMKESGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVV 252

Query: 1641 EGFCRAQKLDDAKRIFRKMQNNGISPNAFSYTVLIQGLCKGRSLEDAVDFCVEMLEAGHS 1820
            EGFC+A++LDDA RIFRKMQNNGISPNAFSYTVLI+G+ KG  L+ AVDFCVEMLEAGHS
Sbjct: 253  EGFCKARQLDDAVRIFRKMQNNGISPNAFSYTVLIRGMYKGNRLDIAVDFCVEMLEAGHS 312

Query: 1821 PNVATFASLVDCFCREKGMGEANSVIGRLREKGFFVNEKAVREYLDKKGPFSPLVWEAIF 2000
            PNVAT   L+  FC+EKG+ EA +VI  L++KG FV++KAVREYLDKKGP SPLVWEA F
Sbjct: 313  PNVATLVDLIHEFCKEKGVEEAKNVIVTLKQKGLFVDDKAVREYLDKKGPQSPLVWEAFF 372

Query: 2001 GKKSSQRS 2024
            GKKS QRS
Sbjct: 373  GKKSPQRS 380


>ref|XP_002283311.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Vitis vinifera]
          Length = 380

 Score =  345 bits (884), Expect = 7e-92
 Identities = 213/427 (49%), Positives = 260/427 (60%), Gaps = 24/427 (5%)
 Frame = +3

Query: 816  MPGKVSKLLCSNFCRR------------STIPL---LNHFSTIRNGPIRRG-ERRYDASE 947
            + GKVSK++ S+  +             S +PL   L  FS+I      RG  RR D + 
Sbjct: 5    LQGKVSKVVFSDCLKDLLHSSHSSPSNPSPLPLPLLLRRFSSIDASSSTRGASRREDLAN 64

Query: 948  DDVLRKLDSGYEENDQVDERKSEAHF------DPPSPIPNRPLRGERRTN-PFKHEKYDD 1106
            +  L       E +D    RKS +        +PP+PIPNRPLRGE+R N P  H     
Sbjct: 65   NSDL--FSPSTEPDDDTYGRKSSSSCGGGSSSNPPNPIPNRPLRGEQRMNRPPPH----- 117

Query: 1107 NDEKMKPSLFQVPRTQFQNRPMXXXXXXXXXKDSINNNDLFTGNFNLGGSIDNDENRRKQ 1286
                       +P+ +                                  +  DE   + 
Sbjct: 118  -----------IPQRKL--------------------------------GLPKDEGVDRA 134

Query: 1287 SYKNPTTFHIPNKEEKKDDQSVDSFLEKFKLGDEKKESPLKDTPIQISKP-DSPTPTDSL 1463
            S  +P  F+ P+  EK      D FLE+FKLG +KKE P +    Q S+  D+    +  
Sbjct: 135  SQASP--FNQPSPAEKVGATLEDGFLERFKLGVQKKERPQESAAAQPSREQDANHGKEQP 192

Query: 1464 PQDTEEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVE 1643
            PQ+ +EIF+KMKE+GLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVE
Sbjct: 193  PQNADEIFRKMKESGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVE 252

Query: 1644 GFCRAQKLDDAKRIFRKMQNNGISPNAFSYTVLIQGLCKGRSLEDAVDFCVEMLEAGHSP 1823
            GFC+A++L+DA RIFRKMQNNGISPNAFSYTVLI+G+ KG  L+ AVDFCVEMLEAGHSP
Sbjct: 253  GFCKARQLNDAVRIFRKMQNNGISPNAFSYTVLIRGMYKGNRLDIAVDFCVEMLEAGHSP 312

Query: 1824 NVATFASLVDCFCREKGMGEANSVIGRLREKGFFVNEKAVREYLDKKGPFSPLVWEAIFG 2003
            NVAT   L+  FC+EKG+ EA +VI  L++KG FV++KAVREYLDKKGP SPLVWEA FG
Sbjct: 313  NVATLVDLIHEFCKEKGVEEAKNVIVTLKQKGLFVDDKAVREYLDKKGPQSPLVWEAFFG 372

Query: 2004 KKSSQRS 2024
            KKS QRS
Sbjct: 373  KKSPQRS 379


>ref|XP_006590678.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            isoform X2 [Glycine max]
          Length = 395

 Score =  338 bits (868), Expect = 5e-90
 Identities = 203/413 (49%), Positives = 249/413 (60%), Gaps = 14/413 (3%)
 Frame = +3

Query: 825  KVSKLLCSNFCRRSTIPLLN---HFSTIRNGPIRRGERRYDASEDDVLRKLDSGYEENDQ 995
            ++ KL+    C++   PLL    HFS   +    R ++    S+D  L++ DS +++N +
Sbjct: 26   QIEKLVSFVHCKQYLPPLLETVRHFS-FTDDCSGRSKQPVGESDDFFLQQSDSSFKDNGE 84

Query: 996  VDERKSEAHFDPPSPIPNRPLRGERRTN--PFKHEKYDDNDEKMKPSLFQVPRTQFQNRP 1169
             D+  SE       PIP+RPLR  +  N  P + ++YD       P  +           
Sbjct: 85   SDQSLSE-------PIPSRPLRSRKPVNQPPPRFQEYDRGSHSFPPRFYD---------- 127

Query: 1170 MXXXXXXXXXKDSINNNDLFTGNFNLGGSIDNDENRRKQSYKNPTTFHIPN--KEEKKDD 1343
                                    N GG  + D+  +  S K    F   N  K  +   
Sbjct: 128  ------------------------NHGGPDELDQTNK--SSKIDLAFQNTNVAKTNRDAG 161

Query: 1344 QSVDSFLEKFKLGDEKKESPLKDT-------PIQISKPDSPTPTDSLPQDTEEIFKKMKE 1502
            QS DSFL KFKLG + K   L +          + S P+ P   +S+PQD +EIFKKMKE
Sbjct: 162  QSGDSFLNKFKLGFDDKTVNLSEVAASKQSEEAKRSNPNQPA-QESMPQDADEIFKKMKE 220

Query: 1503 TGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCRAQKLDDAKR 1682
            TGLIPNAVAMLDGLCKDGLVQEA+KLFGLMREKGTIPE+VIYTAVVEG+ +A K DDAKR
Sbjct: 221  TGLIPNAVAMLDGLCKDGLVQEALKLFGLMREKGTIPEIVIYTAVVEGYTKAHKADDAKR 280

Query: 1683 IFRKMQNNGISPNAFSYTVLIQGLCKGRSLEDAVDFCVEMLEAGHSPNVATFASLVDCFC 1862
            IFRKMQ++G+SPNAFSY VLIQGL K   L DA +FCVEMLEAGHSPNV TF  LVD FC
Sbjct: 281  IFRKMQSSGVSPNAFSYMVLIQGLYKCSRLHDAFEFCVEMLEAGHSPNVTTFVGLVDGFC 340

Query: 1863 REKGMGEANSVIGRLREKGFFVNEKAVREYLDKKGPFSPLVWEAIFGKKSSQR 2021
             EKG+ EA S I  L +KGF VNEKAVR++LDKK PFSP VWEAIFGKK+ QR
Sbjct: 341  NEKGVEEAKSAIKTLTDKGFVVNEKAVRQFLDKKAPFSPSVWEAIFGKKAPQR 393


>ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            isoform X1 [Glycine max]
          Length = 388

 Score =  338 bits (868), Expect = 5e-90
 Identities = 203/413 (49%), Positives = 249/413 (60%), Gaps = 14/413 (3%)
 Frame = +3

Query: 825  KVSKLLCSNFCRRSTIPLLN---HFSTIRNGPIRRGERRYDASEDDVLRKLDSGYEENDQ 995
            ++ KL+    C++   PLL    HFS   +    R ++    S+D  L++ DS +++N +
Sbjct: 19   QIEKLVSFVHCKQYLPPLLETVRHFS-FTDDCSGRSKQPVGESDDFFLQQSDSSFKDNGE 77

Query: 996  VDERKSEAHFDPPSPIPNRPLRGERRTN--PFKHEKYDDNDEKMKPSLFQVPRTQFQNRP 1169
             D+  SE       PIP+RPLR  +  N  P + ++YD       P  +           
Sbjct: 78   SDQSLSE-------PIPSRPLRSRKPVNQPPPRFQEYDRGSHSFPPRFYD---------- 120

Query: 1170 MXXXXXXXXXKDSINNNDLFTGNFNLGGSIDNDENRRKQSYKNPTTFHIPN--KEEKKDD 1343
                                    N GG  + D+  +  S K    F   N  K  +   
Sbjct: 121  ------------------------NHGGPDELDQTNK--SSKIDLAFQNTNVAKTNRDAG 154

Query: 1344 QSVDSFLEKFKLGDEKKESPLKDT-------PIQISKPDSPTPTDSLPQDTEEIFKKMKE 1502
            QS DSFL KFKLG + K   L +          + S P+ P   +S+PQD +EIFKKMKE
Sbjct: 155  QSGDSFLNKFKLGFDDKTVNLSEVAASKQSEEAKRSNPNQPA-QESMPQDADEIFKKMKE 213

Query: 1503 TGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCRAQKLDDAKR 1682
            TGLIPNAVAMLDGLCKDGLVQEA+KLFGLMREKGTIPE+VIYTAVVEG+ +A K DDAKR
Sbjct: 214  TGLIPNAVAMLDGLCKDGLVQEALKLFGLMREKGTIPEIVIYTAVVEGYTKAHKADDAKR 273

Query: 1683 IFRKMQNNGISPNAFSYTVLIQGLCKGRSLEDAVDFCVEMLEAGHSPNVATFASLVDCFC 1862
            IFRKMQ++G+SPNAFSY VLIQGL K   L DA +FCVEMLEAGHSPNV TF  LVD FC
Sbjct: 274  IFRKMQSSGVSPNAFSYMVLIQGLYKCSRLHDAFEFCVEMLEAGHSPNVTTFVGLVDGFC 333

Query: 1863 REKGMGEANSVIGRLREKGFFVNEKAVREYLDKKGPFSPLVWEAIFGKKSSQR 2021
             EKG+ EA S I  L +KGF VNEKAVR++LDKK PFSP VWEAIFGKK+ QR
Sbjct: 334  NEKGVEEAKSAIKTLTDKGFVVNEKAVRQFLDKKAPFSPSVWEAIFGKKAPQR 386


>ref|XP_006422375.1| hypothetical protein CICLE_v10028759mg [Citrus clementina]
            gi|557524309|gb|ESR35615.1| hypothetical protein
            CICLE_v10028759mg [Citrus clementina]
          Length = 344

 Score =  338 bits (867), Expect = 6e-90
 Identities = 189/360 (52%), Positives = 239/360 (66%), Gaps = 4/360 (1%)
 Frame = +3

Query: 954  VLRKLDSGYEENDQVDERKSEAHFDPPSPIPNRPLRGERRTNPFKHEKYDDNDEKMKPSL 1133
            +LR+  S  + N +  +  +  + +PP PIP+RPLRGER   PF ++  +          
Sbjct: 30   LLRRFCSIRDFNTKNCDNDNRNYENPPEPIPDRPLRGER---PFTNQNQN---------- 76

Query: 1134 FQVPRTQFQNRPMXXXXXXXXXKDSINNNDLFTGNFNLGGSIDNDENRRKQSYKNPTTFH 1313
                R  FQ R                        FN   +    +  ++QS+++P    
Sbjct: 77   ----RRSFQPR------------------------FN---NYQQQQRPQQQSFQSPNR-- 103

Query: 1314 IPNKEEKKDDQSVDSFLEKFKLG-DEKKESPLKDTPI---QISKPDSPTPTDSLPQDTEE 1481
             P  +     QS ++FL++FKL  D+K ++P ++  +   Q  KP+   P    PQ+ +E
Sbjct: 104  -PRPKSPDGVQSDENFLDQFKLAIDKKPDNPQQNESLGERQEQKPNRNEPISEPPQEADE 162

Query: 1482 IFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCRAQ 1661
            IFKKMKETGLIPNAVAMLDGLCKDGL+QEAMKLFGLMREKGTIPEVVIYTAVV+GFC+AQ
Sbjct: 163  IFKKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVVDGFCKAQ 222

Query: 1662 KLDDAKRIFRKMQNNGISPNAFSYTVLIQGLCKGRSLEDAVDFCVEMLEAGHSPNVATFA 1841
            K DDAKRIFRKMQ+NGI+PNAFSY +LIQGL K   LE+AV++C+EMLEAGHSPNV TF 
Sbjct: 223  KFDDAKRIFRKMQSNGIAPNAFSYNLLIQGLYKCNKLEEAVEYCIEMLEAGHSPNVTTFV 282

Query: 1842 SLVDCFCREKGMGEANSVIGRLREKGFFVNEKAVREYLDKKGPFSPLVWEAIFGKKSSQR 2021
             LVD  CREKG+ +A SVI  L+EKGF VN+KAVRE+LDKK PFS  VWEAIFGKK+SQ+
Sbjct: 283  GLVDGLCREKGVEKAQSVIATLKEKGFLVNDKAVREFLDKKAPFSSSVWEAIFGKKTSQK 342


>gb|EXB40453.1| hypothetical protein L484_013756 [Morus notabilis]
          Length = 306

 Score =  336 bits (862), Expect = 2e-89
 Identities = 174/267 (65%), Positives = 204/267 (76%), Gaps = 6/267 (2%)
 Frame = +3

Query: 1239 FNLGGSIDNDENRRKQSYKNPTTFHIPNKEEKKDDQSV---DSFLEKFKLGDEKKESPLK 1409
            F   G+ ++DE       +NP     PN+  +         DSFLEKFKLG +  +  ++
Sbjct: 39   FGSAGNGESDETTGPSFSQNPRERSRPNRPPRGRGPLTSEDDSFLEKFKLGLDSSKDGMQ 98

Query: 1410 DTPIQIS---KPDSPTPTDSLPQDTEEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKL 1580
            + P + +   KP  P P    P+D +EIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKL
Sbjct: 99   EKPRREAARPKPPLPQPPPP-PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKL 157

Query: 1581 FGLMREKGTIPEVVIYTAVVEGFCRAQKLDDAKRIFRKMQNNGISPNAFSYTVLIQGLCK 1760
            FGLM+EKGTIPEVVIYTAVV+GFC+AQKLDDA RIFRKMQ+NGI PNAFSY+VL+QGLC 
Sbjct: 158  FGLMKEKGTIPEVVIYTAVVDGFCKAQKLDDAVRIFRKMQSNGIEPNAFSYSVLVQGLCG 217

Query: 1761 GRSLEDAVDFCVEMLEAGHSPNVATFASLVDCFCREKGMGEANSVIGRLREKGFFVNEKA 1940
            G+ LED ++FCVEMLEAGHSPNVATF  LVD  C EKG+ EA  VIG+LR+KGF +NEKA
Sbjct: 218  GKRLEDGLEFCVEMLEAGHSPNVATFVGLVDGLCEEKGVEEAQGVIGKLRDKGFLLNEKA 277

Query: 1941 VREYLDKKGPFSPLVWEAIFGKKSSQR 2021
            VRE+LDKK  FSP VWEAIFGKK+SQR
Sbjct: 278  VREFLDKKASFSPSVWEAIFGKKASQR 304


>ref|XP_006486551.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Citrus sinensis]
          Length = 387

 Score =  332 bits (850), Expect = 6e-88
 Identities = 187/360 (51%), Positives = 236/360 (65%), Gaps = 4/360 (1%)
 Frame = +3

Query: 954  VLRKLDSGYEENDQVDERKSEAHFDPPSPIPNRPLRGERRTNPFKHEKYDDNDEKMKPSL 1133
            +LR+  S  + N +  +  +    +PP PIP+RPLRGER   PF ++  +          
Sbjct: 73   LLRRFCSIRDFNTKNCDNDNRNDQNPPEPIPDRPLRGER---PFTNQNQN---------- 119

Query: 1134 FQVPRTQFQNRPMXXXXXXXXXKDSINNNDLFTGNFNLGGSIDNDENRRKQSYKNPTTFH 1313
                R  FQ R                        FN   +    +  ++QS+++P    
Sbjct: 120  ----RRSFQPR------------------------FN---NYQQQQRPQQQSFQSPNG-- 146

Query: 1314 IPNKEEKKDDQSVDSFLEKFKLG-DEKKESPLKDTPI---QISKPDSPTPTDSLPQDTEE 1481
             P  +     QS ++FL++FKL  D+K  +P ++  +   Q  KP+   P    PQ+ +E
Sbjct: 147  -PRPKSPDGVQSDENFLDQFKLAIDKKPGNPQQNESLGQRQEQKPNRNEPISEPPQEADE 205

Query: 1482 IFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCRAQ 1661
            IFKKMKETGLIPNAVAMLDGLCKDGL+QEAMKLFGLMREKGTIPEVVIYTAVV+GFC+AQ
Sbjct: 206  IFKKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVVDGFCKAQ 265

Query: 1662 KLDDAKRIFRKMQNNGISPNAFSYTVLIQGLCKGRSLEDAVDFCVEMLEAGHSPNVATFA 1841
            K DDAKRIFRKMQ+NGI+PNAFSY +LIQGL K   LE+AV++C+EMLEAGHSPNV TF 
Sbjct: 266  KFDDAKRIFRKMQSNGIAPNAFSYNLLIQGLYKCNKLEEAVEYCIEMLEAGHSPNVTTFV 325

Query: 1842 SLVDCFCREKGMGEANSVIGRLREKGFFVNEKAVREYLDKKGPFSPLVWEAIFGKKSSQR 2021
             LVD  CRE+G+ +A SVI  L+EKGF VN+KAVRE+LDKK PFS  VWEAIFGKK+ Q+
Sbjct: 326  GLVDGLCRERGVEKAQSVIATLKEKGFLVNDKAVREFLDKKAPFSSSVWEAIFGKKTLQK 385


>ref|XP_007043123.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma
            cacao] gi|508707058|gb|EOX98954.1| Pentatricopeptide
            repeat superfamily protein, putative [Theobroma cacao]
          Length = 345

 Score =  329 bits (843), Expect = 4e-87
 Identities = 181/352 (51%), Positives = 224/352 (63%), Gaps = 3/352 (0%)
 Frame = +3

Query: 978  YEENDQVDERKSEAHFDPPSPIPNRPLRGERRTNP-FKHEKYDDNDEKMKPSLFQVPRTQ 1154
            + +ND +    +     PP PIPNR L G+R  NP F+  K                   
Sbjct: 46   FRDNDPISFNSNGDGDKPPEPIPNRSLEGQRPFNPSFRETK------------------- 86

Query: 1155 FQNRPMXXXXXXXXXKDSINNNDLFTGNFNLGGSIDNDENRRKQSYKNPTTFHIPNKEEK 1334
                             ++N+N     +FN      +D NR+++                
Sbjct: 87   ---------------GATLNSNGSSFQSFNT--KFASDPNRKRE---------------- 113

Query: 1335 KDDQSVDSFLEKFKLGDEKK--ESPLKDTPIQISKPDSPTPTDSLPQDTEEIFKKMKETG 1508
             D QS ++FLEKFKLG + K  + P       + +        S PQD +EIFKKMKETG
Sbjct: 114  -DSQSDENFLEKFKLGLDNKRGKQPSDSEAAALLRRKEQEEKPSPPQDADEIFKKMKETG 172

Query: 1509 LIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCRAQKLDDAKRIF 1688
            LIPNAVAMLDGLCKDGL+QEAMKLFG MREKGTIPEVVIYTAVV+GFC+A KLDDAKRIF
Sbjct: 173  LIPNAVAMLDGLCKDGLIQEAMKLFGSMREKGTIPEVVIYTAVVDGFCKAHKLDDAKRIF 232

Query: 1689 RKMQNNGISPNAFSYTVLIQGLCKGRSLEDAVDFCVEMLEAGHSPNVATFASLVDCFCRE 1868
            RKMQ+ G++PN+FSY VLIQGL +   L+DA++FC+EMLEAGHSPNV TF  LVD  C+E
Sbjct: 233  RKMQSKGVTPNSFSYIVLIQGLYRCNKLDDAIEFCLEMLEAGHSPNVTTFVGLVDGLCKE 292

Query: 1869 KGMGEANSVIGRLREKGFFVNEKAVREYLDKKGPFSPLVWEAIFGKKSSQRS 2024
            KG+ EA SVIG L++KGF +N+KAVR++LDKK PFSPLVWEAIFGKK SQ++
Sbjct: 293  KGVEEAQSVIGTLKQKGFVLNDKAVRQFLDKKAPFSPLVWEAIFGKKPSQKT 344


>ref|XP_004253081.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            isoform 1 [Solanum lycopersicum]
            gi|460415472|ref|XP_004253082.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g38150-like isoform 2 [Solanum lycopersicum]
          Length = 340

 Score =  323 bits (829), Expect = 2e-85
 Identities = 184/357 (51%), Positives = 227/357 (63%), Gaps = 2/357 (0%)
 Frame = +3

Query: 957  LRKLDSGYEENDQVDERKSEAHFDPPSPIPNRPLRGERRTNPFKHEKYDDNDEKMKPSLF 1136
            LR   S  + +D  DE     +  PP PIPNRPLR + R  PF             PS  
Sbjct: 35   LRSFSSSNKFSDYSDESAESNYPPPPEPIPNRPLRADSR-RPFN------------PSQR 81

Query: 1137 QVPRTQFQNRPMXXXXXXXXXKDSINNNDLFTGNFNLGGSIDNDENRRKQSYKNPTTFHI 1316
            Q P                  + S N++  F                R+ S         
Sbjct: 82   QHPSN----------------RSSPNHSTTF----------------RRSS--------- 100

Query: 1317 PNKEEKKDDQSVDSFLEKFKLGDEKKESPLKDTPIQISK--PDSPTPTDSLPQDTEEIFK 1490
             N E +   Q  + FL++F+LG ++KE      P   S+  P S  P  + P+D +EIFK
Sbjct: 101  ENNESQMKSQDSEDFLKRFQLGFDRKEENPNTNPKAESRDCPVSEAPP-APPEDADEIFK 159

Query: 1491 KMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCRAQKLD 1670
            KMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVV+GFC+AQK D
Sbjct: 160  KMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVDGFCKAQKFD 219

Query: 1671 DAKRIFRKMQNNGISPNAFSYTVLIQGLCKGRSLEDAVDFCVEMLEAGHSPNVATFASLV 1850
            DA RIFRKMQ NGI PNAFSY ++I+GL +G+ L+DA++FC+EMLEAGHSPNV TF +LV
Sbjct: 220  DAVRIFRKMQGNGIIPNAFSYGIIIRGLSQGKRLDDALEFCLEMLEAGHSPNVVTFVTLV 279

Query: 1851 DCFCREKGMGEANSVIGRLREKGFFVNEKAVREYLDKKGPFSPLVWEAIFGKKSSQR 2021
            D FC+EK + +A ++I  +R+KGF V++KAVRE+LDKKGPF P+VWEAI GKK+SQR
Sbjct: 280  DGFCKEKSLEDAQNMIKTVRQKGFIVDDKAVREFLDKKGPFLPVVWEAILGKKASQR 336


>ref|XP_006342489.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Solanum tuberosum]
          Length = 354

 Score =  319 bits (818), Expect = 3e-84
 Identities = 182/366 (49%), Positives = 229/366 (62%), Gaps = 8/366 (2%)
 Frame = +3

Query: 948  DDVL--RKLDSGYEENDQVDERKSEAHFDPP-SPIPNRPLRGERRTNPFKHEKYDDNDEK 1118
            D++L   K+ S    N    +  +++++ PP  PIPNRPLRG+ +               
Sbjct: 29   DEILPSTKIRSFSSSNSNYSDEFTQSNYPPPPDPIPNRPLRGDSKR-------------- 74

Query: 1119 MKPSLFQVPRTQFQNRPMXXXXXXXXXKDSINNNDLFTGNFNLGGSIDNDENRRKQSYKN 1298
                    P      RP+          DS NN    T +  L  S +N+  + K     
Sbjct: 75   --------PLRDDSRRPLRDDFRRPLRADSSNNP---THSTTLRRSGENNGGQMK----- 118

Query: 1299 PTTFHIPNKEEKKDDQSVDSFLEKFKLG-DEKKESPLKDTPIQISKPDSPTPTDSL---- 1463
                           Q  + FL++F+LG D K+E+P  +  +      S +P        
Sbjct: 119  --------------SQDSEDFLKRFQLGFDRKEENPNTNPALHPKGESSDSPVSEAPPAP 164

Query: 1464 PQDTEEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVE 1643
            P+D +EIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVV+
Sbjct: 165  PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVD 224

Query: 1644 GFCRAQKLDDAKRIFRKMQNNGISPNAFSYTVLIQGLCKGRSLEDAVDFCVEMLEAGHSP 1823
            GF +AQK DDA RIFRKMQ NGI PNAFSY +LI+GL +G  L+DA +FC+EMLEAGHSP
Sbjct: 225  GFFKAQKFDDAVRIFRKMQGNGIIPNAFSYGILIRGLSQGNRLDDAFEFCLEMLEAGHSP 284

Query: 1824 NVATFASLVDCFCREKGMGEANSVIGRLREKGFFVNEKAVREYLDKKGPFSPLVWEAIFG 2003
            NV TF +LVD FC+EK + +A ++I  +R+KGF V++KAVREYLDKKGPF P+VWEAI G
Sbjct: 285  NVVTFVTLVDGFCKEKSLEDAQNMIKTVRQKGFIVDDKAVREYLDKKGPFLPVVWEAILG 344

Query: 2004 KKSSQR 2021
            KK+SQR
Sbjct: 345  KKASQR 350


>ref|XP_006573589.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            isoform X3 [Glycine max] gi|571435834|ref|XP_006573590.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g38150-like isoform X4 [Glycine max]
          Length = 403

 Score =  319 bits (817), Expect = 4e-84
 Identities = 200/422 (47%), Positives = 249/422 (59%), Gaps = 19/422 (4%)
 Frame = +3

Query: 813  VMPGKVSKLLCSNFCRRSTIPLLN---HFSTI--RNGPIRRGERRYDASEDDVLRKLDSG 977
            V   ++ KL+   + ++   P L    HFS    R+G   R ++    S+D    + DS 
Sbjct: 21   VSSSRIEKLVSLLYSKQYLPPWLETVRHFSFTDDRSG---RSKQPVGESDDFFREQSDSS 77

Query: 978  YEENDQVDERKS-EAHFDPPSPIPNRPLRGERRTN--PFKHEKYDDNDEKMKPSLFQVPR 1148
            +++N     ++S         PIP+RPLRG++  N  P +  +YD       P       
Sbjct: 78   FKDNGSNRTQESYNVEQSLSEPIPSRPLRGKKPINQPPPRFREYDRGSHSFPPRF----- 132

Query: 1149 TQFQNRPMXXXXXXXXXKDSINNNDLFTGNFNLGGSIDNDE-NRRKQ---SYKNPTTFHI 1316
                                         + N GG  + D+ N+  Q   +++  T    
Sbjct: 133  -----------------------------DDNHGGPDELDKINKSSQIDLAFQGTTNVAE 163

Query: 1317 PNKEEKKDDQSVDSFLEKFKLGDEKKESPLKDTPI-------QISKPDSPTPTDSLPQDT 1475
             N++  K   S  SFL+KFKLG + K   L +          + S P+ P   +S+PQD 
Sbjct: 164  TNRDVGK---SGGSFLDKFKLGFDDKTVNLSEVAASKQSEEAKRSNPNQPAQ-ESMPQDA 219

Query: 1476 EEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCR 1655
             EIFKKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGL+REKGTIPE+VIYTAVVEG+ +
Sbjct: 220  NEIFKKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVEGYTK 279

Query: 1656 AQKLDDAKRIFRKMQNNGISPNAFSYTVLIQGLCKGRSLEDAVDFCVEMLEAGHSPNVAT 1835
            A K DDAKRIFRKMQ++GISPNAFSYTVLIQGL K   L DA +FCVEMLEAGHSPNV  
Sbjct: 280  AHKADDAKRIFRKMQSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSPNVTA 339

Query: 1836 FASLVDCFCREKGMGEANSVIGRLREKGFFVNEKAVREYLDKKGPFSPLVWEAIFGKKSS 2015
            F  LVD FC EKG+ EA S I  L EKGF VNEKAV ++LDKK PFSP VWEAIFGKK+ 
Sbjct: 340  FVGLVDGFCNEKGVEEAKSAIKTLTEKGFVVNEKAVGQFLDKKAPFSPSVWEAIFGKKAP 399

Query: 2016 QR 2021
            QR
Sbjct: 400  QR 401


>ref|XP_006573588.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            isoform X2 [Glycine max]
          Length = 431

 Score =  319 bits (817), Expect = 4e-84
 Identities = 200/422 (47%), Positives = 249/422 (59%), Gaps = 19/422 (4%)
 Frame = +3

Query: 813  VMPGKVSKLLCSNFCRRSTIPLLN---HFSTI--RNGPIRRGERRYDASEDDVLRKLDSG 977
            V   ++ KL+   + ++   P L    HFS    R+G   R ++    S+D    + DS 
Sbjct: 49   VSSSRIEKLVSLLYSKQYLPPWLETVRHFSFTDDRSG---RSKQPVGESDDFFREQSDSS 105

Query: 978  YEENDQVDERKS-EAHFDPPSPIPNRPLRGERRTN--PFKHEKYDDNDEKMKPSLFQVPR 1148
            +++N     ++S         PIP+RPLRG++  N  P +  +YD       P       
Sbjct: 106  FKDNGSNRTQESYNVEQSLSEPIPSRPLRGKKPINQPPPRFREYDRGSHSFPPRF----- 160

Query: 1149 TQFQNRPMXXXXXXXXXKDSINNNDLFTGNFNLGGSIDNDE-NRRKQ---SYKNPTTFHI 1316
                                         + N GG  + D+ N+  Q   +++  T    
Sbjct: 161  -----------------------------DDNHGGPDELDKINKSSQIDLAFQGTTNVAE 191

Query: 1317 PNKEEKKDDQSVDSFLEKFKLGDEKKESPLKDTPI-------QISKPDSPTPTDSLPQDT 1475
             N++  K   S  SFL+KFKLG + K   L +          + S P+ P   +S+PQD 
Sbjct: 192  TNRDVGK---SGGSFLDKFKLGFDDKTVNLSEVAASKQSEEAKRSNPNQPAQ-ESMPQDA 247

Query: 1476 EEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCR 1655
             EIFKKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGL+REKGTIPE+VIYTAVVEG+ +
Sbjct: 248  NEIFKKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVEGYTK 307

Query: 1656 AQKLDDAKRIFRKMQNNGISPNAFSYTVLIQGLCKGRSLEDAVDFCVEMLEAGHSPNVAT 1835
            A K DDAKRIFRKMQ++GISPNAFSYTVLIQGL K   L DA +FCVEMLEAGHSPNV  
Sbjct: 308  AHKADDAKRIFRKMQSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSPNVTA 367

Query: 1836 FASLVDCFCREKGMGEANSVIGRLREKGFFVNEKAVREYLDKKGPFSPLVWEAIFGKKSS 2015
            F  LVD FC EKG+ EA S I  L EKGF VNEKAV ++LDKK PFSP VWEAIFGKK+ 
Sbjct: 368  FVGLVDGFCNEKGVEEAKSAIKTLTEKGFVVNEKAVGQFLDKKAPFSPSVWEAIFGKKAP 427

Query: 2016 QR 2021
            QR
Sbjct: 428  QR 429


>ref|XP_003516576.2| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            isoform X1 [Glycine max]
          Length = 457

 Score =  319 bits (817), Expect = 4e-84
 Identities = 200/422 (47%), Positives = 249/422 (59%), Gaps = 19/422 (4%)
 Frame = +3

Query: 813  VMPGKVSKLLCSNFCRRSTIPLLN---HFSTI--RNGPIRRGERRYDASEDDVLRKLDSG 977
            V   ++ KL+   + ++   P L    HFS    R+G   R ++    S+D    + DS 
Sbjct: 75   VSSSRIEKLVSLLYSKQYLPPWLETVRHFSFTDDRSG---RSKQPVGESDDFFREQSDSS 131

Query: 978  YEENDQVDERKS-EAHFDPPSPIPNRPLRGERRTN--PFKHEKYDDNDEKMKPSLFQVPR 1148
            +++N     ++S         PIP+RPLRG++  N  P +  +YD       P       
Sbjct: 132  FKDNGSNRTQESYNVEQSLSEPIPSRPLRGKKPINQPPPRFREYDRGSHSFPPRF----- 186

Query: 1149 TQFQNRPMXXXXXXXXXKDSINNNDLFTGNFNLGGSIDNDE-NRRKQ---SYKNPTTFHI 1316
                                         + N GG  + D+ N+  Q   +++  T    
Sbjct: 187  -----------------------------DDNHGGPDELDKINKSSQIDLAFQGTTNVAE 217

Query: 1317 PNKEEKKDDQSVDSFLEKFKLGDEKKESPLKDTPI-------QISKPDSPTPTDSLPQDT 1475
             N++  K   S  SFL+KFKLG + K   L +          + S P+ P   +S+PQD 
Sbjct: 218  TNRDVGK---SGGSFLDKFKLGFDDKTVNLSEVAASKQSEEAKRSNPNQPAQ-ESMPQDA 273

Query: 1476 EEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCR 1655
             EIFKKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGL+REKGTIPE+VIYTAVVEG+ +
Sbjct: 274  NEIFKKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVEGYTK 333

Query: 1656 AQKLDDAKRIFRKMQNNGISPNAFSYTVLIQGLCKGRSLEDAVDFCVEMLEAGHSPNVAT 1835
            A K DDAKRIFRKMQ++GISPNAFSYTVLIQGL K   L DA +FCVEMLEAGHSPNV  
Sbjct: 334  AHKADDAKRIFRKMQSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSPNVTA 393

Query: 1836 FASLVDCFCREKGMGEANSVIGRLREKGFFVNEKAVREYLDKKGPFSPLVWEAIFGKKSS 2015
            F  LVD FC EKG+ EA S I  L EKGF VNEKAV ++LDKK PFSP VWEAIFGKK+ 
Sbjct: 394  FVGLVDGFCNEKGVEEAKSAIKTLTEKGFVVNEKAVGQFLDKKAPFSPSVWEAIFGKKAP 453

Query: 2016 QR 2021
            QR
Sbjct: 454  QR 455


>gb|AFK36371.1| unknown [Lotus japonicus]
          Length = 372

 Score =  314 bits (805), Expect = 1e-82
 Identities = 187/373 (50%), Positives = 226/373 (60%), Gaps = 10/373 (2%)
 Frame = +3

Query: 930  RYDASEDDVLRKLDSGYEENDQVDERKSEAHFDPPS-----PIPNRPLRGERRTNPFKHE 1094
            R+ +  DD    +     E+D V   K + HF   S     PIPNR LRG +  NP   E
Sbjct: 34   RHFSFTDDCSGDIKQLMGESDDVSIHKLDPHFSDSSREGSEPIPNRALRGTQPVNPHSRE 93

Query: 1095 KYDDNDEKMKPSLFQVPRTQFQNRPMXXXXXXXXXKDSINNNDLFTGNFNLGGSIDNDEN 1274
             Y+      +P                                 F GN    G  D+ E 
Sbjct: 94   -YNRGSRSSRPR--------------------------------FDGN---RGRPDDVEM 117

Query: 1275 RRKQSYKNPTTFHIPNKEE--KKDDQSVDSFLEKFKLGDEKK---ESPLKDTPIQISKPD 1439
              K S +    F   N  +  K  ++  DSFL+KFKLG + K    S +  + +      
Sbjct: 118  TNKSS-QTDIGFQGRNMSDTNKVVNKLGDSFLDKFKLGFDNKAGNSSEVAASNLSEEAKS 176

Query: 1440 SPTPTDSLPQDTEEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEV 1619
            + +   ++P+D +EIFKKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGLMREKGTIPE+
Sbjct: 177  ANSNQPAMPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLMREKGTIPEI 236

Query: 1620 VIYTAVVEGFCRAQKLDDAKRIFRKMQNNGISPNAFSYTVLIQGLCKGRSLEDAVDFCVE 1799
            VIYTAVVEG+ +A K DDAKRIFRKMQ+NGISPNAFSYTVL+QGLCK   L+DA +FCVE
Sbjct: 237  VIYTAVVEGYTKAHKADDAKRIFRKMQSNGISPNAFSYTVLVQGLCKCSRLQDAFEFCVE 296

Query: 1800 MLEAGHSPNVATFASLVDCFCREKGMGEANSVIGRLREKGFFVNEKAVREYLDKKGPFSP 1979
            MLEAGHSPN+ TF  LVD F +E+G+ EA   I  L EKGF VNEKAV+ +LD K PFSP
Sbjct: 297  MLEAGHSPNMTTFVDLVDGFVKEQGVAEAKGAIRTLIEKGFVVNEKAVKGFLDMKKPFSP 356

Query: 1980 LVWEAIFGKKSSQ 2018
             VWEAIFGKK+ Q
Sbjct: 357  SVWEAIFGKKAPQ 369


>gb|EYU32378.1| hypothetical protein MIMGU_mgv1a026042mg, partial [Mimulus guttatus]
          Length = 269

 Score =  310 bits (794), Expect = 2e-81
 Identities = 164/255 (64%), Positives = 192/255 (75%), Gaps = 4/255 (1%)
 Frame = +3

Query: 1272 NRRKQSYKNPTTFHIPNKEEKKDDQSVDSFLEKFKLGDEKKESPLK----DTPIQISKPD 1439
            +R  +  +NP +F    K E   D     FLEKFKLG ++K   L     +  IQ  K +
Sbjct: 23   SRGVRENENPNSF----KAETDAD-----FLEKFKLGFDRKSETLTTDSINKSIQPEKKE 73

Query: 1440 SPTPTDSLPQDTEEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEV 1619
            +  P  S P+D +EIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEV
Sbjct: 74   NVEPI-SPPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEV 132

Query: 1620 VIYTAVVEGFCRAQKLDDAKRIFRKMQNNGISPNAFSYTVLIQGLCKGRSLEDAVDFCVE 1799
            V+YTAVV+GFC+A KL+DA RIF+KMQ+NGI PNAFSY VLI+GLC G  L+D   F +E
Sbjct: 133  VVYTAVVDGFCKAHKLEDAVRIFKKMQSNGIVPNAFSYQVLIRGLCSGNRLDDVYGFTIE 192

Query: 1800 MLEAGHSPNVATFASLVDCFCREKGMGEANSVIGRLREKGFFVNEKAVREYLDKKGPFSP 1979
            MLEAGHSPN+ATF  LVD +CREK + EA +VI  +R KGFF  EKAVRE+LDKKGPF P
Sbjct: 193  MLEAGHSPNLATFTGLVDVYCREKDLEEAQNVIKAMRHKGFFFEEKAVREHLDKKGPFLP 252

Query: 1980 LVWEAIFGKKSSQRS 2024
            LVWEAI G K+S+RS
Sbjct: 253  LVWEAILGNKASKRS 267


>ref|XP_007156913.1| hypothetical protein PHAVU_002G027800g [Phaseolus vulgaris]
            gi|593787750|ref|XP_007156914.1| hypothetical protein
            PHAVU_002G027800g [Phaseolus vulgaris]
            gi|561030328|gb|ESW28907.1| hypothetical protein
            PHAVU_002G027800g [Phaseolus vulgaris]
            gi|561030329|gb|ESW28908.1| hypothetical protein
            PHAVU_002G027800g [Phaseolus vulgaris]
          Length = 451

 Score =  310 bits (794), Expect = 2e-81
 Identities = 197/411 (47%), Positives = 242/411 (58%), Gaps = 12/411 (2%)
 Frame = +3

Query: 825  KVSKLLCSNFCRRSTIPLLNHFSTIRNGPIRRGERRYDASEDDVLRKL-DSGYEENDQVD 1001
            K++ LL S       +  + HFS   +   R   ++Y    DD LR+  DS +E+N   +
Sbjct: 79   KLASLLRSKQHLPPWVETVRHFSFADDFSGR--SKQYAWERDDFLRQQSDSSFEDNGS-N 135

Query: 1002 ERKSEAHFDPPSP--IPNRPLRGERRTNPFKHEKYDDNDEKMKPSLFQVPRTQFQNRPMX 1175
                E + +  S   IP+RPLRG +  N            +  P   +  R  F      
Sbjct: 136  RTHEEYNVEQGSSESIPSRPLRGRKPIN------------QPPPRFRESGRGSFP----- 178

Query: 1176 XXXXXXXXKDSINNNDLFTGNFNLGGSIDNDENRRKQSYKNPTTFHIPNKEEKKDD--QS 1349
                             F  N     ++D    R  +S K    F   N  +   D  QS
Sbjct: 179  ---------------PTFDDNHRGPDALD----RTNKSSKIDLAFQGMNVADTNRDFEQS 219

Query: 1350 VDSFLEKFKLGDEKKESPLKDTPI-------QISKPDSPTPTDSLPQDTEEIFKKMKETG 1508
             DSFL+KFKL  + K   L +          + S PD     + +PQD +EIFKKMKETG
Sbjct: 220  GDSFLDKFKLAFDDKTVNLSEVAASKQSEEAKRSNPDQQAQ-EPVPQDADEIFKKMKETG 278

Query: 1509 LIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCRAQKLDDAKRIF 1688
            LIPNAVAMLDGLCKDGLVQEA+KLF LMREKGTIPE+VIYTAVVEG+ +A K DDAKRIF
Sbjct: 279  LIPNAVAMLDGLCKDGLVQEALKLFALMREKGTIPEIVIYTAVVEGYTKADKADDAKRIF 338

Query: 1689 RKMQNNGISPNAFSYTVLIQGLCKGRSLEDAVDFCVEMLEAGHSPNVATFASLVDCFCRE 1868
            RKMQ++GISPNAFSYTV++QGL K R L+DA +FCVEMLEAGHSPNV TF SLVD FC+E
Sbjct: 339  RKMQSSGISPNAFSYTVIVQGLYKCRRLQDAFEFCVEMLEAGHSPNVTTFVSLVDGFCKE 398

Query: 1869 KGMGEANSVIGRLREKGFFVNEKAVREYLDKKGPFSPLVWEAIFGKKSSQR 2021
            KG+ EA   +  L  KGF  +EKAVR++LDKK PFSP VWEAIFGKK+ QR
Sbjct: 399  KGVEEAKDAVKTLTGKGFAFDEKAVRQFLDKKTPFSPSVWEAIFGKKAPQR 449


>ref|XP_004512018.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            isoform X1 [Cicer arietinum]
            gi|502161087|ref|XP_004512019.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g38150-like isoform X2 [Cicer arietinum]
          Length = 371

 Score =  303 bits (776), Expect = 2e-79
 Identities = 172/324 (53%), Positives = 213/324 (65%), Gaps = 12/324 (3%)
 Frame = +3

Query: 1086 KHEKYDDNDEKMKPSLFQVPRTQFQNRPMXXXXXXXXXKDSINNNDLFTGNFNLGGSIDN 1265
            ++  ++DN        F + ++  ++ P          +  IN   L +  ++ GG    
Sbjct: 51   EYPSFNDNGSNRSDKAFDIQQSSSESGPSRSFRG----QKQINQTPLNSQEYSRGGRSVR 106

Query: 1266 ---DENRRKQSYKNPTTFHIPNKEEKKDD--QSVDSFLEKFKLGDEKK-------ESPLK 1409
               D+ R  +S +    F   N  E   D  Q  DSFL+KFKLG + K       ES  +
Sbjct: 107  PRFDDRRGSKSSQIDLGFQGRNVAEVSRDAGQLGDSFLDKFKLGFDDKVGNHSEVESNGQ 166

Query: 1410 DTPIQISKPDSPTPTDSLPQDTEEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGL 1589
                + S  D P   + +PQD +EIFKKMKETGLIPNAVAMLDGLCKDG VQEA+KLFGL
Sbjct: 167  TEGSRASDTDQPAQ-EPMPQDADEIFKKMKETGLIPNAVAMLDGLCKDGNVQEALKLFGL 225

Query: 1590 MREKGTIPEVVIYTAVVEGFCRAQKLDDAKRIFRKMQNNGISPNAFSYTVLIQGLCKGRS 1769
            MREKGTIPE+VIYTAVVEG+ +A K DDA RIFRKMQ+NGISPNA+S+TVLIQGL K   
Sbjct: 226  MREKGTIPEIVIYTAVVEGYTKAHKADDAIRIFRKMQSNGISPNAYSFTVLIQGLYKCSR 285

Query: 1770 LEDAVDFCVEMLEAGHSPNVATFASLVDCFCREKGMGEANSVIGRLREKGFFVNEKAVRE 1949
            L+DA++FCVEMLEAG+S NV TF  +VD FC+E G+ EA  VI  L EKGF  +EKAVRE
Sbjct: 286  LQDALEFCVEMLEAGYSLNVTTFVGVVDGFCKEDGVEEAKGVIKTLTEKGFAYDEKAVRE 345

Query: 1950 YLDKKGPFSPLVWEAIFGKKSSQR 2021
            +LDKK PFSP +WEA+FGKK SQR
Sbjct: 346  FLDKKAPFSPSIWEAVFGKKVSQR 369


>ref|XP_006384881.1| hypothetical protein POPTR_0004s21920g [Populus trichocarpa]
            gi|550341649|gb|ERP62678.1| hypothetical protein
            POPTR_0004s21920g [Populus trichocarpa]
          Length = 380

 Score =  302 bits (773), Expect = 5e-79
 Identities = 176/366 (48%), Positives = 215/366 (58%), Gaps = 16/366 (4%)
 Frame = +3

Query: 972  SGYEENDQVDERKSEAHFDPPSPIPNRPLRGERRTNPFKHEKYDDNDEKMKPSLFQVPRT 1151
            +G+  +D+ + R    +  PP PIPNRPLRG                             
Sbjct: 73   AGFNFDDEKERRLQNQN--PPEPIPNRPLRG----------------------------- 101

Query: 1152 QFQNRPMXXXXXXXXXKDSINNNDLFTGNFNLGGSIDNDENRRKQSYKNPTT---FHIPN 1322
                            K + NNN               +   R Q   +P+T   F++  
Sbjct: 102  ---------------PKPNFNNN--------------TNRPARPQPSHHPSTTSPFNLQP 132

Query: 1323 KEEKKDDQSV--DSFLEKFKL-----GDEKKESPLKDTPIQISKPD------SPTPTDSL 1463
            + +  D   +  D+FL+KFKL      +  K++   DT    + P       S   T   
Sbjct: 133  QTQTHDFNRISDDAFLDKFKLHPDHNNNVNKDAAAADTKAAAAPPPPKNEQASSASTSEP 192

Query: 1464 PQDTEEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVE 1643
             QD E+IF KMKETGLIPNAVAMLDGLCKDGLVQEA+KLFG MREKGTIPEVVIYTAVV+
Sbjct: 193  SQDAEQIFNKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGTMREKGTIPEVVIYTAVVD 252

Query: 1644 GFCRAQKLDDAKRIFRKMQNNGISPNAFSYTVLIQGLCKGRSLEDAVDFCVEMLEAGHSP 1823
            GFC+A KLDDAKRIFRKMQ+NGI+PNAFSY VLIQGL K    +DA+DFC EMLE GHSP
Sbjct: 253  GFCKAHKLDDAKRIFRKMQSNGITPNAFSYAVLIQGLSKCNLFDDAIDFCFEMLELGHSP 312

Query: 1824 NVATFASLVDCFCREKGMGEANSVIGRLREKGFFVNEKAVREYLDKKGPFSPLVWEAIFG 2003
            NV TF  L+D  CREKG+ EA +VIG LR+KGF V++KAVR++LDK  P S  VW+AIFG
Sbjct: 313  NVTTFVGLIDGLCREKGVEEARTVIGTLRQKGFHVHDKAVRDFLDKNKPLSSSVWDAIFG 372

Query: 2004 KKSSQR 2021
            KK S +
Sbjct: 373  KKPSHK 378


>ref|XP_004290096.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Fragaria vesca subsp. vesca]
          Length = 309

 Score =  301 bits (771), Expect = 8e-79
 Identities = 149/239 (62%), Positives = 187/239 (78%), Gaps = 4/239 (1%)
 Frame = +3

Query: 1317 PNKEEKKDDQSV----DSFLEKFKLGDEKKESPLKDTPIQISKPDSPTPTDSLPQDTEEI 1484
            PN E ++++ +      SFLEK K+G EK +   ++ P + ++P  P P  +  ++  EI
Sbjct: 74   PNLERRRENPNPPLQDSSFLEKLKMGLEKSK---REKPQEAAEPPPPQPQPT--EEANEI 128

Query: 1485 FKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCRAQK 1664
            FKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFG MREKGTIPEVVIYTAVVEGFC+ +K
Sbjct: 129  FKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGSMREKGTIPEVVIYTAVVEGFCKGRK 188

Query: 1665 LDDAKRIFRKMQNNGISPNAFSYTVLIQGLCKGRSLEDAVDFCVEMLEAGHSPNVATFAS 1844
             +DAKR+FRKMQ+NGI PNAFSY V++QGLC+   ++DA +FC EMLEAGHSPNV TF  
Sbjct: 189  PEDAKRVFRKMQSNGIVPNAFSYNVMVQGLCRCEKMKDAAEFCGEMLEAGHSPNVTTFVG 248

Query: 1845 LVDCFCREKGMGEANSVIGRLREKGFFVNEKAVREYLDKKGPFSPLVWEAIFGKKSSQR 2021
            LVD  C+E G+    SVIG+L+++G+ VNEKAVRE+LDK+  FSP+VWEAIFGK  S++
Sbjct: 249  LVDGVCKENGVEGGESVIGKLKQRGYVVNEKAVREFLDKRASFSPMVWEAIFGKNHSKK 307


>ref|XP_002514391.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223546488|gb|EEF47987.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 313

 Score =  297 bits (761), Expect = 1e-77
 Identities = 163/287 (56%), Positives = 196/287 (68%), Gaps = 25/287 (8%)
 Frame = +3

Query: 1233 GNFNLGGSIDNDENRRK--------QSYKNPTTFH--------IPNKEEKKDDQSVDSFL 1364
            G+F+L G  D+  N           +  +  T+F+        IP +   ++  S D FL
Sbjct: 30   GSFSLNGERDDASNVDNSPPHPIPNRPLRGQTSFNQSQSQSPRIPRRNTNQNHLSSDDFL 89

Query: 1365 EKFKLG--DEKKESP-------LKDTPIQISKPDSPTPTDSLPQDTEEIFKKMKETGLIP 1517
            EKFKL   + K E P        KD  I  S P  P P      D  +IF KMKETGLIP
Sbjct: 90   EKFKLNKRNHKDEIPHQINNHTSKDENINKSSPPPPPP------DANDIFNKMKETGLIP 143

Query: 1518 NAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCRAQKLDDAKRIFRKM 1697
            NAVAMLDGLCKDGLVQEAMKLFGLMR+KGTIPEVV+YTAVV+GFC+A K DDAKRIF+KM
Sbjct: 144  NAVAMLDGLCKDGLVQEAMKLFGLMRQKGTIPEVVVYTAVVDGFCKAHKTDDAKRIFKKM 203

Query: 1698 QNNGISPNAFSYTVLIQGLCKGRSLEDAVDFCVEMLEAGHSPNVATFASLVDCFCREKGM 1877
             +NGI+PNAFSYTV IQGLCK  +++DAVDFC +ML+AGHSPNV TF  LVD  CREKG+
Sbjct: 204  IDNGITPNAFSYTVTIQGLCKCNAVDDAVDFCFQMLDAGHSPNVTTFVGLVDGLCREKGV 263

Query: 1878 GEANSVIGRLREKGFFVNEKAVREYLDKKGPFSPLVWEAIFGKKSSQ 2018
             EA +VI  LR+KGF++N KA+RE+LDK  P S  + +AIFGKK SQ
Sbjct: 264  DEAQNVIEDLRKKGFYINGKAIREFLDKNAPLSSDLSQAIFGKKPSQ 310


Top