BLASTX nr result

ID: Ziziphus21_contig00006570 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ziziphus21_contig00006570
         (1405 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_008235841.1| PREDICTED: pentatricopeptide repeat-containi...   375   e-101
ref|XP_008382522.1| PREDICTED: pentatricopeptide repeat-containi...   374   e-101
ref|XP_010090734.1| hypothetical protein L484_013756 [Morus nota...   367   2e-98
ref|XP_009358909.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   357   1e-95
ref|XP_006422375.1| hypothetical protein CICLE_v10028759mg [Citr...   346   3e-92
ref|XP_012462310.1| PREDICTED: pentatricopeptide repeat-containi...   343   3e-91
gb|KDO68436.1| hypothetical protein CISIN_1g047934mg [Citrus sin...   342   5e-91
ref|XP_006486551.1| PREDICTED: pentatricopeptide repeat-containi...   342   5e-91
ref|XP_007043123.1| Pentatricopeptide repeat superfamily protein...   341   1e-90
ref|XP_011458113.1| PREDICTED: pentatricopeptide repeat-containi...   341   1e-90
ref|XP_008356289.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   339   4e-90
ref|XP_012076504.1| PREDICTED: pentatricopeptide repeat-containi...   328   7e-87
gb|KCW55702.1| hypothetical protein EUGRSUZ_I01549 [Eucalyptus g...   326   3e-86
ref|XP_010030710.1| PREDICTED: pentatricopeptide repeat-containi...   325   6e-86
ref|XP_006384881.1| hypothetical protein POPTR_0004s21920g [Popu...   321   1e-84
ref|XP_002868835.1| pentatricopeptide repeat-containing protein ...   318   9e-84
ref|XP_004253081.1| PREDICTED: pentatricopeptide repeat-containi...   317   2e-83
ref|XP_006590678.1| PREDICTED: pentatricopeptide repeat-containi...   311   7e-82
ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containi...   311   7e-82
ref|NP_195528.1| pentatricopeptide repeat-containing protein [Ar...   311   9e-82

>ref|XP_008235841.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150
            [Prunus mume]
          Length = 317

 Score =  375 bits (963), Expect = e-101
 Identities = 208/337 (61%), Positives = 247/337 (73%), Gaps = 3/337 (0%)
 Frame = -1

Query: 1270 MPSFQTRISKLIFNGISNPLSYSSQSSIHTTIKAPSFNLLKELRRFSSANDGEGGEMKNQ 1091
            MPSF  R+SK I + ISNPL +SS+  I T +         +LRR SS  D   GEM+  
Sbjct: 1    MPSFHGRVSKHILSIISNPLRHSSEPPISTRLSI-------KLRRSSSTAD-RSGEMEAP 52

Query: 1090 NSPQQPLEPVPHRPLRGGKPFDHHRTRTPQFPEKNSAAREKIRDDPSDA---NSFLEKFK 920
               QQP EP+P+RPLRG +P  +  T   QF  KNS   EK R++PS     +SFLEK K
Sbjct: 53   E--QQPPEPIPNRPLRGQRP-SNPPTSPLQFLNKNSPISEKRRENPSPPLQDSSFLEKLK 109

Query: 919  LGADKFSDNKRENPDXXXXXXXXXXXXXXPEDADEIFKKMKKTGLIPNAVAMLDGLCKDG 740
            LG DK   +KRE P                E+ADEIFKKMK+TGLIPNAVAMLDGLCKDG
Sbjct: 110  LGLDK---SKREKPQEVDEPPQPP------EEADEIFKKMKETGLIPNAVAMLDGLCKDG 160

Query: 739  LVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDNAKRIFRKMKNNGINPNAFSYS 560
            LVQ+AMKLFG MREKGTIPEVVIYTAVV+GFCKAQKL++AKRIFRKM++NGI PNAFSY+
Sbjct: 161  LVQDAMKLFGSMREKGTIPEVVIYTAVVDGFCKAQKLEDAKRIFRKMQSNGIIPNAFSYT 220

Query: 559  VLIQGLYSCKNLEDAVEFCMEMLEDGHSPNVSTFVGLIDGLCREKGVEEAQNVIGKLMQK 380
            VLIQGLY    LEDAVEFC EMLE GHSPNV+TFVGL+D +C+E  +EEA++V+GKL QK
Sbjct: 221  VLIQGLYRSNKLEDAVEFCAEMLEAGHSPNVATFVGLVDTICKENDLEEAESVVGKLKQK 280

Query: 379  GFVLNDKAVKDFLNKKVAVSPLVWEAIFGKKTSQKLF 269
            G+++N+KAV++FL+KK   SP VWEAIFGKK SQK F
Sbjct: 281  GYLVNEKAVREFLDKKAPFSPTVWEAIFGKKKSQKFF 317


>ref|XP_008382522.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150
            [Malus domestica]
          Length = 313

 Score =  374 bits (961), Expect = e-101
 Identities = 209/338 (61%), Positives = 249/338 (73%), Gaps = 4/338 (1%)
 Frame = -1

Query: 1270 MPSFQTRISKLIFNGISNPLSYSSQSSIHTTIKAPSFNLLKELRRFSSANDGEGGEMKNQ 1091
            MPSFQ ++SK I + ISNPL  S+             +L  +LRRFS+  D  GGEM+  
Sbjct: 1    MPSFQGQVSKHILSTISNPLRQST-------------SLSTKLRRFSTTAD-RGGEMEAP 46

Query: 1090 NSPQQPLEPVPHRPLRGGKPFDHHRTRTPQFPEKNSAAREKIRDDPSDA----NSFLEKF 923
             S QQP EP+P+RPLRG +P  + +T   QF  KNS   EK R+ PS      +SFLEK 
Sbjct: 47   -SEQQPPEPIPNRPLRGQRP-SNPQTSNLQFLNKNSPISEKRRERPSSPPLQDSSFLEKL 104

Query: 922  KLGADKFSDNKRENPDXXXXXXXXXXXXXXPEDADEIFKKMKKTGLIPNAVAMLDGLCKD 743
            K+G DK   +KRE P                E+AD+IFKKMK+TGLIPNAVAMLDGLCKD
Sbjct: 105  KMGLDK---SKREEPQEVPEPPQPP------EEADQIFKKMKETGLIPNAVAMLDGLCKD 155

Query: 742  GLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDNAKRIFRKMKNNGINPNAFSY 563
            GLVQEAMKLFG MREKGTIPEVVIYTAVV+GFCKAQKL++AKRIFRKM++NGI PNAFSY
Sbjct: 156  GLVQEAMKLFGSMREKGTIPEVVIYTAVVDGFCKAQKLEDAKRIFRKMQSNGIVPNAFSY 215

Query: 562  SVLIQGLYSCKNLEDAVEFCMEMLEDGHSPNVSTFVGLIDGLCREKGVEEAQNVIGKLMQ 383
            +VLIQGLY    LEDAVEFC EMLE GHSPNV+TFVGLID +C+EK +EEA++VIGKL Q
Sbjct: 216  TVLIQGLYRANMLEDAVEFCSEMLEAGHSPNVTTFVGLIDMVCKEKDMEEAESVIGKLKQ 275

Query: 382  KGFVLNDKAVKDFLNKKVAVSPLVWEAIFGKKTSQKLF 269
            KG+++N+KAVK+FL+KK   SP VWEAIFGK  SQ++F
Sbjct: 276  KGYLVNEKAVKEFLDKKAPFSPRVWEAIFGKNKSQRVF 313


>ref|XP_010090734.1| hypothetical protein L484_013756 [Morus notabilis]
            gi|587850267|gb|EXB40453.1| hypothetical protein
            L484_013756 [Morus notabilis]
          Length = 306

 Score =  367 bits (941), Expect = 2e-98
 Identities = 205/337 (60%), Positives = 232/337 (68%), Gaps = 3/337 (0%)
 Frame = -1

Query: 1270 MPSFQTRISKLIFNGISNPLSYSSQSSIHTTIKAPSFNLLKELRRFSSANDGEGGEMKNQ 1091
            MP F   +SKLIF G  N LS S QSSI         NL K+LR F SA +GE  E    
Sbjct: 1    MPQFGGNLSKLIFTGTRNSLSRSYQSSIRN-------NLPKKLRFFGSAGNGESDETTGP 53

Query: 1090 NSPQQPLEPV-PHRPLRGGKPFDHHRTRTPQFPEKNSAAREKIRDDPSDANSFLEKFKLG 914
            +  Q P E   P+RP RG  P                          S+ +SFLEKFKLG
Sbjct: 54   SFSQNPRERSRPNRPPRGRGPLT------------------------SEDDSFLEKFKLG 89

Query: 913  ADKFSDNKRENP--DXXXXXXXXXXXXXXPEDADEIFKKMKKTGLIPNAVAMLDGLCKDG 740
             D   D  +E P  +              PEDADEIFKKMK+TGLIPNAVAMLDGLCKDG
Sbjct: 90   LDSSKDGMQEKPRREAARPKPPLPQPPPPPEDADEIFKKMKETGLIPNAVAMLDGLCKDG 149

Query: 739  LVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDNAKRIFRKMKNNGINPNAFSYS 560
            LVQEAMKLFGLM+EKGTIPEVVIYTAVV+GFCKAQKLD+A RIFRKM++NGI PNAFSYS
Sbjct: 150  LVQEAMKLFGLMKEKGTIPEVVIYTAVVDGFCKAQKLDDAVRIFRKMQSNGIEPNAFSYS 209

Query: 559  VLIQGLYSCKNLEDAVEFCMEMLEDGHSPNVSTFVGLIDGLCREKGVEEAQNVIGKLMQK 380
            VL+QGL   K LED +EFC+EMLE GHSPNV+TFVGL+DGLC EKGVEEAQ VIGKL  K
Sbjct: 210  VLVQGLCGGKRLEDGLEFCVEMLEAGHSPNVATFVGLVDGLCEEKGVEEAQGVIGKLRDK 269

Query: 379  GFVLNDKAVKDFLNKKVAVSPLVWEAIFGKKTSQKLF 269
            GF+LN+KAV++FL+KK + SP VWEAIFGKK SQ+LF
Sbjct: 270  GFLLNEKAVREFLDKKASFSPSVWEAIFGKKASQRLF 306


>ref|XP_009358909.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At4g38150 [Pyrus x bretschneideri]
          Length = 309

 Score =  357 bits (916), Expect = 1e-95
 Identities = 200/338 (59%), Positives = 241/338 (71%), Gaps = 4/338 (1%)
 Frame = -1

Query: 1270 MPSFQTRISKLIFNGISNPLSYSSQSSIHTTIKAPSFNLLKELRRFSSANDGEGGEMKNQ 1091
            MPSFQ R SK IF+ ISNP   S++                 LRRFS + D  GG+M+  
Sbjct: 1    MPSFQGRASKHIFSAISNPFRQSTK-----------------LRRFSPSAD-RGGKMEAA 42

Query: 1090 NSPQQPLEPVPHRPLRGGKPFDHHRTRTPQFPEKNSAAREKIRDDPSDA----NSFLEKF 923
             + QQP +P+P+RPLRG +   + +T   QF  KNS    +  + PS      +SFLEK 
Sbjct: 43   PA-QQPPDPIPNRPLRG-QSLSNPQTSNLQFLNKNSPISARRGESPSSPPLQDSSFLEKL 100

Query: 922  KLGADKFSDNKRENPDXXXXXXXXXXXXXXPEDADEIFKKMKKTGLIPNAVAMLDGLCKD 743
            KLG DK   +KRE P                E+AD+I KKMK+TGLIPNAVAMLDGLCKD
Sbjct: 101  KLGLDK---SKREEPQEVPEPAEXA------EEADQIIKKMKETGLIPNAVAMLDGLCKD 151

Query: 742  GLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDNAKRIFRKMKNNGINPNAFSY 563
            GLV+EAMKLFG MREKGTIPEVVIYTAVV+GFCKAQK ++ KRIFRKM++NGI PNAFSY
Sbjct: 152  GLVREAMKLFGSMREKGTIPEVVIYTAVVDGFCKAQKFEDTKRIFRKMQSNGIVPNAFSY 211

Query: 562  SVLIQGLYSCKNLEDAVEFCMEMLEDGHSPNVSTFVGLIDGLCREKGVEEAQNVIGKLMQ 383
            +VLIQGLY   NLEDA EFC EMLE GHSPNV+TFVGLID +C+EK +EEA++VIGKL Q
Sbjct: 212  TVLIQGLYRANNLEDAAEFCSEMLEAGHSPNVATFVGLIDVICKEKDMEEAESVIGKLKQ 271

Query: 382  KGFVLNDKAVKDFLNKKVAVSPLVWEAIFGKKTSQKLF 269
            KG+++N+KAVK+FL+KK   SP VWEAIFGK  SQ++F
Sbjct: 272  KGYLVNEKAVKEFLDKKAPFSPRVWEAIFGKNKSQRMF 309


>ref|XP_006422375.1| hypothetical protein CICLE_v10028759mg [Citrus clementina]
            gi|557524309|gb|ESR35615.1| hypothetical protein
            CICLE_v10028759mg [Citrus clementina]
          Length = 344

 Score =  346 bits (887), Expect = 3e-92
 Identities = 186/314 (59%), Positives = 223/314 (71%), Gaps = 22/314 (7%)
 Frame = -1

Query: 1144 LRRFSSANDGEGGEMKNQN-SPQQPLEPVPHRPLRGGKPF----DHHRTRTPQF------ 998
            LRRF S  D       N N + + P EP+P RPLRG +PF     + R+  P+F      
Sbjct: 31   LRRFCSIRDFNTKNCDNDNRNYENPPEPIPDRPLRGERPFTNQNQNRRSFQPRFNNYQQQ 90

Query: 997  --PEKNS-----AAREKIRDDPSDANSFLEKFKLGADKFSDNKRENPDXXXXXXXXXXXX 839
              P++ S       R K  D      +FL++FKL  DK  DN ++N              
Sbjct: 91   QRPQQQSFQSPNRPRPKSPDGVQSDENFLDQFKLAIDKKPDNPQQNESLGERQEQKPNRN 150

Query: 838  XXP----EDADEIFKKMKKTGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVI 671
                   ++ADEIFKKMK+TGLIPNAVAMLDGLCKDGL+QEAMKLFGLMREKGTIPEVVI
Sbjct: 151  EPISEPPQEADEIFKKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVI 210

Query: 670  YTAVVEGFCKAQKLDNAKRIFRKMKNNGINPNAFSYSVLIQGLYSCKNLEDAVEFCMEML 491
            YTAVV+GFCKAQK D+AKRIFRKM++NGI PNAFSY++LIQGLY C  LE+AVE+C+EML
Sbjct: 211  YTAVVDGFCKAQKFDDAKRIFRKMQSNGIAPNAFSYNLLIQGLYKCNKLEEAVEYCIEML 270

Query: 490  EDGHSPNVSTFVGLIDGLCREKGVEEAQNVIGKLMQKGFVLNDKAVKDFLNKKVAVSPLV 311
            E GHSPNV+TFVGL+DGLCREKGVE+AQ+VI  L +KGF++NDKAV++FL+KK   S  V
Sbjct: 271  EAGHSPNVTTFVGLVDGLCREKGVEKAQSVIATLKEKGFLVNDKAVREFLDKKAPFSSSV 330

Query: 310  WEAIFGKKTSQKLF 269
            WEAIFGKKTSQK F
Sbjct: 331  WEAIFGKKTSQKPF 344


>ref|XP_012462310.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150
            isoform X1 [Gossypium raimondii]
            gi|763812905|gb|KJB79757.1| hypothetical protein
            B456_013G065500 [Gossypium raimondii]
          Length = 341

 Score =  343 bits (879), Expect = 3e-91
 Identities = 188/324 (58%), Positives = 225/324 (69%), Gaps = 18/324 (5%)
 Frame = -1

Query: 1186 HTTIKAPSFNLLKELRRFSSANDG--EGGEMKNQNSPQQPLEPVPHRPLRGGKPFD--HH 1019
            H   +A   + + + R FS       E   +++        E +P RPLRG +PF+    
Sbjct: 23   HPVSRAAPSSCVLQTRFFSDIKRPITENESIRSNEDDDGATEHIPKRPLRGRRPFNPSFR 82

Query: 1018 RTRTPQFPEKNSAAR-----------EKIRDDPSDANSFLEKFKLGADKFSDNKRE---N 881
             T    F    S+ +           +K  D  SD N FLEKFKLG +    NKRE   +
Sbjct: 83   ETEGASFDRNRSSFQSPNAKFASDPTKKREDSQSDVN-FLEKFKLGLE----NKRERVPS 137

Query: 880  PDXXXXXXXXXXXXXXPEDADEIFKKMKKTGLIPNAVAMLDGLCKDGLVQEAMKLFGLMR 701
                            PEDADEIFKKMK+TGLIPNAVAMLDGLCKDGL+QEAMKLFGLMR
Sbjct: 138  ESEAMHRKEHEEKLSPPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMR 197

Query: 700  EKGTIPEVVIYTAVVEGFCKAQKLDNAKRIFRKMKNNGINPNAFSYSVLIQGLYSCKNLE 521
            EKGTIPEVVIYTAVV+GFCKA KL++AKRIFRKM++ G+ PNAFSY+VLIQGLY CK+L+
Sbjct: 198  EKGTIPEVVIYTAVVDGFCKAHKLEDAKRIFRKMQSKGVIPNAFSYTVLIQGLYKCKHLD 257

Query: 520  DAVEFCMEMLEDGHSPNVSTFVGLIDGLCREKGVEEAQNVIGKLMQKGFVLNDKAVKDFL 341
            DA+EFC+EM+E GHSPNV+TFVGL+DGLC+EKGVEEA NVIG L QKGF++NDKAV+ FL
Sbjct: 258  DAIEFCLEMVEAGHSPNVTTFVGLVDGLCKEKGVEEAVNVIGTLKQKGFLVNDKAVRQFL 317

Query: 340  NKKVAVSPLVWEAIFGKKTSQKLF 269
            +K+   SPLVWEAIFGKKTSQK F
Sbjct: 318  DKRAPFSPLVWEAIFGKKTSQKAF 341


>gb|KDO68436.1| hypothetical protein CISIN_1g047934mg [Citrus sinensis]
          Length = 344

 Score =  342 bits (877), Expect = 5e-91
 Identities = 184/314 (58%), Positives = 221/314 (70%), Gaps = 22/314 (7%)
 Frame = -1

Query: 1144 LRRFSSANDGEGGEMKNQN-SPQQPLEPVPHRPLRGGKPF----DHHRTRTPQF------ 998
            LRRF S  D       N N + Q P EP+P RPLRG +PF     + R+  P+F      
Sbjct: 31   LRRFCSIRDFNTKNCDNDNRNDQNPPEPIPDRPLRGERPFTNQNQNRRSFQPRFNNYQQQ 90

Query: 997  --PEKNS-----AAREKIRDDPSDANSFLEKFKLGADKFSDNKRENPDXXXXXXXXXXXX 839
              P++ S       R K  D      +FL++FKL  DK   N ++N              
Sbjct: 91   QRPQQQSFQSPNGPRPKSPDGVQSDENFLDQFKLAIDKKPGNPQQNESLGQRQEQKPNRN 150

Query: 838  XXP----EDADEIFKKMKKTGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVI 671
                   ++ADEIFKKMK+TGLIPNAVAMLDGLCKDGL+QEAMKLFGLMREKGTIPEVVI
Sbjct: 151  EPISEPPQEADEIFKKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVI 210

Query: 670  YTAVVEGFCKAQKLDNAKRIFRKMKNNGINPNAFSYSVLIQGLYSCKNLEDAVEFCMEML 491
            YTAVV+GFCKAQK D+AKRIFRKM++NGI PNAFSY++LIQGLY C  LE+AVE+C+EML
Sbjct: 211  YTAVVDGFCKAQKFDDAKRIFRKMQSNGIAPNAFSYNLLIQGLYKCNKLEEAVEYCIEML 270

Query: 490  EDGHSPNVSTFVGLIDGLCREKGVEEAQNVIGKLMQKGFVLNDKAVKDFLNKKVAVSPLV 311
            E GHSPNV+TFVGL+DGLCRE+GVE+AQ+VI  L +KGF++NDKAV++FL+KK   S  V
Sbjct: 271  EAGHSPNVTTFVGLVDGLCRERGVEKAQSVIATLKEKGFLVNDKAVREFLDKKAPFSSSV 330

Query: 310  WEAIFGKKTSQKLF 269
            WEAIFGKKT QK F
Sbjct: 331  WEAIFGKKTLQKPF 344


>ref|XP_006486551.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Citrus sinensis]
          Length = 387

 Score =  342 bits (877), Expect = 5e-91
 Identities = 184/314 (58%), Positives = 221/314 (70%), Gaps = 22/314 (7%)
 Frame = -1

Query: 1144 LRRFSSANDGEGGEMKNQN-SPQQPLEPVPHRPLRGGKPF----DHHRTRTPQF------ 998
            LRRF S  D       N N + Q P EP+P RPLRG +PF     + R+  P+F      
Sbjct: 74   LRRFCSIRDFNTKNCDNDNRNDQNPPEPIPDRPLRGERPFTNQNQNRRSFQPRFNNYQQQ 133

Query: 997  --PEKNS-----AAREKIRDDPSDANSFLEKFKLGADKFSDNKRENPDXXXXXXXXXXXX 839
              P++ S       R K  D      +FL++FKL  DK   N ++N              
Sbjct: 134  QRPQQQSFQSPNGPRPKSPDGVQSDENFLDQFKLAIDKKPGNPQQNESLGQRQEQKPNRN 193

Query: 838  XXP----EDADEIFKKMKKTGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVI 671
                   ++ADEIFKKMK+TGLIPNAVAMLDGLCKDGL+QEAMKLFGLMREKGTIPEVVI
Sbjct: 194  EPISEPPQEADEIFKKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVI 253

Query: 670  YTAVVEGFCKAQKLDNAKRIFRKMKNNGINPNAFSYSVLIQGLYSCKNLEDAVEFCMEML 491
            YTAVV+GFCKAQK D+AKRIFRKM++NGI PNAFSY++LIQGLY C  LE+AVE+C+EML
Sbjct: 254  YTAVVDGFCKAQKFDDAKRIFRKMQSNGIAPNAFSYNLLIQGLYKCNKLEEAVEYCIEML 313

Query: 490  EDGHSPNVSTFVGLIDGLCREKGVEEAQNVIGKLMQKGFVLNDKAVKDFLNKKVAVSPLV 311
            E GHSPNV+TFVGL+DGLCRE+GVE+AQ+VI  L +KGF++NDKAV++FL+KK   S  V
Sbjct: 314  EAGHSPNVTTFVGLVDGLCRERGVEKAQSVIATLKEKGFLVNDKAVREFLDKKAPFSSSV 373

Query: 310  WEAIFGKKTSQKLF 269
            WEAIFGKKT QK F
Sbjct: 374  WEAIFGKKTLQKPF 387


>ref|XP_007043123.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma
            cacao] gi|508707058|gb|EOX98954.1| Pentatricopeptide
            repeat superfamily protein, putative [Theobroma cacao]
          Length = 345

 Score =  341 bits (874), Expect = 1e-90
 Identities = 183/318 (57%), Positives = 218/318 (68%), Gaps = 18/318 (5%)
 Frame = -1

Query: 1168 PSFNLLKELRRFSSAN----DGEGGEMKNQNSPQQPLEPVPHRPLRGGKPFDHHRTRTP- 1004
            PS +LL + R FS       D +     +     +P EP+P+R L G +PF+     T  
Sbjct: 29   PSLSLL-QTRLFSDMRGPFRDNDPISFNSNGDGDKPPEPIPNRSLEGQRPFNPSFRETKG 87

Query: 1003 -----------QFPEKNSAAREKIRDDPSDANSFLEKFKLGADKFSDNKRENPDXXXXXX 857
                        F  K ++   + R+D     +FLEKFKLG D     +  + +      
Sbjct: 88   ATLNSNGSSFQSFNTKFASDPNRKREDSQSDENFLEKFKLGLDNKRGKQPSDSEAAALLR 147

Query: 856  XXXXXXXXP--EDADEIFKKMKKTGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIP 683
                       +DADEIFKKMK+TGLIPNAVAMLDGLCKDGL+QEAMKLFG MREKGTIP
Sbjct: 148  RKEQEEKPSPPQDADEIFKKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGSMREKGTIP 207

Query: 682  EVVIYTAVVEGFCKAQKLDNAKRIFRKMKNNGINPNAFSYSVLIQGLYSCKNLEDAVEFC 503
            EVVIYTAVV+GFCKA KLD+AKRIFRKM++ G+ PN+FSY VLIQGLY C  L+DA+EFC
Sbjct: 208  EVVIYTAVVDGFCKAHKLDDAKRIFRKMQSKGVTPNSFSYIVLIQGLYRCNKLDDAIEFC 267

Query: 502  MEMLEDGHSPNVSTFVGLIDGLCREKGVEEAQNVIGKLMQKGFVLNDKAVKDFLNKKVAV 323
            +EMLE GHSPNV+TFVGL+DGLC+EKGVEEAQ+VIG L QKGFVLNDKAV+ FL+KK   
Sbjct: 268  LEMLEAGHSPNVTTFVGLVDGLCKEKGVEEAQSVIGTLKQKGFVLNDKAVRQFLDKKAPF 327

Query: 322  SPLVWEAIFGKKTSQKLF 269
            SPLVWEAIFGKK SQK F
Sbjct: 328  SPLVWEAIFGKKPSQKTF 345


>ref|XP_011458113.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150
            [Fragaria vesca subsp. vesca]
          Length = 309

 Score =  341 bits (874), Expect = 1e-90
 Identities = 187/339 (55%), Positives = 236/339 (69%), Gaps = 5/339 (1%)
 Frame = -1

Query: 1270 MPSFQTRISKLIFNGISNPLSYSSQSSIHTTIKAPSFNLLKELRRFSSANDGEGGEMKNQ 1091
            MPSF  R+ K IF+ +SN L +S                  +LRRFSS  D  G EM  +
Sbjct: 1    MPSFHARVPKHIFSTVSNSLGHS------------------KLRRFSSGTD-RGREM--E 39

Query: 1090 NSPQQPLEPVPHRPLRGGK-----PFDHHRTRTPQFPEKNSAAREKIRDDPSDANSFLEK 926
               +QP EP+P+RPLRG +     P    R  +P     N   R +  + P   +SFLEK
Sbjct: 40   APAKQPPEPIPNRPLRGQRASNPQPNLERRRESPP----NLERRRENPNPPLQDSSFLEK 95

Query: 925  FKLGADKFSDNKRENPDXXXXXXXXXXXXXXPEDADEIFKKMKKTGLIPNAVAMLDGLCK 746
             K+G +K   +KRE P                E+A+EIFKKMK+TGLIPNAVAMLDGLCK
Sbjct: 96   LKMGLEK---SKREKPQEAAEPPPPQPQPT--EEANEIFKKMKETGLIPNAVAMLDGLCK 150

Query: 745  DGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDNAKRIFRKMKNNGINPNAFS 566
            DGLVQEAMKLFG MREKGTIPEVVIYTAVVEGFCK +K ++AKR+FRKM++NGI PNAFS
Sbjct: 151  DGLVQEAMKLFGSMREKGTIPEVVIYTAVVEGFCKGRKPEDAKRVFRKMQSNGIVPNAFS 210

Query: 565  YSVLIQGLYSCKNLEDAVEFCMEMLEDGHSPNVSTFVGLIDGLCREKGVEEAQNVIGKLM 386
            Y+V++QGL  C+ ++DA EFC EMLE GHSPNV+TFVGL+DG+C+E GVE  ++VIGKL 
Sbjct: 211  YNVMVQGLCRCEKMKDAAEFCGEMLEAGHSPNVTTFVGLVDGVCKENGVEGGESVIGKLK 270

Query: 385  QKGFVLNDKAVKDFLNKKVAVSPLVWEAIFGKKTSQKLF 269
            Q+G+V+N+KAV++FL+K+ + SP+VWEAIFGK  S+KLF
Sbjct: 271  QRGYVVNEKAVREFLDKRASFSPMVWEAIFGKNHSKKLF 309


>ref|XP_008356289.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At4g38150-like [Malus domestica]
          Length = 298

 Score =  339 bits (869), Expect = 4e-90
 Identities = 189/322 (58%), Positives = 230/322 (71%), Gaps = 4/322 (1%)
 Frame = -1

Query: 1270 MPSFQTRISKLIFNGISNPLSYSSQSSIHTTIKAPSFNLLKELRRFSSANDGEGGEMKNQ 1091
            MPSFQ R+SK IF+ +SNP   S+             +L  +L RFS+  D  GGEM+  
Sbjct: 1    MPSFQDRVSKXIFSAVSNPFRQST-------------SLSTKLHRFSTTTD-RGGEMEXA 46

Query: 1090 NSPQQPLEPVPHRPLRGGKPFDHHRTRTPQFPEKNSAAREKIRDDPSDA----NSFLEKF 923
             S QQP +P+P+RPLRG +   + +T   QF  K+S    +  + PS      +SFLEK 
Sbjct: 47   -SEQQPPDPIPNRPLRGQR-LSNRQTSNLQFLNKDSPISARRGESPSSPPLQDSSFLEKL 104

Query: 922  KLGADKFSDNKRENPDXXXXXXXXXXXXXXPEDADEIFKKMKKTGLIPNAVAMLDGLCKD 743
            KLG DK   +KRE P                E+AD+IFKKMK+TGLIPNAVAMLDGLCKD
Sbjct: 105  KLGLDK---SKREEPQEVPEPPQXA------EEADQIFKKMKETGLIPNAVAMLDGLCKD 155

Query: 742  GLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDNAKRIFRKMKNNGINPNAFSY 563
            GLVQEAMKLFG MREKG+IPEVVIYTAV +GFCKAQK ++AKRIFRKM++NGI PNAFSY
Sbjct: 156  GLVQEAMKLFGSMREKGSIPEVVIYTAVXDGFCKAQKXEDAKRIFRKMQSNGIVPNAFSY 215

Query: 562  SVLIQGLYSCKNLEDAVEFCMEMLEDGHSPNVSTFVGLIDGLCREKGVEEAQNVIGKLMQ 383
            +VLIQGLY   NLEDA EFC EMLE GHSPNV+TFVGLID +C+E  +EEA++VIGKL Q
Sbjct: 216  TVLIQGLYRANNLEDAAEFCSEMLEAGHSPNVATFVGLIDVICKEXDMEEAESVIGKLKQ 275

Query: 382  KGFVLNDKAVKDFLNKKVAVSP 317
            KG+++N+KAVK+FL+KK   SP
Sbjct: 276  KGYLVNEKAVKEFLDKKAPFSP 297


>ref|XP_012076504.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150
            [Jatropha curcas] gi|643741632|gb|KDP47047.1|
            hypothetical protein JCGZ_10774 [Jatropha curcas]
          Length = 328

 Score =  328 bits (841), Expect = 7e-87
 Identities = 180/325 (55%), Positives = 220/325 (67%), Gaps = 20/325 (6%)
 Frame = -1

Query: 1183 TTIKAPSFNLLKELRRFSSANDGEGG---------EMKNQNSPQQPLEPVPHRPLRGGK- 1034
            +T+K  S + +  LRRFSS  D   G         E+ N    Q P  P+P+RPLRG + 
Sbjct: 12   STVKPHSLSQI--LRRFSSMRDPSTGFNASFNSNNEVNNGREVQSPPHPIPNRPLRGERG 69

Query: 1033 --PFDHHRTRTPQFPEKNSAAREKIRDDPSDANSFLEKFKLGADKFSDNKRENPDXXXXX 860
              P  H R+++      +S     ++    DA  FL+KFKL  D+  DN+   P+     
Sbjct: 70   ERPLQHPRSQS----SPSSGGPRNVKHQSDDA--FLDKFKLRLDRKKDNEIPLPNRPPPP 123

Query: 859  XXXXXXXXXPE--------DADEIFKKMKKTGLIPNAVAMLDGLCKDGLVQEAMKLFGLM 704
                      E        DAD+IF+KMK+TGLIPNAVAMLDGLCKDGLVQEAMKLFGLM
Sbjct: 124  PSPSGNDIKQEENVNSPPPDADDIFRKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLM 183

Query: 703  REKGTIPEVVIYTAVVEGFCKAQKLDNAKRIFRKMKNNGINPNAFSYSVLIQGLYSCKNL 524
            REKGTIPEVV+YTAVV+G+ KA K D+AKRIFRKM +NGI PNAFSY VLIQGLY C  L
Sbjct: 184  REKGTIPEVVVYTAVVDGYSKAHKPDDAKRIFRKMLDNGITPNAFSYGVLIQGLYKCNLL 243

Query: 523  EDAVEFCMEMLEDGHSPNVSTFVGLIDGLCREKGVEEAQNVIGKLMQKGFVLNDKAVKDF 344
            +DA++F  +MLE GHSPN++TFVGL+DGLC+EKGVEEAQ+VIG L QKGF +NDKAV++F
Sbjct: 244  DDAIDFTFQMLEAGHSPNITTFVGLVDGLCKEKGVEEAQSVIGSLRQKGFFINDKAVREF 303

Query: 343  LNKKVAVSPLVWEAIFGKKTSQKLF 269
            L+K   +S  VWEAIFGKK S K F
Sbjct: 304  LDKNAPLSSSVWEAIFGKKPSNKPF 328


>gb|KCW55702.1| hypothetical protein EUGRSUZ_I01549 [Eucalyptus grandis]
            gi|629089450|gb|KCW55703.1| hypothetical protein
            EUGRSUZ_I01549 [Eucalyptus grandis]
            gi|629089451|gb|KCW55704.1| hypothetical protein
            EUGRSUZ_I01549 [Eucalyptus grandis]
          Length = 349

 Score =  326 bits (835), Expect = 3e-86
 Identities = 178/308 (57%), Positives = 213/308 (69%), Gaps = 13/308 (4%)
 Frame = -1

Query: 1153 LKELRRFSSANDGEGGEMKNQNSPQQPL-EPVPHRPLRGGKPFDHH-RTRTPQFPEKNSA 980
            L++ R   S +  +     N N  + P  +P+P+RPLRG +          P F      
Sbjct: 49   LEDARTDQSQSRSQSPGYNNDNGNETPPPDPIPNRPLRGLQQSQRIIGNNGPNF------ 102

Query: 979  AREKIRDDPSDANSFLEKFKLGADK-----------FSDNKRENPDXXXXXXXXXXXXXX 833
              E +R DPSD +SFLEKFKL  DK            +   +E                 
Sbjct: 103  RGEGVRRDPSD-DSFLEKFKLSFDKRDKPEGDVASATTQPSQEENKVNSNQMANEGQPPL 161

Query: 832  PEDADEIFKKMKKTGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVE 653
            PEDADEIFKKMK+TGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKG+IPEVVIYTAVVE
Sbjct: 162  PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGSIPEVVIYTAVVE 221

Query: 652  GFCKAQKLDNAKRIFRKMKNNGINPNAFSYSVLIQGLYSCKNLEDAVEFCMEMLEDGHSP 473
            GFCKAQK D+AKRIFRKM+NNGI PNAFS++VLIQGLY C  LEDA+EFC EM++ GHSP
Sbjct: 222  GFCKAQKFDDAKRIFRKMQNNGITPNAFSFTVLIQGLYRCDRLEDALEFCQEMIDAGHSP 281

Query: 472  NVSTFVGLIDGLCREKGVEEAQNVIGKLMQKGFVLNDKAVKDFLNKKVAVSPLVWEAIFG 293
            NV TFVGL++G+C++KGVEEAQ VI +L +KG+ +N+KAV++FL KK   S +VWEAIFG
Sbjct: 282  NVMTFVGLVNGVCKQKGVEEAQTVINRLREKGYFINEKAVREFLEKKAPFSSMVWEAIFG 341

Query: 292  KKTSQKLF 269
            KK S  LF
Sbjct: 342  KKQSHSLF 349


>ref|XP_010030710.1| PREDICTED: pentatricopeptide repeat-containing protein At5g48910
            [Eucalyptus grandis]
          Length = 1024

 Score =  325 bits (833), Expect = 6e-86
 Identities = 180/315 (57%), Positives = 215/315 (68%), Gaps = 13/315 (4%)
 Frame = -1

Query: 1153 LKELRRFSSANDGEGGEMKNQNSPQQPL-EPVPHRPLRGGKPFDHH-RTRTPQFPEKNSA 980
            L++ R   S +  +     N N  + P  +P+P+RPLRG +          P F      
Sbjct: 49   LEDARTDQSQSRSQSPGYNNDNGNETPPPDPIPNRPLRGLQQSQRIIGNNGPNF------ 102

Query: 979  AREKIRDDPSDANSFLEKFKLGADK-----------FSDNKRENPDXXXXXXXXXXXXXX 833
              E +R DPSD +SFLEKFKL  DK            +   +E                 
Sbjct: 103  RGEGVRRDPSD-DSFLEKFKLSFDKRDKPEGDVASATTQPSQEENKVNSNQMANEGQPPL 161

Query: 832  PEDADEIFKKMKKTGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVE 653
            PEDADEIFKKMK+TGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKG+IPEVVIYTAVVE
Sbjct: 162  PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGSIPEVVIYTAVVE 221

Query: 652  GFCKAQKLDNAKRIFRKMKNNGINPNAFSYSVLIQGLYSCKNLEDAVEFCMEMLEDGHSP 473
            GFCKAQK D+AKRIFRKM+NNGI PNAFS++VLIQGLY C  LEDA+EFC EM++ GHSP
Sbjct: 222  GFCKAQKFDDAKRIFRKMQNNGITPNAFSFTVLIQGLYRCDRLEDALEFCQEMIDAGHSP 281

Query: 472  NVSTFVGLIDGLCREKGVEEAQNVIGKLMQKGFVLNDKAVKDFLNKKVAVSPLVWEAIFG 293
            NV TFVGL++G+C++KGVEEAQ VI +L +KG+ +N+KAV++FL KK   S +VWEAIFG
Sbjct: 282  NVMTFVGLVNGVCKQKGVEEAQTVINRLREKGYFINEKAVREFLEKKAPFSSMVWEAIFG 341

Query: 292  KKTSQKLF*GPKMEW 248
            KK S   F  PK  W
Sbjct: 342  KKQSHS-FSAPKNLW 355


>ref|XP_006384881.1| hypothetical protein POPTR_0004s21920g [Populus trichocarpa]
            gi|550341649|gb|ERP62678.1| hypothetical protein
            POPTR_0004s21920g [Populus trichocarpa]
          Length = 380

 Score =  321 bits (822), Expect = 1e-84
 Identities = 186/350 (53%), Positives = 225/350 (64%), Gaps = 38/350 (10%)
 Frame = -1

Query: 1204 SSQSSIHTTIKAPS---FNLLKELRRFSSANDGEGG--------EMKNQNSPQQPLEPVP 1058
            SS SS    +K  S    +L + LRRFSS+  G           E + +   Q P EP+P
Sbjct: 36   SSSSSQFRVLKLHSHSRISLSQILRRFSSSIKGSTAGAGFNFDDEKERRLQNQNPPEPIP 95

Query: 1057 HRPLRGGKPF-------------DHHRTRTPQF---PEKNSAAREKIRDDPSDANSFLEK 926
            +RPLRG KP               HH + T  F   P+  +    +I DD     +FL+K
Sbjct: 96   NRPLRGPKPNFNNNTNRPARPQPSHHPSTTSPFNLQPQTQTHDFNRISDD-----AFLDK 150

Query: 925  FKLGADKFSDNKREN-----------PDXXXXXXXXXXXXXXPEDADEIFKKMKKTGLIP 779
            FKL  D  ++  ++            P                +DA++IF KMK+TGLIP
Sbjct: 151  FKLHPDHNNNVNKDAAAADTKAAAAPPPPKNEQASSASTSEPSQDAEQIFNKMKETGLIP 210

Query: 778  NAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDNAKRIFRKM 599
            NAVAMLDGLCKDGLVQEA+KLFG MREKGTIPEVVIYTAVV+GFCKA KLD+AKRIFRKM
Sbjct: 211  NAVAMLDGLCKDGLVQEALKLFGTMREKGTIPEVVIYTAVVDGFCKAHKLDDAKRIFRKM 270

Query: 598  KNNGINPNAFSYSVLIQGLYSCKNLEDAVEFCMEMLEDGHSPNVSTFVGLIDGLCREKGV 419
            ++NGI PNAFSY+VLIQGL  C   +DA++FC EMLE GHSPNV+TFVGLIDGLCREKGV
Sbjct: 271  QSNGITPNAFSYAVLIQGLSKCNLFDDAIDFCFEMLELGHSPNVTTFVGLIDGLCREKGV 330

Query: 418  EEAQNVIGKLMQKGFVLNDKAVKDFLNKKVAVSPLVWEAIFGKKTSQKLF 269
            EEA+ VIG L QKGF ++DKAV+DFL+K   +S  VW+AIFGKK S K F
Sbjct: 331  EEARTVIGTLRQKGFHVHDKAVRDFLDKNKPLSSSVWDAIFGKKPSHKPF 380


>ref|XP_002868835.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297314671|gb|EFH45094.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 301

 Score =  318 bits (814), Expect = 9e-84
 Identities = 175/316 (55%), Positives = 217/316 (68%)
 Frame = -1

Query: 1222 SNPLSYSSQSSIHTTIKAPSFNLLKELRRFSSANDGEGGEMKNQNSPQQPLEPVPHRPLR 1043
            S  + ++ Q +    +  PS +  + L      + G+ G+ K QN P    EP+P+RPLR
Sbjct: 5    SKAVVFARQMAKQIRVTTPSISATRFL------STGDKGQEKQQNPP----EPLPNRPLR 54

Query: 1042 GGKPFDHHRTRTPQFPEKNSAAREKIRDDPSDANSFLEKFKLGADKFSDNKRENPDXXXX 863
            G +  + HR    + P + +    KI +  SD + FLE+FKLG ++ S   +E P     
Sbjct: 55   GERSSNSHR----EPPARQAHDLGKIDNTLSD-DGFLEQFKLGVNQDS---QETPKPEQY 106

Query: 862  XXXXXXXXXXPEDADEIFKKMKKTGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIP 683
                       ED+DEIFKKMK+ GLIPNAVAMLDGLCKDGLVQEAMKLFGLMR+KGTIP
Sbjct: 107  PQDPLLPP---EDSDEIFKKMKEGGLIPNAVAMLDGLCKDGLVQEAMKLFGLMRDKGTIP 163

Query: 682  EVVIYTAVVEGFCKAQKLDNAKRIFRKMKNNGINPNAFSYSVLIQGLYSCKNLEDAVEFC 503
            EVVIYTAVVEGFCKA K+++AKRIFRKM+ NGI PNAFSY VL+QGLY+C  L+DAV FC
Sbjct: 164  EVVIYTAVVEGFCKAHKIEDAKRIFRKMQTNGITPNAFSYGVLVQGLYNCNMLDDAVTFC 223

Query: 502  MEMLEDGHSPNVSTFVGLIDGLCREKGVEEAQNVIGKLMQKGFVLNDKAVKDFLNKKVAV 323
             EMLE GHSPN+ TFVGL+D LCREKGVE+AQ+ I  L QKGF LN KAVK+F++K+   
Sbjct: 224  CEMLESGHSPNIPTFVGLVDALCREKGVEQAQSAIDGLNQKGFALNVKAVKEFMDKRAPF 283

Query: 322  SPLVWEAIFGKKTSQK 275
              L WEAIF KK + K
Sbjct: 284  PSLAWEAIFKKKPTDK 299


>ref|XP_004253081.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150
            [Solanum lycopersicum]
          Length = 340

 Score =  317 bits (811), Expect = 2e-83
 Identities = 172/303 (56%), Positives = 213/303 (70%), Gaps = 13/303 (4%)
 Frame = -1

Query: 1144 LRRFSSAN--DGEGGEMKNQNSPQQPLEPVPHRPLRGG--KPFDHHRTRTPQ---FPEKN 986
            LR FSS+N       E    N P  P EP+P+RPLR    +PF+  + + P     P  +
Sbjct: 35   LRSFSSSNKFSDYSDESAESNYPPPP-EPIPNRPLRADSRRPFNPSQRQHPSNRSSPNHS 93

Query: 985  SAAREKIRDDPS-----DANSFLEKFKLGADKFSDNKRENPDXXXXXXXXXXXXXXP-ED 824
            +  R    ++ S     D+  FL++F+LG D+  +N   NP               P ED
Sbjct: 94   TTFRRSSENNESQMKSQDSEDFLKRFQLGFDRKEENPNTNPKAESRDCPVSEAPPAPPED 153

Query: 823  ADEIFKKMKKTGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFC 644
            ADEIFKKMK+TGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVV+GFC
Sbjct: 154  ADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVDGFC 213

Query: 643  KAQKLDNAKRIFRKMKNNGINPNAFSYSVLIQGLYSCKNLEDAVEFCMEMLEDGHSPNVS 464
            KAQK D+A RIFRKM+ NGI PNAFSY ++I+GL   K L+DA+EFC+EMLE GHSPNV 
Sbjct: 214  KAQKFDDAVRIFRKMQGNGIIPNAFSYGIIIRGLSQGKRLDDALEFCLEMLEAGHSPNVV 273

Query: 463  TFVGLIDGLCREKGVEEAQNVIGKLMQKGFVLNDKAVKDFLNKKVAVSPLVWEAIFGKKT 284
            TFV L+DG C+EK +E+AQN+I  + QKGF+++DKAV++FL+KK    P+VWEAI GKK 
Sbjct: 274  TFVTLVDGFCKEKSLEDAQNMIKTVRQKGFIVDDKAVREFLDKKGPFLPVVWEAILGKKA 333

Query: 283  SQK 275
            SQ+
Sbjct: 334  SQR 336


>ref|XP_006590678.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            isoform X2 [Glycine max] gi|734423081|gb|KHN42002.1|
            Pentatricopeptide repeat-containing protein [Glycine
            soja] gi|947079849|gb|KRH28638.1| hypothetical protein
            GLYMA_11G065700 [Glycine max]
          Length = 395

 Score =  311 bits (798), Expect = 7e-82
 Identities = 183/386 (47%), Positives = 224/386 (58%), Gaps = 66/386 (17%)
 Frame = -1

Query: 1228 GISNPLSYSSQSSIHTTIKAPSF--NLLKELRRFSSANDGEGGEMK-------------- 1097
            G+   +S+S    + + +    +   LL+ +R FS  +D  G   +              
Sbjct: 17   GVHKLVSFSQIEKLVSFVHCKQYLPPLLETVRHFSFTDDCSGRSKQPVGESDDFFLQQSD 76

Query: 1096 -----NQNSPQQPLEPVPHRPLRGGKP-------FDHHRTRTPQFPEK------------ 989
                 N  S Q   EP+P RPLR  KP       F  +   +  FP +            
Sbjct: 77   SSFKDNGESDQSLSEPIPSRPLRSRKPVNQPPPRFQEYDRGSHSFPPRFYDNHGGPDELD 136

Query: 988  -------------NSAAREKIRDDPSDANSFLEKFKLGADKFSDN-------------KR 887
                         N+   +  RD     +SFL KFKLG D  + N             KR
Sbjct: 137  QTNKSSKIDLAFQNTNVAKTNRDAGQSGDSFLNKFKLGFDDKTVNLSEVAASKQSEEAKR 196

Query: 886  ENPDXXXXXXXXXXXXXXPEDADEIFKKMKKTGLIPNAVAMLDGLCKDGLVQEAMKLFGL 707
             NP+               +DADEIFKKMK+TGLIPNAVAMLDGLCKDGLVQEA+KLFGL
Sbjct: 197  SNPNQPAQESMP-------QDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGL 249

Query: 706  MREKGTIPEVVIYTAVVEGFCKAQKLDNAKRIFRKMKNNGINPNAFSYSVLIQGLYSCKN 527
            MREKGTIPE+VIYTAVVEG+ KA K D+AKRIFRKM+++G++PNAFSY VLIQGLY C  
Sbjct: 250  MREKGTIPEIVIYTAVVEGYTKAHKADDAKRIFRKMQSSGVSPNAFSYMVLIQGLYKCSR 309

Query: 526  LEDAVEFCMEMLEDGHSPNVSTFVGLIDGLCREKGVEEAQNVIGKLMQKGFVLNDKAVKD 347
            L DA EFC+EMLE GHSPNV+TFVGL+DG C EKGVEEA++ I  L  KGFV+N+KAV+ 
Sbjct: 310  LHDAFEFCVEMLEAGHSPNVTTFVGLVDGFCNEKGVEEAKSAIKTLTDKGFVVNEKAVRQ 369

Query: 346  FLNKKVAVSPLVWEAIFGKKTSQKLF 269
            FL+KK   SP VWEAIFGKK  Q+ F
Sbjct: 370  FLDKKAPFSPSVWEAIFGKKAPQRPF 395


>ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            isoform X1 [Glycine max] gi|947079847|gb|KRH28636.1|
            hypothetical protein GLYMA_11G065700 [Glycine max]
            gi|947079848|gb|KRH28637.1| hypothetical protein
            GLYMA_11G065700 [Glycine max]
          Length = 388

 Score =  311 bits (798), Expect = 7e-82
 Identities = 183/386 (47%), Positives = 224/386 (58%), Gaps = 66/386 (17%)
 Frame = -1

Query: 1228 GISNPLSYSSQSSIHTTIKAPSF--NLLKELRRFSSANDGEGGEMK-------------- 1097
            G+   +S+S    + + +    +   LL+ +R FS  +D  G   +              
Sbjct: 10   GVHKLVSFSQIEKLVSFVHCKQYLPPLLETVRHFSFTDDCSGRSKQPVGESDDFFLQQSD 69

Query: 1096 -----NQNSPQQPLEPVPHRPLRGGKP-------FDHHRTRTPQFPEK------------ 989
                 N  S Q   EP+P RPLR  KP       F  +   +  FP +            
Sbjct: 70   SSFKDNGESDQSLSEPIPSRPLRSRKPVNQPPPRFQEYDRGSHSFPPRFYDNHGGPDELD 129

Query: 988  -------------NSAAREKIRDDPSDANSFLEKFKLGADKFSDN-------------KR 887
                         N+   +  RD     +SFL KFKLG D  + N             KR
Sbjct: 130  QTNKSSKIDLAFQNTNVAKTNRDAGQSGDSFLNKFKLGFDDKTVNLSEVAASKQSEEAKR 189

Query: 886  ENPDXXXXXXXXXXXXXXPEDADEIFKKMKKTGLIPNAVAMLDGLCKDGLVQEAMKLFGL 707
             NP+               +DADEIFKKMK+TGLIPNAVAMLDGLCKDGLVQEA+KLFGL
Sbjct: 190  SNPNQPAQESMP-------QDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGL 242

Query: 706  MREKGTIPEVVIYTAVVEGFCKAQKLDNAKRIFRKMKNNGINPNAFSYSVLIQGLYSCKN 527
            MREKGTIPE+VIYTAVVEG+ KA K D+AKRIFRKM+++G++PNAFSY VLIQGLY C  
Sbjct: 243  MREKGTIPEIVIYTAVVEGYTKAHKADDAKRIFRKMQSSGVSPNAFSYMVLIQGLYKCSR 302

Query: 526  LEDAVEFCMEMLEDGHSPNVSTFVGLIDGLCREKGVEEAQNVIGKLMQKGFVLNDKAVKD 347
            L DA EFC+EMLE GHSPNV+TFVGL+DG C EKGVEEA++ I  L  KGFV+N+KAV+ 
Sbjct: 303  LHDAFEFCVEMLEAGHSPNVTTFVGLVDGFCNEKGVEEAKSAIKTLTDKGFVVNEKAVRQ 362

Query: 346  FLNKKVAVSPLVWEAIFGKKTSQKLF 269
            FL+KK   SP VWEAIFGKK  Q+ F
Sbjct: 363  FLDKKAPFSPSVWEAIFGKKAPQRPF 388


>ref|NP_195528.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|79326453|ref|NP_001031806.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|75266764|sp|Q9SZL5.1|PP356_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g38150 gi|4467121|emb|CAB37555.1| putative protein
            [Arabidopsis thaliana] gi|7270799|emb|CAB80480.1|
            putative protein [Arabidopsis thaliana]
            gi|26453272|dbj|BAC43709.1| unknown protein [Arabidopsis
            thaliana] gi|332661484|gb|AEE86884.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|332661485|gb|AEE86885.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 302

 Score =  311 bits (797), Expect = 9e-82
 Identities = 171/318 (53%), Positives = 212/318 (66%)
 Frame = -1

Query: 1222 SNPLSYSSQSSIHTTIKAPSFNLLKELRRFSSANDGEGGEMKNQNSPQQPLEPVPHRPLR 1043
            S  + ++ Q +    +  PS +  + L      + G+ G++  Q   Q P EP+P+RPLR
Sbjct: 5    SKAVVFARQMAKQIRVTTPSMSATRFL------STGDNGQVDEQ---QNPPEPLPNRPLR 55

Query: 1042 GGKPFDHHRTRTPQFPEKNSAAREKIRDDPSDANSFLEKFKLGADKFSDNKRENPDXXXX 863
            G +  + HR    +       +   + DD      FLE+FKLG ++ S   RE P     
Sbjct: 56   GERSSNSHREPPARQAHNLGKSDTTLSDD-----GFLEQFKLGVNQDS---RETPKPEQY 107

Query: 862  XXXXXXXXXXPEDADEIFKKMKKTGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIP 683
                       ED+DEIFKKMK+ GLIPNAVAMLDGLCKDGLVQEAMKLFGLMR+KGTIP
Sbjct: 108  PQEPLPPP---EDSDEIFKKMKEGGLIPNAVAMLDGLCKDGLVQEAMKLFGLMRDKGTIP 164

Query: 682  EVVIYTAVVEGFCKAQKLDNAKRIFRKMKNNGINPNAFSYSVLIQGLYSCKNLEDAVEFC 503
            EVVIYTAVVE FCKA K+++AKRIFRKM+NNGI PNAFSY VL+QGLY+C  L+DAV FC
Sbjct: 165  EVVIYTAVVEAFCKAHKIEDAKRIFRKMQNNGIAPNAFSYGVLVQGLYNCNMLDDAVAFC 224

Query: 502  MEMLEDGHSPNVSTFVGLIDGLCREKGVEEAQNVIGKLMQKGFVLNDKAVKDFLNKKVAV 323
             EMLE GHSPNV TFV L+D LCR KGVE+AQ+ I  L QKGF +N KAVK+F++K+   
Sbjct: 225  SEMLESGHSPNVPTFVELVDALCRVKGVEQAQSAIDTLNQKGFAVNVKAVKEFMDKRAPF 284

Query: 322  SPLVWEAIFGKKTSQKLF 269
              L WEAIF KK ++K F
Sbjct: 285  PSLAWEAIFKKKPTEKPF 302


Top