BLASTX nr result

ID: Catharanthus23_contig00006307 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00006307
         (2909 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004253081.1| PREDICTED: pentatricopeptide repeat-containi...   332   4e-88
gb|EXB40453.1| hypothetical protein L484_013756 [Morus notabilis]     327   2e-86
ref|XP_006342489.1| PREDICTED: pentatricopeptide repeat-containi...   326   4e-86
gb|EOX98954.1| Pentatricopeptide repeat superfamily protein, put...   312   6e-82
emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera]   310   3e-81
ref|XP_002283311.1| PREDICTED: pentatricopeptide repeat-containi...   308   1e-80
ref|XP_004290096.1| PREDICTED: pentatricopeptide repeat-containi...   306   4e-80
ref|XP_006422375.1| hypothetical protein CICLE_v10028759mg [Citr...   305   1e-79
ref|XP_006394459.1| hypothetical protein EUTSA_v10005467mg [Eutr...   305   1e-79
ref|XP_006486551.1| PREDICTED: pentatricopeptide repeat-containi...   303   2e-79
gb|EPS62438.1| hypothetical protein M569_12351, partial [Genlise...   303   4e-79
ref|XP_006590678.1| PREDICTED: pentatricopeptide repeat-containi...   301   1e-78
ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containi...   301   1e-78
ref|XP_004512018.1| PREDICTED: pentatricopeptide repeat-containi...   299   4e-78
gb|ESW28907.1| hypothetical protein PHAVU_002G027800g [Phaseolus...   299   5e-78
ref|XP_002868835.1| pentatricopeptide repeat-containing protein ...   298   1e-77
ref|XP_006573589.1| PREDICTED: pentatricopeptide repeat-containi...   295   8e-77
ref|XP_006573588.1| PREDICTED: pentatricopeptide repeat-containi...   295   8e-77
ref|XP_003516576.2| PREDICTED: pentatricopeptide repeat-containi...   295   8e-77
ref|NP_195528.1| pentatricopeptide repeat-containing protein [Ar...   294   2e-76

>ref|XP_004253081.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            isoform 1 [Solanum lycopersicum]
            gi|460415472|ref|XP_004253082.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g38150-like isoform 2 [Solanum lycopersicum]
          Length = 340

 Score =  332 bits (852), Expect = 4e-88
 Identities = 166/233 (71%), Positives = 195/233 (83%), Gaps = 2/233 (0%)
 Frame = +2

Query: 2000 DNESQGRGRGVVEDSDFLERFKLGFDRTKR-VNSDSK-ESPDQTADTTAEPTPEDADEIF 2173
            +NESQ + +   +  DFL+RF+LGFDR +   N++ K ES D          PEDADEIF
Sbjct: 102  NNESQMKSQ---DSEDFLKRFQLGFDRKEENPNTNPKAESRDCPVSEAPPAPPEDADEIF 158

Query: 2174 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKL 2353
            KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVV+GFCKAQK 
Sbjct: 159  KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVDGFCKAQKF 218

Query: 2354 DDAIRIFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGL 2533
            DDA+RIFRKMQ NGI PNAFSYGI+++ L +GKRLDDA EFC EMLEAGH+PNV TF+ L
Sbjct: 219  DDAVRIFRKMQGNGIIPNAFSYGIIIRGLSQGKRLDDALEFCLEMLEAGHSPNVVTFVTL 278

Query: 2534 VDCFCREKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692
            VD FC+EK LE+ +++I T+RQKGF++D+KAVRE+LDKKGPF+P+VWEAI GK
Sbjct: 279  VDGFCKEKSLEDAQNMIKTVRQKGFIVDDKAVREFLDKKGPFLPVVWEAILGK 331


>gb|EXB40453.1| hypothetical protein L484_013756 [Morus notabilis]
          Length = 306

 Score =  327 bits (838), Expect = 2e-86
 Identities = 167/233 (71%), Positives = 185/233 (79%), Gaps = 6/233 (2%)
 Frame = +2

Query: 2012 QGRGRGVVEDSDFLERFKLGFDRTKRVNSDSKESPDQTADTTAEPTP------EDADEIF 2173
            +GRG    ED  FLE+FKLG D +K      +E P + A     P P      EDADEIF
Sbjct: 70   RGRGPLTSEDDSFLEKFKLGLDSSK---DGMQEKPRREAARPKPPLPQPPPPPEDADEIF 126

Query: 2174 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKL 2353
            KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLM+EKGTIPEVVIYTAVV+GFCKAQKL
Sbjct: 127  KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMKEKGTIPEVVIYTAVVDGFCKAQKL 186

Query: 2354 DDAIRIFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGL 2533
            DDA+RIFRKMQSNGI PNAFSY +LVQ LC GKRL+D  EFC EMLEAGH+PNVATF+GL
Sbjct: 187  DDAVRIFRKMQSNGIEPNAFSYSVLVQGLCGGKRLEDGLEFCVEMLEAGHSPNVATFVGL 246

Query: 2534 VDCFCREKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692
            VD  C EKG+EE + +I  LR KGF+L+EKAVRE+LDKK  F P VWEAIFGK
Sbjct: 247  VDGLCEEKGVEEAQGVIGKLRDKGFLLNEKAVREFLDKKASFSPSVWEAIFGK 299


>ref|XP_006342489.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Solanum tuberosum]
          Length = 354

 Score =  326 bits (835), Expect = 4e-86
 Identities = 167/247 (67%), Positives = 193/247 (78%), Gaps = 7/247 (2%)
 Frame = +2

Query: 1973 RKSAQFGYGDNESQGRGRGVVEDSDFLERFKLGFDRTKRVNSDSK-------ESPDQTAD 2131
            R+S +   G  +SQ       +  DFL+RF+LGFDR K  N ++        ES D    
Sbjct: 107  RRSGENNGGQMKSQ-------DSEDFLKRFQLGFDR-KEENPNTNPALHPKGESSDSPVS 158

Query: 2132 TTAEPTPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVI 2311
                  PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVI
Sbjct: 159  EAPPAPPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVI 218

Query: 2312 YTAVVEGFCKAQKLDDAIRIFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEML 2491
            YTAVV+GF KAQK DDA+RIFRKMQ NGI PNAFSYGIL++ L +G RLDDA EFC EML
Sbjct: 219  YTAVVDGFFKAQKFDDAVRIFRKMQGNGIIPNAFSYGILIRGLSQGNRLDDAFEFCLEML 278

Query: 2492 EAGHAPNVATFIGLVDCFCREKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLV 2671
            EAGH+PNV TF+ LVD FC+EK LE+ +++I T+RQKGF++D+KAVREYLDKKGPF+P+V
Sbjct: 279  EAGHSPNVVTFVTLVDGFCKEKSLEDAQNMIKTVRQKGFIVDDKAVREYLDKKGPFLPVV 338

Query: 2672 WEAIFGK 2692
            WEAI GK
Sbjct: 339  WEAILGK 345


>gb|EOX98954.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma
            cacao]
          Length = 345

 Score =  312 bits (799), Expect = 6e-82
 Identities = 156/221 (70%), Positives = 184/221 (83%), Gaps = 3/221 (1%)
 Frame = +2

Query: 2039 DSDFLERFKLGFD--RTKRVNSDSKESPDQTADTTAEPTP-EDADEIFKKMKETGLIPNA 2209
            D +FLE+FKLG D  R K+ +     +  +  +   +P+P +DADEIFKKMKETGLIPNA
Sbjct: 118  DENFLEKFKLGLDNKRGKQPSDSEAAALLRRKEQEEKPSPPQDADEIFKKMKETGLIPNA 177

Query: 2210 VAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRKMQS 2389
            VAMLDGLCKDGL+QEAMKLFG MREKGTIPEVVIYTAVV+GFCKA KLDDA RIFRKMQS
Sbjct: 178  VAMLDGLCKDGLIQEAMKLFGSMREKGTIPEVVIYTAVVDGFCKAHKLDDAKRIFRKMQS 237

Query: 2390 NGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKGLEE 2569
             G+TPN+FSY +L+Q L R  +LDDA EFC EMLEAGH+PNV TF+GLVD  C+EKG+EE
Sbjct: 238  KGVTPNSFSYIVLIQGLYRCNKLDDAIEFCLEMLEAGHSPNVTTFVGLVDGLCKEKGVEE 297

Query: 2570 TESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692
             +S+I TL+QKGFVL++KAVR++LDKK PF PLVWEAIFGK
Sbjct: 298  AQSVIGTLKQKGFVLNDKAVRQFLDKKAPFSPLVWEAIFGK 338


>emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera]
          Length = 381

 Score =  310 bits (793), Expect = 3e-81
 Identities = 151/224 (67%), Positives = 183/224 (81%), Gaps = 2/224 (0%)
 Frame = +2

Query: 2027 GVVEDSDFLERFKLGFDRTKRVNSDSKESP--DQTADTTAEPTPEDADEIFKKMKETGLI 2200
            G   +  FLERFKLG  + +R    +   P  +Q A+   E  P++ADEIF+KMKE+GLI
Sbjct: 151  GATLEDGFLERFKLGVQKKERPQESAAAQPSREQDANHGKEQPPQNADEIFRKMKESGLI 210

Query: 2201 PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRK 2380
            PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKA++LDDA+RIFRK
Sbjct: 211  PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKARQLDDAVRIFRK 270

Query: 2381 MQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKG 2560
            MQ+NGI+PNAFSY +L++ + +G RLD A +FC EMLEAGH+PNVAT + L+  FC+EKG
Sbjct: 271  MQNNGISPNAFSYTVLIRGMYKGNRLDIAVDFCVEMLEAGHSPNVATLVDLIHEFCKEKG 330

Query: 2561 LEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692
            +EE +++I TL+QKG  +D+KAVREYLDKKGP  PLVWEA FGK
Sbjct: 331  VEEAKNVIVTLKQKGLFVDDKAVREYLDKKGPQSPLVWEAFFGK 374


>ref|XP_002283311.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Vitis vinifera]
          Length = 380

 Score =  308 bits (788), Expect = 1e-80
 Identities = 150/224 (66%), Positives = 183/224 (81%), Gaps = 2/224 (0%)
 Frame = +2

Query: 2027 GVVEDSDFLERFKLGFDRTKRVNSDSKESP--DQTADTTAEPTPEDADEIFKKMKETGLI 2200
            G   +  FLERFKLG  + +R    +   P  +Q A+   E  P++ADEIF+KMKE+GLI
Sbjct: 150  GATLEDGFLERFKLGVQKKERPQESAAAQPSREQDANHGKEQPPQNADEIFRKMKESGLI 209

Query: 2201 PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRK 2380
            PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKA++L+DA+RIFRK
Sbjct: 210  PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKARQLNDAVRIFRK 269

Query: 2381 MQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKG 2560
            MQ+NGI+PNAFSY +L++ + +G RLD A +FC EMLEAGH+PNVAT + L+  FC+EKG
Sbjct: 270  MQNNGISPNAFSYTVLIRGMYKGNRLDIAVDFCVEMLEAGHSPNVATLVDLIHEFCKEKG 329

Query: 2561 LEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692
            +EE +++I TL+QKG  +D+KAVREYLDKKGP  PLVWEA FGK
Sbjct: 330  VEEAKNVIVTLKQKGLFVDDKAVREYLDKKGPQSPLVWEAFFGK 373


>ref|XP_004290096.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Fragaria vesca subsp. vesca]
          Length = 309

 Score =  306 bits (783), Expect = 4e-80
 Identities = 151/222 (68%), Positives = 184/222 (82%), Gaps = 2/222 (0%)
 Frame = +2

Query: 2033 VEDSDFLERFKLGFDRTKRVNSDSKESPDQTADTTA-EPTP-EDADEIFKKMKETGLIPN 2206
            ++DS FLE+ K+G +++KR      E P + A+    +P P E+A+EIFKKMKETGLIPN
Sbjct: 87   LQDSSFLEKLKMGLEKSKR------EKPQEAAEPPPPQPQPTEEANEIFKKMKETGLIPN 140

Query: 2207 AVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRKMQ 2386
            AVAMLDGLCKDGLVQEAMKLFG MREKGTIPEVVIYTAVVEGFCK +K +DA R+FRKMQ
Sbjct: 141  AVAMLDGLCKDGLVQEAMKLFGSMREKGTIPEVVIYTAVVEGFCKGRKPEDAKRVFRKMQ 200

Query: 2387 SNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKGLE 2566
            SNGI PNAFSY ++VQ LCR +++ DAAEFCGEMLEAGH+PNV TF+GLVD  C+E G+E
Sbjct: 201  SNGIVPNAFSYNVMVQGLCRCEKMKDAAEFCGEMLEAGHSPNVTTFVGLVDGVCKENGVE 260

Query: 2567 ETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692
              ES+I  L+Q+G+V++EKAVRE+LDK+  F P+VWEAIFGK
Sbjct: 261  GGESVIGKLKQRGYVVNEKAVREFLDKRASFSPMVWEAIFGK 302


>ref|XP_006422375.1| hypothetical protein CICLE_v10028759mg [Citrus clementina]
            gi|557524309|gb|ESR35615.1| hypothetical protein
            CICLE_v10028759mg [Citrus clementina]
          Length = 344

 Score =  305 bits (780), Expect = 1e-79
 Identities = 155/256 (60%), Positives = 195/256 (76%), Gaps = 8/256 (3%)
 Frame = +2

Query: 1949 FSPASQKNRKSAQFGYGDNESQGRGR-GVVEDSDFLERFKLGFDR-------TKRVNSDS 2104
            F+   Q+ R   Q     N  + +   GV  D +FL++FKL  D+        + +    
Sbjct: 84   FNNYQQQQRPQQQSFQSPNRPRPKSPDGVQSDENFLDQFKLAIDKKPDNPQQNESLGERQ 143

Query: 2105 KESPDQTADTTAEPTPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMRE 2284
            ++ P++  +  +EP P++ADEIFKKMKETGLIPNAVAMLDGLCKDGL+QEAMKLFGLMRE
Sbjct: 144  EQKPNRN-EPISEP-PQEADEIFKKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMRE 201

Query: 2285 KGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRKMQSNGITPNAFSYGILVQALCRGKRLDD 2464
            KGTIPEVVIYTAVV+GFCKAQK DDA RIFRKMQSNGI PNAFSY +L+Q L +  +L++
Sbjct: 202  KGTIPEVVIYTAVVDGFCKAQKFDDAKRIFRKMQSNGIAPNAFSYNLLIQGLYKCNKLEE 261

Query: 2465 AAEFCGEMLEAGHAPNVATFIGLVDCFCREKGLEETESLITTLRQKGFVLDEKAVREYLD 2644
            A E+C EMLEAGH+PNV TF+GLVD  CREKG+E+ +S+I TL++KGF++++KAVRE+LD
Sbjct: 262  AVEYCIEMLEAGHSPNVTTFVGLVDGLCREKGVEKAQSVIATLKEKGFLVNDKAVREFLD 321

Query: 2645 KKGPFMPLVWEAIFGK 2692
            KK PF   VWEAIFGK
Sbjct: 322  KKAPFSSSVWEAIFGK 337


>ref|XP_006394459.1| hypothetical protein EUTSA_v10005467mg [Eutrema salsugineum]
            gi|557091098|gb|ESQ31745.1| hypothetical protein
            EUTSA_v10005467mg [Eutrema salsugineum]
          Length = 295

 Score =  305 bits (780), Expect = 1e-79
 Identities = 150/232 (64%), Positives = 178/232 (76%)
 Frame = +2

Query: 1997 GDNESQGRGRGVVEDSDFLERFKLGFDRTKRVNSDSKESPDQTADTTAEPTPEDADEIFK 2176
            G N ++      + D DFLE+FKLG  +     ++ K  P Q       P PED++EIFK
Sbjct: 59   GSNSARPSQPAKLSDHDFLEQFKLGVKQDDSRKTEQK--PQQETSPEPLPAPEDSEEIFK 116

Query: 2177 KMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLD 2356
             MKE GLIPNAVAMLDGLCKDGLVQEAMKLFGLMR+KGTIPEVVIYTAVVEGFCKA K++
Sbjct: 117  NMKEGGLIPNAVAMLDGLCKDGLVQEAMKLFGLMRDKGTIPEVVIYTAVVEGFCKAHKIE 176

Query: 2357 DAIRIFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLV 2536
            DA RIFRKMQ+NGI PNAFSYG+LVQ LC    LDDA +FCGEMLE+GH+PNV+TF+GLV
Sbjct: 177  DAKRIFRKMQTNGIVPNAFSYGVLVQGLCNCNMLDDAVDFCGEMLESGHSPNVSTFVGLV 236

Query: 2537 DCFCREKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692
            D  CREKG+E+ +S I TL QKGF ++ KAV+E+++KK  F  L WEAIF K
Sbjct: 237  DALCREKGVEQAQSAIDTLNQKGFAVNLKAVKEFMEKKASFPSLAWEAIFKK 288


>ref|XP_006486551.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Citrus sinensis]
          Length = 387

 Score =  303 bits (777), Expect = 2e-79
 Identities = 153/229 (66%), Positives = 184/229 (80%), Gaps = 7/229 (3%)
 Frame = +2

Query: 2027 GVVEDSDFLERFKLGFDRTKRVNSDSKESPDQTADTT-------AEPTPEDADEIFKKMK 2185
            GV  D +FL++FKL  D+ K  N    ES  Q  +         +EP P++ADEIFKKMK
Sbjct: 154  GVQSDENFLDQFKLAIDK-KPGNPQQNESLGQRQEQKPNRNEPISEP-PQEADEIFKKMK 211

Query: 2186 ETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAI 2365
            ETGLIPNAVAMLDGLCKDGL+QEAMKLFGLMREKGTIPEVVIYTAVV+GFCKAQK DDA 
Sbjct: 212  ETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVVDGFCKAQKFDDAK 271

Query: 2366 RIFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCF 2545
            RIFRKMQSNGI PNAFSY +L+Q L +  +L++A E+C EMLEAGH+PNV TF+GLVD  
Sbjct: 272  RIFRKMQSNGIAPNAFSYNLLIQGLYKCNKLEEAVEYCIEMLEAGHSPNVTTFVGLVDGL 331

Query: 2546 CREKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692
            CRE+G+E+ +S+I TL++KGF++++KAVRE+LDKK PF   VWEAIFGK
Sbjct: 332  CRERGVEKAQSVIATLKEKGFLVNDKAVREFLDKKAPFSSSVWEAIFGK 380


>gb|EPS62438.1| hypothetical protein M569_12351, partial [Genlisea aurea]
          Length = 272

 Score =  303 bits (775), Expect = 4e-79
 Identities = 150/223 (67%), Positives = 181/223 (81%), Gaps = 6/223 (2%)
 Frame = +2

Query: 2039 DSDFLERFKLGFDRT------KRVNSDSKESPDQTADTTAEPTPEDADEIFKKMKETGLI 2200
            DSDFLERFKLGFDR       + V S+     ++  +      PE+ADEIF+KMKETGLI
Sbjct: 41   DSDFLERFKLGFDRKTTTPPGRVVESEKAGGEEEKEEQQPLSPPENADEIFRKMKETGLI 100

Query: 2201 PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRK 2380
            PNAVAMLDGLCKDGLVQ+A+KLFG MREKG+IP+VV+YTAVVEGFCKAQK DDAIRIF+K
Sbjct: 101  PNAVAMLDGLCKDGLVQDALKLFGTMREKGSIPDVVVYTAVVEGFCKAQKHDDAIRIFKK 160

Query: 2381 MQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKG 2560
            M+SNGI PNAFSY IL++ LC GKRL+DA+ F  EMLE G++PN+ATF GLV+ +C+EKG
Sbjct: 161  MKSNGIAPNAFSYQILIRGLCDGKRLEDASGFTAEMLETGYSPNLATFTGLVNGWCQEKG 220

Query: 2561 LEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFG 2689
            LEE ++L+  ++QKGF ++EKAVREYLDKKGPF   VWEAI G
Sbjct: 221  LEEAKTLVGAMKQKGFSVEEKAVREYLDKKGPFSSPVWEAILG 263


>ref|XP_006590678.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            isoform X2 [Glycine max]
          Length = 395

 Score =  301 bits (771), Expect = 1e-78
 Identities = 159/233 (68%), Positives = 182/233 (78%), Gaps = 8/233 (3%)
 Frame = +2

Query: 2018 RGRGVVEDSDFLERFKLGFDRTKRVN----SDSKESPDQTADTTAEPT----PEDADEIF 2173
            R  G   DS FL +FKLGFD  K VN    + SK+S +       +P     P+DADEIF
Sbjct: 158  RDAGQSGDS-FLNKFKLGFD-DKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDADEIF 215

Query: 2174 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKL 2353
            KKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGLMREKGTIPE+VIYTAVVEG+ KA K 
Sbjct: 216  KKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLMREKGTIPEIVIYTAVVEGYTKAHKA 275

Query: 2354 DDAIRIFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGL 2533
            DDA RIFRKMQS+G++PNAFSY +L+Q L +  RL DA EFC EMLEAGH+PNV TF+GL
Sbjct: 276  DDAKRIFRKMQSSGVSPNAFSYMVLIQGLYKCSRLHDAFEFCVEMLEAGHSPNVTTFVGL 335

Query: 2534 VDCFCREKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692
            VD FC EKG+EE +S I TL  KGFV++EKAVR++LDKK PF P VWEAIFGK
Sbjct: 336  VDGFCNEKGVEEAKSAIKTLTDKGFVVNEKAVRQFLDKKAPFSPSVWEAIFGK 388


>ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            isoform X1 [Glycine max]
          Length = 388

 Score =  301 bits (771), Expect = 1e-78
 Identities = 159/233 (68%), Positives = 182/233 (78%), Gaps = 8/233 (3%)
 Frame = +2

Query: 2018 RGRGVVEDSDFLERFKLGFDRTKRVN----SDSKESPDQTADTTAEPT----PEDADEIF 2173
            R  G   DS FL +FKLGFD  K VN    + SK+S +       +P     P+DADEIF
Sbjct: 151  RDAGQSGDS-FLNKFKLGFD-DKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDADEIF 208

Query: 2174 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKL 2353
            KKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGLMREKGTIPE+VIYTAVVEG+ KA K 
Sbjct: 209  KKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLMREKGTIPEIVIYTAVVEGYTKAHKA 268

Query: 2354 DDAIRIFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGL 2533
            DDA RIFRKMQS+G++PNAFSY +L+Q L +  RL DA EFC EMLEAGH+PNV TF+GL
Sbjct: 269  DDAKRIFRKMQSSGVSPNAFSYMVLIQGLYKCSRLHDAFEFCVEMLEAGHSPNVTTFVGL 328

Query: 2534 VDCFCREKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692
            VD FC EKG+EE +S I TL  KGFV++EKAVR++LDKK PF P VWEAIFGK
Sbjct: 329  VDGFCNEKGVEEAKSAIKTLTDKGFVVNEKAVRQFLDKKAPFSPSVWEAIFGK 381


>ref|XP_004512018.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            isoform X1 [Cicer arietinum]
            gi|502161087|ref|XP_004512019.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g38150-like isoform X2 [Cicer arietinum]
          Length = 371

 Score =  299 bits (766), Expect = 4e-78
 Identities = 160/259 (61%), Positives = 189/259 (72%), Gaps = 16/259 (6%)
 Frame = +2

Query: 1964 QKNRKSAQFGYGDNESQGRGRGVVEDS--------DFLERFKLGFDRTKRVNSDSKESPD 2119
            ++  KS+Q   G      +GR V E S         FL++FKLGFD  K  N    ES  
Sbjct: 112  RRGSKSSQIDLGF-----QGRNVAEVSRDAGQLGDSFLDKFKLGFD-DKVGNHSEVESNG 165

Query: 2120 QTADTTA--------EPTPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGL 2275
            QT  + A        EP P+DADEIFKKMKETGLIPNAVAMLDGLCKDG VQEA+KLFGL
Sbjct: 166  QTEGSRASDTDQPAQEPMPQDADEIFKKMKETGLIPNAVAMLDGLCKDGNVQEALKLFGL 225

Query: 2276 MREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRKMQSNGITPNAFSYGILVQALCRGKR 2455
            MREKGTIPE+VIYTAVVEG+ KA K DDAIRIFRKMQSNGI+PNA+S+ +L+Q L +  R
Sbjct: 226  MREKGTIPEIVIYTAVVEGYTKAHKADDAIRIFRKMQSNGISPNAYSFTVLIQGLYKCSR 285

Query: 2456 LDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKGLEETESLITTLRQKGFVLDEKAVRE 2635
            L DA EFC EMLEAG++ NV TF+G+VD FC+E G+EE + +I TL +KGF  DEKAVRE
Sbjct: 286  LQDALEFCVEMLEAGYSLNVTTFVGVVDGFCKEDGVEEAKGVIKTLTEKGFAYDEKAVRE 345

Query: 2636 YLDKKGPFMPLVWEAIFGK 2692
            +LDKK PF P +WEA+FGK
Sbjct: 346  FLDKKAPFSPSIWEAVFGK 364


>gb|ESW28907.1| hypothetical protein PHAVU_002G027800g [Phaseolus vulgaris]
            gi|561030329|gb|ESW28908.1| hypothetical protein
            PHAVU_002G027800g [Phaseolus vulgaris]
          Length = 451

 Score =  299 bits (765), Expect = 5e-78
 Identities = 151/225 (67%), Positives = 175/225 (77%), Gaps = 10/225 (4%)
 Frame = +2

Query: 2048 FLERFKLGFD----------RTKRVNSDSKESPDQTADTTAEPTPEDADEIFKKMKETGL 2197
            FL++FKL FD           +K+     + +PDQ A    EP P+DADEIFKKMKETGL
Sbjct: 223  FLDKFKLAFDDKTVNLSEVAASKQSEEAKRSNPDQQAQ---EPVPQDADEIFKKMKETGL 279

Query: 2198 IPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFR 2377
            IPNAVAMLDGLCKDGLVQEA+KLF LMREKGTIPE+VIYTAVVEG+ KA K DDA RIFR
Sbjct: 280  IPNAVAMLDGLCKDGLVQEALKLFALMREKGTIPEIVIYTAVVEGYTKADKADDAKRIFR 339

Query: 2378 KMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREK 2557
            KMQS+GI+PNAFSY ++VQ L + +RL DA EFC EMLEAGH+PNV TF+ LVD FC+EK
Sbjct: 340  KMQSSGISPNAFSYTVIVQGLYKCRRLQDAFEFCVEMLEAGHSPNVTTFVSLVDGFCKEK 399

Query: 2558 GLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692
            G+EE +  + TL  KGF  DEKAVR++LDKK PF P VWEAIFGK
Sbjct: 400  GVEEAKDAVKTLTGKGFAFDEKAVRQFLDKKTPFSPSVWEAIFGK 444


>ref|XP_002868835.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297314671|gb|EFH45094.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 301

 Score =  298 bits (762), Expect = 1e-77
 Identities = 153/228 (67%), Positives = 177/228 (77%), Gaps = 2/228 (0%)
 Frame = +2

Query: 2015 GRGRGVVEDSDFLERFKLGFDRTKRVNSDSKESP--DQTADTTAEPTPEDADEIFKKMKE 2188
            G+    + D  FLE+FKLG      VN DS+E+P  +Q       P PED+DEIFKKMKE
Sbjct: 74   GKIDNTLSDDGFLEQFKLG------VNQDSQETPKPEQYPQDPLLP-PEDSDEIFKKMKE 126

Query: 2189 TGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIR 2368
             GLIPNAVAMLDGLCKDGLVQEAMKLFGLMR+KGTIPEVVIYTAVVEGFCKA K++DA R
Sbjct: 127  GGLIPNAVAMLDGLCKDGLVQEAMKLFGLMRDKGTIPEVVIYTAVVEGFCKAHKIEDAKR 186

Query: 2369 IFRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFC 2548
            IFRKMQ+NGITPNAFSYG+LVQ L     LDDA  FC EMLE+GH+PN+ TF+GLVD  C
Sbjct: 187  IFRKMQTNGITPNAFSYGVLVQGLYNCNMLDDAVTFCCEMLESGHSPNIPTFVGLVDALC 246

Query: 2549 REKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692
            REKG+E+ +S I  L QKGF L+ KAV+E++DK+ PF  L WEAIF K
Sbjct: 247  REKGVEQAQSAIDGLNQKGFALNVKAVKEFMDKRAPFPSLAWEAIFKK 294


>ref|XP_006573589.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            isoform X3 [Glycine max] gi|571435834|ref|XP_006573590.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g38150-like isoform X4 [Glycine max]
          Length = 403

 Score =  295 bits (755), Expect = 8e-77
 Identities = 152/223 (68%), Positives = 178/223 (79%), Gaps = 8/223 (3%)
 Frame = +2

Query: 2048 FLERFKLGFDRTKRVN----SDSKESPDQTADTTAEPT----PEDADEIFKKMKETGLIP 2203
            FL++FKLGFD  K VN    + SK+S +       +P     P+DA+EIFKKMKETGLIP
Sbjct: 175  FLDKFKLGFD-DKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDANEIFKKMKETGLIP 233

Query: 2204 NAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRKM 2383
            NAVAMLDGLCKDGLVQEA+KLFGL+REKGTIPE+VIYTAVVEG+ KA K DDA RIFRKM
Sbjct: 234  NAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVEGYTKAHKADDAKRIFRKM 293

Query: 2384 QSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKGL 2563
            QS+GI+PNAFSY +L+Q L +  RL DA EFC EMLEAGH+PNV  F+GLVD FC EKG+
Sbjct: 294  QSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSPNVTAFVGLVDGFCNEKGV 353

Query: 2564 EETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692
            EE +S I TL +KGFV++EKAV ++LDKK PF P VWEAIFGK
Sbjct: 354  EEAKSAIKTLTEKGFVVNEKAVGQFLDKKAPFSPSVWEAIFGK 396


>ref|XP_006573588.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            isoform X2 [Glycine max]
          Length = 431

 Score =  295 bits (755), Expect = 8e-77
 Identities = 152/223 (68%), Positives = 178/223 (79%), Gaps = 8/223 (3%)
 Frame = +2

Query: 2048 FLERFKLGFDRTKRVN----SDSKESPDQTADTTAEPT----PEDADEIFKKMKETGLIP 2203
            FL++FKLGFD  K VN    + SK+S +       +P     P+DA+EIFKKMKETGLIP
Sbjct: 203  FLDKFKLGFD-DKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDANEIFKKMKETGLIP 261

Query: 2204 NAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRKM 2383
            NAVAMLDGLCKDGLVQEA+KLFGL+REKGTIPE+VIYTAVVEG+ KA K DDA RIFRKM
Sbjct: 262  NAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVEGYTKAHKADDAKRIFRKM 321

Query: 2384 QSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKGL 2563
            QS+GI+PNAFSY +L+Q L +  RL DA EFC EMLEAGH+PNV  F+GLVD FC EKG+
Sbjct: 322  QSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSPNVTAFVGLVDGFCNEKGV 381

Query: 2564 EETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692
            EE +S I TL +KGFV++EKAV ++LDKK PF P VWEAIFGK
Sbjct: 382  EEAKSAIKTLTEKGFVVNEKAVGQFLDKKAPFSPSVWEAIFGK 424


>ref|XP_003516576.2| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            isoform X1 [Glycine max]
          Length = 457

 Score =  295 bits (755), Expect = 8e-77
 Identities = 152/223 (68%), Positives = 178/223 (79%), Gaps = 8/223 (3%)
 Frame = +2

Query: 2048 FLERFKLGFDRTKRVN----SDSKESPDQTADTTAEPT----PEDADEIFKKMKETGLIP 2203
            FL++FKLGFD  K VN    + SK+S +       +P     P+DA+EIFKKMKETGLIP
Sbjct: 229  FLDKFKLGFD-DKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDANEIFKKMKETGLIP 287

Query: 2204 NAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRIFRKM 2383
            NAVAMLDGLCKDGLVQEA+KLFGL+REKGTIPE+VIYTAVVEG+ KA K DDA RIFRKM
Sbjct: 288  NAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVEGYTKAHKADDAKRIFRKM 347

Query: 2384 QSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCREKGL 2563
            QS+GI+PNAFSY +L+Q L +  RL DA EFC EMLEAGH+PNV  F+GLVD FC EKG+
Sbjct: 348  QSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSPNVTAFVGLVDGFCNEKGV 407

Query: 2564 EETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692
            EE +S I TL +KGFV++EKAV ++LDKK PF P VWEAIFGK
Sbjct: 408  EEAKSAIKTLTEKGFVVNEKAVGQFLDKKAPFSPSVWEAIFGK 450


>ref|NP_195528.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|79326453|ref|NP_001031806.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|75266764|sp|Q9SZL5.1|PP356_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g38150 gi|4467121|emb|CAB37555.1| putative protein
            [Arabidopsis thaliana] gi|7270799|emb|CAB80480.1|
            putative protein [Arabidopsis thaliana]
            gi|26453272|dbj|BAC43709.1| unknown protein [Arabidopsis
            thaliana] gi|332661484|gb|AEE86884.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|332661485|gb|AEE86885.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 302

 Score =  294 bits (752), Expect = 2e-76
 Identities = 149/227 (65%), Positives = 173/227 (76%), Gaps = 1/227 (0%)
 Frame = +2

Query: 2015 GRGRGVVEDSDFLERFKLGFDRTKRVNSDSKESPD-QTADTTAEPTPEDADEIFKKMKET 2191
            G+    + D  FLE+FKLG      VN DS+E+P  +       P PED+DEIFKKMKE 
Sbjct: 75   GKSDTTLSDDGFLEQFKLG------VNQDSRETPKPEQYPQEPLPPPEDSDEIFKKMKEG 128

Query: 2192 GLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKAQKLDDAIRI 2371
            GLIPNAVAMLDGLCKDGLVQEAMKLFGLMR+KGTIPEVVIYTAVVE FCKA K++DA RI
Sbjct: 129  GLIPNAVAMLDGLCKDGLVQEAMKLFGLMRDKGTIPEVVIYTAVVEAFCKAHKIEDAKRI 188

Query: 2372 FRKMQSNGITPNAFSYGILVQALCRGKRLDDAAEFCGEMLEAGHAPNVATFIGLVDCFCR 2551
            FRKMQ+NGI PNAFSYG+LVQ L     LDDA  FC EMLE+GH+PNV TF+ LVD  CR
Sbjct: 189  FRKMQNNGIAPNAFSYGVLVQGLYNCNMLDDAVAFCSEMLESGHSPNVPTFVELVDALCR 248

Query: 2552 EKGLEETESLITTLRQKGFVLDEKAVREYLDKKGPFMPLVWEAIFGK 2692
             KG+E+ +S I TL QKGF ++ KAV+E++DK+ PF  L WEAIF K
Sbjct: 249  VKGVEQAQSAIDTLNQKGFAVNVKAVKEFMDKRAPFPSLAWEAIFKK 295


Top