BLASTX nr result

ID: Rehmannia22_contig00022617 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00022617
         (715 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006357522.1| PREDICTED: pentatricopeptide repeat-containi...   369   e-100
ref|XP_004243803.1| PREDICTED: pentatricopeptide repeat-containi...   365   6e-99
gb|EMJ18275.1| hypothetical protein PRUPE_ppa000834mg [Prunus pe...   332   7e-89
emb|CAN75708.1| hypothetical protein VITISV_031421 [Vitis vinifera]   332   1e-88
ref|XP_002272784.1| PREDICTED: pentatricopeptide repeat-containi...   330   3e-88
ref|XP_004306009.1| PREDICTED: pentatricopeptide repeat-containi...   329   6e-88
ref|XP_002517971.1| pentatricopeptide repeat-containing protein,...   322   6e-86
gb|EXB62281.1| hypothetical protein L484_022169 [Morus notabilis]     320   2e-85
ref|XP_006491629.1| PREDICTED: pentatricopeptide repeat-containi...   318   1e-84
ref|XP_006447317.1| hypothetical protein CICLE_v10017547mg [Citr...   318   1e-84
gb|EOX99345.1| Pentatricopeptide repeat (PPR) superfamily protei...   315   7e-84
gb|EPS73099.1| hypothetical protein M569_01654 [Genlisea aurea]       310   2e-82
ref|XP_004141647.1| PREDICTED: pentatricopeptide repeat-containi...   293   4e-77
ref|XP_004169587.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   289   5e-76
ref|XP_002887500.1| pentatricopeptide repeat-containing protein ...   280   4e-73
ref|XP_002319373.2| hypothetical protein POPTR_0013s14110g [Popu...   279   6e-73
ref|XP_006300678.1| hypothetical protein CARUB_v10019718mg [Caps...   278   1e-72
ref|XP_006390515.1| hypothetical protein EUTSA_v10019624mg, part...   277   3e-72
ref|XP_006585437.1| PREDICTED: pentatricopeptide repeat-containi...   272   9e-71
ref|NP_177512.1| pentatricopeptide repeat-containing protein [Ar...   269   8e-70

>ref|XP_006357522.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g73710-like isoform X1 [Solanum tuberosum]
           gi|565382385|ref|XP_006357523.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At1g73710-like isoform X2 [Solanum tuberosum]
          Length = 1012

 Score =  369 bits (948), Expect = e-100
 Identities = 180/234 (76%), Positives = 196/234 (83%)
 Frame = +1

Query: 1   VFLGFKLQCHSKIIASPXXXXXXXXXXXXXXILPSVLRALESEKDLEKTLELYHGKLNPK 180
           VF+GFKLQCHSK  A P              ILPS+LR+L +E D+EKTL LY+GKL+PK
Sbjct: 87  VFIGFKLQCHSKAEALPSRTVINGKRKGYGGILPSILRSLRTESDVEKTLNLYYGKLSPK 146

Query: 181 EQTVILKEQSKWDKVLRVFEWFKSQKDYVPNVIHYNVVLRALGRAKKWDELRLCWIEMAK 360
           EQTVILKEQS W K LRVFEW KSQKDYVPNVIHYNV+LRALGRAKKWDELRLCWIEMAK
Sbjct: 147 EQTVILKEQSNWGKALRVFEWMKSQKDYVPNVIHYNVILRALGRAKKWDELRLCWIEMAK 206

Query: 361 KGVLPTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRGIFPDEVTMSTVIKVLKDAGEYDK 540
            GV PTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRGIFPDEVTM+TV+KVLKDAGEYD+
Sbjct: 207 NGVFPTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRGIFPDEVTMNTVVKVLKDAGEYDR 266

Query: 541 ADRFYKDWSVGKIELDDLELDSMGDQQSISLKQFLLSELFRTGGRSHSFTNFNE 702
           ADRFYKDW  GKIELDD +LDS+ D +  SLKQFLL+ELFRTGGR+ S    NE
Sbjct: 267 ADRFYKDWCTGKIELDDFDLDSIDDSEPFSLKQFLLTELFRTGGRNPSRVLDNE 320


>ref|XP_004243803.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g73710-like [Solanum lycopersicum]
          Length = 1014

 Score =  365 bits (938), Expect = 6e-99
 Identities = 176/228 (77%), Positives = 194/228 (85%)
 Frame = +1

Query: 1   VFLGFKLQCHSKIIASPXXXXXXXXXXXXXXILPSVLRALESEKDLEKTLELYHGKLNPK 180
           V +GFKLQCHSK  A P              ILPS+LR+L +E D+EKTL LY+GKL+PK
Sbjct: 87  VLIGFKLQCHSKAEALPSRTVINGKKKGYGGILPSILRSLRTESDVEKTLNLYYGKLSPK 146

Query: 181 EQTVILKEQSKWDKVLRVFEWFKSQKDYVPNVIHYNVVLRALGRAKKWDELRLCWIEMAK 360
           EQTVILKEQS W+K LRVFEW KSQKDYVPNVIHYNV+LRALGRAKKWDELRLCWIEMAK
Sbjct: 147 EQTVILKEQSNWEKALRVFEWMKSQKDYVPNVIHYNVILRALGRAKKWDELRLCWIEMAK 206

Query: 361 KGVLPTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRGIFPDEVTMSTVIKVLKDAGEYDK 540
            GV PTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRGIFPDEVTM+TV+KVLKDAGEYD+
Sbjct: 207 NGVFPTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRGIFPDEVTMNTVVKVLKDAGEYDR 266

Query: 541 ADRFYKDWSVGKIELDDLELDSMGDQQSISLKQFLLSELFRTGGRSHS 684
           ADRFYKDW  GKIELDD +LDS+ + +  SLKQFLL+ELFRTGGR+ S
Sbjct: 267 ADRFYKDWCTGKIELDDFDLDSIDNSEPFSLKQFLLTELFRTGGRNPS 314


>gb|EMJ18275.1| hypothetical protein PRUPE_ppa000834mg [Prunus persica]
          Length = 987

 Score =  332 bits (851), Expect = 7e-89
 Identities = 163/231 (70%), Positives = 188/231 (81%), Gaps = 7/231 (3%)
 Frame = +1

Query: 4   FLGFKLQCHSKIIASPXXXXXXXXXXXXXX-ILPSVLRALESEKDLEKTLELYHGKLNPK 180
           F+GFKLQC SK +  P               +LPS+LR+L+SE D+EKTL      LNPK
Sbjct: 92  FVGFKLQCDSKTLVLPTKGSSINGKKKAYGGVLPSILRSLQSENDVEKTLNSCGENLNPK 151

Query: 181 EQTVILKEQSKWDKVLRVFEWFKSQKDYVPNVIHYNVVLRALGRAKKWDELRLCWIEMAK 360
           EQTVILKEQ +W++V+RVFEWFKSQK+YVPNVIHYNVVLR LGRA+KWDELRLCWIEMAK
Sbjct: 152 EQTVILKEQKRWERVVRVFEWFKSQKEYVPNVIHYNVVLRKLGRAQKWDELRLCWIEMAK 211

Query: 361 KGVLPTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRGIFPDEVTMSTVIKVLKDAGEYDK 540
           +GVLPTNNTY MLVDVYGKAGLVKEALLWIKHMKLRGIFPD+VTM+TV+K LKDAGE+D+
Sbjct: 212 RGVLPTNNTYAMLVDVYGKAGLVKEALLWIKHMKLRGIFPDDVTMNTVVKALKDAGEFDR 271

Query: 541 ADRFYKDWSVGKIELDDLELDSMGDQ------QSISLKQFLLSELFRTGGR 675
           AD+FYKDW  GKIELD+L+LDSMGD       + IS K FL +ELF+TGGR
Sbjct: 272 ADKFYKDWCDGKIELDELDLDSMGDSVNDSGLEPISFKHFLSTELFKTGGR 322


>emb|CAN75708.1| hypothetical protein VITISV_031421 [Vitis vinifera]
          Length = 1313

 Score =  332 bits (850), Expect = 1e-88
 Identities = 163/231 (70%), Positives = 187/231 (80%), Gaps = 6/231 (2%)
 Frame = +1

Query: 1    VFLGFKLQCHSKIIASPXXXXXXXXXXXXXXILPSVLRALESEKDLEKTLELYHGKLNPK 180
            VF GFKLQCHS+ +A P              +LPS+LRALESE ++E TL    GKL+PK
Sbjct: 399  VFPGFKLQCHSRTVALPTKTSISRRKKKYSGVLPSILRALESEXNIEDTLSSC-GKLSPK 457

Query: 181  EQTVILKEQSKWDKVLRVFEWFKSQKDYVPNVIHYNVVLRALGRAKKWDELRLCWIEMAK 360
            EQTVILKEQS W++VLRVFEW KSQ+DYVPNVIHYNVVLR LGRA+KWDELRLCWIEMAK
Sbjct: 458  EQTVILKEQSSWERVLRVFEWIKSQEDYVPNVIHYNVVLRVLGRAQKWDELRLCWIEMAK 517

Query: 361  KGVLPTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRGIFPDEVTMSTVIKVLKDAGEYDK 540
             GVLPTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRG+FPDEVTM+TV++VLKDAGE+D 
Sbjct: 518  NGVLPTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRGVFPDEVTMNTVVRVLKDAGEFDW 577

Query: 541  ADRFYKDWSVGKIELDDLELDSMGDQQS------ISLKQFLLSELFRTGGR 675
            ADRFY+DW VGK+EL D +L+S+ D         +SLK FL +ELF+ GGR
Sbjct: 578  ADRFYRDWCVGKVELGDFDLESVADSDDEIGSAPVSLKHFLSTELFKIGGR 628


>ref|XP_002272784.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g73710-like [Vitis vinifera]
          Length = 1008

 Score =  330 bits (846), Expect = 3e-88
 Identities = 162/231 (70%), Positives = 186/231 (80%), Gaps = 6/231 (2%)
 Frame = +1

Query: 1   VFLGFKLQCHSKIIASPXXXXXXXXXXXXXXILPSVLRALESEKDLEKTLELYHGKLNPK 180
           VF GFKLQCHS+ +A P              +LPS+LRALESE ++E TL    GKL+PK
Sbjct: 94  VFPGFKLQCHSRTVALPTKTSISRRKKKYSGVLPSILRALESENNIEDTLSSC-GKLSPK 152

Query: 181 EQTVILKEQSKWDKVLRVFEWFKSQKDYVPNVIHYNVVLRALGRAKKWDELRLCWIEMAK 360
           EQTVILKEQS W++VLRVFEW KSQ+DYVPNVIHYNVVLR LGRA+KWDELRLCWIEMAK
Sbjct: 153 EQTVILKEQSSWERVLRVFEWIKSQEDYVPNVIHYNVVLRVLGRAQKWDELRLCWIEMAK 212

Query: 361 KGVLPTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRGIFPDEVTMSTVIKVLKDAGEYDK 540
            GVLPTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRG+FPDEV M+TV++VLKDAGE+D 
Sbjct: 213 NGVLPTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRGVFPDEVAMNTVVRVLKDAGEFDW 272

Query: 541 ADRFYKDWSVGKIELDDLELDSMGDQQS------ISLKQFLLSELFRTGGR 675
           ADRFY+DW VGK+EL D +L+S+ D         +SLK FL +ELF+ GGR
Sbjct: 273 ADRFYRDWCVGKVELGDFDLESVADSDDEIGSAPVSLKHFLSTELFKIGGR 323


>ref|XP_004306009.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g73710-like [Fragaria vesca subsp. vesca]
          Length = 1000

 Score =  329 bits (843), Expect = 6e-88
 Identities = 165/248 (66%), Positives = 197/248 (79%), Gaps = 10/248 (4%)
 Frame = +1

Query: 1   VFLGFKLQCHSKIIASPXXXXXXXXXXXXXX-ILPSVLRALESEKDLEKTLELYHGKLNP 177
           V++GFKLQCHSK +  P               +LPS+LR+LE+E D+EKTLE +   L+ 
Sbjct: 67  VYVGFKLQCHSKALVLPTKVSLVNGKKKRYGGVLPSILRSLENENDVEKTLESFGESLSA 126

Query: 178 KEQTVILKEQSKWDKVLRVFEWFKSQKDYVPNVIHYNVVLRALGRAKKWDELRLCWIEMA 357
           KEQTVILKEQ  W++VLRVFEWFKSQK+Y+PNVIHYNVVLR LGRA++WDELRLCWIEMA
Sbjct: 127 KEQTVILKEQRSWERVLRVFEWFKSQKEYLPNVIHYNVVLRVLGRAQRWDELRLCWIEMA 186

Query: 358 KKGVLPTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRGIFPDEVTMSTVIKVLKDAGEYD 537
           KKGVLPTNNTY MLVDVYGKAGLVKEALLWIKHMKLRG+FPDEVTM+TV++ LK+A E+D
Sbjct: 187 KKGVLPTNNTYSMLVDVYGKAGLVKEALLWIKHMKLRGMFPDEVTMNTVVRALKNAEEFD 246

Query: 538 KADRFYKDWSVGKIELDDLELDSMGD------QQSISLKQFLLSELFRTGGR---SHSFT 690
           +AD+FYKDW  G+IELDDL+LD+MGD       + IS K FL +ELF+TGGR   S   T
Sbjct: 247 RADKFYKDWCTGRIELDDLDLDTMGDSVVGSVSEPISFKHFLSTELFKTGGRVPTSKIMT 306

Query: 691 NFNETESS 714
           + N TE+S
Sbjct: 307 SMN-TENS 313



 Score = 57.8 bits (138), Expect = 4e-06
 Identities = 50/224 (22%), Positives = 95/224 (42%), Gaps = 28/224 (12%)
 Frame = +1

Query: 97   LPSVLRALESEKDLEKTLELY-----HGKLNPKEQTVILK---EQSKWDKVLRVFEWFKS 252
            LP +++   +E  L++   LY     +  ++ K    I+    E+  W +   VF     
Sbjct: 463  LPGIIKLYINEGRLDQAKLLYEKCQLNRGISSKTCAAIIDAYAEKGLWTEAEVVFSRKGD 522

Query: 253  QKDYVPNVIHYNVVLRALGRAKKWDELRLCWIEMAKKGVLPTNNTYGMLVDVYGKAGLVK 432
                + +++ YNV+++A G+AK +D+    +  M K G  P   TY  L+ ++    LV 
Sbjct: 523  LGGQMKDIVEYNVMIKAYGKAKLYDKAFSLFRGMKKHGTWPDECTYNSLIQMFSGGDLVD 582

Query: 433  EALLWIKHMKLRGIFPDEVTMSTVIKVLKDAGEYDKADRFYKDW---------------- 564
             A   +  M+  G+ P  +T S +I      G+   A   Y+D                 
Sbjct: 583  RARDLLTEMQETGLKPQSLTFSALIACYARLGQLSDAVDVYQDMVKSGTKPNEFVYGSLI 642

Query: 565  ----SVGKIELDDLELDSMGDQQSISLKQFLLSELFRTGGRSHS 684
                  G++E + L+   + ++  IS  Q +L+ L +  G++ S
Sbjct: 643  NGFAETGRVE-EALKYFHLMEESGISANQIVLTSLIKAYGKAGS 685


>ref|XP_002517971.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223542953|gb|EEF44489.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 1029

 Score =  322 bits (826), Expect = 6e-86
 Identities = 158/232 (68%), Positives = 185/232 (79%), Gaps = 7/232 (3%)
 Frame = +1

Query: 1   VFLGFKLQCHSKIIASPXXXXXXXXXXXXXX-ILPSVLRALESEKDLEKTLELYHGKLNP 177
           V LGFKL CHSK +  P               +LPS+LR+L S+ D+EKTL  +   LNP
Sbjct: 89  VSLGFKLHCHSKTLTLPTRNSSFNGKKKRYGGVLPSILRSLNSDNDIEKTLNSFGDNLNP 148

Query: 178 KEQTVILKEQSKWDKVLRVFEWFKSQKDYVPNVIHYNVVLRALGRAKKWDELRLCWIEMA 357
           KEQTVILKEQ  W++++RVFE+FKS+KDYVPNVIHYN+VLRALGRA+KWD+LR CWIEMA
Sbjct: 149 KEQTVILKEQRNWERMVRVFEFFKSRKDYVPNVIHYNIVLRALGRAQKWDDLRRCWIEMA 208

Query: 358 KKGVLPTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRGIFPDEVTMSTVIKVLKDAGEYD 537
           K GVLPTNNTYGMLVDVYGKAGLV EALLWIKHMKLRG+FPDEVTM+TV+KVLKDAGE+D
Sbjct: 209 KSGVLPTNNTYGMLVDVYGKAGLVTEALLWIKHMKLRGLFPDEVTMNTVVKVLKDAGEFD 268

Query: 538 KADRFYKDWSVGKIELDDLELDSMGDQQ------SISLKQFLLSELFRTGGR 675
           +A  FYKDW +GKIELDDLEL+SMGD +       +S K FL +ELF+ GGR
Sbjct: 269 RAHSFYKDWCIGKIELDDLELNSMGDIEHGSGSGPVSFKHFLSTELFKIGGR 320


>gb|EXB62281.1| hypothetical protein L484_022169 [Morus notabilis]
          Length = 1018

 Score =  320 bits (821), Expect = 2e-85
 Identities = 166/246 (67%), Positives = 192/246 (78%), Gaps = 8/246 (3%)
 Frame = +1

Query: 1   VFLGFKLQCHSKIIASPXXXXXXXXXXXXXX--ILPSVLRALESEKDLEKTLELYHGKLN 174
           VF GFK+Q HSK +A P                +LPS+LR+LES  D+EK L  +   L+
Sbjct: 88  VFAGFKVQSHSKTLAFPTKVSSLNGNKKKRYGGVLPSILRSLESNDDVEKILVEFGANLS 147

Query: 175 PKEQTVILKEQSKWDKVLRVFEWFKSQKDYVPNVIHYNVVLRALGRAKKWDELRLCWIEM 354
           PKEQTVILKEQ  W++V+RVFEWFKSQK+YVPNVIHYNVVLRALGRA+KWDELRL WIEM
Sbjct: 148 PKEQTVILKEQRNWERVVRVFEWFKSQKEYVPNVIHYNVVLRALGRAQKWDELRLQWIEM 207

Query: 355 AKKGVLPTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRGIFPDEVTMSTVIKVLKDAGEY 534
           AK GV PTNNTYGMLVDVYGKAGLVKEA+LWIKHM++RGIFPDEVTMSTV++VLKD GEY
Sbjct: 208 AKTGVFPTNNTYGMLVDVYGKAGLVKEAVLWIKHMRVRGIFPDEVTMSTVVRVLKDGGEY 267

Query: 535 DKADRFYKDWSVGKIELDDLELDSMGD---QQSISLKQFLLSELFRTGGR---SHSFTNF 696
           D+ADRFYKDW +G+IELD   LDSM D    + +S K FL +ELFRTGGR   S S T+ 
Sbjct: 268 DRADRFYKDWCMGRIELD---LDSMVDGSGSEPVSFKHFLSTELFRTGGRIPGSRSLTSS 324

Query: 697 NETESS 714
            E+ESS
Sbjct: 325 LESESS 330


>ref|XP_006491629.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g73710-like [Citrus sinensis]
          Length = 1004

 Score =  318 bits (814), Expect = 1e-84
 Identities = 157/226 (69%), Positives = 183/226 (80%), Gaps = 3/226 (1%)
 Frame = +1

Query: 10  GFKLQCHSKIIASPXXXXXXXXXXXXXX-ILPSVLRALESEKDLEKTLELYHGKLNPKEQ 186
           GFKLQC+SK   SP               ILPS+LR+ ES  D++ TL  +   L+PKEQ
Sbjct: 80  GFKLQCNSKSTISPTKSSLVNSRRKKYGGILPSLLRSFESNDDIDNTLNSFCENLSPKEQ 139

Query: 187 TVILKEQSKWDKVLRVFEWFKSQKDYVPNVIHYNVVLRALGRAKKWDELRLCWIEMAKKG 366
           TV+LKEQ  W++V+RVFE+FKSQKDYVPNVIHYN+VLRALGRA+KWDELRL WIEMAK G
Sbjct: 140 TVVLKEQKSWERVIRVFEFFKSQKDYVPNVIHYNIVLRALGRAQKWDELRLRWIEMAKNG 199

Query: 367 VLPTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRGIFPDEVTMSTVIKVLKDAGEYDKAD 546
           VLPTNNTYGMLVDVYGKAGL+KEALLWIKHMKLRGIFPDEVTM+TV++VLK+ GE+D AD
Sbjct: 200 VLPTNNTYGMLVDVYGKAGLIKEALLWIKHMKLRGIFPDEVTMNTVVRVLKEVGEFDSAD 259

Query: 547 RFYKDWSVGKIELDDLELDSMGDQQS--ISLKQFLLSELFRTGGRS 678
           RFYKDW +G++ELDDLELDS  D  S  +S K FL +ELFRTGGR+
Sbjct: 260 RFYKDWCLGRLELDDLELDSTDDLGSTPVSFKHFLSTELFRTGGRN 305


>ref|XP_006447317.1| hypothetical protein CICLE_v10017547mg [Citrus clementina]
           gi|557549928|gb|ESR60557.1| hypothetical protein
           CICLE_v10017547mg [Citrus clementina]
          Length = 962

 Score =  318 bits (814), Expect = 1e-84
 Identities = 157/226 (69%), Positives = 183/226 (80%), Gaps = 3/226 (1%)
 Frame = +1

Query: 10  GFKLQCHSKIIASPXXXXXXXXXXXXXX-ILPSVLRALESEKDLEKTLELYHGKLNPKEQ 186
           GFKLQC+SK   SP               ILPS+LR+ ES  D++ TL  +   L+PKEQ
Sbjct: 80  GFKLQCNSKSTISPTKSSLVNSRRKKYGGILPSLLRSFESNDDIDNTLNSFCENLSPKEQ 139

Query: 187 TVILKEQSKWDKVLRVFEWFKSQKDYVPNVIHYNVVLRALGRAKKWDELRLCWIEMAKKG 366
           TV+LKEQ  W++V+RVFE+FKSQKDYVPNVIHYN+VLRALGRA+KWDELRL WIEMAK G
Sbjct: 140 TVVLKEQKSWERVIRVFEFFKSQKDYVPNVIHYNIVLRALGRAQKWDELRLRWIEMAKNG 199

Query: 367 VLPTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRGIFPDEVTMSTVIKVLKDAGEYDKAD 546
           VLPTNNTYGMLVDVYGKAGL+KEALLWIKHMKLRGIFPDEVTM+TV++VLK+ GE+D AD
Sbjct: 200 VLPTNNTYGMLVDVYGKAGLIKEALLWIKHMKLRGIFPDEVTMNTVVRVLKEVGEFDSAD 259

Query: 547 RFYKDWSVGKIELDDLELDSMGDQQS--ISLKQFLLSELFRTGGRS 678
           RFYKDW +G++ELDDLELDS  D  S  +S K FL +ELFRTGGR+
Sbjct: 260 RFYKDWCLGRLELDDLELDSTDDLGSTPVSFKHFLSTELFRTGGRN 305


>gb|EOX99345.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma
           cacao]
          Length = 1007

 Score =  315 bits (808), Expect = 7e-84
 Identities = 164/244 (67%), Positives = 190/244 (77%), Gaps = 9/244 (3%)
 Frame = +1

Query: 10  GFKLQCHSKIIASPXXXXXXXXXXXXXX-ILPSVLRALESEKDLEKTLELYHGKLNPKEQ 186
           GFKLQC SK + SP               ILPS+LRALE + D+EKTL      L+PKEQ
Sbjct: 79  GFKLQCLSKTLFSPTKSSSSNVKKKRYKGILPSILRALECDTDVEKTLSSVCENLSPKEQ 138

Query: 187 TVILKEQSKWDKVLRVFEWFKSQKDYVPNVIHYNVVLRALGRAKKWDELRLCWIEMAKKG 366
           TVILKEQS  ++V RVF +FKS KDYVPNVIHYN+VLRALGRA+KWDELRLCWIEMAK G
Sbjct: 139 TVILKEQSNCERVTRVFGFFKSLKDYVPNVIHYNIVLRALGRAQKWDELRLCWIEMAKNG 198

Query: 367 VLPTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRGIFPDEVTMSTVIKVLKDAGEYDKAD 546
           VLPTNNTYGMLVDVYGKAGLVKEALLWIKHM+LRG++PDEVTM+TV+KVLKDA E+D+AD
Sbjct: 199 VLPTNNTYGMLVDVYGKAGLVKEALLWIKHMRLRGLYPDEVTMNTVVKVLKDAMEFDRAD 258

Query: 547 RFYKDWSVGKIELDDLELDSMGDQQS------ISLKQFLLSELFRTGGRSHSFTNFN--E 702
           RFYKDW +GK++L+DLELDSM D ++      +S K FL +ELFRTGGRS         +
Sbjct: 259 RFYKDWCIGKVDLNDLELDSMIDFENGSGSAPVSFKHFLSTELFRTGGRSPVLETLGSPD 318

Query: 703 TESS 714
           TESS
Sbjct: 319 TESS 322


>gb|EPS73099.1| hypothetical protein M569_01654 [Genlisea aurea]
          Length = 1119

 Score =  310 bits (795), Expect = 2e-82
 Identities = 155/229 (67%), Positives = 187/229 (81%), Gaps = 1/229 (0%)
 Frame = +1

Query: 1   VFLGFKLQCHSKIIASPXXXXXXXXXXXXXXILPSVLRALESEKDLEKTLELYHGKLNPK 180
           VFLGFKL+CHS  +                 +LPS+L  L   +DLEK+L ++  KL+PK
Sbjct: 224 VFLGFKLRCHSNAVEF-HGKKKRKKKVYGGELLPSIL--LSDGEDLEKSLAIHFDKLSPK 280

Query: 181 EQTVILKEQSKWDKVLRVFEWFKSQKDYVPNVIHYNVVLRALGRAKKWDELRLCWIEMAK 360
           EQTVILKEQ  W+KVLR+FEWFK Q+ Y PNVIHYNVVLRALG+A++WDELRLCWI+MA+
Sbjct: 281 EQTVILKEQRGWEKVLRIFEWFKRQESYTPNVIHYNVVLRALGKARRWDELRLCWIDMAE 340

Query: 361 KGVLPTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRGIFPDEVTMSTVIKVLKDAGEYDK 540
            GVLPTNNTYGMLVDVYGK+GLVKEALLWIKHMKLRG+FPDEVTMSTV+KVLKDA E+D+
Sbjct: 341 NGVLPTNNTYGMLVDVYGKSGLVKEALLWIKHMKLRGVFPDEVTMSTVVKVLKDAREFDR 400

Query: 541 ADRFYKDWSVGKIELDDLELDSMGDQQSISLKQFLLSELFRTGGR-SHS 684
           A RFY+DW  G+I L+D +LD++ DQQ+ISLKQFL +ELFR+GG+ SHS
Sbjct: 401 AHRFYEDWCRGRIGLED-DLDALEDQQAISLKQFLSTELFRSGGKLSHS 448


>ref|XP_004141647.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g73710-like [Cucumis sativus]
          Length = 1020

 Score =  293 bits (750), Expect = 4e-77
 Identities = 144/233 (61%), Positives = 179/233 (76%), Gaps = 8/233 (3%)
 Frame = +1

Query: 1   VFLGFKLQCHSKIIA-SPXXXXXXXXXXXXXXILPSVLRALESEKDLEKTLELYHGKLNP 177
           V LGFKLQCHS+ ++ +               ILPS+LR+L+S  D+   L      L+P
Sbjct: 69  VSLGFKLQCHSRTLSMASQRLSTNGKKKSYGGILPSILRSLKSASDIGNILSSSCQNLSP 128

Query: 178 KEQTVILKEQSKWDKVLRVFEWFKSQKDYVPNVIHYNVVLRALGRAKKWDELRLCWIEMA 357
           KEQTVILKEQS+W++V++VF+WFKSQKDYVPNVIHYN+VLR LG+A+KWDELRLCW EMA
Sbjct: 129 KEQTVILKEQSRWERVIQVFQWFKSQKDYVPNVIHYNIVLRTLGQAQKWDELRLCWNEMA 188

Query: 358 KKGVLPTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRGIFPDEVTMSTVIKVLKDAGEYD 537
           + GV+PTNNTYGML+DVYGK GLVKEALLWIKHM +RGIFPDEVTM+TV++VLKDAGE+D
Sbjct: 189 ENGVVPTNNTYGMLIDVYGKVGLVKEALLWIKHMTVRGIFPDEVTMNTVVRVLKDAGEFD 248

Query: 538 KADRFYKDWSVGKIELDDLELDSMGDQ-------QSISLKQFLLSELFRTGGR 675
            AD+FYKDW  G +EL+D +L+S  +        + I+ K FLL+ELFR G R
Sbjct: 249 SADKFYKDWCRGLVELNDFDLNSRVEDFGVNSAVEPITPKHFLLTELFRIGTR 301



 Score = 56.6 bits (135), Expect = 8e-06
 Identities = 40/145 (27%), Positives = 70/145 (48%), Gaps = 8/145 (5%)
 Frame = +1

Query: 97  LPSVLRALESEKDLEKT---LELYH--GKLNPKEQTVILK---EQSKWDKVLRVFEWFKS 252
           LP V++   +E  L++    LE Y    +L+P+    I+    E+  W +   +F W + 
Sbjct: 464 LPRVIKMYINEGLLDRAKILLEKYRLDTELSPRISAAIIDAYAEKGLWFEAESIFLWKRD 523

Query: 253 QKDYVPNVIHYNVVLRALGRAKKWDELRLCWIEMAKKGVLPTNNTYGMLVDVYGKAGLVK 432
                 +V+ YNV+++A G+A+ +++  L +  M  +G  P   TY  L+ ++    LV 
Sbjct: 524 LSGKKMDVMEYNVMIKAYGKAELYEKAFLLFKSMKNRGTWPDECTYNSLIQMFSGGDLVD 583

Query: 433 EALLWIKHMKLRGIFPDEVTMSTVI 507
           EA   +  M+  G  P   T S VI
Sbjct: 584 EARRLLTEMQRMGFKPTCQTFSAVI 608


>ref|XP_004169587.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
           protein At1g73710-like [Cucumis sativus]
          Length = 1026

 Score =  289 bits (740), Expect = 5e-76
 Identities = 142/233 (60%), Positives = 177/233 (75%), Gaps = 8/233 (3%)
 Frame = +1

Query: 1   VFLGFKLQCHSKIIA-SPXXXXXXXXXXXXXXILPSVLRALESEKDLEKTLELYHGKLNP 177
           V LGFKLQCHS+ ++ +               ILPS+LR+L+S  D+   L      L+P
Sbjct: 69  VSLGFKLQCHSRTLSMASQRLSTNGKKKSYGGILPSILRSLKSASDIGSILSSSCQNLSP 128

Query: 178 KEQTVILKEQSKWDKVLRVFEWFKSQKDYVPNVIHYNVVLRALGRAKKWDELRLCWIEMA 357
           KEQTVILKEQS+W++V++VF+WFKSQKDYVPNVIHYN+VLR LG+A+KWDELRLCW EMA
Sbjct: 129 KEQTVILKEQSRWERVIQVFQWFKSQKDYVPNVIHYNIVLRTLGQAQKWDELRLCWNEMA 188

Query: 358 KKGVLPTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRGIFPDEVTMSTVIKVLKDAGEYD 537
           + GV+PTNNTYGML+DVYGK GLVKEALLWIKHM +RGIFPDEVTM+TV++VLKDAGE+D
Sbjct: 189 ENGVVPTNNTYGMLIDVYGKVGLVKEALLWIKHMTVRGIFPDEVTMNTVVRVLKDAGEFD 248

Query: 538 KADRFYKDWSVGKIELDDLELDSMGDQ-------QSISLKQFLLSELFRTGGR 675
            AD+FYKDW  G +EL+D +L+S  +        + I+ K F  +ELFR G R
Sbjct: 249 SADKFYKDWCRGLVELNDFDLNSRVEDFGVNSAVEPITPKHFCXTELFRIGTR 301



 Score = 56.6 bits (135), Expect = 8e-06
 Identities = 40/145 (27%), Positives = 70/145 (48%), Gaps = 8/145 (5%)
 Frame = +1

Query: 97  LPSVLRALESEKDLEKT---LELYH--GKLNPKEQTVILK---EQSKWDKVLRVFEWFKS 252
           LP V++   +E  L++    LE Y    +L+P+    I+    E+  W +   +F W + 
Sbjct: 464 LPRVIKMYINEGLLDRAKILLEKYRLDTELSPRISAAIIDAYAEKGLWFEAESIFLWKRD 523

Query: 253 QKDYVPNVIHYNVVLRALGRAKKWDELRLCWIEMAKKGVLPTNNTYGMLVDVYGKAGLVK 432
                 +V+ YNV+++A G+A+ +++  L +  M  +G  P   TY  L+ ++    LV 
Sbjct: 524 LAGKKXDVMEYNVMIKAYGKAELYEKAFLLFKSMKNRGTWPDECTYNSLIQMFSGGDLVD 583

Query: 433 EALLWIKHMKLRGIFPDEVTMSTVI 507
           EA   +  M+  G  P   T S VI
Sbjct: 584 EARRLLTEMQRMGFKPTCQTFSAVI 608


>ref|XP_002887500.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297333341|gb|EFH63759.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 989

 Score =  280 bits (715), Expect = 4e-73
 Identities = 134/202 (66%), Positives = 164/202 (81%), Gaps = 7/202 (3%)
 Frame = +1

Query: 94  ILPSVLRALESEKDLEKTLELYHGKLNPKEQTVILKEQSKWDKVLRVFEWFKSQKDYVPN 273
           ++PS+LR+L+S  D+E TL      L+PKEQTV+LKEQ++WD+VLRVF +F+S + YVPN
Sbjct: 79  VIPSILRSLDSSTDIETTLASLCLNLSPKEQTVLLKEQTRWDRVLRVFRFFQSHQSYVPN 138

Query: 274 VIHYNVVLRALGRAKKWDELRLCWIEMAKKGVLPTNNTYGMLVDVYGKAGLVKEALLWIK 453
           VIHYN+VLRALGRA KWDELRLCWIEMA  GVLPTNNTYGMLVDVYGKAGLVKEALLWIK
Sbjct: 139 VIHYNIVLRALGRAGKWDELRLCWIEMAHNGVLPTNNTYGMLVDVYGKAGLVKEALLWIK 198

Query: 454 HMKLRGIFPDEVTMSTVIKVLKDAGEYDKADRFYKDWSVGKIELDDLELDSMGD------ 615
           HM  R  FPDEVTM+TV++V K++GE+D+ADRF+K W  GK+ LDDL+LDS+ D      
Sbjct: 199 HMGQRMHFPDEVTMATVVRVFKNSGEFDRADRFFKGWCAGKVNLDDLDLDSIDDFPKNGS 258

Query: 616 -QQSISLKQFLLSELFRTGGRS 678
            Q  ++LKQFL  ELF+ G R+
Sbjct: 259 AQSPVNLKQFLSMELFKVGARN 280


>ref|XP_002319373.2| hypothetical protein POPTR_0013s14110g [Populus trichocarpa]
           gi|550325820|gb|EEE95296.2| hypothetical protein
           POPTR_0013s14110g [Populus trichocarpa]
          Length = 965

 Score =  279 bits (714), Expect = 6e-73
 Identities = 132/175 (75%), Positives = 155/175 (88%), Gaps = 6/175 (3%)
 Frame = +1

Query: 169 LNPKEQTVILKEQSKWDKVLRVFEWFKSQKDYVPNVIHYNVVLRALGRAKKWDELRLCWI 348
           L+PKEQTV+LKEQ  W++V+RVFE+FKSQKDYVPNVIHYN+VLR LGRAK+WDELRLCW+
Sbjct: 95  LSPKEQTVVLKEQRNWERVVRVFEFFKSQKDYVPNVIHYNIVLRVLGRAKRWDELRLCWM 154

Query: 349 EMAKKGVLPTNNTYGMLVDVYGKAGLVKEALLWIKHMKLRGIFPDEVTMSTVIKVLKDAG 528
           +MAK GVLPTNNTYGMLVDVY KAGLV EALLWIKHM+LRG+FPDEVTM+TV+KVLKD G
Sbjct: 155 DMAKNGVLPTNNTYGMLVDVYAKAGLV-EALLWIKHMRLRGLFPDEVTMNTVVKVLKDVG 213

Query: 529 EYDKADRFYKDWSVGKIELDDLELDSMGDQQS------ISLKQFLLSELFRTGGR 675
           E+DKA+RFYKDW  G++ELD LELDSM D ++      +S K FLL+ELF+TGGR
Sbjct: 214 EFDKAERFYKDWCAGRVELDGLELDSMLDSENGSRSEPVSFKHFLLTELFKTGGR 268


>ref|XP_006300678.1| hypothetical protein CARUB_v10019718mg [Capsella rubella]
           gi|565486079|ref|XP_006300679.1| hypothetical protein
           CARUB_v10019718mg [Capsella rubella]
           gi|482569388|gb|EOA33576.1| hypothetical protein
           CARUB_v10019718mg [Capsella rubella]
           gi|482569389|gb|EOA33577.1| hypothetical protein
           CARUB_v10019718mg [Capsella rubella]
          Length = 986

 Score =  278 bits (711), Expect = 1e-72
 Identities = 135/214 (63%), Positives = 168/214 (78%), Gaps = 7/214 (3%)
 Frame = +1

Query: 94  ILPSVLRALESEKDLEKTLELYHGKLNPKEQTVILKEQSKWDKVLRVFEWFKSQKDYVPN 273
           ++PS+LR+L+S  D+E TL      L+PKEQTV+LKEQ++WD+VLRVF +F+S + YVPN
Sbjct: 82  VIPSILRSLDSSTDIETTLASLCLNLSPKEQTVLLKEQTRWDRVLRVFRFFQSHQGYVPN 141

Query: 274 VIHYNVVLRALGRAKKWDELRLCWIEMAKKGVLPTNNTYGMLVDVYGKAGLVKEALLWIK 453
           VIHYN+VLRALGRA KWDELRLCWIEMA  GVLPTNNTYGMLVDVYGKAGLVKEALLWIK
Sbjct: 142 VIHYNIVLRALGRAGKWDELRLCWIEMAHNGVLPTNNTYGMLVDVYGKAGLVKEALLWIK 201

Query: 454 HMKLRGIFPDEVTMSTVIKVLKDAGEYDKADRFYKDWSVGKIELDDLELDSMGD------ 615
           HM  R  FPDEVTM+TV++V K++GE+D+ADRF+K W  GK+ LDDL+LDS+ D      
Sbjct: 202 HMGQRMHFPDEVTMATVVRVFKNSGEFDRADRFFKGWCAGKVNLDDLDLDSIDDFPKNSS 261

Query: 616 -QQSISLKQFLLSELFRTGGRSHSFTNFNETESS 714
            +  ++LKQFL  ELF+ G R+    +F+    S
Sbjct: 262 ARSPVNLKQFLSMELFKVGARNPIEKSFHFASGS 295


>ref|XP_006390515.1| hypothetical protein EUTSA_v10019624mg, partial [Eutrema
           salsugineum] gi|557086949|gb|ESQ27801.1| hypothetical
           protein EUTSA_v10019624mg, partial [Eutrema salsugineum]
          Length = 967

 Score =  277 bits (708), Expect = 3e-72
 Identities = 132/202 (65%), Positives = 164/202 (81%), Gaps = 7/202 (3%)
 Frame = +1

Query: 94  ILPSVLRALESEKDLEKTLELYHGKLNPKEQTVILKEQSKWDKVLRVFEWFKSQKDYVPN 273
           +LPS+LR+L+S  D+E TL      L+PKEQTV+LKEQ++WD+VLRVF +F+S + YVPN
Sbjct: 76  VLPSILRSLDSSTDIETTLASLCLNLSPKEQTVLLKEQTRWDRVLRVFRFFQSHQGYVPN 135

Query: 274 VIHYNVVLRALGRAKKWDELRLCWIEMAKKGVLPTNNTYGMLVDVYGKAGLVKEALLWIK 453
           VIHYN+VLRALGRA KWDELRLCWIEMA  GVLPTNNTYGMLVDVYGKAGLVKEALLWIK
Sbjct: 136 VIHYNIVLRALGRAGKWDELRLCWIEMAHNGVLPTNNTYGMLVDVYGKAGLVKEALLWIK 195

Query: 454 HMKLRGIFPDEVTMSTVIKVLKDAGEYDKADRFYKDWSVGKIELDDLELDSMGD------ 615
           HM+ R  FPDEVTM+TV++V K++G++D+ADRF+K W  G++ LDDL+LDS+ D      
Sbjct: 196 HMEQRMHFPDEVTMATVVRVFKNSGDFDRADRFFKGWCAGRVNLDDLDLDSIDDSPKNGS 255

Query: 616 -QQSISLKQFLLSELFRTGGRS 678
               ++LKQFL  ELF+ G R+
Sbjct: 256 ASSPVNLKQFLSMELFKVGARN 277


>ref|XP_006585437.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g73710-like [Glycine max]
          Length = 989

 Score =  272 bits (695), Expect = 9e-71
 Identities = 135/215 (62%), Positives = 161/215 (74%), Gaps = 9/215 (4%)
 Frame = +1

Query: 97  LPSVLRALESEKDLEKTLELYHGKLNPKEQTVILKEQSKWDKVLRVFEWFKSQKDYVPNV 276
           LPS+LR L +  DLE  L      L+PKE TV+LKEQS W +  R+FEWFKSQ  Y PN 
Sbjct: 72  LPSLLRTLSTAADLETALSTLPSPLSPKEITVLLKEQSTWQRAARIFEWFKSQTWYTPNA 131

Query: 277 IHYNVVLRALGRAKKWDELRLCWIEMAKKGVLPTNNTYGMLVDVYGKAGLVKEALLWIKH 456
           IHYNVVLRALG+A++WD+LRLCW++MAK GVLPTNNTY MLVDVYGKAGLV+EALLWI+H
Sbjct: 132 IHYNVVLRALGKAQQWDQLRLCWLDMAKNGVLPTNNTYSMLVDVYGKAGLVQEALLWIRH 191

Query: 457 MKLRGIFPDEVTMSTVIKVLKDAGEYDKADRFYKDWSVGKIELDDLEL-DSMGDQQS--- 624
           M++RG FPDEVTM TV+KVLKD G++D+A RFYK W  GK+EL+DLEL DS+G   S   
Sbjct: 192 MRVRGFFPDEVTMCTVVKVLKDVGDFDRAHRFYKGWCEGKVELNDLELEDSLGINNSSNG 251

Query: 625 -----ISLKQFLLSELFRTGGRSHSFTNFNETESS 714
                IS KQFL +ELF+ GGR+        T SS
Sbjct: 252 SASMGISFKQFLSTELFKIGGRAPVSGEARSTNSS 286


>ref|NP_177512.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75169780|sp|Q9C9U0.1|PP118_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At1g73710 gi|12324197|gb|AAG52063.1|AC012679_1
           hypothetical protein; 49134-52109 [Arabidopsis thaliana]
           gi|332197379|gb|AEE35500.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 991

 Score =  269 bits (687), Expect = 8e-70
 Identities = 131/202 (64%), Positives = 162/202 (80%), Gaps = 7/202 (3%)
 Frame = +1

Query: 94  ILPSVLRALESEKDLEKTLELYHGKLNPKEQTVILKEQSKWDKVLRVFEWFKSQKDYVPN 273
           ++PS+LR+L+S  D+E TL      L+PKEQTV+LKEQ++W++VLRVF +F+S + YVPN
Sbjct: 85  VIPSILRSLDSSTDIETTLASLCLNLSPKEQTVLLKEQTRWERVLRVFRFFQSHQSYVPN 144

Query: 274 VIHYNVVLRALGRAKKWDELRLCWIEMAKKGVLPTNNTYGMLVDVYGKAGLVKEALLWIK 453
           VIHYN+VLRALGRA KWDELRLCWIEMA  GVLPTNNTYGMLVDVYGKAGLVKEALLWIK
Sbjct: 145 VIHYNIVLRALGRAGKWDELRLCWIEMAHNGVLPTNNTYGMLVDVYGKAGLVKEALLWIK 204

Query: 454 HMKLRGIFPDEVTMSTVIKVLKDAGEYDKADRFYKDWSVGKIELDDLELDSMGD------ 615
           HM  R  FPDEVTM+TV++V K++GE+D+ADRF+K W  GK+   DL+LDS+ D      
Sbjct: 205 HMGQRMHFPDEVTMATVVRVFKNSGEFDRADRFFKGWCAGKV---DLDLDSIDDFPKNGS 261

Query: 616 -QQSISLKQFLLSELFRTGGRS 678
            Q  ++LKQFL  ELF+ G R+
Sbjct: 262 AQSPVNLKQFLSMELFKVGARN 283


Top