BLASTX nr result

ID: Mentha25_contig00012589 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00012589
         (1221 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU39532.1| hypothetical protein MIMGU_mgv1a001059mg [Mimulus...   641   0.0  
ref|XP_004237508.1| PREDICTED: uncharacterized protein LOC101246...   627   e-177
ref|XP_002268094.2| PREDICTED: uncharacterized protein LOC100241...   627   e-177
ref|XP_006340475.1| PREDICTED: uncharacterized protein LOC102579...   625   e-176
ref|XP_002522027.1| pentatricopeptide repeat-containing protein,...   624   e-176
ref|XP_007208365.1| hypothetical protein PRUPE_ppa001139mg [Prun...   622   e-175
ref|XP_007208364.1| hypothetical protein PRUPE_ppa001139mg [Prun...   622   e-175
ref|XP_007030297.1| Plastid transcriptionally active 3 isoform 2...   616   e-174
ref|XP_007030296.1| Plastid transcriptionally active 3 isoform 1...   616   e-174
ref|XP_006478983.1| PREDICTED: uncharacterized protein LOC102630...   612   e-172
ref|XP_006443293.1| hypothetical protein CICLE_v10023441mg [Citr...   612   e-172
ref|XP_002325363.1| SAP domain-containing family protein [Populu...   610   e-172
gb|EXB93125.1| Pentatricopeptide repeat-containing protein [Moru...   604   e-170
ref|XP_002884436.1| hypothetical protein ARALYDRAFT_477686 [Arab...   599   e-169
ref|XP_003555560.1| PREDICTED: uncharacterized protein LOC100807...   596   e-168
ref|XP_003535382.1| PREDICTED: uncharacterized protein LOC100802...   593   e-167
gb|AAF26788.1|AC016829_12 hypothetical protein [Arabidopsis thal...   593   e-167
ref|XP_003590907.1| Pentatricopeptide repeat-containing protein ...   592   e-167
ref|NP_187076.2| plastid transcriptionally active 3 [Arabidopsis...   592   e-167
ref|XP_006854340.1| hypothetical protein AMTR_s00039p00135490 [A...   591   e-166

>gb|EYU39532.1| hypothetical protein MIMGU_mgv1a001059mg [Mimulus guttatus]
          Length = 900

 Score =  641 bits (1653), Expect = 0.0
 Identities = 319/407 (78%), Positives = 347/407 (85%), Gaps = 1/407 (0%)
 Frame = +3

Query: 3    KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182
            +A  GIPEIAFSTFENMEYG EA+MKPDTE+YNWVIQA+TRAESYDRVQDVAEL+GMMVE
Sbjct: 264  QATCGIPEIAFSTFENMEYG-EAFMKPDTESYNWVIQAFTRAESYDRVQDVAELLGMMVE 322

Query: 183  DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362
            DYKRLQPNVRTYALLVECFTKYCV +EAIRHFR LKNFEGGT LLH +GQ+GDPLSLYLR
Sbjct: 323  DYKRLQPNVRTYALLVECFTKYCVTKEAIRHFRGLKNFEGGTVLLHNDGQHGDPLSLYLR 382

Query: 363  ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542
            ALCREGRIVEL+DALETM +D QQIP RAMILSRKYRTLVSSWIEPLQEEAELGHE+DY+
Sbjct: 383  ALCREGRIVELIDALETMERDNQQIPARAMILSRKYRTLVSSWIEPLQEEAELGHEVDYV 442

Query: 543  ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722
            AR++ EGGLTGERKRWVPRRGKTPLDPDA+G+ Y++PME SFKQRCLEEW+IHHRKLLRT
Sbjct: 443  ARFIAEGGLTGERKRWVPRRGKTPLDPDADGFIYNSPMENSFKQRCLEEWRIHHRKLLRT 502

Query: 723  LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902
            L NEGP +LGN+SESDY RV ERL+KIIKGPEQ+ LKPKAASKM+VSELKEELEAQGLPT
Sbjct: 503  LWNEGPAILGNVSESDYNRVVERLKKIIKGPEQSALKPKAASKMVVSELKEELEAQGLPT 562

Query: 903  DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082
            DGTRNVLYQRVQKARRINRSRGR                     ISRIKL EGNTEFWR+
Sbjct: 563  DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLEEGNTEFWRR 622

Query: 1083 RFLGEGLNENHSKP-XXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220
            RFLGEGL ENH+KP                   DVGD+VAK+GEDDE
Sbjct: 623  RFLGEGLTENHNKPLEVEDYDVLDVTDDADVGDDVGDDVAKEGEDDE 669


>ref|XP_004237508.1| PREDICTED: uncharacterized protein LOC101246046 [Solanum
            lycopersicum]
          Length = 891

 Score =  627 bits (1616), Expect = e-177
 Identities = 310/406 (76%), Positives = 344/406 (84%)
 Frame = +3

Query: 3    KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182
            +A  GIPEIAF+TFENMEYG + +MKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE
Sbjct: 254  QASCGIPEIAFATFENMEYGDD-HMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 312

Query: 183  DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362
            D+KRLQPNVRTYALLVECFTKYCV+REAIRHFR LKNFEGGT++L+ +G+YGDPLSLYLR
Sbjct: 313  DHKRLQPNVRTYALLVECFTKYCVVREAIRHFRGLKNFEGGTQVLYNDGKYGDPLSLYLR 372

Query: 363  ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542
            ALCREGRIVELL+ALE MAKD Q IPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYI
Sbjct: 373  ALCREGRIVELLEALEAMAKDNQPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYI 432

Query: 543  ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722
            ARYV EGGLTG+RKRWVPRRGKTPLDPDA+G+ YSNP ETSFKQRC EEW++HHRKLL+T
Sbjct: 433  ARYVAEGGLTGDRKRWVPRRGKTPLDPDAQGFIYSNPRETSFKQRCFEEWRLHHRKLLKT 492

Query: 723  LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902
            L NEGP +LG +SE DYIR+EERLRK+IKGPEQ+ LKPKAASKM+VSELKEELEAQGLPT
Sbjct: 493  LLNEGPSILGKVSEYDYIRIEERLRKVIKGPEQSALKPKAASKMVVSELKEELEAQGLPT 552

Query: 903  DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082
            DGTRNVLYQRVQKARRINRSRGR                     ISRIKLHEGNTEFW++
Sbjct: 553  DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKR 612

Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220
            RFLGEGL+EN+ +                  +D  D++ KD EDDE
Sbjct: 613  RFLGEGLSENYGQ---QSEIIDLEPTDVVDDNDAVDDITKDAEDDE 655


>ref|XP_002268094.2| PREDICTED: uncharacterized protein LOC100241547 [Vitis vinifera]
            gi|296085161|emb|CBI28656.3| unnamed protein product
            [Vitis vinifera]
          Length = 884

 Score =  627 bits (1616), Expect = e-177
 Identities = 316/406 (77%), Positives = 341/406 (83%)
 Frame = +3

Query: 3    KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182
            +A  GIPEIAF+TFENMEYG E YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE
Sbjct: 249  QATCGIPEIAFATFENMEYG-EDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 307

Query: 183  DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362
            D+KRLQPNV+TYALLVEC TKYCV+REAIRHFRALKNFEGGTK+LH EG +GDPLSLYLR
Sbjct: 308  DHKRLQPNVKTYALLVECLTKYCVVREAIRHFRALKNFEGGTKVLHDEGNFGDPLSLYLR 367

Query: 363  ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542
            ALCREGRIVELLDALE MAKD Q IPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYI
Sbjct: 368  ALCREGRIVELLDALEAMAKDNQPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYI 427

Query: 543  ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722
            ARY+ EGGLTG+RKRWVPRRGKTPLDPDA G+ YSNPMETSFKQRCLE+WK++HRKLL+T
Sbjct: 428  ARYIAEGGLTGDRKRWVPRRGKTPLDPDALGFIYSNPMETSFKQRCLEDWKMYHRKLLKT 487

Query: 723  LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902
            LRNEG   LG +SESDYIRVEERLRKIIKGP+QN LKPKAASKMIVSELKEELEAQGLPT
Sbjct: 488  LRNEGLAALGEVSESDYIRVEERLRKIIKGPDQNALKPKAASKMIVSELKEELEAQGLPT 547

Query: 903  DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082
            DGTRNVLYQRVQKARRINRSRGR                     ISRIKL EGNTEFW++
Sbjct: 548  DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLQEGNTEFWKR 607

Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220
            RFLGE L     KP                 +D+G++ AK+ EDDE
Sbjct: 608  RFLGEDLTVGRGKP---MDKENSELPDVLDDADIGEDTAKEVEDDE 650


>ref|XP_006340475.1| PREDICTED: uncharacterized protein LOC102579691 [Solanum tuberosum]
          Length = 890

 Score =  625 bits (1611), Expect = e-176
 Identities = 311/406 (76%), Positives = 344/406 (84%)
 Frame = +3

Query: 3    KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182
            +A  GIPEIAF+TFENMEYG + +MKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE
Sbjct: 253  QATCGIPEIAFATFENMEYGDD-HMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 311

Query: 183  DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362
            D+KRLQPNVRTYALLVECFTKYCV+REAIRHFR LKNFEGGT++L+ +G+YGD LSLYLR
Sbjct: 312  DHKRLQPNVRTYALLVECFTKYCVVREAIRHFRGLKNFEGGTQVLYNDGKYGDSLSLYLR 371

Query: 363  ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542
            ALCREGRIVELL+ALE MAKD Q IPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYI
Sbjct: 372  ALCREGRIVELLEALEAMAKDNQPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYI 431

Query: 543  ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722
            ARYV EGGLTG+RKRWVPRRGKTPLDPDA+G+ YSNP ETSFKQRC EEW++HHRKLL+T
Sbjct: 432  ARYVAEGGLTGDRKRWVPRRGKTPLDPDAQGFIYSNPRETSFKQRCFEEWRLHHRKLLKT 491

Query: 723  LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902
            L NEGP +LG ISE DYIR+EERLRK+IKGPEQ+ LKPKAASKMIVSELKEELEAQGLPT
Sbjct: 492  LLNEGPSILGKISEYDYIRIEERLRKVIKGPEQSALKPKAASKMIVSELKEELEAQGLPT 551

Query: 903  DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082
            DGTRNVLYQRVQKARRINRSRGR                     ISRIKLHEGNTEFW++
Sbjct: 552  DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKR 611

Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220
            RFLGEGL+EN+ +                  +D  D++AK+ EDDE
Sbjct: 612  RFLGEGLSENYGQ---QSEIIDLEPTDVVDDNDAVDDIAKEAEDDE 654


>ref|XP_002522027.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223538831|gb|EEF40431.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 889

 Score =  624 bits (1608), Expect = e-176
 Identities = 308/406 (75%), Positives = 338/406 (83%)
 Frame = +3

Query: 3    KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182
            +A  GIPEIAF+TFENMEYGGE YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE
Sbjct: 250  QATCGIPEIAFATFENMEYGGEEYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 309

Query: 183  DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362
            D+KRLQPNVRTYALLVECFTKYCV+REAIRHFRAL+NFEGGTK+LHY+G +GDPLSLYLR
Sbjct: 310  DHKRLQPNVRTYALLVECFTKYCVVREAIRHFRALQNFEGGTKVLHYDGNFGDPLSLYLR 369

Query: 363  ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542
            ALCREGRIVELL+ALE M +D Q IPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDY+
Sbjct: 370  ALCREGRIVELLEALEAMGRDNQPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYV 429

Query: 543  ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722
            ARYV EGGLTGERKRWVPRRGKTPLDPDA G+ YSNPMETSFKQRC+E+WK+HHRKLLRT
Sbjct: 430  ARYVAEGGLTGERKRWVPRRGKTPLDPDAAGFIYSNPMETSFKQRCIEDWKVHHRKLLRT 489

Query: 723  LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902
            L NEG   LG  SESDY+RV ERL+KIIKGP+QN LKPKAASKM+VSELKEELEAQGLP 
Sbjct: 490  LLNEGLAALGEASESDYLRVVERLKKIIKGPDQNVLKPKAASKMVVSELKEELEAQGLPI 549

Query: 903  DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082
            DGTRNVLYQRVQKARRINRSRGR                     ISRIKL EGNTEFW++
Sbjct: 550  DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDEIISRIKLEEGNTEFWKR 609

Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220
            RFLGEGLN ++ +P                  +  D+  +D E D+
Sbjct: 610  RFLGEGLNGSNLQPMSVAKSELPDVLDDVDAIEDADKEVEDEEADD 655


>ref|XP_007208365.1| hypothetical protein PRUPE_ppa001139mg [Prunus persica]
            gi|462404007|gb|EMJ09564.1| hypothetical protein
            PRUPE_ppa001139mg [Prunus persica]
          Length = 897

 Score =  622 bits (1603), Expect = e-175
 Identities = 305/373 (81%), Positives = 327/373 (87%)
 Frame = +3

Query: 3    KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182
            +A  GIPEIAFSTFENMEYGGE YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE
Sbjct: 252  QATCGIPEIAFSTFENMEYGGEEYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 311

Query: 183  DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362
            D+KRLQPN++T+ALLVECFTKYCV+REAIRHFRALK FEGGTK LH EG +GDPLSLYLR
Sbjct: 312  DHKRLQPNMKTHALLVECFTKYCVVREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLR 371

Query: 363  ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542
            ALCREGRI+ELL+ALE MA+D Q IPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDY+
Sbjct: 372  ALCREGRILELLEALEAMAEDNQTIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYM 431

Query: 543  ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722
            ARY+ EGGLTGERKRWVPRRGKTPLDPD EG+ YSNPME SFKQRCLE+WKIHHRKLLRT
Sbjct: 432  ARYIAEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMENSFKQRCLEDWKIHHRKLLRT 491

Query: 723  LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902
            LRNEG   LG+ SESDYIRVE RLRKIIKGP+QN LKPKAASKM+VSELKEELEAQGLPT
Sbjct: 492  LRNEGVAALGDASESDYIRVEMRLRKIIKGPDQNVLKPKAASKMVVSELKEELEAQGLPT 551

Query: 903  DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082
            DGTRNVLYQRVQKARRINRSRGR                     ISRIKL EGNTEFW++
Sbjct: 552  DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEEIDELISRIKLEEGNTEFWKR 611

Query: 1083 RFLGEGLNENHSK 1121
            RFLGEG + +  K
Sbjct: 612  RFLGEGFSSDQEK 624


>ref|XP_007208364.1| hypothetical protein PRUPE_ppa001139mg [Prunus persica]
            gi|462404006|gb|EMJ09563.1| hypothetical protein
            PRUPE_ppa001139mg [Prunus persica]
          Length = 780

 Score =  622 bits (1603), Expect = e-175
 Identities = 305/373 (81%), Positives = 327/373 (87%)
 Frame = +3

Query: 3    KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182
            +A  GIPEIAFSTFENMEYGGE YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE
Sbjct: 252  QATCGIPEIAFSTFENMEYGGEEYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 311

Query: 183  DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362
            D+KRLQPN++T+ALLVECFTKYCV+REAIRHFRALK FEGGTK LH EG +GDPLSLYLR
Sbjct: 312  DHKRLQPNMKTHALLVECFTKYCVVREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLR 371

Query: 363  ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542
            ALCREGRI+ELL+ALE MA+D Q IPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDY+
Sbjct: 372  ALCREGRILELLEALEAMAEDNQTIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYM 431

Query: 543  ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722
            ARY+ EGGLTGERKRWVPRRGKTPLDPD EG+ YSNPME SFKQRCLE+WKIHHRKLLRT
Sbjct: 432  ARYIAEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMENSFKQRCLEDWKIHHRKLLRT 491

Query: 723  LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902
            LRNEG   LG+ SESDYIRVE RLRKIIKGP+QN LKPKAASKM+VSELKEELEAQGLPT
Sbjct: 492  LRNEGVAALGDASESDYIRVEMRLRKIIKGPDQNVLKPKAASKMVVSELKEELEAQGLPT 551

Query: 903  DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082
            DGTRNVLYQRVQKARRINRSRGR                     ISRIKL EGNTEFW++
Sbjct: 552  DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEEIDELISRIKLEEGNTEFWKR 611

Query: 1083 RFLGEGLNENHSK 1121
            RFLGEG + +  K
Sbjct: 612  RFLGEGFSSDQEK 624


>ref|XP_007030297.1| Plastid transcriptionally active 3 isoform 2 [Theobroma cacao]
            gi|508718902|gb|EOY10799.1| Plastid transcriptionally
            active 3 isoform 2 [Theobroma cacao]
          Length = 782

 Score =  616 bits (1588), Expect = e-174
 Identities = 310/406 (76%), Positives = 338/406 (83%)
 Frame = +3

Query: 3    KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182
            +A  GIPEIAF+TFENMEYG E YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE
Sbjct: 131  QATCGIPEIAFATFENMEYG-EEYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 189

Query: 183  DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362
            D+KR+QPNV+TYALLVECFTKYCV++EAIRHFRALK FEGGT++L  EG + DPLSLYLR
Sbjct: 190  DHKRVQPNVKTYALLVECFTKYCVVKEAIRHFRALKKFEGGTRVLQNEGNFDDPLSLYLR 249

Query: 363  ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542
            ALCREGRIVELL+AL+ MAKD Q IPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYI
Sbjct: 250  ALCREGRIVELLEALQAMAKDNQPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYI 309

Query: 543  ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722
            ARY+EEGGLTGERKRWVPRRGKTPLDPDA G+ YSNPMETSFKQRCLE+WK+HHRKLL+T
Sbjct: 310  ARYIEEGGLTGERKRWVPRRGKTPLDPDAAGFIYSNPMETSFKQRCLEDWKLHHRKLLKT 369

Query: 723  LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902
            L+NEG   LG  SESDY+RV ERL+KIIKGP+QN LKPKAASKMIVSELKEELEAQGLP 
Sbjct: 370  LQNEGLAALGGASESDYVRVSERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPI 429

Query: 903  DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082
            DGTRNVLYQRVQKARRINRSRGR                     ISRIKL EGNTEFW++
Sbjct: 430  DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEEVDELISRIKLEEGNTEFWKR 489

Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220
            RFLGE LN +H KP                  DV ++ AKD EDDE
Sbjct: 490  RFLGEHLNVDHVKP--IDEGESEPADDELDDGDVVEDAAKDIEDDE 533


>ref|XP_007030296.1| Plastid transcriptionally active 3 isoform 1 [Theobroma cacao]
            gi|508718901|gb|EOY10798.1| Plastid transcriptionally
            active 3 isoform 1 [Theobroma cacao]
          Length = 905

 Score =  616 bits (1588), Expect = e-174
 Identities = 310/406 (76%), Positives = 338/406 (83%)
 Frame = +3

Query: 3    KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182
            +A  GIPEIAF+TFENMEYG E YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE
Sbjct: 254  QATCGIPEIAFATFENMEYG-EEYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 312

Query: 183  DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362
            D+KR+QPNV+TYALLVECFTKYCV++EAIRHFRALK FEGGT++L  EG + DPLSLYLR
Sbjct: 313  DHKRVQPNVKTYALLVECFTKYCVVKEAIRHFRALKKFEGGTRVLQNEGNFDDPLSLYLR 372

Query: 363  ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542
            ALCREGRIVELL+AL+ MAKD Q IPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYI
Sbjct: 373  ALCREGRIVELLEALQAMAKDNQPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYI 432

Query: 543  ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722
            ARY+EEGGLTGERKRWVPRRGKTPLDPDA G+ YSNPMETSFKQRCLE+WK+HHRKLL+T
Sbjct: 433  ARYIEEGGLTGERKRWVPRRGKTPLDPDAAGFIYSNPMETSFKQRCLEDWKLHHRKLLKT 492

Query: 723  LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902
            L+NEG   LG  SESDY+RV ERL+KIIKGP+QN LKPKAASKMIVSELKEELEAQGLP 
Sbjct: 493  LQNEGLAALGGASESDYVRVSERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPI 552

Query: 903  DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082
            DGTRNVLYQRVQKARRINRSRGR                     ISRIKL EGNTEFW++
Sbjct: 553  DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEEVDELISRIKLEEGNTEFWKR 612

Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220
            RFLGE LN +H KP                  DV ++ AKD EDDE
Sbjct: 613  RFLGEHLNVDHVKP--IDEGESEPADDELDDGDVVEDAAKDIEDDE 656


>ref|XP_006478983.1| PREDICTED: uncharacterized protein LOC102630853 isoform X2 [Citrus
            sinensis]
          Length = 764

 Score =  612 bits (1578), Expect = e-172
 Identities = 310/413 (75%), Positives = 341/413 (82%), Gaps = 7/413 (1%)
 Frame = +3

Query: 3    KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182
            +A  GIPE+AF+TFENMEYG E YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMM E
Sbjct: 130  QATCGIPEVAFATFENMEYG-EDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMFE 188

Query: 183  DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362
            D+KRLQPNV+TYALLVECFTKYC + EAIRHFRAL+N+EGGTK+LH EG +GDPLSLYLR
Sbjct: 189  DHKRLQPNVKTYALLVECFTKYCAVTEAIRHFRALQNYEGGTKVLHNEGNFGDPLSLYLR 248

Query: 363  ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542
            ALCREGRI+ELL+ALE MAKD Q +PPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYI
Sbjct: 249  ALCREGRIIELLEALEAMAKDNQPVPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYI 308

Query: 543  ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722
            ARY+ EGGLTGERKRWVPRRGKTPLDPDA G+ YSNPMETSFKQRCLE+ K +HRKLLRT
Sbjct: 309  ARYISEGGLTGERKRWVPRRGKTPLDPDAVGFIYSNPMETSFKQRCLEDGKKYHRKLLRT 368

Query: 723  LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902
            L+NEGP VLG++SESDY+RVEERL+K+IKGPEQ+ LKPKAASKM+VSELKEEL+AQGLPT
Sbjct: 369  LQNEGPAVLGDVSESDYVRVEERLKKLIKGPEQHVLKPKAASKMVVSELKEELDAQGLPT 428

Query: 903  DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082
            DGTRNVLYQRVQKARRINRSRGR                     ISRIKL EGNTEFW++
Sbjct: 429  DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEEVDELISRIKLEEGNTEFWKR 488

Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDE-------VAKDGEDDE 1220
            RFLGEGLN  H K                  SDV D+       VAKD E DE
Sbjct: 489  RFLGEGLNGRHDK---------AVEMDESELSDVLDDDVTDVEYVAKDEEADE 532


>ref|XP_006443293.1| hypothetical protein CICLE_v10023441mg [Citrus clementina]
            gi|568850568|ref|XP_006478982.1| PREDICTED:
            uncharacterized protein LOC102630853 isoform X1 [Citrus
            sinensis] gi|557545555|gb|ESR56533.1| hypothetical
            protein CICLE_v10023441mg [Citrus clementina]
          Length = 887

 Score =  612 bits (1578), Expect = e-172
 Identities = 310/413 (75%), Positives = 341/413 (82%), Gaps = 7/413 (1%)
 Frame = +3

Query: 3    KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182
            +A  GIPE+AF+TFENMEYG E YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMM E
Sbjct: 253  QATCGIPEVAFATFENMEYG-EDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMFE 311

Query: 183  DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362
            D+KRLQPNV+TYALLVECFTKYC + EAIRHFRAL+N+EGGTK+LH EG +GDPLSLYLR
Sbjct: 312  DHKRLQPNVKTYALLVECFTKYCAVTEAIRHFRALQNYEGGTKVLHNEGNFGDPLSLYLR 371

Query: 363  ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542
            ALCREGRI+ELL+ALE MAKD Q +PPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYI
Sbjct: 372  ALCREGRIIELLEALEAMAKDNQPVPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYI 431

Query: 543  ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722
            ARY+ EGGLTGERKRWVPRRGKTPLDPDA G+ YSNPMETSFKQRCLE+ K +HRKLLRT
Sbjct: 432  ARYISEGGLTGERKRWVPRRGKTPLDPDAVGFIYSNPMETSFKQRCLEDGKKYHRKLLRT 491

Query: 723  LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902
            L+NEGP VLG++SESDY+RVEERL+K+IKGPEQ+ LKPKAASKM+VSELKEEL+AQGLPT
Sbjct: 492  LQNEGPAVLGDVSESDYVRVEERLKKLIKGPEQHVLKPKAASKMVVSELKEELDAQGLPT 551

Query: 903  DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082
            DGTRNVLYQRVQKARRINRSRGR                     ISRIKL EGNTEFW++
Sbjct: 552  DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEEVDELISRIKLEEGNTEFWKR 611

Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDE-------VAKDGEDDE 1220
            RFLGEGLN  H K                  SDV D+       VAKD E DE
Sbjct: 612  RFLGEGLNGRHDK---------AVEMDESELSDVLDDDVTDVEYVAKDEEADE 655


>ref|XP_002325363.1| SAP domain-containing family protein [Populus trichocarpa]
            gi|222862238|gb|EEE99744.1| SAP domain-containing family
            protein [Populus trichocarpa]
          Length = 887

 Score =  610 bits (1572), Expect = e-172
 Identities = 304/408 (74%), Positives = 339/408 (83%), Gaps = 2/408 (0%)
 Frame = +3

Query: 3    KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182
            +A  GIPEI+F+TFENMEYG E YMKPDTE+YNWVIQAYTRAESYDRVQDVAEL+GMMVE
Sbjct: 249  QATCGIPEISFATFENMEYG-EDYMKPDTESYNWVIQAYTRAESYDRVQDVAELLGMMVE 307

Query: 183  DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362
            D+KR+QPNV+TYALLVECF+KYCV+REAIRHFRAL+ FEGGTK LH EG++GDPLSLYLR
Sbjct: 308  DHKRIQPNVKTYALLVECFSKYCVVREAIRHFRALRKFEGGTKALHNEGKFGDPLSLYLR 367

Query: 363  ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542
            ALCREGRIV+LL+ALE MA+D Q IPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDY+
Sbjct: 368  ALCREGRIVDLLEALEAMAEDNQPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYV 427

Query: 543  ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722
            ARYV EGGLTGERKRWVPRRGKTPLDPD +G+ YSNPMETS KQRCLE+WK HHRKLL+ 
Sbjct: 428  ARYVAEGGLTGERKRWVPRRGKTPLDPDCDGFIYSNPMETSLKQRCLEDWKAHHRKLLKM 487

Query: 723  LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902
            LRNEG   LG+ SESDY+RVEERLRKII+GP++N LKPKAASKMIVSELK+ELEAQGLP 
Sbjct: 488  LRNEGLAALGDASESDYLRVEERLRKIIRGPDRNVLKPKAASKMIVSELKDELEAQGLPI 547

Query: 903  DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082
            DGTRNVLYQRVQKARRINRSRGR                     ISRI+LHEG+TEFW++
Sbjct: 548  DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEEVDELISRIQLHEGDTEFWKR 607

Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGD--EVAKDGEDDE 1220
            RFLGEG N NH KP                  D  D  +VAK+ ED+E
Sbjct: 608  RFLGEGFNGNHVKPVDMETSELPDELDEDEDDDDDDVEDVAKEVEDEE 655


>gb|EXB93125.1| Pentatricopeptide repeat-containing protein [Morus notabilis]
          Length = 895

 Score =  604 bits (1558), Expect = e-170
 Identities = 305/406 (75%), Positives = 339/406 (83%)
 Frame = +3

Query: 3    KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182
            +A  GIPEIAFSTFENM+YG E +MKPDTETYNWVIQAYTRAESYDRVQDVAEL+G+MVE
Sbjct: 246  QATCGIPEIAFSTFENMQYG-EEFMKPDTETYNWVIQAYTRAESYDRVQDVAELLGIMVE 304

Query: 183  DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362
            D+KRLQPN++T+ALLVECFTKYCVI EAIRHFRAL+NFEGGT +LH EG +GDPLSLYLR
Sbjct: 305  DHKRLQPNMKTHALLVECFTKYCVIGEAIRHFRALRNFEGGTIVLHNEGNFGDPLSLYLR 364

Query: 363  ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542
            ALCREGRIVELL+ALE M KD Q IPPRAM+LS+KYRTLVSSWIEPLQ+EAELG+EIDYI
Sbjct: 365  ALCREGRIVELLEALEAMVKDNQPIPPRAMLLSKKYRTLVSSWIEPLQDEAELGYEIDYI 424

Query: 543  ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722
            ARY+ EGGLTGERKRWVPRRGKTPLDPDA G+ YSNPMETSFKQRCLE+WK ++RKLLRT
Sbjct: 425  ARYIAEGGLTGERKRWVPRRGKTPLDPDAAGFIYSNPMETSFKQRCLEDWKTYNRKLLRT 484

Query: 723  LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902
            LRNEG  VLG+ SESDYIRVEERL KI++GPEQN LKPKAASKMIVSELKEELEAQGLPT
Sbjct: 485  LRNEGIAVLGDASESDYIRVEERLLKIVRGPEQNVLKPKAASKMIVSELKEELEAQGLPT 544

Query: 903  DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082
            DGTRNVLYQRVQKARRINRSRGR                     ISRIKL EGNTEFW++
Sbjct: 545  DGTRNVLYQRVQKARRINRSRGRPLWIPPVEEEEEEVDEDLDELISRIKLQEGNTEFWKR 604

Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220
            RFLGEGLN ++                    +D+ ++ AK+ EDDE
Sbjct: 605  RFLGEGLNGDNGN---STSMGRAEFADVDVDADIVEDSAKEVEDDE 647


>ref|XP_002884436.1| hypothetical protein ARALYDRAFT_477686 [Arabidopsis lyrata subsp.
            lyrata] gi|297330276|gb|EFH60695.1| hypothetical protein
            ARALYDRAFT_477686 [Arabidopsis lyrata subsp. lyrata]
          Length = 914

 Score =  599 bits (1544), Expect = e-169
 Identities = 298/406 (73%), Positives = 332/406 (81%)
 Frame = +3

Query: 3    KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182
            +A  GIPE+A++TFENMEYG   +MKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE
Sbjct: 257  QATCGIPEVAYATFENMEYGEGLFMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 316

Query: 183  DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362
            D+KR+QPNV+TYALLVECFTKYCV++EAIRHFRALKNFEGGT +LH  G++ DPLSLYLR
Sbjct: 317  DHKRVQPNVKTYALLVECFTKYCVVKEAIRHFRALKNFEGGTTILHNAGKFEDPLSLYLR 376

Query: 363  ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542
            ALCREGRIVEL+DAL+ M KD Q IPPRAMI+SRKYRTLVSSWIEPLQEEAELG+EIDY+
Sbjct: 377  ALCREGRIVELIDALDAMRKDSQPIPPRAMIMSRKYRTLVSSWIEPLQEEAELGYEIDYV 436

Query: 543  ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722
            ARY+EEGGLTGERKRWVPRRGKTPLDPDA G+ YSNP+ETSFKQRCLE+WKIHHRKLLRT
Sbjct: 437  ARYIEEGGLTGERKRWVPRRGKTPLDPDASGFIYSNPIETSFKQRCLEDWKIHHRKLLRT 496

Query: 723  LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902
            L++EG  VLG+ SESDY+RV ERLR IIKGP QN LKPKAASKM+VSELKEELEAQGLP 
Sbjct: 497  LQSEGLPVLGDASESDYMRVMERLRNIIKGPAQNLLKPKAASKMVVSELKEELEAQGLPI 556

Query: 903  DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082
            DGTRNVLYQRVQKARRIN+SRGR                     I RIKLHEG+TEFW++
Sbjct: 557  DGTRNVLYQRVQKARRINKSRGRPLWVPPIEEEEEEVDEEVDDLICRIKLHEGDTEFWKR 616

Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220
            RFLGEGL E  S                    D+  E   D +DDE
Sbjct: 617  RFLGEGLIET-SVESKETTESVVTGESEKAIEDISKEADNDEDDDE 661


>ref|XP_003555560.1| PREDICTED: uncharacterized protein LOC100807191 isoform X1 [Glycine
            max]
          Length = 887

 Score =  596 bits (1536), Expect = e-168
 Identities = 301/407 (73%), Positives = 336/407 (82%), Gaps = 1/407 (0%)
 Frame = +3

Query: 3    KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182
            +A  GIPEIAF+TFENMEYG E YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE
Sbjct: 244  QATCGIPEIAFATFENMEYG-EDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 302

Query: 183  DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362
            D+KR+QPN +T+ALLVECFTKYCV+REAIRHFRALKNFEGG K+LH EG +GDPLSLYLR
Sbjct: 303  DHKRIQPNAKTHALLVECFTKYCVVREAIRHFRALKNFEGGIKVLHNEGNHGDPLSLYLR 362

Query: 363  ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542
            ALCREGRIVE+L+ALE MAKD Q IP RAMILSRKYRTLVSSWIEPLQEEAELG+EIDYI
Sbjct: 363  ALCREGRIVEMLEALEAMAKDNQPIPSRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYI 422

Query: 543  ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722
            +RY++EGGLTGERKRWVPRRGKTPLDPDA G+ YSNPMETSFKQRCLEE K+H++KLL+T
Sbjct: 423  SRYIDEGGLTGERKRWVPRRGKTPLDPDAHGFIYSNPMETSFKQRCLEELKLHNKKLLKT 482

Query: 723  LRNEGPIVLGN-ISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLP 899
            L+NEG   LG+ +SESDYIRV+ERL+K+IKGPEQN LKPKAASKM+VSELKEEL+AQGLP
Sbjct: 483  LQNEGLAALGDGVSESDYIRVQERLKKLIKGPEQNVLKPKAASKMLVSELKEELDAQGLP 542

Query: 900  TDGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWR 1079
             DG RNVLYQRVQKARRINRSRGR                     IS IKL EGNTEFW+
Sbjct: 543  IDGNRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDALISHIKLEEGNTEFWK 602

Query: 1080 QRFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220
            +RFLGEGLN +   P                  D  ++ AK+ EDDE
Sbjct: 603  RRFLGEGLNGDQEMPTDAAESEVPEVLDDV---DAIEDAAKEVEDDE 646


>ref|XP_003535382.1| PREDICTED: uncharacterized protein LOC100802355 isoform X1 [Glycine
            max]
          Length = 887

 Score =  593 bits (1529), Expect = e-167
 Identities = 298/407 (73%), Positives = 337/407 (82%), Gaps = 1/407 (0%)
 Frame = +3

Query: 3    KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182
            +A  GIPEIAF+TFENMEYG E YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE
Sbjct: 244  QATCGIPEIAFATFENMEYG-EDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 302

Query: 183  DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362
            D+KR+QPN +T+ALLVECFTKYCV+REAIRHFRALKNFEGG ++LH EG +GDPLSLYLR
Sbjct: 303  DHKRIQPNAKTHALLVECFTKYCVVREAIRHFRALKNFEGGIEVLHNEGNHGDPLSLYLR 362

Query: 363  ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542
            ALCREGRIVE+L+ALE MAKD Q IP RAMILSRKYRTLVSSWIEPLQEEAE+G+EIDYI
Sbjct: 363  ALCREGRIVEMLEALEAMAKDNQPIPSRAMILSRKYRTLVSSWIEPLQEEAEIGYEIDYI 422

Query: 543  ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722
            +RY++EGGLTGERKRWVPRRGKTPLDPDA G+ YSNPMETSFKQRC+EE K+H++KLL+T
Sbjct: 423  SRYIDEGGLTGERKRWVPRRGKTPLDPDAHGFIYSNPMETSFKQRCMEELKLHNKKLLKT 482

Query: 723  LRNEGPIVLG-NISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLP 899
            L+NEG   LG ++SE DYIRV+ERL+K++KGPEQN LKPKAASKM+VSELKEEL+AQGLP
Sbjct: 483  LQNEGLAALGDDVSEFDYIRVQERLKKLMKGPEQNVLKPKAASKMLVSELKEELDAQGLP 542

Query: 900  TDGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWR 1079
             DGTRNVLYQRVQKARRINRSRGR                     ISRIKL EGNTEFW+
Sbjct: 543  IDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDALISRIKLEEGNTEFWK 602

Query: 1080 QRFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220
            +RFLGEGLN +   P                  D  ++ AK+ EDDE
Sbjct: 603  RRFLGEGLNGDQEMPTDAVQSDVPEVLDDV---DAIEDAAKEVEDDE 646


>gb|AAF26788.1|AC016829_12 hypothetical protein [Arabidopsis thaliana]
          Length = 913

 Score =  593 bits (1529), Expect = e-167
 Identities = 295/406 (72%), Positives = 330/406 (81%)
 Frame = +3

Query: 3    KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182
            +A  GIPE+A++TFENMEYG   +MKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE
Sbjct: 257  QATCGIPEVAYATFENMEYGEGLFMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 316

Query: 183  DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362
            D+KR+QPNV+TYALLVECFTKYCV++EAIRHFRALKNFEGGT +LH  G + DPLSLYLR
Sbjct: 317  DHKRVQPNVKTYALLVECFTKYCVVKEAIRHFRALKNFEGGTVILHNAGNFEDPLSLYLR 376

Query: 363  ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542
            ALCREGRIVEL+DAL+ M KD Q IPPRAMI+SRKYRTLVSSWIEPLQEEAELG+EIDY+
Sbjct: 377  ALCREGRIVELIDALDAMRKDNQPIPPRAMIMSRKYRTLVSSWIEPLQEEAELGYEIDYL 436

Query: 543  ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722
            ARY+EEGGLTGERKRWVPRRGKTPLDPDA G+ YSNP+ETSFKQRCLE+WK+HHRKLLRT
Sbjct: 437  ARYIEEGGLTGERKRWVPRRGKTPLDPDASGFIYSNPIETSFKQRCLEDWKVHHRKLLRT 496

Query: 723  LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902
            L++EG  VLG+ SESDY+RV ERLR IIKGP  N LKPKAASKM+VSELKEELEAQGLP 
Sbjct: 497  LQSEGLPVLGDASESDYMRVVERLRNIIKGPALNLLKPKAASKMVVSELKEELEAQGLPI 556

Query: 903  DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082
            DGTRNVLYQRVQKARRIN+SRGR                     I RIKLHEG+TEFW++
Sbjct: 557  DGTRNVLYQRVQKARRINKSRGRPLWVPPIEEEEEEVDEEVDDLICRIKLHEGDTEFWKR 616

Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220
            RFLGEGL E  S                    D+  E   + +DDE
Sbjct: 617  RFLGEGLIET-SVESKETTESVVTGESEKAIEDISKEADNEEDDDE 661


>ref|XP_003590907.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355479955|gb|AES61158.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 2047

 Score =  592 bits (1527), Expect = e-167
 Identities = 300/407 (73%), Positives = 337/407 (82%), Gaps = 1/407 (0%)
 Frame = +3

Query: 3    KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182
            +A  GIPEIAF+TFENMEYG E YMKPDTETYNWVIQAYTRA+SYDRVQDVAEL+GMMVE
Sbjct: 239  QATCGIPEIAFTTFENMEYG-EDYMKPDTETYNWVIQAYTRADSYDRVQDVAELLGMMVE 297

Query: 183  DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362
            D+KR+QPNV+T+ALLVECFTKYCV+REAIRHFRALKNFEGGTK+LH +G +GDPLSLYLR
Sbjct: 298  DHKRVQPNVKTHALLVECFTKYCVVREAIRHFRALKNFEGGTKILHMDGNHGDPLSLYLR 357

Query: 363  ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542
            ALCREGRI+++L+ALE MA D QQIPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYI
Sbjct: 358  ALCREGRIIDMLEALEAMANDNQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYI 417

Query: 543  ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722
            ARYVEEGGLTGERKRWVPR GKTPLDPDA+G+ YSNPMETSFKQRCLEE K++H+KLL+ 
Sbjct: 418  ARYVEEGGLTGERKRWVPRSGKTPLDPDADGFIYSNPMETSFKQRCLEEKKVYHKKLLKK 477

Query: 723  LRNEGPIVLGN-ISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLP 899
            LR EG + LG+  SESDY+RV E L+KIIKGPEQN LKPKAASKM+V+ELKEELEAQGLP
Sbjct: 478  LRYEGIVALGDGASESDYVRVIEWLKKIIKGPEQNALKPKAASKMLVNELKEELEAQGLP 537

Query: 900  TDGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWR 1079
             DGTRNVLYQRVQKARRIN+SRGR                     ISRIKL EGNTE+W+
Sbjct: 538  IDGTRNVLYQRVQKARRINQSRGRPLWVPPIEVEEEEVDEELEALISRIKLEEGNTEYWK 597

Query: 1080 QRFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220
            +RFLGEGLN ++                     DV  + AK+ EDDE
Sbjct: 598  RRFLGEGLNGDNGNAMDEGESESPDVQDYI---DVVGDDAKEAEDDE 641


>ref|NP_187076.2| plastid transcriptionally active 3 [Arabidopsis thaliana]
            gi|332640537|gb|AEE74058.1| plastid transcriptionally
            active 3 [Arabidopsis thaliana]
          Length = 910

 Score =  592 bits (1527), Expect = e-167
 Identities = 296/406 (72%), Positives = 331/406 (81%)
 Frame = +3

Query: 3    KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182
            +A  GIPE+A++TFENMEYG E +MKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE
Sbjct: 257  QATCGIPEVAYATFENMEYG-EVFMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 315

Query: 183  DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362
            D+KR+QPNV+TYALLVECFTKYCV++EAIRHFRALKNFEGGT +LH  G + DPLSLYLR
Sbjct: 316  DHKRVQPNVKTYALLVECFTKYCVVKEAIRHFRALKNFEGGTVILHNAGNFEDPLSLYLR 375

Query: 363  ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542
            ALCREGRIVEL+DAL+ M KD Q IPPRAMI+SRKYRTLVSSWIEPLQEEAELG+EIDY+
Sbjct: 376  ALCREGRIVELIDALDAMRKDNQPIPPRAMIMSRKYRTLVSSWIEPLQEEAELGYEIDYL 435

Query: 543  ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722
            ARY+EEGGLTGERKRWVPRRGKTPLDPDA G+ YSNP+ETSFKQRCLE+WK+HHRKLLRT
Sbjct: 436  ARYIEEGGLTGERKRWVPRRGKTPLDPDASGFIYSNPIETSFKQRCLEDWKVHHRKLLRT 495

Query: 723  LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902
            L++EG  VLG+ SESDY+RV ERLR IIKGP  N LKPKAASKM+VSELKEELEAQGLP 
Sbjct: 496  LQSEGLPVLGDASESDYMRVVERLRNIIKGPALNLLKPKAASKMVVSELKEELEAQGLPI 555

Query: 903  DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082
            DGTRNVLYQRVQKARRIN+SRGR                     I RIKLHEG+TEFW++
Sbjct: 556  DGTRNVLYQRVQKARRINKSRGRPLWVPPIEEEEEEVDEEVDDLICRIKLHEGDTEFWKR 615

Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220
            RFLGEGL E  S                    D+  E   + +DDE
Sbjct: 616  RFLGEGLIET-SVESKETTESVVTGESEKAIEDISKEADNEEDDDE 660


>ref|XP_006854340.1| hypothetical protein AMTR_s00039p00135490 [Amborella trichopoda]
            gi|548858016|gb|ERN15807.1| hypothetical protein
            AMTR_s00039p00135490 [Amborella trichopoda]
          Length = 870

 Score =  591 bits (1524), Expect = e-166
 Identities = 300/406 (73%), Positives = 330/406 (81%)
 Frame = +3

Query: 3    KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182
            +A  GIPEIAF+TFENMEYGGE +MKPDTE+YNWVIQAYTRAESYDRVQDVAEL+GMMVE
Sbjct: 240  QATCGIPEIAFATFENMEYGGEDFMKPDTESYNWVIQAYTRAESYDRVQDVAELLGMMVE 299

Query: 183  DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362
            D+KRLQPNVRTYALLVECFTKYCV++EAIRHFRALKNFEGGT++L  EG +GDPLSLYLR
Sbjct: 300  DHKRLQPNVRTYALLVECFTKYCVLKEAIRHFRALKNFEGGTRVLCNEGNFGDPLSLYLR 359

Query: 363  ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542
            ALCREGRIVELL+ALE MAKD Q I PRAMILS+KYRTLVSSWIEPLQEEAELG E+DYI
Sbjct: 360  ALCREGRIVELLEALEAMAKDNQPITPRAMILSKKYRTLVSSWIEPLQEEAELGFEVDYI 419

Query: 543  ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722
            ARY+ EGGLT ERKRWVPRRGKTPLDPDA G+AYSNPMETS+KQRCLE  K+H+RKLL+ 
Sbjct: 420  ARYIAEGGLTAERKRWVPRRGKTPLDPDAIGFAYSNPMETSYKQRCLENLKVHNRKLLKK 479

Query: 723  LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902
            L+ EG   LG++SE+DY RV ERL+K+IKGP+Q  LKPKAASKMIVSELKEELEAQGLPT
Sbjct: 480  LKYEGRAALGDVSEADYARVVERLKKVIKGPDQTALKPKAASKMIVSELKEELEAQGLPT 539

Query: 903  DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082
            DGTR VLYQRVQKARRINRSRGR                     ISRI+L EGNTEFWR+
Sbjct: 540  DGTRQVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDEWISRIRLEEGNTEFWRR 599

Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220
            RFLGEGL    S P                  D  D+  KD EDDE
Sbjct: 600  RFLGEGLG---SVPDKKIELEDLDTSNTLDDIDNTDDNPKDMEDDE 642


Top