BLASTX nr result
ID: Mentha25_contig00012589
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00012589 (1221 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU39532.1| hypothetical protein MIMGU_mgv1a001059mg [Mimulus... 641 0.0 ref|XP_004237508.1| PREDICTED: uncharacterized protein LOC101246... 627 e-177 ref|XP_002268094.2| PREDICTED: uncharacterized protein LOC100241... 627 e-177 ref|XP_006340475.1| PREDICTED: uncharacterized protein LOC102579... 625 e-176 ref|XP_002522027.1| pentatricopeptide repeat-containing protein,... 624 e-176 ref|XP_007208365.1| hypothetical protein PRUPE_ppa001139mg [Prun... 622 e-175 ref|XP_007208364.1| hypothetical protein PRUPE_ppa001139mg [Prun... 622 e-175 ref|XP_007030297.1| Plastid transcriptionally active 3 isoform 2... 616 e-174 ref|XP_007030296.1| Plastid transcriptionally active 3 isoform 1... 616 e-174 ref|XP_006478983.1| PREDICTED: uncharacterized protein LOC102630... 612 e-172 ref|XP_006443293.1| hypothetical protein CICLE_v10023441mg [Citr... 612 e-172 ref|XP_002325363.1| SAP domain-containing family protein [Populu... 610 e-172 gb|EXB93125.1| Pentatricopeptide repeat-containing protein [Moru... 604 e-170 ref|XP_002884436.1| hypothetical protein ARALYDRAFT_477686 [Arab... 599 e-169 ref|XP_003555560.1| PREDICTED: uncharacterized protein LOC100807... 596 e-168 ref|XP_003535382.1| PREDICTED: uncharacterized protein LOC100802... 593 e-167 gb|AAF26788.1|AC016829_12 hypothetical protein [Arabidopsis thal... 593 e-167 ref|XP_003590907.1| Pentatricopeptide repeat-containing protein ... 592 e-167 ref|NP_187076.2| plastid transcriptionally active 3 [Arabidopsis... 592 e-167 ref|XP_006854340.1| hypothetical protein AMTR_s00039p00135490 [A... 591 e-166 >gb|EYU39532.1| hypothetical protein MIMGU_mgv1a001059mg [Mimulus guttatus] Length = 900 Score = 641 bits (1653), Expect = 0.0 Identities = 319/407 (78%), Positives = 347/407 (85%), Gaps = 1/407 (0%) Frame = +3 Query: 3 KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182 +A GIPEIAFSTFENMEYG EA+MKPDTE+YNWVIQA+TRAESYDRVQDVAEL+GMMVE Sbjct: 264 QATCGIPEIAFSTFENMEYG-EAFMKPDTESYNWVIQAFTRAESYDRVQDVAELLGMMVE 322 Query: 183 DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362 DYKRLQPNVRTYALLVECFTKYCV +EAIRHFR LKNFEGGT LLH +GQ+GDPLSLYLR Sbjct: 323 DYKRLQPNVRTYALLVECFTKYCVTKEAIRHFRGLKNFEGGTVLLHNDGQHGDPLSLYLR 382 Query: 363 ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542 ALCREGRIVEL+DALETM +D QQIP RAMILSRKYRTLVSSWIEPLQEEAELGHE+DY+ Sbjct: 383 ALCREGRIVELIDALETMERDNQQIPARAMILSRKYRTLVSSWIEPLQEEAELGHEVDYV 442 Query: 543 ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722 AR++ EGGLTGERKRWVPRRGKTPLDPDA+G+ Y++PME SFKQRCLEEW+IHHRKLLRT Sbjct: 443 ARFIAEGGLTGERKRWVPRRGKTPLDPDADGFIYNSPMENSFKQRCLEEWRIHHRKLLRT 502 Query: 723 LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902 L NEGP +LGN+SESDY RV ERL+KIIKGPEQ+ LKPKAASKM+VSELKEELEAQGLPT Sbjct: 503 LWNEGPAILGNVSESDYNRVVERLKKIIKGPEQSALKPKAASKMVVSELKEELEAQGLPT 562 Query: 903 DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082 DGTRNVLYQRVQKARRINRSRGR ISRIKL EGNTEFWR+ Sbjct: 563 DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLEEGNTEFWRR 622 Query: 1083 RFLGEGLNENHSKP-XXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220 RFLGEGL ENH+KP DVGD+VAK+GEDDE Sbjct: 623 RFLGEGLTENHNKPLEVEDYDVLDVTDDADVGDDVGDDVAKEGEDDE 669 >ref|XP_004237508.1| PREDICTED: uncharacterized protein LOC101246046 [Solanum lycopersicum] Length = 891 Score = 627 bits (1616), Expect = e-177 Identities = 310/406 (76%), Positives = 344/406 (84%) Frame = +3 Query: 3 KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182 +A GIPEIAF+TFENMEYG + +MKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE Sbjct: 254 QASCGIPEIAFATFENMEYGDD-HMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 312 Query: 183 DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362 D+KRLQPNVRTYALLVECFTKYCV+REAIRHFR LKNFEGGT++L+ +G+YGDPLSLYLR Sbjct: 313 DHKRLQPNVRTYALLVECFTKYCVVREAIRHFRGLKNFEGGTQVLYNDGKYGDPLSLYLR 372 Query: 363 ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542 ALCREGRIVELL+ALE MAKD Q IPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYI Sbjct: 373 ALCREGRIVELLEALEAMAKDNQPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYI 432 Query: 543 ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722 ARYV EGGLTG+RKRWVPRRGKTPLDPDA+G+ YSNP ETSFKQRC EEW++HHRKLL+T Sbjct: 433 ARYVAEGGLTGDRKRWVPRRGKTPLDPDAQGFIYSNPRETSFKQRCFEEWRLHHRKLLKT 492 Query: 723 LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902 L NEGP +LG +SE DYIR+EERLRK+IKGPEQ+ LKPKAASKM+VSELKEELEAQGLPT Sbjct: 493 LLNEGPSILGKVSEYDYIRIEERLRKVIKGPEQSALKPKAASKMVVSELKEELEAQGLPT 552 Query: 903 DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082 DGTRNVLYQRVQKARRINRSRGR ISRIKLHEGNTEFW++ Sbjct: 553 DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKR 612 Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220 RFLGEGL+EN+ + +D D++ KD EDDE Sbjct: 613 RFLGEGLSENYGQ---QSEIIDLEPTDVVDDNDAVDDITKDAEDDE 655 >ref|XP_002268094.2| PREDICTED: uncharacterized protein LOC100241547 [Vitis vinifera] gi|296085161|emb|CBI28656.3| unnamed protein product [Vitis vinifera] Length = 884 Score = 627 bits (1616), Expect = e-177 Identities = 316/406 (77%), Positives = 341/406 (83%) Frame = +3 Query: 3 KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182 +A GIPEIAF+TFENMEYG E YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE Sbjct: 249 QATCGIPEIAFATFENMEYG-EDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 307 Query: 183 DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362 D+KRLQPNV+TYALLVEC TKYCV+REAIRHFRALKNFEGGTK+LH EG +GDPLSLYLR Sbjct: 308 DHKRLQPNVKTYALLVECLTKYCVVREAIRHFRALKNFEGGTKVLHDEGNFGDPLSLYLR 367 Query: 363 ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542 ALCREGRIVELLDALE MAKD Q IPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYI Sbjct: 368 ALCREGRIVELLDALEAMAKDNQPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYI 427 Query: 543 ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722 ARY+ EGGLTG+RKRWVPRRGKTPLDPDA G+ YSNPMETSFKQRCLE+WK++HRKLL+T Sbjct: 428 ARYIAEGGLTGDRKRWVPRRGKTPLDPDALGFIYSNPMETSFKQRCLEDWKMYHRKLLKT 487 Query: 723 LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902 LRNEG LG +SESDYIRVEERLRKIIKGP+QN LKPKAASKMIVSELKEELEAQGLPT Sbjct: 488 LRNEGLAALGEVSESDYIRVEERLRKIIKGPDQNALKPKAASKMIVSELKEELEAQGLPT 547 Query: 903 DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082 DGTRNVLYQRVQKARRINRSRGR ISRIKL EGNTEFW++ Sbjct: 548 DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLQEGNTEFWKR 607 Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220 RFLGE L KP +D+G++ AK+ EDDE Sbjct: 608 RFLGEDLTVGRGKP---MDKENSELPDVLDDADIGEDTAKEVEDDE 650 >ref|XP_006340475.1| PREDICTED: uncharacterized protein LOC102579691 [Solanum tuberosum] Length = 890 Score = 625 bits (1611), Expect = e-176 Identities = 311/406 (76%), Positives = 344/406 (84%) Frame = +3 Query: 3 KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182 +A GIPEIAF+TFENMEYG + +MKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE Sbjct: 253 QATCGIPEIAFATFENMEYGDD-HMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 311 Query: 183 DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362 D+KRLQPNVRTYALLVECFTKYCV+REAIRHFR LKNFEGGT++L+ +G+YGD LSLYLR Sbjct: 312 DHKRLQPNVRTYALLVECFTKYCVVREAIRHFRGLKNFEGGTQVLYNDGKYGDSLSLYLR 371 Query: 363 ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542 ALCREGRIVELL+ALE MAKD Q IPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYI Sbjct: 372 ALCREGRIVELLEALEAMAKDNQPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYI 431 Query: 543 ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722 ARYV EGGLTG+RKRWVPRRGKTPLDPDA+G+ YSNP ETSFKQRC EEW++HHRKLL+T Sbjct: 432 ARYVAEGGLTGDRKRWVPRRGKTPLDPDAQGFIYSNPRETSFKQRCFEEWRLHHRKLLKT 491 Query: 723 LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902 L NEGP +LG ISE DYIR+EERLRK+IKGPEQ+ LKPKAASKMIVSELKEELEAQGLPT Sbjct: 492 LLNEGPSILGKISEYDYIRIEERLRKVIKGPEQSALKPKAASKMIVSELKEELEAQGLPT 551 Query: 903 DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082 DGTRNVLYQRVQKARRINRSRGR ISRIKLHEGNTEFW++ Sbjct: 552 DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKR 611 Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220 RFLGEGL+EN+ + +D D++AK+ EDDE Sbjct: 612 RFLGEGLSENYGQ---QSEIIDLEPTDVVDDNDAVDDIAKEAEDDE 654 >ref|XP_002522027.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223538831|gb|EEF40431.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 889 Score = 624 bits (1608), Expect = e-176 Identities = 308/406 (75%), Positives = 338/406 (83%) Frame = +3 Query: 3 KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182 +A GIPEIAF+TFENMEYGGE YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE Sbjct: 250 QATCGIPEIAFATFENMEYGGEEYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 309 Query: 183 DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362 D+KRLQPNVRTYALLVECFTKYCV+REAIRHFRAL+NFEGGTK+LHY+G +GDPLSLYLR Sbjct: 310 DHKRLQPNVRTYALLVECFTKYCVVREAIRHFRALQNFEGGTKVLHYDGNFGDPLSLYLR 369 Query: 363 ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542 ALCREGRIVELL+ALE M +D Q IPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDY+ Sbjct: 370 ALCREGRIVELLEALEAMGRDNQPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYV 429 Query: 543 ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722 ARYV EGGLTGERKRWVPRRGKTPLDPDA G+ YSNPMETSFKQRC+E+WK+HHRKLLRT Sbjct: 430 ARYVAEGGLTGERKRWVPRRGKTPLDPDAAGFIYSNPMETSFKQRCIEDWKVHHRKLLRT 489 Query: 723 LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902 L NEG LG SESDY+RV ERL+KIIKGP+QN LKPKAASKM+VSELKEELEAQGLP Sbjct: 490 LLNEGLAALGEASESDYLRVVERLKKIIKGPDQNVLKPKAASKMVVSELKEELEAQGLPI 549 Query: 903 DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082 DGTRNVLYQRVQKARRINRSRGR ISRIKL EGNTEFW++ Sbjct: 550 DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDEIISRIKLEEGNTEFWKR 609 Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220 RFLGEGLN ++ +P + D+ +D E D+ Sbjct: 610 RFLGEGLNGSNLQPMSVAKSELPDVLDDVDAIEDADKEVEDEEADD 655 >ref|XP_007208365.1| hypothetical protein PRUPE_ppa001139mg [Prunus persica] gi|462404007|gb|EMJ09564.1| hypothetical protein PRUPE_ppa001139mg [Prunus persica] Length = 897 Score = 622 bits (1603), Expect = e-175 Identities = 305/373 (81%), Positives = 327/373 (87%) Frame = +3 Query: 3 KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182 +A GIPEIAFSTFENMEYGGE YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE Sbjct: 252 QATCGIPEIAFSTFENMEYGGEEYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 311 Query: 183 DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362 D+KRLQPN++T+ALLVECFTKYCV+REAIRHFRALK FEGGTK LH EG +GDPLSLYLR Sbjct: 312 DHKRLQPNMKTHALLVECFTKYCVVREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLR 371 Query: 363 ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542 ALCREGRI+ELL+ALE MA+D Q IPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDY+ Sbjct: 372 ALCREGRILELLEALEAMAEDNQTIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYM 431 Query: 543 ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722 ARY+ EGGLTGERKRWVPRRGKTPLDPD EG+ YSNPME SFKQRCLE+WKIHHRKLLRT Sbjct: 432 ARYIAEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMENSFKQRCLEDWKIHHRKLLRT 491 Query: 723 LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902 LRNEG LG+ SESDYIRVE RLRKIIKGP+QN LKPKAASKM+VSELKEELEAQGLPT Sbjct: 492 LRNEGVAALGDASESDYIRVEMRLRKIIKGPDQNVLKPKAASKMVVSELKEELEAQGLPT 551 Query: 903 DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082 DGTRNVLYQRVQKARRINRSRGR ISRIKL EGNTEFW++ Sbjct: 552 DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEEIDELISRIKLEEGNTEFWKR 611 Query: 1083 RFLGEGLNENHSK 1121 RFLGEG + + K Sbjct: 612 RFLGEGFSSDQEK 624 >ref|XP_007208364.1| hypothetical protein PRUPE_ppa001139mg [Prunus persica] gi|462404006|gb|EMJ09563.1| hypothetical protein PRUPE_ppa001139mg [Prunus persica] Length = 780 Score = 622 bits (1603), Expect = e-175 Identities = 305/373 (81%), Positives = 327/373 (87%) Frame = +3 Query: 3 KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182 +A GIPEIAFSTFENMEYGGE YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE Sbjct: 252 QATCGIPEIAFSTFENMEYGGEEYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 311 Query: 183 DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362 D+KRLQPN++T+ALLVECFTKYCV+REAIRHFRALK FEGGTK LH EG +GDPLSLYLR Sbjct: 312 DHKRLQPNMKTHALLVECFTKYCVVREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLR 371 Query: 363 ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542 ALCREGRI+ELL+ALE MA+D Q IPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDY+ Sbjct: 372 ALCREGRILELLEALEAMAEDNQTIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYM 431 Query: 543 ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722 ARY+ EGGLTGERKRWVPRRGKTPLDPD EG+ YSNPME SFKQRCLE+WKIHHRKLLRT Sbjct: 432 ARYIAEGGLTGERKRWVPRRGKTPLDPDVEGFIYSNPMENSFKQRCLEDWKIHHRKLLRT 491 Query: 723 LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902 LRNEG LG+ SESDYIRVE RLRKIIKGP+QN LKPKAASKM+VSELKEELEAQGLPT Sbjct: 492 LRNEGVAALGDASESDYIRVEMRLRKIIKGPDQNVLKPKAASKMVVSELKEELEAQGLPT 551 Query: 903 DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082 DGTRNVLYQRVQKARRINRSRGR ISRIKL EGNTEFW++ Sbjct: 552 DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEEIDELISRIKLEEGNTEFWKR 611 Query: 1083 RFLGEGLNENHSK 1121 RFLGEG + + K Sbjct: 612 RFLGEGFSSDQEK 624 >ref|XP_007030297.1| Plastid transcriptionally active 3 isoform 2 [Theobroma cacao] gi|508718902|gb|EOY10799.1| Plastid transcriptionally active 3 isoform 2 [Theobroma cacao] Length = 782 Score = 616 bits (1588), Expect = e-174 Identities = 310/406 (76%), Positives = 338/406 (83%) Frame = +3 Query: 3 KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182 +A GIPEIAF+TFENMEYG E YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE Sbjct: 131 QATCGIPEIAFATFENMEYG-EEYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 189 Query: 183 DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362 D+KR+QPNV+TYALLVECFTKYCV++EAIRHFRALK FEGGT++L EG + DPLSLYLR Sbjct: 190 DHKRVQPNVKTYALLVECFTKYCVVKEAIRHFRALKKFEGGTRVLQNEGNFDDPLSLYLR 249 Query: 363 ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542 ALCREGRIVELL+AL+ MAKD Q IPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYI Sbjct: 250 ALCREGRIVELLEALQAMAKDNQPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYI 309 Query: 543 ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722 ARY+EEGGLTGERKRWVPRRGKTPLDPDA G+ YSNPMETSFKQRCLE+WK+HHRKLL+T Sbjct: 310 ARYIEEGGLTGERKRWVPRRGKTPLDPDAAGFIYSNPMETSFKQRCLEDWKLHHRKLLKT 369 Query: 723 LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902 L+NEG LG SESDY+RV ERL+KIIKGP+QN LKPKAASKMIVSELKEELEAQGLP Sbjct: 370 LQNEGLAALGGASESDYVRVSERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPI 429 Query: 903 DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082 DGTRNVLYQRVQKARRINRSRGR ISRIKL EGNTEFW++ Sbjct: 430 DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEEVDELISRIKLEEGNTEFWKR 489 Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220 RFLGE LN +H KP DV ++ AKD EDDE Sbjct: 490 RFLGEHLNVDHVKP--IDEGESEPADDELDDGDVVEDAAKDIEDDE 533 >ref|XP_007030296.1| Plastid transcriptionally active 3 isoform 1 [Theobroma cacao] gi|508718901|gb|EOY10798.1| Plastid transcriptionally active 3 isoform 1 [Theobroma cacao] Length = 905 Score = 616 bits (1588), Expect = e-174 Identities = 310/406 (76%), Positives = 338/406 (83%) Frame = +3 Query: 3 KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182 +A GIPEIAF+TFENMEYG E YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE Sbjct: 254 QATCGIPEIAFATFENMEYG-EEYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 312 Query: 183 DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362 D+KR+QPNV+TYALLVECFTKYCV++EAIRHFRALK FEGGT++L EG + DPLSLYLR Sbjct: 313 DHKRVQPNVKTYALLVECFTKYCVVKEAIRHFRALKKFEGGTRVLQNEGNFDDPLSLYLR 372 Query: 363 ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542 ALCREGRIVELL+AL+ MAKD Q IPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYI Sbjct: 373 ALCREGRIVELLEALQAMAKDNQPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYI 432 Query: 543 ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722 ARY+EEGGLTGERKRWVPRRGKTPLDPDA G+ YSNPMETSFKQRCLE+WK+HHRKLL+T Sbjct: 433 ARYIEEGGLTGERKRWVPRRGKTPLDPDAAGFIYSNPMETSFKQRCLEDWKLHHRKLLKT 492 Query: 723 LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902 L+NEG LG SESDY+RV ERL+KIIKGP+QN LKPKAASKMIVSELKEELEAQGLP Sbjct: 493 LQNEGLAALGGASESDYVRVSERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPI 552 Query: 903 DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082 DGTRNVLYQRVQKARRINRSRGR ISRIKL EGNTEFW++ Sbjct: 553 DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEEVDELISRIKLEEGNTEFWKR 612 Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220 RFLGE LN +H KP DV ++ AKD EDDE Sbjct: 613 RFLGEHLNVDHVKP--IDEGESEPADDELDDGDVVEDAAKDIEDDE 656 >ref|XP_006478983.1| PREDICTED: uncharacterized protein LOC102630853 isoform X2 [Citrus sinensis] Length = 764 Score = 612 bits (1578), Expect = e-172 Identities = 310/413 (75%), Positives = 341/413 (82%), Gaps = 7/413 (1%) Frame = +3 Query: 3 KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182 +A GIPE+AF+TFENMEYG E YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMM E Sbjct: 130 QATCGIPEVAFATFENMEYG-EDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMFE 188 Query: 183 DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362 D+KRLQPNV+TYALLVECFTKYC + EAIRHFRAL+N+EGGTK+LH EG +GDPLSLYLR Sbjct: 189 DHKRLQPNVKTYALLVECFTKYCAVTEAIRHFRALQNYEGGTKVLHNEGNFGDPLSLYLR 248 Query: 363 ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542 ALCREGRI+ELL+ALE MAKD Q +PPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYI Sbjct: 249 ALCREGRIIELLEALEAMAKDNQPVPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYI 308 Query: 543 ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722 ARY+ EGGLTGERKRWVPRRGKTPLDPDA G+ YSNPMETSFKQRCLE+ K +HRKLLRT Sbjct: 309 ARYISEGGLTGERKRWVPRRGKTPLDPDAVGFIYSNPMETSFKQRCLEDGKKYHRKLLRT 368 Query: 723 LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902 L+NEGP VLG++SESDY+RVEERL+K+IKGPEQ+ LKPKAASKM+VSELKEEL+AQGLPT Sbjct: 369 LQNEGPAVLGDVSESDYVRVEERLKKLIKGPEQHVLKPKAASKMVVSELKEELDAQGLPT 428 Query: 903 DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082 DGTRNVLYQRVQKARRINRSRGR ISRIKL EGNTEFW++ Sbjct: 429 DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEEVDELISRIKLEEGNTEFWKR 488 Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDE-------VAKDGEDDE 1220 RFLGEGLN H K SDV D+ VAKD E DE Sbjct: 489 RFLGEGLNGRHDK---------AVEMDESELSDVLDDDVTDVEYVAKDEEADE 532 >ref|XP_006443293.1| hypothetical protein CICLE_v10023441mg [Citrus clementina] gi|568850568|ref|XP_006478982.1| PREDICTED: uncharacterized protein LOC102630853 isoform X1 [Citrus sinensis] gi|557545555|gb|ESR56533.1| hypothetical protein CICLE_v10023441mg [Citrus clementina] Length = 887 Score = 612 bits (1578), Expect = e-172 Identities = 310/413 (75%), Positives = 341/413 (82%), Gaps = 7/413 (1%) Frame = +3 Query: 3 KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182 +A GIPE+AF+TFENMEYG E YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMM E Sbjct: 253 QATCGIPEVAFATFENMEYG-EDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMFE 311 Query: 183 DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362 D+KRLQPNV+TYALLVECFTKYC + EAIRHFRAL+N+EGGTK+LH EG +GDPLSLYLR Sbjct: 312 DHKRLQPNVKTYALLVECFTKYCAVTEAIRHFRALQNYEGGTKVLHNEGNFGDPLSLYLR 371 Query: 363 ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542 ALCREGRI+ELL+ALE MAKD Q +PPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYI Sbjct: 372 ALCREGRIIELLEALEAMAKDNQPVPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYI 431 Query: 543 ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722 ARY+ EGGLTGERKRWVPRRGKTPLDPDA G+ YSNPMETSFKQRCLE+ K +HRKLLRT Sbjct: 432 ARYISEGGLTGERKRWVPRRGKTPLDPDAVGFIYSNPMETSFKQRCLEDGKKYHRKLLRT 491 Query: 723 LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902 L+NEGP VLG++SESDY+RVEERL+K+IKGPEQ+ LKPKAASKM+VSELKEEL+AQGLPT Sbjct: 492 LQNEGPAVLGDVSESDYVRVEERLKKLIKGPEQHVLKPKAASKMVVSELKEELDAQGLPT 551 Query: 903 DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082 DGTRNVLYQRVQKARRINRSRGR ISRIKL EGNTEFW++ Sbjct: 552 DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEEVDELISRIKLEEGNTEFWKR 611 Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDE-------VAKDGEDDE 1220 RFLGEGLN H K SDV D+ VAKD E DE Sbjct: 612 RFLGEGLNGRHDK---------AVEMDESELSDVLDDDVTDVEYVAKDEEADE 655 >ref|XP_002325363.1| SAP domain-containing family protein [Populus trichocarpa] gi|222862238|gb|EEE99744.1| SAP domain-containing family protein [Populus trichocarpa] Length = 887 Score = 610 bits (1572), Expect = e-172 Identities = 304/408 (74%), Positives = 339/408 (83%), Gaps = 2/408 (0%) Frame = +3 Query: 3 KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182 +A GIPEI+F+TFENMEYG E YMKPDTE+YNWVIQAYTRAESYDRVQDVAEL+GMMVE Sbjct: 249 QATCGIPEISFATFENMEYG-EDYMKPDTESYNWVIQAYTRAESYDRVQDVAELLGMMVE 307 Query: 183 DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362 D+KR+QPNV+TYALLVECF+KYCV+REAIRHFRAL+ FEGGTK LH EG++GDPLSLYLR Sbjct: 308 DHKRIQPNVKTYALLVECFSKYCVVREAIRHFRALRKFEGGTKALHNEGKFGDPLSLYLR 367 Query: 363 ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542 ALCREGRIV+LL+ALE MA+D Q IPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDY+ Sbjct: 368 ALCREGRIVDLLEALEAMAEDNQPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYV 427 Query: 543 ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722 ARYV EGGLTGERKRWVPRRGKTPLDPD +G+ YSNPMETS KQRCLE+WK HHRKLL+ Sbjct: 428 ARYVAEGGLTGERKRWVPRRGKTPLDPDCDGFIYSNPMETSLKQRCLEDWKAHHRKLLKM 487 Query: 723 LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902 LRNEG LG+ SESDY+RVEERLRKII+GP++N LKPKAASKMIVSELK+ELEAQGLP Sbjct: 488 LRNEGLAALGDASESDYLRVEERLRKIIRGPDRNVLKPKAASKMIVSELKDELEAQGLPI 547 Query: 903 DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082 DGTRNVLYQRVQKARRINRSRGR ISRI+LHEG+TEFW++ Sbjct: 548 DGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEEVDELISRIQLHEGDTEFWKR 607 Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGD--EVAKDGEDDE 1220 RFLGEG N NH KP D D +VAK+ ED+E Sbjct: 608 RFLGEGFNGNHVKPVDMETSELPDELDEDEDDDDDDVEDVAKEVEDEE 655 >gb|EXB93125.1| Pentatricopeptide repeat-containing protein [Morus notabilis] Length = 895 Score = 604 bits (1558), Expect = e-170 Identities = 305/406 (75%), Positives = 339/406 (83%) Frame = +3 Query: 3 KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182 +A GIPEIAFSTFENM+YG E +MKPDTETYNWVIQAYTRAESYDRVQDVAEL+G+MVE Sbjct: 246 QATCGIPEIAFSTFENMQYG-EEFMKPDTETYNWVIQAYTRAESYDRVQDVAELLGIMVE 304 Query: 183 DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362 D+KRLQPN++T+ALLVECFTKYCVI EAIRHFRAL+NFEGGT +LH EG +GDPLSLYLR Sbjct: 305 DHKRLQPNMKTHALLVECFTKYCVIGEAIRHFRALRNFEGGTIVLHNEGNFGDPLSLYLR 364 Query: 363 ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542 ALCREGRIVELL+ALE M KD Q IPPRAM+LS+KYRTLVSSWIEPLQ+EAELG+EIDYI Sbjct: 365 ALCREGRIVELLEALEAMVKDNQPIPPRAMLLSKKYRTLVSSWIEPLQDEAELGYEIDYI 424 Query: 543 ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722 ARY+ EGGLTGERKRWVPRRGKTPLDPDA G+ YSNPMETSFKQRCLE+WK ++RKLLRT Sbjct: 425 ARYIAEGGLTGERKRWVPRRGKTPLDPDAAGFIYSNPMETSFKQRCLEDWKTYNRKLLRT 484 Query: 723 LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902 LRNEG VLG+ SESDYIRVEERL KI++GPEQN LKPKAASKMIVSELKEELEAQGLPT Sbjct: 485 LRNEGIAVLGDASESDYIRVEERLLKIVRGPEQNVLKPKAASKMIVSELKEELEAQGLPT 544 Query: 903 DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082 DGTRNVLYQRVQKARRINRSRGR ISRIKL EGNTEFW++ Sbjct: 545 DGTRNVLYQRVQKARRINRSRGRPLWIPPVEEEEEEVDEDLDELISRIKLQEGNTEFWKR 604 Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220 RFLGEGLN ++ +D+ ++ AK+ EDDE Sbjct: 605 RFLGEGLNGDNGN---STSMGRAEFADVDVDADIVEDSAKEVEDDE 647 >ref|XP_002884436.1| hypothetical protein ARALYDRAFT_477686 [Arabidopsis lyrata subsp. lyrata] gi|297330276|gb|EFH60695.1| hypothetical protein ARALYDRAFT_477686 [Arabidopsis lyrata subsp. lyrata] Length = 914 Score = 599 bits (1544), Expect = e-169 Identities = 298/406 (73%), Positives = 332/406 (81%) Frame = +3 Query: 3 KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182 +A GIPE+A++TFENMEYG +MKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE Sbjct: 257 QATCGIPEVAYATFENMEYGEGLFMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 316 Query: 183 DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362 D+KR+QPNV+TYALLVECFTKYCV++EAIRHFRALKNFEGGT +LH G++ DPLSLYLR Sbjct: 317 DHKRVQPNVKTYALLVECFTKYCVVKEAIRHFRALKNFEGGTTILHNAGKFEDPLSLYLR 376 Query: 363 ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542 ALCREGRIVEL+DAL+ M KD Q IPPRAMI+SRKYRTLVSSWIEPLQEEAELG+EIDY+ Sbjct: 377 ALCREGRIVELIDALDAMRKDSQPIPPRAMIMSRKYRTLVSSWIEPLQEEAELGYEIDYV 436 Query: 543 ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722 ARY+EEGGLTGERKRWVPRRGKTPLDPDA G+ YSNP+ETSFKQRCLE+WKIHHRKLLRT Sbjct: 437 ARYIEEGGLTGERKRWVPRRGKTPLDPDASGFIYSNPIETSFKQRCLEDWKIHHRKLLRT 496 Query: 723 LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902 L++EG VLG+ SESDY+RV ERLR IIKGP QN LKPKAASKM+VSELKEELEAQGLP Sbjct: 497 LQSEGLPVLGDASESDYMRVMERLRNIIKGPAQNLLKPKAASKMVVSELKEELEAQGLPI 556 Query: 903 DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082 DGTRNVLYQRVQKARRIN+SRGR I RIKLHEG+TEFW++ Sbjct: 557 DGTRNVLYQRVQKARRINKSRGRPLWVPPIEEEEEEVDEEVDDLICRIKLHEGDTEFWKR 616 Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220 RFLGEGL E S D+ E D +DDE Sbjct: 617 RFLGEGLIET-SVESKETTESVVTGESEKAIEDISKEADNDEDDDE 661 >ref|XP_003555560.1| PREDICTED: uncharacterized protein LOC100807191 isoform X1 [Glycine max] Length = 887 Score = 596 bits (1536), Expect = e-168 Identities = 301/407 (73%), Positives = 336/407 (82%), Gaps = 1/407 (0%) Frame = +3 Query: 3 KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182 +A GIPEIAF+TFENMEYG E YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE Sbjct: 244 QATCGIPEIAFATFENMEYG-EDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 302 Query: 183 DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362 D+KR+QPN +T+ALLVECFTKYCV+REAIRHFRALKNFEGG K+LH EG +GDPLSLYLR Sbjct: 303 DHKRIQPNAKTHALLVECFTKYCVVREAIRHFRALKNFEGGIKVLHNEGNHGDPLSLYLR 362 Query: 363 ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542 ALCREGRIVE+L+ALE MAKD Q IP RAMILSRKYRTLVSSWIEPLQEEAELG+EIDYI Sbjct: 363 ALCREGRIVEMLEALEAMAKDNQPIPSRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYI 422 Query: 543 ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722 +RY++EGGLTGERKRWVPRRGKTPLDPDA G+ YSNPMETSFKQRCLEE K+H++KLL+T Sbjct: 423 SRYIDEGGLTGERKRWVPRRGKTPLDPDAHGFIYSNPMETSFKQRCLEELKLHNKKLLKT 482 Query: 723 LRNEGPIVLGN-ISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLP 899 L+NEG LG+ +SESDYIRV+ERL+K+IKGPEQN LKPKAASKM+VSELKEEL+AQGLP Sbjct: 483 LQNEGLAALGDGVSESDYIRVQERLKKLIKGPEQNVLKPKAASKMLVSELKEELDAQGLP 542 Query: 900 TDGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWR 1079 DG RNVLYQRVQKARRINRSRGR IS IKL EGNTEFW+ Sbjct: 543 IDGNRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDALISHIKLEEGNTEFWK 602 Query: 1080 QRFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220 +RFLGEGLN + P D ++ AK+ EDDE Sbjct: 603 RRFLGEGLNGDQEMPTDAAESEVPEVLDDV---DAIEDAAKEVEDDE 646 >ref|XP_003535382.1| PREDICTED: uncharacterized protein LOC100802355 isoform X1 [Glycine max] Length = 887 Score = 593 bits (1529), Expect = e-167 Identities = 298/407 (73%), Positives = 337/407 (82%), Gaps = 1/407 (0%) Frame = +3 Query: 3 KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182 +A GIPEIAF+TFENMEYG E YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE Sbjct: 244 QATCGIPEIAFATFENMEYG-EDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 302 Query: 183 DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362 D+KR+QPN +T+ALLVECFTKYCV+REAIRHFRALKNFEGG ++LH EG +GDPLSLYLR Sbjct: 303 DHKRIQPNAKTHALLVECFTKYCVVREAIRHFRALKNFEGGIEVLHNEGNHGDPLSLYLR 362 Query: 363 ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542 ALCREGRIVE+L+ALE MAKD Q IP RAMILSRKYRTLVSSWIEPLQEEAE+G+EIDYI Sbjct: 363 ALCREGRIVEMLEALEAMAKDNQPIPSRAMILSRKYRTLVSSWIEPLQEEAEIGYEIDYI 422 Query: 543 ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722 +RY++EGGLTGERKRWVPRRGKTPLDPDA G+ YSNPMETSFKQRC+EE K+H++KLL+T Sbjct: 423 SRYIDEGGLTGERKRWVPRRGKTPLDPDAHGFIYSNPMETSFKQRCMEELKLHNKKLLKT 482 Query: 723 LRNEGPIVLG-NISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLP 899 L+NEG LG ++SE DYIRV+ERL+K++KGPEQN LKPKAASKM+VSELKEEL+AQGLP Sbjct: 483 LQNEGLAALGDDVSEFDYIRVQERLKKLMKGPEQNVLKPKAASKMLVSELKEELDAQGLP 542 Query: 900 TDGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWR 1079 DGTRNVLYQRVQKARRINRSRGR ISRIKL EGNTEFW+ Sbjct: 543 IDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDALISRIKLEEGNTEFWK 602 Query: 1080 QRFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220 +RFLGEGLN + P D ++ AK+ EDDE Sbjct: 603 RRFLGEGLNGDQEMPTDAVQSDVPEVLDDV---DAIEDAAKEVEDDE 646 >gb|AAF26788.1|AC016829_12 hypothetical protein [Arabidopsis thaliana] Length = 913 Score = 593 bits (1529), Expect = e-167 Identities = 295/406 (72%), Positives = 330/406 (81%) Frame = +3 Query: 3 KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182 +A GIPE+A++TFENMEYG +MKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE Sbjct: 257 QATCGIPEVAYATFENMEYGEGLFMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 316 Query: 183 DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362 D+KR+QPNV+TYALLVECFTKYCV++EAIRHFRALKNFEGGT +LH G + DPLSLYLR Sbjct: 317 DHKRVQPNVKTYALLVECFTKYCVVKEAIRHFRALKNFEGGTVILHNAGNFEDPLSLYLR 376 Query: 363 ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542 ALCREGRIVEL+DAL+ M KD Q IPPRAMI+SRKYRTLVSSWIEPLQEEAELG+EIDY+ Sbjct: 377 ALCREGRIVELIDALDAMRKDNQPIPPRAMIMSRKYRTLVSSWIEPLQEEAELGYEIDYL 436 Query: 543 ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722 ARY+EEGGLTGERKRWVPRRGKTPLDPDA G+ YSNP+ETSFKQRCLE+WK+HHRKLLRT Sbjct: 437 ARYIEEGGLTGERKRWVPRRGKTPLDPDASGFIYSNPIETSFKQRCLEDWKVHHRKLLRT 496 Query: 723 LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902 L++EG VLG+ SESDY+RV ERLR IIKGP N LKPKAASKM+VSELKEELEAQGLP Sbjct: 497 LQSEGLPVLGDASESDYMRVVERLRNIIKGPALNLLKPKAASKMVVSELKEELEAQGLPI 556 Query: 903 DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082 DGTRNVLYQRVQKARRIN+SRGR I RIKLHEG+TEFW++ Sbjct: 557 DGTRNVLYQRVQKARRINKSRGRPLWVPPIEEEEEEVDEEVDDLICRIKLHEGDTEFWKR 616 Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220 RFLGEGL E S D+ E + +DDE Sbjct: 617 RFLGEGLIET-SVESKETTESVVTGESEKAIEDISKEADNEEDDDE 661 >ref|XP_003590907.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355479955|gb|AES61158.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 2047 Score = 592 bits (1527), Expect = e-167 Identities = 300/407 (73%), Positives = 337/407 (82%), Gaps = 1/407 (0%) Frame = +3 Query: 3 KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182 +A GIPEIAF+TFENMEYG E YMKPDTETYNWVIQAYTRA+SYDRVQDVAEL+GMMVE Sbjct: 239 QATCGIPEIAFTTFENMEYG-EDYMKPDTETYNWVIQAYTRADSYDRVQDVAELLGMMVE 297 Query: 183 DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362 D+KR+QPNV+T+ALLVECFTKYCV+REAIRHFRALKNFEGGTK+LH +G +GDPLSLYLR Sbjct: 298 DHKRVQPNVKTHALLVECFTKYCVVREAIRHFRALKNFEGGTKILHMDGNHGDPLSLYLR 357 Query: 363 ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542 ALCREGRI+++L+ALE MA D QQIPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYI Sbjct: 358 ALCREGRIIDMLEALEAMANDNQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYI 417 Query: 543 ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722 ARYVEEGGLTGERKRWVPR GKTPLDPDA+G+ YSNPMETSFKQRCLEE K++H+KLL+ Sbjct: 418 ARYVEEGGLTGERKRWVPRSGKTPLDPDADGFIYSNPMETSFKQRCLEEKKVYHKKLLKK 477 Query: 723 LRNEGPIVLGN-ISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLP 899 LR EG + LG+ SESDY+RV E L+KIIKGPEQN LKPKAASKM+V+ELKEELEAQGLP Sbjct: 478 LRYEGIVALGDGASESDYVRVIEWLKKIIKGPEQNALKPKAASKMLVNELKEELEAQGLP 537 Query: 900 TDGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWR 1079 DGTRNVLYQRVQKARRIN+SRGR ISRIKL EGNTE+W+ Sbjct: 538 IDGTRNVLYQRVQKARRINQSRGRPLWVPPIEVEEEEVDEELEALISRIKLEEGNTEYWK 597 Query: 1080 QRFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220 +RFLGEGLN ++ DV + AK+ EDDE Sbjct: 598 RRFLGEGLNGDNGNAMDEGESESPDVQDYI---DVVGDDAKEAEDDE 641 >ref|NP_187076.2| plastid transcriptionally active 3 [Arabidopsis thaliana] gi|332640537|gb|AEE74058.1| plastid transcriptionally active 3 [Arabidopsis thaliana] Length = 910 Score = 592 bits (1527), Expect = e-167 Identities = 296/406 (72%), Positives = 331/406 (81%) Frame = +3 Query: 3 KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182 +A GIPE+A++TFENMEYG E +MKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVE Sbjct: 257 QATCGIPEVAYATFENMEYG-EVFMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVE 315 Query: 183 DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362 D+KR+QPNV+TYALLVECFTKYCV++EAIRHFRALKNFEGGT +LH G + DPLSLYLR Sbjct: 316 DHKRVQPNVKTYALLVECFTKYCVVKEAIRHFRALKNFEGGTVILHNAGNFEDPLSLYLR 375 Query: 363 ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542 ALCREGRIVEL+DAL+ M KD Q IPPRAMI+SRKYRTLVSSWIEPLQEEAELG+EIDY+ Sbjct: 376 ALCREGRIVELIDALDAMRKDNQPIPPRAMIMSRKYRTLVSSWIEPLQEEAELGYEIDYL 435 Query: 543 ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722 ARY+EEGGLTGERKRWVPRRGKTPLDPDA G+ YSNP+ETSFKQRCLE+WK+HHRKLLRT Sbjct: 436 ARYIEEGGLTGERKRWVPRRGKTPLDPDASGFIYSNPIETSFKQRCLEDWKVHHRKLLRT 495 Query: 723 LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902 L++EG VLG+ SESDY+RV ERLR IIKGP N LKPKAASKM+VSELKEELEAQGLP Sbjct: 496 LQSEGLPVLGDASESDYMRVVERLRNIIKGPALNLLKPKAASKMVVSELKEELEAQGLPI 555 Query: 903 DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082 DGTRNVLYQRVQKARRIN+SRGR I RIKLHEG+TEFW++ Sbjct: 556 DGTRNVLYQRVQKARRINKSRGRPLWVPPIEEEEEEVDEEVDDLICRIKLHEGDTEFWKR 615 Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220 RFLGEGL E S D+ E + +DDE Sbjct: 616 RFLGEGLIET-SVESKETTESVVTGESEKAIEDISKEADNEEDDDE 660 >ref|XP_006854340.1| hypothetical protein AMTR_s00039p00135490 [Amborella trichopoda] gi|548858016|gb|ERN15807.1| hypothetical protein AMTR_s00039p00135490 [Amborella trichopoda] Length = 870 Score = 591 bits (1524), Expect = e-166 Identities = 300/406 (73%), Positives = 330/406 (81%) Frame = +3 Query: 3 KAISGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVE 182 +A GIPEIAF+TFENMEYGGE +MKPDTE+YNWVIQAYTRAESYDRVQDVAEL+GMMVE Sbjct: 240 QATCGIPEIAFATFENMEYGGEDFMKPDTESYNWVIQAYTRAESYDRVQDVAELLGMMVE 299 Query: 183 DYKRLQPNVRTYALLVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLR 362 D+KRLQPNVRTYALLVECFTKYCV++EAIRHFRALKNFEGGT++L EG +GDPLSLYLR Sbjct: 300 DHKRLQPNVRTYALLVECFTKYCVLKEAIRHFRALKNFEGGTRVLCNEGNFGDPLSLYLR 359 Query: 363 ALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYI 542 ALCREGRIVELL+ALE MAKD Q I PRAMILS+KYRTLVSSWIEPLQEEAELG E+DYI Sbjct: 360 ALCREGRIVELLEALEAMAKDNQPITPRAMILSKKYRTLVSSWIEPLQEEAELGFEVDYI 419 Query: 543 ARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRT 722 ARY+ EGGLT ERKRWVPRRGKTPLDPDA G+AYSNPMETS+KQRCLE K+H+RKLL+ Sbjct: 420 ARYIAEGGLTAERKRWVPRRGKTPLDPDAIGFAYSNPMETSYKQRCLENLKVHNRKLLKK 479 Query: 723 LRNEGPIVLGNISESDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPT 902 L+ EG LG++SE+DY RV ERL+K+IKGP+Q LKPKAASKMIVSELKEELEAQGLPT Sbjct: 480 LKYEGRAALGDVSEADYARVVERLKKVIKGPDQTALKPKAASKMIVSELKEELEAQGLPT 539 Query: 903 DGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXXXXXXXXXISRIKLHEGNTEFWRQ 1082 DGTR VLYQRVQKARRINRSRGR ISRI+L EGNTEFWR+ Sbjct: 540 DGTRQVLYQRVQKARRINRSRGRPLWVPPVEEEEEEVDEELDEWISRIRLEEGNTEFWRR 599 Query: 1083 RFLGEGLNENHSKPXXXXXXXXXXXXXXXXXSDVGDEVAKDGEDDE 1220 RFLGEGL S P D D+ KD EDDE Sbjct: 600 RFLGEGLG---SVPDKKIELEDLDTSNTLDDIDNTDDNPKDMEDDE 642