BLASTX nr result
ID: Mentha29_contig00009474
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00009474 (2918 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU39532.1| hypothetical protein MIMGU_mgv1a001059mg [Mimulus... 1335 0.0 ref|XP_004237508.1| PREDICTED: uncharacterized protein LOC101246... 1234 0.0 ref|XP_002268094.2| PREDICTED: uncharacterized protein LOC100241... 1233 0.0 ref|XP_006340475.1| PREDICTED: uncharacterized protein LOC102579... 1233 0.0 ref|XP_007030296.1| Plastid transcriptionally active 3 isoform 1... 1226 0.0 ref|XP_007208365.1| hypothetical protein PRUPE_ppa001139mg [Prun... 1211 0.0 ref|XP_002522027.1| pentatricopeptide repeat-containing protein,... 1194 0.0 gb|EXB93125.1| Pentatricopeptide repeat-containing protein [Moru... 1181 0.0 ref|XP_006443293.1| hypothetical protein CICLE_v10023441mg [Citr... 1179 0.0 ref|XP_002325363.1| SAP domain-containing family protein [Populu... 1170 0.0 ref|XP_006408205.1| hypothetical protein EUTSA_v10020015mg [Eutr... 1160 0.0 ref|XP_003555560.1| PREDICTED: uncharacterized protein LOC100807... 1159 0.0 ref|XP_006296937.1| hypothetical protein CARUB_v10012929mg [Caps... 1158 0.0 ref|XP_003535382.1| PREDICTED: uncharacterized protein LOC100802... 1156 0.0 ref|XP_002884436.1| hypothetical protein ARALYDRAFT_477686 [Arab... 1156 0.0 ref|NP_187076.2| plastid transcriptionally active 3 [Arabidopsis... 1154 0.0 gb|AAF26788.1|AC016829_12 hypothetical protein [Arabidopsis thal... 1150 0.0 ref|XP_003590907.1| Pentatricopeptide repeat-containing protein ... 1128 0.0 ref|XP_007143992.1| hypothetical protein PHAVU_007G119900g [Phas... 1127 0.0 ref|XP_006854340.1| hypothetical protein AMTR_s00039p00135490 [A... 1127 0.0 >gb|EYU39532.1| hypothetical protein MIMGU_mgv1a001059mg [Mimulus guttatus] Length = 900 Score = 1335 bits (3456), Expect = 0.0 Identities = 695/906 (76%), Positives = 745/906 (82%), Gaps = 9/906 (0%) Frame = -3 Query: 2877 MAFSLLTQTPFSLSLIXXXXXXXXXXXXXXXXXXXXXSQRKPRRKKQQS---DLKSSDDN 2707 MA SLLT PFSLS + QRK RRKKQQ+ DLKSSDD+ Sbjct: 1 MASSLLTYAPFSLSFVSVSAPRNSVAHAAAQFASISSPQRKSRRKKQQNQPNDLKSSDDS 60 Query: 2706 GYPGGEGGSFTPSSAEKLLRLVFMEELMERARSGSAAGVSDVIYDMIAAGLTPGPRSFHG 2527 GY GG G+FTPSSAEKLLRLVFMEELMERARSGS GVSDVIYDMIAAGLTPGPRSFHG Sbjct: 61 GYSGGSDGNFTPSSAEKLLRLVFMEELMERARSGSVLGVSDVIYDMIAAGLTPGPRSFHG 120 Query: 2526 LVVSHVLNRDEEGSMHALRRQLSEGLRPLHETFLALVRLFGSKGRATRGLEILAAMEKLN 2347 LVVSHVLNRD EG+MHALRRQLSEG+RPLHETFLALVRLFGSKG ATRGLEILAAMEKLN Sbjct: 121 LVVSHVLNRDAEGAMHALRRQLSEGIRPLHETFLALVRLFGSKGHATRGLEILAAMEKLN 180 Query: 2346 YDIRQAWLLLVEELVRSKHLEDANRVFLKGAEGGLRATNRLYDLLIEEDCKVGDHSNALT 2167 YDIRQAWLLL+EELV+S HLEDANRVFLKGAEGGLRAT+ LYDLLIEEDCKVGDHSNALT Sbjct: 181 YDIRQAWLLLIEELVKSSHLEDANRVFLKGAEGGLRATDELYDLLIEEDCKVGDHSNALT 240 Query: 2166 IAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGGEAYMKPDTETYNWVIQ 1987 IAYEMEAAGRMATT HFNCLLS QATCGIPEIAFSTFENMEYG EA+MKPDTE+YNWVIQ Sbjct: 241 IAYEMEAAGRMATTSHFNCLLSCQATCGIPEIAFSTFENMEYG-EAFMKPDTESYNWVIQ 299 Query: 1986 AYTRAESYDRVQDVAELVGMMVEDYKRLQPNV----LLVECFTKYCVIREAIRHFRALKN 1819 A+TRAESYDRVQDVAEL+GMMVEDYKRLQPNV LLVECFTKYCV +EAIRHFR LKN Sbjct: 300 AFTRAESYDRVQDVAELLGMMVEDYKRLQPNVRTYALLVECFTKYCVTKEAIRHFRGLKN 359 Query: 1818 FEGGTKLLHYEGQYGDPLSLYLRALCREGRIVELLDALETMAKDKQQIPPRAMILSRKYR 1639 FEGGT LLH +GQ+GDPLSLYLRALCREGRIVEL+DALETM +D QQIP RAMILSRKYR Sbjct: 360 FEGGTVLLHNDGQHGDPLSLYLRALCREGRIVELIDALETMERDNQQIPARAMILSRKYR 419 Query: 1638 TLVSSWIEPLQEEAELGHEIDYIARYVEEGGLTGERKRWVPRRGKTPLDPDAEGYAYSNP 1459 TLVSSWIEPLQEEAELGHE+DY+AR++ EGGLTGERKRWVPRRGKTPLDPDA+G+ Y++P Sbjct: 420 TLVSSWIEPLQEEAELGHEVDYVARFIAEGGLTGERKRWVPRRGKTPLDPDADGFIYNSP 479 Query: 1458 METSFKQRCLEEWKIHHRKLLRTLRNEGPVVLGNISESDYIRVEERLRKIIKGPEQNTLK 1279 ME SFKQRCLEEW+IHHRKLLRTL NEGP +LGN+SESDY RV ERL+KIIKGPEQ+ LK Sbjct: 480 MENSFKQRCLEEWRIHHRKLLRTLWNEGPAILGNVSESDYNRVVERLKKIIKGPEQSALK 539 Query: 1278 PKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRINRSRGRXXXXXXXXXXXXXX 1099 PKAASKM+VSELKEELEAQGLPTDGTRNVLYQRVQKARRINRSRGR Sbjct: 540 PKAASKMVVSELKEELEAQGLPTDGTRNVLYQRVQKARRINRSRGRPLWVPPVEEEEEEV 599 Query: 1098 XXXXXXLISRIKLHEGDTEFWRQRFLGEGLNENHSKPLEEEXXXXXXXXXXXXXXXXXXX 919 LISRIKL EG+TEFWR+RFLGEGL ENH+KPLE E Sbjct: 600 DEELDELISRIKLEEGNTEFWRRRFLGEGLTENHNKPLEVE------DYDVLDVTDDADV 653 Query: 918 XXXXXXXXXXXXXXXXXXXXXXXXEQTEIQVSDRVKDKEAEAAKPLQMIGVQLLKDSDQX 739 EQTEI+V DR KDKE + AKPLQMIGVQLLKDSDQ Sbjct: 654 GDDVGDDVAKEGEDDEVDEEDEEVEQTEIRVGDRAKDKEIDTAKPLQMIGVQLLKDSDQ- 712 Query: 738 XXXXXXXXXXXXXXSMEXXXXXDWFPEDIYEAFKEMRNRKVFDVQDMYTLADAWGWTWDK 559 SME DWFPED+ EAFKE+R RKVFDV+DMYT+ADAWGWTW+K Sbjct: 713 TTRSSRKKRRSSRVSMEDDDDDDWFPEDLQEAFKELRERKVFDVEDMYTIADAWGWTWEK 772 Query: 558 EFKNKAPRRWSQEWEVELAVKIMNLVIELGGTPTIGDCAMVLRAAIRAPMPSAFLQILQM 379 +FKNKAPRRWSQEWEVELA+K+M VIELGGTPTIGDCA+VLRAAIRAPMPSAFLQILQ Sbjct: 773 DFKNKAPRRWSQEWEVELAIKLMTKVIELGGTPTIGDCAVVLRAAIRAPMPSAFLQILQT 832 Query: 378 THRLGYVFGSPLYDEVISLCLDLGELDASIAIVADLETSGIEVADETLDRVISARQ--DT 205 THRLGYVFGSPLYDE+ISLCLDLGELDASIAIVADLETSGI+V DETLDRVISARQ D Sbjct: 833 THRLGYVFGSPLYDEIISLCLDLGELDASIAIVADLETSGIKVPDETLDRVISARQANDI 892 Query: 204 TASDAS 187 A++AS Sbjct: 893 PANNAS 898 >ref|XP_004237508.1| PREDICTED: uncharacterized protein LOC101246046 [Solanum lycopersicum] Length = 891 Score = 1234 bits (3193), Expect = 0.0 Identities = 639/857 (74%), Positives = 702/857 (81%), Gaps = 7/857 (0%) Frame = -3 Query: 2760 RKPRRKKQQS-DLKSSDDNGYPGGEGGSFTPSSAEKLLRLVFMEELMERARSGSAAGVSD 2584 RK RRKKQQ LK+ DD+G S EK+LRLVFMEELMERAR+ +AGVS Sbjct: 40 RKTRRKKQQQHQLKTVDDDGSMA--------SGTEKVLRLVFMEELMERARNADSAGVSQ 91 Query: 2583 VIYDMIAAGLTPGPRSFHGLVVSHVLNRDEEGSMHALRRQLSEGLRPLHETFLALVRLFG 2404 VIYDMIAAGL+PGPRSFHGLVV+HVL+RD +G+MHALRR+LSEGLRPLHETFLALVRLFG Sbjct: 92 VIYDMIAAGLSPGPRSFHGLVVAHVLHRDNDGAMHALRRELSEGLRPLHETFLALVRLFG 151 Query: 2403 SKGRATRGLEILAAMEKLNYDIRQAWLLLVEELVRSKHLEDANRVFLKGAEGGLRATNRL 2224 + G ATRGLEILAAMEKLNYDIRQAWL+LVEELVRS HLEDAN+VFLKGAEGGLRAT+ + Sbjct: 152 ANGLATRGLEILAAMEKLNYDIRQAWLVLVEELVRSNHLEDANKVFLKGAEGGLRATDEI 211 Query: 2223 YDLLIEEDCKVGDHSNALTIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENME 2044 YDLLIEEDCKVGDHSNALTIAYEMEAAGRMATT HFNCLLSVQA+CGIPEIAF+TFENME Sbjct: 212 YDLLIEEDCKVGDHSNALTIAYEMEAAGRMATTSHFNCLLSVQASCGIPEIAFATFENME 271 Query: 2043 YGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVEDYKRLQPNV----LLVEC 1876 YG + +MKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVED+KRLQPNV LLVEC Sbjct: 272 YGDD-HMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNVRTYALLVEC 330 Query: 1875 FTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLRALCREGRIVELLDALETM 1696 FTKYCV+REAIRHFR LKNFEGGT++L+ +G+YGDPLSLYLRALCREGRIVELL+ALE M Sbjct: 331 FTKYCVVREAIRHFRGLKNFEGGTQVLYNDGKYGDPLSLYLRALCREGRIVELLEALEAM 390 Query: 1695 AKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYIARYVEEGGLTGERKRWVP 1516 AKD Q IPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYIARYV EGGLTG+RKRWVP Sbjct: 391 AKDNQPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYIARYVAEGGLTGDRKRWVP 450 Query: 1515 RRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRTLRNEGPVVLGNISESDYI 1336 RRGKTPLDPDA+G+ YSNP ETSFKQRC EEW++HHRKLL+TL NEGP +LG +SE DYI Sbjct: 451 RRGKTPLDPDAQGFIYSNPRETSFKQRCFEEWRLHHRKLLKTLLNEGPSILGKVSEYDYI 510 Query: 1335 RVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRIN 1156 R+EERLRK+IKGPEQ+ LKPKAASKM+VSELKEELEAQGLPTDGTRNVLYQRVQKARRIN Sbjct: 511 RIEERLRKVIKGPEQSALKPKAASKMVVSELKEELEAQGLPTDGTRNVLYQRVQKARRIN 570 Query: 1155 RSRGRXXXXXXXXXXXXXXXXXXXXLISRIKLHEGDTEFWRQRFLGEGLNENHSKPLEEE 976 RSRGR LISRIKLHEG+TEFW++RFLGEGL+EN+ + E Sbjct: 571 RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLSENYGQQSE-- 628 Query: 975 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEQTEIQ--VSDRVKDKE 802 EQTE Q +SDR KDKE Sbjct: 629 ---IIDLEPTDVVDDNDAVDDITKDAEDDEAEDDEAQDEEEEVEQTESQPEISDR-KDKE 684 Query: 801 AEAAKPLQMIGVQLLKDSDQXXXXXXXXXXXXXXXSMEXXXXXDWFPEDIYEAFKEMRNR 622 EAAKPLQMIGVQLLKDSD + DWFP DI+EAF E+R R Sbjct: 685 VEAAKPLQMIGVQLLKDSDLTASSSKKSRRRLSRVAAVDDDDDDWFPLDIHEAFVELRKR 744 Query: 621 KVFDVQDMYTLADAWGWTWDKEFKNKAPRRWSQEWEVELAVKIMNLVIELGGTPTIGDCA 442 KVFDV DMYT+ DAWGWTW+KE KNKAPRRWSQEWEVELA+K+M VIELGGTPTIGDCA Sbjct: 745 KVFDVSDMYTITDAWGWTWEKEIKNKAPRRWSQEWEVELAIKVMTKVIELGGTPTIGDCA 804 Query: 441 MVLRAAIRAPMPSAFLQILQMTHRLGYVFGSPLYDEVISLCLDLGELDASIAIVADLETS 262 M+LR+A+RAPMPSAFL+ILQ TH LGYVFGSPLYDE+I LCLDLGELDA+IAIVADLETS Sbjct: 805 MILRSAVRAPMPSAFLKILQTTHSLGYVFGSPLYDEIIILCLDLGELDAAIAIVADLETS 864 Query: 261 GIEVADETLDRVISARQ 211 GI+V DETLDRVISARQ Sbjct: 865 GIKVPDETLDRVISARQ 881 >ref|XP_002268094.2| PREDICTED: uncharacterized protein LOC100241547 [Vitis vinifera] gi|296085161|emb|CBI28656.3| unnamed protein product [Vitis vinifera] Length = 884 Score = 1233 bits (3191), Expect = 0.0 Identities = 638/863 (73%), Positives = 697/863 (80%), Gaps = 6/863 (0%) Frame = -3 Query: 2763 QRKPRRKKQQSDLKSSDDNGYPGGEGGSFTPSSAEKLLRLVFMEELMERARSGSAAGVSD 2584 +++PRRKK+ K G EK LRL FMEELMERARS AGVS+ Sbjct: 34 EKRPRRKKKTKQPKEDSFVAVTAVSAG-------EKALRLTFMEELMERARSADTAGVSE 86 Query: 2583 VIYDMIAAGLTPGPRSFHGLVVSHVLNRDEEGSMHALRRQLSEGLRPLHETFLALVRLFG 2404 V YDM+AAGL+PGPRSFHGL+VS VLN D+EG+M +LRR+LS GLRPLHETF+AL+RLFG Sbjct: 87 VFYDMVAAGLSPGPRSFHGLIVSTVLNGDDEGAMQSLRRELSAGLRPLHETFVALIRLFG 146 Query: 2403 SKGRATRGLEILAAMEKLNYDIRQAWLLLVEELVRSKHLEDANRVFLKGAEGGLRATNRL 2224 SKG ATRGLEILAAMEKLN+DIR+AWL+LVEELVR HLEDAN+VFLKGA+GGLRATN L Sbjct: 147 SKGYATRGLEILAAMEKLNFDIRKAWLVLVEELVRHNHLEDANKVFLKGAKGGLRATNEL 206 Query: 2223 YDLLIEEDCKVGDHSNALTIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENME 2044 YDLLIEEDCKVGDHSNALTIAYEMEAAGRMATT+HFNCLLSVQATCGIPEIAF+TFENME Sbjct: 207 YDLLIEEDCKVGDHSNALTIAYEMEAAGRMATTYHFNCLLSVQATCGIPEIAFATFENME 266 Query: 2043 YGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVEDYKRLQPNV----LLVEC 1876 YG E YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVED+KRLQPNV LLVEC Sbjct: 267 YG-EDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNVKTYALLVEC 325 Query: 1875 FTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLRALCREGRIVELLDALETM 1696 TKYCV+REAIRHFRALKNFEGGTK+LH EG +GDPLSLYLRALCREGRIVELLDALE M Sbjct: 326 LTKYCVVREAIRHFRALKNFEGGTKVLHDEGNFGDPLSLYLRALCREGRIVELLDALEAM 385 Query: 1695 AKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYIARYVEEGGLTGERKRWVP 1516 AKD Q IPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYIARY+ EGGLTG+RKRWVP Sbjct: 386 AKDNQPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYIARYIAEGGLTGDRKRWVP 445 Query: 1515 RRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRTLRNEGPVVLGNISESDYI 1336 RRGKTPLDPDA G+ YSNPMETSFKQRCLE+WK++HRKLL+TLRNEG LG +SESDYI Sbjct: 446 RRGKTPLDPDALGFIYSNPMETSFKQRCLEDWKMYHRKLLKTLRNEGLAALGEVSESDYI 505 Query: 1335 RVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRIN 1156 RVEERLRKIIKGP+QN LKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRIN Sbjct: 506 RVEERLRKIIKGPDQNALKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRIN 565 Query: 1155 RSRGRXXXXXXXXXXXXXXXXXXXXLISRIKLHEGDTEFWRQRFLGEGLNENHSKPLEEE 976 RSRGR LISRIKL EG+TEFW++RFLGE L KP+++E Sbjct: 566 RSRGRPLWVPPVEEEEEEVDEELDELISRIKLQEGNTEFWKRRFLGEDLTVGRGKPMDKE 625 Query: 975 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEQTEIQVSDRVKDKEAE 796 E TE QV+DRVKDKE E Sbjct: 626 ----------NSELPDVLDDADIGEDTAKEVEDDEADEEEEEVEPTESQVADRVKDKEVE 675 Query: 795 AAKPLQMIGVQLLKDSDQXXXXXXXXXXXXXXXSMEXXXXXDWFPEDIYEAFKEMRNRKV 616 AAKPLQMIGVQLLKDSDQ SME DWFP DI+EAFKEMR RK+ Sbjct: 676 AAKPLQMIGVQLLKDSDQTTPATRKSRRKLSRASMEDSDDDDWFPLDIHEAFKEMRERKI 735 Query: 615 FDVQDMYTLADAWGWTWDKEFKNKAPRRWSQEWEVELAVKIMNLVIELGGTPTIGDCAMV 436 FDV DMYT+AD WGWTW+KE KNK PR W+QEWEVELA+K+M VIELGGTPTIGDCAM+ Sbjct: 736 FDVSDMYTIADVWGWTWEKELKNKPPRSWTQEWEVELAIKVMLKVIELGGTPTIGDCAMI 795 Query: 435 LRAAIRAPMPSAFLQILQMTHRLGYVFGSPLYDEVISLCLDLGELDASIAIVADLETSGI 256 LRAAIRAP+PSAFL++LQ TH+LGYVFGSPLY+EVI LCLDLGELDA+IAIVAD+ETSGI Sbjct: 796 LRAAIRAPLPSAFLKVLQTTHKLGYVFGSPLYNEVIILCLDLGELDAAIAIVADMETSGI 855 Query: 255 EVADETLDRVISARQ--DTTASD 193 V DETLDRVISARQ DT A+D Sbjct: 856 AVPDETLDRVISARQMIDTAATD 878 >ref|XP_006340475.1| PREDICTED: uncharacterized protein LOC102579691 [Solanum tuberosum] Length = 890 Score = 1233 bits (3189), Expect = 0.0 Identities = 641/864 (74%), Positives = 703/864 (81%), Gaps = 8/864 (0%) Frame = -3 Query: 2763 QRKPRRKKQQSDLKSSDDNGYPGGEGGSFTPSSAEKLLRLVFMEELMERARSGSAAGVSD 2584 +R R+K+QQ LK+ DD+G S EK+LRLVFMEELMERAR+ +AGVS Sbjct: 39 RRTRRKKQQQHQLKTVDDDGSMA--------SGTEKVLRLVFMEELMERARNADSAGVSQ 90 Query: 2583 VIYDMIAAGLTPGPRSFHGLVVSHVLNRDEEGSMHALRRQLSEGLRPLHETFLALVRLFG 2404 VIYDMIAAGL+PGPRSFHGLVVSHVL+RD +G+MHALRR+LSEGLRPLHETFLALVRLFG Sbjct: 91 VIYDMIAAGLSPGPRSFHGLVVSHVLHRDNDGAMHALRRELSEGLRPLHETFLALVRLFG 150 Query: 2403 SKGRATRGLEILAAMEKLNYDIRQAWLLLVEELVRSKHLEDANRVFLKGAEGGLRATNRL 2224 + G ATRGLEILAAMEKLNYDIRQAWL+LVEELVRS HLEDAN+VFLKGAEGGLRAT+ + Sbjct: 151 ANGLATRGLEILAAMEKLNYDIRQAWLVLVEELVRSNHLEDANKVFLKGAEGGLRATDEI 210 Query: 2223 YDLLIEEDCKVGDHSNALTIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENME 2044 YDLLIEEDCKVGDHSNALTIAYEMEAAGRMATT HFNCLLSVQATCGIPEIAF+TFENME Sbjct: 211 YDLLIEEDCKVGDHSNALTIAYEMEAAGRMATTSHFNCLLSVQATCGIPEIAFATFENME 270 Query: 2043 YGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVEDYKRLQPNV----LLVEC 1876 YG + +MKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVED+KRLQPNV LLVEC Sbjct: 271 YGDD-HMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNVRTYALLVEC 329 Query: 1875 FTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLRALCREGRIVELLDALETM 1696 FTKYCV+REAIRHFR LKNFEGGT++L+ +G+YGD LSLYLRALCREGRIVELL+ALE M Sbjct: 330 FTKYCVVREAIRHFRGLKNFEGGTQVLYNDGKYGDSLSLYLRALCREGRIVELLEALEAM 389 Query: 1695 AKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYIARYVEEGGLTGERKRWVP 1516 AKD Q IPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYIARYV EGGLTG+RKRWVP Sbjct: 390 AKDNQPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYIARYVAEGGLTGDRKRWVP 449 Query: 1515 RRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRTLRNEGPVVLGNISESDYI 1336 RRGKTPLDPDA+G+ YSNP ETSFKQRC EEW++HHRKLL+TL NEGP +LG ISE DYI Sbjct: 450 RRGKTPLDPDAQGFIYSNPRETSFKQRCFEEWRLHHRKLLKTLLNEGPSILGKISEYDYI 509 Query: 1335 RVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRIN 1156 R+EERLRK+IKGPEQ+ LKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRIN Sbjct: 510 RIEERLRKVIKGPEQSALKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRIN 569 Query: 1155 RSRGRXXXXXXXXXXXXXXXXXXXXLISRIKLHEGDTEFWRQRFLGEGLNENHSKPLEEE 976 RSRGR LISRIKLHEG+TEFW++RFLGEGL+EN+ + E Sbjct: 570 RSRGRPLWVPPVEEEEEEVDEELDELISRIKLHEGNTEFWKRRFLGEGLSENYGQQSE-- 627 Query: 975 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEQTEIQ--VSDRVKDKE 802 EQTE Q + DR KDKE Sbjct: 628 ---IIDLEPTDVVDDNDAVDDIAKEAEDDEAEDDEAQDEEEEVEQTESQPEIGDR-KDKE 683 Query: 801 AEAAKPLQMIGVQLLKDSDQXXXXXXXXXXXXXXXSMEXXXXXDWFPEDIYEAFKEMRNR 622 EAAKPLQMIGVQLLKDSD + DWFP DI+EAF E+R R Sbjct: 684 VEAAKPLQMIGVQLLKDSDLTASSSKKSRRRLSRVAAVDDDDDDWFPLDIHEAFVELRKR 743 Query: 621 KVFDVQDMYTLADAWGWTWDKEFKNKAPRRWSQEWEVELAVKIMNLVIELGGTPTIGDCA 442 KVFDV DMYT+ DAWGWTW+KE KNKAPRRWSQEWEVEL +K+M VIELGGTPTIGDCA Sbjct: 744 KVFDVSDMYTITDAWGWTWEKEIKNKAPRRWSQEWEVELGIKVMTKVIELGGTPTIGDCA 803 Query: 441 MVLRAAIRAPMPSAFLQILQMTHRLGYVFGSPLYDEVISLCLDLGELDASIAIVADLETS 262 M+LRAA+RAPMPSAFL+ILQ TH LGYVFGSPLYDE+I LCLDLGELDA+IAIVADLETS Sbjct: 804 MILRAAVRAPMPSAFLRILQTTHSLGYVFGSPLYDEIIILCLDLGELDAAIAIVADLETS 863 Query: 261 GIEVADETLDRVISARQ--DTTAS 196 GI+V DETLDRVISARQ DT A+ Sbjct: 864 GIKVPDETLDRVISARQGSDTPAN 887 >ref|XP_007030296.1| Plastid transcriptionally active 3 isoform 1 [Theobroma cacao] gi|508718901|gb|EOY10798.1| Plastid transcriptionally active 3 isoform 1 [Theobroma cacao] Length = 905 Score = 1226 bits (3173), Expect = 0.0 Identities = 626/862 (72%), Positives = 697/862 (80%), Gaps = 6/862 (0%) Frame = -3 Query: 2754 PRRKKQQSDLKSSDDNGYPGGEGGSFTPSSAEKLLRLVFMEELMERARSGSAAGVSDVIY 2575 PRRKK+QS K DDN + S+ EK LRL FMEELM++ARS AGVSDVIY Sbjct: 37 PRRKKRQSQQKKDDDNATLSSSNAAV--SALEKSLRLTFMEELMQKARSRDVAGVSDVIY 94 Query: 2574 DMIAAGLTPGPRSFHGLVVSHVLNRDEEGSMHALRRQLSEGLRPLHETFLALVRLFGSKG 2395 DMIAAGLTPGPRSFHGLVV+HVLN D EG+M ALRR+L G+RPLHET ++++RLFGSKG Sbjct: 95 DMIAAGLTPGPRSFHGLVVAHVLNGDVEGAMQALRRELGVGVRPLHETLVSMIRLFGSKG 154 Query: 2394 RATRGLEILAAMEKLNYDIRQAWLLLVEELVRSKHLEDANRVFLKGAEGGLRATNRLYDL 2215 AT+GLE+LAAMEKLNYDIRQAW++LVEELVR+K++EDAN VFLKGA+GGLRATN LYDL Sbjct: 155 LATKGLEVLAAMEKLNYDIRQAWIILVEELVRNKYMEDANNVFLKGAKGGLRATNELYDL 214 Query: 2214 LIEEDCKVGDHSNALTIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGG 2035 +IEEDCKVGDHSNAL IAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAF+TFENMEYG Sbjct: 215 MIEEDCKVGDHSNALEIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFATFENMEYG- 273 Query: 2034 EAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVEDYKRLQPNV----LLVECFTK 1867 E YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVED+KR+QPNV LLVECFTK Sbjct: 274 EEYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRVQPNVKTYALLVECFTK 333 Query: 1866 YCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLRALCREGRIVELLDALETMAKD 1687 YCV++EAIRHFRALK FEGGT++L EG + DPLSLYLRALCREGRIVELL+AL+ MAKD Sbjct: 334 YCVVKEAIRHFRALKKFEGGTRVLQNEGNFDDPLSLYLRALCREGRIVELLEALQAMAKD 393 Query: 1686 KQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYIARYVEEGGLTGERKRWVPRRG 1507 Q IPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYIARY+EEGGLTGERKRWVPRRG Sbjct: 394 NQPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYIARYIEEGGLTGERKRWVPRRG 453 Query: 1506 KTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRTLRNEGPVVLGNISESDYIRVE 1327 KTPLDPDA G+ YSNPMETSFKQRCLE+WK+HHRKLL+TL+NEG LG SESDY+RV Sbjct: 454 KTPLDPDAAGFIYSNPMETSFKQRCLEDWKLHHRKLLKTLQNEGLAALGGASESDYVRVS 513 Query: 1326 ERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRINRSR 1147 ERL+KIIKGP+QN LKPKAASKMIVSELKEELEAQGLP DGTRNVLYQRVQKARRINRSR Sbjct: 514 ERLKKIIKGPDQNVLKPKAASKMIVSELKEELEAQGLPIDGTRNVLYQRVQKARRINRSR 573 Query: 1146 GRXXXXXXXXXXXXXXXXXXXXLISRIKLHEGDTEFWRQRFLGEGLNENHSKPLEEEXXX 967 GR LISRIKL EG+TEFW++RFLGE LN +H KP++E Sbjct: 574 GRPLWVPPVEEEEEEVDEEVDELISRIKLEEGNTEFWKRRFLGEHLNVDHVKPIDE---- 629 Query: 966 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEQTEIQVSDRVKDKEAEAAK 787 EQ E Q DR+KDKE EA K Sbjct: 630 ------GESEPADDELDDGDVVEDAAKDIEDDEADEEEEGEQAESQEGDRIKDKEVEAKK 683 Query: 786 PLQMIGVQLLKDSDQXXXXXXXXXXXXXXXSMEXXXXXDWFPEDIYEAFKEMRNRKVFDV 607 PLQMIGVQLLKDSDQ S+E DWFPEDI+EAF+E+R RKVFDV Sbjct: 684 PLQMIGVQLLKDSDQTTTRSKKSRRRSSRVSVEDDDDDDWFPEDIFEAFQELRERKVFDV 743 Query: 606 QDMYTLADAWGWTWDKEFKNKAPRRWSQEWEVELAVKIMNLVIELGGTPTIGDCAMVLRA 427 +DMYT+ADAWGWTW+KE KNK PR+WSQEWEVELA+++M VIELGGTPT+GDCAM+LRA Sbjct: 744 EDMYTIADAWGWTWEKELKNKPPRKWSQEWEVELAIQVMQKVIELGGTPTVGDCAMILRA 803 Query: 426 AIRAPMPSAFLQILQMTHRLGYVFGSPLYDEVISLCLDLGELDASIAIVADLETSGIEVA 247 AI+APMPSAFL+ILQ H LG+VFGSPLYDEVIS+C+DLGELDA+IAIVADLET+GI V Sbjct: 804 AIKAPMPSAFLKILQTAHSLGFVFGSPLYDEVISICVDLGELDAAIAIVADLETAGIAVP 863 Query: 246 DETLDRVISARQ--DTTASDAS 187 D+TLDRVISARQ DT D S Sbjct: 864 DQTLDRVISARQTVDTAGGDVS 885 >ref|XP_007208365.1| hypothetical protein PRUPE_ppa001139mg [Prunus persica] gi|462404007|gb|EMJ09564.1| hypothetical protein PRUPE_ppa001139mg [Prunus persica] Length = 897 Score = 1211 bits (3133), Expect = 0.0 Identities = 623/869 (71%), Positives = 694/869 (79%), Gaps = 16/869 (1%) Frame = -3 Query: 2763 QRKPRRKKQQSDLKSSDDNGYPGGEGGSFTPSSAEKLLRLVFMEELMERARSGSAAGVSD 2584 +++ RRK++Q+ K +D+ P S+AEK LR FMEELM RAR+ A GVSD Sbjct: 39 EKRTRRKRRQT--KGDNDSSSPSS-------SAAEKSLRFTFMEELMGRARNRDANGVSD 89 Query: 2583 VIYDMIAAGLTPGPRSFHGLVVSHVLNRDEEGSMHALRRQLSEGLRPLHETFLALVRLFG 2404 VIYDM+AAGLTPGPRSFHGL+V+H LN D E +M +LRR+LS GLRPLHETF+AL+RLFG Sbjct: 90 VIYDMVAAGLTPGPRSFHGLIVAHALNGDTEAAMQSLRRELSSGLRPLHETFIALIRLFG 149 Query: 2403 SKGRATRGLEILAAMEKLNYDIRQAWLLLVEELVRSKHLEDANRVFLKGAEGGLRATNRL 2224 SKGRATRGLEILAAMEKL+YDIR+AWLLLVEELVR++HLEDAN+VFLKGA+GGLRAT+ + Sbjct: 150 SKGRATRGLEILAAMEKLHYDIRRAWLLLVEELVRTRHLEDANKVFLKGAKGGLRATDEV 209 Query: 2223 YDLLIEEDCKVGDHSNALTIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENME 2044 YDLLI EDCKVGDHSNAL IAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENME Sbjct: 210 YDLLIVEDCKVGDHSNALDIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENME 269 Query: 2043 YGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVEDYKRLQPNV----LLVEC 1876 YGGE YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVED+KRLQPN+ LLVEC Sbjct: 270 YGGEEYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNMKTHALLVEC 329 Query: 1875 FTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLRALCREGRIVELLDALETM 1696 FTKYCV+REAIRHFRALK FEGGTK LH EG +GDPLSLYLRALCREGRI+ELL+ALE M Sbjct: 330 FTKYCVVREAIRHFRALKTFEGGTKALHNEGNFGDPLSLYLRALCREGRILELLEALEAM 389 Query: 1695 AKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYIARYVEEGGLTGERKRWVP 1516 A+D Q IPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDY+ARY+ EGGLTGERKRWVP Sbjct: 390 AEDNQTIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYMARYIAEGGLTGERKRWVP 449 Query: 1515 RRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRTLRNEGPVVLGNISESDYI 1336 RRGKTPLDPD EG+ YSNPME SFKQRCLE+WKIHHRKLLRTLRNEG LG+ SESDYI Sbjct: 450 RRGKTPLDPDVEGFIYSNPMENSFKQRCLEDWKIHHRKLLRTLRNEGVAALGDASESDYI 509 Query: 1335 RVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRIN 1156 RVE RLRKIIKGP+QN LKPKAASKM+VSELKEELEAQGLPTDGTRNVLYQRVQKARRIN Sbjct: 510 RVEMRLRKIIKGPDQNVLKPKAASKMVVSELKEELEAQGLPTDGTRNVLYQRVQKARRIN 569 Query: 1155 RSRGRXXXXXXXXXXXXXXXXXXXXLISRIKLHEGDTEFWRQRFLGEGLNENHSKPLE-- 982 RSRGR LISRIKL EG+TEFW++RFLGEG + + K ++ Sbjct: 570 RSRGRPLWVPPVEEEEEEVDEEIDELISRIKLEEGNTEFWKRRFLGEGFSSDQEKAVDVS 629 Query: 981 ----------EEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEQTEI 832 E EQ E Sbjct: 630 DSASVVDVAKEVENGEAEADDDDDGDNDDDDDNDDDDDDEEEEEEEEEVEVEVEVEQAER 689 Query: 831 QVSDRVKDKEAEAAKPLQMIGVQLLKDSDQXXXXXXXXXXXXXXXSMEXXXXXDWFPEDI 652 Q +RVK+KE EA KPLQMIGVQLLKDSDQ S E DWFP DI Sbjct: 690 QDVERVKEKEIEAKKPLQMIGVQLLKDSDQTSTTSKKSRRRRSRVSAEDDNDDDWFPLDI 749 Query: 651 YEAFKEMRNRKVFDVQDMYTLADAWGWTWDKEFKNKAPRRWSQEWEVELAVKIMNLVIEL 472 +EAFKE+RNRKVFDV DMYTLADAWGWTW++E KN+ PRRWSQ+WEV+LA+K+M L +L Sbjct: 750 FEAFKELRNRKVFDVSDMYTLADAWGWTWERELKNRPPRRWSQDWEVQLAIKVM-LKAKL 808 Query: 471 GGTPTIGDCAMVLRAAIRAPMPSAFLQILQMTHRLGYVFGSPLYDEVISLCLDLGELDAS 292 GGTPTIGDCA++LRAAIRAP+PSAFL+ILQ TH LGYVFGSPLYDE+ISLCLDLGE+DA+ Sbjct: 809 GGTPTIGDCAVILRAAIRAPLPSAFLKILQTTHTLGYVFGSPLYDEIISLCLDLGEVDAA 868 Query: 291 IAIVADLETSGIEVADETLDRVISARQDT 205 +AIVAD+ET+GI V DETLDRVISAR+ T Sbjct: 869 VAIVADMETTGITVPDETLDRVISARRTT 897 >ref|XP_002522027.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223538831|gb|EEF40431.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 889 Score = 1194 bits (3088), Expect = 0.0 Identities = 611/860 (71%), Positives = 693/860 (80%), Gaps = 9/860 (1%) Frame = -3 Query: 2763 QRKPRRKKQ----QSDLKSSDDNGYPGGEGGSFTPSSAEKLLRLVFMEELMERARSGSAA 2596 ++K RRKKQ + L+ +D++ P ++AEK LR FMEELM+RAR+ A Sbjct: 30 EKKSRRKKQPHQQKQQLEKNDNSILPA------VITAAEKTLRFNFMEELMDRARNRDAV 83 Query: 2595 GVSDVIYDMIAAGLTPGPRSFHGLVVSHVLNRDEEGSMHALRRQLSEGLRPLHETFLALV 2416 GVSDVIYDM+AAGL+PGPRSFHGL+V++ LN D EG+M +LRR+LS+G+RPLHETFLAL+ Sbjct: 84 GVSDVIYDMVAAGLSPGPRSFHGLIVAYCLNGDIEGAMQSLRRELSQGIRPLHETFLALI 143 Query: 2415 RLFGSKGRATRGLEILAAMEKLNYDIRQAWLLLVEELVRSKHLEDANRVFLKGAEGGLRA 2236 RLFGSKG A+RGLEILAAMEKL YDIR AW++LVEELV++K++EDAN+VFLKGA+GGLRA Sbjct: 144 RLFGSKGHASRGLEILAAMEKLKYDIRLAWIVLVEELVKNKYMEDANKVFLKGAKGGLRA 203 Query: 2235 TNRLYDLLIEEDCKVGDHSNALTIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTF 2056 T+ LYD +IEEDCKVGDHSNAL IAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAF+TF Sbjct: 204 TDELYDYMIEEDCKVGDHSNALEIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFATF 263 Query: 2055 ENMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVEDYKRLQPNV----L 1888 ENMEYGGE YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVED+KRLQPNV L Sbjct: 264 ENMEYGGEEYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNVRTYAL 323 Query: 1887 LVECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLRALCREGRIVELLDA 1708 LVECFTKYCV+REAIRHFRAL+NFEGGTK+LHY+G +GDPLSLYLRALCREGRIVELL+A Sbjct: 324 LVECFTKYCVVREAIRHFRALQNFEGGTKVLHYDGNFGDPLSLYLRALCREGRIVELLEA 383 Query: 1707 LETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYIARYVEEGGLTGERK 1528 LE M +D Q IPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDY+ARYV EGGLTGERK Sbjct: 384 LEAMGRDNQPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYVARYVAEGGLTGERK 443 Query: 1527 RWVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRTLRNEGPVVLGNISE 1348 RWVPRRGKTPLDPDA G+ YSNPMETSFKQRC+E+WK+HHRKLLRTL NEG LG SE Sbjct: 444 RWVPRRGKTPLDPDAAGFIYSNPMETSFKQRCIEDWKVHHRKLLRTLLNEGLAALGEASE 503 Query: 1347 SDYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKA 1168 SDY+RV ERL+KIIKGP+QN LKPKAASKM+VSELKEELEAQGLP DGTRNVLYQRVQKA Sbjct: 504 SDYLRVVERLKKIIKGPDQNVLKPKAASKMVVSELKEELEAQGLPIDGTRNVLYQRVQKA 563 Query: 1167 RRINRSRGRXXXXXXXXXXXXXXXXXXXXLISRIKLHEGDTEFWRQRFLGEGLNENHSKP 988 RRINRSRGR +ISRIKL EG+TEFW++RFLGEGLN ++ +P Sbjct: 564 RRINRSRGRPLWVPPVEEEEEEVDEELDEIISRIKLEEGNTEFWKRRFLGEGLNGSNLQP 623 Query: 987 LEEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEQTEIQVSDR-VK 811 + EQTE Q DR VK Sbjct: 624 MS-----VAKSELPDVLDDVDAIEDADKEVEDEEADDEEEAEAEVEVEQTENQDVDRVVK 678 Query: 810 DKEAEAAKPLQMIGVQLLKDSDQXXXXXXXXXXXXXXXSMEXXXXXDWFPEDIYEAFKEM 631 +KE EA KPLQMIGVQLLKDSD S+E DWFPED +EAFKE+ Sbjct: 679 EKEVEAKKPLQMIGVQLLKDSDHLTTRSKKSKRRSARASVEDDADDDWFPEDPFEAFKEL 738 Query: 630 RNRKVFDVQDMYTLADAWGWTWDKEFKNKAPRRWSQEWEVELAVKIMNLVIELGGTPTIG 451 R RKVFDV+DMYT+AD WGWTW++E KN+ P++WSQEWEVELA+K+M L +L GTPTIG Sbjct: 739 RERKVFDVEDMYTIADVWGWTWEREIKNRPPQKWSQEWEVELAIKLM-LKAQLSGTPTIG 797 Query: 450 DCAMVLRAAIRAPMPSAFLQILQMTHRLGYVFGSPLYDEVISLCLDLGELDASIAIVADL 271 DCAM+LRAAIRAPMPSAFL+ILQ TH LGY FGSPLYDEVISLCLD+GELDA+IAIVADL Sbjct: 798 DCAMILRAAIRAPMPSAFLKILQTTHSLGYTFGSPLYDEVISLCLDIGELDAAIAIVADL 857 Query: 270 ETSGIEVADETLDRVISARQ 211 E++GI V D+TLDRVISARQ Sbjct: 858 ESTGITVPDQTLDRVISARQ 877 >gb|EXB93125.1| Pentatricopeptide repeat-containing protein [Morus notabilis] Length = 895 Score = 1181 bits (3054), Expect = 0.0 Identities = 614/869 (70%), Positives = 694/869 (79%), Gaps = 10/869 (1%) Frame = -3 Query: 2763 QRKPRRKKQQSDLKSSDDNGYPGGEGGSFTPSSAEKLLRLVFMEELMERARSGSAAGVSD 2584 +++ RRK++Q+ DD+ S+AEK LR FMEELMERAR+ AAGVSD Sbjct: 40 EKRTRRKRRQT----KDDD------------SAAEKGLRFTFMEELMERARNRDAAGVSD 83 Query: 2583 VIYDMIAAGLTPGPRSFHGLVVSHVLNRDEEGSMHALRRQLSEGLRPLHETFLALVRLFG 2404 VIYDM+AAGLTPGPRSFHGL+V+H L+ D E +M +LRR+LS GLRPL ETF+AL+R+FG Sbjct: 84 VIYDMVAAGLTPGPRSFHGLIVAHALSGDAEAAMQSLRRELSAGLRPLQETFVALIRMFG 143 Query: 2403 SKGRATRGLEILAAMEKLNYDIRQAWLLLVEELVRSKHLEDANRVFLKGAEGGLRATNRL 2224 SKGRAT+G+EILAAMEKLNYDIR AWL+LVEELVRS HLEDAN+VFL+GA+GGLRAT+ + Sbjct: 144 SKGRATKGMEILAAMEKLNYDIRGAWLILVEELVRSNHLEDANKVFLRGAKGGLRATDEV 203 Query: 2223 YDLLIEEDCKVGDHSNALTIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENME 2044 YDL+I EDCK GDHSNAL IAYEMEAAGRMATTFHFN LLSVQATCGIPEIAFSTFENM+ Sbjct: 204 YDLMIVEDCKAGDHSNALEIAYEMEAAGRMATTFHFNWLLSVQATCGIPEIAFSTFENMQ 263 Query: 2043 YGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVEDYKRLQPNV----LLVEC 1876 YG E +MKPDTETYNWVIQAYTRAESYDRVQDVAEL+G+MVED+KRLQPN+ LLVEC Sbjct: 264 YG-EEFMKPDTETYNWVIQAYTRAESYDRVQDVAELLGIMVEDHKRLQPNMKTHALLVEC 322 Query: 1875 FTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLRALCREGRIVELLDALETM 1696 FTKYCVI EAIRHFRAL+NFEGGT +LH EG +GDPLSLYLRALCREGRIVELL+ALE M Sbjct: 323 FTKYCVIGEAIRHFRALRNFEGGTIVLHNEGNFGDPLSLYLRALCREGRIVELLEALEAM 382 Query: 1695 AKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYIARYVEEGGLTGERKRWVP 1516 KD Q IPPRAM+LS+KYRTLVSSWIEPLQ+EAELG+EIDYIARY+ EGGLTGERKRWVP Sbjct: 383 VKDNQPIPPRAMLLSKKYRTLVSSWIEPLQDEAELGYEIDYIARYIAEGGLTGERKRWVP 442 Query: 1515 RRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRTLRNEGPVVLGNISESDYI 1336 RRGKTPLDPDA G+ YSNPMETSFKQRCLE+WK ++RKLLRTLRNEG VLG+ SESDYI Sbjct: 443 RRGKTPLDPDAAGFIYSNPMETSFKQRCLEDWKTYNRKLLRTLRNEGIAVLGDASESDYI 502 Query: 1335 RVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRIN 1156 RVEERL KI++GPEQN LKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRIN Sbjct: 503 RVEERLLKIVRGPEQNVLKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRIN 562 Query: 1155 RSRGRXXXXXXXXXXXXXXXXXXXXLISRIKLHEGDTEFWRQRFLGEGLNENH----SKP 988 RSRGR LISRIKL EG+TEFW++RFLGEGLN ++ S Sbjct: 563 RSRGRPLWIPPVEEEEEEVDEDLDELISRIKLQEGNTEFWKRRFLGEGLNGDNGNSTSMG 622 Query: 987 LEEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEQTEIQVSDRVKD 808 E EQTE Q ++RVK+ Sbjct: 623 RAEFADVDVDADIVEDSAKEVEDDEADADDNDEEEEEEEEVEEVDVVEQTESQDAERVKE 682 Query: 807 KEAEAAKPLQMIGVQLLKDSDQXXXXXXXXXXXXXXXSMEXXXXXDWFPEDIYEAFKEMR 628 K+ A KPLQMIGVQLLKDSD+ +E DWFPEDI+EAFKE+R Sbjct: 683 KQVAAKKPLQMIGVQLLKDSDE-TTPSSKKSRRRASRVVEDDADDDWFPEDIFEAFKELR 741 Query: 627 NRKVFDVQDMYTLADAWGWTWDKEFKNKAPRRWSQEWEVELAVKIMNLVIELGGTPTIGD 448 RKVFDV DMYTLADAWGWTW+K+ N+ PRRWSQEWEVELA+K+M +IELGGTPTIGD Sbjct: 742 KRKVFDVDDMYTLADAWGWTWEKDLDNRPPRRWSQEWEVELAIKVMLKIIELGGTPTIGD 801 Query: 447 CAMVLRAAIRAPMPSAFLQILQMTHRLGYVFGSPLYDEVISLCLDLGELDASIAIVADLE 268 CAM+LRAAIRAP+PSAFL+ILQ TH LGYVFGSPLYDE+ISLCLDLGELDA+IAIVADLE Sbjct: 802 CAMILRAAIRAPLPSAFLKILQTTHSLGYVFGSPLYDEIISLCLDLGELDAAIAIVADLE 861 Query: 267 TSGIEVADETLDRVISARQ--DTTASDAS 187 T+ I V DETLDRVI+ARQ +++A D+S Sbjct: 862 TTSIAVPDETLDRVIAARQMNESSAGDSS 890 >ref|XP_006443293.1| hypothetical protein CICLE_v10023441mg [Citrus clementina] gi|568850568|ref|XP_006478982.1| PREDICTED: uncharacterized protein LOC102630853 isoform X1 [Citrus sinensis] gi|557545555|gb|ESR56533.1| hypothetical protein CICLE_v10023441mg [Citrus clementina] Length = 887 Score = 1179 bits (3051), Expect = 0.0 Identities = 606/865 (70%), Positives = 693/865 (80%), Gaps = 6/865 (0%) Frame = -3 Query: 2763 QRKPRRKKQQSDLKSSDDNGYPGGEGGSFTPSSAEKLLRLVFMEELMERARSGSAAGVSD 2584 ++K RRKKQQ + D+ GS S+AE+ LRL+FMEELM+ AR+ A V+D Sbjct: 34 EKKTRRKKQQRRQQKHGDSLL--STNGSVV-SAAEQGLRLIFMEELMQHARNRDAPRVND 90 Query: 2583 VIYDMIAAGLTPGPRSFHGLVVSHVLNRDEEGSMHALRRQLSEGLRPLHETFLALVRLFG 2404 VIYDMIAAGL+PGPRSFHGLVV++ LN D EG+MH+L+R+LS G+RPLHET +AL RLFG Sbjct: 91 VIYDMIAAGLSPGPRSFHGLVVAYTLNGDHEGAMHSLKRELSTGVRPLHETLIALARLFG 150 Query: 2403 SKGRATRGLEILAAMEKLNYDIRQAWLLLVEELVRSKHLEDANRVFLKGAEGGLRATNRL 2224 SKG AT+GLEILAAMEK+NYDIRQAWL+LVEELVR+K+LEDAN+VFL+GA+GGLRAT+ + Sbjct: 151 SKGLATKGLEILAAMEKINYDIRQAWLILVEELVRNKYLEDANKVFLRGAKGGLRATDEI 210 Query: 2223 YDLLIEEDCKVGDHSNALTIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENME 2044 YDL+I EDCK GDHSNAL IAYEMEAAGRMATTFHFN LLS QATCGIPE+AF+TFENME Sbjct: 211 YDLMIAEDCKAGDHSNALEIAYEMEAAGRMATTFHFNHLLSCQATCGIPEVAFATFENME 270 Query: 2043 YGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVEDYKRLQPNV----LLVEC 1876 YG E YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMM ED+KRLQPNV LLVEC Sbjct: 271 YG-EDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMFEDHKRLQPNVKTYALLVEC 329 Query: 1875 FTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLRALCREGRIVELLDALETM 1696 FTKYC + EAIRHFRAL+N+EGGTK+LH EG +GDPLSLYLRALCREGRI+ELL+ALE M Sbjct: 330 FTKYCAVTEAIRHFRALQNYEGGTKVLHNEGNFGDPLSLYLRALCREGRIIELLEALEAM 389 Query: 1695 AKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYIARYVEEGGLTGERKRWVP 1516 AKD Q +PPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYIARY+ EGGLTGERKRWVP Sbjct: 390 AKDNQPVPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYIARYISEGGLTGERKRWVP 449 Query: 1515 RRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRTLRNEGPVVLGNISESDYI 1336 RRGKTPLDPDA G+ YSNPMETSFKQRCLE+ K +HRKLLRTL+NEGP VLG++SESDY+ Sbjct: 450 RRGKTPLDPDAVGFIYSNPMETSFKQRCLEDGKKYHRKLLRTLQNEGPAVLGDVSESDYV 509 Query: 1335 RVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRIN 1156 RVEERL+K+IKGPEQ+ LKPKAASKM+VSELKEEL+AQGLPTDGTRNVLYQRVQKARRIN Sbjct: 510 RVEERLKKLIKGPEQHVLKPKAASKMVVSELKEELDAQGLPTDGTRNVLYQRVQKARRIN 569 Query: 1155 RSRGRXXXXXXXXXXXXXXXXXXXXLISRIKLHEGDTEFWRQRFLGEGLNENHSKPLEEE 976 RSRGR LISRIKL EG+TEFW++RFLGEGLN H K +E + Sbjct: 570 RSRGRPLWVPPVEEEEEEVDEEVDELISRIKLEEGNTEFWKRRFLGEGLNGRHDKAVEMD 629 Query: 975 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEQTEIQVSDRVKDKEAE 796 + E Q DRVK+K E Sbjct: 630 -------ESELSDVLDDDVTDVEYVAKDEEADEEADEEEEVEQAEPESQDVDRVKEKLVE 682 Query: 795 AAKPLQMIGVQLLKDSDQXXXXXXXXXXXXXXXSMEXXXXXDWFPEDIYEAFKEMRNRKV 616 A KPLQMIGVQLLKDSDQ +E DWFPED +EAFKEMR RKV Sbjct: 683 AKKPLQMIGVQLLKDSDQ-TTTTSKRSMKRSSRMVEDDDDEDWFPEDPFEAFKEMRKRKV 741 Query: 615 FDVQDMYTLADAWGWTWDKEFKNKAPRRWSQEWEVELAVKIMNLVIELGGTPTIGDCAMV 436 FDV DMYT+ADAWGWTW++E KN+ P++WSQEWEVELA++IM VIELGG PTIGDCA++ Sbjct: 742 FDVSDMYTIADAWGWTWEREIKNRPPQKWSQEWEVELAIQIMLKVIELGGMPTIGDCAVI 801 Query: 435 LRAAIRAPMPSAFLQILQMTHRLGYVFGSPLYDEVISLCLDLGELDASIAIVADLETSGI 256 + AAIRAP+PSAFL+ILQ TH LGYVFGSPLYDE+ISLCLDLGELDA++AIVAD+ET+GI Sbjct: 802 IHAAIRAPLPSAFLKILQKTHSLGYVFGSPLYDEIISLCLDLGELDAAVAIVADMETTGI 861 Query: 255 EVADETLDRVISARQ--DTTASDAS 187 V D+TLDRVI+ARQ +T+ DAS Sbjct: 862 AVPDQTLDRVITARQTGETSVDDAS 886 >ref|XP_002325363.1| SAP domain-containing family protein [Populus trichocarpa] gi|222862238|gb|EEE99744.1| SAP domain-containing family protein [Populus trichocarpa] Length = 887 Score = 1170 bits (3026), Expect = 0.0 Identities = 604/853 (70%), Positives = 681/853 (79%), Gaps = 5/853 (0%) Frame = -3 Query: 2754 PRRKKQQSDLKSSDDNGYPGGEGGSFTPSSAEKLLRLVFMEELMERARSGSAAGVSDVIY 2575 P++ +++ K +DNG P S S+ EK LR FMEELM RAR+ + GVSDVIY Sbjct: 34 PKKSRRKKPPKQKNDNGSPL----SVVVSAEEKNLRFAFMEELMHRARNRDSNGVSDVIY 89 Query: 2574 DMIAAGLTPGPRSFHGLVVSHVLNRDEEGSMHALRRQLSEGLRPLHETFLALVRLFGSKG 2395 DMIAAGL+PGPRSFHGL+V+H LN D EG+M +LRR+LS G RPLHET +AL+RLFGSKG Sbjct: 90 DMIAAGLSPGPRSFHGLIVAHTLNGDHEGAMQSLRRELSAGHRPLHETCIALIRLFGSKG 149 Query: 2394 RATRGLEILAAMEKLNYDIRQAWLLLVEELVRSKHLEDANRVFLKGAEGGLRATNRLYDL 2215 TRGLE+LAAMEKLNYDIR+AW+LLVEELV+ + +EDANRVFLKGA GGLRAT+ LYDL Sbjct: 150 FGTRGLELLAAMEKLNYDIRRAWILLVEELVKGRFMEDANRVFLKGANGGLRATDELYDL 209 Query: 2214 LIEEDCKVGDHSNALTIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGG 2035 +IEEDCKVGDHSNAL IAY ME AGRMATTFHFNCLLSVQATCGIPEI+F+TFENMEYG Sbjct: 210 MIEEDCKVGDHSNALDIAYAMEEAGRMATTFHFNCLLSVQATCGIPEISFATFENMEYG- 268 Query: 2034 EAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVEDYKRLQPNV----LLVECFTK 1867 E YMKPDTE+YNWVIQAYTRAESYDRVQDVAEL+GMMVED+KR+QPNV LLVECF+K Sbjct: 269 EDYMKPDTESYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRIQPNVKTYALLVECFSK 328 Query: 1866 YCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLRALCREGRIVELLDALETMAKD 1687 YCV+REAIRHFRAL+ FEGGTK LH EG++GDPLSLYLRALCREGRIV+LL+ALE MA+D Sbjct: 329 YCVVREAIRHFRALRKFEGGTKALHNEGKFGDPLSLYLRALCREGRIVDLLEALEAMAED 388 Query: 1686 KQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYIARYVEEGGLTGERKRWVPRRG 1507 Q IPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDY+ARYV EGGLTGERKRWVPRRG Sbjct: 389 NQPIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYVARYVAEGGLTGERKRWVPRRG 448 Query: 1506 KTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRTLRNEGPVVLGNISESDYIRVE 1327 KTPLDPD +G+ YSNPMETS KQRCLE+WK HHRKLL+ LRNEG LG+ SESDY+RVE Sbjct: 449 KTPLDPDCDGFIYSNPMETSLKQRCLEDWKAHHRKLLKMLRNEGLAALGDASESDYLRVE 508 Query: 1326 ERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRINRSR 1147 ERLRKII+GP++N LKPKAASKMIVSELK+ELEAQGLP DGTRNVLYQRVQKARRINRSR Sbjct: 509 ERLRKIIRGPDRNVLKPKAASKMIVSELKDELEAQGLPIDGTRNVLYQRVQKARRINRSR 568 Query: 1146 GRXXXXXXXXXXXXXXXXXXXXLISRIKLHEGDTEFWRQRFLGEGLNENHSKPLEEEXXX 967 GR LISRI+LHEGDTEFW++RFLGEG N NH KP++ E Sbjct: 569 GRPLWVPPVEEEEEEVDEEVDELISRIQLHEGDTEFWKRRFLGEGFNGNHVKPVDME--T 626 Query: 966 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEQTEIQVSDR-VKDKEAEAA 790 EQTE Q ++R VK KEAEA Sbjct: 627 SELPDELDEDEDDDDDDVEDVAKEVEDEEADEEGEVEVEVEQTESQDAERIVKAKEAEAK 686 Query: 789 KPLQMIGVQLLKDSDQXXXXXXXXXXXXXXXSMEXXXXXDWFPEDIYEAFKEMRNRKVFD 610 KPLQMIGVQLLKDSDQ + DWFPEDI EAFKEMRNRKVFD Sbjct: 687 KPLQMIGVQLLKDSDQ--TTRMSKKSRRRAARLADDDDDDWFPEDILEAFKEMRNRKVFD 744 Query: 609 VQDMYTLADAWGWTWDKEFKNKAPRRWSQEWEVELAVKIMNLVIELGGTPTIGDCAMVLR 430 V+DMY +ADAWGWTW++E K + +RWSQEWEVELA+++M L +LGGTPTIGDCAM+LR Sbjct: 745 VEDMYLIADAWGWTWEREIKKRPLQRWSQEWEVELAIQLM-LKAKLGGTPTIGDCAMILR 803 Query: 429 AAIRAPMPSAFLQILQMTHRLGYVFGSPLYDEVISLCLDLGELDASIAIVADLETSGIEV 250 AAIRAPMPSAFL+ILQ TH LGY FGS LYDE+ISLC+DLGELDA+IAIVADLET+GI V Sbjct: 804 AAIRAPMPSAFLKILQTTHSLGYQFGSSLYDEIISLCVDLGELDAAIAIVADLETAGIAV 863 Query: 249 ADETLDRVISARQ 211 D+TLDRVISA+Q Sbjct: 864 PDQTLDRVISAKQ 876 >ref|XP_006408205.1| hypothetical protein EUTSA_v10020015mg [Eutrema salsugineum] gi|557109351|gb|ESQ49658.1| hypothetical protein EUTSA_v10020015mg [Eutrema salsugineum] Length = 912 Score = 1160 bits (3002), Expect = 0.0 Identities = 594/859 (69%), Positives = 681/859 (79%), Gaps = 8/859 (0%) Frame = -3 Query: 2763 QRKPRRK-KQQSDLKSSDDNGYPGGEGGSFTPSSAEKLLRLVFMEELMERARSGSAAGVS 2587 ++KPRR+ KQQ +N G S+ E+ LRL FM+ELMERAR+ +GVS Sbjct: 36 EKKPRRRRKQQQKRGDGAENDDSSSFGSREAVSALERSLRLTFMDELMERARNRDTSGVS 95 Query: 2586 DVIYDMIAAGLTPGPRSFHGLVVSHVLNRDEEGSMHALRRQLSEGLRPLHETFLALVRLF 2407 +VIYDMIAAGL+PGPRSFHGLVV+H LN DE+G+MH+LR++L G RPL ET +ALVRL Sbjct: 96 EVIYDMIAAGLSPGPRSFHGLVVAHALNGDEQGAMHSLRKELGAGQRPLPETMIALVRLS 155 Query: 2406 GSKGRATRGLEILAAMEKLNYDIRQAWLLLVEELVRSKHLEDANRVFLKGAEGGLRATNR 2227 GSKG ATRGLE+LAAMEKLNYDIRQAWL+LVEEL+R+ HLEDAN+VFLKGA GG+RAT++ Sbjct: 156 GSKGNATRGLELLAAMEKLNYDIRQAWLILVEELMRTNHLEDANKVFLKGARGGMRATDQ 215 Query: 2226 LYDLLIEEDCKVGDHSNALTIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENM 2047 LYDL+IEEDCK GDHSNAL I+YEMEAAGRMATTFHFNCLLSVQATCGIPE+A+STFENM Sbjct: 216 LYDLMIEEDCKAGDHSNALDISYEMEAAGRMATTFHFNCLLSVQATCGIPEVAYSTFENM 275 Query: 2046 EYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVEDYKRLQPNV----LLVE 1879 EYG E +MKPDTETYNWVIQAYTRA+SYDRVQDVAEL+GMMVED+KR+QPNV LLVE Sbjct: 276 EYG-EDFMKPDTETYNWVIQAYTRADSYDRVQDVAELLGMMVEDHKRVQPNVKTHALLVE 334 Query: 1878 CFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLRALCREGRIVELLDALET 1699 CFTKYCV++EAIRHFRALKNFEGGT +LH G + DPLSLYLRALCREGRIVEL++AL+ Sbjct: 335 CFTKYCVVKEAIRHFRALKNFEGGTIVLHNAGNFEDPLSLYLRALCREGRIVELIEALDA 394 Query: 1698 MAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYIARYVEEGGLTGERKRWV 1519 M KD Q IPPRAMI+SRKYRTLVSSWIEPLQEEAELG+EIDY+ARY+EEGGLTGERKRWV Sbjct: 395 MRKDNQPIPPRAMIMSRKYRTLVSSWIEPLQEEAELGYEIDYLARYIEEGGLTGERKRWV 454 Query: 1518 PRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRTLRNEGPVVLGNISESDY 1339 PRRGKTPLDPDA G+ YSNP+ETSFKQRCL++WKIHHRKLLRTL++EG VLG+ SESDY Sbjct: 455 PRRGKTPLDPDASGFIYSNPIETSFKQRCLDDWKIHHRKLLRTLQSEGLPVLGDASESDY 514 Query: 1338 IRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRI 1159 IRV ERLR IIKGP QN LKPKAASKM+VSELKEELEAQGLP DGTRNVLYQRVQKARRI Sbjct: 515 IRVMERLRNIIKGPAQNLLKPKAASKMVVSELKEELEAQGLPIDGTRNVLYQRVQKARRI 574 Query: 1158 NRSRGRXXXXXXXXXXXXXXXXXXXXLISRIKLHEGDTEFWRQRFLGEGLNENHSKPLE- 982 N+SRGR LI RIKLHEGDTEFW++RFLGEGL E ++ E Sbjct: 575 NKSRGRPLWVPPIEEEEEEVDEEVDELIRRIKLHEGDTEFWKRRFLGEGLIETSAESKET 634 Query: 981 EEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEQTE--IQVSDRVKD 808 E +TE + D VK+ Sbjct: 635 AESVATGESENTIEDVVKEAEDDDDEEEEEQEGDEEDEEEEEVVVAETENRAEGEDLVKN 694 Query: 807 KEAEAAKPLQMIGVQLLKDSDQXXXXXXXXXXXXXXXSMEXXXXXDWFPEDIYEAFKEMR 628 K A+A K LQMIGVQLLK+SD+ ++E DWFPE+ +EAFKEMR Sbjct: 695 KAADAKKHLQMIGVQLLKESDE-ANRTKKRGKRASRMTLEDDADEDWFPEEPFEAFKEMR 753 Query: 627 NRKVFDVQDMYTLADAWGWTWDKEFKNKAPRRWSQEWEVELAVKIMNLVIELGGTPTIGD 448 RKVFDV DMYT+AD WGWTW+K++KNK PR+WSQEWEVELA+ +M VIELGG PTIGD Sbjct: 754 ERKVFDVSDMYTIADVWGWTWEKDYKNKTPRKWSQEWEVELAIVLMTKVIELGGIPTIGD 813 Query: 447 CAMVLRAAIRAPMPSAFLQILQMTHRLGYVFGSPLYDEVISLCLDLGELDASIAIVADLE 268 CA++LRAA+RAPMPSAFL+ILQ TH LGY FGSPLYDE+I+LCLDLGELDA+IAIVAD+E Sbjct: 814 CAVILRAAVRAPMPSAFLKILQTTHSLGYSFGSPLYDEIITLCLDLGELDAAIAIVADME 873 Query: 267 TSGIEVADETLDRVISARQ 211 T+GI V D+TLD+VISARQ Sbjct: 874 TTGITVPDQTLDKVISARQ 892 >ref|XP_003555560.1| PREDICTED: uncharacterized protein LOC100807191 isoform X1 [Glycine max] Length = 887 Score = 1159 bits (2999), Expect = 0.0 Identities = 597/855 (69%), Positives = 676/855 (79%), Gaps = 5/855 (0%) Frame = -3 Query: 2760 RKPRRKKQQSDLKSSDDNGYPGGEGGSFTPSSAEKLLRLVFMEELMERARSGSAAGVSDV 2581 ++ R+KKQ D S+ +NG LR FMEELM+RAR+ + GVS+V Sbjct: 40 KRGRKKKQAKDDDSAVENG-----------------LRFSFMEELMDRARNRDSNGVSEV 82 Query: 2580 IYDMIAAGLTPGPRSFHGLVVSHVLNRDEEGSMHALRRQLSEGLRPLHETFLALVRLFGS 2401 +YDMIAAGL+PGPRSFHGLVVSH LN DEE +M +LRR+L+ GLRP+HETFLAL+RLFGS Sbjct: 83 MYDMIAAGLSPGPRSFHGLVVSHALNGDEEAAMESLRRELAAGLRPVHETFLALIRLFGS 142 Query: 2400 KGRATRGLEILAAMEKLNYDIRQAWLLLVEELVRSKHLEDANRVFLKGAEGGLRATNRLY 2221 KGRATRGLEILAAMEKLNYDIRQAWL+L+EELV +KHLEDAN VFLKGA+GGL+AT+ +Y Sbjct: 143 KGRATRGLEILAAMEKLNYDIRQAWLILIEELVWNKHLEDANEVFLKGAKGGLKATDEVY 202 Query: 2220 DLLIEEDCKVGDHSNALTIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEY 2041 DLLIEEDCK GDHSNAL IAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAF+TFENMEY Sbjct: 203 DLLIEEDCKAGDHSNALDIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFATFENMEY 262 Query: 2040 GGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVEDYKRLQPN----VLLVECF 1873 GE YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVED+KR+QPN LLVECF Sbjct: 263 -GEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRIQPNAKTHALLVECF 321 Query: 1872 TKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLRALCREGRIVELLDALETMA 1693 TKYCV+REAIRHFRALKNFEGG K+LH EG +GDPLSLYLRALCREGRIVE+L+ALE MA Sbjct: 322 TKYCVVREAIRHFRALKNFEGGIKVLHNEGNHGDPLSLYLRALCREGRIVEMLEALEAMA 381 Query: 1692 KDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYIARYVEEGGLTGERKRWVPR 1513 KD Q IP RAMILSRKYRTLVSSWIEPLQEEAELG+EIDYI+RY++EGGLTGERKRWVPR Sbjct: 382 KDNQPIPSRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYISRYIDEGGLTGERKRWVPR 441 Query: 1512 RGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRTLRNEGPVVLGN-ISESDYI 1336 RGKTPLDPDA G+ YSNPMETSFKQRCLEE K+H++KLL+TL+NEG LG+ +SESDYI Sbjct: 442 RGKTPLDPDAHGFIYSNPMETSFKQRCLEELKLHNKKLLKTLQNEGLAALGDGVSESDYI 501 Query: 1335 RVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRIN 1156 RV+ERL+K+IKGPEQN LKPKAASKM+VSELKEEL+AQGLP DG RNVLYQRVQKARRIN Sbjct: 502 RVQERLKKLIKGPEQNVLKPKAASKMLVSELKEELDAQGLPIDGNRNVLYQRVQKARRIN 561 Query: 1155 RSRGRXXXXXXXXXXXXXXXXXXXXLISRIKLHEGDTEFWRQRFLGEGLNENHSKPLEEE 976 RSRGR LIS IKL EG+TEFW++RFLGEGLN + P + Sbjct: 562 RSRGRPLWVPPVEEEEEEVDEELDALISHIKLEEGNTEFWKRRFLGEGLNGDQEMPTD-- 619 Query: 975 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEQTEIQVSDRVKDKEAE 796 E E Q +R+K+KE E Sbjct: 620 -AAESEVPEVLDDVDAIEDAAKEVEDDEADDDEEEAEQAEEEVEPAENQDVNRIKEKEVE 678 Query: 795 AAKPLQMIGVQLLKDSDQXXXXXXXXXXXXXXXSMEXXXXXDWFPEDIYEAFKEMRNRKV 616 A +PLQMIGVQLLKD DQ +E DW P D++EAF+EMR RK+ Sbjct: 679 AKRPLQMIGVQLLKDIDQ-PTATSKKFKRSRKVQVEDDDDDDWLPLDLFEAFEEMRKRKI 737 Query: 615 FDVQDMYTLADAWGWTWDKEFKNKAPRRWSQEWEVELAVKIMNLVIELGGTPTIGDCAMV 436 FDV DMYTLADAWGWTW++E K K PRRWSQEWEVELA+K+M VIELGG PTIGDCAM+ Sbjct: 738 FDVSDMYTLADAWGWTWERELKKKPPRRWSQEWEVELAIKVMQKVIELGGRPTIGDCAMI 797 Query: 435 LRAAIRAPMPSAFLQILQMTHRLGYVFGSPLYDEVISLCLDLGELDASIAIVADLETSGI 256 LRAAIRAP+PSAFL ILQ TH LG+ FGSPLYDE+ISLC+DLGELDA++A+VADLET+GI Sbjct: 798 LRAAIRAPLPSAFLTILQTTHSLGFKFGSPLYDEIISLCVDLGELDAAVAVVADLETTGI 857 Query: 255 EVADETLDRVISARQ 211 V+D TLDRVISA+Q Sbjct: 858 SVSDLTLDRVISAKQ 872 >ref|XP_006296937.1| hypothetical protein CARUB_v10012929mg [Capsella rubella] gi|482565646|gb|EOA29835.1| hypothetical protein CARUB_v10012929mg [Capsella rubella] Length = 911 Score = 1158 bits (2995), Expect = 0.0 Identities = 595/866 (68%), Positives = 685/866 (79%), Gaps = 15/866 (1%) Frame = -3 Query: 2763 QRKPRRKKQQS--DLKSSDDNGYPGGEGGSFTPSSAEKLLRLVFMEELMERARSGSAAGV 2590 ++KPRR+++Q D +DD+ G S+ E+ LRL FM+ELMERAR+ ++GV Sbjct: 36 EKKPRRRRKQKRGDAADNDDSS---SFGSGEAVSALERSLRLTFMDELMERARNRDSSGV 92 Query: 2589 SDVIYDMIAAGLTPGPRSFHGLVVSHVLNRDEEGSMHALRRQLSEGLRPLHETFLALVRL 2410 S+VIYDMIAAGL PGPRSFHGLVV+H LN DE+G+MH+LR++L G RPL ET +ALVRL Sbjct: 93 SEVIYDMIAAGLAPGPRSFHGLVVAHALNGDEQGAMHSLRKELGAGQRPLPETMIALVRL 152 Query: 2409 FGSKGRATRGLEILAAMEKLNYDIRQAWLLLVEELVRSKHLEDANRVFLKGAEGGLRATN 2230 GSKG ATRGLEILAAMEKLNYDIRQAWL+LVEEL+R+ HLEDAN+VFLKGA GG+RAT+ Sbjct: 153 SGSKGNATRGLEILAAMEKLNYDIRQAWLILVEELMRTNHLEDANKVFLKGARGGMRATD 212 Query: 2229 RLYDLLIEEDCKVGDHSNALTIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFEN 2050 +LYDL+IEEDCK GDHSNAL I+YEMEAAGRMATTFHFNCLLSVQATCGIPE+A+STFEN Sbjct: 213 QLYDLMIEEDCKAGDHSNALDISYEMEAAGRMATTFHFNCLLSVQATCGIPEVAYSTFEN 272 Query: 2049 MEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVEDYKRLQPNV----LLV 1882 MEYG E +MKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVED+KR+QPNV LLV Sbjct: 273 MEYG-EDFMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRVQPNVKTYALLV 331 Query: 1881 ECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLRALCREGRIVELLDALE 1702 ECFTKYCV++EAIRHFRALKNFEGGT +L+ G++ DPLSLYLRALCREGRIVEL+DAL+ Sbjct: 332 ECFTKYCVVKEAIRHFRALKNFEGGTIVLYNAGKFEDPLSLYLRALCREGRIVELIDALD 391 Query: 1701 TMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYIARYVEEGGLTGERKRW 1522 M KD Q IPPRAMI+SRKYRTLVSSWIEPLQEEAELG+EIDY+ARY+EEGGLTG+RKRW Sbjct: 392 AMRKDNQPIPPRAMIMSRKYRTLVSSWIEPLQEEAELGYEIDYLARYIEEGGLTGDRKRW 451 Query: 1521 VPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRTLRNEGPVVLGNISESD 1342 VPR+GKTPLDPDA G+ YSNP+ETSFKQRCLE+WKIHHRKLLRTL++EG VLG+ SESD Sbjct: 452 VPRKGKTPLDPDASGFIYSNPIETSFKQRCLEDWKIHHRKLLRTLQSEGLPVLGDASESD 511 Query: 1341 YIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARR 1162 Y+RV ERLR II+GP QN LKPKAASKM+VSELKEELEAQGLP DGTRNVLYQRVQKARR Sbjct: 512 YMRVMERLRNIIRGPAQNLLKPKAASKMVVSELKEELEAQGLPIDGTRNVLYQRVQKARR 571 Query: 1161 INRSRGRXXXXXXXXXXXXXXXXXXXXLISRIKLHEGDTEFWRQRFLGEGLNENHSKPLE 982 IN+SRGR LI RIKLHEGDTEFW++RFLGEGL E +E Sbjct: 572 INKSRGRPLWVPPIEEEEEEVDEEVDELICRIKLHEGDTEFWKRRFLGEGLIET---SVE 628 Query: 981 EEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEQTEIQVS------- 823 E+ E+ V+ Sbjct: 629 SSETAETVATGESEITIKDAAKEADDDEDDDEEEEQEGDEDDDENEEEEVVVAETENRAE 688 Query: 822 --DRVKDKEAEAAKPLQMIGVQLLKDSDQXXXXXXXXXXXXXXXSMEXXXXXDWFPEDIY 649 D VK+K AEA K LQMIGVQLLK+SD+ ++E DWFPED + Sbjct: 689 GEDLVKNKAAEAKKHLQMIGVQLLKESDE-ANRTKKRGKRASRMTLEDDADEDWFPEDPF 747 Query: 648 EAFKEMRNRKVFDVQDMYTLADAWGWTWDKEFKNKAPRRWSQEWEVELAVKIMNLVIELG 469 EAFKEMR RKVFDV DMYT+AD WGWTW+K+FKNK PR+WSQEWEVELA+ +M VIELG Sbjct: 748 EAFKEMRERKVFDVSDMYTIADVWGWTWEKDFKNKTPRKWSQEWEVELAMVLMTKVIELG 807 Query: 468 GTPTIGDCAMVLRAAIRAPMPSAFLQILQMTHRLGYVFGSPLYDEVISLCLDLGELDASI 289 G PTIGDCA++LRAA+RAPMPSAFL+ILQ TH LGY FGSPLYDE+I+LCLDLGELDA+I Sbjct: 808 GIPTIGDCAVILRAAVRAPMPSAFLKILQTTHSLGYSFGSPLYDEIITLCLDLGELDAAI 867 Query: 288 AIVADLETSGIEVADETLDRVISARQ 211 AIVAD+ET+GI V D+T+D+VISARQ Sbjct: 868 AIVADMETTGITVPDQTIDKVISARQ 893 >ref|XP_003535382.1| PREDICTED: uncharacterized protein LOC100802355 isoform X1 [Glycine max] Length = 887 Score = 1156 bits (2990), Expect = 0.0 Identities = 594/855 (69%), Positives = 679/855 (79%), Gaps = 5/855 (0%) Frame = -3 Query: 2760 RKPRRKKQQSDLKSSDDNGYPGGEGGSFTPSSAEKLLRLVFMEELMERARSGSAAGVSDV 2581 ++ R+KKQ D +S+ +NG LR FMEELM+RAR+ + GVS+V Sbjct: 40 KRGRKKKQSKDDESAVENG-----------------LRFSFMEELMDRARNRDSNGVSEV 82 Query: 2580 IYDMIAAGLTPGPRSFHGLVVSHVLNRDEEGSMHALRRQLSEGLRPLHETFLALVRLFGS 2401 +YDMIAAGL+PGPRSFHGLVVSH LN DEE +M +LRR+L+ GLRP+HETFLAL+RLFGS Sbjct: 83 MYDMIAAGLSPGPRSFHGLVVSHALNGDEEAAMESLRRELAAGLRPVHETFLALIRLFGS 142 Query: 2400 KGRATRGLEILAAMEKLNYDIRQAWLLLVEELVRSKHLEDANRVFLKGAEGGLRATNRLY 2221 KGRATRGLEILAAMEKLNYDIRQAWL+L+EELVR+ HLEDAN VFLKGA+GGL+AT+ +Y Sbjct: 143 KGRATRGLEILAAMEKLNYDIRQAWLILIEELVRNMHLEDANEVFLKGAKGGLKATDEVY 202 Query: 2220 DLLIEEDCKVGDHSNALTIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEY 2041 DLLI+EDCKVGDHSNAL IAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAF+TFENMEY Sbjct: 203 DLLIQEDCKVGDHSNALDIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFATFENMEY 262 Query: 2040 GGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVEDYKRLQPN----VLLVECF 1873 GE YMKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVED+KR+QPN LLVECF Sbjct: 263 -GEDYMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRIQPNAKTHALLVECF 321 Query: 1872 TKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLRALCREGRIVELLDALETMA 1693 TKYCV+REAIRHFRALKNFEGG ++LH EG +GDPLSLYLRALCREGRIVE+L+ALE MA Sbjct: 322 TKYCVVREAIRHFRALKNFEGGIEVLHNEGNHGDPLSLYLRALCREGRIVEMLEALEAMA 381 Query: 1692 KDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYIARYVEEGGLTGERKRWVPR 1513 KD Q IP RAMILSRKYRTLVSSWIEPLQEEAE+G+EIDYI+RY++EGGLTGERKRWVPR Sbjct: 382 KDNQPIPSRAMILSRKYRTLVSSWIEPLQEEAEIGYEIDYISRYIDEGGLTGERKRWVPR 441 Query: 1512 RGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRTLRNEGPVVLG-NISESDYI 1336 RGKTPLDPDA G+ YSNPMETSFKQRC+EE K+H++KLL+TL+NEG LG ++SE DYI Sbjct: 442 RGKTPLDPDAHGFIYSNPMETSFKQRCMEELKLHNKKLLKTLQNEGLAALGDDVSEFDYI 501 Query: 1335 RVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRIN 1156 RV+ERL+K++KGPEQN LKPKAASKM+VSELKEEL+AQGLP DGTRNVLYQRVQKARRIN Sbjct: 502 RVQERLKKLMKGPEQNVLKPKAASKMLVSELKEELDAQGLPIDGTRNVLYQRVQKARRIN 561 Query: 1155 RSRGRXXXXXXXXXXXXXXXXXXXXLISRIKLHEGDTEFWRQRFLGEGLNENHSKPLEEE 976 RSRGR LISRIKL EG+TEFW++RFLGEGLN + P + Sbjct: 562 RSRGRPLWVPPVEEEEEEVDEELDALISRIKLEEGNTEFWKRRFLGEGLNGDQEMPTD-- 619 Query: 975 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEQTEIQVSDRVKDKEAE 796 E E Q +R+K+KE E Sbjct: 620 -AVQSDVPEVLDDVDAIEDAAKEVEDDEADDEEEEAEQAEEEVEPAENQDVNRIKEKEVE 678 Query: 795 AAKPLQMIGVQLLKDSDQXXXXXXXXXXXXXXXSMEXXXXXDWFPEDIYEAFKEMRNRKV 616 A +PLQMIGVQLLKD DQ +E DW P +++EAFKEMR RK+ Sbjct: 679 AKRPLQMIGVQLLKDIDQ-PTATSKKFKRSRRVQVEDDDDDDWLPLNLFEAFKEMRKRKI 737 Query: 615 FDVQDMYTLADAWGWTWDKEFKNKAPRRWSQEWEVELAVKIMNLVIELGGTPTIGDCAMV 436 FDV DMYTLADAWGWTW++E KNK PRRWSQE EVELA+K+M+ VIELGG PTIGDCAM+ Sbjct: 738 FDVSDMYTLADAWGWTWERELKNKPPRRWSQEREVELAIKVMHKVIELGGRPTIGDCAMI 797 Query: 435 LRAAIRAPMPSAFLQILQMTHRLGYVFGSPLYDEVISLCLDLGELDASIAIVADLETSGI 256 LRAAIRAP+PSAFL ILQ TH LG+ FGSPLYDE ISLC+DLGELDA++A+VADLET+GI Sbjct: 798 LRAAIRAPLPSAFLTILQTTHALGFKFGSPLYDETISLCVDLGELDAAVAVVADLETTGI 857 Query: 255 EVADETLDRVISARQ 211 V+D TLDRVISA+Q Sbjct: 858 SVSDHTLDRVISAKQ 872 >ref|XP_002884436.1| hypothetical protein ARALYDRAFT_477686 [Arabidopsis lyrata subsp. lyrata] gi|297330276|gb|EFH60695.1| hypothetical protein ARALYDRAFT_477686 [Arabidopsis lyrata subsp. lyrata] Length = 914 Score = 1156 bits (2990), Expect = 0.0 Identities = 595/867 (68%), Positives = 683/867 (78%), Gaps = 16/867 (1%) Frame = -3 Query: 2763 QRKPRRK-KQQSDLKSSDDNGYPGGEGGSFTPSSAEKLLRLVFMEELMERARSGSAAGVS 2587 ++KPRR+ KQ+ + +D+ G G + S+ E+ LRL FM+ELMERAR+ +GVS Sbjct: 36 EKKPRRRRKQKRGDGAENDSSLSFGSGDAV--SALERSLRLTFMDELMERARNRDTSGVS 93 Query: 2586 DVIYDMIAAGLTPGPRSFHGLVVSHVLNRDEEGSMHALRRQLSEGLRPLHETFLALVRLF 2407 +VIYDMIAAGL+PGPRSFHGLVV+H LN DE G+MH+LR++L G RPL ET +ALVRL Sbjct: 94 EVIYDMIAAGLSPGPRSFHGLVVAHALNGDEHGAMHSLRKELGAGQRPLPETMIALVRLS 153 Query: 2406 GSKGRATRGLEILAAMEKLNYDIRQAWLLLVEELVRSKHLEDANRVFLKGAEGGLRATNR 2227 GSKG ATRGLEILAAMEKLNYDIRQAWL+LVEEL+R HLEDAN+VFLKGA GG+RATN Sbjct: 154 GSKGNATRGLEILAAMEKLNYDIRQAWLILVEELMRINHLEDANKVFLKGARGGMRATNH 213 Query: 2226 LYDLLIEEDCKVGDHSNALTIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENM 2047 LYDL+IEEDCK GDHSNAL I+YEMEAAGRMATTFHFNCLLSVQATCGIPE+A++TFENM Sbjct: 214 LYDLMIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEVAYATFENM 273 Query: 2046 EYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVEDYKRLQPNV----LLVE 1879 EYG +MKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVED+KR+QPNV LLVE Sbjct: 274 EYGEGLFMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRVQPNVKTYALLVE 333 Query: 1878 CFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLRALCREGRIVELLDALET 1699 CFTKYCV++EAIRHFRALKNFEGGT +LH G++ DPLSLYLRALCREGRIVEL+DAL+ Sbjct: 334 CFTKYCVVKEAIRHFRALKNFEGGTTILHNAGKFEDPLSLYLRALCREGRIVELIDALDA 393 Query: 1698 MAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYIARYVEEGGLTGERKRWV 1519 M KD Q IPPRAMI+SRKYRTLVSSWIEPLQEEAELG+EIDY+ARY+EEGGLTGERKRWV Sbjct: 394 MRKDSQPIPPRAMIMSRKYRTLVSSWIEPLQEEAELGYEIDYVARYIEEGGLTGERKRWV 453 Query: 1518 PRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRTLRNEGPVVLGNISESDY 1339 PRRGKTPLDPDA G+ YSNP+ETSFKQRCLE+WKIHHRKLLRTL++EG VLG+ SESDY Sbjct: 454 PRRGKTPLDPDASGFIYSNPIETSFKQRCLEDWKIHHRKLLRTLQSEGLPVLGDASESDY 513 Query: 1338 IRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRI 1159 +RV ERLR IIKGP QN LKPKAASKM+VSELKEELEAQGLP DGTRNVLYQRVQKARRI Sbjct: 514 MRVMERLRNIIKGPAQNLLKPKAASKMVVSELKEELEAQGLPIDGTRNVLYQRVQKARRI 573 Query: 1158 NRSRGRXXXXXXXXXXXXXXXXXXXXLISRIKLHEGDTEFWRQRFLGEGLNENHSKPLEE 979 N+SRGR LI RIKLHEGDTEFW++RFLGEGL E +E Sbjct: 574 NKSRGRPLWVPPIEEEEEEVDEEVDDLICRIKLHEGDTEFWKRRFLGEGLIET---SVES 630 Query: 978 EXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEQTEIQVSDR------ 817 + E+ E+ V++ Sbjct: 631 KETTESVVTGESEKAIEDISKEADNDEDDDEEEQEGDDDDDETEEEEEVVVAETENRAEG 690 Query: 816 ---VKDKEAEAAKPLQMIGVQLLKDSDQXXXXXXXXXXXXXXXSMEXXXXXDWFPEDIYE 646 VK+K A+A K LQMIGVQLLK+SD+ ++E DWFPED +E Sbjct: 691 EELVKNKAADAKKHLQMIGVQLLKESDE-ANRTKKRGKRASRMTLEDDADEDWFPEDPFE 749 Query: 645 AFKEMRNRKVFDVQDMYTLADAWGWTWDKEFKNKAPRRWSQEWEVELAVKIMN--LVIEL 472 AFKEMR RKVFDV DMYT+AD WGWTW+K+FKN+ PR+WSQEWEVELA+ +M VIEL Sbjct: 750 AFKEMRERKVFDVSDMYTIADVWGWTWEKDFKNRTPRKWSQEWEVELAIVLMTKARVIEL 809 Query: 471 GGTPTIGDCAMVLRAAIRAPMPSAFLQILQMTHRLGYVFGSPLYDEVISLCLDLGELDAS 292 GG PTIGDCA++LRAA+RAPMPSAFL+ILQ TH LGY FGSPLYDE+I+LCLD+GELDA+ Sbjct: 810 GGIPTIGDCAVILRAALRAPMPSAFLKILQTTHSLGYSFGSPLYDEIITLCLDIGELDAA 869 Query: 291 IAIVADLETSGIEVADETLDRVISARQ 211 IAIVAD+ET+GI V D+TLD+VISARQ Sbjct: 870 IAIVADMETTGITVPDQTLDKVISARQ 896 >ref|NP_187076.2| plastid transcriptionally active 3 [Arabidopsis thaliana] gi|332640537|gb|AEE74058.1| plastid transcriptionally active 3 [Arabidopsis thaliana] Length = 910 Score = 1154 bits (2985), Expect = 0.0 Identities = 594/863 (68%), Positives = 680/863 (78%), Gaps = 12/863 (1%) Frame = -3 Query: 2763 QRKPRRKKQQS--DLKSSDDN-GYPGGEGGSFTPSSAEKLLRLVFMEELMERARSGSAAG 2593 ++KPRR+++Q D +DD+ + GE S+ E+ LRL FM+ELMERAR+ +G Sbjct: 36 EKKPRRRRKQKRGDGAENDDSLSFGSGEA----VSALERSLRLTFMDELMERARNRDTSG 91 Query: 2592 VSDVIYDMIAAGLTPGPRSFHGLVVSHVLNRDEEGSMHALRRQLSEGLRPLHETFLALVR 2413 VS+VIYDMIAAGL+PGPRSFHGLVV+H LN DE+G+MH+LR++L G RPL ET +ALVR Sbjct: 92 VSEVIYDMIAAGLSPGPRSFHGLVVAHALNGDEQGAMHSLRKELGAGQRPLPETMIALVR 151 Query: 2412 LFGSKGRATRGLEILAAMEKLNYDIRQAWLLLVEELVRSKHLEDANRVFLKGAEGGLRAT 2233 L GSKG ATRGLEILAAMEKL YDIRQAWL+LVEEL+R HLEDAN+VFLKGA GG+RAT Sbjct: 152 LSGSKGNATRGLEILAAMEKLKYDIRQAWLILVEELMRINHLEDANKVFLKGARGGMRAT 211 Query: 2232 NRLYDLLIEEDCKVGDHSNALTIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFE 2053 ++LYDL+IEEDCK GDHSNAL I+YEMEAAGRMATTFHFNCLLSVQATCGIPE+A++TFE Sbjct: 212 DQLYDLMIEEDCKAGDHSNALDISYEMEAAGRMATTFHFNCLLSVQATCGIPEVAYATFE 271 Query: 2052 NMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVEDYKRLQPNV----LL 1885 NMEYG E +MKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVED+KR+QPNV LL Sbjct: 272 NMEYG-EVFMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRVQPNVKTYALL 330 Query: 1884 VECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLRALCREGRIVELLDAL 1705 VECFTKYCV++EAIRHFRALKNFEGGT +LH G + DPLSLYLRALCREGRIVEL+DAL Sbjct: 331 VECFTKYCVVKEAIRHFRALKNFEGGTVILHNAGNFEDPLSLYLRALCREGRIVELIDAL 390 Query: 1704 ETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYIARYVEEGGLTGERKR 1525 + M KD Q IPPRAMI+SRKYRTLVSSWIEPLQEEAELG+EIDY+ARY+EEGGLTGERKR Sbjct: 391 DAMRKDNQPIPPRAMIMSRKYRTLVSSWIEPLQEEAELGYEIDYLARYIEEGGLTGERKR 450 Query: 1524 WVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRTLRNEGPVVLGNISES 1345 WVPRRGKTPLDPDA G+ YSNP+ETSFKQRCLE+WK+HHRKLLRTL++EG VLG+ SES Sbjct: 451 WVPRRGKTPLDPDASGFIYSNPIETSFKQRCLEDWKVHHRKLLRTLQSEGLPVLGDASES 510 Query: 1344 DYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKAR 1165 DY+RV ERLR IIKGP N LKPKAASKM+VSELKEELEAQGLP DGTRNVLYQRVQKAR Sbjct: 511 DYMRVVERLRNIIKGPALNLLKPKAASKMVVSELKEELEAQGLPIDGTRNVLYQRVQKAR 570 Query: 1164 RINRSRGRXXXXXXXXXXXXXXXXXXXXLISRIKLHEGDTEFWRQRFLGEGLNEN--HSK 991 RIN+SRGR LI RIKLHEGDTEFW++RFLGEGL E SK Sbjct: 571 RINKSRGRPLWVPPIEEEEEEVDEEVDDLICRIKLHEGDTEFWKRRFLGEGLIETSVESK 630 Query: 990 PLEEE---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEQTEIQVSD 820 E + + D Sbjct: 631 ETTESVVTGESEKAIEDISKEADNEEDDDEEEQEGDEDDDENEEEEVVVPETENRAEGED 690 Query: 819 RVKDKEAEAAKPLQMIGVQLLKDSDQXXXXXXXXXXXXXXXSMEXXXXXDWFPEDIYEAF 640 VK+K A+A K LQMIGVQLLK+SD+ ++E DWFPE+ +EAF Sbjct: 691 LVKNKAADAKKHLQMIGVQLLKESDE-ANRTKKRGKRASRMTLEDDADEDWFPEEPFEAF 749 Query: 639 KEMRNRKVFDVQDMYTLADAWGWTWDKEFKNKAPRRWSQEWEVELAVKIMNLVIELGGTP 460 KEMR RKVFDV DMYT+AD WGWTW+K+FKNK PR+WSQEWEVELA+ +M VIELGG P Sbjct: 750 KEMRERKVFDVADMYTIADVWGWTWEKDFKNKTPRKWSQEWEVELAIVLMTKVIELGGIP 809 Query: 459 TIGDCAMVLRAAIRAPMPSAFLQILQMTHRLGYVFGSPLYDEVISLCLDLGELDASIAIV 280 TIGDCA++LRAA+RAPMPSAFL+ILQ TH LGY FGSPLYDE+I+LCLDLGELDA+IAIV Sbjct: 810 TIGDCAVILRAALRAPMPSAFLKILQTTHSLGYSFGSPLYDEIITLCLDLGELDAAIAIV 869 Query: 279 ADLETSGIEVADETLDRVISARQ 211 AD+ET+GI V D+TLD+VISARQ Sbjct: 870 ADMETTGITVPDQTLDKVISARQ 892 >gb|AAF26788.1|AC016829_12 hypothetical protein [Arabidopsis thaliana] Length = 913 Score = 1150 bits (2974), Expect = 0.0 Identities = 593/865 (68%), Positives = 679/865 (78%), Gaps = 14/865 (1%) Frame = -3 Query: 2763 QRKPRRKKQQS--DLKSSDDN-GYPGGEGGSFTPSSAEKLLRLVFMEELMERARSGSAAG 2593 ++KPRR+++Q D +DD+ + GE S+ E+ LRL FM+ELMERAR+ +G Sbjct: 36 EKKPRRRRKQKRGDGAENDDSLSFGSGEA----VSALERSLRLTFMDELMERARNRDTSG 91 Query: 2592 VSDVIYDMIAAGLTPGPRSFHGLVVSHVLNRDEEGSMHALRRQLSEGLRPLHETFLALVR 2413 VS+VIYDMIAAGL+PGPRSFHGLVV+H LN DE+G+MH+LR++L G RPL ET +ALVR Sbjct: 92 VSEVIYDMIAAGLSPGPRSFHGLVVAHALNGDEQGAMHSLRKELGAGQRPLPETMIALVR 151 Query: 2412 LFGSKGRATRGLEILAAMEKLNYDIRQAWLLLVEELVRSKHLEDANRVFLKGAEGGLRAT 2233 L GSKG ATRGLEILAAMEKL YDIRQAWL+LVEEL+R HLEDAN+VFLKGA GG+RAT Sbjct: 152 LSGSKGNATRGLEILAAMEKLKYDIRQAWLILVEELMRINHLEDANKVFLKGARGGMRAT 211 Query: 2232 NRLYDLLIEEDCKVGDHSNALTIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFE 2053 ++LYDL+IEEDCK GDHSNAL I+YEMEAAGRMATTFHFNCLLSVQATCGIPE+A++TFE Sbjct: 212 DQLYDLMIEEDCKAGDHSNALDISYEMEAAGRMATTFHFNCLLSVQATCGIPEVAYATFE 271 Query: 2052 NMEYGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVEDYKRLQPNV----LL 1885 NMEYG +MKPDTETYNWVIQAYTRAESYDRVQDVAEL+GMMVED+KR+QPNV LL Sbjct: 272 NMEYGEGLFMKPDTETYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRVQPNVKTYALL 331 Query: 1884 VECFTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLRALCREGRIVELLDAL 1705 VECFTKYCV++EAIRHFRALKNFEGGT +LH G + DPLSLYLRALCREGRIVEL+DAL Sbjct: 332 VECFTKYCVVKEAIRHFRALKNFEGGTVILHNAGNFEDPLSLYLRALCREGRIVELIDAL 391 Query: 1704 ETMAKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYIARYVEEGGLTGERKR 1525 + M KD Q IPPRAMI+SRKYRTLVSSWIEPLQEEAELG+EIDY+ARY+EEGGLTGERKR Sbjct: 392 DAMRKDNQPIPPRAMIMSRKYRTLVSSWIEPLQEEAELGYEIDYLARYIEEGGLTGERKR 451 Query: 1524 WVPRRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRTLRNEGPVVLGNISES 1345 WVPRRGKTPLDPDA G+ YSNP+ETSFKQRCLE+WK+HHRKLLRTL++EG VLG+ SES Sbjct: 452 WVPRRGKTPLDPDASGFIYSNPIETSFKQRCLEDWKVHHRKLLRTLQSEGLPVLGDASES 511 Query: 1344 DYIRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKAR 1165 DY+RV ERLR IIKGP N LKPKAASKM+VSELKEELEAQGLP DGTRNVLYQRVQKAR Sbjct: 512 DYMRVVERLRNIIKGPALNLLKPKAASKMVVSELKEELEAQGLPIDGTRNVLYQRVQKAR 571 Query: 1164 RINRSRGRXXXXXXXXXXXXXXXXXXXXLISRIKLHEGDTEFWRQRFLGEGLNEN--HSK 991 RIN+SRGR LI RIKLHEGDTEFW++RFLGEGL E SK Sbjct: 572 RINKSRGRPLWVPPIEEEEEEVDEEVDDLICRIKLHEGDTEFWKRRFLGEGLIETSVESK 631 Query: 990 PLEEE---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEQTEIQVSD 820 E + + D Sbjct: 632 ETTESVVTGESEKAIEDISKEADNEEDDDEEEQEGDEDDDENEEEEVVVPETENRAEGED 691 Query: 819 RVKDKEAEAAKPLQMIGVQLLKDSDQXXXXXXXXXXXXXXXSMEXXXXXDWFPEDIYEAF 640 VK+K A+A K LQMIGVQLLK+SD+ ++E DWFPE+ +EAF Sbjct: 692 LVKNKAADAKKHLQMIGVQLLKESDE-ANRTKKRGKRASRMTLEDDADEDWFPEEPFEAF 750 Query: 639 KEMRNRKVFDVQDMYTLADAWGWTWDKEFKNKAPRRWSQEWEVELAVKIMNL--VIELGG 466 KEMR RKVFDV DMYT+AD WGWTW+K+FKNK PR+WSQEWEVELA+ +M VIELGG Sbjct: 751 KEMRERKVFDVADMYTIADVWGWTWEKDFKNKTPRKWSQEWEVELAIVLMTKAGVIELGG 810 Query: 465 TPTIGDCAMVLRAAIRAPMPSAFLQILQMTHRLGYVFGSPLYDEVISLCLDLGELDASIA 286 PTIGDCA++LRAA+RAPMPSAFL+ILQ TH LGY FGSPLYDE+I+LCLDLGELDA+IA Sbjct: 811 IPTIGDCAVILRAALRAPMPSAFLKILQTTHSLGYSFGSPLYDEIITLCLDLGELDAAIA 870 Query: 285 IVADLETSGIEVADETLDRVISARQ 211 IVAD+ET+GI V D+TLD+VISARQ Sbjct: 871 IVADMETTGITVPDQTLDKVISARQ 895 >ref|XP_003590907.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355479955|gb|AES61158.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 2047 Score = 1128 bits (2917), Expect = 0.0 Identities = 591/900 (65%), Positives = 677/900 (75%), Gaps = 49/900 (5%) Frame = -3 Query: 2763 QRKPRRKKQQSDLKSSDDNGYPGGEGGSFTPSSAEKLLRLVFMEELMERARSGSAAGVSD 2584 +++ R+KKQ D DD + E LR FMEELM RAR+ + GVS Sbjct: 34 EKRTRKKKQVKD----DD-------------TLLENSLRFSFMEELMNRARNRDSTGVSQ 76 Query: 2583 VIYDMIAAGLTPGPRSFHGLVVSHVLNRDEEGSMHALRRQLSEGLRPLHETFLALVRLFG 2404 V+YDMIAAGL+PGPRSFHGLVVS+ LN +E+ +M +LRR+L GLRP+HETF+ALVRLFG Sbjct: 77 VMYDMIAAGLSPGPRSFHGLVVSYALNGNEQAAMDSLRRELGAGLRPIHETFVALVRLFG 136 Query: 2403 SKGRATRGLEILAAMEKLNYDIRQAWLLLVEELVRSKHLEDANRVFLKGAEGGLRATNRL 2224 SKG +TRGLEIL AME LNYDIR AW++L+EELVR+KHLEDAN+VFLKGA+GGLRAT+ L Sbjct: 137 SKGHSTRGLEILGAMENLNYDIRHAWIILIEELVRNKHLEDANKVFLKGAKGGLRATDEL 196 Query: 2223 YDLLIEEDCKVGDHSNALTIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENME 2044 YDLLIEEDCK GDHSNAL I+YEMEAAGRMATTFHFNCLLSVQATCGIPEIAF+TFENME Sbjct: 197 YDLLIEEDCKAGDHSNALEISYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFTTFENME 256 Query: 2043 YGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVEDYKRLQPNV----LLVEC 1876 Y GE YMKPDTETYNWVIQAYTRA+SYDRVQDVAEL+GMMVED+KR+QPNV LLVEC Sbjct: 257 Y-GEDYMKPDTETYNWVIQAYTRADSYDRVQDVAELLGMMVEDHKRVQPNVKTHALLVEC 315 Query: 1875 FTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLRALCREGRIVELLDALETM 1696 FTKYCV+REAIRHFRALKNFEGGTK+LH +G +GDPLSLYLRALCREGRI+++L+ALE M Sbjct: 316 FTKYCVVREAIRHFRALKNFEGGTKILHMDGNHGDPLSLYLRALCREGRIIDMLEALEAM 375 Query: 1695 AKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYIARYVEEGGLTGERKRWVP 1516 A D QQIPPRAMILSRKYRTLVSSWIEPLQEEAELG+EIDYIARYVEEGGLTGERKRWVP Sbjct: 376 ANDNQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYIARYVEEGGLTGERKRWVP 435 Query: 1515 RRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRTLRNEGPVVLGN-ISESDY 1339 R GKTPLDPDA+G+ YSNPMETSFKQRCLEE K++H+KLL+ LR EG V LG+ SESDY Sbjct: 436 RSGKTPLDPDADGFIYSNPMETSFKQRCLEEKKVYHKKLLKKLRYEGIVALGDGASESDY 495 Query: 1338 IRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRI 1159 +RV E L+KIIKGPEQN LKPKAASKM+V+ELKEELEAQGLP DGTRNVLYQRVQKARRI Sbjct: 496 VRVIEWLKKIIKGPEQNALKPKAASKMLVNELKEELEAQGLPIDGTRNVLYQRVQKARRI 555 Query: 1158 NRSRGRXXXXXXXXXXXXXXXXXXXXLISRIKLHEGDTEFWRQRFLGEGLNENHSKPLEE 979 N+SRGR LISRIKL EG+TE+W++RFLGEGLN ++ ++E Sbjct: 556 NQSRGRPLWVPPIEVEEEEVDEELEALISRIKLEEGNTEYWKRRFLGEGLNGDNGNAMDE 615 Query: 978 EXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEQTEIQVSDRVKDKEA 799 Q E Q +R+K+KE Sbjct: 616 --GESESPDVQDYIDVVGDDAKEAEDDEADEDEEEEVEQIEEEIAQVENQDVERIKEKEV 673 Query: 798 EAAKPLQMIGVQLLKDSDQXXXXXXXXXXXXXXXSM-EXXXXXDWFPEDIYEAFKEMRNR 622 E+ KPLQMIGVQLLKD ++ +M + DWFP DI+EAFKEMRNR Sbjct: 674 ESKKPLQMIGVQLLKDFNEPSATFKKSSRRRSRRNMVDDDADDDWFPLDIFEAFKEMRNR 733 Query: 621 KVFDVQDMYTLADAWGWTWDKEFKNKAPRRWSQEWEVELAVKIMN--------------- 487 +VFDV DMYTLADAWGWTW+KE KN+ P RWSQEWEV+LA+K+M Sbjct: 734 RVFDVSDMYTLADAWGWTWEKELKNRPPHRWSQEWEVDLAIKVMQKATVANTPLDKLNKK 793 Query: 486 ----------------------------LVIELGGTPTIGDCAMVLRAAIRAPMPSAFLQ 391 VI+LGGTPTIGDCA++LRAAI AP+PSAFL Sbjct: 794 EIVRAVILSMCKELKVGYVVRIKYGDNAAVIQLGGTPTIGDCAVILRAAISAPLPSAFLT 853 Query: 390 ILQMTHRLGYVFGSPLYDEVISLCLDLGELDASIAIVADLETSGIEVADETLDRVISARQ 211 ILQ TH LGY FG PLYDEVISLCLDLGELDA++A+VADLET+GI V+D+TLDRVISA+Q Sbjct: 854 ILQTTHGLGYKFGRPLYDEVISLCLDLGELDAAVAVVADLETTGILVSDQTLDRVISAKQ 913 >ref|XP_007143992.1| hypothetical protein PHAVU_007G119900g [Phaseolus vulgaris] gi|561017182|gb|ESW15986.1| hypothetical protein PHAVU_007G119900g [Phaseolus vulgaris] Length = 887 Score = 1127 bits (2915), Expect = 0.0 Identities = 583/856 (68%), Positives = 672/856 (78%), Gaps = 5/856 (0%) Frame = -3 Query: 2763 QRKPRRKKQQSDLKSSDDNGYPGGEGGSFTPSSAEKLLRLVFMEELMERARSGSAAGVSD 2584 +++ R+KKQ D +S+ +NG LR FMEELM+RAR + GVS+ Sbjct: 39 EKRGRKKKQAKDDQSAVENG-----------------LRFSFMEELMDRARLRDSNGVSE 81 Query: 2583 VIYDMIAAGLTPGPRSFHGLVVSHVLNRDEEGSMHALRRQLSEGLRPLHETFLALVRLFG 2404 VIYDMIAAG++PGPRSFHGLVVS+ LN EE +M +LRR+L+ GLRP+HETF+ALVRLFG Sbjct: 82 VIYDMIAAGISPGPRSFHGLVVSNALNGHEEAAMESLRRELAAGLRPVHETFMALVRLFG 141 Query: 2403 SKGRATRGLEILAAMEKLNYDIRQAWLLLVEELVRSKHLEDANRVFLKGAEGGLRATNRL 2224 SKGRA RGL+IL M+ LNYDIRQAW++L+EEL+RSKHLE AN+VF KGA+ GL+AT+ + Sbjct: 142 SKGRANRGLQILGDMQDLNYDIRQAWIVLIEELIRSKHLEGANQVFFKGADIGLKATDEV 201 Query: 2223 YDLLIEEDCKVGDHSNALTIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENME 2044 YDLLI+EDCK GDHSNAL IAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAF+TFENME Sbjct: 202 YDLLIKEDCKAGDHSNALDIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFATFENME 261 Query: 2043 YGGEAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVEDYKRLQPNV----LLVEC 1876 Y GE YMKPDT+TYNWVIQAYTRAESYDRVQDVAEL+GMMVED+KR+QPNV LLVEC Sbjct: 262 Y-GEDYMKPDTDTYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRIQPNVKTHALLVEC 320 Query: 1875 FTKYCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLRALCREGRIVELLDALETM 1696 FTKYCV+REAIRHFRALK+FE GTK+LH EG +GDPLSLYLRALCREGRIVE+L+ALE M Sbjct: 321 FTKYCVVREAIRHFRALKHFEEGTKVLHDEGNHGDPLSLYLRALCREGRIVEMLEALEVM 380 Query: 1695 AKDKQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYIARYVEEGGLTGERKRWVP 1516 AKD Q IP RAMILSRKYRTLVSSWIEPLQEEAELG+EIDYIARY+EEGGLTGERKRWVP Sbjct: 381 AKDNQLIPSRAMILSRKYRTLVSSWIEPLQEEAELGYEIDYIARYIEEGGLTGERKRWVP 440 Query: 1515 RRGKTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRTLRNEGPVVLGN-ISESDY 1339 RRGKTPLDPDA+G+ YSNPMETSFKQRCLEE + +++KLL+TL+ EG VLG+ +SE DY Sbjct: 441 RRGKTPLDPDAQGFIYSNPMETSFKQRCLEELRDYNKKLLKTLQIEGLAVLGDGVSEYDY 500 Query: 1338 IRVEERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRI 1159 IRV+ERL+K+IKGPEQN LKPKAASKM+V ELKEELEAQGLP DGTRNVLYQRVQKARRI Sbjct: 501 IRVKERLKKLIKGPEQNVLKPKAASKMLVFELKEELEAQGLPIDGTRNVLYQRVQKARRI 560 Query: 1158 NRSRGRXXXXXXXXXXXXXXXXXXXXLISRIKLHEGDTEFWRQRFLGEGLNENHSKPLEE 979 NRSRGR LISRIKL EG+TEFW++RFLGEGL + ++ Sbjct: 561 NRSRGRPLWIPPVEEEEEEVDEEVDALISRIKLQEGNTEFWKRRFLGEGLTGDQEMTMD- 619 Query: 978 EXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEQTEIQVSDRVKDKEA 799 E E Q DR+K KE Sbjct: 620 --AGKSDVSEVPDDIDVIEDAAKDIEDDEVDEEEEEAEQVEEEVEPAENQDVDRIKVKEV 677 Query: 798 EAAKPLQMIGVQLLKDSDQXXXXXXXXXXXXXXXSMEXXXXXDWFPEDIYEAFKEMRNRK 619 ++ KPLQMIGVQL KDSDQ ++ DWFP D++EAFKEMR RK Sbjct: 678 KSNKPLQMIGVQLFKDSDQPITRSKKFKKSARMQAVN-DDDDDWFPLDVFEAFKEMRKRK 736 Query: 618 VFDVQDMYTLADAWGWTWDKEFKNKAPRRWSQEWEVELAVKIMNLVIELGGTPTIGDCAM 439 +FDV DMYTLADAWGWTW++E KNK PRRWSQEWEVELA+K+M VIELGGTPTIGDCA+ Sbjct: 737 IFDVSDMYTLADAWGWTWERELKNKPPRRWSQEWEVELAIKVMQKVIELGGTPTIGDCAV 796 Query: 438 VLRAAIRAPMPSAFLQILQMTHRLGYVFGSPLYDEVISLCLDLGELDASIAIVADLETSG 259 +LRAA+RAP+PSAFL ILQ TH LGY FGS LYDE+I LC+DLGELDA++A+VADLET+G Sbjct: 797 ILRAAVRAPLPSAFLTILQTTHGLGYKFGSSLYDEIICLCVDLGELDAAVAVVADLETTG 856 Query: 258 IEVADETLDRVISARQ 211 I V+D+TLDRVISA+Q Sbjct: 857 ILVSDQTLDRVISAKQ 872 >ref|XP_006854340.1| hypothetical protein AMTR_s00039p00135490 [Amborella trichopoda] gi|548858016|gb|ERN15807.1| hypothetical protein AMTR_s00039p00135490 [Amborella trichopoda] Length = 870 Score = 1127 bits (2914), Expect = 0.0 Identities = 579/853 (67%), Positives = 662/853 (77%), Gaps = 5/853 (0%) Frame = -3 Query: 2754 PRRKKQQSDLKSSDDNGYPGGEGGSFTPSSAEKLLRLVFMEELMERARSGSAAGVSDVIY 2575 P R + + +SS E S SS E+ LR FME LM+RAR+G AAG ++V+ Sbjct: 21 PTRGSKVEEKRSSRRKKSQFKEDDSSVSSSVERGLRFAFMEALMDRARAGDAAGTAEVLR 80 Query: 2574 DMIAAGLTPGPRSFHGLVVSHVLNRDEEGSMHALRRQLSEGLRPLHETFLALVRLFGSKG 2395 DM+AAGL+PGPRSFHGL+VS VLN DEEG+M +LR +LS G RPLHETF+AL+ LFGSKG Sbjct: 81 DMVAAGLSPGPRSFHGLIVSQVLNGDEEGAMQSLRMELSAGHRPLHETFIALIHLFGSKG 140 Query: 2394 RATRGLEILAAMEKLNYDIRQAWLLLVEELVRSKHLEDANRVFLKGAEGGLRATNRLYDL 2215 + +G EILAAMEKLNYDIR+AWL L+EEL+R+ HL++AN+VFL+GAEGGLRAT+ LYDL Sbjct: 141 LSVKGQEILAAMEKLNYDIRKAWLTLIEELIRNGHLDNANKVFLRGAEGGLRATDELYDL 200 Query: 2214 LIEEDCKVGDHSNALTIAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFSTFENMEYGG 2035 LIEEDCK GDHSNALT+AYEMEAAGRMATTFHFNCLLSVQATCGIPEIAF+TFENMEYGG Sbjct: 201 LIEEDCKAGDHSNALTVAYEMEAAGRMATTFHFNCLLSVQATCGIPEIAFATFENMEYGG 260 Query: 2034 EAYMKPDTETYNWVIQAYTRAESYDRVQDVAELVGMMVEDYKRLQPNV----LLVECFTK 1867 E +MKPDTE+YNWVIQAYTRAESYDRVQDVAEL+GMMVED+KRLQPNV LLVECFTK Sbjct: 261 EDFMKPDTESYNWVIQAYTRAESYDRVQDVAELLGMMVEDHKRLQPNVRTYALLVECFTK 320 Query: 1866 YCVIREAIRHFRALKNFEGGTKLLHYEGQYGDPLSLYLRALCREGRIVELLDALETMAKD 1687 YCV++EAIRHFRALKNFEGGT++L EG +GDPLSLYLRALCREGRIVELL+ALE MAKD Sbjct: 321 YCVLKEAIRHFRALKNFEGGTRVLCNEGNFGDPLSLYLRALCREGRIVELLEALEAMAKD 380 Query: 1686 KQQIPPRAMILSRKYRTLVSSWIEPLQEEAELGHEIDYIARYVEEGGLTGERKRWVPRRG 1507 Q I PRAMILS+KYRTLVSSWIEPLQEEAELG E+DYIARY+ EGGLT ERKRWVPRRG Sbjct: 381 NQPITPRAMILSKKYRTLVSSWIEPLQEEAELGFEVDYIARYIAEGGLTAERKRWVPRRG 440 Query: 1506 KTPLDPDAEGYAYSNPMETSFKQRCLEEWKIHHRKLLRTLRNEGPVVLGNISESDYIRVE 1327 KTPLDPDA G+AYSNPMETS+KQRCLE K+H+RKLL+ L+ EG LG++SE+DY RV Sbjct: 441 KTPLDPDAIGFAYSNPMETSYKQRCLENLKVHNRKLLKKLKYEGRAALGDVSEADYARVV 500 Query: 1326 ERLRKIIKGPEQNTLKPKAASKMIVSELKEELEAQGLPTDGTRNVLYQRVQKARRINRSR 1147 ERL+K+IKGP+Q LKPKAASKMIVSELKEELEAQGLPTDGTR VLYQRVQKARRINRSR Sbjct: 501 ERLKKVIKGPDQTALKPKAASKMIVSELKEELEAQGLPTDGTRQVLYQRVQKARRINRSR 560 Query: 1146 GRXXXXXXXXXXXXXXXXXXXXLISRIKLHEGDTEFWRQRFLGEGLNENHSKPLEEEXXX 967 GR ISRI+L EG+TEFWR+RFLGEGL K +E E Sbjct: 561 GRPLWVPPVEEEEEEVDEELDEWISRIRLEEGNTEFWRRRFLGEGLGSVPDKKIELEDLD 620 Query: 966 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEQTEIQVSDRVKDKEAEAAK 787 TE Q D VK+KE E K Sbjct: 621 TSNTLDDIDNTDDNPKDMEDDEVDEEEEEI------------TESQEEDGVKEKEVEVVK 668 Query: 786 -PLQMIGVQLLKDSDQXXXXXXXXXXXXXXXSMEXXXXXDWFPEDIYEAFKEMRNRKVFD 610 PLQMIGVQLLKDS +E DWFPED+ EAFKE+R R++FD Sbjct: 669 PPLQMIGVQLLKDSQ---LPTSRRSRRRVRPMVEDDDDDDWFPEDLQEAFKELRERRIFD 725 Query: 609 VQDMYTLADAWGWTWDKEFKNKAPRRWSQEWEVELAVKIMNLVIELGGTPTIGDCAMVLR 430 V DMYT+AD WGWTW++E K K P RWSQE EVELA+KIM+ VIELGG PTIGDCAM+LR Sbjct: 726 VSDMYTIADVWGWTWERELKAKFPERWSQEREVELAIKIMHKVIELGGKPTIGDCAMILR 785 Query: 429 AAIRAPMPSAFLQILQMTHRLGYVFGSPLYDEVISLCLDLGELDASIAIVADLETSGIEV 250 AAIRAP+P+AFL ILQ TH L YVFGSPLYDEVI+ CLDLGELDA++AI+ADLET+GI+V Sbjct: 786 AAIRAPLPAAFLTILQTTHSLDYVFGSPLYDEVITHCLDLGELDAAVAIIADLETTGIKV 845 Query: 249 ADETLDRVISARQ 211 DETLD+V++A+Q Sbjct: 846 PDETLDKVLAAQQ 858