BLASTX nr result

ID: Paeonia25_contig00010659 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia25_contig00010659
         (1526 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007212718.1| hypothetical protein PRUPE_ppa018797mg [Prun...   650   0.0  
ref|XP_002264956.1| PREDICTED: pentatricopeptide repeat-containi...   644   0.0  
ref|XP_006449088.1| hypothetical protein CICLE_v10018367mg [Citr...   615   e-173
ref|XP_002305605.1| pentatricopeptide repeat-containing family p...   597   e-168
ref|XP_002518527.1| pentatricopeptide repeat-containing protein,...   585   e-164
ref|XP_007025994.1| Tetratricopeptide repeat-like superfamily pr...   576   e-162
ref|XP_006467990.1| PREDICTED: pentatricopeptide repeat-containi...   548   e-153
ref|XP_004233795.1| PREDICTED: pentatricopeptide repeat-containi...   543   e-152
ref|XP_006348079.1| PREDICTED: pentatricopeptide repeat-containi...   536   e-150
ref|XP_006285106.1| hypothetical protein CARUB_v10006439mg [Caps...   436   e-119
ref|XP_006413812.1| hypothetical protein EUTSA_v10024760mg [Eutr...   426   e-116
sp|O49558.2|PP331_ARATH RecName: Full=Pentatricopeptide repeat-c...   419   e-114
gb|EYU27821.1| hypothetical protein MIMGU_mgv1a006926mg [Mimulus...   414   e-113
ref|NP_193849.2| pentatricopeptide repeat-containing protein [Ar...   360   7e-97
ref|XP_002867861.1| predicted protein [Arabidopsis lyrata subsp....   341   4e-91
emb|CAA17536.1| putative protein [Arabidopsis thaliana] gi|72689...   330   1e-87
ref|XP_006826483.1| hypothetical protein AMTR_s00004p00243870 [A...   279   2e-72
ref|XP_006857674.1| hypothetical protein AMTR_s00061p00160470 [A...   203   1e-49
gb|EPS66849.1| hypothetical protein M569_07924, partial [Genlise...   197   1e-47
ref|XP_006849567.1| hypothetical protein AMTR_s00024p00183850 [A...   195   4e-47

>ref|XP_007212718.1| hypothetical protein PRUPE_ppa018797mg [Prunus persica]
            gi|462408583|gb|EMJ13917.1| hypothetical protein
            PRUPE_ppa018797mg [Prunus persica]
          Length = 584

 Score =  650 bits (1678), Expect = 0.0
 Identities = 322/498 (64%), Positives = 395/498 (79%)
 Frame = -1

Query: 1526 LSFFNWAKSSLGFQPNLKSRCKLIQISIQSGFFQPAKPILDNLIQTHPASVLVESMIQAC 1347
            L FFNWAK +L F+P+LKS C++I++S+ SG  +P KPILD+LIQTHP S LV+ +  AC
Sbjct: 75   LEFFNWAKVNLRFEPDLKSNCQIIRVSLGSGLVRPVKPILDSLIQTHPVSELVQCITLAC 134

Query: 1346 SGSDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKL 1167
             G+DSQ+  LS VL CYS KG+F EGL+VF++    G +PSV ACNALL+A+Q   E +L
Sbjct: 135  KGTDSQSTTLSFVLGCYSRKGLFREGLEVFRKMNVLGCVPSVVACNALLNAIQRENEIRL 194

Query: 1166 AWCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKR 987
            AWCFYG +IRNGVLPDR TWS++A+ILCK GK ERI+ +L + IYNS++YNL++D   K 
Sbjct: 195  AWCFYGLMIRNGVLPDRFTWSLVAQILCKDGKFERILRLLDLNIYNSMMYNLLVDGCSKS 254

Query: 986  GDFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPS 807
            G+F AAF  L+EMC+RK+DP FSTYSSILDGACK GNVEVVE++   MVEKKL+     S
Sbjct: 255  GNFDAAFSHLNEMCDRKVDPDFSTYSSILDGACKLGNVEVVERVTSVMVEKKLLPNCPLS 314

Query: 806  EYDLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAI 627
            EYD IV+KLCDLGKT+AAEMFFK+A D+KIGLQD TYG  L+AL+ E R KEA+ +Y  I
Sbjct: 315  EYDSIVEKLCDLGKTHAAEMFFKKACDEKIGLQDGTYGLMLKALTNEVRTKEAISVYRLI 374

Query: 626  SERGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGKW 447
            SERG  V+G SY+AFA+VLCKE+  EE  ELL D+I RG SP ASELS FI+  C++G+W
Sbjct: 375  SERGIVVDGSSYHAFADVLCKEERYEEGFELLMDVISRGCSPSASELSCFISFLCRRGRW 434

Query: 446  KEAEDLLNLILEKGLLPDSFCCCSLVGYYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVL 267
            +EAE LLN++L+KGLLPD  CC  LVG YCS +QIDSAI LHNK+EKL GSLDVTTYNVL
Sbjct: 435  REAEYLLNVVLDKGLLPDLICCSPLVGRYCSGRQIDSAIALHNKMEKLNGSLDVTTYNVL 494

Query: 266  LNGLFLERRVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGLK 87
            L+GLF  RR+EEA+RVF+YMR   L+SS SF+IMIRGLC  KELRKAMK+HDEMLK+ LK
Sbjct: 495  LSGLFAARRIEEAMRVFDYMRRHNLMSSASFTIMIRGLCGVKELRKAMKIHDEMLKMRLK 554

Query: 86   PERTAYKRLIWGFKTRLS 33
            P+   YKRLI GF+  LS
Sbjct: 555  PDAATYKRLISGFQVTLS 572


>ref|XP_002264956.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21170-like
            [Vitis vinifera]
          Length = 569

 Score =  644 bits (1661), Expect = 0.0
 Identities = 318/495 (64%), Positives = 398/495 (80%), Gaps = 1/495 (0%)
 Frame = -1

Query: 1526 LSFFNWAKSSLGFQPNLKSRCKLIQISIQSGFFQPAKPILDNLIQTHPASVLVESMIQAC 1347
            LSFFNW +++LGFQP+L +  ++I+ISIQSG FQPAK ILD+LI+T   SVLV+S+IQAC
Sbjct: 78   LSFFNWVRTNLGFQPDLAAHSQIIRISIQSGLFQPAKGILDSLIETQKVSVLVDSVIQAC 137

Query: 1346 SGSDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKL 1167
             G DS++ +L  VLECYS KG+FIE L+VF+     GY+PSV +CNALL +LQ   E KL
Sbjct: 138  RGKDSESPVLGFVLECYSSKGLFIEALEVFRRITIHGYVPSVRSCNALLDSLQRENEIKL 197

Query: 1166 AWCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIY-NSVIYNLVIDSYCK 990
            AWC  GA+IRNGVLPD   +  +A ILCK+GKLER+V +L M I  N++IY LVID YC+
Sbjct: 198  AWCVCGALIRNGVLPD---YVRIALILCKNGKLERVVRLLDMSIVCNALIYKLVIDCYCE 254

Query: 989  RGDFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLP 810
            RG+F AAF  L+EMCNRK DPGF  Y+SILDGACKY N EV++ ++ SMVEK L+   L 
Sbjct: 255  RGNFSAAFHYLNEMCNRKFDPGFCAYNSILDGACKYENDEVIQIVMGSMVEKGLLPKLLL 314

Query: 809  SEYDLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHA 630
            SEYD I+QK+C+LGKT+AA+MFFKRA ++KI L +ATYGC LRAL+K+GRVKEA+ +Y  
Sbjct: 315  SEYDSIIQKICNLGKTHAAQMFFKRARNEKIELDNATYGCMLRALAKDGRVKEAIGVYLV 374

Query: 629  ISERGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGK 450
            I E G TV    Y+AF NVLC+EDPS+EVS+L+ ++IG+GFSPC S+LSKFIT  CK G+
Sbjct: 375  ILESGVTVKDGCYHAFVNVLCEEDPSQEVSKLMGEIIGKGFSPCGSKLSKFITSLCKNGR 434

Query: 449  WKEAEDLLNLILEKGLLPDSFCCCSLVGYYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNV 270
            W EA+DLLN+ +EKGLLPDSFCC +LV +YC  +QIDS+I LH KI+K+KGSLDV TYNV
Sbjct: 435  WTEADDLLNVTIEKGLLPDSFCCSALVEHYCRSRQIDSSIALHEKIKKVKGSLDVATYNV 494

Query: 269  LLNGLFLERRVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGL 90
            LLNGLF+E+R+E+AV VF+ MR Q L+SS SF+IM+ GLCRE+ELRKAMK HDEMLK+GL
Sbjct: 495  LLNGLFMEKRIEDAVSVFDCMRSQNLLSSTSFTIMVSGLCRERELRKAMKFHDEMLKMGL 554

Query: 89   KPERTAYKRLIWGFK 45
            KP+R  YKRLI GFK
Sbjct: 555  KPDRATYKRLISGFK 569


>ref|XP_006449088.1| hypothetical protein CICLE_v10018367mg [Citrus clementina]
            gi|557551699|gb|ESR62328.1| hypothetical protein
            CICLE_v10018367mg [Citrus clementina]
          Length = 578

 Score =  615 bits (1585), Expect = e-173
 Identities = 303/494 (61%), Positives = 385/494 (77%), Gaps = 1/494 (0%)
 Frame = -1

Query: 1526 LSFFNWAKSSLGFQPNLKSRCKLIQISIQSGFFQPAKPILDNLIQTHPASVLVESMIQAC 1347
            L+FF W K+SL F+P+L S+C +I++ + SG  +   PILD+LIQTH A+VL  SMIQ+C
Sbjct: 84   LNFFYWIKTSLHFEPDLISQCHIIRLLLGSGQTERINPILDSLIQTHTATVLTHSMIQSC 143

Query: 1346 SGSDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKL 1167
             G DSQ+  LS VL+CYSHKG+F++GL+V++  R  G++P+V ACNALL AL    E +L
Sbjct: 144  EGRDSQSDALSLVLDCYSHKGLFMDGLEVYRMMRVYGFVPAVSACNALLDALYRQNEIRL 203

Query: 1166 AWCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKR 987
            A C YGA+IR+GV P++ TWS++A+ILC+SGK E ++G+L  GIY+SV+YNLVID Y K+
Sbjct: 204  ASCLYGAMIRDGVSPNKFTWSLVAQILCRSGKFEVVLGLLDSGIYSSVMYNLVIDFYSKK 263

Query: 986  GDFRAAFDQLDEMCN-RKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLP 810
            GDF AAFD+L+EMCN R L PGFSTYSSILDG C+Y   EV ++I+  MVEKKL+  +  
Sbjct: 264  GDFGAAFDRLNEMCNGRNLTPGFSTYSSILDGGCRYEKTEVSDRIVGLMVEKKLLPKNFL 323

Query: 809  SEYDLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHA 630
            S  D ++QKL D+GKTYAAEM FKRA D+KI LQD TYGC L+ALSKEGRVKE ++IYH 
Sbjct: 324  SGNDSVIQKLSDMGKTYAAEMIFKRACDEKIELQDDTYGCMLKALSKEGRVKEVIQIYHL 383

Query: 629  ISERGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGK 450
            ISERG TV    YYAF NVLCKE   EEV  LLRD++ RG+ PCA ELS+F+  QC KGK
Sbjct: 384  ISERGITVKDSDYYAFVNVLCKEHQPEEVCGLLRDVVERGYIPCAMELSRFVASQCGKGK 443

Query: 449  WKEAEDLLNLILEKGLLPDSFCCCSLVGYYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNV 270
            WKE E+LL+ +L++GLL DSFCC SL+ YYCS +QID AI LH KIEKLKGSLDV TY+V
Sbjct: 444  WKEVEELLSAVLDQGLLLDSFCCSSLMEYYCSNRQIDKAIALHIKIEKLKGSLDVATYDV 503

Query: 269  LLNGLFLERRVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGL 90
            LL+GLF + R+EEAV++F+YM+  K+VSS SF I++  LC  KELRKAMK+HDEMLK+G 
Sbjct: 504  LLDGLFKDGRMEEAVQIFDYMKELKVVSSSSFVIVVSRLCHLKELRKAMKIHDEMLKMGH 563

Query: 89   KPERTAYKRLIWGF 48
            KP+   YK++I GF
Sbjct: 564  KPDEATYKQVISGF 577


>ref|XP_002305605.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|222848569|gb|EEE86116.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 564

 Score =  597 bits (1539), Expect = e-168
 Identities = 289/493 (58%), Positives = 376/493 (76%)
 Frame = -1

Query: 1526 LSFFNWAKSSLGFQPNLKSRCKLIQISIQSGFFQPAKPILDNLIQTHPASVLVESMIQAC 1347
            L FFNW +++L  +P+LKS+C +I I + SG   P +PI+D+L++TH  SVL E+M+ +C
Sbjct: 70   LRFFNWVQTNLKLKPDLKSQCHIINICVNSGLTLPVRPIMDSLVKTHHVSVLGEAMVDSC 129

Query: 1346 SGSDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKL 1167
             G   ++   S VLECYSHKG+F+E L++F++ RG G++ S  ACN++L  LQ   E KL
Sbjct: 130  RGKSLKSDAFSFVLECYSHKGLFMESLEMFRKMRGNGFIASGTACNSVLDVLQRENEIKL 189

Query: 1166 AWCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKR 987
            AWCFY A+I++GVLPD+ TWS++A+ILCK G  ERIV  L MG+YNSV+YN VID   KR
Sbjct: 190  AWCFYCAMIKDGVLPDKLTWSLIAQILCKDGNFERIVKFLDMGVYNSVLYNGVIDCCSKR 249

Query: 986  GDFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPS 807
            GDF AAF++L++MC RKLDPGFSTYS+ILDGACK+GN EV+E+++  M EK L+     S
Sbjct: 250  GDFEAAFERLNQMCERKLDPGFSTYSAILDGACKHGNEEVIERVMDIMAEKGLLPKCPLS 309

Query: 806  EYDLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAI 627
            + D ++QK  DL K   A MFF+RA D+KIGLQDATYGC L+ALSKE RVKEA+ +Y  I
Sbjct: 310  QCDSVIQKFSDLCKMNVATMFFRRACDEKIGLQDATYGCMLKALSKEARVKEAIGLYSLI 369

Query: 626  SERGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGKW 447
            SE+G  V   +Y+AF ++L +ED  EE  E+L D++ RGF P    LSKFI L  +K +W
Sbjct: 370  SEKGIRVKDSTYHAFLDLLSEEDQYEEGYEILGDMMRRGFRPGTVGLSKFILLLSRKRRW 429

Query: 446  KEAEDLLNLILEKGLLPDSFCCCSLVGYYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVL 267
            +E EDLL+L+LEKGLLPDS CCCSLV +YCSR+QID A+ LHNK+EKL+ SLDV TYN+L
Sbjct: 430  REVEDLLDLVLEKGLLPDSLCCCSLVEHYCSRRQIDKAVALHNKMEKLQASLDVATYNIL 489

Query: 266  LNGLFLERRVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGLK 87
            L+GL    R+EE VRVF+YM+  KLV+SESF+I IRGLCR KE+RKAMK+HDEML +GLK
Sbjct: 490  LDGLVKNGRIEEVVRVFDYMKGLKLVNSESFTITIRGLCRAKEMRKAMKLHDEMLDMGLK 549

Query: 86   PERTAYKRLIWGF 48
            P++ AYKRLI  F
Sbjct: 550  PDKAAYKRLILEF 562


>ref|XP_002518527.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223542372|gb|EEF43914.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 599

 Score =  585 bits (1509), Expect = e-164
 Identities = 288/495 (58%), Positives = 379/495 (76%)
 Frame = -1

Query: 1526 LSFFNWAKSSLGFQPNLKSRCKLIQISIQSGFFQPAKPILDNLIQTHPASVLVESMIQAC 1347
            L+FFNWAK++L F P+LKS+C +IQ+S+ S   + AK ILD+LI+T+P+++ +E+M+QAC
Sbjct: 99   LNFFNWAKTNLNFNPDLKSQCHVIQLSLGSDLPRAAKKILDSLIKTYPSNLFLETMVQAC 158

Query: 1346 SGSDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKL 1167
             G  S    L+ VLE YSHKG F+EGL+V+K+ R  G  PSV ACN LL ALQ  +E +L
Sbjct: 159  RGKSSLLCTLNFVLEFYSHKGSFLEGLEVYKKMRVIGCTPSVHACNVLLDALQRESEIRL 218

Query: 1166 AWCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKR 987
            AWCFY A+IR GVLPD+ TWS++A ILCK G  ERIV +L MGI NSV+YN V+D Y K 
Sbjct: 219  AWCFYCAMIRVGVLPDKFTWSLVAHILCKDGNFERIVKLLDMGICNSVMYNAVVDYYSKN 278

Query: 986  GDFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPS 807
            GDF+AAF +L+EM +RK++PGFSTYSSILDGACK  N++V+E+++  MV K+L+     S
Sbjct: 279  GDFKAAFCRLNEMYDRKVEPGFSTYSSILDGACKCRNLQVIERVVAIMVGKQLLSKCPSS 338

Query: 806  EYDLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAI 627
            +YD I+QKLCDLGK  AA +FFKRA D++IGLQDATYG  LRA S EG ++EA+ +Y  I
Sbjct: 339  DYDSIIQKLCDLGKVSAATLFFKRACDERIGLQDATYGRMLRAFSIEGILEEAIGLYQVI 398

Query: 626  SERGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGKW 447
             ERG T+   +  AF ++L ++D   E  E++RD++ RGFSPC S LSK+ITL CKK +W
Sbjct: 399  LERGLTIKDNASDAFVDLLSEKDQYAEGYEIVRDIMRRGFSPCTSSLSKYITLLCKKRRW 458

Query: 446  KEAEDLLNLILEKGLLPDSFCCCSLVGYYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVL 267
            KEAE+LL ++LEKGLLPD+   CSLV +YCS KQ D A+ LHN +EKL+ SLD+T YN+L
Sbjct: 459  KEAEELLYMVLEKGLLPDTLSFCSLVKHYCSSKQTDKALALHNTLEKLQASLDITAYNLL 518

Query: 266  LNGLFLERRVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGLK 87
            L GL  E RVEE+++VF+YM+  KL +S SF+++IRGLCR KELRKAMK+HDEML +GLK
Sbjct: 519  LGGLVKEGRVEESIKVFDYMKGLKLANSASFTVIIRGLCRAKELRKAMKLHDEMLNMGLK 578

Query: 86   PERTAYKRLIWGFKT 42
            P++  YKRLI  F +
Sbjct: 579  PDKPTYKRLILEFNS 593


>ref|XP_007025994.1| Tetratricopeptide repeat-like superfamily protein, putative
            [Theobroma cacao] gi|508781360|gb|EOY28616.1|
            Tetratricopeptide repeat-like superfamily protein,
            putative [Theobroma cacao]
          Length = 578

 Score =  576 bits (1485), Expect = e-162
 Identities = 283/494 (57%), Positives = 377/494 (76%)
 Frame = -1

Query: 1526 LSFFNWAKSSLGFQPNLKSRCKLIQISIQSGFFQPAKPILDNLIQTHPASVLVESMIQAC 1347
            L+FFNW K+ LGF+P+LKS+C +IQI I S   +  +P +++LIQ+HPA ++ +SMIQAC
Sbjct: 85   LTFFNWVKTHLGFKPDLKSQCHIIQIVIGSDLCRCVEPAVNSLIQSHPAPIVADSMIQAC 144

Query: 1346 SGSDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKL 1167
             G + Q+  LSSV++CYS  G+F+EGL+VF++ R  G+ PSVCACN LL ALQ   E KL
Sbjct: 145  KGKNFQSSALSSVIKCYSKHGLFMEGLEVFRKMRIHGFTPSVCACNELLDALQRGNEVKL 204

Query: 1166 AWCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKR 987
            AW F GA++R G+ PD+ +WS++A+ILCK+GKL ++VG+L  GIYNS IY+LVID Y K 
Sbjct: 205  AWGFLGAMLRVGIEPDQFSWSLVAQILCKNGKLGKVVGLLEKGIYNSEIYDLVIDFYSKS 264

Query: 986  GDFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPS 807
            GDF AAF++L+EM NRK+D  F TYSSILDGACKY + EV+ +I+R MVEK+LV     S
Sbjct: 265  GDFGAAFNRLNEMYNRKVDTSFCTYSSILDGACKYNDGEVIGRILRMMVEKELVPRHQFS 324

Query: 806  EYDLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAI 627
            + DLI+ KLCDL KT+AAEM FK+A D+ I L++ TYG  L+ALS+E R+ EA+ +   I
Sbjct: 325  KKDLIIPKLCDLRKTHAAEMLFKKACDENIRLRNDTYGSMLKALSQEARIDEAIEVCRMI 384

Query: 626  SERGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGKW 447
             +R   VN   Y AF N LCKED S++  ELL D+I RG +PCAS+LSK+I+ QC +  W
Sbjct: 385  LKRRIIVNESCYSAFINALCKEDQSDDGYELLVDIIKRGHNPCASKLSKYISSQCSQMNW 444

Query: 446  KEAEDLLNLILEKGLLPDSFCCCSLVGYYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVL 267
            ++AE+LL+L+LEKGLLPDSF CC L+ YYC  +Q+D  + LH+K+EK+KG LDVTTYN++
Sbjct: 445  RKAEELLDLMLEKGLLPDSFGCCLLIQYYCFNRQVDKIVALHDKMEKVKGCLDVTTYNMI 504

Query: 266  LNGLFLERRVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGLK 87
            L+ L+ ER+ EEAVRV++YM    LV S SF+IMIR LC  KE++KAMK+HDEML +GLK
Sbjct: 505  LDVLWGERKAEEAVRVYDYMTGLNLVDSASFTIMIRELCHMKEMKKAMKIHDEMLNMGLK 564

Query: 86   PERTAYKRLIWGFK 45
            P++  YKRLI GFK
Sbjct: 565  PDKGTYKRLISGFK 578


>ref|XP_006467990.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21170-like
            [Citrus sinensis]
          Length = 538

 Score =  548 bits (1411), Expect = e-153
 Identities = 284/494 (57%), Positives = 352/494 (71%), Gaps = 1/494 (0%)
 Frame = -1

Query: 1526 LSFFNWAKSSLGFQPNLKSRCKLIQISIQSGFFQPAKPILDNLIQTHPASVLVESMIQAC 1347
            L+FF W K+SL F+P+L S+C +I++ + SG  +  KP LD+LIQTH A+VL  SMIQ+C
Sbjct: 84   LNFFYWIKTSLHFEPDLISQCHIIRLLLGSGQTERIKPSLDSLIQTHTATVLTHSMIQSC 143

Query: 1346 SGSDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKL 1167
                                                     V ACNALL AL    E +L
Sbjct: 144  E----------------------------------------VSACNALLDALYRQNEIRL 163

Query: 1166 AWCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKR 987
            A C YGA++R+GV P++ TWS++A+ILC+SGK E ++G+L  GIY+SV+YNLVID Y K+
Sbjct: 164  ASCLYGAMVRDGVSPNKFTWSLVAQILCRSGKFEVVLGLLDSGIYSSVMYNLVIDFYSKK 223

Query: 986  GDFRAAFDQLDEMCN-RKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLP 810
            GDF AAFD+L+EMCN R L PGFSTYSSILDGA +Y   EV ++I+  MVEKKL+     
Sbjct: 224  GDFGAAFDRLNEMCNGRNLTPGFSTYSSILDGARRYEKTEVSDRIVGLMVEKKLLPKHFL 283

Query: 809  SEYDLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHA 630
            S  D ++QKL D+GKTYAAEM FKRA D+KI LQD TYGC L+ALSKEGRVKEA++IYH 
Sbjct: 284  SGNDYVIQKLSDMGKTYAAEMIFKRACDEKIELQDDTYGCMLKALSKEGRVKEAIQIYHL 343

Query: 629  ISERGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGK 450
            ISERG TV    YYAF NVLCKE   EEV  LLRD++ RG+ PCA ELS+F+  QC KGK
Sbjct: 344  ISERGITVRDSDYYAFVNVLCKEHQPEEVCGLLRDVVERGYIPCAMELSRFVASQCGKGK 403

Query: 449  WKEAEDLLNLILEKGLLPDSFCCCSLVGYYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNV 270
            WKE E+LL+ +L+KGLL DSFCC SL+ YYCS +QID AI LH KIEKLKGSLDV TY+V
Sbjct: 404  WKEVEELLSAVLDKGLLLDSFCCSSLMEYYCSNRQIDKAIALHIKIEKLKGSLDVATYDV 463

Query: 269  LLNGLFLERRVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGL 90
            LL+GLF + R+EEAVR+F+YM+  K+VSS SF I++  LC  KELRKAMK HDEMLK+G 
Sbjct: 464  LLDGLFKDGRMEEAVRIFDYMKELKVVSSSSFVIVVSRLCHLKELRKAMKNHDEMLKMGH 523

Query: 89   KPERTAYKRLIWGF 48
            KP+   YK++I GF
Sbjct: 524  KPDEATYKQVISGF 537


>ref|XP_004233795.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21170-like
            [Solanum lycopersicum]
          Length = 584

 Score =  543 bits (1399), Expect = e-152
 Identities = 267/495 (53%), Positives = 365/495 (73%), Gaps = 2/495 (0%)
 Frame = -1

Query: 1526 LSFFNWAKSSLGFQPNLKSRCKLIQISIQSGFFQPAKPILDNLIQTHPASVLVESMIQAC 1347
            L FF++AK++LGFQP+ K  C L+ I + SG  +PAKPILD LIQT+P + +V  +IQ+ 
Sbjct: 89   LRFFHYAKNNLGFQPDAKVLCTLVYILLGSGLSRPAKPILDTLIQTYPPAQIVGFLIQSL 148

Query: 1346 SGSDS--QALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAET 1173
               +   Q+ +LSSVLECY +KG+F+E LQV++  R  GY  SV  CN LL+ L    + 
Sbjct: 149  KAGEIHIQSSVLSSVLECYCNKGLFLEALQVYQIVREYGYFVSVNCCNTLLNLLLSKNDL 208

Query: 1172 KLAWCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYC 993
            +L WC+YG++IRNGV  +  TWS++A++LCK GK E+IV +L  G+ + +IYN++ID Y 
Sbjct: 209  RLGWCYYGSIIRNGVQENVVTWSLIAQMLCKDGKFEKIVAILDKGVCSPLIYNILIDCYS 268

Query: 992  KRGDFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDL 813
            +RG F AAF  L++M + ++DP FST+SSILDGACKY N +V+E ++ SMVEK  +   +
Sbjct: 269  ERGKFDAAFGYLNDMYSERIDPTFSTFSSILDGACKYQNAQVIESVMSSMVEKGHLPKVV 328

Query: 812  PSEYDLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYH 633
              +YD ++QK   +GK YAAE+FF+ A +  I LQD TYG  LRA SKEG+ ++A+ +Y+
Sbjct: 329  TPDYDSVIQKFSGIGKAYAAELFFREAYEKSIKLQDKTYGSMLRAFSKEGKAEDAIWMYN 388

Query: 632  AISERGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKG 453
             I ER   +NGK Y AF +VLC E PS EVS LL+DLIGRGF P  S++SKFI  QC+K 
Sbjct: 389  IIVERKIFINGKCYSAFMSVLCNEIPSVEVSSLLKDLIGRGFVPPVSQVSKFIVSQCEKH 448

Query: 452  KWKEAEDLLNLILEKGLLPDSFCCCSLVGYYCSRKQIDSAIGLHNKIEKLKGSLDVTTYN 273
            +WKEAE+LLN+I +KGL  +SFCCCSLV +YC  ++IDSAI LH ++E+L  +LDV TY 
Sbjct: 449  QWKEAEELLNVIFQKGLQFESFCCCSLVRHYCFSRRIDSAISLHTELERLGVALDVETYG 508

Query: 272  VLLNGLFLERRVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLG 93
            +LL+ LF  RR EEA+++F+YMR   ++SS SFSIMIRGLC+E+E RKAM++HD+MLKLG
Sbjct: 509  LLLDRLFKSRRHEEALKIFDYMRTHDMLSSGSFSIMIRGLCQEEEFRKAMRLHDDMLKLG 568

Query: 92   LKPERTAYKRLIWGF 48
             KP++ AYKRLI GF
Sbjct: 569  FKPDKKAYKRLISGF 583


>ref|XP_006348079.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21170-like
            isoform X1 [Solanum tuberosum]
            gi|565362693|ref|XP_006348080.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g21170-like isoform X2 [Solanum tuberosum]
            gi|565362695|ref|XP_006348081.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g21170-like isoform X3 [Solanum tuberosum]
          Length = 584

 Score =  536 bits (1381), Expect = e-150
 Identities = 263/495 (53%), Positives = 368/495 (74%), Gaps = 2/495 (0%)
 Frame = -1

Query: 1526 LSFFNWAKSSLGFQPNLKSRCKLIQISIQSGFFQPAKPILDNLIQTHPASVLVESMIQAC 1347
            L FF++AK++LGFQP+ K  C L+ I + SG  +PAKPILD LIQT+P + +V  +IQ+ 
Sbjct: 89   LRFFDYAKNNLGFQPDAKVLCTLVYILLGSGLSKPAKPILDTLIQTYPPAQIVGFLIQSL 148

Query: 1346 SGSDS--QALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAET 1173
               +   Q+ +LSSVLECY +KG+F+E LQV++  R  GY  SV  CN LL+ L    E 
Sbjct: 149  KVGEIHIQSSVLSSVLECYCNKGLFLEALQVYQIVREYGYFVSVNCCNTLLNLLLSKNEL 208

Query: 1172 KLAWCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYC 993
            +L WC++G++IRNGV  +  TWS++A++LCK GK E+IV +L  G+ + V+YN++ID Y 
Sbjct: 209  RLGWCYFGSIIRNGVQENVVTWSLIAQMLCKDGKFEQIVPILDKGVCSPVMYNILIDCYS 268

Query: 992  KRGDFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDL 813
            +RG+F AAF  L++M ++ +DP F+T+SSILDGACKY N EV+E ++ SMVEK  +   +
Sbjct: 269  ERGNFEAAFGYLNDMYSKCIDPTFNTFSSILDGACKYQNAEVIESVMSSMVEKGHLPKVV 328

Query: 812  PSEYDLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYH 633
              +YD ++++  D+GK YAAE+FF+ A + +I LQD TYG  LRA SKEG+ ++A+ +Y+
Sbjct: 329  LPDYDSVIRRFSDMGKAYAAELFFREAYEKRIKLQDNTYGSMLRAFSKEGKAEDAIWMYN 388

Query: 632  AISERGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKG 453
             I ER   ++ K Y AF +VLC E+PS EVS LL+DLIGRGF P  S++SKFI  QC+K 
Sbjct: 389  IIVERKIFISDKCYSAFMSVLCNENPSLEVSSLLKDLIGRGFVPPVSQVSKFIVSQCEKR 448

Query: 452  KWKEAEDLLNLILEKGLLPDSFCCCSLVGYYCSRKQIDSAIGLHNKIEKLKGSLDVTTYN 273
            +WKEAE+LLN+I ++ L  +SFCCCSLV +YC  ++IDSAI LH ++E+L  +LDV TY 
Sbjct: 449  QWKEAEELLNVIFQRRLQFESFCCCSLVRHYCFSRRIDSAISLHTELERLGVALDVETYG 508

Query: 272  VLLNGLFLERRVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLG 93
            +LL+ LF  RR EEA+++F+YMR   ++SSESFSIMIRGLC+E+E RKAM++HD+MLKLG
Sbjct: 509  LLLDSLFKSRRREEALKIFDYMRTHDMLSSESFSIMIRGLCQEQEFRKAMRLHDDMLKLG 568

Query: 92   LKPERTAYKRLIWGF 48
             KP++ AYKRLI GF
Sbjct: 569  FKPDKKAYKRLISGF 583


>ref|XP_006285106.1| hypothetical protein CARUB_v10006439mg [Capsella rubella]
            gi|482553811|gb|EOA18004.1| hypothetical protein
            CARUB_v10006439mg [Capsella rubella]
          Length = 585

 Score =  436 bits (1121), Expect = e-119
 Identities = 220/499 (44%), Positives = 341/499 (68%), Gaps = 5/499 (1%)
 Frame = -1

Query: 1526 LSFFNWAKSSLGFQPNLKSRCKLIQISIQSGFFQPAKPILDNLIQTHPASVLVESMIQAC 1347
            L FF++A++ L F P++KS+C++I+++ +SG  + A+ +L  L++T+  S++V S+ + C
Sbjct: 87   LDFFDFAQTHLHFDPDVKSQCRVIEVATESGLLERAETLLRPLVETNSVSLVVGSLQKCC 146

Query: 1346 SGSDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKL 1167
             G  S ++ LS VLECY+ KG +  GL+VF   R     PS+ A N+LL +L    + ++
Sbjct: 147  EGEVSLSISLSLVLECYALKGCYQNGLEVFGFMRRLRLSPSLRAYNSLLDSLIKEGQFRV 206

Query: 1166 AWCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKR 987
            A C Y A++RN V+ D  TW ++A+ILC+ G+ + +V ++  G+ +  IY  +++ Y + 
Sbjct: 207  ALCLYSAMVRNQVVSDGFTWDLVAQILCEQGRSKSVVKLMETGVESCKIYTNLVECYSRN 266

Query: 986  GDFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPS 807
            G+F A F+ + EM N+KL+  FS+YS +LD  C+ G+ E++ K++  MVEKK + +D  +
Sbjct: 267  GEFDAVFNVIHEMDNKKLELSFSSYSCVLDDVCRLGDAELMGKVLGLMVEKKFLAVDASA 326

Query: 806  EYDLIVQKLCDLGKTYAAEMFFKRA-SDDKIGLQDATYGCALRALSKEGRVKEAVRIYHA 630
              D I+++LCD+GKT+A+EM F++A + + + L+D TYGC L+ALS++GR KEAV +Y  
Sbjct: 327  VNDEIIERLCDMGKTFASEMLFRKACNGETVRLRDGTYGCMLKALSRKGRTKEAVDVYRL 386

Query: 629  ISERGATVNGKSYYA-FANVLCKEDPS-EEVSELLRDLIGRGFSPCASELSKFITLQCKK 456
            I  +G TV  +S Y  FAN LC++D S EE  ELL D+I RGF PC   LS+ +   C+K
Sbjct: 387  ICRKGITVLDESCYTEFANALCRDDNSPEEELELLVDVIKRGFVPCTRRLSEVLASLCRK 446

Query: 455  GKWKEAEDLLNLILEKGLLPDSFCCCSLVGYYCSRKQIDSAIGLHNKIEKLKGSLDVTTY 276
             +W+ AE LL+ ++E  +  DSF C  L+  YC   ++D A+ LH +I+K+KGSLDV  Y
Sbjct: 447  RRWRHAEKLLDSVMEMEVYFDSFSCGILMERYCRSGKLDKAMELHERIKKMKGSLDVNAY 506

Query: 275  NVLLNGLFLERR--VEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEML 102
            N +L+ L + +R  VEEAVRVF YM+  K V+S+SF+IMI+GLC  KE++KA + HDEML
Sbjct: 507  NAVLDRLMMRQREMVEEAVRVFEYMKEMKSVNSKSFTIMIQGLCHVKEMKKAKQSHDEML 566

Query: 101  KLGLKPERTAYKRLIWGFK 45
            KLG+KP+   YKR+I+GFK
Sbjct: 567  KLGMKPDLATYKRVIYGFK 585


>ref|XP_006413812.1| hypothetical protein EUTSA_v10024760mg [Eutrema salsugineum]
            gi|557114982|gb|ESQ55265.1| hypothetical protein
            EUTSA_v10024760mg [Eutrema salsugineum]
          Length = 584

 Score =  426 bits (1096), Expect = e-116
 Identities = 223/499 (44%), Positives = 329/499 (65%), Gaps = 5/499 (1%)
 Frame = -1

Query: 1526 LSFFNWAKSSLGFQPNLKSRCKLIQISIQSGFFQPAKPILDNLIQTHPASVLVESMIQAC 1347
            L FF+WAK+ L F+P+LKS C++IQ++ ++G  + A+  +  LI+TH   V+V SM +  
Sbjct: 87   LDFFDWAKTHLRFEPDLKSCCRVIQVATETGLLERAEAFVRPLIETHSVCVIVGSMHRWF 146

Query: 1346 SGSDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKL 1167
             G  S +  LS VLECY+ KG +  GL+VF   R     PS+ A N+LL +L    + +L
Sbjct: 147  EGEVSLSTSLSLVLECYALKGSYQNGLEVFGSMRRLRLSPSLRAYNSLLDSLVKEKQFRL 206

Query: 1166 AWCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKR 987
            A C Y A++RN V+ D  TW ++A++LC+ GK + +V ++  G+ +  IY  +++ Y + 
Sbjct: 207  ALCLYSAMVRNRVVSDGLTWDLVAQVLCEQGKFKSVVKLMETGVESCKIYTNLVECYSRN 266

Query: 986  GDFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPS 807
            G+F A F  + EM  +KL+  F +Y  +LD AC+ G+ E+++K++  MVEK+ + +D  +
Sbjct: 267  GEFDAVFSVIQEMDAKKLELSFCSYGYVLDDACRLGDSELIDKVLGLMVEKEFLTLDDST 326

Query: 806  EYDLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAI 627
              D I+++LCD+GKT+A+EM F RA +    ++D TYGC L++LS  GR KEAV +Y  I
Sbjct: 327  VNDQIIERLCDMGKTFASEMLFHRACNGGT-VRDRTYGCMLKSLSVIGRTKEAVDVYRLI 385

Query: 626  SERGATVNGKS-YYAFANVLCKED--PSEEVSELLRDLIGRGFSPCASELSKFITLQCKK 456
              +G TV  +S Y  FAN LC++D   SEE  ELL D+I RGF PC  +LS+ +   C+K
Sbjct: 386  CRKGITVLDESCYKEFANALCRDDDNSSEEEGELLIDVIKRGFVPCTLKLSEVLASLCRK 445

Query: 455  GKWKEAEDLLNLILEKGLLPDSFCCCSLVGYYCSRKQIDSAIGLHNKIEKLKGSLDVTTY 276
             +W  AE LL+ ++E  +  DSF C  L+  YC   +++ A+ LH KI+K+KGSLDV  Y
Sbjct: 446  RRWNRAEKLLDSVMEMEVHFDSFSCGLLMERYCRSGKLEKAMVLHEKIKKMKGSLDVNAY 505

Query: 275  NVLLNGLFLERR--VEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEML 102
            N +L+ L + +R  VEEAV+VF YM+    V+S+SF+IMI GLCR KE++KAMK HDEML
Sbjct: 506  NAVLDRLMMRQRTMVEEAVQVFEYMKEMNTVNSKSFTIMIHGLCRVKEMKKAMKSHDEML 565

Query: 101  KLGLKPERTAYKRLIWGFK 45
            KLGLKP+   YKRLI GF+
Sbjct: 566  KLGLKPDLVTYKRLISGFR 584


>sp|O49558.2|PP331_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g21170
          Length = 585

 Score =  419 bits (1077), Expect = e-114
 Identities = 217/499 (43%), Positives = 332/499 (66%), Gaps = 5/499 (1%)
 Frame = -1

Query: 1526 LSFFNWAKSSLGFQPNLKSRCKLIQISIQSGFFQPAKPILDNLIQTHPASVLVESMIQAC 1347
            L FF++AK+ L F+P+LKS C++I+++ +SG  + A+ +L  L++T+  S++V  M +  
Sbjct: 87   LDFFDFAKTHLRFEPDLKSHCRVIEVAAESGLLERAEMLLRPLVETNSVSLVVGEMHRWF 146

Query: 1346 SGSDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKL 1167
             G  S ++ LS VLE Y+ KG    GL+VF   R     PS  A N+LL +L    + ++
Sbjct: 147  EGEVSLSVSLSLVLEYYALKGSHHNGLEVFGFMRRLRLSPSQSAYNSLLGSLVKENQFRV 206

Query: 1166 AWCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKR 987
            A C Y A++RNG++ D  TW ++A+ILC+ G+ + +  ++  G+ +  IY  +++ Y + 
Sbjct: 207  ALCLYSAMVRNGIVSDELTWDLIAQILCEQGRSKSVFKLMETGVESCKIYTNLVECYSRN 266

Query: 986  GDFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPS 807
            G+F A F  + EM ++KL+  F +Y  +LD AC+ G+ E ++K++  MVEKK V +   +
Sbjct: 267  GEFDAVFSLIHEMDDKKLELSFCSYGCVLDDACRLGDAEFIDKVLCLMVEKKFVTLGDSA 326

Query: 806  EYDLIVQKLCDLGKTYAAEMFFKRA-SDDKIGLQDATYGCALRALSKEGRVKEAVRIYHA 630
              D I+++LCD+GKT+A+EM F++A + + + L D+TYGC L+ALS++ R KEAV +Y  
Sbjct: 327  VNDKIIERLCDMGKTFASEMLFRKACNGETVRLWDSTYGCMLKALSRKKRTKEAVDVYRM 386

Query: 629  ISERGATVNGKS-YYAFANVLCKED-PSEEVSELLRDLIGRGFSPCASELSKFITLQCKK 456
            I  +G TV  +S Y  FAN LC++D  SEE  ELL D+I RGF PC  +LS+ +   C+K
Sbjct: 387  ICRKGITVLDESCYIEFANALCRDDNSSEEEEELLVDVIKRGFVPCTHKLSEVLASMCRK 446

Query: 455  GKWKEAEDLLNLILEKGLLPDSFCCCSLVGYYCSRKQIDSAIGLHNKIEKLKGSLDVTTY 276
             +WK AE LL+ ++E  +  DSF C  L+  YC   +++ A+ LH KI+K+KGSLDV  Y
Sbjct: 447  RRWKSAEKLLDSVMEMEVYFDSFACGLLMERYCRSGKLEKALVLHEKIKKMKGSLDVNAY 506

Query: 275  NVLLNGLFLERR--VEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEML 102
            N +L+ L + ++  VEEAV VF YM+    V+S+SF+IMI+GLCR KE++KAM+ HDEML
Sbjct: 507  NAVLDRLMMRQKEMVEEAVVVFEYMKEINSVNSKSFTIMIQGLCRVKEMKKAMRSHDEML 566

Query: 101  KLGLKPERTAYKRLIWGFK 45
            +LGLKP+   YKRLI GFK
Sbjct: 567  RLGLKPDLVTYKRLILGFK 585


>gb|EYU27821.1| hypothetical protein MIMGU_mgv1a006926mg [Mimulus guttatus]
          Length = 426

 Score =  414 bits (1065), Expect = e-113
 Identities = 206/425 (48%), Positives = 292/425 (68%), Gaps = 1/425 (0%)
 Frame = -1

Query: 1319 LSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKLAWCFYGAVI 1140
            ++SV+ECY  K M+++ L+V+   +      SV +CN LL+ L    E KLAWC+Y ++I
Sbjct: 1    MNSVVECYCSKQMYLQSLEVYHMAKDYRIGLSVDSCNILLNLLGDKNELKLAWCYYASII 60

Query: 1139 RNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKRGDFRAAFDQ 960
            RNGV  +R TWS +ARIL K GK ERI  +  +GI+   +++L+ID + KRGDF AAFD 
Sbjct: 61   RNGVSGNRFTWSSIARILHKDGKFERISKVFDVGIFTPEMFDLIIDGHSKRGDFEAAFDY 120

Query: 959  LDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPSEYDLIVQKL 780
            L+ MC++++ P FSTYSSIL+GACK+ + E++E ++  MVEK  +      +YD IV++L
Sbjct: 121  LNRMCSKEIGPSFSTYSSILNGACKHQDGEIIENMLSLMVEKGHIAETPVCDYDSIVKEL 180

Query: 779  CDLGKTYAAEMFFKRASDDKIGLQDATYGCALRAL-SKEGRVKEAVRIYHAISERGATVN 603
            CD GKT+A ++F +RA + KI LQ  TY C L AL S+E R+++A+++Y  + E+   ++
Sbjct: 181  CDEGKTFAVDLFSERAYEAKIELQHGTYECMLMALLSEEARLEDAIKLYKIVREKNILLS 240

Query: 602  GKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGKWKEAEDLLN 423
               Y  F  +LCKE+PS E++ LL D+  +GF     ELS +I+ QC +G+W+EAE++ N
Sbjct: 241  ESCYSEFVVILCKENPSREITNLLVDITKQGFFFQPKELSGYISKQCAEGRWREAEEIFN 300

Query: 422  LILEKGLLPDSFCCCSLVGYYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVLLNGLFLER 243
             +L KG L DS CC S+V  +CS  QI  AI +HNK+E+LKGSLD+  YN  +  LF + 
Sbjct: 301  AVLNKGFLLDSTCCGSIVKRHCSSGQIGKAIVVHNKLEELKGSLDIAAYNKFIAALFRDN 360

Query: 242  RVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGLKPERTAYKR 63
            R EE ++VF+YM+  K+   ESFS MI GLCR KE RKAM+ HDEML+LGLKP+R  YKR
Sbjct: 361  RAEETIKVFDYMKACKIFDGESFSHMICGLCRVKEFRKAMRFHDEMLELGLKPDRRTYKR 420

Query: 62   LIWGF 48
            LI GF
Sbjct: 421  LISGF 425


>ref|NP_193849.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|332659015|gb|AEE84415.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 551

 Score =  360 bits (925), Expect = 7e-97
 Identities = 204/499 (40%), Positives = 309/499 (61%), Gaps = 5/499 (1%)
 Frame = -1

Query: 1526 LSFFNWAKSSLGFQPNLKSRCKLIQISIQSGFFQPAKPILDNLIQTHPASVLVESMIQAC 1347
            L FF++AK+ L F+P+LKS C++I+++ +SG  + A+ +L  L++T+  S++V  M +  
Sbjct: 87   LDFFDFAKTHLRFEPDLKSHCRVIEVAAESGLLERAEMLLRPLVETNSVSLVVGEMHRWF 146

Query: 1346 SGSDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKL 1167
             G  S ++ LS VLE Y+ KG    GL+VF   R     PS  A N+LL +L    + ++
Sbjct: 147  EGEVSLSVSLSLVLEYYALKGSHHNGLEVFGFMRRLRLSPSQSAYNSLLGSLVKENQFRV 206

Query: 1166 AWCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKR 987
            A C Y A++RNG++ D  TW ++A+ILC+ G+ + +  ++  G+ +  IY  +++ Y + 
Sbjct: 207  ALCLYSAMVRNGIVSDELTWDLIAQILCEQGRSKSVFKLMETGVESCKIYTNLVECYSRN 266

Query: 986  GDFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPS 807
            G+F A F  + EM ++KL+  F +Y  +LD AC+ G+ E ++K++  MVEKK V +   +
Sbjct: 267  GEFDAVFSLIHEMDDKKLELSFCSYGCVLDDACRLGDAEFIDKVLCLMVEKKFVTLGDSA 326

Query: 806  EYDLIVQKLCDLGKTYAAEMFFKRA-SDDKIGLQDATYGCALRALSKEGRVKEAVRIYHA 630
              D I+++LCD+GKT+A+EM F++A + + + L D+TYGC L+ALS++ R KEAV +Y  
Sbjct: 327  VNDKIIERLCDMGKTFASEMLFRKACNGETVRLWDSTYGCMLKALSRKKRTKEAVDVYRM 386

Query: 629  ISERGATVNGKS-YYAFANVLCKED-PSEEVSELLRDLIGRGFSPCASELSKFITLQCKK 456
            I  +G TV  +S Y  FAN LC++D  SEE  ELL D+I RG      + S  I L    
Sbjct: 387  ICRKGITVLDESCYIEFANALCRDDNSSEEEEELLVDVIKRGKEDGNPQRSFLIRL---- 442

Query: 455  GKWKEAEDLLNLILEKGLLPDSFCCCSLVGYYCSRKQIDSAIGLHNKIEKLKGSLDVTTY 276
             KW+  +      LEK L+                        LH KI+K+KGSLDV  Y
Sbjct: 443  WKWRSGK------LEKALV------------------------LHEKIKKMKGSLDVNAY 472

Query: 275  NVLLNGLFLERR--VEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEML 102
            N +L+ L + ++  VEEAV VF YM+    V+S+SF+IMI+GLCR KE++KAM+ HDEML
Sbjct: 473  NAVLDRLMMRQKEMVEEAVVVFEYMKEINSVNSKSFTIMIQGLCRVKEMKKAMRSHDEML 532

Query: 101  KLGLKPERTAYKRLIWGFK 45
            +LGLKP+   YKRLI GFK
Sbjct: 533  RLGLKPDLVTYKRLILGFK 551


>ref|XP_002867861.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297313697|gb|EFH44120.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 534

 Score =  341 bits (875), Expect = 4e-91
 Identities = 194/499 (38%), Positives = 302/499 (60%), Gaps = 5/499 (1%)
 Frame = -1

Query: 1526 LSFFNWAKSSLGFQPNLKSRCKLIQISIQSGFFQPAKPILDNLIQTHPASVLVESMIQAC 1347
            L FF++AK+ L F+P+LKS C++I+++ +SG  + A+ +L  L++TH  S++V SM +  
Sbjct: 87   LDFFDFAKTHLRFEPDLKSHCRVIEVATESGLLERAETLLRPLVETHSVSLVVGSMHRWF 146

Query: 1346 SGSDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKL 1167
             G  S ++ LS V+ECY+ KG +  GL+VF   R     PS  A N+LL +L    + ++
Sbjct: 147  EGDVSLSISLSLVIECYALKGCYQNGLEVFGFMRRLRLSPSQSAYNSLLGSLVKENQFRV 206

Query: 1166 AWCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKR 987
            A C Y A+I                 LC+ G+ + +V ++  G+ +  IY  +++ Y + 
Sbjct: 207  ALCLYSAMI-----------------LCEHGRSKSVVKLMETGVESCKIYTNLVECYSRN 249

Query: 986  GDFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPS 807
            G+F A F  + EM  +KL+  FS+Y  +LD AC+ G+ E+++K++ SMVEKK + +   +
Sbjct: 250  GEFDATFSLIHEMDGKKLELSFSSYGCVLDNACRLGDAELIDKVLGSMVEKKFLTLGDSA 309

Query: 806  EYDLIVQKLCDLGKTYAAEMFFKRASD-DKIGLQDATYGCALRALSKEGRVKEAVRIYHA 630
              D ++++LCD+GKT+A+EM F++A + + + L+++TYGC L+ALS++ R KEAV +Y  
Sbjct: 310  LNDQMIERLCDMGKTFASEMLFRKACNGETVRLRESTYGCMLKALSRKERTKEAVDVYRM 369

Query: 629  ISERGATVNGKSYY-AFANVLCKED-PSEEVSELLRDLIGRGFSPCASELSKFITLQCKK 456
            I  +G  V  +S Y  FAN LC++D  SEE  ELL D+I RG      + S  I L    
Sbjct: 370  ICRKGINVLDESCYNEFANALCRDDNSSEEGEELLVDVIKRGKEDGNPQRSFLIRLW--- 426

Query: 455  GKWKEAEDLLNLILEKGLLPDSFCCCSLVGYYCSRKQIDSAIGLHNKIEKLKGSLDVTTY 276
             KW+  +                              ++ A+ LH KI+K+KGSLDV  Y
Sbjct: 427  -KWRSGK------------------------------LEKALELHEKIKKMKGSLDVNAY 455

Query: 275  NVLLNGLFLERR--VEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEML 102
            N +L+ L + ++  VEEAV VF YM+  K V+S+SF+IMI+GLCR KE++KAM+ HDEML
Sbjct: 456  NAVLDRLMMRQKEMVEEAVGVFEYMKEMKSVNSKSFTIMIQGLCRVKEMKKAMRSHDEML 515

Query: 101  KLGLKPERTAYKRLIWGFK 45
            +L +KP+  +YKRLI GFK
Sbjct: 516  RLDMKPDLVSYKRLILGFK 534


>emb|CAA17536.1| putative protein [Arabidopsis thaliana] gi|7268914|emb|CAB79117.1|
            putative protein [Arabidopsis thaliana]
          Length = 534

 Score =  330 bits (845), Expect = 1e-87
 Identities = 197/499 (39%), Positives = 296/499 (59%), Gaps = 5/499 (1%)
 Frame = -1

Query: 1526 LSFFNWAKSSLGFQPNLKSRCKLIQISIQSGFFQPAKPILDNLIQTHPASVLVESMIQAC 1347
            L FF++AK+ L F+P+LKS C++I+++ +SG  + A+ +L  L++T+  S++V  M +  
Sbjct: 87   LDFFDFAKTHLRFEPDLKSHCRVIEVAAESGLLERAEMLLRPLVETNSVSLVVGEMHRWF 146

Query: 1346 SGSDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKL 1167
             G  S ++ LS VLE Y+ KG    GL+VF   R     PS  A N+LL +L    + ++
Sbjct: 147  EGEVSLSVSLSLVLEYYALKGSHHNGLEVFGFMRRLRLSPSQSAYNSLLGSLVKENQFRV 206

Query: 1166 AWCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKR 987
            A C Y A+I                 LC+ G+ + +  ++  G+ +  IY  +++ Y + 
Sbjct: 207  ALCLYSAMI-----------------LCEQGRSKSVFKLMETGVESCKIYTNLVECYSRN 249

Query: 986  GDFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPS 807
            G+F A F  + EM ++KL+  F +Y  +LD AC+ G+ E ++K++  MVEKK V +   +
Sbjct: 250  GEFDAVFSLIHEMDDKKLELSFCSYGCVLDDACRLGDAEFIDKVLCLMVEKKFVTLGDSA 309

Query: 806  EYDLIVQKLCDLGKTYAAEMFFKRA-SDDKIGLQDATYGCALRALSKEGRVKEAVRIYHA 630
              D I+++LCD+GKT+A+EM F++A + + + L D+TYGC L+ALS++ R KEAV +Y  
Sbjct: 310  VNDKIIERLCDMGKTFASEMLFRKACNGETVRLWDSTYGCMLKALSRKKRTKEAVDVYRM 369

Query: 629  ISERGATVNGKS-YYAFANVLCKED-PSEEVSELLRDLIGRGFSPCASELSKFITLQCKK 456
            I  +G TV  +S Y  FAN LC++D  SEE  ELL D+I RG      + S  I L    
Sbjct: 370  ICRKGITVLDESCYIEFANALCRDDNSSEEEEELLVDVIKRGKEDGNPQRSFLIRL---- 425

Query: 455  GKWKEAEDLLNLILEKGLLPDSFCCCSLVGYYCSRKQIDSAIGLHNKIEKLKGSLDVTTY 276
             KW+  +      LEK L+                        LH KI+K+KGSLDV  Y
Sbjct: 426  WKWRSGK------LEKALV------------------------LHEKIKKMKGSLDVNAY 455

Query: 275  NVLLNGLFLERR--VEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEML 102
            N +L+ L + ++  VEEAV VF YM+    V+S+SF+IMI+GLCR KE++KAM+ HDEML
Sbjct: 456  NAVLDRLMMRQKEMVEEAVVVFEYMKEINSVNSKSFTIMIQGLCRVKEMKKAMRSHDEML 515

Query: 101  KLGLKPERTAYKRLIWGFK 45
            +LGLKP+   YKRLI GFK
Sbjct: 516  RLGLKPDLVTYKRLILGFK 534


>ref|XP_006826483.1| hypothetical protein AMTR_s00004p00243870 [Amborella trichopoda]
            gi|548830797|gb|ERM93720.1| hypothetical protein
            AMTR_s00004p00243870 [Amborella trichopoda]
          Length = 359

 Score =  279 bits (714), Expect = 2e-72
 Identities = 143/348 (41%), Positives = 215/348 (61%), Gaps = 22/348 (6%)
 Frame = -1

Query: 1025 VIYNLVIDSYCKRGDFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRS 846
            V+YNL++D YC+ GDF  AF+ ++ +  + L+P F++Y SILDG+C++GN+    +++R 
Sbjct: 11   VVYNLILDGYCRNGDFVIAFEVIERIYGKGLEPDFASYGSILDGSCRFGNMGTAVRVLRI 70

Query: 845  MVEKKLVQMD----LPSE------------------YDLIVQKLCDLGKTYAAEMFFKRA 732
            M+EK+LV        P++                  YD  ++KLC LG T+AAE+ F  A
Sbjct: 71   MLEKRLVPTVGGEFSPNDCFTLNDNNCIVAAISYLHYDAFIRKLCKLGMTHAAELVFGIA 130

Query: 731  SDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAISERGATVNGKSYYAFANVLCKEDPS 552
                + LQ+A Y   L+A S++ R+KEAVR+Y  + +R   +N        N L KE+PS
Sbjct: 131  RSALVPLQNACYIALLKAFSRDRRIKEAVRMYFLLLQRDIAMNISECNVLLNALFKEEPS 190

Query: 551  EEVSELLRDLIGRGFSPCASELSKFITLQCKKGKWKEAEDLLNLILEKGLLPDSFCCCSL 372
            EEV+++++ +I +GF P    +S +I+ QC KG W+EA +LL + LE+G++PD F   S 
Sbjct: 191  EEVNKVIKSVIEKGFYPDPLAISSYISAQCSKGGWQEANELLWVTLERGVMPDGFVWGSF 250

Query: 371  VGYYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVLLNGLFLERRVEEAVRVFNYMRIQKL 192
            + +YC    +D A+ LH K  K    L+  +YN+LLN L+ E ++EEA  +F+YMR + +
Sbjct: 251  IRHYCEDGHLDYALSLHEKFAKSGNVLNAPSYNILLNRLYNEGKLEEASGMFDYMRNKDV 310

Query: 191  VSSESFSIMIRGLCREKELRKAMKVHDEMLKLGLKPERTAYKRLIWGF 48
             SS SF  MI   CREK+  +A K+HDEMLK GLKP+   YKRLI GF
Sbjct: 311  TSSASFMTMISWFCREKKFSEARKMHDEMLKKGLKPDEATYKRLISGF 358


>ref|XP_006857674.1| hypothetical protein AMTR_s00061p00160470 [Amborella trichopoda]
           gi|548861770|gb|ERN19141.1| hypothetical protein
           AMTR_s00061p00160470 [Amborella trichopoda]
          Length = 372

 Score =  203 bits (517), Expect = 1e-49
 Identities = 103/250 (41%), Positives = 160/250 (64%)
 Frame = -1

Query: 806 EYDLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAI 627
           +Y + +++LC LG T AAE+ F  A +  + LQ+A+Y   L+  S++ R+KEAVR+Y  +
Sbjct: 119 DYGVFIRRLCKLGMTDAAELVFGIAHNALVFLQNASYIALLKGFSRDKRIKEAVRMYFLL 178

Query: 626 SERGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGKW 447
            +R   +N        N L KE+ SEEV+++++ +I +GF P    +S  I+ QC KG W
Sbjct: 179 LQRDIALNICECNVLLNALFKEEQSEEVNKVIKSVIRKGFYPDPLAISSHISSQCSKGGW 238

Query: 446 KEAEDLLNLILEKGLLPDSFCCCSLVGYYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVL 267
           +EA +LL ++LE+G++P+ F C S + +YC    +D A+ LH K+ KL   L+  +YN+L
Sbjct: 239 QEANELLWVMLERGVMPNGFACGSFIRHYCEDGGLDYALSLHEKLVKLGNVLNAPSYNIL 298

Query: 266 LNGLFLERRVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGLK 87
           L+ L+   ++EEA  +F++MR + + SS SF  MI   C EK+  +A K+HDEMLK GLK
Sbjct: 299 LDQLYNGGKLEEASEMFDHMRNKNVTSSASFITMISWFCWEKKFSEARKMHDEMLKKGLK 358

Query: 86  PERTAYKRLI 57
           P+   YKRLI
Sbjct: 359 PDEATYKRLI 368


>gb|EPS66849.1| hypothetical protein M569_07924, partial [Genlisea aurea]
          Length = 729

 Score =  197 bits (501), Expect = 1e-47
 Identities = 127/509 (24%), Positives = 247/509 (48%), Gaps = 17/509 (3%)
 Frame = -1

Query: 1526 LSFFNWAKSSLGFQPNLKSRCKLIQISIQSGFFQPAKPILDNLIQTHPASVLVESMI--- 1356
            L F +WA+    F+ +L+  C  I I  +   ++ A+ + + +    P     +S+    
Sbjct: 55   LKFLDWARGLPFFRDHLQCYCLSIHILTRFKLYKTAQSLAEEVALRFPQDEHGDSVFSCL 114

Query: 1355 ----QACSGSDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQ 1188
                QAC  S     +   V++ +S+  +    L +    +  G++PSV + NA+L A+ 
Sbjct: 115  RDTYQACESSSG---VFDLVVKAFSNLKLTDRALNMIYSAKCCGFMPSVLSYNAVLEAIF 171

Query: 1187 VSAETK---LAWCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGM----GIY- 1032
             ++  +    A C +  ++ NG+ P+  T+++L R LC + ++ + + +       G+  
Sbjct: 172  RNSSCRNVDSARCVFHEMMENGISPNVYTYNVLIRGLCANKEMNQGLSLFEQMEKRGVLP 231

Query: 1031 NSVIYNLVIDSYCKRGDFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKII 852
            N V +N VID+YCK  +   A+  L +M  R L+P   TY+ I++G CK G ++  + ++
Sbjct: 232  NVVTFNTVIDAYCKSRNIDQAYGLLKQMWERNLEPNVITYNVIINGLCKEGRIKETDDVL 291

Query: 851  RSMVEKKLVQMDLPSEYDLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALS 672
              M  K L   ++   Y+ +V   C  G  + A         + +     TY C + +L 
Sbjct: 292  VDMKAKGLAPNEIT--YNTLVDGYCKEGNFHQALALHAEMVKNGLSPNVVTYTCLINSLC 349

Query: 671  KEGRVKEAVRIYHAISERGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCAS 492
            K G ++ A+  ++ ++ RG   N K+Y    +   ++   +E   L+ ++I +GFSP   
Sbjct: 350  KAGNLQRAMDYFNQMAVRGLKPNEKTYTTLIDGFSQQGFMDEAYGLVEEMISKGFSPSIV 409

Query: 491  ELSKFITLQCKKGKWKEAEDLLNLILEKGLLPDSFCCCSLVGYYCSRKQIDSAIGLHNKI 312
              +  I   C+ G+  E  +++  +  +G+ PD     S++  YC    +D A  +   +
Sbjct: 410  TYNALINGHCQLGRVDEGLNVIQTMTSRGVFPDVVSYSSIINGYCRNLDLDKAFSVKEDM 469

Query: 311  EKLKGSLDVTTYNVLLNGLFLERRVEEAVRVFNYM--RIQKLVSSESFSIMIRGLCREKE 138
             +     D  TY+ L+ GL   RR++EA ++F  M  ++  L    +++ +I   C E +
Sbjct: 470  SQKGIFPDTITYSSLIQGLCELRRLDEACKLFTEMSSKLNLLPDKCTYTCLINAYCAEND 529

Query: 137  LRKAMKVHDEMLKLGLKPERTAYKRLIWG 51
            + KA+ +HDEM++ GL P+  +Y  L+ G
Sbjct: 530  IPKAIHLHDEMIRRGLFPDVISYNVLVNG 558



 Score = 77.4 bits (189), Expect = 2e-11
 Identities = 57/237 (24%), Positives = 93/237 (39%)
 Frame = -1

Query: 758 AAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAISERGATVNGKSYYAFA 579
           +A   F    ++ I     TY   +R L     + + + ++  + +RG   N  ++    
Sbjct: 181 SARCVFHEMMENGISPNVYTYNVLIRGLCANKEMNQGLSLFEQMEKRGVLPNVVTFNTVI 240

Query: 578 NVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGKWKEAEDLLNLILEKGLL 399
           +  CK    ++   LL+ +  R   P     +  I   CK+G+ KE +D+L  +  KGL 
Sbjct: 241 DAYCKSRNIDQAYGLLKQMWERNLEPNVITYNVIINGLCKEGRIKETDDVLVDMKAKGLA 300

Query: 398 PDSFCCCSLVGYYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVLLNGLFLERRVEEAVRV 219
           P+     +LV  YC       A+ LH ++ K   S +V TY  L+N L            
Sbjct: 301 PNEITYNTLVDGYCKEGNFHQALALHAEMVKNGLSPNVVTYTCLINSL------------ 348

Query: 218 FNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGLKPERTAYKRLIWGF 48
                                 C+   L++AM   ++M   GLKP    Y  LI GF
Sbjct: 349 ----------------------CKAGNLQRAMDYFNQMAVRGLKPNEKTYTTLIDGF 383


>ref|XP_006849567.1| hypothetical protein AMTR_s00024p00183850 [Amborella trichopoda]
            gi|548853142|gb|ERN11148.1| hypothetical protein
            AMTR_s00024p00183850 [Amborella trichopoda]
          Length = 633

 Score =  195 bits (496), Expect = 4e-47
 Identities = 140/504 (27%), Positives = 242/504 (48%), Gaps = 12/504 (2%)
 Frame = -1

Query: 1526 LSFFNWAKSSLGFQPNLKSRCKLIQISIQSGFFQPAKPILDNLIQTH--PASVLVESMIQ 1353
            L F    +S L F  +L+  C  I I       QPA  +L  ++     P +++ +++++
Sbjct: 90   LGFVKHLESDLVFHLDLRCLCIAIHIIAGLENPQPALQLLQRIVNGGFGPNTLIFDALMK 149

Query: 1352 ACSGSDSQ-ALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAE 1176
            A    +++  L+ + +++   H     E +Q+F   +G    PS+ +CN LLS L    +
Sbjct: 150  AKEVCETKNTLVFNLLIKACCHLQKSDEAVQIFYLMKGHKLSPSIESCNFLLSTLSKQNK 209

Query: 1175 TKLAWCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGML----GMGIYNSVI-YNL 1011
            T+ AW  Y  + R  +     T++I+  ILCK GKL +    L    G+G   +V+ YN 
Sbjct: 210  TETAWVIYAEIFRLKIPSSIVTFNIMINILCKEGKLNKAKEFLSYMEGLGFKPTVVTYNT 269

Query: 1010 VIDSYCKRGDFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKK 831
            V++ YC +G  + A +  D M NR + P   TY+S++ G CK G +E   + +  M E  
Sbjct: 270  VLNGYCNKGKVQIALEIFDTMKNRGVSPDSFTYASLISGLCKEGRLEESAQFLAKMEESG 329

Query: 830  LVQMDLPSEYDLIVQKLCDLGKTYAAEMFFKRASDD-KIGLQDA--TYGCALRALSKEGR 660
            LV   +   Y+ ++   C+ G+    EM FK  ++  K G++    TY   +  L   G+
Sbjct: 330  LVPTVVA--YNAMIDGFCNNGRL---EMAFKYRNEMIKRGIEPTICTYNPLIHGLFMAGK 384

Query: 659  VKEAVRIYHAISERGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSK 480
             KE   +   +  R    +  +Y    N  CKE  + +  EL  +++ +G  P     + 
Sbjct: 385  NKEVDDMIKEMVSRNVGPDVFTYNILINGYCKEGNASKAFELHAEMLHKGIEPTKVTYTS 444

Query: 479  FITLQCKKGKWKEAEDLLNLILEKGLLPDSFCCCSLVGYYCSRKQIDSAIGLHNKIEKLK 300
             I   CK+ K +EA+ L   ++ KG+ PD     +L+  +C+   +D A  L  +++  K
Sbjct: 445  LIYGLCKQNKMEEADRLFKEVMTKGISPDVVLYNALIDGHCAIGNVDDAFMLLKEMDDKK 504

Query: 299  GSLDVTTYNVLLNGLFLERRVEEAVRVFNYMRIQKLVSSE-SFSIMIRGLCREKELRKAM 123
               D  TYN L+ GL +  + +EA  + + M+ + +     S++ +I G  R+ E+  A 
Sbjct: 505  LFPDEITYNTLMRGLCIVGKADEARGLIDKMKERGIKPDYISYNTLISGYSRKGEMNNAF 564

Query: 122  KVHDEMLKLGLKPERTAYKRLIWG 51
            K+ DEML  G  P    Y  LI G
Sbjct: 565  KIRDEMLSTGFNPTILTYNALIKG 588



 Score =  122 bits (305), Expect = 6e-25
 Identities = 95/404 (23%), Positives = 178/404 (44%), Gaps = 5/404 (1%)
 Frame = -1

Query: 1496 LGFQPNLKSRCKLIQISIQSGFFQPAKPILDNLIQTHPASVLVESMIQACSGSDSQALIL 1317
            LGF+P + +   ++      G  Q A  I D +                  G    +   
Sbjct: 258  LGFKPTVVTYNTVLNGYCNKGKVQIALEIFDTMKNR---------------GVSPDSFTY 302

Query: 1316 SSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKLAWCFYGAVIR 1137
            +S++     +G   E  Q   +    G +P+V A NA++     +   ++A+ +   +I+
Sbjct: 303  ASLISGLCKEGRLEESAQFLAKMEESGLVPTVVAYNAMIDGFCNNGRLEMAFKYRNEMIK 362

Query: 1136 NGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSV-----IYNLVIDSYCKRGDFRA 972
             G+ P   T++ L   L  +GK + +  M+   +  +V      YN++I+ YCK G+   
Sbjct: 363  RGIEPTICTYNPLIHGLFMAGKNKEVDDMIKEMVSRNVGPDVFTYNILINGYCKEGNASK 422

Query: 971  AFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPSEYDLI 792
            AF+   EM ++ ++P   TY+S++ G CK   +E  +++ + ++ K  +  D+   Y+ +
Sbjct: 423  AFELHAEMLHKGIEPTKVTYTSLIYGLCKQNKMEEADRLFKEVMTKG-ISPDVVL-YNAL 480

Query: 791  VQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAISERGA 612
            +   C +G    A M  K   D K+   + TY   +R L   G+  EA  +   + ERG 
Sbjct: 481  IDGHCAIGNVDDAFMLLKEMDDKKLFPDEITYNTLMRGLCIVGKADEARGLIDKMKERGI 540

Query: 611  TVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGKWKEAED 432
              +  SY    +   ++       ++  +++  GF+P     +  I   CK  +  +AE+
Sbjct: 541  KPDYISYNTLISGYSRKGEMNNAFKIRDEMLSTGFNPTILTYNALIKGLCKAREGGQAEE 600

Query: 431  LLNLILEKGLLPDSFCCCSLVGYYCSRKQIDSAIGLHNKIEKLK 300
            LL  ++ +GL+PD        G Y S  +     GL  K+EK K
Sbjct: 601  LLKEMVSRGLMPDD-------GTYISMIE-----GLSEKVEKSK 632


Top