BLASTX nr result

ID: Paeonia23_contig00015524 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia23_contig00015524
         (1585 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007212718.1| hypothetical protein PRUPE_ppa018797mg [Prun...   645   0.0  
ref|XP_002264956.1| PREDICTED: pentatricopeptide repeat-containi...   643   0.0  
ref|XP_006449088.1| hypothetical protein CICLE_v10018367mg [Citr...   610   e-172
ref|XP_002305605.1| pentatricopeptide repeat-containing family p...   593   e-167
ref|XP_002518527.1| pentatricopeptide repeat-containing protein,...   585   e-164
ref|XP_007025994.1| Tetratricopeptide repeat-like superfamily pr...   572   e-160
ref|XP_006467990.1| PREDICTED: pentatricopeptide repeat-containi...   543   e-152
ref|XP_004233795.1| PREDICTED: pentatricopeptide repeat-containi...   540   e-151
ref|XP_006348079.1| PREDICTED: pentatricopeptide repeat-containi...   533   e-148
ref|XP_006285106.1| hypothetical protein CARUB_v10006439mg [Caps...   435   e-119
ref|XP_006413812.1| hypothetical protein EUTSA_v10024760mg [Eutr...   426   e-116
sp|O49558.2|PP331_ARATH RecName: Full=Pentatricopeptide repeat-c...   422   e-115
gb|EYU27821.1| hypothetical protein MIMGU_mgv1a006926mg [Mimulus...   415   e-113
ref|NP_193849.2| pentatricopeptide repeat-containing protein [Ar...   362   2e-97
ref|XP_002867861.1| predicted protein [Arabidopsis lyrata subsp....   343   1e-91
emb|CAA17536.1| putative protein [Arabidopsis thaliana] gi|72689...   332   4e-88
ref|XP_006826483.1| hypothetical protein AMTR_s00004p00243870 [A...   281   4e-73
ref|XP_006857674.1| hypothetical protein AMTR_s00061p00160470 [A...   206   3e-50
gb|EPS66849.1| hypothetical protein M569_07924, partial [Genlise...   197   1e-47
ref|XP_006287148.1| hypothetical protein CARUB_v10000318mg [Caps...   191   6e-46

>ref|XP_007212718.1| hypothetical protein PRUPE_ppa018797mg [Prunus persica]
            gi|462408583|gb|EMJ13917.1| hypothetical protein
            PRUPE_ppa018797mg [Prunus persica]
          Length = 584

 Score =  645 bits (1665), Expect = 0.0
 Identities = 319/496 (64%), Positives = 393/496 (79%)
 Frame = -3

Query: 1580 FFNWAKSSLGFQPNLKSHCKLIQISIQSGFFQSAKPILDNLIQTHPASVLVESMIQVCSG 1401
            FFNWAK +L F+P+LKS+C++I++S+ SG  +  KPILD+LIQTHP S LV+ +   C G
Sbjct: 77   FFNWAKVNLRFEPDLKSNCQIIRVSLGSGLVRPVKPILDSLIQTHPVSELVQCITLACKG 136

Query: 1400 SDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKLAW 1221
            +DSQ+  LS VL CYS KG+F EGL+VF++    G +PSV ACNALL+A+Q   E +LAW
Sbjct: 137  TDSQSTTLSFVLGCYSRKGLFREGLEVFRKMNVLGCVPSVVACNALLNAIQRENEIRLAW 196

Query: 1220 CFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKRGD 1041
            CFYG +IRNGVLPDR TWS++A+ILCK GK ERI+ +L + IYNS++YNL++D   K G+
Sbjct: 197  CFYGLMIRNGVLPDRFTWSLVAQILCKDGKFERILRLLDLNIYNSMMYNLLVDGCSKSGN 256

Query: 1040 FRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPSEY 861
            F AAF  L+EMC+RK+DP FSTYSSILDGACK GNVEVVE++   MVEKKL+     SEY
Sbjct: 257  FDAAFSHLNEMCDRKVDPDFSTYSSILDGACKLGNVEVVERVTSVMVEKKLLPNCPLSEY 316

Query: 860  DLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAISE 681
            D IV+KLCDLGKT+AAEMFFK+A D+KIGLQD TYG  L+AL+ E R KEA+ +Y  ISE
Sbjct: 317  DSIVEKLCDLGKTHAAEMFFKKACDEKIGLQDGTYGLMLKALTNEVRTKEAISVYRLISE 376

Query: 680  RGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGKWKE 501
            RG  V+G SY+AFA+VLCKE+  EE  ELL D+I RG SP ASELS FI+  C++G+W+E
Sbjct: 377  RGIVVDGSSYHAFADVLCKEERYEEGFELLMDVISRGCSPSASELSCFISFLCRRGRWRE 436

Query: 500  AEDLLNLILEKGLLPDSFCCCSLVGHYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVLLN 321
            AE LLN++L+KGLLPD  CC  LVG YCS +QIDSAI LHNK+EKL GSLDVTTYNVLL+
Sbjct: 437  AEYLLNVVLDKGLLPDLICCSPLVGRYCSGRQIDSAIALHNKMEKLNGSLDVTTYNVLLS 496

Query: 320  GLFLERRVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGLKPE 141
            GLF  RR+EEA+RVF+YMR   L+SS SF+IMIRGLC  KELRKAMK+HDEMLK+ LKP+
Sbjct: 497  GLFAARRIEEAMRVFDYMRRHNLMSSASFTIMIRGLCGVKELRKAMKIHDEMLKMRLKPD 556

Query: 140  RTAYKRLIWGFKTRLS 93
               YKRLI GF+  LS
Sbjct: 557  AATYKRLISGFQVTLS 572


>ref|XP_002264956.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21170-like
            [Vitis vinifera]
          Length = 569

 Score =  643 bits (1659), Expect = 0.0
 Identities = 317/494 (64%), Positives = 396/494 (80%), Gaps = 1/494 (0%)
 Frame = -3

Query: 1583 SFFNWAKSSLGFQPNLKSHCKLIQISIQSGFFQSAKPILDNLIQTHPASVLVESMIQVCS 1404
            SFFNW +++LGFQP+L +H ++I+ISIQSG FQ AK ILD+LI+T   SVLV+S+IQ C 
Sbjct: 79   SFFNWVRTNLGFQPDLAAHSQIIRISIQSGLFQPAKGILDSLIETQKVSVLVDSVIQACR 138

Query: 1403 GSDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKLA 1224
            G DS++ +L  VLECYS KG+FIE L+VF+     GY+PSV +CNALL +LQ   E KLA
Sbjct: 139  GKDSESPVLGFVLECYSSKGLFIEALEVFRRITIHGYVPSVRSCNALLDSLQRENEIKLA 198

Query: 1223 WCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIY-NSVIYNLVIDSYCKR 1047
            WC  GA+IRNGVLPD   +  +A ILCK+GKLER+V +L M I  N++IY LVID YC+R
Sbjct: 199  WCVCGALIRNGVLPD---YVRIALILCKNGKLERVVRLLDMSIVCNALIYKLVIDCYCER 255

Query: 1046 GDFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPS 867
            G+F AAF  L+EMCNRK DPGF  Y+SILDGACKY N EV++ ++ SMVEK L+   L S
Sbjct: 256  GNFSAAFHYLNEMCNRKFDPGFCAYNSILDGACKYENDEVIQIVMGSMVEKGLLPKLLLS 315

Query: 866  EYDLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAI 687
            EYD I+QK+C+LGKT+AA+MFFKRA ++KI L +ATYGC LRAL+K+GRVKEA+ +Y  I
Sbjct: 316  EYDSIIQKICNLGKTHAAQMFFKRARNEKIELDNATYGCMLRALAKDGRVKEAIGVYLVI 375

Query: 686  SERGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGKW 507
             E G TV    Y+AF NVLC+EDPS+EVS+L+ ++IG+GFSPC S+LSKFIT  CK G+W
Sbjct: 376  LESGVTVKDGCYHAFVNVLCEEDPSQEVSKLMGEIIGKGFSPCGSKLSKFITSLCKNGRW 435

Query: 506  KEAEDLLNLILEKGLLPDSFCCCSLVGHYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVL 327
             EA+DLLN+ +EKGLLPDSFCC +LV HYC  +QIDS+I LH KI+K+KGSLDV TYNVL
Sbjct: 436  TEADDLLNVTIEKGLLPDSFCCSALVEHYCRSRQIDSSIALHEKIKKVKGSLDVATYNVL 495

Query: 326  LNGLFLERRVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGLK 147
            LNGLF+E+R+E+AV VF+ MR Q L+SS SF+IM+ GLCRE+ELRKAMK HDEMLK+GLK
Sbjct: 496  LNGLFMEKRIEDAVSVFDCMRSQNLLSSTSFTIMVSGLCRERELRKAMKFHDEMLKMGLK 555

Query: 146  PERTAYKRLIWGFK 105
            P+R  YKRLI GFK
Sbjct: 556  PDRATYKRLISGFK 569


>ref|XP_006449088.1| hypothetical protein CICLE_v10018367mg [Citrus clementina]
            gi|557551699|gb|ESR62328.1| hypothetical protein
            CICLE_v10018367mg [Citrus clementina]
          Length = 578

 Score =  610 bits (1573), Expect = e-172
 Identities = 301/493 (61%), Positives = 382/493 (77%), Gaps = 1/493 (0%)
 Frame = -3

Query: 1583 SFFNWAKSSLGFQPNLKSHCKLIQISIQSGFFQSAKPILDNLIQTHPASVLVESMIQVCS 1404
            +FF W K+SL F+P+L S C +I++ + SG  +   PILD+LIQTH A+VL  SMIQ C 
Sbjct: 85   NFFYWIKTSLHFEPDLISQCHIIRLLLGSGQTERINPILDSLIQTHTATVLTHSMIQSCE 144

Query: 1403 GSDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKLA 1224
            G DSQ+  LS VL+CYSHKG+F++GL+V++  R  G++P+V ACNALL AL    E +LA
Sbjct: 145  GRDSQSDALSLVLDCYSHKGLFMDGLEVYRMMRVYGFVPAVSACNALLDALYRQNEIRLA 204

Query: 1223 WCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKRG 1044
             C YGA+IR+GV P++ TWS++A+ILC+SGK E ++G+L  GIY+SV+YNLVID Y K+G
Sbjct: 205  SCLYGAMIRDGVSPNKFTWSLVAQILCRSGKFEVVLGLLDSGIYSSVMYNLVIDFYSKKG 264

Query: 1043 DFRAAFDQLDEMCN-RKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPS 867
            DF AAFD+L+EMCN R L PGFSTYSSILDG C+Y   EV ++I+  MVEKKL+  +  S
Sbjct: 265  DFGAAFDRLNEMCNGRNLTPGFSTYSSILDGGCRYEKTEVSDRIVGLMVEKKLLPKNFLS 324

Query: 866  EYDLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAI 687
              D ++QKL D+GKTYAAEM FKRA D+KI LQD TYGC L+ALSKEGRVKE ++IYH I
Sbjct: 325  GNDSVIQKLSDMGKTYAAEMIFKRACDEKIELQDDTYGCMLKALSKEGRVKEVIQIYHLI 384

Query: 686  SERGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGKW 507
            SERG TV    YYAF NVLCKE   EEV  LLRD++ RG+ PCA ELS+F+  QC KGKW
Sbjct: 385  SERGITVKDSDYYAFVNVLCKEHQPEEVCGLLRDVVERGYIPCAMELSRFVASQCGKGKW 444

Query: 506  KEAEDLLNLILEKGLLPDSFCCCSLVGHYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVL 327
            KE E+LL+ +L++GLL DSFCC SL+ +YCS +QID AI LH KIEKLKGSLDV TY+VL
Sbjct: 445  KEVEELLSAVLDQGLLLDSFCCSSLMEYYCSNRQIDKAIALHIKIEKLKGSLDVATYDVL 504

Query: 326  LNGLFLERRVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGLK 147
            L+GLF + R+EEAV++F+YM+  K+VSS SF I++  LC  KELRKAMK+HDEMLK+G K
Sbjct: 505  LDGLFKDGRMEEAVQIFDYMKELKVVSSSSFVIVVSRLCHLKELRKAMKIHDEMLKMGHK 564

Query: 146  PERTAYKRLIWGF 108
            P+   YK++I GF
Sbjct: 565  PDEATYKQVISGF 577


>ref|XP_002305605.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|222848569|gb|EEE86116.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 564

 Score =  593 bits (1530), Expect = e-167
 Identities = 288/491 (58%), Positives = 372/491 (75%)
 Frame = -3

Query: 1580 FFNWAKSSLGFQPNLKSHCKLIQISIQSGFFQSAKPILDNLIQTHPASVLVESMIQVCSG 1401
            FFNW +++L  +P+LKS C +I I + SG     +PI+D+L++TH  SVL E+M+  C G
Sbjct: 72   FFNWVQTNLKLKPDLKSQCHIINICVNSGLTLPVRPIMDSLVKTHHVSVLGEAMVDSCRG 131

Query: 1400 SDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKLAW 1221
               ++   S VLECYSHKG+F+E L++F++ RG G++ S  ACN++L  LQ   E KLAW
Sbjct: 132  KSLKSDAFSFVLECYSHKGLFMESLEMFRKMRGNGFIASGTACNSVLDVLQRENEIKLAW 191

Query: 1220 CFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKRGD 1041
            CFY A+I++GVLPD+ TWS++A+ILCK G  ERIV  L MG+YNSV+YN VID   KRGD
Sbjct: 192  CFYCAMIKDGVLPDKLTWSLIAQILCKDGNFERIVKFLDMGVYNSVLYNGVIDCCSKRGD 251

Query: 1040 FRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPSEY 861
            F AAF++L++MC RKLDPGFSTYS+ILDGACK+GN EV+E+++  M EK L+     S+ 
Sbjct: 252  FEAAFERLNQMCERKLDPGFSTYSAILDGACKHGNEEVIERVMDIMAEKGLLPKCPLSQC 311

Query: 860  DLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAISE 681
            D ++QK  DL K   A MFF+RA D+KIGLQDATYGC L+ALSKE RVKEA+ +Y  ISE
Sbjct: 312  DSVIQKFSDLCKMNVATMFFRRACDEKIGLQDATYGCMLKALSKEARVKEAIGLYSLISE 371

Query: 680  RGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGKWKE 501
            +G  V   +Y+AF ++L +ED  EE  E+L D++ RGF P    LSKFI L  +K +W+E
Sbjct: 372  KGIRVKDSTYHAFLDLLSEEDQYEEGYEILGDMMRRGFRPGTVGLSKFILLLSRKRRWRE 431

Query: 500  AEDLLNLILEKGLLPDSFCCCSLVGHYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVLLN 321
             EDLL+L+LEKGLLPDS CCCSLV HYCSR+QID A+ LHNK+EKL+ SLDV TYN+LL+
Sbjct: 432  VEDLLDLVLEKGLLPDSLCCCSLVEHYCSRRQIDKAVALHNKMEKLQASLDVATYNILLD 491

Query: 320  GLFLERRVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGLKPE 141
            GL    R+EE VRVF+YM+  KLV+SESF+I IRGLCR KE+RKAMK+HDEML +GLKP+
Sbjct: 492  GLVKNGRIEEVVRVFDYMKGLKLVNSESFTITIRGLCRAKEMRKAMKLHDEMLDMGLKPD 551

Query: 140  RTAYKRLIWGF 108
            + AYKRLI  F
Sbjct: 552  KAAYKRLILEF 562


>ref|XP_002518527.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223542372|gb|EEF43914.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 599

 Score =  585 bits (1508), Expect = e-164
 Identities = 287/494 (58%), Positives = 377/494 (76%)
 Frame = -3

Query: 1583 SFFNWAKSSLGFQPNLKSHCKLIQISIQSGFFQSAKPILDNLIQTHPASVLVESMIQVCS 1404
            +FFNWAK++L F P+LKS C +IQ+S+ S   ++AK ILD+LI+T+P+++ +E+M+Q C 
Sbjct: 100  NFFNWAKTNLNFNPDLKSQCHVIQLSLGSDLPRAAKKILDSLIKTYPSNLFLETMVQACR 159

Query: 1403 GSDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKLA 1224
            G  S    L+ VLE YSHKG F+EGL+V+K+ R  G  PSV ACN LL ALQ  +E +LA
Sbjct: 160  GKSSLLCTLNFVLEFYSHKGSFLEGLEVYKKMRVIGCTPSVHACNVLLDALQRESEIRLA 219

Query: 1223 WCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKRG 1044
            WCFY A+IR GVLPD+ TWS++A ILCK G  ERIV +L MGI NSV+YN V+D Y K G
Sbjct: 220  WCFYCAMIRVGVLPDKFTWSLVAHILCKDGNFERIVKLLDMGICNSVMYNAVVDYYSKNG 279

Query: 1043 DFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPSE 864
            DF+AAF +L+EM +RK++PGFSTYSSILDGACK  N++V+E+++  MV K+L+     S+
Sbjct: 280  DFKAAFCRLNEMYDRKVEPGFSTYSSILDGACKCRNLQVIERVVAIMVGKQLLSKCPSSD 339

Query: 863  YDLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAIS 684
            YD I+QKLCDLGK  AA +FFKRA D++IGLQDATYG  LRA S EG ++EA+ +Y  I 
Sbjct: 340  YDSIIQKLCDLGKVSAATLFFKRACDERIGLQDATYGRMLRAFSIEGILEEAIGLYQVIL 399

Query: 683  ERGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGKWK 504
            ERG T+   +  AF ++L ++D   E  E++RD++ RGFSPC S LSK+ITL CKK +WK
Sbjct: 400  ERGLTIKDNASDAFVDLLSEKDQYAEGYEIVRDIMRRGFSPCTSSLSKYITLLCKKRRWK 459

Query: 503  EAEDLLNLILEKGLLPDSFCCCSLVGHYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVLL 324
            EAE+LL ++LEKGLLPD+   CSLV HYCS KQ D A+ LHN +EKL+ SLD+T YN+LL
Sbjct: 460  EAEELLYMVLEKGLLPDTLSFCSLVKHYCSSKQTDKALALHNTLEKLQASLDITAYNLLL 519

Query: 323  NGLFLERRVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGLKP 144
             GL  E RVEE+++VF+YM+  KL +S SF+++IRGLCR KELRKAMK+HDEML +GLKP
Sbjct: 520  GGLVKEGRVEESIKVFDYMKGLKLANSASFTVIIRGLCRAKELRKAMKLHDEMLNMGLKP 579

Query: 143  ERTAYKRLIWGFKT 102
            ++  YKRLI  F +
Sbjct: 580  DKPTYKRLILEFNS 593


>ref|XP_007025994.1| Tetratricopeptide repeat-like superfamily protein, putative
            [Theobroma cacao] gi|508781360|gb|EOY28616.1|
            Tetratricopeptide repeat-like superfamily protein,
            putative [Theobroma cacao]
          Length = 578

 Score =  572 bits (1473), Expect = e-160
 Identities = 280/493 (56%), Positives = 374/493 (75%)
 Frame = -3

Query: 1583 SFFNWAKSSLGFQPNLKSHCKLIQISIQSGFFQSAKPILDNLIQTHPASVLVESMIQVCS 1404
            +FFNW K+ LGF+P+LKS C +IQI I S   +  +P +++LIQ+HPA ++ +SMIQ C 
Sbjct: 86   TFFNWVKTHLGFKPDLKSQCHIIQIVIGSDLCRCVEPAVNSLIQSHPAPIVADSMIQACK 145

Query: 1403 GSDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKLA 1224
            G + Q+  LSSV++CYS  G+F+EGL+VF++ R  G+ PSVCACN LL ALQ   E KLA
Sbjct: 146  GKNFQSSALSSVIKCYSKHGLFMEGLEVFRKMRIHGFTPSVCACNELLDALQRGNEVKLA 205

Query: 1223 WCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKRG 1044
            W F GA++R G+ PD+ +WS++A+ILCK+GKL ++VG+L  GIYNS IY+LVID Y K G
Sbjct: 206  WGFLGAMLRVGIEPDQFSWSLVAQILCKNGKLGKVVGLLEKGIYNSEIYDLVIDFYSKSG 265

Query: 1043 DFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPSE 864
            DF AAF++L+EM NRK+D  F TYSSILDGACKY + EV+ +I+R MVEK+LV     S+
Sbjct: 266  DFGAAFNRLNEMYNRKVDTSFCTYSSILDGACKYNDGEVIGRILRMMVEKELVPRHQFSK 325

Query: 863  YDLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAIS 684
             DLI+ KLCDL KT+AAEM FK+A D+ I L++ TYG  L+ALS+E R+ EA+ +   I 
Sbjct: 326  KDLIIPKLCDLRKTHAAEMLFKKACDENIRLRNDTYGSMLKALSQEARIDEAIEVCRMIL 385

Query: 683  ERGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGKWK 504
            +R   VN   Y AF N LCKED S++  ELL D+I RG +PCAS+LSK+I+ QC +  W+
Sbjct: 386  KRRIIVNESCYSAFINALCKEDQSDDGYELLVDIIKRGHNPCASKLSKYISSQCSQMNWR 445

Query: 503  EAEDLLNLILEKGLLPDSFCCCSLVGHYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVLL 324
            +AE+LL+L+LEKGLLPDSF CC L+ +YC  +Q+D  + LH+K+EK+KG LDVTTYN++L
Sbjct: 446  KAEELLDLMLEKGLLPDSFGCCLLIQYYCFNRQVDKIVALHDKMEKVKGCLDVTTYNMIL 505

Query: 323  NGLFLERRVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGLKP 144
            + L+ ER+ EEAVRV++YM    LV S SF+IMIR LC  KE++KAMK+HDEML +GLKP
Sbjct: 506  DVLWGERKAEEAVRVYDYMTGLNLVDSASFTIMIRELCHMKEMKKAMKIHDEMLNMGLKP 565

Query: 143  ERTAYKRLIWGFK 105
            ++  YKRLI GFK
Sbjct: 566  DKGTYKRLISGFK 578


>ref|XP_006467990.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21170-like
            [Citrus sinensis]
          Length = 538

 Score =  543 bits (1399), Expect = e-152
 Identities = 282/493 (57%), Positives = 349/493 (70%), Gaps = 1/493 (0%)
 Frame = -3

Query: 1583 SFFNWAKSSLGFQPNLKSHCKLIQISIQSGFFQSAKPILDNLIQTHPASVLVESMIQVCS 1404
            +FF W K+SL F+P+L S C +I++ + SG  +  KP LD+LIQTH A+VL  SMIQ C 
Sbjct: 85   NFFYWIKTSLHFEPDLISQCHIIRLLLGSGQTERIKPSLDSLIQTHTATVLTHSMIQSCE 144

Query: 1403 GSDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKLA 1224
                                                    V ACNALL AL    E +LA
Sbjct: 145  ----------------------------------------VSACNALLDALYRQNEIRLA 164

Query: 1223 WCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKRG 1044
             C YGA++R+GV P++ TWS++A+ILC+SGK E ++G+L  GIY+SV+YNLVID Y K+G
Sbjct: 165  SCLYGAMVRDGVSPNKFTWSLVAQILCRSGKFEVVLGLLDSGIYSSVMYNLVIDFYSKKG 224

Query: 1043 DFRAAFDQLDEMCN-RKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPS 867
            DF AAFD+L+EMCN R L PGFSTYSSILDGA +Y   EV ++I+  MVEKKL+     S
Sbjct: 225  DFGAAFDRLNEMCNGRNLTPGFSTYSSILDGARRYEKTEVSDRIVGLMVEKKLLPKHFLS 284

Query: 866  EYDLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAI 687
              D ++QKL D+GKTYAAEM FKRA D+KI LQD TYGC L+ALSKEGRVKEA++IYH I
Sbjct: 285  GNDYVIQKLSDMGKTYAAEMIFKRACDEKIELQDDTYGCMLKALSKEGRVKEAIQIYHLI 344

Query: 686  SERGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGKW 507
            SERG TV    YYAF NVLCKE   EEV  LLRD++ RG+ PCA ELS+F+  QC KGKW
Sbjct: 345  SERGITVRDSDYYAFVNVLCKEHQPEEVCGLLRDVVERGYIPCAMELSRFVASQCGKGKW 404

Query: 506  KEAEDLLNLILEKGLLPDSFCCCSLVGHYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVL 327
            KE E+LL+ +L+KGLL DSFCC SL+ +YCS +QID AI LH KIEKLKGSLDV TY+VL
Sbjct: 405  KEVEELLSAVLDKGLLLDSFCCSSLMEYYCSNRQIDKAIALHIKIEKLKGSLDVATYDVL 464

Query: 326  LNGLFLERRVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGLK 147
            L+GLF + R+EEAVR+F+YM+  K+VSS SF I++  LC  KELRKAMK HDEMLK+G K
Sbjct: 465  LDGLFKDGRMEEAVRIFDYMKELKVVSSSSFVIVVSRLCHLKELRKAMKNHDEMLKMGHK 524

Query: 146  PERTAYKRLIWGF 108
            P+   YK++I GF
Sbjct: 525  PDEATYKQVISGF 537


>ref|XP_004233795.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21170-like
            [Solanum lycopersicum]
          Length = 584

 Score =  540 bits (1390), Expect = e-151
 Identities = 266/493 (53%), Positives = 362/493 (73%), Gaps = 2/493 (0%)
 Frame = -3

Query: 1580 FFNWAKSSLGFQPNLKSHCKLIQISIQSGFFQSAKPILDNLIQTHPASVLVESMIQVCSG 1401
            FF++AK++LGFQP+ K  C L+ I + SG  + AKPILD LIQT+P + +V  +IQ    
Sbjct: 91   FFHYAKNNLGFQPDAKVLCTLVYILLGSGLSRPAKPILDTLIQTYPPAQIVGFLIQSLKA 150

Query: 1400 SDS--QALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKL 1227
             +   Q+ +LSSVLECY +KG+F+E LQV++  R  GY  SV  CN LL+ L    + +L
Sbjct: 151  GEIHIQSSVLSSVLECYCNKGLFLEALQVYQIVREYGYFVSVNCCNTLLNLLLSKNDLRL 210

Query: 1226 AWCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKR 1047
             WC+YG++IRNGV  +  TWS++A++LCK GK E+IV +L  G+ + +IYN++ID Y +R
Sbjct: 211  GWCYYGSIIRNGVQENVVTWSLIAQMLCKDGKFEKIVAILDKGVCSPLIYNILIDCYSER 270

Query: 1046 GDFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPS 867
            G F AAF  L++M + ++DP FST+SSILDGACKY N +V+E ++ SMVEK  +   +  
Sbjct: 271  GKFDAAFGYLNDMYSERIDPTFSTFSSILDGACKYQNAQVIESVMSSMVEKGHLPKVVTP 330

Query: 866  EYDLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAI 687
            +YD ++QK   +GK YAAE+FF+ A +  I LQD TYG  LRA SKEG+ ++A+ +Y+ I
Sbjct: 331  DYDSVIQKFSGIGKAYAAELFFREAYEKSIKLQDKTYGSMLRAFSKEGKAEDAIWMYNII 390

Query: 686  SERGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGKW 507
             ER   +NGK Y AF +VLC E PS EVS LL+DLIGRGF P  S++SKFI  QC+K +W
Sbjct: 391  VERKIFINGKCYSAFMSVLCNEIPSVEVSSLLKDLIGRGFVPPVSQVSKFIVSQCEKHQW 450

Query: 506  KEAEDLLNLILEKGLLPDSFCCCSLVGHYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVL 327
            KEAE+LLN+I +KGL  +SFCCCSLV HYC  ++IDSAI LH ++E+L  +LDV TY +L
Sbjct: 451  KEAEELLNVIFQKGLQFESFCCCSLVRHYCFSRRIDSAISLHTELERLGVALDVETYGLL 510

Query: 326  LNGLFLERRVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGLK 147
            L+ LF  RR EEA+++F+YMR   ++SS SFSIMIRGLC+E+E RKAM++HD+MLKLG K
Sbjct: 511  LDRLFKSRRHEEALKIFDYMRTHDMLSSGSFSIMIRGLCQEEEFRKAMRLHDDMLKLGFK 570

Query: 146  PERTAYKRLIWGF 108
            P++ AYKRLI GF
Sbjct: 571  PDKKAYKRLISGF 583


>ref|XP_006348079.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21170-like
            isoform X1 [Solanum tuberosum]
            gi|565362693|ref|XP_006348080.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g21170-like isoform X2 [Solanum tuberosum]
            gi|565362695|ref|XP_006348081.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At4g21170-like isoform X3 [Solanum tuberosum]
          Length = 584

 Score =  533 bits (1372), Expect = e-148
 Identities = 262/493 (53%), Positives = 365/493 (74%), Gaps = 2/493 (0%)
 Frame = -3

Query: 1580 FFNWAKSSLGFQPNLKSHCKLIQISIQSGFFQSAKPILDNLIQTHPASVLVESMIQVCSG 1401
            FF++AK++LGFQP+ K  C L+ I + SG  + AKPILD LIQT+P + +V  +IQ    
Sbjct: 91   FFDYAKNNLGFQPDAKVLCTLVYILLGSGLSKPAKPILDTLIQTYPPAQIVGFLIQSLKV 150

Query: 1400 SDS--QALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKL 1227
             +   Q+ +LSSVLECY +KG+F+E LQV++  R  GY  SV  CN LL+ L    E +L
Sbjct: 151  GEIHIQSSVLSSVLECYCNKGLFLEALQVYQIVREYGYFVSVNCCNTLLNLLLSKNELRL 210

Query: 1226 AWCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKR 1047
             WC++G++IRNGV  +  TWS++A++LCK GK E+IV +L  G+ + V+YN++ID Y +R
Sbjct: 211  GWCYFGSIIRNGVQENVVTWSLIAQMLCKDGKFEQIVPILDKGVCSPVMYNILIDCYSER 270

Query: 1046 GDFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPS 867
            G+F AAF  L++M ++ +DP F+T+SSILDGACKY N EV+E ++ SMVEK  +   +  
Sbjct: 271  GNFEAAFGYLNDMYSKCIDPTFNTFSSILDGACKYQNAEVIESVMSSMVEKGHLPKVVLP 330

Query: 866  EYDLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAI 687
            +YD ++++  D+GK YAAE+FF+ A + +I LQD TYG  LRA SKEG+ ++A+ +Y+ I
Sbjct: 331  DYDSVIRRFSDMGKAYAAELFFREAYEKRIKLQDNTYGSMLRAFSKEGKAEDAIWMYNII 390

Query: 686  SERGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGKW 507
             ER   ++ K Y AF +VLC E+PS EVS LL+DLIGRGF P  S++SKFI  QC+K +W
Sbjct: 391  VERKIFISDKCYSAFMSVLCNENPSLEVSSLLKDLIGRGFVPPVSQVSKFIVSQCEKRQW 450

Query: 506  KEAEDLLNLILEKGLLPDSFCCCSLVGHYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVL 327
            KEAE+LLN+I ++ L  +SFCCCSLV HYC  ++IDSAI LH ++E+L  +LDV TY +L
Sbjct: 451  KEAEELLNVIFQRRLQFESFCCCSLVRHYCFSRRIDSAISLHTELERLGVALDVETYGLL 510

Query: 326  LNGLFLERRVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGLK 147
            L+ LF  RR EEA+++F+YMR   ++SSESFSIMIRGLC+E+E RKAM++HD+MLKLG K
Sbjct: 511  LDSLFKSRRREEALKIFDYMRTHDMLSSESFSIMIRGLCQEQEFRKAMRLHDDMLKLGFK 570

Query: 146  PERTAYKRLIWGF 108
            P++ AYKRLI GF
Sbjct: 571  PDKKAYKRLISGF 583


>ref|XP_006285106.1| hypothetical protein CARUB_v10006439mg [Capsella rubella]
            gi|482553811|gb|EOA18004.1| hypothetical protein
            CARUB_v10006439mg [Capsella rubella]
          Length = 585

 Score =  435 bits (1118), Expect = e-119
 Identities = 219/497 (44%), Positives = 339/497 (68%), Gaps = 5/497 (1%)
 Frame = -3

Query: 1580 FFNWAKSSLGFQPNLKSHCKLIQISIQSGFFQSAKPILDNLIQTHPASVLVESMIQVCSG 1401
            FF++A++ L F P++KS C++I+++ +SG  + A+ +L  L++T+  S++V S+ + C G
Sbjct: 89   FFDFAQTHLHFDPDVKSQCRVIEVATESGLLERAETLLRPLVETNSVSLVVGSLQKCCEG 148

Query: 1400 SDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKLAW 1221
              S ++ LS VLECY+ KG +  GL+VF   R     PS+ A N+LL +L    + ++A 
Sbjct: 149  EVSLSISLSLVLECYALKGCYQNGLEVFGFMRRLRLSPSLRAYNSLLDSLIKEGQFRVAL 208

Query: 1220 CFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKRGD 1041
            C Y A++RN V+ D  TW ++A+ILC+ G+ + +V ++  G+ +  IY  +++ Y + G+
Sbjct: 209  CLYSAMVRNQVVSDGFTWDLVAQILCEQGRSKSVVKLMETGVESCKIYTNLVECYSRNGE 268

Query: 1040 FRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPSEY 861
            F A F+ + EM N+KL+  FS+YS +LD  C+ G+ E++ K++  MVEKK + +D  +  
Sbjct: 269  FDAVFNVIHEMDNKKLELSFSSYSCVLDDVCRLGDAELMGKVLGLMVEKKFLAVDASAVN 328

Query: 860  DLIVQKLCDLGKTYAAEMFFKRA-SDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAIS 684
            D I+++LCD+GKT+A+EM F++A + + + L+D TYGC L+ALS++GR KEAV +Y  I 
Sbjct: 329  DEIIERLCDMGKTFASEMLFRKACNGETVRLRDGTYGCMLKALSRKGRTKEAVDVYRLIC 388

Query: 683  ERGATVNGKSYYA-FANVLCKEDPS-EEVSELLRDLIGRGFSPCASELSKFITLQCKKGK 510
             +G TV  +S Y  FAN LC++D S EE  ELL D+I RGF PC   LS+ +   C+K +
Sbjct: 389  RKGITVLDESCYTEFANALCRDDNSPEEELELLVDVIKRGFVPCTRRLSEVLASLCRKRR 448

Query: 509  WKEAEDLLNLILEKGLLPDSFCCCSLVGHYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNV 330
            W+ AE LL+ ++E  +  DSF C  L+  YC   ++D A+ LH +I+K+KGSLDV  YN 
Sbjct: 449  WRHAEKLLDSVMEMEVYFDSFSCGILMERYCRSGKLDKAMELHERIKKMKGSLDVNAYNA 508

Query: 329  LLNGLFLERR--VEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKL 156
            +L+ L + +R  VEEAVRVF YM+  K V+S+SF+IMI+GLC  KE++KA + HDEMLKL
Sbjct: 509  VLDRLMMRQREMVEEAVRVFEYMKEMKSVNSKSFTIMIQGLCHVKEMKKAKQSHDEMLKL 568

Query: 155  GLKPERTAYKRLIWGFK 105
            G+KP+   YKR+I+GFK
Sbjct: 569  GMKPDLATYKRVIYGFK 585


>ref|XP_006413812.1| hypothetical protein EUTSA_v10024760mg [Eutrema salsugineum]
            gi|557114982|gb|ESQ55265.1| hypothetical protein
            EUTSA_v10024760mg [Eutrema salsugineum]
          Length = 584

 Score =  426 bits (1095), Expect = e-116
 Identities = 222/497 (44%), Positives = 328/497 (65%), Gaps = 5/497 (1%)
 Frame = -3

Query: 1580 FFNWAKSSLGFQPNLKSHCKLIQISIQSGFFQSAKPILDNLIQTHPASVLVESMIQVCSG 1401
            FF+WAK+ L F+P+LKS C++IQ++ ++G  + A+  +  LI+TH   V+V SM +   G
Sbjct: 89   FFDWAKTHLRFEPDLKSCCRVIQVATETGLLERAEAFVRPLIETHSVCVIVGSMHRWFEG 148

Query: 1400 SDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKLAW 1221
              S +  LS VLECY+ KG +  GL+VF   R     PS+ A N+LL +L    + +LA 
Sbjct: 149  EVSLSTSLSLVLECYALKGSYQNGLEVFGSMRRLRLSPSLRAYNSLLDSLVKEKQFRLAL 208

Query: 1220 CFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKRGD 1041
            C Y A++RN V+ D  TW ++A++LC+ GK + +V ++  G+ +  IY  +++ Y + G+
Sbjct: 209  CLYSAMVRNRVVSDGLTWDLVAQVLCEQGKFKSVVKLMETGVESCKIYTNLVECYSRNGE 268

Query: 1040 FRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPSEY 861
            F A F  + EM  +KL+  F +Y  +LD AC+ G+ E+++K++  MVEK+ + +D  +  
Sbjct: 269  FDAVFSVIQEMDAKKLELSFCSYGYVLDDACRLGDSELIDKVLGLMVEKEFLTLDDSTVN 328

Query: 860  DLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAISE 681
            D I+++LCD+GKT+A+EM F RA +    ++D TYGC L++LS  GR KEAV +Y  I  
Sbjct: 329  DQIIERLCDMGKTFASEMLFHRACNGGT-VRDRTYGCMLKSLSVIGRTKEAVDVYRLICR 387

Query: 680  RGATVNGKS-YYAFANVLCKED--PSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGK 510
            +G TV  +S Y  FAN LC++D   SEE  ELL D+I RGF PC  +LS+ +   C+K +
Sbjct: 388  KGITVLDESCYKEFANALCRDDDNSSEEEGELLIDVIKRGFVPCTLKLSEVLASLCRKRR 447

Query: 509  WKEAEDLLNLILEKGLLPDSFCCCSLVGHYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNV 330
            W  AE LL+ ++E  +  DSF C  L+  YC   +++ A+ LH KI+K+KGSLDV  YN 
Sbjct: 448  WNRAEKLLDSVMEMEVHFDSFSCGLLMERYCRSGKLEKAMVLHEKIKKMKGSLDVNAYNA 507

Query: 329  LLNGLFLERR--VEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKL 156
            +L+ L + +R  VEEAV+VF YM+    V+S+SF+IMI GLCR KE++KAMK HDEMLKL
Sbjct: 508  VLDRLMMRQRTMVEEAVQVFEYMKEMNTVNSKSFTIMIHGLCRVKEMKKAMKSHDEMLKL 567

Query: 155  GLKPERTAYKRLIWGFK 105
            GLKP+   YKRLI GF+
Sbjct: 568  GLKPDLVTYKRLISGFR 584


>sp|O49558.2|PP331_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At4g21170
          Length = 585

 Score =  422 bits (1084), Expect = e-115
 Identities = 217/497 (43%), Positives = 332/497 (66%), Gaps = 5/497 (1%)
 Frame = -3

Query: 1580 FFNWAKSSLGFQPNLKSHCKLIQISIQSGFFQSAKPILDNLIQTHPASVLVESMIQVCSG 1401
            FF++AK+ L F+P+LKSHC++I+++ +SG  + A+ +L  L++T+  S++V  M +   G
Sbjct: 89   FFDFAKTHLRFEPDLKSHCRVIEVAAESGLLERAEMLLRPLVETNSVSLVVGEMHRWFEG 148

Query: 1400 SDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKLAW 1221
              S ++ LS VLE Y+ KG    GL+VF   R     PS  A N+LL +L    + ++A 
Sbjct: 149  EVSLSVSLSLVLEYYALKGSHHNGLEVFGFMRRLRLSPSQSAYNSLLGSLVKENQFRVAL 208

Query: 1220 CFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKRGD 1041
            C Y A++RNG++ D  TW ++A+ILC+ G+ + +  ++  G+ +  IY  +++ Y + G+
Sbjct: 209  CLYSAMVRNGIVSDELTWDLIAQILCEQGRSKSVFKLMETGVESCKIYTNLVECYSRNGE 268

Query: 1040 FRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPSEY 861
            F A F  + EM ++KL+  F +Y  +LD AC+ G+ E ++K++  MVEKK V +   +  
Sbjct: 269  FDAVFSLIHEMDDKKLELSFCSYGCVLDDACRLGDAEFIDKVLCLMVEKKFVTLGDSAVN 328

Query: 860  DLIVQKLCDLGKTYAAEMFFKRA-SDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAIS 684
            D I+++LCD+GKT+A+EM F++A + + + L D+TYGC L+ALS++ R KEAV +Y  I 
Sbjct: 329  DKIIERLCDMGKTFASEMLFRKACNGETVRLWDSTYGCMLKALSRKKRTKEAVDVYRMIC 388

Query: 683  ERGATVNGKS-YYAFANVLCKED-PSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGK 510
             +G TV  +S Y  FAN LC++D  SEE  ELL D+I RGF PC  +LS+ +   C+K +
Sbjct: 389  RKGITVLDESCYIEFANALCRDDNSSEEEEELLVDVIKRGFVPCTHKLSEVLASMCRKRR 448

Query: 509  WKEAEDLLNLILEKGLLPDSFCCCSLVGHYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNV 330
            WK AE LL+ ++E  +  DSF C  L+  YC   +++ A+ LH KI+K+KGSLDV  YN 
Sbjct: 449  WKSAEKLLDSVMEMEVYFDSFACGLLMERYCRSGKLEKALVLHEKIKKMKGSLDVNAYNA 508

Query: 329  LLNGLFLERR--VEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKL 156
            +L+ L + ++  VEEAV VF YM+    V+S+SF+IMI+GLCR KE++KAM+ HDEML+L
Sbjct: 509  VLDRLMMRQKEMVEEAVVVFEYMKEINSVNSKSFTIMIQGLCRVKEMKKAMRSHDEMLRL 568

Query: 155  GLKPERTAYKRLIWGFK 105
            GLKP+   YKRLI GFK
Sbjct: 569  GLKPDLVTYKRLILGFK 585


>gb|EYU27821.1| hypothetical protein MIMGU_mgv1a006926mg [Mimulus guttatus]
          Length = 426

 Score =  415 bits (1067), Expect = e-113
 Identities = 206/425 (48%), Positives = 292/425 (68%), Gaps = 1/425 (0%)
 Frame = -3

Query: 1379 LSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKLAWCFYGAVI 1200
            ++SV+ECY  K M+++ L+V+   +      SV +CN LL+ L    E KLAWC+Y ++I
Sbjct: 1    MNSVVECYCSKQMYLQSLEVYHMAKDYRIGLSVDSCNILLNLLGDKNELKLAWCYYASII 60

Query: 1199 RNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKRGDFRAAFDQ 1020
            RNGV  +R TWS +ARIL K GK ERI  +  +GI+   +++L+ID + KRGDF AAFD 
Sbjct: 61   RNGVSGNRFTWSSIARILHKDGKFERISKVFDVGIFTPEMFDLIIDGHSKRGDFEAAFDY 120

Query: 1019 LDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPSEYDLIVQKL 840
            L+ MC++++ P FSTYSSIL+GACK+ + E++E ++  MVEK  +      +YD IV++L
Sbjct: 121  LNRMCSKEIGPSFSTYSSILNGACKHQDGEIIENMLSLMVEKGHIAETPVCDYDSIVKEL 180

Query: 839  CDLGKTYAAEMFFKRASDDKIGLQDATYGCALRAL-SKEGRVKEAVRIYHAISERGATVN 663
            CD GKT+A ++F +RA + KI LQ  TY C L AL S+E R+++A+++Y  + E+   ++
Sbjct: 181  CDEGKTFAVDLFSERAYEAKIELQHGTYECMLMALLSEEARLEDAIKLYKIVREKNILLS 240

Query: 662  GKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGKWKEAEDLLN 483
               Y  F  +LCKE+PS E++ LL D+  +GF     ELS +I+ QC +G+W+EAE++ N
Sbjct: 241  ESCYSEFVVILCKENPSREITNLLVDITKQGFFFQPKELSGYISKQCAEGRWREAEEIFN 300

Query: 482  LILEKGLLPDSFCCCSLVGHYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVLLNGLFLER 303
             +L KG L DS CC S+V  +CS  QI  AI +HNK+E+LKGSLD+  YN  +  LF + 
Sbjct: 301  AVLNKGFLLDSTCCGSIVKRHCSSGQIGKAIVVHNKLEELKGSLDIAAYNKFIAALFRDN 360

Query: 302  RVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGLKPERTAYKR 123
            R EE ++VF+YM+  K+   ESFS MI GLCR KE RKAM+ HDEML+LGLKP+R  YKR
Sbjct: 361  RAEETIKVFDYMKACKIFDGESFSHMICGLCRVKEFRKAMRFHDEMLELGLKPDRRTYKR 420

Query: 122  LIWGF 108
            LI GF
Sbjct: 421  LISGF 425


>ref|NP_193849.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|332659015|gb|AEE84415.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 551

 Score =  362 bits (930), Expect = 2e-97
 Identities = 204/497 (41%), Positives = 309/497 (62%), Gaps = 5/497 (1%)
 Frame = -3

Query: 1580 FFNWAKSSLGFQPNLKSHCKLIQISIQSGFFQSAKPILDNLIQTHPASVLVESMIQVCSG 1401
            FF++AK+ L F+P+LKSHC++I+++ +SG  + A+ +L  L++T+  S++V  M +   G
Sbjct: 89   FFDFAKTHLRFEPDLKSHCRVIEVAAESGLLERAEMLLRPLVETNSVSLVVGEMHRWFEG 148

Query: 1400 SDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKLAW 1221
              S ++ LS VLE Y+ KG    GL+VF   R     PS  A N+LL +L    + ++A 
Sbjct: 149  EVSLSVSLSLVLEYYALKGSHHNGLEVFGFMRRLRLSPSQSAYNSLLGSLVKENQFRVAL 208

Query: 1220 CFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKRGD 1041
            C Y A++RNG++ D  TW ++A+ILC+ G+ + +  ++  G+ +  IY  +++ Y + G+
Sbjct: 209  CLYSAMVRNGIVSDELTWDLIAQILCEQGRSKSVFKLMETGVESCKIYTNLVECYSRNGE 268

Query: 1040 FRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPSEY 861
            F A F  + EM ++KL+  F +Y  +LD AC+ G+ E ++K++  MVEKK V +   +  
Sbjct: 269  FDAVFSLIHEMDDKKLELSFCSYGCVLDDACRLGDAEFIDKVLCLMVEKKFVTLGDSAVN 328

Query: 860  DLIVQKLCDLGKTYAAEMFFKRA-SDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAIS 684
            D I+++LCD+GKT+A+EM F++A + + + L D+TYGC L+ALS++ R KEAV +Y  I 
Sbjct: 329  DKIIERLCDMGKTFASEMLFRKACNGETVRLWDSTYGCMLKALSRKKRTKEAVDVYRMIC 388

Query: 683  ERGATVNGKS-YYAFANVLCKED-PSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGK 510
             +G TV  +S Y  FAN LC++D  SEE  ELL D+I RG      + S  I L     K
Sbjct: 389  RKGITVLDESCYIEFANALCRDDNSSEEEEELLVDVIKRGKEDGNPQRSFLIRL----WK 444

Query: 509  WKEAEDLLNLILEKGLLPDSFCCCSLVGHYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNV 330
            W+  +      LEK L+                        LH KI+K+KGSLDV  YN 
Sbjct: 445  WRSGK------LEKALV------------------------LHEKIKKMKGSLDVNAYNA 474

Query: 329  LLNGLFLERR--VEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKL 156
            +L+ L + ++  VEEAV VF YM+    V+S+SF+IMI+GLCR KE++KAM+ HDEML+L
Sbjct: 475  VLDRLMMRQKEMVEEAVVVFEYMKEINSVNSKSFTIMIQGLCRVKEMKKAMRSHDEMLRL 534

Query: 155  GLKPERTAYKRLIWGFK 105
            GLKP+   YKRLI GFK
Sbjct: 535  GLKPDLVTYKRLILGFK 551


>ref|XP_002867861.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297313697|gb|EFH44120.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 534

 Score =  343 bits (880), Expect = 1e-91
 Identities = 194/497 (39%), Positives = 302/497 (60%), Gaps = 5/497 (1%)
 Frame = -3

Query: 1580 FFNWAKSSLGFQPNLKSHCKLIQISIQSGFFQSAKPILDNLIQTHPASVLVESMIQVCSG 1401
            FF++AK+ L F+P+LKSHC++I+++ +SG  + A+ +L  L++TH  S++V SM +   G
Sbjct: 89   FFDFAKTHLRFEPDLKSHCRVIEVATESGLLERAETLLRPLVETHSVSLVVGSMHRWFEG 148

Query: 1400 SDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKLAW 1221
              S ++ LS V+ECY+ KG +  GL+VF   R     PS  A N+LL +L    + ++A 
Sbjct: 149  DVSLSISLSLVIECYALKGCYQNGLEVFGFMRRLRLSPSQSAYNSLLGSLVKENQFRVAL 208

Query: 1220 CFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKRGD 1041
            C Y A+I                 LC+ G+ + +V ++  G+ +  IY  +++ Y + G+
Sbjct: 209  CLYSAMI-----------------LCEHGRSKSVVKLMETGVESCKIYTNLVECYSRNGE 251

Query: 1040 FRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPSEY 861
            F A F  + EM  +KL+  FS+Y  +LD AC+ G+ E+++K++ SMVEKK + +   +  
Sbjct: 252  FDATFSLIHEMDGKKLELSFSSYGCVLDNACRLGDAELIDKVLGSMVEKKFLTLGDSALN 311

Query: 860  DLIVQKLCDLGKTYAAEMFFKRASD-DKIGLQDATYGCALRALSKEGRVKEAVRIYHAIS 684
            D ++++LCD+GKT+A+EM F++A + + + L+++TYGC L+ALS++ R KEAV +Y  I 
Sbjct: 312  DQMIERLCDMGKTFASEMLFRKACNGETVRLRESTYGCMLKALSRKERTKEAVDVYRMIC 371

Query: 683  ERGATVNGKSYY-AFANVLCKED-PSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGK 510
             +G  V  +S Y  FAN LC++D  SEE  ELL D+I RG      + S  I L     K
Sbjct: 372  RKGINVLDESCYNEFANALCRDDNSSEEGEELLVDVIKRGKEDGNPQRSFLIRLW----K 427

Query: 509  WKEAEDLLNLILEKGLLPDSFCCCSLVGHYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNV 330
            W+  +                              ++ A+ LH KI+K+KGSLDV  YN 
Sbjct: 428  WRSGK------------------------------LEKALELHEKIKKMKGSLDVNAYNA 457

Query: 329  LLNGLFLERR--VEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKL 156
            +L+ L + ++  VEEAV VF YM+  K V+S+SF+IMI+GLCR KE++KAM+ HDEML+L
Sbjct: 458  VLDRLMMRQKEMVEEAVGVFEYMKEMKSVNSKSFTIMIQGLCRVKEMKKAMRSHDEMLRL 517

Query: 155  GLKPERTAYKRLIWGFK 105
             +KP+  +YKRLI GFK
Sbjct: 518  DMKPDLVSYKRLILGFK 534


>emb|CAA17536.1| putative protein [Arabidopsis thaliana] gi|7268914|emb|CAB79117.1|
            putative protein [Arabidopsis thaliana]
          Length = 534

 Score =  332 bits (850), Expect = 4e-88
 Identities = 197/497 (39%), Positives = 296/497 (59%), Gaps = 5/497 (1%)
 Frame = -3

Query: 1580 FFNWAKSSLGFQPNLKSHCKLIQISIQSGFFQSAKPILDNLIQTHPASVLVESMIQVCSG 1401
            FF++AK+ L F+P+LKSHC++I+++ +SG  + A+ +L  L++T+  S++V  M +   G
Sbjct: 89   FFDFAKTHLRFEPDLKSHCRVIEVAAESGLLERAEMLLRPLVETNSVSLVVGEMHRWFEG 148

Query: 1400 SDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKLAW 1221
              S ++ LS VLE Y+ KG    GL+VF   R     PS  A N+LL +L    + ++A 
Sbjct: 149  EVSLSVSLSLVLEYYALKGSHHNGLEVFGFMRRLRLSPSQSAYNSLLGSLVKENQFRVAL 208

Query: 1220 CFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGMGIYNSVIYNLVIDSYCKRGD 1041
            C Y A+I                 LC+ G+ + +  ++  G+ +  IY  +++ Y + G+
Sbjct: 209  CLYSAMI-----------------LCEQGRSKSVFKLMETGVESCKIYTNLVECYSRNGE 251

Query: 1040 FRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPSEY 861
            F A F  + EM ++KL+  F +Y  +LD AC+ G+ E ++K++  MVEKK V +   +  
Sbjct: 252  FDAVFSLIHEMDDKKLELSFCSYGCVLDDACRLGDAEFIDKVLCLMVEKKFVTLGDSAVN 311

Query: 860  DLIVQKLCDLGKTYAAEMFFKRA-SDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAIS 684
            D I+++LCD+GKT+A+EM F++A + + + L D+TYGC L+ALS++ R KEAV +Y  I 
Sbjct: 312  DKIIERLCDMGKTFASEMLFRKACNGETVRLWDSTYGCMLKALSRKKRTKEAVDVYRMIC 371

Query: 683  ERGATVNGKS-YYAFANVLCKED-PSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGK 510
             +G TV  +S Y  FAN LC++D  SEE  ELL D+I RG      + S  I L     K
Sbjct: 372  RKGITVLDESCYIEFANALCRDDNSSEEEEELLVDVIKRGKEDGNPQRSFLIRL----WK 427

Query: 509  WKEAEDLLNLILEKGLLPDSFCCCSLVGHYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNV 330
            W+  +      LEK L+                        LH KI+K+KGSLDV  YN 
Sbjct: 428  WRSGK------LEKALV------------------------LHEKIKKMKGSLDVNAYNA 457

Query: 329  LLNGLFLERR--VEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKL 156
            +L+ L + ++  VEEAV VF YM+    V+S+SF+IMI+GLCR KE++KAM+ HDEML+L
Sbjct: 458  VLDRLMMRQKEMVEEAVVVFEYMKEINSVNSKSFTIMIQGLCRVKEMKKAMRSHDEMLRL 517

Query: 155  GLKPERTAYKRLIWGFK 105
            GLKP+   YKRLI GFK
Sbjct: 518  GLKPDLVTYKRLILGFK 534


>ref|XP_006826483.1| hypothetical protein AMTR_s00004p00243870 [Amborella trichopoda]
            gi|548830797|gb|ERM93720.1| hypothetical protein
            AMTR_s00004p00243870 [Amborella trichopoda]
          Length = 359

 Score =  281 bits (720), Expect = 4e-73
 Identities = 144/348 (41%), Positives = 215/348 (61%), Gaps = 22/348 (6%)
 Frame = -3

Query: 1085 VIYNLVIDSYCKRGDFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRS 906
            V+YNL++D YC+ GDF  AF+ ++ +  + L+P F++Y SILDG+C++GN+    +++R 
Sbjct: 11   VVYNLILDGYCRNGDFVIAFEVIERIYGKGLEPDFASYGSILDGSCRFGNMGTAVRVLRI 70

Query: 905  MVEKKLVQMD----LPSE------------------YDLIVQKLCDLGKTYAAEMFFKRA 792
            M+EK+LV        P++                  YD  ++KLC LG T+AAE+ F  A
Sbjct: 71   MLEKRLVPTVGGEFSPNDCFTLNDNNCIVAAISYLHYDAFIRKLCKLGMTHAAELVFGIA 130

Query: 791  SDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAISERGATVNGKSYYAFANVLCKEDPS 612
                + LQ+A Y   L+A S++ R+KEAVR+Y  + +R   +N        N L KE+PS
Sbjct: 131  RSALVPLQNACYIALLKAFSRDRRIKEAVRMYFLLLQRDIAMNISECNVLLNALFKEEPS 190

Query: 611  EEVSELLRDLIGRGFSPCASELSKFITLQCKKGKWKEAEDLLNLILEKGLLPDSFCCCSL 432
            EEV+++++ +I +GF P    +S +I+ QC KG W+EA +LL + LE+G++PD F   S 
Sbjct: 191  EEVNKVIKSVIEKGFYPDPLAISSYISAQCSKGGWQEANELLWVTLERGVMPDGFVWGSF 250

Query: 431  VGHYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVLLNGLFLERRVEEAVRVFNYMRIQKL 252
            + HYC    +D A+ LH K  K    L+  +YN+LLN L+ E ++EEA  +F+YMR + +
Sbjct: 251  IRHYCEDGHLDYALSLHEKFAKSGNVLNAPSYNILLNRLYNEGKLEEASGMFDYMRNKDV 310

Query: 251  VSSESFSIMIRGLCREKELRKAMKVHDEMLKLGLKPERTAYKRLIWGF 108
             SS SF  MI   CREK+  +A K+HDEMLK GLKP+   YKRLI GF
Sbjct: 311  TSSASFMTMISWFCREKKFSEARKMHDEMLKKGLKPDEATYKRLISGF 358


>ref|XP_006857674.1| hypothetical protein AMTR_s00061p00160470 [Amborella trichopoda]
           gi|548861770|gb|ERN19141.1| hypothetical protein
           AMTR_s00061p00160470 [Amborella trichopoda]
          Length = 372

 Score =  206 bits (523), Expect = 3e-50
 Identities = 104/250 (41%), Positives = 160/250 (64%)
 Frame = -3

Query: 866 EYDLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAI 687
           +Y + +++LC LG T AAE+ F  A +  + LQ+A+Y   L+  S++ R+KEAVR+Y  +
Sbjct: 119 DYGVFIRRLCKLGMTDAAELVFGIAHNALVFLQNASYIALLKGFSRDKRIKEAVRMYFLL 178

Query: 686 SERGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGKW 507
            +R   +N        N L KE+ SEEV+++++ +I +GF P    +S  I+ QC KG W
Sbjct: 179 LQRDIALNICECNVLLNALFKEEQSEEVNKVIKSVIRKGFYPDPLAISSHISSQCSKGGW 238

Query: 506 KEAEDLLNLILEKGLLPDSFCCCSLVGHYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVL 327
           +EA +LL ++LE+G++P+ F C S + HYC    +D A+ LH K+ KL   L+  +YN+L
Sbjct: 239 QEANELLWVMLERGVMPNGFACGSFIRHYCEDGGLDYALSLHEKLVKLGNVLNAPSYNIL 298

Query: 326 LNGLFLERRVEEAVRVFNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGLK 147
           L+ L+   ++EEA  +F++MR + + SS SF  MI   C EK+  +A K+HDEMLK GLK
Sbjct: 299 LDQLYNGGKLEEASEMFDHMRNKNVTSSASFITMISWFCWEKKFSEARKMHDEMLKKGLK 358

Query: 146 PERTAYKRLI 117
           P+   YKRLI
Sbjct: 359 PDEATYKRLI 368


>gb|EPS66849.1| hypothetical protein M569_07924, partial [Genlisea aurea]
          Length = 729

 Score =  197 bits (500), Expect = 1e-47
 Identities = 125/507 (24%), Positives = 247/507 (48%), Gaps = 17/507 (3%)
 Frame = -3

Query: 1580 FFNWAKSSLGFQPNLKSHCKLIQISIQSGFFQSAKPILDNLIQTHPASVLVESMI----- 1416
            F +WA+    F+ +L+ +C  I I  +   +++A+ + + +    P     +S+      
Sbjct: 57   FLDWARGLPFFRDHLQCYCLSIHILTRFKLYKTAQSLAEEVALRFPQDEHGDSVFSCLRD 116

Query: 1415 --QVCSGSDSQALILSSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVS 1242
              Q C  S     +   V++ +S+  +    L +    +  G++PSV + NA+L A+  +
Sbjct: 117  TYQACESSSG---VFDLVVKAFSNLKLTDRALNMIYSAKCCGFMPSVLSYNAVLEAIFRN 173

Query: 1241 AETK---LAWCFYGAVIRNGVLPDRSTWSILARILCKSGKLERIVGMLGM----GIY-NS 1086
            +  +    A C +  ++ NG+ P+  T+++L R LC + ++ + + +       G+  N 
Sbjct: 174  SSCRNVDSARCVFHEMMENGISPNVYTYNVLIRGLCANKEMNQGLSLFEQMEKRGVLPNV 233

Query: 1085 VIYNLVIDSYCKRGDFRAAFDQLDEMCNRKLDPGFSTYSSILDGACKYGNVEVVEKIIRS 906
            V +N VID+YCK  +   A+  L +M  R L+P   TY+ I++G CK G ++  + ++  
Sbjct: 234  VTFNTVIDAYCKSRNIDQAYGLLKQMWERNLEPNVITYNVIINGLCKEGRIKETDDVLVD 293

Query: 905  MVEKKLVQMDLPSEYDLIVQKLCDLGKTYAAEMFFKRASDDKIGLQDATYGCALRALSKE 726
            M  K L   ++   Y+ +V   C  G  + A         + +     TY C + +L K 
Sbjct: 294  MKAKGLAPNEIT--YNTLVDGYCKEGNFHQALALHAEMVKNGLSPNVVTYTCLINSLCKA 351

Query: 725  GRVKEAVRIYHAISERGATVNGKSYYAFANVLCKEDPSEEVSELLRDLIGRGFSPCASEL 546
            G ++ A+  ++ ++ RG   N K+Y    +   ++   +E   L+ ++I +GFSP     
Sbjct: 352  GNLQRAMDYFNQMAVRGLKPNEKTYTTLIDGFSQQGFMDEAYGLVEEMISKGFSPSIVTY 411

Query: 545  SKFITLQCKKGKWKEAEDLLNLILEKGLLPDSFCCCSLVGHYCSRKQIDSAIGLHNKIEK 366
            +  I   C+ G+  E  +++  +  +G+ PD     S++  YC    +D A  +   + +
Sbjct: 412  NALINGHCQLGRVDEGLNVIQTMTSRGVFPDVVSYSSIINGYCRNLDLDKAFSVKEDMSQ 471

Query: 365  LKGSLDVTTYNVLLNGLFLERRVEEAVRVFNYM--RIQKLVSSESFSIMIRGLCREKELR 192
                 D  TY+ L+ GL   RR++EA ++F  M  ++  L    +++ +I   C E ++ 
Sbjct: 472  KGIFPDTITYSSLIQGLCELRRLDEACKLFTEMSSKLNLLPDKCTYTCLINAYCAENDIP 531

Query: 191  KAMKVHDEMLKLGLKPERTAYKRLIWG 111
            KA+ +HDEM++ GL P+  +Y  L+ G
Sbjct: 532  KAIHLHDEMIRRGLFPDVISYNVLVNG 558



 Score = 77.8 bits (190), Expect = 1e-11
 Identities = 57/237 (24%), Positives = 93/237 (39%)
 Frame = -3

Query: 818 AAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAISERGATVNGKSYYAFA 639
           +A   F    ++ I     TY   +R L     + + + ++  + +RG   N  ++    
Sbjct: 181 SARCVFHEMMENGISPNVYTYNVLIRGLCANKEMNQGLSLFEQMEKRGVLPNVVTFNTVI 240

Query: 638 NVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITLQCKKGKWKEAEDLLNLILEKGLL 459
           +  CK    ++   LL+ +  R   P     +  I   CK+G+ KE +D+L  +  KGL 
Sbjct: 241 DAYCKSRNIDQAYGLLKQMWERNLEPNVITYNVIINGLCKEGRIKETDDVLVDMKAKGLA 300

Query: 458 PDSFCCCSLVGHYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVLLNGLFLERRVEEAVRV 279
           P+     +LV  YC       A+ LH ++ K   S +V TY  L+N L            
Sbjct: 301 PNEITYNTLVDGYCKEGNFHQALALHAEMVKNGLSPNVVTYTCLINSL------------ 348

Query: 278 FNYMRIQKLVSSESFSIMIRGLCREKELRKAMKVHDEMLKLGLKPERTAYKRLIWGF 108
                                 C+   L++AM   ++M   GLKP    Y  LI GF
Sbjct: 349 ----------------------CKAGNLQRAMDYFNQMAVRGLKPNEKTYTTLIDGF 383


>ref|XP_006287148.1| hypothetical protein CARUB_v10000318mg [Capsella rubella]
            gi|482555854|gb|EOA20046.1| hypothetical protein
            CARUB_v10000318mg [Capsella rubella]
          Length = 729

 Score =  191 bits (486), Expect = 6e-46
 Identities = 125/482 (25%), Positives = 231/482 (47%), Gaps = 11/482 (2%)
 Frame = -3

Query: 1520 LIQISIQSGFFQSAKPILDNLIQTHPAS--VLVESMIQVCSGSDSQALILSSVLECYSHK 1347
            +I I ++SG    A+  +  +I+    S   +V +++   S   S   +   ++  Y   
Sbjct: 119  MIHILVRSGRLSDAQGCVLRMIRRSGVSRVEIVNALVSTYSNCGSNDSVFDLLIRTYVQA 178

Query: 1346 GMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKLAWCFYGAVIRNGVLPDRSTW 1167
                E  + F   R +GY  S+ ACNAL+ +L      +LAW  Y  + R+GV  +  T 
Sbjct: 179  RKLREAHEAFSLLRSKGYTVSIDACNALIGSLVRIGWVELAWGVYQEISRSGVGINVFTL 238

Query: 1166 SILARILCKSGKLERIVGMLGM----GIYNSVI-YNLVIDSYCKRGDFRAAFDQLDEMCN 1002
            +I+   LCK GK+E+I   L      G+Y  ++ YN +I +Y  +G    AF+ +D M +
Sbjct: 239  NIMVNALCKDGKMEKIGTFLSQVKEKGVYPDIVTYNTLISAYSSKGLMEEAFELMDAMPS 298

Query: 1001 RKLDPGFSTYSSILDGACKYGNVEVVEKIIRSMVEKKLVQMDLPSEYDLIVQKLCDLGKT 822
            +   PG  T++++++G CK+G  E  +++   M+   L      + Y  ++ + C  G  
Sbjct: 299  KGFSPGVYTFNTVINGLCKHGRYERAKEVFAEMLRSGLSPDS--TTYRSLLMEACKKGDA 356

Query: 821  YAAEMFFKRASDDKIGLQDATYGCALRALSKEGRVKEAVRIYHAISERGATVNGKSYYAF 642
               E  F       I      +   +   ++ G + +A+  +H++ + G + +   Y   
Sbjct: 357  VETEKIFSDMRCRDIVPDLVCFSSVMSLSARSGNLDKALVYFHSVKDAGLSPDNVIYTIL 416

Query: 641  ANVLCKEDPSEEVSELLRDLIGRGFSPCASELSKFITL---QCKKGKWKEAEDLLNLILE 471
                CK+    E   L  D++ +G   CA ++  + T+    CK+   +EA+ L N + E
Sbjct: 417  IQGYCKKGMISEAMNLRNDMLRQG---CAMDVVTYNTILHGLCKQKMLREADKLFNEMTE 473

Query: 470  KGLLPDSFCCCSLVGHYCSRKQIDSAIGLHNKIEKLKGSLDVTTYNVLLNGLFLERRVEE 291
            +GL PDS+    L+  +C    + +A+ L  K+++ +  LDV TYN LL+G      ++ 
Sbjct: 474  RGLFPDSYTLTILIDGHCKLGNLQNAMELFKKMKEKRIKLDVVTYNTLLDGFGKVGDIDT 533

Query: 290  AVRVFNYMRIQKLVSSE-SFSIMIRGLCREKELRKAMKVHDEMLKLGLKPERTAYKRLIW 114
            A  ++  M  ++++ +  S+SIM+  LC +  L +A +V DEM    +KP       +I 
Sbjct: 534  AKEIWADMVSREILPTPISYSIMVNALCSKGHLSEAFRVWDEMTSKSIKPTVMICNSMIK 593

Query: 113  GF 108
            G+
Sbjct: 594  GY 595



 Score =  148 bits (374), Expect = 6e-33
 Identities = 110/458 (24%), Positives = 195/458 (42%), Gaps = 46/458 (10%)
 Frame = -3

Query: 1376 SSVLECYSHKGMFIEGLQVFKETRGRGYLPSVCACNALLSALQVSAETKLAWCFYGAVIR 1197
            ++++  YS KG+  E  ++      +G+ P V   N +++ L      + A   +  ++R
Sbjct: 274  NTLISAYSSKGLMEEAFELMDAMPSKGFSPGVYTFNTVINGLCKHGRYERAKEVFAEMLR 333

Query: 1196 NGVLPDRSTWSILARILCKSG---KLERIV-------------------------GMLGM 1101
            +G+ PD +T+  L    CK G   + E+I                          G L  
Sbjct: 334  SGLSPDSTTYRSLLMEACKKGDAVETEKIFSDMRCRDIVPDLVCFSSVMSLSARSGNLDK 393

Query: 1100 GIY------------NSVIYNLVIDSYCKRGDFRAAFDQLDEMCNRKLDPGFSTYSSILD 957
             +             ++VIY ++I  YCK+G    A +  ++M  +       TY++IL 
Sbjct: 394  ALVYFHSVKDAGLSPDNVIYTILIQGYCKKGMISEAMNLRNDMLRQGCAMDVVTYNTILH 453

Query: 956  GACKYGNVEVVEKIIRSMVEKKLVQMDLPSEYDL--IVQKLCDLGKTYAAEMFFKRASDD 783
            G CK   +   +K+   M E+ L     P  Y L  ++   C LG    A   FK+  + 
Sbjct: 454  GLCKQKMLREADKLFNEMTERGL----FPDSYTLTILIDGHCKLGNLQNAMELFKKMKEK 509

Query: 782  KIGLQDATYGCALRALSKEGRVKEAVRIYHAISERGATVNGKSYYAFANVLCKEDPSEEV 603
            +I L   TY   L    K G +  A  I+  +  R       SY    N LC +    E 
Sbjct: 510  RIKLDVVTYNTLLDGFGKVGDIDTAKEIWADMVSREILPTPISYSIMVNALCSKGHLSEA 569

Query: 602  SELLRDLIGRGFSPCASELSKFITLQCKKGKWKEAEDLLNLILEKGLLPDSFCCCSLVGH 423
              +  ++  +   P     +  I   C+ G   + E  L  ++ +G +PD     +L+  
Sbjct: 570  FRVWDEMTSKSIKPTVMICNSMIKGYCRSGNASDGESFLEKMISEGFVPDCISYNTLIYG 629

Query: 422  YCSRKQIDSAIGLHNKIEKLKGSL--DVTTYNVLLNGLFLERRVEEAVRVFNYMRIQKLV 249
            +   + +  A GL  K+E+ +G L  DV TYN +L+G   + +++EA  V   M I++ +
Sbjct: 630  FVKEENMSKAFGLVKKMEEKQGGLVPDVFTYNTILHGFCRQNQMKEAEVVLRKM-IERGI 688

Query: 248  SSE--SFSIMIRGLCREKELRKAMKVHDEMLKLGLKPE 141
              +  +++ +I G   +  L +A + HDEML+ G  P+
Sbjct: 689  EPDRSTYTSLINGFVSQDNLTEAFRFHDEMLQRGFSPD 726


Top