BLASTX nr result

ID: Mentha22_contig00030221 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00030221
         (1092 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007223002.1| hypothetical protein PRUPE_ppa006785mg [Prun...   484   e-134
gb|EXB55995.1| hypothetical protein L484_018781 [Morus notabilis]     483   e-134
ref|XP_006340302.1| PREDICTED: putative pentatricopeptide repeat...   483   e-134
ref|XP_004251431.1| PREDICTED: putative pentatricopeptide repeat...   476   e-132
ref|XP_002515231.1| pentatricopeptide repeat-containing protein,...   470   e-130
ref|XP_002279193.1| PREDICTED: putative pentatricopeptide repeat...   469   e-130
ref|XP_006492377.1| PREDICTED: putative pentatricopeptide repeat...   468   e-129
ref|XP_006444562.1| hypothetical protein CICLE_v10019795mg [Citr...   466   e-129
ref|XP_007051214.1| Pentatricopeptide repeat superfamily protein...   465   e-128
ref|XP_003518677.1| PREDICTED: putative pentatricopeptide repeat...   463   e-128
ref|XP_007132128.1| hypothetical protein PHAVU_011G069000g [Phas...   460   e-127
ref|XP_004495883.1| PREDICTED: putative pentatricopeptide repeat...   460   e-127
ref|XP_004295891.1| PREDICTED: putative pentatricopeptide repeat...   458   e-126
ref|XP_002310039.2| hypothetical protein POPTR_0007s06780g [Popu...   457   e-126
ref|XP_003591356.1| Pentatricopeptide repeat-containing protein ...   450   e-124
ref|XP_004163923.1| PREDICTED: putative pentatricopeptide repeat...   450   e-124
ref|XP_004150840.1| PREDICTED: putative pentatricopeptide repeat...   450   e-124
ref|XP_006841601.1| hypothetical protein AMTR_s00003p00209480 [A...   436   e-120
ref|XP_002889402.1| hypothetical protein ARALYDRAFT_470202 [Arab...   436   e-120
gb|EPS57811.1| hypothetical protein M569_17007, partial [Genlise...   432   e-119

>ref|XP_007223002.1| hypothetical protein PRUPE_ppa006785mg [Prunus persica]
           gi|462419938|gb|EMJ24201.1| hypothetical protein
           PRUPE_ppa006785mg [Prunus persica]
          Length = 395

 Score =  484 bits (1245), Expect = e-134
 Identities = 232/302 (76%), Positives = 260/302 (86%)
 Frame = +1

Query: 1   RTLSQEKSMSDARNVYHSLKHEFKPNLHTFNILLAGWKSCEEAEAFFGEMKDMGVEPDIV 180
           RTL QEKSM+DARNVYHSLKH F PNL TFNILL+GWKS EEAE FF EM++MGVEPDIV
Sbjct: 74  RTLCQEKSMTDARNVYHSLKHNFTPNLQTFNILLSGWKSSEEAEGFFKEMREMGVEPDIV 133

Query: 181 SYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEM 360
           SYNCLVDV+CK  E+ KAY ++ +MRDE I  DV TYTS+IGGLGL GQPDKAR+VLKEM
Sbjct: 134 SYNCLVDVYCKSIEIDKAYKVVEQMRDENISPDVFTYTSIIGGLGLVGQPDKARDVLKEM 193

Query: 361 REYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFWMNDL 540
           +E+GCYPDVAAYNA IRNFCIAKRLGDAY +M+ MM KGL PNATT+N+  R  FW NDL
Sbjct: 194 KEFGCYPDVAAYNAAIRNFCIAKRLGDAYGLMDAMMSKGLSPNATTYNLFFRVFFWSNDL 253

Query: 541 KSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLVSDVL 720
           +SSW LY RM   GCLPNTQSCMFLIRL KRQEKVD+AL+LWNDMVEKGFG++ LVSDVL
Sbjct: 254 QSSWGLYGRMMHTGCLPNTQSCMFLIRLFKRQEKVDMALQLWNDMVEKGFGSYILVSDVL 313

Query: 721 FDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDDSLANLSEKMTAFGS 900
           FDLLCD+GKL EAERCFLQM+EKG KPS VSFRRIKVLMELA K ++L NL+EKM  FGS
Sbjct: 314 FDLLCDLGKLMEAERCFLQMMEKGHKPSNVSFRRIKVLMELANKHEALKNLTEKMAVFGS 373

Query: 901 AI 906
           +I
Sbjct: 374 SI 375


>gb|EXB55995.1| hypothetical protein L484_018781 [Morus notabilis]
          Length = 486

 Score =  483 bits (1243), Expect = e-134
 Identities = 231/299 (77%), Positives = 262/299 (87%)
 Frame = +1

Query: 1    RTLSQEKSMSDARNVYHSLKHEFKPNLHTFNILLAGWKSCEEAEAFFGEMKDMGVEPDIV 180
            RTL QEKSM+DARNVYHSLKH F+PNL TFNILL+GWKSCEEAE FF EM++MGV+PD+V
Sbjct: 183  RTLCQEKSMADARNVYHSLKHSFRPNLQTFNILLSGWKSCEEAEGFFEEMREMGVKPDVV 242

Query: 181  SYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEM 360
            SYNCLVDV+CKGRE++KA+ ++ KMRDE I  DVITYTS+IGGLGL GQPDKAR+VLKEM
Sbjct: 243  SYNCLVDVYCKGREIEKAFKVVAKMRDEDIQPDVITYTSIIGGLGLVGQPDKARDVLKEM 302

Query: 361  REYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFWMNDL 540
            +E GCYPDVAAYNA IRNFCIAKRLG AY++M+EM+ KGL  NATT+N+  R  +W NDL
Sbjct: 303  KEDGCYPDVAAYNAAIRNFCIAKRLGVAYSLMDEMVSKGLNANATTYNLFFRVFYWSNDL 362

Query: 541  KSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLVSDVL 720
             SSW+LY RM E GCLPNTQSCMFLIRL +RQEKV++AL+LWNDMVEKGFG++ LVSDVL
Sbjct: 363  TSSWNLYGRMMETGCLPNTQSCMFLIRLFRRQEKVEMALQLWNDMVEKGFGSYVLVSDVL 422

Query: 721  FDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDDSLANLSEKMTAFG 897
            FDLLCD GKL EAERCFLQMVEKGQKPS VS+RRIKVLMELA K DSL  LSEKM  FG
Sbjct: 423  FDLLCDAGKLMEAERCFLQMVEKGQKPSNVSYRRIKVLMELANKQDSLHILSEKMALFG 481


>ref|XP_006340302.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g02420-like [Solanum tuberosum]
          Length = 505

 Score =  483 bits (1243), Expect = e-134
 Identities = 229/303 (75%), Positives = 269/303 (88%)
 Frame = +1

Query: 1    RTLSQEKSMSDARNVYHSLKHEFKPNLHTFNILLAGWKSCEEAEAFFGEMKDMGVEPDIV 180
            R L QEKSMSDARNVYH LK++F+PN  TFNILL+GWKS E+AE FF EM+D+GVEPD+V
Sbjct: 185  RALCQEKSMSDARNVYHRLKYKFRPNNQTFNILLSGWKSSEDAEVFFKEMRDLGVEPDVV 244

Query: 181  SYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEM 360
            S+NCLVDV+CKGRE++KA+ ++ +MR++ I  DVITYTSLIGGLGLAGQPDKAR +LKEM
Sbjct: 245  SFNCLVDVYCKGREMEKAFTVVEEMREKDITPDVITYTSLIGGLGLAGQPDKARHILKEM 304

Query: 361  REYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFWMNDL 540
            REYGCYPDVAAYNA IRNFCIAKR+GDAY++M+EM+  GL PNATT+NV LRS FW+NDL
Sbjct: 305  REYGCYPDVAAYNAAIRNFCIAKRIGDAYSLMDEMVRNGLSPNATTYNVFLRSFFWINDL 364

Query: 541  KSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLVSDVL 720
            KSSW+LY RMKE GCLP+TQSCMFLIRL +R EKV++ALELW+DM+E+GFG++ LVSDVL
Sbjct: 365  KSSWTLYQRMKETGCLPSTQSCMFLIRLSRRHEKVEMALELWDDMMERGFGSYILVSDVL 424

Query: 721  FDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDDSLANLSEKMTAFGS 900
            FDLLCD+GKL EAERCFLQMV KGQKPS VSFRRIKVLMELA K ++L  LSEKM AFGS
Sbjct: 425  FDLLCDLGKLAEAERCFLQMVNKGQKPSNVSFRRIKVLMELANKQETLKLLSEKMAAFGS 484

Query: 901  AIQ 909
            + Q
Sbjct: 485  STQ 487


>ref|XP_004251431.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g02420-like [Solanum lycopersicum]
          Length = 500

 Score =  476 bits (1226), Expect = e-132
 Identities = 225/303 (74%), Positives = 266/303 (87%)
 Frame = +1

Query: 1    RTLSQEKSMSDARNVYHSLKHEFKPNLHTFNILLAGWKSCEEAEAFFGEMKDMGVEPDIV 180
            R L QEKSMSDARNVYH LK++F+PN  TFNILL+GWKS E+AE FF EM+D+GVEPD+V
Sbjct: 185  RALCQEKSMSDARNVYHRLKYKFRPNNQTFNILLSGWKSSEDAEVFFKEMRDLGVEPDVV 244

Query: 181  SYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEM 360
            S+NCLVDV+CKGRE++KA+ ++ +MR++ I  DVITYTSLIGGLGL GQPDKAR +LKEM
Sbjct: 245  SFNCLVDVYCKGREMEKAFRVVEEMREKDITPDVITYTSLIGGLGLVGQPDKARHILKEM 304

Query: 361  REYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFWMNDL 540
            REYGCYPD AAYNA +RNFCIAKR+GDAY++M+EM+  GL PNATT+NV LRS FW+NDL
Sbjct: 305  REYGCYPDAAAYNAAVRNFCIAKRIGDAYSLMDEMVRNGLSPNATTYNVFLRSFFWINDL 364

Query: 541  KSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLVSDVL 720
            KSSW+LY RMKE GCLP+TQSCMFLIRL +R EKV++ALELW+DM+E+GFG++ LVSDVL
Sbjct: 365  KSSWTLYQRMKETGCLPSTQSCMFLIRLSRRHEKVEMALELWDDMMERGFGSYILVSDVL 424

Query: 721  FDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDDSLANLSEKMTAFGS 900
            FDLLCD+GKL EAERCFLQMV KGQKPS VSFRRIKVLMELA K ++L  LSEKM AF S
Sbjct: 425  FDLLCDLGKLAEAERCFLQMVNKGQKPSNVSFRRIKVLMELANKQEALKLLSEKMAAFRS 484

Query: 901  AIQ 909
            + Q
Sbjct: 485  STQ 487


>ref|XP_002515231.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223545711|gb|EEF47215.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 505

 Score =  470 bits (1210), Expect = e-130
 Identities = 221/303 (72%), Positives = 260/303 (85%)
 Frame = +1

Query: 1    RTLSQEKSMSDARNVYHSLKHEFKPNLHTFNILLAGWKSCEEAEAFFGEMKDMGVEPDIV 180
            RTL QEKSM+DARNVYH LK EFKPNL TFNILL+GWK  EEAE FF EM+++G++PD+V
Sbjct: 184  RTLCQEKSMTDARNVYHRLKKEFKPNLQTFNILLSGWKQSEEAELFFEEMRELGIKPDVV 243

Query: 181  SYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEM 360
            SYN L+DV+CK RE++KAY ++ KMR+E I  DVITYTS+IGGLGL GQPDKAR++L EM
Sbjct: 244  SYNSLIDVYCKDREMEKAYKVVEKMREEDISPDVITYTSIIGGLGLVGQPDKARDILNEM 303

Query: 361  REYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFWMNDL 540
            +EYGCYPDVAAYNAVIRN+CIAKRLGDA N+M+EM  KGL PNATT+N+  R  +W NDL
Sbjct: 304  KEYGCYPDVAAYNAVIRNYCIAKRLGDASNLMDEMASKGLSPNATTYNLFFRVFYWSNDL 363

Query: 541  KSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLVSDVL 720
            ++SWSLY RM E GCLPNTQSCMFLIRL ++ EKV++AL LWNDMVEKGFG++ LVSDVL
Sbjct: 364  RNSWSLYRRMMESGCLPNTQSCMFLIRLFRKHEKVEMALTLWNDMVEKGFGSYILVSDVL 423

Query: 721  FDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDDSLANLSEKMTAFGS 900
            FDLLCD+GKL EAE+CFLQM+EKG KPS VSFRRIKVLMEL  K D+L NL +KM  FGS
Sbjct: 424  FDLLCDMGKLVEAEKCFLQMIEKGHKPSNVSFRRIKVLMELVNKHDALLNLQKKMAIFGS 483

Query: 901  AIQ 909
            +IQ
Sbjct: 484  SIQ 486


>ref|XP_002279193.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g02420-like [Vitis vinifera]
          Length = 526

 Score =  469 bits (1208), Expect = e-130
 Identities = 222/306 (72%), Positives = 261/306 (85%)
 Frame = +1

Query: 1    RTLSQEKSMSDARNVYHSLKHEFKPNLHTFNILLAGWKSCEEAEAFFGEMKDMGVEPDIV 180
            RTL QEKSM DARNVYHSLKH+F+P+L TFNILL+GWKS EEAE FF EM++MGVEPD+V
Sbjct: 205  RTLCQEKSMRDARNVYHSLKHDFRPDLRTFNILLSGWKSAEEAEGFFDEMREMGVEPDVV 264

Query: 181  SYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEM 360
            SYNCL+DV+CKGRE+++AY +I KMRDE I  DVI+YTS+IGGLGL GQPDKAR+VLKEM
Sbjct: 265  SYNCLIDVYCKGREIERAYKVIDKMRDEQISPDVISYTSIIGGLGLVGQPDKARDVLKEM 324

Query: 361  REYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFWMNDL 540
            +EYGCYPDVAAYNA IRNFCIA RLGDA  +M+EM+ KGL PNATT+N+  R  +W NDL
Sbjct: 325  KEYGCYPDVAAYNAAIRNFCIANRLGDADGLMDEMVGKGLSPNATTYNLFFRCFYWSNDL 384

Query: 541  KSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLVSDVL 720
              S  LY RMK+ GCLPNTQSCMFL RL +RQEKV++ALELWNDMVEKGFG++ LVSDVL
Sbjct: 385  GRSCGLYQRMKKTGCLPNTQSCMFLTRLFRRQEKVEMALELWNDMVEKGFGSYILVSDVL 444

Query: 721  FDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDDSLANLSEKMTAFGS 900
            FD+LCD+GKL E E+C LQM+EKG KPS VSFRRIKVLMELA K ++L NL+EKM  FG 
Sbjct: 445  FDMLCDMGKLVEVEKCCLQMIEKGHKPSNVSFRRIKVLMELANKHEALQNLTEKMAMFGP 504

Query: 901  AIQARK 918
            + Q ++
Sbjct: 505  STQVQE 510


>ref|XP_006492377.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g02420-like [Citrus sinensis]
          Length = 506

 Score =  468 bits (1203), Expect = e-129
 Identities = 222/313 (70%), Positives = 267/313 (85%), Gaps = 1/313 (0%)
 Frame = +1

Query: 1    RTLSQEKSMSDARNVYHSLKHEFKPNLHTFNILLAGWKSCEEAEAFFGEMKDMGVEPDIV 180
            RTL QEKSM+DARNVYHSLK++F+PNL TFNILL+GWKS +EAE F  EM++MGV+PDIV
Sbjct: 186  RTLCQEKSMTDARNVYHSLKYDFRPNLQTFNILLSGWKSVDEAEGFLEEMREMGVKPDIV 245

Query: 181  SYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEM 360
            SYNCL+DV+CK R+++KAY ++ KMRDE I  DVI+YTS+IGGLGL GQPDKAR+VLKEM
Sbjct: 246  SYNCLIDVYCKDRQVEKAYKIVEKMRDEDISPDVISYTSIIGGLGLVGQPDKARDVLKEM 305

Query: 361  REYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFWMNDL 540
            +EYGCYPD AAYNA IRN+CIAKRL DA  +M+EM+ KGL PNATT+N+  R  +W NDL
Sbjct: 306  KEYGCYPDAAAYNAAIRNYCIAKRLRDASGLMDEMVEKGLSPNATTYNLFFRVFYWSNDL 365

Query: 541  KSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLVSDVL 720
            +SSW+LY RM   GCLPNTQSCMFL++L KRQEKV++AL+LWNDMVEKGFG++ LVSDVL
Sbjct: 366  RSSWNLYCRMMGTGCLPNTQSCMFLVKLCKRQEKVEIALQLWNDMVEKGFGSYILVSDVL 425

Query: 721  FDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDDSLANLSEKMTAFG- 897
            FDLLCD+GKL EAE+ FL+M+EKG KPSQVSFRRIKVLMELA K ++L NLS KM  FG 
Sbjct: 426  FDLLCDMGKLVEAEKSFLEMIEKGHKPSQVSFRRIKVLMELANKQEALQNLSNKMALFGP 485

Query: 898  SAIQARKTYVDEL 936
            S I  R+ Y+ E+
Sbjct: 486  SMIPKREEYLAEM 498


>ref|XP_006444562.1| hypothetical protein CICLE_v10019795mg [Citrus clementina]
            gi|557546824|gb|ESR57802.1| hypothetical protein
            CICLE_v10019795mg [Citrus clementina]
          Length = 506

 Score =  466 bits (1199), Expect = e-129
 Identities = 221/313 (70%), Positives = 266/313 (84%), Gaps = 1/313 (0%)
 Frame = +1

Query: 1    RTLSQEKSMSDARNVYHSLKHEFKPNLHTFNILLAGWKSCEEAEAFFGEMKDMGVEPDIV 180
            RTL QEKSM+DARNVYHSLK++F+PNL TFNILL+GWKS +EAE F  EM++MGV+PDIV
Sbjct: 186  RTLCQEKSMTDARNVYHSLKYDFRPNLQTFNILLSGWKSVDEAEGFLEEMREMGVKPDIV 245

Query: 181  SYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEM 360
            SYNCL+DV+CK R+++KAY ++ KMRDE I  DVI+YTS+IGGLGL GQPDKAR+VLKEM
Sbjct: 246  SYNCLIDVYCKDRQVEKAYKIVEKMRDEDISPDVISYTSIIGGLGLVGQPDKARDVLKEM 305

Query: 361  REYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFWMNDL 540
            +EYGCYPD AAYNA IRN+CIAKRL DA  +M+EM+ KGL PNATT+N+  R  +W NDL
Sbjct: 306  KEYGCYPDAAAYNAAIRNYCIAKRLRDASGLMDEMVEKGLSPNATTYNLFFRVFYWSNDL 365

Query: 541  KSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLVSDVL 720
            +SSW+LY RM   GCLPNTQSCMFL++L KRQEKV++AL+LWNDMVEKGFG++ LVSDVL
Sbjct: 366  RSSWNLYCRMMGTGCLPNTQSCMFLVKLCKRQEKVEIALQLWNDMVEKGFGSYILVSDVL 425

Query: 721  FDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDDSLANLSEKMTAFG- 897
            FDLLCD+GKL EAE+ FL+M+EKG KPSQVSFRRIK LMELA K ++L NLS KM  FG 
Sbjct: 426  FDLLCDMGKLVEAEKSFLEMIEKGHKPSQVSFRRIKALMELANKQEALQNLSNKMALFGP 485

Query: 898  SAIQARKTYVDEL 936
            S I  R+ Y+ E+
Sbjct: 486  SMIPKREEYLAEM 498


>ref|XP_007051214.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma
            cacao] gi|508703475|gb|EOX95371.1| Pentatricopeptide
            repeat superfamily protein isoform 1 [Theobroma cacao]
          Length = 511

 Score =  465 bits (1196), Expect = e-128
 Identities = 219/303 (72%), Positives = 259/303 (85%)
 Frame = +1

Query: 1    RTLSQEKSMSDARNVYHSLKHEFKPNLHTFNILLAGWKSCEEAEAFFGEMKDMGVEPDIV 180
            RTL QEK M DARNVYHSLKH+F+PNL TFNILL+GWKS EEAE FF EM+ +GV+PD+V
Sbjct: 184  RTLCQEKCMKDARNVYHSLKHDFRPNLQTFNILLSGWKSSEEAEGFFNEMRGLGVKPDVV 243

Query: 181  SYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEM 360
            SYNCL+DV+CK R++ KAY ++ +M DE I  DVITYTS+IGGLGL GQPDKA++VLKEM
Sbjct: 244  SYNCLIDVYCKNRDIDKAYRVVERMTDEEIWPDVITYTSIIGGLGLVGQPDKAKDVLKEM 303

Query: 361  REYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFWMNDL 540
            +E+GCYPDVAAYNA IRNFCIAKRLGDAYN+M+EM+ KGL PNATT+N+  R  +W NDL
Sbjct: 304  KEHGCYPDVAAYNAAIRNFCIAKRLGDAYNLMDEMVGKGLSPNATTYNLFFRVFYWSNDL 363

Query: 541  KSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLVSDVL 720
            +SS SLY RM + GCLPNTQSCMFLIRL +R EKV +AL+LWNDMVEKGFG++ LVSDVL
Sbjct: 364  RSSCSLYQRMMDSGCLPNTQSCMFLIRLFRRHEKVGMALQLWNDMVEKGFGSYVLVSDVL 423

Query: 721  FDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDDSLANLSEKMTAFGS 900
            FDLLCD+GKL EAE+CF +M+EK  KPS VSFRRIKVLMELA K +++ NL EKM  FGS
Sbjct: 424  FDLLCDMGKLVEAEKCFSEMIEKRHKPSNVSFRRIKVLMELANKHEAVKNLKEKMAVFGS 483

Query: 901  AIQ 909
            +IQ
Sbjct: 484  SIQ 486


>ref|XP_003518677.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g02420-like [Glycine max]
          Length = 500

 Score =  463 bits (1191), Expect = e-128
 Identities = 216/310 (69%), Positives = 263/310 (84%)
 Frame = +1

Query: 1    RTLSQEKSMSDARNVYHSLKHEFKPNLHTFNILLAGWKSCEEAEAFFGEMKDMGVEPDIV 180
            RTL QEKSM+DARNVYHSLKH F+PNL TFNILL+GWK+ E+A+ FF EMK+MGV PD+V
Sbjct: 179  RTLCQEKSMADARNVYHSLKHRFRPNLQTFNILLSGWKTPEDADLFFKEMKEMGVTPDVV 238

Query: 181  SYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEM 360
            +YN L+DV+CKGRE++KAY ++ +MRD+    DVITYT +IGGLGL GQPDKAR VLKEM
Sbjct: 239  TYNSLMDVYCKGREIEKAYKMLDEMRDQDFSPDVITYTCIIGGLGLIGQPDKARNVLKEM 298

Query: 361  REYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFWMNDL 540
            +EYGCYPD AAYNA IRNFCIAKRLGDA+ ++EEM+ KGL PNATT+N+  R  +W NDL
Sbjct: 299  KEYGCYPDAAAYNAAIRNFCIAKRLGDAHGLVEEMVTKGLSPNATTYNLFFRVFYWSNDL 358

Query: 541  KSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLVSDVL 720
            +SSW++Y RM   GCLPNTQSCMFLIRL +R EKV++AL+ W DMVEKGFG++TLVSDVL
Sbjct: 359  QSSWNMYQRMMVEGCLPNTQSCMFLIRLFRRHEKVEMALQFWGDMVEKGFGSYTLVSDVL 418

Query: 721  FDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDDSLANLSEKMTAFGS 900
            FDLLCD+GKLEEAE+CFL+MVEKGQKPS VSFRRIKVLMELA + ++L +L +KM  FG 
Sbjct: 419  FDLLCDMGKLEEAEKCFLEMVEKGQKPSHVSFRRIKVLMELANRHEALQSLMQKMAMFGR 478

Query: 901  AIQARKTYVD 930
             +Q  ++ V+
Sbjct: 479  PLQVDQSTVN 488


>ref|XP_007132128.1| hypothetical protein PHAVU_011G069000g [Phaseolus vulgaris]
            gi|561005128|gb|ESW04122.1| hypothetical protein
            PHAVU_011G069000g [Phaseolus vulgaris]
          Length = 500

 Score =  460 bits (1184), Expect = e-127
 Identities = 215/317 (67%), Positives = 266/317 (83%)
 Frame = +1

Query: 1    RTLSQEKSMSDARNVYHSLKHEFKPNLHTFNILLAGWKSCEEAEAFFGEMKDMGVEPDIV 180
            RTL QEKSM+DARNVYHSLKH F+PNL TFNILL+GWK+ E+A+ FF EMK+MGV PD+V
Sbjct: 179  RTLCQEKSMTDARNVYHSLKHRFRPNLQTFNILLSGWKTPEDADGFFKEMKEMGVTPDVV 238

Query: 181  SYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEM 360
            +YN LVDV+CKGRE++KAY ++ +MRD  +  DVITYT +IGGLGL GQPDKAR VLKEM
Sbjct: 239  TYNSLVDVYCKGREIEKAYKVLDEMRDRDLSPDVITYTCIIGGLGLIGQPDKARGVLKEM 298

Query: 361  REYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFWMNDL 540
            +EYGCYPD AAYNA IRNFCIAKRLGDA+ +++EM+  GL PNATT+N+  R  +W NDL
Sbjct: 299  KEYGCYPDAAAYNAAIRNFCIAKRLGDAHGLVKEMVSMGLCPNATTYNLFFRVFYWSNDL 358

Query: 541  KSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLVSDVL 720
             SSW +Y RM   GCLPNTQSCMFLIRL ++ EKV++AL+LW +MVEKGFG++TLVSDVL
Sbjct: 359  HSSWIMYKRMMVEGCLPNTQSCMFLIRLFRKHEKVEMALQLWENMVEKGFGSYTLVSDVL 418

Query: 721  FDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDDSLANLSEKMTAFGS 900
            FDLLCD+GKLEEAE+CFL+M+EKGQKPS VSFRRIKVLMELA + ++L +L++KM+ FG 
Sbjct: 419  FDLLCDMGKLEEAEKCFLEMIEKGQKPSNVSFRRIKVLMELANRHEALESLTQKMSIFGR 478

Query: 901  AIQARKTYVDELDKPIS 951
             +Q  ++ V + + P S
Sbjct: 479  PLQLHQSSVSQTETPDS 495


>ref|XP_004495883.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g02420-like [Cicer arietinum]
          Length = 502

 Score =  460 bits (1183), Expect = e-127
 Identities = 217/306 (70%), Positives = 260/306 (84%)
 Frame = +1

Query: 1    RTLSQEKSMSDARNVYHSLKHEFKPNLHTFNILLAGWKSCEEAEAFFGEMKDMGVEPDIV 180
            RTL QEKSM+DARNVYHSLKH F PNL TFNILL+GWK+ E+AE+FF EMK+MGVEPD+V
Sbjct: 179  RTLCQEKSMTDARNVYHSLKHSFHPNLQTFNILLSGWKTPEDAESFFKEMKEMGVEPDVV 238

Query: 181  SYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEM 360
            +YN LVDV+CKGRE+ KAY +  +MR+  +  DVITYT +IGGLGL GQPDKAR+VLKEM
Sbjct: 239  TYNSLVDVYCKGREIDKAYKVFDEMRERDLSPDVITYTCIIGGLGLIGQPDKARDVLKEM 298

Query: 361  REYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFWMNDL 540
            +E+G YPDV AYNA IRNFCIAKRLGDAY++++EM  KGL PNATT+N+  R  +W NDL
Sbjct: 299  KEFGIYPDVPAYNAAIRNFCIAKRLGDAYDLVDEMTNKGLSPNATTYNLFFRIYYWSNDL 358

Query: 541  KSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLVSDVL 720
             SSWSLY RM   GCLPNTQSCMFLIRL+K+ EK ++AL+LW DMVEKGFG++TLVSDVL
Sbjct: 359  PSSWSLYKRMMVEGCLPNTQSCMFLIRLLKKHEKAEMALQLWGDMVEKGFGSYTLVSDVL 418

Query: 721  FDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDDSLANLSEKMTAFGS 900
            FDLLCD+GKL EAE+CFL+MVEKGQKPS VSFRRIKVLMELA + +++ NL++KM  FG 
Sbjct: 419  FDLLCDMGKLLEAEKCFLEMVEKGQKPSNVSFRRIKVLMELANRHEAIQNLTQKMGVFGQ 478

Query: 901  AIQARK 918
             +Q R+
Sbjct: 479  TLQVRE 484


>ref|XP_004295891.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g02420-like [Fragaria vesca subsp. vesca]
          Length = 504

 Score =  458 bits (1179), Expect = e-126
 Identities = 221/320 (69%), Positives = 262/320 (81%)
 Frame = +1

Query: 1    RTLSQEKSMSDARNVYHSLKHEFKPNLHTFNILLAGWKSCEEAEAFFGEMKDMGVEPDIV 180
            RTL QEKSM+DARNVYH LKH F+PNL TFNILL+GWKS EEAE FF EM+++G++PD+V
Sbjct: 184  RTLCQEKSMTDARNVYHKLKHSFEPNLQTFNILLSGWKSSEEAEGFFEEMRELGLKPDVV 243

Query: 181  SYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEM 360
            SYNCL+DV+ K RE++K + ++ KMRDE I  D ITYT +IGG GL GQPDKAR+VLKEM
Sbjct: 244  SYNCLIDVYSKNREMEKVFKVMEKMRDEEIWPDKITYTCVIGGFGLVGQPDKARDVLKEM 303

Query: 361  REYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFWMNDL 540
            +E GCYPDVAAYNA IRNFCIAKRLGDA  +MEEMM  GL PNATT+N+  R  FW +DL
Sbjct: 304  KELGCYPDVAAYNAAIRNFCIAKRLGDANGLMEEMMSNGLSPNATTYNLFFRVFFWSSDL 363

Query: 541  KSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLVSDVL 720
            ++SWSLY RM  +GCLPNTQSCMFLIRL ++ EKVDLAL+LWNDM+E+GFG++ LVSDVL
Sbjct: 364  QNSWSLYGRMMYMGCLPNTQSCMFLIRLFRKLEKVDLALQLWNDMIERGFGSYILVSDVL 423

Query: 721  FDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDDSLANLSEKMTAFGS 900
            FDLLCD+GKL EAE CFLQMVEKG KPS VSFRRIKVLMELA K ++L NL+EKM  FGS
Sbjct: 424  FDLLCDMGKLTEAETCFLQMVEKGHKPSNVSFRRIKVLMELANKHEALKNLTEKMALFGS 483

Query: 901  AIQARKTYVDELDKPISEIM 960
            +I   ++       P SE +
Sbjct: 484  SIHLPESMDRSTVVPCSEAL 503


>ref|XP_002310039.2| hypothetical protein POPTR_0007s06780g [Populus trichocarpa]
            gi|550334290|gb|EEE90489.2| hypothetical protein
            POPTR_0007s06780g [Populus trichocarpa]
          Length = 509

 Score =  457 bits (1175), Expect = e-126
 Identities = 218/304 (71%), Positives = 259/304 (85%)
 Frame = +1

Query: 1    RTLSQEKSMSDARNVYHSLKHEFKPNLHTFNILLAGWKSCEEAEAFFGEMKDMGVEPDIV 180
            RTL QEKSMSDARNVYH LK  F+PNL TFNILL+GWKS EEAE F+ EMK++GV+PDIV
Sbjct: 190  RTLCQEKSMSDARNVYHHLKKGFRPNLQTFNILLSGWKSSEEAELFYEEMKELGVKPDIV 249

Query: 181  SYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEM 360
            +YN L+DVFCKGREL+KAY ++ +MR+E I  DVITYTS+IGGLGL GQPDKAR++LKEM
Sbjct: 250  TYNSLIDVFCKGRELEKAYGVVARMREEDILPDVITYTSIIGGLGLVGQPDKARDMLKEM 309

Query: 361  REYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFWMNDL 540
            +E+GCYPDVAAYNAVIRN+CIAKRL  AY++M EM  KG+ PNAT++N+  R   W NDL
Sbjct: 310  KEHGCYPDVAAYNAVIRNYCIAKRLDAAYSLMAEMESKGMSPNATSYNLFFRVFSWSNDL 369

Query: 541  KSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLVSDVL 720
            ++SW  Y RM + GCLPNTQSCMFLI+L KR EKV++AL+LWNDMVEKGFG++ LVSDVL
Sbjct: 370  RNSWDFYGRMMDAGCLPNTQSCMFLIKLFKRHEKVEMALQLWNDMVEKGFGSYILVSDVL 429

Query: 721  FDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDDSLANLSEKMTAFGS 900
              +LCD+GKL EAE+CFLQMVEKG KPS VSFRRIKVLMELA K D++ NLSEKM  FGS
Sbjct: 430  LGMLCDMGKLVEAEKCFLQMVEKGHKPSNVSFRRIKVLMELANKHDAIRNLSEKMAIFGS 489

Query: 901  AIQA 912
            +I+A
Sbjct: 490  SIRA 493


>ref|XP_003591356.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355480404|gb|AES61607.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 518

 Score =  450 bits (1158), Expect = e-124
 Identities = 212/319 (66%), Positives = 265/319 (83%)
 Frame = +1

Query: 1    RTLSQEKSMSDARNVYHSLKHEFKPNLHTFNILLAGWKSCEEAEAFFGEMKDMGVEPDIV 180
            RTL QEKSM+DARNVYHSLKH F+PNL TFNILL+GWK+ E+AE F  EMK+MGVEPD+V
Sbjct: 173  RTLCQEKSMTDARNVYHSLKHNFRPNLQTFNILLSGWKNVEDAELFVNEMKEMGVEPDVV 232

Query: 181  SYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEM 360
            +YN LVDV+CKGRE++KAY +  +MR++ +  DVITYTS+IGGLGL GQPDKAR+VLKEM
Sbjct: 233  TYNSLVDVYCKGREIEKAYKVFDEMREKDLSPDVITYTSVIGGLGLVGQPDKARDVLKEM 292

Query: 361  REYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFWMNDL 540
            +EYG YPDV AYNA IRN+CIAKRLG A+ +++EM+ KGL PNATT+N+  R  +W NDL
Sbjct: 293  KEYGVYPDVPAYNAAIRNYCIAKRLGIAFELVDEMVNKGLSPNATTYNLFFRVFYWSNDL 352

Query: 541  KSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLVSDVL 720
            +SSW+LY RM   GCLP TQSCMFLIRL KR EK+++AL+LW +MVEKGFG++TLVSDVL
Sbjct: 353  QSSWNLYKRMMGEGCLPYTQSCMFLIRLFKRHEKMEMALQLWGEMVEKGFGSYTLVSDVL 412

Query: 721  FDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDDSLANLSEKMTAFGS 900
            FD+LCD+GKL EAE+CFL+M+EKGQ+PS VSF+RIKVLMELA K +++ NL++KM  FG 
Sbjct: 413  FDMLCDMGKLMEAEKCFLEMIEKGQRPSNVSFKRIKVLMELANKHEAIQNLTQKMAIFGR 472

Query: 901  AIQARKTYVDELDKPISEI 957
             +Q      + +  PI E+
Sbjct: 473  PLQVH----ERVATPIGEM 487


>ref|XP_004163923.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g02420-like [Cucumis sativus]
          Length = 495

 Score =  450 bits (1157), Expect = e-124
 Identities = 210/295 (71%), Positives = 252/295 (85%)
 Frame = +1

Query: 1    RTLSQEKSMSDARNVYHSLKHEFKPNLHTFNILLAGWKSCEEAEAFFGEMKDMGVEPDIV 180
            RTL QEKSM DARNVYH LK  F+PNL TFNILL+GWKS EEAE FF EM +MGV+PD+V
Sbjct: 187  RTLCQEKSMMDARNVYHGLKSMFRPNLQTFNILLSGWKSSEEAEGFFDEMIEMGVKPDVV 246

Query: 181  SYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEM 360
            SYNCLVDV+CK RE+ KA+ ++GKMRDE I  DVITYTS+IGGLGL GQPDKAR +LKEM
Sbjct: 247  SYNCLVDVYCKNREMDKAFKVVGKMRDEDIPADVITYTSIIGGLGLVGQPDKARNILKEM 306

Query: 361  REYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFWMNDL 540
            +EYGCYPDVAAYNA IRNFCIAKRL +A+++++EM+ KGL PNATT+N+  R  FW NDL
Sbjct: 307  KEYGCYPDVAAYNATIRNFCIAKRLHEAFDLLDEMVNKGLSPNATTYNLFFRIFFWSNDL 366

Query: 541  KSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLVSDVL 720
            +S+W+LY RM + GCLPNTQSC+FL+RL K+ EK ++ALELWNDM++KGFG++ LVS+ L
Sbjct: 367  QSAWNLYRRMMDTGCLPNTQSCLFLVRLFKKYEKEEMALELWNDMIQKGFGSYILVSEEL 426

Query: 721  FDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDDSLANLSEKM 885
            FDLLCD+GKL EAE CFLQMV+KG KPS  SF+RIKVLMELA K ++L NLS+KM
Sbjct: 427  FDLLCDLGKLIEAESCFLQMVDKGHKPSYTSFKRIKVLMELANKHEALQNLSKKM 481



 Score = 72.8 bits (177), Expect = 2e-10
 Identities = 60/268 (22%), Positives = 117/268 (43%), Gaps = 2/268 (0%)
 Frame = +1

Query: 139 FGEMKDMGVEPDIVSYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGL 318
           F + K    E D+  +N L+   C+ + +  A N+   ++      ++ T+  L+ G   
Sbjct: 167 FRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARNVYHGLKSM-FRPNLQTFNILLSGWKS 225

Query: 319 AGQPDKAREVLKEMREYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATT 498
           +   ++A     EM E G  PDV +YN ++  +C  + +  A+ V+ +M  + +  +  T
Sbjct: 226 S---EEAEGFFDEMIEMGVKPDVVSYNCLVDVYCKNREMDKAFKVVGKMRDEDIPADVIT 282

Query: 499 FNVILRSLFWMNDLKSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMV 678
           +  I+  L  +     + ++   MKE GC P+  +    IR     +++  A +L ++MV
Sbjct: 283 YTSIIGGLGLVGQPDKARNILKEMKEYGCYPDVAAYNATIRNFCIAKRLHEAFDLLDEMV 342

Query: 679 EKGFGAFTLVSDVLFDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDD 858
            KG        ++ F +      L+ A   + +M++ G  P+  S   +  L +   K++
Sbjct: 343 NKGLSPNATTYNLFFRIFFWSNDLQSAWNLYRRMMDTGCLPNTQSCLFLVRLFKKYEKEE 402

Query: 859 SLANLSEKM--TAFGSAIQARKTYVDEL 936
               L   M    FGS I   +   D L
Sbjct: 403 MALELWNDMIQKGFGSYILVSEELFDLL 430


>ref|XP_004150840.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g02420-like [Cucumis sativus]
          Length = 495

 Score =  450 bits (1157), Expect = e-124
 Identities = 210/295 (71%), Positives = 252/295 (85%)
 Frame = +1

Query: 1    RTLSQEKSMSDARNVYHSLKHEFKPNLHTFNILLAGWKSCEEAEAFFGEMKDMGVEPDIV 180
            RTL QEKSM DARNVYH LK  F+PNL TFNILL+GWKS EEAE FF EM +MGV+PD+V
Sbjct: 187  RTLCQEKSMMDARNVYHGLKSMFRPNLQTFNILLSGWKSSEEAEGFFDEMIEMGVKPDVV 246

Query: 181  SYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEM 360
            SYNCLVDV+CK RE+ KA+ ++GKMRDE I  DVITYTS+IGGLGL GQPDKAR +LKEM
Sbjct: 247  SYNCLVDVYCKNREMDKAFKVVGKMRDEDIPADVITYTSIIGGLGLVGQPDKARNILKEM 306

Query: 361  REYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFWMNDL 540
            +EYGCYPDVAAYNA IRNFCIAKRL +A+++++EM+ KGL PNATT+N+  R  FW NDL
Sbjct: 307  KEYGCYPDVAAYNATIRNFCIAKRLHEAFDLLDEMVNKGLSPNATTYNLFFRIFFWSNDL 366

Query: 541  KSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLVSDVL 720
            +S+W+LY RM + GCLPNTQSC+FL+RL K+ EK ++ALELWNDM++KGFG++ LVS+ L
Sbjct: 367  QSAWNLYRRMMDTGCLPNTQSCLFLVRLFKKYEKEEMALELWNDMIQKGFGSYILVSEEL 426

Query: 721  FDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDDSLANLSEKM 885
            FDLLCD+GKL EAE CFLQMV+KG KPS  SF+RIKVLMELA K ++L NLS+KM
Sbjct: 427  FDLLCDLGKLIEAESCFLQMVDKGHKPSYTSFKRIKVLMELANKHEALQNLSKKM 481



 Score = 72.8 bits (177), Expect = 2e-10
 Identities = 60/268 (22%), Positives = 117/268 (43%), Gaps = 2/268 (0%)
 Frame = +1

Query: 139 FGEMKDMGVEPDIVSYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGL 318
           F + K    E D+  +N L+   C+ + +  A N+   ++      ++ T+  L+ G   
Sbjct: 167 FRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARNVYHGLKSM-FRPNLQTFNILLSGWKS 225

Query: 319 AGQPDKAREVLKEMREYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATT 498
           +   ++A     EM E G  PDV +YN ++  +C  + +  A+ V+ +M  + +  +  T
Sbjct: 226 S---EEAEGFFDEMIEMGVKPDVVSYNCLVDVYCKNREMDKAFKVVGKMRDEDIPADVIT 282

Query: 499 FNVILRSLFWMNDLKSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMV 678
           +  I+  L  +     + ++   MKE GC P+  +    IR     +++  A +L ++MV
Sbjct: 283 YTSIIGGLGLVGQPDKARNILKEMKEYGCYPDVAAYNATIRNFCIAKRLHEAFDLLDEMV 342

Query: 679 EKGFGAFTLVSDVLFDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDD 858
            KG        ++ F +      L+ A   + +M++ G  P+  S   +  L +   K++
Sbjct: 343 NKGLSPNATTYNLFFRIFFWSNDLQSAWNLYRRMMDTGCLPNTQSCLFLVRLFKKYEKEE 402

Query: 859 SLANLSEKM--TAFGSAIQARKTYVDEL 936
               L   M    FGS I   +   D L
Sbjct: 403 MALELWNDMIQKGFGSYILVSEELFDLL 430


>ref|XP_006841601.1| hypothetical protein AMTR_s00003p00209480 [Amborella trichopoda]
            gi|548843622|gb|ERN03276.1| hypothetical protein
            AMTR_s00003p00209480 [Amborella trichopoda]
          Length = 436

 Score =  436 bits (1122), Expect = e-120
 Identities = 203/306 (66%), Positives = 254/306 (83%)
 Frame = +1

Query: 1    RTLSQEKSMSDARNVYHSLKHEFKPNLHTFNILLAGWKSCEEAEAFFGEMKDMGVEPDIV 180
            RTL QEKSM DARNVYHSLK  F+PN+ TFNILL+GWK+ +EAE+FFGEM ++G +PD+V
Sbjct: 131  RTLCQEKSMGDARNVYHSLKRSFRPNILTFNILLSGWKTPQEAESFFGEMIELGCKPDLV 190

Query: 181  SYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEM 360
            SYNCLVD  CKGREL KA  ++  MR++ I  DV+TYTS+IGGLGL GQPDKA EVLKEM
Sbjct: 191  SYNCLVDALCKGRELDKALKIVQMMREKEIYPDVMTYTSIIGGLGLMGQPDKACEVLKEM 250

Query: 361  REYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFWMNDL 540
            RE+GCYPD  AYNA IRNFCIA+RLGDAY  M+EM++KGL PN TT+N+  R  +  NDL
Sbjct: 251  REHGCYPDTPAYNAAIRNFCIARRLGDAYRSMDEMVMKGLSPNPTTYNLFFRCFYQSNDL 310

Query: 541  KSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLVSDVL 720
             S+WS+Y RM E GCLPNTQ+CMFLI+L KRQEK++++L LWNDMVE+GFG++TLVSD+L
Sbjct: 311  SSAWSMYQRMMETGCLPNTQTCMFLIKLFKRQEKLEMSLRLWNDMVERGFGSYTLVSDIL 370

Query: 721  FDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDDSLANLSEKMTAFGS 900
            FD+LCD+GKL E E+CFLQM++KGQKPS  +F+RIKVLMELA + +++  LS+KM AF  
Sbjct: 371  FDMLCDLGKLIEVEKCFLQMIDKGQKPSNAAFKRIKVLMELANRQEAITYLSKKMEAFAF 430

Query: 901  AIQARK 918
            + Q ++
Sbjct: 431  SPQLKE 436



 Score = 58.5 bits (140), Expect = 5e-06
 Identities = 52/216 (24%), Positives = 99/216 (45%), Gaps = 4/216 (1%)
 Frame = +1

Query: 181 SYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEM 360
           SY+ ++ +  + R+ +K + L+ +MR +  D  +IT  +L   L    +    R+ ++  
Sbjct: 53  SYDTMLYILGRERKFEKVWGLLREMRIK--DQSLITPRTLQIVLARIAKACSVRQTVESF 110

Query: 361 REYGCYP----DVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFW 528
                Y        ++NA++R  C  K +GDA NV   +  +   PN  TFN++L    W
Sbjct: 111 TRILRYSKHLDSTDSFNALLRTLCQEKSMGDARNVYHSLK-RSFRPNILTFNILLSG--W 167

Query: 529 MNDLKSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLV 708
               ++  S +  M E+GC P+  S   L+  + +  ++D AL++   M EK      + 
Sbjct: 168 KTPQEAE-SFFGEMIELGCKPDLVSYNCLVDALCKGRELDKALKIVQMMREKEIYPDVMT 226

Query: 709 SDVLFDLLCDVGKLEEAERCFLQMVEKGQKPSQVSF 816
              +   L  +G+ ++A     +M E G  P   ++
Sbjct: 227 YTSIIGGLGLMGQPDKACEVLKEMREHGCYPDTPAY 262


>ref|XP_002889402.1| hypothetical protein ARALYDRAFT_470202 [Arabidopsis lyrata subsp.
            lyrata] gi|297335244|gb|EFH65661.1| hypothetical protein
            ARALYDRAFT_470202 [Arabidopsis lyrata subsp. lyrata]
          Length = 490

 Score =  436 bits (1121), Expect = e-120
 Identities = 209/302 (69%), Positives = 251/302 (83%)
 Frame = +1

Query: 1    RTLSQEKSMSDARNVYHSLKHEFKPNLHTFNILLAGWKSCEEAEAFFGEMKDMGVEPDIV 180
            RTL QEKSM+DARNVYHSLKH+F+P+L TFNILL+GWKS EEAEAFF EMK  G++PD+V
Sbjct: 187  RTLCQEKSMTDARNVYHSLKHQFQPDLQTFNILLSGWKSSEEAEAFFEEMKGKGLKPDVV 246

Query: 181  SYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEM 360
            +YN L+DV+CK RE++KAY LI KMR+E    DVITYT++IGGLGL GQPDKAREVLKEM
Sbjct: 247  TYNSLIDVYCKDREIEKAYKLIDKMREEDETPDVITYTTIIGGLGLIGQPDKAREVLKEM 306

Query: 361  REYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFWMNDL 540
            +EYGCYPDVAAYNA IRN+CIA+RLGDA  +++EM+ KGL PNATT+N+  R L   NDL
Sbjct: 307  KEYGCYPDVAAYNAAIRNYCIARRLGDADKLVDEMVKKGLSPNATTYNLFFRVLSLANDL 366

Query: 541  KSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLVSDVL 720
              SW LY RM   GCLPNTQSCMFLI++ KR EKVD+A+ LW DMV KGFG+++LVSDVL
Sbjct: 367  GRSWELYERMLGNGCLPNTQSCMFLIKMFKRHEKVDMAMRLWEDMVVKGFGSYSLVSDVL 426

Query: 721  FDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDDSLANLSEKMTAFGS 900
             DLLCD+ K+EEAE+C L+MVEKG +PS VSF+RIK+LMELA K D + NL +KM  FG+
Sbjct: 427  LDLLCDLAKVEEAEKCLLEMVEKGHRPSNVSFKRIKLLMELANKHDEVNNLIQKMAIFGT 486

Query: 901  AI 906
             I
Sbjct: 487  EI 488


>gb|EPS57811.1| hypothetical protein M569_17007, partial [Genlisea aurea]
          Length = 449

 Score =  432 bits (1112), Expect = e-119
 Identities = 205/298 (68%), Positives = 250/298 (83%)
 Frame = +1

Query: 1    RTLSQEKSMSDARNVYHSLKHEFKPNLHTFNILLAGWKSCEEAEAFFGEMKDMGVEPDIV 180
            R +SQEKSM+DAR VY   K EF+PNL TFNILL+GWKS  +AEAF  EMKD+G++PD+V
Sbjct: 152  RAVSQEKSMADARRVYRDTKREFRPNLQTFNILLSGWKSAVDAEAFLAEMKDVGIDPDVV 211

Query: 181  SYNCLVDVFCKGRELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEM 360
            +YNCLVDV CKGR++ +AY ++ +MR+ GI+ DV+TYTSLIGGLGL GQPDKAR VL+EM
Sbjct: 212  TYNCLVDVHCKGRDVGRAYGIVEEMREGGIEPDVVTYTSLIGGLGLVGQPDKARGVLEEM 271

Query: 361  REYGCYPDVAAYNAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFWMNDL 540
            +EYGC PD AAYNA IRNFCIA+RL +AY ++ EM   G  PNATT+NVILRSL+W NDL
Sbjct: 272  KEYGCRPDAAAYNAAIRNFCIARRLKEAYALVAEMEGDGTGPNATTYNVILRSLYWANDL 331

Query: 541  KSSWSLYLRMKEIGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLVSDVL 720
              SW LY RM+  GCLPNTQSCMFLIRLM+RQEK  LA+ELW DMVE GFG++TLVSDVL
Sbjct: 332  CGSWELYRRMRATGCLPNTQSCMFLIRLMRRQEKASLAVELWEDMVELGFGSYTLVSDVL 391

Query: 721  FDLLCDVGKLEEAERCFLQMVEKGQKPSQVSFRRIKVLMELARKDDSLANLSEKMTAF 894
              LLCD+G +++AERCFLQMV+KGQ+PS+VSFRRIKVLMEL+ + ++L+NL EKM +F
Sbjct: 392  IGLLCDLGMVDDAERCFLQMVDKGQRPSRVSFRRIKVLMELSGRFEALSNLREKMASF 449



 Score = 69.3 bits (168), Expect = 3e-09
 Identities = 66/287 (22%), Positives = 114/287 (39%)
 Frame = +1

Query: 37  RNVYHSLKHEFKPNLHTFNILLAGWKSCEEAEAFFGEMKDMGVEPDIVSYNCLVDVFCKG 216
           RN++H  +      +      +A   S  +  + F + K +  E D+  YN L+    + 
Sbjct: 98  RNIHHREQLITPRTIQIVVARIAKVSSVSQTVSSFNKFKKLVPELDVSCYNALLRAVSQE 157

Query: 217 RELQKAYNLIGKMRDEGIDLDVITYTSLIGGLGLAGQPDKAREVLKEMREYGCYPDVAAY 396
           + +  A  +    + E    ++ T+  L+ G   A     A   L EM++ G  PDV  Y
Sbjct: 158 KSMADARRVYRDTKRE-FRPNLQTFNILLSGWKSAVD---AEAFLAEMKDVGIDPDVVTY 213

Query: 397 NAVIRNFCIAKRLGDAYNVMEEMMVKGLIPNATTFNVILRSLFWMNDLKSSWSLYLRMKE 576
           N ++   C  + +G AY ++EEM   G+ P+  T+  ++  L  +     +  +   MKE
Sbjct: 214 NCLVDVHCKGRDVGRAYGIVEEMREGGIEPDVVTYTSLIGGLGLVGQPDKARGVLEEMKE 273

Query: 577 IGCLPNTQSCMFLIRLMKRQEKVDLALELWNDMVEKGFGAFTLVSDVLFDLLCDVGKLEE 756
            GC P+  +    IR      ++  A  L  +M   G G      +V+   L     L  
Sbjct: 274 YGCRPDAAAYNAAIRNFCIARRLKEAYALVAEMEGDGTGPNATTYNVILRSLYWANDLCG 333

Query: 757 AERCFLQMVEKGQKPSQVSFRRIKVLMELARKDDSLANLSEKMTAFG 897
           +   + +M   G  P+  S   +  LM    K      L E M   G
Sbjct: 334 SWELYRRMRATGCLPNTQSCMFLIRLMRRQEKASLAVELWEDMVELG 380


Top