BLASTX nr result

ID: Rehmannia24_contig00012551 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia24_contig00012551
         (1301 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004241392.1| PREDICTED: uncharacterized protein LOC101247...   400   e-109
gb|EOY04957.1| Pentatricopeptide repeat (PPR) superfamily protei...   390   e-106
ref|XP_002302882.1| hypothetical protein POPTR_0002s23140g [Popu...   390   e-106
emb|CBI39163.3| unnamed protein product [Vitis vinifera]              389   e-105
ref|XP_002268699.1| PREDICTED: uncharacterized protein LOC100253...   389   e-105
ref|XP_006478864.1| PREDICTED: uncharacterized protein At3g49140...   389   e-105
ref|XP_006478863.1| PREDICTED: uncharacterized protein At3g49140...   389   e-105
ref|XP_006347461.1| PREDICTED: uncharacterized protein At3g49140...   387   e-105
ref|XP_002530542.1| conserved hypothetical protein [Ricinus comm...   386   e-104
gb|EXB68700.1| hypothetical protein L484_024718 [Morus notabilis]     383   e-104
gb|EOY04956.1| Pentatricopeptide repeat superfamily protein isof...   379   e-102
gb|EMJ24237.1| hypothetical protein PRUPE_ppa005374mg [Prunus pe...   379   e-102
ref|XP_004296717.1| PREDICTED: uncharacterized protein At3g49140...   372   e-100
ref|XP_004152092.1| PREDICTED: uncharacterized protein At3g49140...   362   2e-97
ref|XP_004172166.1| PREDICTED: uncharacterized protein At3g49140...   352   3e-94
ref|XP_006402701.1| hypothetical protein EUTSA_v10005960mg [Eutr...   350   1e-93
ref|XP_002876496.1| hypothetical protein ARALYDRAFT_486397 [Arab...   348   3e-93
ref|NP_567080.1| pentatricopeptide repeat-containing protein-lik...   345   3e-92
emb|CAB91600.1| putative protein [Arabidopsis thaliana]               345   3e-92
ref|XP_006291096.1| hypothetical protein CARUB_v10017210mg [Caps...   343   1e-91

>ref|XP_004241392.1| PREDICTED: uncharacterized protein LOC101247332 [Solanum
            lycopersicum]
          Length = 467

 Score =  400 bits (1028), Expect = e-109
 Identities = 203/335 (60%), Positives = 247/335 (73%), Gaps = 1/335 (0%)
 Frame = +3

Query: 3    ANNRALLIFPGVVHCEPHVQISWAEFQYVIDDFGDIYFEVYDGQNILQDHGASNPVTALI 182
            ANN ALLIFPG VHCEPH Q+SWAEFQYVID++GDI+FE+YD +NIL++  ASN V ALI
Sbjct: 132  ANNNALLIFPGTVHCEPHEQVSWAEFQYVIDEYGDIFFEIYDDKNILRNRDASNSVNALI 191

Query: 183  GMDISHYESRKMDIYXXXXXXXXXXXXVIFVDDYSDIEDPSKADIWVDWGMPESSTWTHP 362
            GM+ S YE R+++                F DDY + E     D  VDWGMP+SS+  HP
Sbjct: 192  GMEFSQYEKRRVESPDDINLAGDSVDDSNFFDDYFEGESSEMYDYQVDWGMPDSSSPLHP 251

Query: 363  VYFAKSMTKAIDVEHTKMMDHPSNGVVVWGFLRPVFLDEEIYLRRLFN-DEESDGYTSDD 539
            VYFAK +TKA+ ++H KMMDHPSNG+ +WG L+P FL+EE Y+RRLF+ DE SDG T D 
Sbjct: 252  VYFAKCLTKAVHMKHAKMMDHPSNGISIWGRLKPAFLEEEYYVRRLFSGDEVSDGSTLDW 311

Query: 540  KDGDIMSISSRDDGSCSRSTIYRLEIAKIELFXXXXXXXXXXXXDFQYAEPDVLVHSIPS 719
            KDG+I+S SSR D S + S+IYRLEI +++LF            DF  AEPD LV+S P+
Sbjct: 312  KDGEILSFSSRYDKSRTLSSIYRLEIMRVDLFSVYGAQLAVNLYDFHDAEPDSLVYSAPA 371

Query: 720  ILERFDVRGTRCNVALKALCKKKGLHVEGANLIGVDSLGIDVRVSSGTEVRTHRFSFKVR 899
            ILE F  +G RC  ALKALC+KKGLHVE ANLIGVDSLG+DVRV SGTEV THRF FKVR
Sbjct: 372  ILEWFRQQGIRCKYALKALCRKKGLHVERANLIGVDSLGMDVRVLSGTEVWTHRFPFKVR 431

Query: 900  ANSECAADKQIQQLLYPRSRRKKLRTLDRPREMDS 1004
            A+SE AA+KQI+QLL+PRSRRKK RT +R  ++DS
Sbjct: 432  AHSEIAAEKQIRQLLFPRSRRKKFRTAERSGDLDS 466


>gb|EOY04957.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 2
            [Theobroma cacao]
          Length = 467

 Score =  390 bits (1003), Expect = e-106
 Identities = 199/327 (60%), Positives = 239/327 (73%), Gaps = 2/327 (0%)
 Frame = +3

Query: 3    ANNRALLIFPGVVHCEPHVQISWAEFQYVIDDFGDIYFEVYDGQNILQDHGASNPVTALI 182
            AN+ ALL+FPG VH EPH QISWAEF YVIDD+GDI+FE++D +NILQD GASN V ALI
Sbjct: 131  ANSTALLVFPGTVHSEPHEQISWAEFHYVIDDYGDIFFEIFDDENILQDRGASNLVNALI 190

Query: 183  GMDISHYESRKMDIYXXXXXXXXXXXXVIFVDDYSDIEDPSKADIWVDWGMPESST--WT 356
            GMDI  +E+ ++               + F DDY ++ D   ++  VDWGMP+++T  W 
Sbjct: 191  GMDIPMHENNRV-AGEYNISDIGNDDEIPFDDDYFEVMDSEMSEAPVDWGMPDTATATWV 249

Query: 357  HPVYFAKSMTKAIDVEHTKMMDHPSNGVVVWGFLRPVFLDEEIYLRRLFNDEESDGYTSD 536
            HP+YFAK +TKA+ +EH + MDHPSNGV + G LRP F DEE YLRRLF+ E++DGYTSD
Sbjct: 250  HPIYFAKCLTKAVHMEHDRKMDHPSNGVSIVGCLRPAFYDEESYLRRLFHFEDNDGYTSD 309

Query: 537  DKDGDIMSISSRDDGSCSRSTIYRLEIAKIELFXXXXXXXXXXXXDFQYAEPDVLVHSIP 716
             KDG+    SS+  GS S ST+YR+EI ++ELF            DFQ AEPDVLVHS  
Sbjct: 310  WKDGETSRSSSKYGGSKSDSTLYRMEIMRMELFSIYGVQSLISLQDFQDAEPDVLVHSTS 369

Query: 717  SILERFDVRGTRCNVALKALCKKKGLHVEGANLIGVDSLGIDVRVSSGTEVRTHRFSFKV 896
            +ILERF   G RCNVALKALCKKKGL +EGANLIGVDSLGIDVR+ SG EVRTHRF FKV
Sbjct: 370  AILERFSQNGIRCNVALKALCKKKGLQIEGANLIGVDSLGIDVRIFSGVEVRTHRFPFKV 429

Query: 897  RANSECAADKQIQQLLYPRSRRKKLRT 977
            RA SE AA+KQI +LL+PRS RKK RT
Sbjct: 430  RAMSETAAEKQILKLLFPRSHRKKFRT 456


>ref|XP_002302882.1| hypothetical protein POPTR_0002s23140g [Populus trichocarpa]
            gi|222844608|gb|EEE82155.1| hypothetical protein
            POPTR_0002s23140g [Populus trichocarpa]
          Length = 469

 Score =  390 bits (1003), Expect = e-106
 Identities = 193/325 (59%), Positives = 238/325 (73%)
 Frame = +3

Query: 3    ANNRALLIFPGVVHCEPHVQISWAEFQYVIDDFGDIYFEVYDGQNILQDHGASNPVTALI 182
            AN  ALL+FPG VHCEPH QISWAEFQY+IDD+GDI+FE++D  NILQD GASNPV  LI
Sbjct: 135  ANTSALLVFPGSVHCEPHGQISWAEFQYIIDDYGDIFFEIFDNSNILQDRGASNPVNVLI 194

Query: 183  GMDISHYESRKMDIYXXXXXXXXXXXXVIFVDDYSDIEDPSKADIWVDWGMPESSTWTHP 362
            GMDI  YE++K+ +             + F +DY ++ D   +++ VDWGMP +S+  HP
Sbjct: 195  GMDIPMYENKKV-VNEYNIFNVGSEDDIPFDEDYFEVMDSEDSEVPVDWGMPYTSSLVHP 253

Query: 363  VYFAKSMTKAIDVEHTKMMDHPSNGVVVWGFLRPVFLDEEIYLRRLFNDEESDGYTSDDK 542
            +YFAK MTKAI++E+ + MDHPSNGV + G LRP F DEE+YLR  F+  +SDGY SD K
Sbjct: 254  IYFAKCMTKAINMEYYRKMDHPSNGVSIVGCLRPAFSDEELYLRTSFHCGDSDGYNSDRK 313

Query: 543  DGDIMSISSRDDGSCSRSTIYRLEIAKIELFXXXXXXXXXXXXDFQYAEPDVLVHSIPSI 722
            D +I+S +S+ D S S ST++ LEI +IELF            DFQ AEPDVL HS P+I
Sbjct: 314  DTEILSFNSKSDVSSSGSTLHCLEIMRIELFSLYGSQSAVSLQDFQEAEPDVLAHSTPAI 373

Query: 723  LERFDVRGTRCNVALKALCKKKGLHVEGANLIGVDSLGIDVRVSSGTEVRTHRFSFKVRA 902
            LE F  +G+RCN+ALKALCKKKGLHVE ANLIGVDSLG+DVR+ SG E RTHRF FKVRA
Sbjct: 374  LEHFSEKGSRCNIALKALCKKKGLHVERANLIGVDSLGMDVRIFSGVEARTHRFPFKVRA 433

Query: 903  NSECAADKQIQQLLYPRSRRKKLRT 977
              + AA KQI QLL+PR+RRKK +T
Sbjct: 434  TCKTAAQKQIHQLLFPRARRKKFKT 458


>emb|CBI39163.3| unnamed protein product [Vitis vinifera]
          Length = 470

 Score =  389 bits (999), Expect = e-105
 Identities = 196/324 (60%), Positives = 240/324 (74%)
 Frame = +3

Query: 3    ANNRALLIFPGVVHCEPHVQISWAEFQYVIDDFGDIYFEVYDGQNILQDHGASNPVTALI 182
            AN  ALL+ P +VH EPH  ISWAEFQY+IDDFGDI+F+++D QNILQD GASNPV ALI
Sbjct: 142  ANGSALLLLPRIVHSEPHDHISWAEFQYIIDDFGDIFFQIFDDQNILQDPGASNPVNALI 201

Query: 183  GMDISHYESRKMDIYXXXXXXXXXXXXVIFVDDYSDIEDPSKADIWVDWGMPESSTWTHP 362
            GMD+S Y++R++               +   DDY ++ED   +DI VDWG+P++S+  HP
Sbjct: 202  GMDLSLYKNRRV-AGEYNISESGSTDDISLDDDYFEVEDSEMSDIPVDWGIPDTSSLVHP 260

Query: 363  VYFAKSMTKAIDVEHTKMMDHPSNGVVVWGFLRPVFLDEEIYLRRLFNDEESDGYTSDDK 542
            +YFAK +TKA+++E+ K MDHPSNG+ + G LRP F+DEE YLRRLF+ E+SDGYTSD K
Sbjct: 261  IYFAKCLTKAVNMEYNKEMDHPSNGISMVGCLRPAFIDEEPYLRRLFSCEDSDGYTSDWK 320

Query: 543  DGDIMSISSRDDGSCSRSTIYRLEIAKIELFXXXXXXXXXXXXDFQYAEPDVLVHSIPSI 722
            D +I   SS+ DG   RST YRLEI +IELF            DFQ AEPDVLVHS  +I
Sbjct: 321  DEEITGFSSKGDGHNPRSTFYRLEIMRIELFSVYGIQALISLQDFQDAEPDVLVHSTKAI 380

Query: 723  LERFDVRGTRCNVALKALCKKKGLHVEGANLIGVDSLGIDVRVSSGTEVRTHRFSFKVRA 902
            +E F   GT  NVALKALCKKKG HVEGANLIGVDSLG+DVRV +G E++THRFSFKVRA
Sbjct: 381  VEHFTENGTWFNVALKALCKKKGFHVEGANLIGVDSLGMDVRVFTGVEIQTHRFSFKVRA 440

Query: 903  NSECAADKQIQQLLYPRSRRKKLR 974
             S  AA+KQIQQLL+P SRRKK++
Sbjct: 441  TSAAAAEKQIQQLLFPPSRRKKVQ 464


>ref|XP_002268699.1| PREDICTED: uncharacterized protein LOC100253226 [Vitis vinifera]
          Length = 518

 Score =  389 bits (999), Expect = e-105
 Identities = 196/324 (60%), Positives = 240/324 (74%)
 Frame = +3

Query: 3    ANNRALLIFPGVVHCEPHVQISWAEFQYVIDDFGDIYFEVYDGQNILQDHGASNPVTALI 182
            AN  ALL+ P +VH EPH  ISWAEFQY+IDDFGDI+F+++D QNILQD GASNPV ALI
Sbjct: 190  ANGSALLLLPRIVHSEPHDHISWAEFQYIIDDFGDIFFQIFDDQNILQDPGASNPVNALI 249

Query: 183  GMDISHYESRKMDIYXXXXXXXXXXXXVIFVDDYSDIEDPSKADIWVDWGMPESSTWTHP 362
            GMD+S Y++R++               +   DDY ++ED   +DI VDWG+P++S+  HP
Sbjct: 250  GMDLSLYKNRRV-AGEYNISESGSTDDISLDDDYFEVEDSEMSDIPVDWGIPDTSSLVHP 308

Query: 363  VYFAKSMTKAIDVEHTKMMDHPSNGVVVWGFLRPVFLDEEIYLRRLFNDEESDGYTSDDK 542
            +YFAK +TKA+++E+ K MDHPSNG+ + G LRP F+DEE YLRRLF+ E+SDGYTSD K
Sbjct: 309  IYFAKCLTKAVNMEYNKEMDHPSNGISMVGCLRPAFIDEEPYLRRLFSCEDSDGYTSDWK 368

Query: 543  DGDIMSISSRDDGSCSRSTIYRLEIAKIELFXXXXXXXXXXXXDFQYAEPDVLVHSIPSI 722
            D +I   SS+ DG   RST YRLEI +IELF            DFQ AEPDVLVHS  +I
Sbjct: 369  DEEITGFSSKGDGHNPRSTFYRLEIMRIELFSVYGIQALISLQDFQDAEPDVLVHSTKAI 428

Query: 723  LERFDVRGTRCNVALKALCKKKGLHVEGANLIGVDSLGIDVRVSSGTEVRTHRFSFKVRA 902
            +E F   GT  NVALKALCKKKG HVEGANLIGVDSLG+DVRV +G E++THRFSFKVRA
Sbjct: 429  VEHFTENGTWFNVALKALCKKKGFHVEGANLIGVDSLGMDVRVFTGVEIQTHRFSFKVRA 488

Query: 903  NSECAADKQIQQLLYPRSRRKKLR 974
             S  AA+KQIQQLL+P SRRKK++
Sbjct: 489  TSAAAAEKQIQQLLFPPSRRKKVQ 512


>ref|XP_006478864.1| PREDICTED: uncharacterized protein At3g49140-like isoform X2 [Citrus
            sinensis]
          Length = 458

 Score =  389 bits (998), Expect = e-105
 Identities = 195/335 (58%), Positives = 249/335 (74%), Gaps = 2/335 (0%)
 Frame = +3

Query: 3    ANNRALLIFPGVVHCEPHVQISWAEFQYVIDDFGDIYFEVYDGQNILQDHGASNPVTALI 182
            ANN +LL+FPG VHCEPH QISWAEFQYVIDD+GDI+FE++D +NIL D GA+N VTA I
Sbjct: 122  ANNSSLLVFPGTVHCEPHEQISWAEFQYVIDDYGDIFFEIFDDENILHDPGANNLVTAFI 181

Query: 183  GMDISHYESRKMDIYXXXXXXXXXXXXVIFVD-DYSDIEDPSKADIWVDWGMPESSTWTH 359
            GMDI  Y+++++                I +D DY ++ D   +D  VDWGMP++S+W H
Sbjct: 182  GMDIPKYDNQRVAAGAGYNYSDFVTSDDIPLDEDYFEVVDSEVSDGSVDWGMPDTSSWVH 241

Query: 360  PVYFAKSMTKAIDVEHTKMMDHPSNGVVVWGFLRPVFLDEEIYLRRLFNDEESDGYTSDD 539
            P+YF+K +TKA+++E+ + MDHPSNG+ + G+LRP F DEE YLRR F+ E+SDG  SD 
Sbjct: 242  PIYFSKCLTKAVNMEYDRKMDHPSNGLSIVGYLRPAFADEESYLRRQFHSEDSDGDNSDW 301

Query: 540  KDGDIMSISSRDDGSCSRSTIYRLEIAKIELFXXXXXXXXXXXXDFQYAEPDVLVHSIPS 719
            +DG+  + SS++  S + ST+YRLEI +IELF            DFQ AEPD+LVHS  +
Sbjct: 302  QDGETPNFSSKNGRSNTGSTLYRLEIMRIELFSVYGIRSPVSLQDFQDAEPDILVHSTSA 361

Query: 720  ILERFDVRGTRCNVALKALCKKKGLHVEGANLIGVDSLGIDVRVSSGTEVRTHRFSFKVR 899
            I+E F ++G RCN ALKALCKKKGL+VE ANLIGVDSLG+DVRV SG EVRTHRF FK+R
Sbjct: 362  IIEHFSLKGIRCNGALKALCKKKGLNVEEANLIGVDSLGMDVRVFSGVEVRTHRFPFKIR 421

Query: 900  ANSECAADKQIQQLLYPRSRRKKLRT-LDRPREMD 1001
            A SE AA+KQIQQLL+PRSRRKKLR+  D  +E+D
Sbjct: 422  ATSEVAAEKQIQQLLFPRSRRKKLRSQRDVLKELD 456


>ref|XP_006478863.1| PREDICTED: uncharacterized protein At3g49140-like isoform X1 [Citrus
            sinensis]
          Length = 468

 Score =  389 bits (998), Expect = e-105
 Identities = 195/335 (58%), Positives = 249/335 (74%), Gaps = 2/335 (0%)
 Frame = +3

Query: 3    ANNRALLIFPGVVHCEPHVQISWAEFQYVIDDFGDIYFEVYDGQNILQDHGASNPVTALI 182
            ANN +LL+FPG VHCEPH QISWAEFQYVIDD+GDI+FE++D +NIL D GA+N VTA I
Sbjct: 132  ANNSSLLVFPGTVHCEPHEQISWAEFQYVIDDYGDIFFEIFDDENILHDPGANNLVTAFI 191

Query: 183  GMDISHYESRKMDIYXXXXXXXXXXXXVIFVD-DYSDIEDPSKADIWVDWGMPESSTWTH 359
            GMDI  Y+++++                I +D DY ++ D   +D  VDWGMP++S+W H
Sbjct: 192  GMDIPKYDNQRVAAGAGYNYSDFVTSDDIPLDEDYFEVVDSEVSDGSVDWGMPDTSSWVH 251

Query: 360  PVYFAKSMTKAIDVEHTKMMDHPSNGVVVWGFLRPVFLDEEIYLRRLFNDEESDGYTSDD 539
            P+YF+K +TKA+++E+ + MDHPSNG+ + G+LRP F DEE YLRR F+ E+SDG  SD 
Sbjct: 252  PIYFSKCLTKAVNMEYDRKMDHPSNGLSIVGYLRPAFADEESYLRRQFHSEDSDGDNSDW 311

Query: 540  KDGDIMSISSRDDGSCSRSTIYRLEIAKIELFXXXXXXXXXXXXDFQYAEPDVLVHSIPS 719
            +DG+  + SS++  S + ST+YRLEI +IELF            DFQ AEPD+LVHS  +
Sbjct: 312  QDGETPNFSSKNGRSNTGSTLYRLEIMRIELFSVYGIRSPVSLQDFQDAEPDILVHSTSA 371

Query: 720  ILERFDVRGTRCNVALKALCKKKGLHVEGANLIGVDSLGIDVRVSSGTEVRTHRFSFKVR 899
            I+E F ++G RCN ALKALCKKKGL+VE ANLIGVDSLG+DVRV SG EVRTHRF FK+R
Sbjct: 372  IIEHFSLKGIRCNGALKALCKKKGLNVEEANLIGVDSLGMDVRVFSGVEVRTHRFPFKIR 431

Query: 900  ANSECAADKQIQQLLYPRSRRKKLRT-LDRPREMD 1001
            A SE AA+KQIQQLL+PRSRRKKLR+  D  +E+D
Sbjct: 432  ATSEVAAEKQIQQLLFPRSRRKKLRSQRDVLKELD 466


>ref|XP_006347461.1| PREDICTED: uncharacterized protein At3g49140-like [Solanum tuberosum]
          Length = 485

 Score =  387 bits (993), Expect = e-105
 Identities = 202/355 (56%), Positives = 246/355 (69%), Gaps = 21/355 (5%)
 Frame = +3

Query: 3    ANNRALLIFPGVVHCEPHVQISWAEFQYVIDDFGDIYFEVYDGQNILQDHGASNPVTALI 182
            AN+ ALLIFPG VHCEPH Q+SWAEFQYVID++GDI+ E+YD +NIL++  ASN V ALI
Sbjct: 132  ANSNALLIFPGTVHCEPHEQVSWAEFQYVIDEYGDIFLEIYDDKNILRNRDASNSVNALI 191

Query: 183  GMDISHYESRKMDIYXXXXXXXXXXXXVIFVDDYSDI--------------------EDP 302
            GMD S YE R+ +                F DDY ++                    E+ 
Sbjct: 192  GMDFSQYEKRRAESPDDINLAGDSIDDSNFFDDYFEVSKQPLQFHYIIRPISLILQGENS 251

Query: 303  SKADIWVDWGMPESSTWTHPVYFAKSMTKAIDVEHTKMMDHPSNGVVVWGFLRPVFLDEE 482
               D  VDWGMP+SS+  HPVYFAK +TKA+ ++H KMMDHPSNG+ +WG L+P FL+EE
Sbjct: 252  EMYDYQVDWGMPDSSSTLHPVYFAKCLTKAVHMKHAKMMDHPSNGISIWGRLKPAFLEEE 311

Query: 483  IYLRRLFN-DEESDGYTSDDKDGDIMSISSRDDGSCSRSTIYRLEIAKIELFXXXXXXXX 659
             Y+RRLF+ DE SDG T D KDG+I+S SSR D SC+ S+IYRLEI +++LF        
Sbjct: 312  YYVRRLFSGDEVSDGSTLDWKDGEILSFSSRYDKSCTLSSIYRLEIIRVDLFSVYGAQLA 371

Query: 660  XXXXDFQYAEPDVLVHSIPSILERFDVRGTRCNVALKALCKKKGLHVEGANLIGVDSLGI 839
                DF  AEPD LVHS P+ILE F  +G RC  ALKALC+KKGLHVE ANLIGVDSLG+
Sbjct: 372  VNLYDFHDAEPDSLVHSAPAILEWFRQQGIRCKYALKALCRKKGLHVERANLIGVDSLGM 431

Query: 840  DVRVSSGTEVRTHRFSFKVRANSECAADKQIQQLLYPRSRRKKLRTLDRPREMDS 1004
            DVRV SGTEV THRF FKVRA+SE AA+KQI+QLL+PRSRRKK    +R  ++DS
Sbjct: 432  DVRVLSGTEVWTHRFPFKVRAHSEIAAEKQIRQLLFPRSRRKKF--TERAGDLDS 484


>ref|XP_002530542.1| conserved hypothetical protein [Ricinus communis]
            gi|223529904|gb|EEF31833.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 409

 Score =  386 bits (991), Expect = e-104
 Identities = 198/335 (59%), Positives = 240/335 (71%), Gaps = 1/335 (0%)
 Frame = +3

Query: 3    ANNRALLIFPGVVHCEPHVQISWAEFQYVIDDFGDIYFEVYDGQNILQDHGASNPVTALI 182
            AN+ ALL+FPG VHCEPH QISWAEFQYV+DD+GDI+FE++D  +ILQD GA+NP+ A I
Sbjct: 78   ANSSALLVFPGTVHCEPHEQISWAEFQYVVDDYGDIFFEIFDDISILQDPGATNPMNAFI 137

Query: 183  GMDISHYESRKMDIYXXXXXXXXXXXXVIFVDDYSDIEDPSKADIWVDWGMPESSTWTHP 362
            GMDI  YE++++               + F DDY ++ D   +D+ VDWGMP++STW HP
Sbjct: 138  GMDIPMYENKRI-ANEYNVFDIGSTDDIPFDDDYFEVMDSEVSDVPVDWGMPDTSTWVHP 196

Query: 363  VYFAKSMTKAIDVEHTKMMDHPSNGVVVWGFLRPVFLDEEIYLRRLFNDEESDGYTSDDK 542
            +YFAK +TKA D+E  + MDHPSNGV + G LRP F DEE YLRRLF+ ++SD Y SD  
Sbjct: 197  IYFAKCLTKATDMECDRKMDHPSNGVSILGCLRPAFADEESYLRRLFHCQDSDNYNSDWT 256

Query: 543  DGDIMSISSRDDGSCSRSTIYRLEIAKIELFXXXXXXXXXXXXDFQYAEPDVLVHSIPSI 722
            D +I+S SS+ DGS   ST+YRLEI +IELF             FQ AEPDVLVHS  +I
Sbjct: 257  DVEILSFSSKGDGSSRGSTLYRLEIMRIELFSVYGAQACTY---FQDAEPDVLVHSTSAI 313

Query: 723  LERFDVRGTRCNVALKALCKKKGLHVEGANLIGVDSLGIDVRVSSGTEVRTHRFSFKVRA 902
            L+ F   G RCN ALKALCKKKGLHVEGANLIG+DSLGIDVR  SG EV+T RF FKVRA
Sbjct: 314  LDHFSNNGIRCNAALKALCKKKGLHVEGANLIGIDSLGIDVRTFSGVEVQTQRFPFKVRA 373

Query: 903  NSECAADKQIQQLLYPRSRRKKLRTL-DRPREMDS 1004
              E AA+KQI QLL+P SRRKK R+  DR R+  S
Sbjct: 374  TCEAAAEKQIHQLLFPPSRRKKFRSHGDRLRDSKS 408


>gb|EXB68700.1| hypothetical protein L484_024718 [Morus notabilis]
          Length = 459

 Score =  383 bits (984), Expect = e-104
 Identities = 193/325 (59%), Positives = 240/325 (73%)
 Frame = +3

Query: 3    ANNRALLIFPGVVHCEPHVQISWAEFQYVIDDFGDIYFEVYDGQNILQDHGASNPVTALI 182
            ANN ALLIFPG +HCEPH QISWAEFQYVIDD+GDIYFE+ D  NIL+D  ASNPV ALI
Sbjct: 129  ANNSALLIFPGTIHCEPHEQISWAEFQYVIDDYGDIYFEMLDDANILEDPSASNPVNALI 188

Query: 183  GMDISHYESRKMDIYXXXXXXXXXXXXVIFVDDYSDIEDPSKADIWVDWGMPESSTWTHP 362
            GMD+  YE++++               + F DDY ++ +   ++I  DWGMP +ST  HP
Sbjct: 189  GMDMPMYENKRVAGEYNISDNSGSIDEIPFDDDYFEVVESEVSEIPFDWGMPHASTLIHP 248

Query: 363  VYFAKSMTKAIDVEHTKMMDHPSNGVVVWGFLRPVFLDEEIYLRRLFNDEESDGYTSDDK 542
            +YFAK +TK +++E+ + MDHPSNGV + G LRP F DEE ++RRLF  E+ DGY S+  
Sbjct: 249  IYFAKCLTKVVNMEYDRKMDHPSNGVSILGCLRPAFADEESHIRRLFCYEDGDGYHSEWS 308

Query: 543  DGDIMSISSRDDGSCSRSTIYRLEIAKIELFXXXXXXXXXXXXDFQYAEPDVLVHSIPSI 722
            DG+ +S +SR D   S ST+YRLEI +IELF            DFQ AEPD LVHS  +I
Sbjct: 309  DGETLSSNSRRDRGNSGSTLYRLEILRIELFSSAISLQ-----DFQDAEPDFLVHSTSAI 363

Query: 723  LERFDVRGTRCNVALKALCKKKGLHVEGANLIGVDSLGIDVRVSSGTEVRTHRFSFKVRA 902
            +ERF  +G RC+VALKALCKKKGLHVEGA+LIGVDSLG+DVRVS G+EV+THRF FKVRA
Sbjct: 364  VERFSEKGIRCDVALKALCKKKGLHVEGAHLIGVDSLGMDVRVSVGSEVQTHRFPFKVRA 423

Query: 903  NSECAADKQIQQLLYPRSRRKKLRT 977
             SE AA+KQI+QL++PR+RRKKLR+
Sbjct: 424  TSEIAAEKQIRQLMFPRARRKKLRS 448


>gb|EOY04956.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma
            cacao]
          Length = 486

 Score =  379 bits (973), Expect = e-102
 Identities = 199/346 (57%), Positives = 239/346 (69%), Gaps = 21/346 (6%)
 Frame = +3

Query: 3    ANNRALLIFPGVVHCEPHVQISWAEFQYVIDDFGDIYFEVYDGQNILQDHGASNPVTALI 182
            AN+ ALL+FPG VH EPH QISWAEF YVIDD+GDI+FE++D +NILQD GASN V ALI
Sbjct: 131  ANSTALLVFPGTVHSEPHEQISWAEFHYVIDDYGDIFFEIFDDENILQDRGASNLVNALI 190

Query: 183  GMDISHYESRKMDIYXXXXXXXXXXXXVIFVDDYSDIEDPSKADIWVDWGMPESST--WT 356
            GMDI  +E+ ++               + F DDY ++ D   ++  VDWGMP+++T  W 
Sbjct: 191  GMDIPMHENNRV-AGEYNISDIGNDDEIPFDDDYFEVMDSEMSEAPVDWGMPDTATATWV 249

Query: 357  HPVYFAKSMTKAIDVEHTKMMDHPSNGVVVWGFLRPVFLDEEIYLRRLFNDEESDGYTSD 536
            HP+YFAK +TKA+ +EH + MDHPSNGV + G LRP F DEE YLRRLF+ E++DGYTSD
Sbjct: 250  HPIYFAKCLTKAVHMEHDRKMDHPSNGVSIVGCLRPAFYDEESYLRRLFHFEDNDGYTSD 309

Query: 537  DKDGDIMSISSRDDGSCSRSTIYRLEIAKIELFXXXXXXXXXXXX--------------- 671
             KDG+    SS+  GS S ST+YR+EI ++ELF                           
Sbjct: 310  WKDGETSRSSSKYGGSKSDSTLYRMEIMRMELFSIYGVQAFLMKRIMEERLSSCFLYLSL 369

Query: 672  ----DFQYAEPDVLVHSIPSILERFDVRGTRCNVALKALCKKKGLHVEGANLIGVDSLGI 839
                DFQ AEPDVLVHS  +ILERF   G RCNVALKALCKKKGL +EGANLIGVDSLGI
Sbjct: 370  ISLQDFQDAEPDVLVHSTSAILERFSQNGIRCNVALKALCKKKGLQIEGANLIGVDSLGI 429

Query: 840  DVRVSSGTEVRTHRFSFKVRANSECAADKQIQQLLYPRSRRKKLRT 977
            DVR+ SG EVRTHRF FKVRA SE AA+KQI +LL+PRS RKK RT
Sbjct: 430  DVRIFSGVEVRTHRFPFKVRAMSETAAEKQILKLLFPRSHRKKFRT 475


>gb|EMJ24237.1| hypothetical protein PRUPE_ppa005374mg [Prunus persica]
          Length = 464

 Score =  379 bits (973), Expect = e-102
 Identities = 189/325 (58%), Positives = 242/325 (74%)
 Frame = +3

Query: 3    ANNRALLIFPGVVHCEPHVQISWAEFQYVIDDFGDIYFEVYDGQNILQDHGASNPVTALI 182
            AN  ALL+FPG +HCEPH QISWA+F+YVIDD+GD+YFE++D  N+L+D  ASNPV AL 
Sbjct: 131  ANCSALLVFPGKIHCEPHEQISWADFEYVIDDYGDLYFEIFDDANLLEDPAASNPVNALF 190

Query: 183  GMDISHYESRKMDIYXXXXXXXXXXXXVIFVDDYSDIEDPSKADIWVDWGMPESSTWTHP 362
            GMDI  Y+  ++               + F DDY ++ +   +D+ +DWG+P++S+  HP
Sbjct: 191  GMDIPTYDDGRI-AGEFNILGGGNSDEIPFDDDYLEVVESEVSDV-LDWGLPDTSSSIHP 248

Query: 363  VYFAKSMTKAIDVEHTKMMDHPSNGVVVWGFLRPVFLDEEIYLRRLFNDEESDGYTSDDK 542
            +YFAK +TK I++E+ K MDHPSNGV + G LRP F DEE Y+RRLF+ E+SDGY SD K
Sbjct: 249  IYFAKCLTKVINIEYHKKMDHPSNGVSILGCLRPAFADEEFYVRRLFHYEDSDGYNSDWK 308

Query: 543  DGDIMSISSRDDGSCSRSTIYRLEIAKIELFXXXXXXXXXXXXDFQYAEPDVLVHSIPSI 722
            DG  +S+SS+ D   + ST+YRLEI +IELF            DFQ AEPDVLV++   I
Sbjct: 309  DGKSLSLSSKSDRIKTCSTLYRLEIMRIELFSVYGVQSTISLEDFQDAEPDVLVNATLEI 368

Query: 723  LERFDVRGTRCNVALKALCKKKGLHVEGANLIGVDSLGIDVRVSSGTEVRTHRFSFKVRA 902
            ++RF+ RG RC+VALKALCK+KGLHVEGA+LIGVDSLG+DVRV SG EV+THRF FKVRA
Sbjct: 369  VDRFNERGIRCDVALKALCKRKGLHVEGAHLIGVDSLGMDVRVFSGLEVQTHRFPFKVRA 428

Query: 903  NSECAADKQIQQLLYPRSRRKKLRT 977
             SE AA+KQIQQLL+PRSRRKKL++
Sbjct: 429  TSEVAAEKQIQQLLFPRSRRKKLKS 453


>ref|XP_004296717.1| PREDICTED: uncharacterized protein At3g49140-like [Fragaria vesca
            subsp. vesca]
          Length = 463

 Score =  372 bits (956), Expect = e-100
 Identities = 190/325 (58%), Positives = 241/325 (74%)
 Frame = +3

Query: 3    ANNRALLIFPGVVHCEPHVQISWAEFQYVIDDFGDIYFEVYDGQNILQDHGASNPVTALI 182
            AN+ ALL+FPG +H EPH QISWAEFQYVIDD+GD+YFE++D  NIL+D  ASNPV AL 
Sbjct: 131  ANDNALLVFPGKIHSEPHEQISWAEFQYVIDDYGDLYFELFDDANILEDPTASNPVNALF 190

Query: 183  GMDISHYESRKMDIYXXXXXXXXXXXXVIFVDDYSDIEDPSKADIWVDWGMPESSTWTHP 362
            GMDI  + + ++               + F DDY ++ +P   D+ +DW +P++ST  HP
Sbjct: 191  GMDIPAHNNGRIT-GGFSILDDYNSDDMPFDDDYLEVVEPEAFDV-LDWEIPDASTVIHP 248

Query: 363  VYFAKSMTKAIDVEHTKMMDHPSNGVVVWGFLRPVFLDEEIYLRRLFNDEESDGYTSDDK 542
            +YFAK +TKAI++ H + MDHPSNGV + G L P F DEE Y+RRLF+ E+SD Y SD+K
Sbjct: 249  IYFAKCLTKAINIRHDRKMDHPSNGVSILGCLIPAFADEEFYVRRLFHHEDSD-YDSDEK 307

Query: 543  DGDIMSISSRDDGSCSRSTIYRLEIAKIELFXXXXXXXXXXXXDFQYAEPDVLVHSIPSI 722
            DG  +SISS+ D S +RST+YRLEI +IELF            DFQ AEPD L++SI  I
Sbjct: 308  DGKGVSISSKSDRSKTRSTLYRLEIMRIELFSVYGVQSAISLQDFQDAEPDFLINSISDI 367

Query: 723  LERFDVRGTRCNVALKALCKKKGLHVEGANLIGVDSLGIDVRVSSGTEVRTHRFSFKVRA 902
            +ERF+ RG RC+VALKALCK+KGL VEGA+LIGVDSLG+DVRV SG+EV+THRF F+VRA
Sbjct: 368  VERFNERGIRCDVALKALCKRKGLQVEGAHLIGVDSLGMDVRVFSGSEVQTHRFPFRVRA 427

Query: 903  NSECAADKQIQQLLYPRSRRKKLRT 977
             SE  A+KQI+QLL+PRSRRKKLR+
Sbjct: 428  KSELVAEKQIEQLLFPRSRRKKLRS 452


>ref|XP_004152092.1| PREDICTED: uncharacterized protein At3g49140-like [Cucumis sativus]
          Length = 446

 Score =  362 bits (928), Expect = 2e-97
 Identities = 183/326 (56%), Positives = 233/326 (71%), Gaps = 2/326 (0%)
 Frame = +3

Query: 6    NNRALLIFPGVVHCEPHVQISWAEFQYVIDDFGDIYFEVYDGQNILQDHGASNPVTALIG 185
            N+ ALL+FPG VH EPH Q+SW EFQYV DD+GD+YFE++D  N+L+D  A NPV ALIG
Sbjct: 111  NSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIG 170

Query: 186  MDISHYESRKMDIYXXXXXXXXXXXXVIFVDDYSDIEDPSKADIWVDWGMPESSTWTHPV 365
            MD+  YESR++ +               F  DY ++ +   A+I VDWG+P+ S+  HPV
Sbjct: 171  MDMQMYESRRI-VGDYSDVDSGYGDVAPFDYDYIEVVEADLANIPVDWGVPDVSSMVHPV 229

Query: 366  YFAKSMTKAIDVEHTKMMDHPSNGVVVWGFLRPVFLDEEIYLRRLFNDEESDGYTSDDK- 542
            YFAK + K I++E+ + M HPSNGV + G LRP + DEE Y+RRLF  EES+GY ++ K 
Sbjct: 230  YFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKG 289

Query: 543  -DGDIMSISSRDDGSCSRSTIYRLEIAKIELFXXXXXXXXXXXXDFQYAEPDVLVHSIPS 719
             +G+  ++ S+ D S  RST+YRLEI +IELF            DFQ AEPD+L+HS   
Sbjct: 290  LEGETSNLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAE 349

Query: 720  ILERFDVRGTRCNVALKALCKKKGLHVEGANLIGVDSLGIDVRVSSGTEVRTHRFSFKVR 899
            ILERF+ +G +CN+ALKALCKK+GLHVE A LIGVDSLG+DVRV  GTEVRT RF FK+R
Sbjct: 350  ILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIR 409

Query: 900  ANSECAADKQIQQLLYPRSRRKKLRT 977
            A SE AA+KQIQQLL+PRSRRKKLR+
Sbjct: 410  ATSEAAAEKQIQQLLFPRSRRKKLRS 435


>ref|XP_004172166.1| PREDICTED: uncharacterized protein At3g49140-like [Cucumis sativus]
          Length = 437

 Score =  352 bits (902), Expect = 3e-94
 Identities = 180/324 (55%), Positives = 227/324 (70%)
 Frame = +3

Query: 6    NNRALLIFPGVVHCEPHVQISWAEFQYVIDDFGDIYFEVYDGQNILQDHGASNPVTALIG 185
            N+ ALL+FPG VH EPH Q+SW EFQYV DD+GD+YFE++D  N+L+D  A NPV ALIG
Sbjct: 111  NSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIG 170

Query: 186  MDISHYESRKMDIYXXXXXXXXXXXXVIFVDDYSDIEDPSKADIWVDWGMPESSTWTHPV 365
            MD+  YESR++ +               F  DY ++ +   A+I VDWG+P+ S+  HPV
Sbjct: 171  MDMQMYESRRI-VGDYSDVDSGYGDVAPFDYDYIEVVEADLANIPVDWGVPDVSSMVHPV 229

Query: 366  YFAKSMTKAIDVEHTKMMDHPSNGVVVWGFLRPVFLDEEIYLRRLFNDEESDGYTSDDKD 545
            YFAK + K I++E+ + M HPSNGV + G LRP + DEE Y+RRLF  EES        +
Sbjct: 230  YFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEES-------LE 282

Query: 546  GDIMSISSRDDGSCSRSTIYRLEIAKIELFXXXXXXXXXXXXDFQYAEPDVLVHSIPSIL 725
            G+  ++ S+ D S  RST+YRLEI +IELF            DFQ AEPD+L+HS   IL
Sbjct: 283  GETSNLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEIL 342

Query: 726  ERFDVRGTRCNVALKALCKKKGLHVEGANLIGVDSLGIDVRVSSGTEVRTHRFSFKVRAN 905
            ERF+ +G +CN+ALKALCKK+GLHVE A LIGVDSLG+DVRV  GTEVRT RF FK+RA 
Sbjct: 343  ERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRAT 402

Query: 906  SECAADKQIQQLLYPRSRRKKLRT 977
            SE AA+KQIQQLL+PRSRRKKLR+
Sbjct: 403  SEAAAEKQIQQLLFPRSRRKKLRS 426


>ref|XP_006402701.1| hypothetical protein EUTSA_v10005960mg [Eutrema salsugineum]
            gi|557103800|gb|ESQ44154.1| hypothetical protein
            EUTSA_v10005960mg [Eutrema salsugineum]
          Length = 459

 Score =  350 bits (897), Expect = 1e-93
 Identities = 175/325 (53%), Positives = 228/325 (70%)
 Frame = +3

Query: 3    ANNRALLIFPGVVHCEPHVQISWAEFQYVIDDFGDIYFEVYDGQNILQDHGASNPVTALI 182
            AN+ A+L+FPG +HCEPH Q SW+EF+YVIDD+GDI+FE+ D +NIL+D GASNPV A  
Sbjct: 126  ANSSAVLVFPGAIHCEPHDQNSWSEFKYVIDDYGDIFFEIPDDENILEDPGASNPVKAFF 185

Query: 183  GMDISHYESRKMDIYXXXXXXXXXXXXVIFVDDYSDIEDPSKADIWVDWGMPESSTWTHP 362
            GMD+  YE+ ++               +IF D Y +I D    DI VDWGMP++S   HP
Sbjct: 186  GMDVPRYENARLH-EEYNMSDIGNLDQIIFDDHYFEIMDSEARDIPVDWGMPDTSNGVHP 244

Query: 363  VYFAKSMTKAIDVEHTKMMDHPSNGVVVWGFLRPVFLDEEIYLRRLFNDEESDGYTSDDK 542
            +YFAK ++KAI V++ + MD+PSNGV + G LRP FLDEE Y+RRLF  E+ D Y+ D +
Sbjct: 245  IYFAKHLSKAISVDYDRKMDYPSNGVSILGCLRPAFLDEESYIRRLFLSEDRDDYSWDVQ 304

Query: 543  DGDIMSISSRDDGSCSRSTIYRLEIAKIELFXXXXXXXXXXXXDFQYAEPDVLVHSIPSI 722
              D  S SSR + +   S++YRLEI  IEL             DFQ AEPD+LVHS  +I
Sbjct: 305  GDDNPSTSSRREENDMSSSLYRLEIVGIELLSLYGAESSISLQDFQDAEPDILVHSTSAI 364

Query: 723  LERFDVRGTRCNVALKALCKKKGLHVEGANLIGVDSLGIDVRVSSGTEVRTHRFSFKVRA 902
            +ERF+ RG   ++ALKALCKKKGLH E ANLI VDSLG+DVRV +G +V+THRF FK RA
Sbjct: 365  IERFNNRGINSSIALKALCKKKGLHAEEANLISVDSLGMDVRVFAGAQVQTHRFPFKTRA 424

Query: 903  NSECAADKQIQQLLYPRSRRKKLRT 977
             +E AA+K+I QLL+PRSRR+K+++
Sbjct: 425  MTEIAAEKKIHQLLFPRSRRRKMKS 449


>ref|XP_002876496.1| hypothetical protein ARALYDRAFT_486397 [Arabidopsis lyrata subsp.
            lyrata] gi|297322334|gb|EFH52755.1| hypothetical protein
            ARALYDRAFT_486397 [Arabidopsis lyrata subsp. lyrata]
          Length = 459

 Score =  348 bits (893), Expect = 3e-93
 Identities = 175/328 (53%), Positives = 230/328 (70%), Gaps = 1/328 (0%)
 Frame = +3

Query: 3    ANNRALLIFPGVVHCEPHVQISWAEFQYVIDDFGDIYFEVYDGQNILQDHGASNPVTALI 182
            AN+ A+L+FPG +HCEPH   SW+EF+YVIDD+GDI+FE+ D +NIL+D GASNPV A  
Sbjct: 126  ANSSAVLVFPGAIHCEPHDHNSWSEFKYVIDDYGDIFFEIPDDENILEDPGASNPVKAFF 185

Query: 183  GMDISHYES-RKMDIYXXXXXXXXXXXXVIFVDDYSDIEDPSKADIWVDWGMPESSTWTH 359
            GMD+  YE+ R  + Y            +IF D Y +I D    DI +DWGMP++S   H
Sbjct: 186  GMDVPRYENTRHHEEYNISDIGNLDQ--IIFDDHYFEIMDSEARDIPIDWGMPDTSNGVH 243

Query: 360  PVYFAKSMTKAIDVEHTKMMDHPSNGVVVWGFLRPVFLDEEIYLRRLFNDEESDGYTSDD 539
            P+YFAK ++KAI +++ + MD+PSNGV + G LRP FLDEE Y+RRLF  E+ D Y+ + 
Sbjct: 244  PIYFAKHLSKAISMDYDRKMDYPSNGVSILGCLRPAFLDEESYIRRLFLSEDRDDYSWEV 303

Query: 540  KDGDIMSISSRDDGSCSRSTIYRLEIAKIELFXXXXXXXXXXXXDFQYAEPDVLVHSIPS 719
            +  D  + SSR D +   S++YRLEI  IEL             DFQ AEPD+LVHS+ +
Sbjct: 304  QGDDNPNTSSRQDENDMSSSLYRLEIVGIELLSLYGAESSISLQDFQDAEPDILVHSMSA 363

Query: 720  ILERFDVRGTRCNVALKALCKKKGLHVEGANLIGVDSLGIDVRVSSGTEVRTHRFSFKVR 899
            I+ERF+ RG   ++ALKALCKKKGLH E ANLI VDSLG+DVRV +G +V+THRF FK R
Sbjct: 364  IIERFNNRGINSSIALKALCKKKGLHAEEANLISVDSLGMDVRVFAGAQVQTHRFPFKTR 423

Query: 900  ANSECAADKQIQQLLYPRSRRKKLRTLD 983
            A +E AA+K+I QLL+PRSRR+KL++ D
Sbjct: 424  ATTEMAAEKKIHQLLFPRSRRRKLKSHD 451


>ref|NP_567080.1| pentatricopeptide repeat-containing protein-like protein [Arabidopsis
            thaliana] gi|15292859|gb|AAK92800.1| unknown protein
            [Arabidopsis thaliana] gi|20258901|gb|AAM14144.1| unknown
            protein [Arabidopsis thaliana]
            gi|332646380|gb|AEE79901.1| pentatricopeptide
            repeat-containing protein-like protein [Arabidopsis
            thaliana]
          Length = 459

 Score =  345 bits (884), Expect = 3e-92
 Identities = 175/328 (53%), Positives = 227/328 (69%), Gaps = 1/328 (0%)
 Frame = +3

Query: 3    ANNRALLIFPGVVHCEPHVQISWAEFQYVIDDFGDIYFEVYDGQNILQDHGASNPVTALI 182
            AN+ A+L+FPG +HCEPH   SW+EF+YVIDD+GDI+FE+ D +NIL+D GASNPV A  
Sbjct: 126  ANSSAVLVFPGAIHCEPHDHNSWSEFKYVIDDYGDIFFEIPDDENILEDPGASNPVKAFF 185

Query: 183  GMDISHYES-RKMDIYXXXXXXXXXXXXVIFVDDYSDIEDPSKADIWVDWGMPESSTWTH 359
            GMD+  YE+ R  + Y            +IF D Y +I D    DI +DWGMP++S   H
Sbjct: 186  GMDVPRYENTRHHEEYNISDIGNLDQ--IIFDDHYFEIMDSEARDIPIDWGMPDTSNGVH 243

Query: 360  PVYFAKSMTKAIDVEHTKMMDHPSNGVVVWGFLRPVFLDEEIYLRRLFNDEESDGYTSDD 539
            P+YFAK ++KAI +++ + MD+PSNGV + G LRP FLDEE Y+RRLF  E+ D Y+ + 
Sbjct: 244  PIYFAKHLSKAISMDYDRKMDYPSNGVSILGCLRPAFLDEESYIRRLFLSEDRDDYSWEV 303

Query: 540  KDGDIMSISSRDDGSCSRSTIYRLEIAKIELFXXXXXXXXXXXXDFQYAEPDVLVHSIPS 719
            +  D    SSR D +   S++YRLEI  IEL             DFQ AEPD+LVHS  +
Sbjct: 304  QGDDNPITSSRRDENDMSSSLYRLEIVGIELLSLYGAESSISLQDFQDAEPDILVHSTSA 363

Query: 720  ILERFDVRGTRCNVALKALCKKKGLHVEGANLIGVDSLGIDVRVSSGTEVRTHRFSFKVR 899
            I+ERF+ RG   ++ALKALCKKKGLH E ANLI VDSLG+DVRV +G +V+THRF FK R
Sbjct: 364  IIERFNNRGINSSIALKALCKKKGLHAEEANLISVDSLGMDVRVFAGAQVQTHRFPFKTR 423

Query: 900  ANSECAADKQIQQLLYPRSRRKKLRTLD 983
            A +E AA+K+I QLL+PRSRR+KL+  D
Sbjct: 424  ATTEMAAEKKIHQLLFPRSRRRKLKCHD 451


>emb|CAB91600.1| putative protein [Arabidopsis thaliana]
          Length = 452

 Score =  345 bits (884), Expect = 3e-92
 Identities = 175/328 (53%), Positives = 227/328 (69%), Gaps = 1/328 (0%)
 Frame = +3

Query: 3    ANNRALLIFPGVVHCEPHVQISWAEFQYVIDDFGDIYFEVYDGQNILQDHGASNPVTALI 182
            AN+ A+L+FPG +HCEPH   SW+EF+YVIDD+GDI+FE+ D +NIL+D GASNPV A  
Sbjct: 119  ANSSAVLVFPGAIHCEPHDHNSWSEFKYVIDDYGDIFFEIPDDENILEDPGASNPVKAFF 178

Query: 183  GMDISHYES-RKMDIYXXXXXXXXXXXXVIFVDDYSDIEDPSKADIWVDWGMPESSTWTH 359
            GMD+  YE+ R  + Y            +IF D Y +I D    DI +DWGMP++S   H
Sbjct: 179  GMDVPRYENTRHHEEYNISDIGNLDQ--IIFDDHYFEIMDSEARDIPIDWGMPDTSNGVH 236

Query: 360  PVYFAKSMTKAIDVEHTKMMDHPSNGVVVWGFLRPVFLDEEIYLRRLFNDEESDGYTSDD 539
            P+YFAK ++KAI +++ + MD+PSNGV + G LRP FLDEE Y+RRLF  E+ D Y+ + 
Sbjct: 237  PIYFAKHLSKAISMDYDRKMDYPSNGVSILGCLRPAFLDEESYIRRLFLSEDRDDYSWEV 296

Query: 540  KDGDIMSISSRDDGSCSRSTIYRLEIAKIELFXXXXXXXXXXXXDFQYAEPDVLVHSIPS 719
            +  D    SSR D +   S++YRLEI  IEL             DFQ AEPD+LVHS  +
Sbjct: 297  QGDDNPITSSRRDENDMSSSLYRLEIVGIELLSLYGAESSISLQDFQDAEPDILVHSTSA 356

Query: 720  ILERFDVRGTRCNVALKALCKKKGLHVEGANLIGVDSLGIDVRVSSGTEVRTHRFSFKVR 899
            I+ERF+ RG   ++ALKALCKKKGLH E ANLI VDSLG+DVRV +G +V+THRF FK R
Sbjct: 357  IIERFNNRGINSSIALKALCKKKGLHAEEANLISVDSLGMDVRVFAGAQVQTHRFPFKTR 416

Query: 900  ANSECAADKQIQQLLYPRSRRKKLRTLD 983
            A +E AA+K+I QLL+PRSRR+KL+  D
Sbjct: 417  ATTEMAAEKKIHQLLFPRSRRRKLKCHD 444


>ref|XP_006291096.1| hypothetical protein CARUB_v10017210mg [Capsella rubella]
            gi|482559803|gb|EOA23994.1| hypothetical protein
            CARUB_v10017210mg [Capsella rubella]
          Length = 457

 Score =  343 bits (879), Expect = 1e-91
 Identities = 175/327 (53%), Positives = 225/327 (68%)
 Frame = +3

Query: 3    ANNRALLIFPGVVHCEPHVQISWAEFQYVIDDFGDIYFEVYDGQNILQDHGASNPVTALI 182
            AN+ A+LIFPG +HCEPH Q SW+EF+YVID++GDI+FE+ D  NIL+D  ASNPV A  
Sbjct: 126  ANSSAVLIFPGAIHCEPHDQTSWSEFKYVIDEYGDIFFEIPDDVNILEDPEASNPVKAFF 185

Query: 183  GMDISHYESRKMDIYXXXXXXXXXXXXVIFVDDYSDIEDPSKADIWVDWGMPESSTWTHP 362
            GMD+  YE+ ++               +IF D Y +I D    DI VDWGMP++S   HP
Sbjct: 186  GMDVPRYENTRLH-EEYNISDIGNLDQIIFDDHYFEIMDSEARDIPVDWGMPDTSNAVHP 244

Query: 363  VYFAKSMTKAIDVEHTKMMDHPSNGVVVWGFLRPVFLDEEIYLRRLFNDEESDGYTSDDK 542
            +YFAK M+KAI +++ + MD+PSNGV + G LRP FLDEE Y+RRLF  E+ D Y+ + +
Sbjct: 245  IYFAKHMSKAISMDYDRKMDYPSNGVSILGCLRPAFLDEESYIRRLFTSEDRDDYSWEAQ 304

Query: 543  DGDIMSISSRDDGSCSRSTIYRLEIAKIELFXXXXXXXXXXXXDFQYAEPDVLVHSIPSI 722
            D    S S R D     S++YRLEI  IEL             DFQ AEPD+LVHS  +I
Sbjct: 305  DNP--STSLRRDEKDISSSLYRLEIVGIELLSLYGAESSISLQDFQDAEPDILVHSTSAI 362

Query: 723  LERFDVRGTRCNVALKALCKKKGLHVEGANLIGVDSLGIDVRVSSGTEVRTHRFSFKVRA 902
            +ERF+ RG   ++ALKALCKKKGLH E ANLI VDSLG+DVRV +G +V+THRF FK RA
Sbjct: 363  IERFNNRGINSSIALKALCKKKGLHAEEANLISVDSLGMDVRVFAGAQVQTHRFPFKTRA 422

Query: 903  NSECAADKQIQQLLYPRSRRKKLRTLD 983
             +E AA+K+I QLL+PRSRR+KL++ D
Sbjct: 423  TTEMAAEKKIHQLLFPRSRRRKLKSHD 449


Top