BLASTX nr result

ID: Akebia23_contig00002892 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00002892
         (2695 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007204292.1| hypothetical protein PRUPE_ppa001096mg [Prun...   696   0.0  
emb|CBI27815.3| unnamed protein product [Vitis vinifera]              660   0.0  
ref|XP_002278147.1| PREDICTED: uncharacterized protein LOC100244...   651   0.0  
ref|XP_006453195.1| hypothetical protein CICLE_v10007604mg [Citr...   637   e-180
ref|XP_004288980.1| PREDICTED: uncharacterized protein LOC101312...   628   e-177
ref|XP_007012655.1| DNA repair protein rhp7, putative [Theobroma...   626   e-176
ref|XP_002516283.1| rad7, putative [Ricinus communis] gi|2235447...   612   e-172
ref|XP_003546506.2| PREDICTED: uncharacterized protein LOC100808...   597   e-168
ref|XP_004243936.1| PREDICTED: DNA repair protein RAD7-like [Sol...   585   e-164
ref|XP_007138661.1| hypothetical protein PHAVU_009G227500g, part...   577   e-161
ref|XP_006297020.1| hypothetical protein CARUB_v10013011mg [Caps...   566   e-158
gb|EXB38942.1| hypothetical protein L484_027377 [Morus notabilis]     564   e-158
ref|XP_007012421.1| Rad7, putative isoform 1 [Theobroma cacao] g...   560   e-156
dbj|BAD43070.1| hypothetical protein [Arabidopsis thaliana] gi|6...   559   e-156
ref|NP_178661.2| uncharacterized protein [Arabidopsis thaliana] ...   559   e-156
dbj|BAF01173.1| hypothetical protein [Arabidopsis thaliana]           557   e-156
ref|XP_002309465.2| hypothetical protein POPTR_0006s23720g [Popu...   556   e-155
gb|EYU28335.1| hypothetical protein MIMGU_mgv1a001385mg [Mimulus...   556   e-155
ref|XP_007012423.1| Rad7, putative isoform 3 [Theobroma cacao] g...   556   e-155
ref|XP_003551123.1| PREDICTED: DNA repair protein rhp7-like [Gly...   553   e-154

>ref|XP_007204292.1| hypothetical protein PRUPE_ppa001096mg [Prunus persica]
            gi|462399823|gb|EMJ05491.1| hypothetical protein
            PRUPE_ppa001096mg [Prunus persica]
          Length = 910

 Score =  696 bits (1795), Expect = 0.0
 Identities = 380/714 (53%), Positives = 489/714 (68%), Gaps = 25/714 (3%)
 Frame = -2

Query: 2400 SSTKGKGKRKLEFDLNSPPLDLGISDGNDKGFFNLRSGARISKRKIEG---------IRV 2248
            S  + K KRKLE D+N P L+    DG  KGF +LRSG ++SKR + G         +  
Sbjct: 208  SKEEVKKKRKLEIDINFPALEWEGEDGRSKGFLSLRSGKKVSKRGLGGGHNGALVIDLDA 267

Query: 2247 DSNGIDNVGEKL--------------AXXXXXXXXXXXXXXXKFXXXXXXXXXXXXXXVN 2110
            D NG   +GE                +               +               + 
Sbjct: 268  DENGKGKLGESGFAFNGVDVVELDSDSEEERSSENLVQSSSPRGKRKLSDAIEGVAEDLK 327

Query: 2109 PNTIADEMGPSNVKKRYTREEKGKGRLIKDTWLSIGIDPVELDLKPEEENLLEDAVSGLI 1930
               +A E G  N ++RY+ EEKGKG+LI +  L  G D  EL LK E        V   +
Sbjct: 328  DEVMASENGIDNGRRRYSIEEKGKGKLIGEVVLMNGNDEAELGLKSE--------VLSSV 379

Query: 1929 PDAASGLVQLQEISAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHF--QPE 1756
             + A+  ++ +E +A+  E Q+  +  R  A        RF+D AR++ASRFAHF  + E
Sbjct: 380  ENVAASPIRKRENAALPDESQLINSNTRENAASGNQYMERFRDIARRNASRFAHFASEEE 439

Query: 1755 DEDYIAPLPEPNEEIEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPLN 1576
            +E+ + P  E  ++IEDWPGPFSTAMKII+DRA K    Q+ S    T P P +EW P +
Sbjct: 440  EENQLPPQVEVAQDIEDWPGPFSTAMKIIKDRAAKN--AQLPSKDQ-TKP-PFVEWVPKS 495

Query: 1575 NEGQNRSRPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDL 1396
             + +  S+ L+PSL+DL ++ LAKNA+AI SL+ V D +R++L Q+LCDSR+M+ HF +L
Sbjct: 496  FQDRPLSKNLIPSLQDLCLSFLAKNADAIVSLEHVADALRHRLCQMLCDSRKMNSHFFEL 555

Query: 1395 LGCGSPSEICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPS 1216
            L  G P+E+ ++DCSWMTEE+ TK F   DT+NLTVLQLDQCGRC+ DYIL +TLARS +
Sbjct: 556  LVQGLPTEVRLRDCSWMTEEQFTKSFQQWDTSNLTVLQLDQCGRCVADYILHSTLARSSN 615

Query: 1215 SLAALVSISLRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRE 1036
             L AL ++SL G CR++DVGL ALV+SAPALRSLNL QCSLLTS  I  +ADSLG+VLRE
Sbjct: 616  CLPALTTLSLSGACRLSDVGLGALVSSAPALRSLNLSQCSLLTSSSIGTLADSLGSVLRE 675

Query: 1035 LYIDDCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCG 856
            LY++DCQ IDA+LILPALKKLEHLEVL +  ++ VCD FI  F+T  G ++KELVL +CG
Sbjct: 676  LYLNDCQGIDALLILPALKKLEHLEVLWLGGLENVCDDFIKEFVTARGQSLKELVLTDCG 735

Query: 855  KLTDTSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAA 676
            KLTD+S+KVIAE+C+GLCALDLVNL+KLTD+ +G+LANGC+ IQTLKLCRNAFSDE+IAA
Sbjct: 736  KLTDSSVKVIAETCTGLCALDLVNLYKLTDLTLGYLANGCREIQTLKLCRNAFSDEAIAA 795

Query: 675  FLEASGESLKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSS 496
            FLE SGE L ELSLNN+K+VG+NTAI+LA+ S+ L +LDLSWCRNL DEA+GLI D+C S
Sbjct: 796  FLETSGECLTELSLNNIKKVGYNTAIALAKRSRKLHTLDLSWCRNLTDEALGLIADSCLS 855

Query: 495  LRVLKLFGCTQITNTFFNGHSNPLVRIIGLKLTPILEHLNVLDRPEAPLRY*SV 334
            LR+LKLFGCTQ+TNTF +GHSNP V+IIGLK++PILEH+ V D  E PLRY SV
Sbjct: 856  LRILKLFGCTQLTNTFLDGHSNPEVKIIGLKVSPILEHVKVSDPHEGPLRYSSV 909


>emb|CBI27815.3| unnamed protein product [Vitis vinifera]
          Length = 832

 Score =  660 bits (1702), Expect = 0.0
 Identities = 377/701 (53%), Positives = 470/701 (67%), Gaps = 10/701 (1%)
 Frame = -2

Query: 2421 RRSERLGSSTKGKGKRKLEFDLNSPPLDLGISDGNDKGFFNLRSGARISKRKIEGI-RVD 2245
            +RSE + +  + KGKRKL F+ +  PLD       DKGF  LRSG +I K  + G+ R++
Sbjct: 183  QRSEDMVTVKQSKGKRKLSFEAS--PLD-----DEDKGFLGLRSGKKIVKEIMCGVDRIE 235

Query: 2244 SNGIDNVGEKLAXXXXXXXXXXXXXXXKFXXXXXXXXXXXXXXVNPNTIADEMGPSNVKK 2065
            S+G   V E+                                 +  +  A+E G    ++
Sbjct: 236  SDGGKYVVEQ----------ERGGEDKGVKVQGHGNGEAAVEELQKDPSANENGSVRGRR 285

Query: 2064 RYTREEKGKGRLIKDTWLSIGIDPVELDLKPEEENLLEDAVSGLIPDAASGLVQLQEISA 1885
            R+T EEKGKG+L++D      ID VELDL  E +N++++                  +SA
Sbjct: 286  RFTGEEKGKGKLVEDDEPQNRIDAVELDLNLELKNVIDN------------------MSA 327

Query: 1884 VEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHFQPEDE-----DYIAPLPEPN 1720
             E++   A+TR              F+D AR++ASRFAHF PE E        A +  P+
Sbjct: 328  DENDAVEARTR--------------FRDIARRNASRFAHFAPEQEMENHPSREAEIQRPS 373

Query: 1719 E----EIEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPLNNEGQNRSR 1552
            E    E EDWPGPFSTAMKII+DR  KQN +Q NS+S    PA VI W P   +     +
Sbjct: 374  EGGEKENEDWPGPFSTAMKIIKDREKKQNTQQ-NSSSDRNRPAHVI-WSPRKVKSSECPK 431

Query: 1551 PLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGCGSPSE 1372
            PL PSL+++ + +LA+N +AITSL+ +PD +R+KLSQLLCDSRRM+ H ++LL  GSP E
Sbjct: 432  PLAPSLQEMCLEVLAQNGDAITSLESIPDALRHKLSQLLCDSRRMNSHILELLVSGSPFE 491

Query: 1371 ICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPSSLAALVSI 1192
            +CV+DCSW+TEEE  +IF  CDTN+LTVLQLDQCGRCM DY+LRAT     + L AL ++
Sbjct: 492  VCVRDCSWLTEEEFARIFKRCDTNSLTVLQLDQCGRCMTDYVLRATFDMLSNGLPALTTV 551

Query: 1191 SLRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRELYIDDCQS 1012
            SL+G CR++D GL ALV+SAP LRS+NL QCSLLTS  I N+A++LG+VLRELYIDDCQ 
Sbjct: 552  SLKGACRLSDAGLRALVSSAPMLRSINLSQCSLLTSASIKNLAETLGSVLRELYIDDCQG 611

Query: 1011 IDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCGKLTDTSLK 832
            IDAMLIL AL+KLE LEVLSVA IQTVCD FI  FI+V G  MKELVL +C +LTD SLK
Sbjct: 612  IDAMLILSALEKLECLEVLSVAGIQTVCDDFIWEFISVHGPTMKELVLTDCSRLTDFSLK 671

Query: 831  VIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAAFLEASGES 652
             IAE+C  L ALDL NL KLTD A G+LA+GCQ++QTLKL  N+FSDE+IAAFLE SG S
Sbjct: 672  AIAETCPELRALDLGNLCKLTDSAFGYLASGCQAMQTLKLRCNSFSDEAIAAFLEISGGS 731

Query: 651  LKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSSLRVLKLFG 472
            LKELSLNNV ++GHNTAISLAR S+ L+ LDLSWCRNL D  +G IVD+C SLRVLKLFG
Sbjct: 732  LKELSLNNVSKIGHNTAISLARRSRELIRLDLSWCRNLTDGDLGFIVDSCLSLRVLKLFG 791

Query: 471  CTQITNTFFNGHSNPLVRIIGLKLTPILEHLNVLDRPEAPL 349
            CTQITN F +GHSNP V IIGLKLTPIL+HL + D    PL
Sbjct: 792  CTQITNMFVDGHSNPQVEIIGLKLTPILKHLKLTDPQSFPL 832


>ref|XP_002278147.1| PREDICTED: uncharacterized protein LOC100244043 [Vitis vinifera]
          Length = 905

 Score =  651 bits (1679), Expect = 0.0
 Identities = 353/595 (59%), Positives = 432/595 (72%), Gaps = 9/595 (1%)
 Frame = -2

Query: 2106 NTIADEMGPSNVKKRYTREEKGKGRLIKDTWLSIGIDPVELDLKPEEENLLEDAVSGLIP 1927
            N  ADE       +RY+REEKGKG LI D      ++PV+ +L+ E +N ++ AVS  I 
Sbjct: 324  NMSADENDAVEGGQRYSREEKGKGILINDDLAPNAVNPVDFNLESEVKNSVDTAVSESI- 382

Query: 1926 DAASGLVQLQEISAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHFQPEDE- 1750
                   QL+    ++ + ++ +T V   A R R    RF+D AR++ASRFAHF PE E 
Sbjct: 383  -------QLEGNVGLQVQNEVIQTSVTGIASRART---RFRDIARRNASRFAHFAPEQEM 432

Query: 1749 ----DYIAPLPEPNE----EIEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVI 1594
                   A +  P+E    E EDWPGPFSTAMKII+DR  KQN +Q NS+S    PA VI
Sbjct: 433  ENHPSREAEIQRPSEGGEKENEDWPGPFSTAMKIIKDREKKQNTQQ-NSSSDRNRPAHVI 491

Query: 1593 EWKPLNNEGQNRSRPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMD 1414
             W P   +     +PL PSL+++ + +LA+N +AITSL+ +PD +R+KLSQLLCDSRRM+
Sbjct: 492  -WSPRKVKSSECPKPLAPSLQEMCLEVLAQNGDAITSLESIPDALRHKLSQLLCDSRRMN 550

Query: 1413 CHFVDLLGCGSPSEICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRAT 1234
             H ++LL  GSP E+CV+DCSW+TEEE  +IF  CDTN+LTVLQLDQCGRCM DY+LRAT
Sbjct: 551  SHILELLVSGSPFEVCVRDCSWLTEEEFARIFKRCDTNSLTVLQLDQCGRCMTDYVLRAT 610

Query: 1233 LARSPSSLAALVSISLRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSL 1054
                 + L AL ++SL+G CR++D GL ALV+SAP LRS+NL QCSLLTS  I N+A++L
Sbjct: 611  FDMLSNGLPALTTVSLKGACRLSDAGLRALVSSAPMLRSINLSQCSLLTSASIKNLAETL 670

Query: 1053 GTVLRELYIDDCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKEL 874
            G+VLRELYIDDCQ IDAMLIL AL+KLE LEVLSVA IQTVCD FI  FI+V G  MKEL
Sbjct: 671  GSVLRELYIDDCQGIDAMLILSALEKLECLEVLSVAGIQTVCDDFIWEFISVHGPTMKEL 730

Query: 873  VLANCGKLTDTSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFS 694
            VL +C +LTD SLK IAE+C  L ALDL NL KLTD A G+LA+GCQ++QTLKL  N+FS
Sbjct: 731  VLTDCSRLTDFSLKAIAETCPELRALDLGNLCKLTDSAFGYLASGCQAMQTLKLRCNSFS 790

Query: 693  DESIAAFLEASGESLKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLI 514
            DE+IAAFLE SG SLKELSLNNV ++GHNTAISLAR S+ L+ LDLSWCRNL D  +G I
Sbjct: 791  DEAIAAFLEISGGSLKELSLNNVSKIGHNTAISLARRSRELIRLDLSWCRNLTDGDLGFI 850

Query: 513  VDNCSSLRVLKLFGCTQITNTFFNGHSNPLVRIIGLKLTPILEHLNVLDRPEAPL 349
            VD+C SLRVLKLFGCTQITN F +GHSNP V IIGLKLTPIL+HL + D    PL
Sbjct: 851  VDSCLSLRVLKLFGCTQITNMFVDGHSNPQVEIIGLKLTPILKHLKLTDPQSFPL 905


>ref|XP_006453195.1| hypothetical protein CICLE_v10007604mg [Citrus clementina]
            gi|568840725|ref|XP_006474316.1| PREDICTED:
            uncharacterized protein LOC102618698 [Citrus sinensis]
            gi|557556421|gb|ESR66435.1| hypothetical protein
            CICLE_v10007604mg [Citrus clementina]
          Length = 715

 Score =  637 bits (1643), Expect = e-180
 Identities = 361/697 (51%), Positives = 463/697 (66%), Gaps = 15/697 (2%)
 Frame = -2

Query: 2379 KRKLEFDLNSPPLDLGISDGNDKGFFNLRSGARISKRKIE---GIRVDSNGIDNVGEKLA 2209
            KRKL+   N     LG+  G+ +GF NLRSG ++ KR  E   G  VD    +N  E + 
Sbjct: 54   KRKLDVSENL----LGLEGGDSEGFLNLRSGKKVIKRIGETDGGNSVDGKEKENGKETMD 109

Query: 2208 XXXXXXXXXXXXXXXKFXXXXXXXXXXXXXXVNPNTIADEMGPSNVKKRYTREEKGKGRL 2029
                                                 AD       ++R+ REEKGK +L
Sbjct: 110  FEEVRMLREVSKDGADVDKLDIKQN------------ADGSCSEKRRRRFGREEKGKAKL 157

Query: 2028 IKDTWLSIGIDPVELDL----KPEEENLLEDAVSGLIPDAASGLVQLQEISAVEHERQIA 1861
            I +     G + + LDL    K  EEN+      G + +  +             +R   
Sbjct: 158  IDEDSTVNGSEFINLDLELGTKHSEENV------GSVSEPRT------------EQRVDK 199

Query: 1860 KTRVRMKAMRRRLDRGRFQDAARQSASRFAHFQPED-------EDYIAPLPEPNEEIEDW 1702
            K+ VR+   R      +F+D ARQ+AS+FA+F  E+       E  +    E   EIEDW
Sbjct: 200  KSSVRLSESRME----QFRDIARQNASKFAYFNVEENHLSDDNERLVVADGEVGREIEDW 255

Query: 1701 PGPFSTAMKIIRDRATKQNGRQ-MNSTSGVTNPAPVIEWKPLNNEGQNRSRPLVPSLRDL 1525
            PGPFSTAMKI+RDR  K +G Q + S          I W P   + Q   + ++PSL++L
Sbjct: 256  PGPFSTAMKIVRDREKKLSGGQRIGSLDPKKKSNSSILWIPRKGQRQG-PKLIIPSLKEL 314

Query: 1524 SMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGCGSPSEICVKDCSWM 1345
            SM IL +NA+AITSL+ VPD +R+KLS +LCDSR+M+ HF++LL  GSP+EI ++DCSW+
Sbjct: 315  SMKILVQNADAITSLEHVPDALRHKLSFMLCDSRQMNSHFLNLLFSGSPTEIRLRDCSWL 374

Query: 1344 TEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPSSLAALVSISLRGGCRIT 1165
            TE+E TK F +CDT NLTVLQLD+CGRCMPDYIL +TLA S +SL +L ++S+ G CRI+
Sbjct: 375  TEQEFTKAFVSCDTKNLTVLQLDRCGRCMPDYILLSTLASSLNSLPSLTTLSICGACRIS 434

Query: 1164 DVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRELYIDDCQSIDAMLILPA 985
            DVG  ALV SAPALRS+NL QCSLLTS  ++ +AD LG+ ++ELYI+DCQS++AMLILPA
Sbjct: 435  DVGFKALVTSAPALRSINLSQCSLLTSTSMDILADKLGSFIQELYINDCQSLNAMLILPA 494

Query: 984  LKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCGKLTDTSLKVIAESCSGL 805
            L+KL+HLEVLSVA I+TV D F+ GF+  CG NMKEL+L +C KLTD SLKVIAE+C  L
Sbjct: 495  LRKLKHLEVLSVAGIETVTDEFVRGFVYACGHNMKELILTDCVKLTDFSLKVIAETCPRL 554

Query: 804  CALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAAFLEASGESLKELSLNNV 625
            C LDL NL+KLTD  IG+LANGCQ+IQTLKLCRNAFSDE+IAAFLE +GE LKELSLNNV
Sbjct: 555  CTLDLSNLYKLTDFGIGYLANGCQAIQTLKLCRNAFSDEAIAAFLETAGEPLKELSLNNV 614

Query: 624  KRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSSLRVLKLFGCTQITNTFF 445
            ++V  NTA+SLA+ S  L++LDLSWCRNL+DEA+GLIVD+C SLR+LKLFGC+QITN F 
Sbjct: 615  RKVADNTALSLAKRSNKLVNLDLSWCRNLSDEALGLIVDSCLSLRMLKLFGCSQITNAFL 674

Query: 444  NGHSNPLVRIIGLKLTPILEHLNVLDRPEAPLRY*SV 334
            +GHSNP V+IIGLK++P+LEH+ V D  E PL Y SV
Sbjct: 675  DGHSNPDVQIIGLKMSPVLEHVKVPDFHEGPLHYSSV 711


>ref|XP_004288980.1| PREDICTED: uncharacterized protein LOC101312489 [Fragaria vesca
            subsp. vesca]
          Length = 903

 Score =  628 bits (1620), Expect = e-177
 Identities = 334/600 (55%), Positives = 435/600 (72%), Gaps = 21/600 (3%)
 Frame = -2

Query: 2070 KKRYTREEKGKGRLIKDTWLSIGIDPVELDLKP---------------EEENLLEDAVSG 1936
            +++++R+EKGK +LI    L    D VELD                   E +L+ + V  
Sbjct: 305  RRKFSRQEKGKEKLIGGALLPNDFDKVELDFLGIGALSELSSMPNVVLSELSLMPNVVLS 364

Query: 1935 ---LIPDAASGLVQLQEISAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHF 1765
               L+ +      Q+ E  A++ + Q   T  R +   R     RF+D ARQ+ASRFA F
Sbjct: 365  ELSLMSNVVPSPAQVGENVAMQEQVQARNTNAREEGRDRNQYMERFRDIARQNASRFARF 424

Query: 1764 QP--EDEDYIAPLPEPNEEIEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIE 1591
             P  E+E+ + P  +   E EDWPGPFSTAM+I+RD A K N ++ +++   T PA +++
Sbjct: 425  DPREEEENDMPPQVDVELEDEDWPGPFSTAMRIMRDGAEK-NMQEHSASKDKTKPA-LVK 482

Query: 1590 WKPLNNEGQNR-SRPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMD 1414
            W P   E     S+ L+PSL++L +++LAKNA+ I SL+ VPD +R++LS LLCDSRRM+
Sbjct: 483  WVPKRQEQDLAISKNLIPSLQELCLSVLAKNADEIVSLESVPDALRHQLSHLLCDSRRMN 542

Query: 1413 CHFVDLLGCGSPSEICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRAT 1234
             HF +LL  GSP+E+ ++DCSW+TEEE TK F  CD  NLTVLQLDQCGRC+PDYIL +T
Sbjct: 543  THFFELLVQGSPTEVRLRDCSWLTEEEFTKSFQLCDITNLTVLQLDQCGRCLPDYILNST 602

Query: 1233 LARSPSSLAALVSISLRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSL 1054
            LARS + L +LVS+SL G CR++DVGL ALV+S PALRSLNL QCSLLTS  I+ +A+SL
Sbjct: 603  LARSANCLPSLVSLSLSGACRLSDVGLGALVSSVPALRSLNLSQCSLLTSSSIDTLANSL 662

Query: 1053 GTVLRELYIDDCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKEL 874
            G++L+ELY++DCQSIDAM ILPALKK EHLEVL +  I+ VCD FI  FI+  G N+KEL
Sbjct: 663  GSLLKELYLNDCQSIDAMQILPALKKFEHLEVLWLPGIENVCDDFIKEFISARGHNLKEL 722

Query: 873  VLANCGKLTDTSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFS 694
             L +C  LTD+S+KV+AE+CSGLCALDL NLHKLTD ++G+LANGC++IQTLK CRN+FS
Sbjct: 723  SLTDCINLTDSSVKVLAETCSGLCALDLFNLHKLTDYSLGYLANGCRAIQTLKFCRNSFS 782

Query: 693  DESIAAFLEASGESLKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLI 514
            DE++AAFLE SGE LKELSLNN+ +VG NTAISLAR S+NL  LDLSWCRNL DEA+GLI
Sbjct: 783  DEAVAAFLETSGECLKELSLNNITKVGDNTAISLARHSRNLHCLDLSWCRNLTDEALGLI 842

Query: 513  VDNCSSLRVLKLFGCTQITNTFFNGHSNPLVRIIGLKLTPILEHLNVLDRPEAPLRY*SV 334
            VD+C SL++LKLFGCTQIT+ F +GHSNP V+IIG+++TPIL+ + V D    PL Y +V
Sbjct: 843  VDSCLSLKMLKLFGCTQITDLFLSGHSNPDVKIIGVRMTPILKDVRVPDPAAGPLHYSAV 902


>ref|XP_007012655.1| DNA repair protein rhp7, putative [Theobroma cacao]
            gi|508783018|gb|EOY30274.1| DNA repair protein rhp7,
            putative [Theobroma cacao]
          Length = 742

 Score =  626 bits (1615), Expect = e-176
 Identities = 336/591 (56%), Positives = 420/591 (71%), Gaps = 12/591 (2%)
 Frame = -2

Query: 2079 SNVKKRYTREEKGKGRLIKDTWLSIGIDPVELDLKPEEENLLEDAVSGLIPDAASGLVQL 1900
            +N ++R++ E KGKG+L+ +T                   +LE      +  + SG+   
Sbjct: 170  ANCRRRFSAEGKGKGKLVVET-------------------ILESKAKSSVDGSVSGVNLS 210

Query: 1899 QEISAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHFQPEDED--------- 1747
             E   +  E++  K + R    R       F+D ARQ+ASR+AHF  ++ED         
Sbjct: 211  AEKVRLPDEKRTKKNKKRGYGGRTE----HFRDVARQNASRYAHFDAQEEDDNIFSVEAE 266

Query: 1746 -YIAPLPEPNEE--IEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPLN 1576
              I+P  E  EE  +EDWPGPFSTAMKIIRDRA K N ++  S+SG      ++ W P  
Sbjct: 267  REISPENEQPEETGVEDWPGPFSTAMKIIRDRAEKLNLQRGRSSSGNVQSVQIM-WVPQK 325

Query: 1575 NEGQNRSRPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDL 1396
             +G++RS+ L PSL D+   IL  NA+AI SL  VPD +R+KL Q+LCDSRRM+ +F+DL
Sbjct: 326  GKGKDRSKRLPPSLLDMCFRILVNNADAIASLDHVPDALRHKLCQMLCDSRRMNSNFLDL 385

Query: 1395 LGCGSPSEICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPS 1216
            L  GSPSEI ++DCSW+TEE+ T+ F  CDT  LTVLQLDQCG C+PDYIL +TLA+S +
Sbjct: 386  LVSGSPSEIRLRDCSWLTEEQFTRCFDGCDTTKLTVLQLDQCGCCIPDYILLSTLAQSSN 445

Query: 1215 SLAALVSISLRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRE 1036
            SL AL+++SL G  R++D GLNALV+SAPALRS+NL Q SLLT+   + +A+SL +VL E
Sbjct: 446  SLPALINLSLTGAFRLSDAGLNALVSSAPALRSINLSQSSLLTASAFDTLANSLASVLLE 505

Query: 1035 LYIDDCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCG 856
            LYI+DCQSIDA LILPALKKLEHLEVLSVA +++V D FI  FI   G  +KEL+L  C 
Sbjct: 506  LYINDCQSIDAKLILPALKKLEHLEVLSVAGLESVTDCFIKEFIIARGHGIKELILTGCR 565

Query: 855  KLTDTSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAA 676
            KL+D+SLK+IAE+C  L ALD+ NL KLTD  +G+LANGCQS+Q LK CRNAFSD++IAA
Sbjct: 566  KLSDSSLKIIAETCPNLRALDVGNLSKLTDSTLGYLANGCQSLQLLKFCRNAFSDDAIAA 625

Query: 675  FLEASGESLKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSS 496
            FLE SGE LKELSLNNV +VGHNTA+SLAR SKNLLSLDLSWCRNL DEAVGLIVD+C S
Sbjct: 626  FLETSGEVLKELSLNNVGKVGHNTALSLARRSKNLLSLDLSWCRNLTDEAVGLIVDSCLS 685

Query: 495  LRVLKLFGCTQITNTFFNGHSNPLVRIIGLKLTPILEHLNVLDRPEAPLRY 343
            LRVLKLFGCTQITN F +GHSN  V IIGLK +P+LEH+ V D  E PLRY
Sbjct: 686  LRVLKLFGCTQITNVFLDGHSNSKVEIIGLKFSPLLEHIKVPDSQEGPLRY 736


>ref|XP_002516283.1| rad7, putative [Ricinus communis] gi|223544769|gb|EEF46285.1| rad7,
            putative [Ricinus communis]
          Length = 765

 Score =  612 bits (1577), Expect = e-172
 Identities = 366/691 (52%), Positives = 452/691 (65%), Gaps = 36/691 (5%)
 Frame = -2

Query: 2424 RRRSERLGSST-----KGKGKRKL--------EFDLNSPPLDLGISDGNDKGFF-NLRSG 2287
            RRRS RL S +      G  KRK+        E +  +    +  +D  D     +LRSG
Sbjct: 47   RRRSLRLASKSVPRDQNGSRKRKISSIEKEKEETEEQNSAFQVNDNDNVDSEMILSLRSG 106

Query: 2286 ARISKRKIE-----GIRVDSNGI-----DNVGEKLAXXXXXXXXXXXXXXXKFXXXXXXX 2137
             R+ KRK+E      + +++  +     +NV +K                 K        
Sbjct: 107  KRVVKRKVEYDSGENLVIEAKDLNVEEFENVSDK--------DKGKAKLTEKLMEKQSVV 158

Query: 2136 XXXXXXXVNPNTIADEMGPS-NVKKRYTREEKGKGRLIKDTWL-SIGIDPVELDLKPEE- 1966
                   +  N  + E   S   K+RY+REEKGK  L  D    SIG D +EL  K +E 
Sbjct: 159  EGNCSSRLEVNKFSHESSNSMRTKRRYSREEKGKANLDDDGLSNSIGKDELELQSKVKEL 218

Query: 1965 -ENLLEDAVSGLIPDAASGLVQLQEISAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQ 1789
              +L E+ V  L+P                +ERQ        K    R+D+  F+D A +
Sbjct: 219  GHSLGENVV--LLPG---------------NERQTMNINTSNKN-ESRMDQ--FRDIATR 258

Query: 1788 SASRFAHF-QPEDEDYIAPLP-------EPNEEIEDWPGPFSTAMKIIRDRATKQNGRQM 1633
            +ASRFA F + EDE+  + +        E NE IEDWPGPFSTAMKIIRDRA  +N +Q 
Sbjct: 259  NASRFAQFDRQEDENLPSEVDNVEISSVEENERIEDWPGPFSTAMKIIRDRANMRNSQQG 318

Query: 1632 NSTSGVTNPAPVIEWKPLNNEGQNRSRPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRN 1453
             ST       P I W P  N    +SR  VPSL++L M I+ KN +A+TSL  VPD +R+
Sbjct: 319  ASTLEKPQSVP-ITWVPTRNR---QSRTCVPSLQELCMRIIVKNVDAVTSLDHVPDALRH 374

Query: 1452 KLSQLLCDSRRMDCHFVDLLGCGSPSEICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQ 1273
            +L QLLCD R+M+  F+DLL  GSP+EI VKDCSWM+EEE  K F  CDTNNL+VLQLDQ
Sbjct: 375  RLCQLLCDCRKMNSSFLDLLVRGSPTEIRVKDCSWMSEEELVKCFEGCDTNNLSVLQLDQ 434

Query: 1272 CGRCMPDYILRATLARSPSSLAALVSISLRGGCRITDVGLNALVASAPALRSLNLGQCSL 1093
            CGRCMPDY++ ATLARS  SL AL+++SL G CR++D+GL+ LVASA +LRS+NL QCS 
Sbjct: 435  CGRCMPDYVIPATLARSSRSLPALITLSLCGACRLSDIGLSLLVASATSLRSINLSQCSH 494

Query: 1092 LTSIGINNVADSLGTVLRELYIDDCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAFIH 913
            LTS  I  +ADSLG+VLRELYIDDCQS+DAMLILP+LKKLEHLEVLS+A IQTVCD F+ 
Sbjct: 495  LTSTSIGTLADSLGSVLRELYIDDCQSLDAMLILPSLKKLEHLEVLSLAGIQTVCDDFVR 554

Query: 912  GFITVCGSNMKELVLANCGKLTDTSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQ 733
             F+  CG N+KE  LA+C KLTD+SLKVIAE+C GLCAL+LVNL KLTD  +G LANGC+
Sbjct: 555  EFVVACGHNIKEFGLADCTKLTDSSLKVIAETCPGLCALNLVNLRKLTDSTLGFLANGCR 614

Query: 732  SIQTLKLCRNAFSDESIAAFLEASGESLKELSLNNVKRVGHNTAISLARCSKNLLSLDLS 553
             IQTLKLCRNAFSDE IAAFLE+SG+ LKELSLNNVK+VGH+TAISLAR S+NL+SLDLS
Sbjct: 615  EIQTLKLCRNAFSDEGIAAFLESSGDLLKELSLNNVKKVGHHTAISLARRSRNLISLDLS 674

Query: 552  WCRNLADEAVGLIVDNCSSLRVLKLFGCTQI 460
            WCRNL+DEAVGLIVD+CSSLRVLKLFGC Q+
Sbjct: 675  WCRNLSDEAVGLIVDSCSSLRVLKLFGCGQV 705


>ref|XP_003546506.2| PREDICTED: uncharacterized protein LOC100808150 [Glycine max]
          Length = 826

 Score =  597 bits (1540), Expect = e-168
 Identities = 296/495 (59%), Positives = 381/495 (76%), Gaps = 2/495 (0%)
 Frame = -2

Query: 1812 RFQDAARQSASRFAHFQPEDEDY--IAPLPEPNEEIEDWPGPFSTAMKIIRDRATKQNGR 1639
            RF D AR++ASRFA F PE ED+    P+    +EIEDWPGPFSTAMKIIRDR +K    
Sbjct: 330  RFHDIARENASRFAFFAPEGEDHDRSPPVEPERDEIEDWPGPFSTAMKIIRDRGSKLQNA 389

Query: 1638 QMNSTSGVTNPAPVIEWKPLNNEGQNRSRPLVPSLRDLSMNILAKNAEAITSLKDVPDGM 1459
            + +S + +      I+W P    G       VPSL+++ + IL KN +AI SL+ VPD +
Sbjct: 390  EASSQASLCES---IKWVPNAKRGNAGVNVSVPSLQEMCLKILVKNVDAIASLESVPDAL 446

Query: 1458 RNKLSQLLCDSRRMDCHFVDLLGCGSPSEICVKDCSWMTEEEATKIFGACDTNNLTVLQL 1279
            R++LSQLLCDSRR++ HF++LL  G+P+EI ++DCSW+TEE+ T+ F  CDT NL VLQL
Sbjct: 447  RHRLSQLLCDSRRINGHFLELLVRGTPTEIRLRDCSWLTEEQFTESFRTCDTENLVVLQL 506

Query: 1278 DQCGRCMPDYILRATLARSPSSLAALVSISLRGGCRITDVGLNALVASAPALRSLNLGQC 1099
            DQCGRC+PDY++ +TLA+SP  L++L ++SL G CR++D GL ALV+SAPALRS+NL QC
Sbjct: 507  DQCGRCLPDYVVVSTLAQSPRHLSSLSTLSLSGACRLSDGGLRALVSSAPALRSINLSQC 566

Query: 1098 SLLTSIGINNVADSLGTVLRELYIDDCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAF 919
            SLLTS  +  +A+SL ++L+ELY+DDCQ IDA LI+PAL +LEHLEVLSVA IQTVCD F
Sbjct: 567  SLLTSSSVYILAESLKSLLKELYLDDCQGIDAALIVPALIELEHLEVLSVAGIQTVCDEF 626

Query: 918  IHGFITVCGSNMKELVLANCGKLTDTSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANG 739
            +  +I   G NMKELVL +C  LTD S+K I E C GLC LDL+NLHKLTD++IGHLANG
Sbjct: 627  VKNYIVARGQNMKELVLKDCINLTDASIKAIVEHCPGLCVLDLMNLHKLTDLSIGHLANG 686

Query: 738  CQSIQTLKLCRNAFSDESIAAFLEASGESLKELSLNNVKRVGHNTAISLARCSKNLLSLD 559
            C+++ TLKLCRN FSDE+IAAF+E +G SLKELSLNN+K+VG++T +SLA  +KNL SLD
Sbjct: 687  CRALHTLKLCRNPFSDEAIAAFVETTGGSLKELSLNNIKKVGYHTTLSLANHAKNLHSLD 746

Query: 558  LSWCRNLADEAVGLIVDNCSSLRVLKLFGCTQITNTFFNGHSNPLVRIIGLKLTPILEHL 379
            LSWCRNL D A+GLIVD+C +LR LKLFGC+Q+T+ F NGHSN  ++IIGLK++P+LEH+
Sbjct: 747  LSWCRNLTDNALGLIVDSCLALRSLKLFGCSQVTDAFLNGHSNLQIQIIGLKMSPVLEHV 806

Query: 378  NVLDRPEAPLRY*SV 334
             V D  +  L Y SV
Sbjct: 807  KVPDPHQGALNYSSV 821


>ref|XP_004243936.1| PREDICTED: DNA repair protein RAD7-like [Solanum lycopersicum]
          Length = 902

 Score =  585 bits (1507), Expect = e-164
 Identities = 317/601 (52%), Positives = 414/601 (68%), Gaps = 22/601 (3%)
 Frame = -2

Query: 2067 KRYTREEKGKGRLIKDTWLSIGIDPVELDLKPEEENLLEDAVSGLIPDAASGLVQLQEIS 1888
            +R +REEKGK  +  D  L  G+D +E   K   E   ++ VS  I              
Sbjct: 316  RRISREEKGKQVMAGDD-LCHGVDTLEGKSKNGAEKPADEIVSRAIN------------L 362

Query: 1887 AVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHF--QPEDEDYIAP-----LP 1729
             ++   Q+A       A RR + R RF+D AR++ASRFAHF  Q E E+ +A       P
Sbjct: 363  TIQDGEQVADADGSATATRR-VHRERFRDVARRNASRFAHFSSQAEHENDVADEAAEEFP 421

Query: 1728 EP---NEEIEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPLNNEGQNR 1558
            +     EEIEDWPGPFSTAM IIRDR      +Q N +         + W P  ++    
Sbjct: 422  QEVAETEEIEDWPGPFSTAMNIIRDREMNMKHQQQNKSE---KSKIEVVWVPKTDQQGQS 478

Query: 1557 SRPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGCGSP 1378
             + +VPSL DL M+IL KNA+AITSL  +PD +R+K+ Q LCDSR M   F+ LL  GSP
Sbjct: 479  RKMVVPSLHDLCMDILVKNADAITSLDGLPDALRHKICQSLCDSREMTYQFLQLLISGSP 538

Query: 1377 SEICVKDCSWMTEEEATKIFGACDTNN-----------LTVLQLDQCGRCMPDYILRATL 1231
            +EI ++DCSW+ EE  T+ F  CDTNN           L VLQLDQCGRC+PDYIL  TL
Sbjct: 539  TEIRIRDCSWLNEENFTQSFKGCDTNNFESFKGCDTNNLVVLQLDQCGRCLPDYILLVTL 598

Query: 1230 ARSPSSLAALVSISLRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLG 1051
            AR P++L AL ++SL+G CR++D GL A++++AP LRS+NL QCSLLT  GI+++++SLG
Sbjct: 599  ARRPNNLPALTTLSLKGACRLSDAGLEAIISAAPNLRSINLSQCSLLTCDGISSLSNSLG 658

Query: 1050 TVLRELYIDDCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELV 871
            +VLRELY+D+C+++  +LILPAL KL+HLEVLSVA IQTVCDAFI  F+T  G +++E++
Sbjct: 659  SVLRELYLDNCEAVHPILILPALLKLQHLEVLSVAGIQTVCDAFIKEFVTNRGQSLREII 718

Query: 870  LANCGKLTDTSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSD 691
            L  C +LTD SLK I+++C  L A+DL +L KLTD AI HLA GC+ +  LKLCRN FSD
Sbjct: 719  LKGCMELTDRSLKDISQNCPKLRAIDLSDLCKLTDSAIEHLATGCREVDNLKLCRNPFSD 778

Query: 690  ESIAAFLEASGESLKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIV 511
            E++AA++E SG SLKELSLN +K+V HNTA+SLA+CSKNL+SLDLSWCRNL +EA+GLIV
Sbjct: 779  EAVAAYVEISGVSLKELSLNRIKKVSHNTAMSLAKCSKNLISLDLSWCRNLTNEALGLIV 838

Query: 510  DNCSSLRVLKLFGCTQITNTFFNGHSNPLVRIIGLKLTPILEHLNVLDR-PEAPLRY*SV 334
            D+C SL VLKLFGC+Q+T+ F +GHSNP V+IIGLK+TPILEH+   D   + PLRY +V
Sbjct: 839  DSCLSLEVLKLFGCSQVTSVFLDGHSNPQVKIIGLKMTPILEHIEAPDSLQQGPLRYSAV 898

Query: 333  P 331
            P
Sbjct: 899  P 899


>ref|XP_007138661.1| hypothetical protein PHAVU_009G227500g, partial [Phaseolus vulgaris]
            gi|561011748|gb|ESW10655.1| hypothetical protein
            PHAVU_009G227500g, partial [Phaseolus vulgaris]
          Length = 771

 Score =  577 bits (1487), Expect = e-161
 Identities = 296/520 (56%), Positives = 385/520 (74%), Gaps = 3/520 (0%)
 Frame = -2

Query: 1884 VEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHFQPEDED--YIAPLPEP-NEE 1714
            V    + +  R R   +RR     RF D AR++ASRFA F PE+ED     P+PE  +EE
Sbjct: 251  VRERSRNSNARERRSGLRRNDYMERFHDIARENASRFAFFAPEEEDDGRSPPVPEAASEE 310

Query: 1713 IEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPLNNEGQNRSRPLVPSL 1534
            IEDWPGPFSTAMKIIRDR       Q   TS   N    I+W P  ++G       VPSL
Sbjct: 311  IEDWPGPFSTAMKIIRDRGMNLQNAQ---TSSQANLCESIKWVPKAHKGDVGVLS-VPSL 366

Query: 1533 RDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGCGSPSEICVKDC 1354
            +D+   IL  N +AI SL+ VPD +R++LSQLLCDSRR++ HF++LL  G+P+EI ++DC
Sbjct: 367  QDMCFRILVNNVDAIASLESVPDALRHRLSQLLCDSRRINGHFLELLVRGTPTEIRLRDC 426

Query: 1353 SWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPSSLAALVSISLRGGC 1174
            SW+TEE+ T+ F  C+T NL+VLQLDQCGRC+PD+++ ATLARSP +LA L ++SLRG C
Sbjct: 427  SWLTEEQFTECFRMCNTENLSVLQLDQCGRCLPDFVIVATLARSPRNLARLTTLSLRGAC 486

Query: 1173 RITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRELYIDDCQSIDAMLI 994
            R++D GL ALV+SAPALRS+NL QCSLLTS  I  +A+SL  +L+EL++DDCQ IDA LI
Sbjct: 487  RLSDGGLRALVSSAPALRSINLSQCSLLTSASIYLLAESLSYLLKELFLDDCQGIDAALI 546

Query: 993  LPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCGKLTDTSLKVIAESC 814
            +PAL +LEHLEVLSVA I TVCD F+  +I   G NMKELVL +C  LTD+S+KVI E C
Sbjct: 547  VPALIELEHLEVLSVAGIPTVCDEFVKNYIVARGQNMKELVLKDCINLTDSSIKVIVEHC 606

Query: 813  SGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAAFLEASGESLKELSL 634
             GL  LD++NL++LTD+++G+L NGC+ + TLKLCRN FSDE+IAAF+E +G SLKEL L
Sbjct: 607  PGLRVLDIMNLNRLTDLSVGYLTNGCRVLHTLKLCRNPFSDEAIAAFVETTGGSLKELLL 666

Query: 633  NNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSSLRVLKLFGCTQITN 454
            NN+K+VG++T +SLA  +K L  LDLSWCRNL D A+GLIVD+C +LR+L+LFGCTQ+T+
Sbjct: 667  NNIKKVGYHTTLSLANHAKKLHYLDLSWCRNLTDNALGLIVDSCLALRLLRLFGCTQVTD 726

Query: 453  TFFNGHSNPLVRIIGLKLTPILEHLNVLDRPEAPLRY*SV 334
             F NGHSN  ++IIGLK++P+L+ + V D  +  L Y SV
Sbjct: 727  AFLNGHSNLQIQIIGLKMSPVLQDVKVPDPHQGALNYSSV 766


>ref|XP_006297020.1| hypothetical protein CARUB_v10013011mg [Capsella rubella]
            gi|482565729|gb|EOA29918.1| hypothetical protein
            CARUB_v10013011mg [Capsella rubella]
          Length = 791

 Score =  566 bits (1458), Expect = e-158
 Identities = 303/572 (52%), Positives = 404/572 (70%), Gaps = 8/572 (1%)
 Frame = -2

Query: 2070 KKRYTREEKGKGRLIKDTWLSIGIDPVELDLKPEEENLLEDAVSGLIPDAASGLVQLQEI 1891
            ++ YTREEKGKG  +++    I I+  E ++  E ENL+ D  +   PDA+        +
Sbjct: 232  RRIYTREEKGKGIQVENVASPITIEICEEEM--EMENLINDG-NPPTPDASVPESSAMTV 288

Query: 1890 SAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHFQP--EDEDYIAPLPEPNE 1717
            +A + + Q           R       F+D AR++ASRFA++    E+E+ ++   E  +
Sbjct: 289  NAEQTQNQNGNQIGNGGRSRH------FRDIARRNASRFAYYDARMEEEEDLSDR-EGEQ 341

Query: 1716 EIEDWPGPFSTAMKIIRDRATKQNGRQMNSTS----GVTNP--APVIEWKPLNNEGQNRS 1555
            ++EDWPGPFSTAMKII+DR       + NSTS    GV+N   + +  W P  N      
Sbjct: 342  QVEDWPGPFSTAMKIIKDR-------EENSTSYFGIGVSNKEKSSLTIWVPRINFSVAPR 394

Query: 1554 RPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGCGSPS 1375
            +   PSL++LS+ IL KNA+AITSL  VPD +R KL QLLCDSRRMD HF+DLL  GSP+
Sbjct: 395  K--APSLQELSLQILVKNADAITSLDYVPDALRVKLCQLLCDSRRMDVHFLDLLVRGSPT 452

Query: 1374 EICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPSSLAALVS 1195
            EICV DCSW+TEE+ T+ F  CDT+NL VLQLDQCGRCMPDY+L +TLARSP  L  L S
Sbjct: 453  EICVPDCSWLTEEQFTECFKNCDTSNLMVLQLDQCGRCMPDYVLHSTLARSPKQLPMLSS 512

Query: 1194 ISLRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRELYIDDCQ 1015
            +SL G CR++D GL  LV+SAPA+ S+NL QCSLLTS  I+ ++DSLG+VLRELYI++CQ
Sbjct: 513  LSLSGACRLSDAGLKTLVSSAPAITSINLNQCSLLTSSSIDMLSDSLGSVLRELYINECQ 572

Query: 1014 SIDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCGKLTDTSL 835
            SID   IL ALKK E LE+LS+AD+ +V   F+  F+T  G  +K+L+L N G+LTD+S+
Sbjct: 573  SIDVKRILSALKKFEKLEILSLADLPSVKGQFLKEFVTARGQTLKQLILTNSGRLTDSSI 632

Query: 834  KVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAAFLEASGE 655
            KVI+E+C  L  LDL N+ KLTD ++G+LANGCQ+++ L  CRN FSDE++AAF+E +G 
Sbjct: 633  KVISENCPNLSVLDLANICKLTDSSLGYLANGCQALEKLIFCRNTFSDEAVAAFIETAGG 692

Query: 654  SLKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSSLRVLKLF 475
             L ELSLNNVK+VGHNTA+++A+ S  L  LD+SWCR+++D+ +G IVDNCSSL+VLK+F
Sbjct: 693  CLNELSLNNVKKVGHNTAVAIAKHSTKLQILDVSWCRDMSDDLLGYIVDNCSSLKVLKVF 752

Query: 474  GCTQITNTFFNGHSNPLVRIIGLKLTPILEHL 379
            GCTQ+T+ F NGHSNP V+I+GLK+ P L HL
Sbjct: 753  GCTQVTDVFVNGHSNPTVKILGLKMVPFLGHL 784


>gb|EXB38942.1| hypothetical protein L484_027377 [Morus notabilis]
          Length = 775

 Score =  564 bits (1453), Expect = e-158
 Identities = 340/676 (50%), Positives = 438/676 (64%), Gaps = 7/676 (1%)
 Frame = -2

Query: 2391 KGKGKRKLEF-DLNSPPLDLGISDGNDKGFFNLRSGARISKRKIEGIRVDSNGIDNVGE- 2218
            + KGKRKL   D   P L+         G  +LRSG R+SKR  +GI     G   VGE 
Sbjct: 137  EAKGKRKLGVVDGYLPSLECSEDGEGGIGVLSLRSGKRVSKRGNDGIE----GGRQVGEF 192

Query: 2217 -KLAXXXXXXXXXXXXXXXKFXXXXXXXXXXXXXXVNPN-TIADEMGPSNVKKRYTREEK 2044
             K+                +F                    IADE G +       R+ K
Sbjct: 193  GKIGEDKGKAILDSEEASGEFRIPKISKGKRKISDSGEEEVIADENGDN-------RKRK 245

Query: 2043 GKGRLIKDTWLSIGID-PVELDLKPEEENLLEDAVSGLIPDAASGLVQLQEISAVEHERQ 1867
            GKG L++D  L    +  VE+ L+ E EN   D V                   V +E Q
Sbjct: 246  GKGLLVEDDGLVSNSNLDVEIRLETEVENNSGDNV-------------------VSNEGQ 286

Query: 1866 IAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHFQPEDEDYIAPLPEPNEE--IEDWPGP 1693
                 VR + M R      F+D AR++A RFAHF  E+ED   P  E ++E  IEDWPGP
Sbjct: 287  -----VRNEFMER------FRDIARRNAYRFAHFDGEEEDN-EPHSEVDDEPDIEDWPGP 334

Query: 1692 FSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPLNNEGQNRSRPLVPSLRDLSMNI 1513
            FSTA+KIIRDR  K+N +  NS+S    PA V+ W P +N+    S+ +VPSL++LS+  
Sbjct: 335  FSTALKIIRDRE-KKNQQPGNSSSREKKPADVV-WFPKSNQDCKWSKNVVPSLQELSLRC 392

Query: 1512 LAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGCGSPSEICVKDCSWMTEEE 1333
            LA NA+ + SL   PD ++++LSQLLCDSRRM+ H   LL  GSP+E+CVKDCSW+TEEE
Sbjct: 393  LANNADKLVSLDYFPDCLKHRLSQLLCDSRRMNAHVFKLLLQGSPTEVCVKDCSWLTEEE 452

Query: 1332 ATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPSSLAALVSISLRGGCRITDVGL 1153
             TK F   D +NL VLQL  CGRC+PD++L +TLA + +SL  L ++S+RG CR++D+GL
Sbjct: 453  FTKCFQNFDPSNLMVLQLGFCGRCLPDFLLCSTLACAENSLPVLTTLSVRGACRLSDIGL 512

Query: 1152 NALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRELYIDDCQSIDAMLILPALKKL 973
             +LV+SAPALRSLNL +CSLLTS  I+ +A+SLG +LRELY+D C SID ML LPALKKL
Sbjct: 513  KSLVSSAPALRSLNLTECSLLTSSSIDTLANSLGLILRELYLDQCLSIDVMLTLPALKKL 572

Query: 972  EHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCGKLTDTSLKVIAESCSGLCALD 793
            E LEVLS+A I TVCD FI  FI++ G NMKEL+LA+C  LTD+SLK+IAE C GL A+D
Sbjct: 573  EQLEVLSLAGIATVCDKFIREFISIRGHNMKELILADCVNLTDSSLKIIAEKCPGLRAVD 632

Query: 792  LVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAAFLEASGESLKELSLNNVKRVG 613
            L NL KLTD ++G+LAN C++IQ L L R+ FSD+SIAAFLE SGE L+ELSLN+V++VG
Sbjct: 633  LSNLRKLTDSSLGYLANCCRAIQRLILSRDLFSDKSIAAFLETSGECLEELSLNSVRKVG 692

Query: 612  HNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSSLRVLKLFGCTQITNTFFNGHS 433
             +TA+S+AR  + L SL+LS+CR L D A+G IVD+C SLRVLK+FGCTQ+T+ F NGHS
Sbjct: 693  CHTALSIARRLRVLRSLNLSFCRGLTDNALGFIVDSCLSLRVLKIFGCTQVTSVFVNGHS 752

Query: 432  NPLVRIIGLKLTPILE 385
            NP V+IIGL + P+LE
Sbjct: 753  NPDVKIIGLPMCPVLE 768


>ref|XP_007012421.1| Rad7, putative isoform 1 [Theobroma cacao]
            gi|508782784|gb|EOY30040.1| Rad7, putative isoform 1
            [Theobroma cacao]
          Length = 714

 Score =  560 bits (1443), Expect = e-156
 Identities = 314/614 (51%), Positives = 406/614 (66%), Gaps = 32/614 (5%)
 Frame = -2

Query: 2088 MGPSNVKKRYTREEKGKGRLIK------------DTWLS-IGIDPVELDLKP--EEENLL 1954
            +G  + K+R++ EEKGK +L              D  L+ IGID       P  E E   
Sbjct: 101  VGSPSKKRRFSVEEKGKAKLDGFDEEEEKLNLDLDLGLTQIGIDKAISSFGPPIEAEEQK 160

Query: 1953 EDAVSGLIPDAASGLVQLQEISAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRF 1774
            +  V  L        + L  +  ++++R        +   R+R +  R  + AR+ A R 
Sbjct: 161  DTEVEFLGSTNTLNTIDLV-VGEIDYKRNDETEEFYVS--RKREESRRHHEIARKFAQRL 217

Query: 1773 AHFQPEDEDYIAPLPEPNEE----------------IEDWPGPFSTAMKIIRDRATKQNG 1642
            AH    + D +    + N++                 ED   PF  A+++I+ R +    
Sbjct: 218  AHEVDSEGDLLKSFSKTNKDGALKNVVVVVDDDDDKAEDSESPFGMALEMIKTRNSSSTD 277

Query: 1641 RQMNSTSGVTNPAPVIEWKPLNNEGQNRSRPL-VPSLRDLSMNILAKNAEAITSLKDVPD 1465
            ++  S  G+       +W P N +G + S    VPSL DLS+  LAKNAEA+ SL+ VPD
Sbjct: 278  KKKYSRGGLEAE---FKWVPKNYKGSSISMARDVPSLLDLSLRALAKNAEAMVSLEHVPD 334

Query: 1464 GMRNKLSQLLCDSRRMDCHFVDLLGCGSPSEICVKDCSWMTEEEATKIFGACDTNNLTVL 1285
             +R+KLSQL+CD+R+MD HF++LL  GSP+EI V DCS +TE+E TK+FG CDT NL VL
Sbjct: 335  VLRHKLSQLVCDNRKMDAHFLELLVRGSPTEIRVNDCSGVTEDEFTKMFGCCDTKNLIVL 394

Query: 1284 QLDQCGRCMPDYILRATLARSPSSLAALVSISLRGGCRITDVGLNALVASAPALRSLNLG 1105
            QLD CG C+PDY+L+ TLA S +SL ALV++SL G  R++D GLN L  SAPAL+S+NL 
Sbjct: 395  QLDLCGSCLPDYVLQGTLAHSSNSLPALVTLSLDGAYRLSDKGLNLLALSAPALQSINLS 454

Query: 1104 QCSLLTSIGINNVADSLGTVLRELYIDDCQSIDAMLILPALKKLEHLEVLSVADIQTVCD 925
            QCSLLTS GINN+A    + LRELY+D+CQ+I AM++LPALKKL+ LEVLS+A IQTVCD
Sbjct: 455  QCSLLTSAGINNLASCFESTLRELYLDECQNIQAMVVLPALKKLKCLEVLSLAGIQTVCD 514

Query: 924  AFIHGFITVCGSNMKELVLANCGKLTDTSLKVIAESCSGLCALDLVNLHKLTDVAIGHLA 745
             F+ G +  CG NMKELVLANC +LTD SLK + ++CS LCALDL  LH LTD ++ +LA
Sbjct: 515  DFVVGMVEACGKNMKELVLANCVELTDISLKFVGKNCSRLCALDLSYLHNLTDSSMRYLA 574

Query: 744  NGCQSIQTLKLCRNAFSDESIAAFLEASGESLKELSLNNVKRVGHNTAISLARCSKNLLS 565
            NGC+SI  LKLCRN FSDE+IAAFLEASG SL ELSLNN+  VG NTA+SL++CS+ L S
Sbjct: 575  NGCRSITKLKLCRNGFSDEAIAAFLEASGGSLTELSLNNIISVGLNTALSLSKCSRKLFS 634

Query: 564  LDLSWCRNLADEAVGLIVDNCSSLRVLKLFGCTQITNTFFNGHSNPLVRIIGLKLTPILE 385
            LDLSWCRNL DEA+GLIVD+C  LR+LKLFGCTQIT  F  GHSN  V+IIGLK+T IL+
Sbjct: 635  LDLSWCRNLTDEALGLIVDSCLLLRLLKLFGCTQITEVFLGGHSNAQVQIIGLKMTTILK 694

Query: 384  HLNVLDRPEAPLRY 343
            HLN+L+  EAPLRY
Sbjct: 695  HLNMLEPQEAPLRY 708


>dbj|BAD43070.1| hypothetical protein [Arabidopsis thaliana]
            gi|62318624|dbj|BAD95072.1| hypothetical protein
            [Arabidopsis thaliana]
          Length = 762

 Score =  559 bits (1440), Expect = e-156
 Identities = 300/576 (52%), Positives = 402/576 (69%), Gaps = 3/576 (0%)
 Frame = -2

Query: 2094 DEMGPSNVKKRYTREEKGKGRL-IKDTWLSIGIDPVELDLKPEEENLLEDAVSGLIPDAA 1918
            ++   S  ++RYTREEKGKG + ++D    + ++ VE+D   EEE  +E  V+   P   
Sbjct: 196  EKQSSSMGRRRYTREEKGKGIMQVEDVSSPVTVEIVEVD---EEEMEIESLVNSEKPPDV 252

Query: 1917 SGLVQLQEISAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHF--QPEDEDY 1744
            S    + E++A     + A+ R     +        F+D A + A RFAHF  Q E+E+ 
Sbjct: 253  S----VTELAATMANVEQAQNRENSNEIGNDSRTQHFRDIAERIAHRFAHFDAQVEEEED 308

Query: 1743 IAPLPEPNEEIEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPLNNEGQ 1564
            ++   E  +++EDWPGPFSTAMKII+DR            S     +P I W P +N   
Sbjct: 309  LSD-KEGEQQVEDWPGPFSTAMKIIKDREEYTTPHVGIGVSNKERSSPTI-WVPRSNFSF 366

Query: 1563 NRSRPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGCG 1384
               +   PSL++LS+ +L KNA+AITSL  VPD +R KL QLLCDSRRMD HF+DLL  G
Sbjct: 367  PPRK--APSLQELSLRVLVKNADAITSLDYVPDTLRVKLCQLLCDSRRMDLHFLDLLVQG 424

Query: 1383 SPSEICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPSSLAA 1204
            SP+EICV DCSW+TEEE T+ F  CDT+NL VLQLDQCGRCMPDYIL  TLARSP  L  
Sbjct: 425  SPTEICVPDCSWLTEEEFTECFKNCDTSNLMVLQLDQCGRCMPDYILPFTLARSPKVLPM 484

Query: 1203 LVSISLRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRELYID 1024
            L ++S+ G CR++DVGL  LV+SAPA+ S+NL QCSLLTS  I+ ++DSLG+VLRELYI+
Sbjct: 485  LSTLSISGACRLSDVGLRQLVSSAPAITSINLNQCSLLTSSSIDMLSDSLGSVLRELYIN 544

Query: 1023 DCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCGKLTD 844
            +CQ+ID   IL ALKK E LEVLS+AD+ +V   F+  F+T  G  +K+L+L N  KL+D
Sbjct: 545  ECQNIDMKHILAALKKFEKLEVLSLADLPSVKGRFLKEFVTARGQTLKQLILTNSRKLSD 604

Query: 843  TSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAAFLEA 664
            +S+KVI+E+C  L  LDL N+ KLTD ++G+LANGCQ+++ L  CRN FSDE++AAF+E 
Sbjct: 605  SSIKVISENCPNLSVLDLANVCKLTDSSLGYLANGCQALEKLIFCRNPFSDEAVAAFVET 664

Query: 663  SGESLKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSSLRVL 484
            +G SLKELSLNNVK+VGHNTA++LA+ S  L  LD+SWCR ++++ +G IVDN SSL+VL
Sbjct: 665  AGGSLKELSLNNVKKVGHNTALALAKHSDKLQILDISWCREMSNDLLGYIVDNSSSLKVL 724

Query: 483  KLFGCTQITNTFFNGHSNPLVRIIGLKLTPILEHLN 376
            K+FGC+Q+T+ F  GHSNP V+I+G+K+ P L HL+
Sbjct: 725  KVFGCSQVTDVFVKGHSNPNVKILGVKMDPFLGHLS 760


>ref|NP_178661.2| uncharacterized protein [Arabidopsis thaliana]
            gi|330250903|gb|AEC05997.1| uncharacterized protein
            AT2G06040 [Arabidopsis thaliana]
          Length = 762

 Score =  559 bits (1440), Expect = e-156
 Identities = 300/576 (52%), Positives = 402/576 (69%), Gaps = 3/576 (0%)
 Frame = -2

Query: 2094 DEMGPSNVKKRYTREEKGKGRL-IKDTWLSIGIDPVELDLKPEEENLLEDAVSGLIPDAA 1918
            ++   S  ++RYTREEKGKG + ++D    + ++ VE+D   EEE  +E  V+   P   
Sbjct: 196  EKQSSSMGRRRYTREEKGKGIMQVEDVSSPVTVEIVEVD---EEEMEIESLVNSEKPPDV 252

Query: 1917 SGLVQLQEISAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHF--QPEDEDY 1744
            S    + E++A     + A+ R     +        F+D A + A RFAHF  Q E+E+ 
Sbjct: 253  S----VTELAATMANVEQAQNRENSNEIGNDSRTQHFRDIAERIAHRFAHFDAQVEEEED 308

Query: 1743 IAPLPEPNEEIEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPLNNEGQ 1564
            ++   E  +++EDWPGPFSTAMKII+DR            S     +P I W P +N   
Sbjct: 309  LSD-KEGEQQVEDWPGPFSTAMKIIKDREEYTTPHVGIGVSNKERSSPTI-WVPRSNFSF 366

Query: 1563 NRSRPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGCG 1384
               +   PSL++LS+ +L KNA+AITSL  VPD +R KL QLLCDSRRMD HF+DLL  G
Sbjct: 367  PPRK--APSLQELSLRVLVKNADAITSLDYVPDTLRVKLCQLLCDSRRMDLHFLDLLVQG 424

Query: 1383 SPSEICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPSSLAA 1204
            SP+EICV DCSW+TEEE T+ F  CDT+NL VLQLDQCGRCMPDYIL  TLARSP  L  
Sbjct: 425  SPTEICVPDCSWLTEEEFTECFKNCDTSNLMVLQLDQCGRCMPDYILPFTLARSPKVLPM 484

Query: 1203 LVSISLRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRELYID 1024
            L ++S+ G CR++DVGL  LV+SAPA+ S+NL QCSLLTS  I+ ++DSLG+VLRELYI+
Sbjct: 485  LSTLSISGACRLSDVGLRQLVSSAPAITSINLNQCSLLTSSSIDMLSDSLGSVLRELYIN 544

Query: 1023 DCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCGKLTD 844
            +CQ+ID   IL ALKK E LEVLS+AD+ +V   F+  F+T  G  +K+L+L N  KL+D
Sbjct: 545  ECQNIDMKHILAALKKFEKLEVLSLADLPSVKGRFLKEFVTARGQTLKQLILTNSRKLSD 604

Query: 843  TSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAAFLEA 664
            +S+KVI+E+C  L  LDL N+ KLTD ++G+LANGCQ+++ L  CRN FSDE++AAF+E 
Sbjct: 605  SSIKVISENCPNLSVLDLANVCKLTDSSLGYLANGCQALEKLIFCRNPFSDEAVAAFVET 664

Query: 663  SGESLKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSSLRVL 484
            +G SLKELSLNNVK+VGHNTA++LA+ S  L  LD+SWCR ++++ +G IVDN SSL+VL
Sbjct: 665  AGGSLKELSLNNVKKVGHNTALALAKHSDKLQILDISWCREMSNDLLGYIVDNSSSLKVL 724

Query: 483  KLFGCTQITNTFFNGHSNPLVRIIGLKLTPILEHLN 376
            K+FGC+Q+T+ F  GHSNP V+I+G+K+ P L HL+
Sbjct: 725  KVFGCSQVTDVFVKGHSNPNVKILGVKMDPFLGHLS 760


>dbj|BAF01173.1| hypothetical protein [Arabidopsis thaliana]
          Length = 762

 Score =  557 bits (1436), Expect = e-156
 Identities = 299/576 (51%), Positives = 402/576 (69%), Gaps = 3/576 (0%)
 Frame = -2

Query: 2094 DEMGPSNVKKRYTREEKGKGRL-IKDTWLSIGIDPVELDLKPEEENLLEDAVSGLIPDAA 1918
            ++   S  ++RYTREEKGKG + ++D    + ++ VE+D   EEE  +E  V+   P   
Sbjct: 196  EKQSSSMGRRRYTREEKGKGIMQVEDVSSPVTVEIVEVD---EEEMEIESLVNSEKPPDV 252

Query: 1917 SGLVQLQEISAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHF--QPEDEDY 1744
            S    + E++A     + A+ R     +        F+D A + A RFAHF  Q E+E+ 
Sbjct: 253  S----VTELAATMANVEQAQNRENSNEIGNDSRTQHFRDIAERIAHRFAHFDAQVEEEED 308

Query: 1743 IAPLPEPNEEIEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPLNNEGQ 1564
            ++   E  +++EDWPGPFSTAMKII+DR            S     +P I W P +N   
Sbjct: 309  LSD-KEGEQQVEDWPGPFSTAMKIIKDREEYTTPHVGIGVSNKERSSPTI-WVPRSNFSF 366

Query: 1563 NRSRPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGCG 1384
               +   PSL++LS+ +L KNA+AITSL  VPD +R KL QLLCDSRRMD HF+DLL  G
Sbjct: 367  PPRK--APSLQELSLRVLVKNADAITSLDYVPDTLRVKLCQLLCDSRRMDLHFLDLLVQG 424

Query: 1383 SPSEICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPSSLAA 1204
            SP+EICV DCSW+TEEE T+ F  CDT+NL VLQLDQCGRCMPDYIL  TLARSP  L  
Sbjct: 425  SPTEICVPDCSWLTEEEFTECFKNCDTSNLMVLQLDQCGRCMPDYILPFTLARSPKVLPM 484

Query: 1203 LVSISLRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRELYID 1024
            L ++S+ G CR++DVGL  LV+SAPA+ S+NL QCSLLTS  I+ ++DSLG+VLRELYI+
Sbjct: 485  LSTLSISGACRLSDVGLRQLVSSAPAITSINLNQCSLLTSSSIDMLSDSLGSVLRELYIN 544

Query: 1023 DCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCGKLTD 844
            +CQ+ID   IL AL+K E LEVLS+AD+ +V   F+  F+T  G  +K+L+L N  KL+D
Sbjct: 545  ECQNIDMKHILAALEKFEKLEVLSLADLPSVKGRFLKEFVTARGQTLKQLILTNSRKLSD 604

Query: 843  TSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAAFLEA 664
            +S+KVI+E+C  L  LDL N+ KLTD ++G+LANGCQ+++ L  CRN FSDE++AAF+E 
Sbjct: 605  SSIKVISENCPNLSVLDLANVCKLTDSSLGYLANGCQALEKLIFCRNPFSDEAVAAFVET 664

Query: 663  SGESLKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSSLRVL 484
            +G SLKELSLNNVK+VGHNTA++LA+ S  L  LD+SWCR ++++ +G IVDN SSL+VL
Sbjct: 665  AGGSLKELSLNNVKKVGHNTALALAKHSDKLQILDISWCREMSNDLLGYIVDNSSSLKVL 724

Query: 483  KLFGCTQITNTFFNGHSNPLVRIIGLKLTPILEHLN 376
            K+FGC+Q+T+ F  GHSNP V+I+G+K+ P L HL+
Sbjct: 725  KVFGCSQVTDVFVKGHSNPNVKILGVKMDPFLGHLS 760


>ref|XP_002309465.2| hypothetical protein POPTR_0006s23720g [Populus trichocarpa]
            gi|550336952|gb|EEE92988.2| hypothetical protein
            POPTR_0006s23720g [Populus trichocarpa]
          Length = 679

 Score =  556 bits (1433), Expect = e-155
 Identities = 309/608 (50%), Positives = 406/608 (66%), Gaps = 29/608 (4%)
 Frame = -2

Query: 2079 SNVKKRYTREEKGKGRLIKDTWLSIGI---------DPVE--LDLKPEEENLLEDA--VS 1939
            S+ + RYT EEKGK ++  +  L   +         DPVE  +D  P E  LL     + 
Sbjct: 71   SSKRLRYTTEEKGKAKVDCEVNLDFDLNLDLWGFEKDPVEGKMDTWPFEAGLLSSGPVMH 130

Query: 1938 GLIPDAASGLVQLQEISA------VEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASR 1777
               PD+     Q++           E  ++IA + VR +  RR+  +   ++ AR  A R
Sbjct: 131  NFFPDSVERNTQVENYDVPRKDIVFEQRKEIALSSVRKRQSRRKEQKLMQREIARNVAPR 190

Query: 1776 FAHFQPEDE------DYIAPLPEPNEEIE----DWPGPFSTAMKIIRDRATKQNGRQMNS 1627
            FAH  P+++      +    L E + E+E    D   PFS A++ I+ R T + G    S
Sbjct: 191  FAHLGPQEQQMKQHKEKKVKLREVDLEMELDLDDSQSPFSLALEAIKMRQTVRKG----S 246

Query: 1626 TSGVTNPAPVIEWKPLNNEGQNRSRPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKL 1447
             +G +    + +W P   +  +  +  VP+L DLS+N LAKNA+AI SL+ VPD +R++L
Sbjct: 247  LTGFSES--LFKWVPAKAKDCDALKRDVPTLLDLSLNALAKNADAIVSLEHVPDKLRHRL 304

Query: 1446 SQLLCDSRRMDCHFVDLLGCGSPSEICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCG 1267
            SQL+ D   +D HFV+LL  GSP+EI +++ S +TEEE +KIF  CDT +LTVLQLD CG
Sbjct: 305  SQLVSDCGVVDAHFVELLARGSPTEIRLRNISRLTEEEFSKIFSVCDTKDLTVLQLDLCG 364

Query: 1266 RCMPDYILRATLARSPSSLAALVSISLRGGCRITDVGLNALVASAPALRSLNLGQCSLLT 1087
            RCMPDYIL  TLARS   L +L +ISL+G  R++D+GL  L  SAPAL+S+NL QCSLLT
Sbjct: 365  RCMPDYILNGTLARSSHRLPSLATISLKGAHRLSDIGLTQLAVSAPALQSINLSQCSLLT 424

Query: 1086 SIGINNVADSLGTVLRELYIDDCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGF 907
            S GI++      + LRELYIDDCQ+IDA +ILPALKKL+ LEVLSVA I+TVCD F+ G 
Sbjct: 425  SQGISDFVSCFESTLRELYIDDCQNIDATIILPALKKLKCLEVLSVAGIETVCDNFVIGL 484

Query: 906  ITVCGSNMKELVLANCGKLTDTSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSI 727
            +   G NMKEL  ANC +LTD SL+++ ++C  LCALDL  LH LTD A+ HLANGCQSI
Sbjct: 485  VKALGINMKELGFANCVQLTDISLRIVGKNCPNLCALDLSYLHNLTDSALKHLANGCQSI 544

Query: 726  QTLKLCRNAFSDESIAAFLEASGESLKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWC 547
            + LKL RN FSDE+I+AFLE SG+SL  LS+NN+ RV HNTA+S+A+CS+NL+SLDLSWC
Sbjct: 545  RRLKLHRNDFSDEAISAFLEVSGQSLDALSVNNIHRVAHNTALSIAKCSRNLVSLDLSWC 604

Query: 546  RNLADEAVGLIVDNCSSLRVLKLFGCTQITNTFFNGHSNPLVRIIGLKLTPILEHLNVLD 367
            R L DEA+G+IVD+C SL++LKLFGCTQIT  F NGHSNP+VRIIG K  P+LEHL+ L+
Sbjct: 605  RRLTDEALGMIVDSCLSLKLLKLFGCTQITEAFLNGHSNPMVRIIGCKTGPVLEHLDALE 664

Query: 366  RPEAPLRY 343
              E PLRY
Sbjct: 665  PQENPLRY 672


>gb|EYU28335.1| hypothetical protein MIMGU_mgv1a001385mg [Mimulus guttatus]
          Length = 827

 Score =  556 bits (1432), Expect = e-155
 Identities = 299/585 (51%), Positives = 410/585 (70%), Gaps = 8/585 (1%)
 Frame = -2

Query: 2064 RYTREEKGKGRLIKDTWLSIGIDPVELDLKPEEENLLEDAVSGLIPDAASGLVQLQEISA 1885
            R ++EEKGK ++      S G +  EL ++   ++ +   +     ++  G  Q++E   
Sbjct: 262  RLSKEEKGKLKIEIKAASSSGTNTSELIIQNVSDSSVSATLHAAANESLPGGSQVREADV 321

Query: 1884 VEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHFQPEDE--------DYIAPLP 1729
              ++               R+ R RF++ AR++ASRFAHF P +E            P+P
Sbjct: 322  NGNDAG-------------RVHRERFRNFARRNASRFAHFSPHEEFGNNAPVGGIQIPVP 368

Query: 1728 EPNEEIEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPLNNEGQNRSRP 1549
            E +  +EDWPGPFSTA+KII D   K+ G   + +  V      ++W P   E   +S+ 
Sbjct: 369  EADNGLEDWPGPFSTAIKIINDG--KRRGASTDKSGAVE-----LKWIPKMQE-LRKSQK 420

Query: 1548 LVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGCGSPSEI 1369
             VPSL++L ++ILAKNA+AITSL  VPD +R+K+   LCD+R+MD HF++LL  GSP+EI
Sbjct: 421  HVPSLQELCLSILAKNADAITSLDFVPDVLRHKICWFLCDNRKMDSHFLELLVHGSPTEI 480

Query: 1368 CVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPSSLAALVSIS 1189
             V+DCSW++EE  TK F  C+ + LTV Q DQ G C+PDY L ATLARS +SL AL ++S
Sbjct: 481  RVRDCSWLSEELFTKTFEGCNASKLTVFQFDQGGACLPDYTLYATLARSTNSLPALTTVS 540

Query: 1188 LRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRELYIDDCQSI 1009
            L+G  R++D GLN LV++A +L+S+++ QC +LTS GI ++A+SL  VLRELYID+C  I
Sbjct: 541  LKGAYRLSDAGLNTLVSAAHSLKSIDISQCPMLTSDGICSLANSLQLVLRELYIDNCHGI 600

Query: 1008 DAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCGKLTDTSLKV 829
            DAM ILPAL KLE+LEVLS+A IQTVCD F+  F+++ G  MKELVLA+C +LTD+S+KV
Sbjct: 601  DAMSILPALLKLENLEVLSLAGIQTVCDDFVSKFVSIHGCRMKELVLADCIELTDSSIKV 660

Query: 828  IAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAAFLEASGESL 649
            I ++CS L A+DL NL KLTD++IGHLANGC++IQ LK CRNAFSDE+IAA+L+  G  L
Sbjct: 661  IGDTCSKLRAIDLSNLCKLTDISIGHLANGCRAIQMLKFCRNAFSDEAIAAYLDVRGALL 720

Query: 648  KELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSSLRVLKLFGC 469
             +LSLNNV +V ++TA+SLAR  +NL SLDLSWCRNL +EA+GL+VD+CSSL VLKLFGC
Sbjct: 721  NDLSLNNVIQVSNHTALSLARNCRNLRSLDLSWCRNLTNEALGLVVDSCSSLEVLKLFGC 780

Query: 468  TQITNTFFNGHSNPLVRIIGLKLTPILEHLNVLDRPEAPLRY*SV 334
            TQ+TN F +GHSN  V++IGLK+TP+ +H++V D    PLRY S+
Sbjct: 781  TQVTNVFLDGHSNSEVKLIGLKMTPVFKHIDVPDFLLGPLRYSSI 825


>ref|XP_007012423.1| Rad7, putative isoform 3 [Theobroma cacao]
            gi|508782786|gb|EOY30042.1| Rad7, putative isoform 3
            [Theobroma cacao]
          Length = 715

 Score =  556 bits (1432), Expect = e-155
 Identities = 314/615 (51%), Positives = 406/615 (66%), Gaps = 33/615 (5%)
 Frame = -2

Query: 2088 MGPSNVKKRYTREEKGKGRLIK------------DTWLS-IGIDPVELDLKP--EEENLL 1954
            +G  + K+R++ EEKGK +L              D  L+ IGID       P  E E   
Sbjct: 101  VGSPSKKRRFSVEEKGKAKLDGFDEEEEKLNLDLDLGLTQIGIDKAISSFGPPIEAEEQK 160

Query: 1953 EDAVSGLIPDAASGLVQLQEISAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRF 1774
            +  V  L        + L  +  ++++R        +   R+R +  R  + AR+ A R 
Sbjct: 161  DTEVEFLGSTNTLNTIDLV-VGEIDYKRNDETEEFYVS--RKREESRRHHEIARKFAQRL 217

Query: 1773 AHFQPEDEDYIAPLPEPNEE----------------IEDWPGPFSTAMKIIRDRATKQNG 1642
            AH    + D +    + N++                 ED   PF  A+++I+ R +    
Sbjct: 218  AHEVDSEGDLLKSFSKTNKDGALKNVVVVVDDDDDKAEDSESPFGMALEMIKTRNSSSTD 277

Query: 1641 RQMNSTSGVTNPAPVIEWKPLNNEGQNRSRPL-VPSLRDLSMNILAKNAEAITSLKDVPD 1465
            ++  S  G+       +W P N +G + S    VPSL DLS+  LAKNAEA+ SL+ VPD
Sbjct: 278  KKKYSRGGLEAE---FKWVPKNYKGSSISMARDVPSLLDLSLRALAKNAEAMVSLEHVPD 334

Query: 1464 GMRNKLSQLLCDSRRMDCHFVDLLGCGSPSEICVKDCSWMTEEEATKIFGACDTNNLTVL 1285
             +R+KLSQL+CD+R+MD HF++LL  GSP+EI V DCS +TE+E TK+FG CDT NL VL
Sbjct: 335  VLRHKLSQLVCDNRKMDAHFLELLVRGSPTEIRVNDCSGVTEDEFTKMFGCCDTKNLIVL 394

Query: 1284 QLDQCGRCMPDYILRATLARSPSSLAALVSISLRGGCRITDVGLNALVASAPALRSLNLG 1105
            QLD CG C+PDY+L+ TLA S +SL ALV++SL G  R++D GLN L  SAPAL+S+NL 
Sbjct: 395  QLDLCGSCLPDYVLQGTLAHSSNSLPALVTLSLDGAYRLSDKGLNLLALSAPALQSINLS 454

Query: 1104 QCSLLTSIGINNVADSLGTVLRELYIDDCQSIDAMLILPALKKLEHLEVLSVADIQTVCD 925
            QCSLLTS GINN+A    + LRELY+D+CQ+I AM++LPALKKL+ LEVLS+A IQTVCD
Sbjct: 455  QCSLLTSAGINNLASCFESTLRELYLDECQNIQAMVVLPALKKLKCLEVLSLAGIQTVCD 514

Query: 924  AFIHGFITVCGSNMKELVLANCGKLTDTSLKVIAESCSGLCALDLVNLHKLTDVAIGHLA 745
             F+ G +  CG NMKELVLANC +LTD SLK + ++CS LCALDL  LH LTD ++ +LA
Sbjct: 515  DFVVGMVEACGKNMKELVLANCVELTDISLKFVGKNCSRLCALDLSYLHNLTDSSMRYLA 574

Query: 744  NGCQSIQTLKLCRNAFSDESIAAFLEASGESLKELSLNN-VKRVGHNTAISLARCSKNLL 568
            NGC+SI  LKLCRN FSDE+IAAFLEASG SL ELSLNN +  VG NTA+SL++CS+ L 
Sbjct: 575  NGCRSITKLKLCRNGFSDEAIAAFLEASGGSLTELSLNNIISVVGLNTALSLSKCSRKLF 634

Query: 567  SLDLSWCRNLADEAVGLIVDNCSSLRVLKLFGCTQITNTFFNGHSNPLVRIIGLKLTPIL 388
            SLDLSWCRNL DEA+GLIVD+C  LR+LKLFGCTQIT  F  GHSN  V+IIGLK+T IL
Sbjct: 635  SLDLSWCRNLTDEALGLIVDSCLLLRLLKLFGCTQITEVFLGGHSNAQVQIIGLKMTTIL 694

Query: 387  EHLNVLDRPEAPLRY 343
            +HLN+L+  EAPLRY
Sbjct: 695  KHLNMLEPQEAPLRY 709


>ref|XP_003551123.1| PREDICTED: DNA repair protein rhp7-like [Glycine max]
          Length = 675

 Score =  553 bits (1426), Expect = e-154
 Identities = 283/506 (55%), Positives = 369/506 (72%), Gaps = 3/506 (0%)
 Frame = -2

Query: 1851 VRMKAMRRRLDRGRFQDAARQSASRFAHFQPED--EDYIAPLPEPNEEIEDWPGPFSTAM 1678
            V M   RR  +  RF+  A+++A+ +A F   +  ++  +    P   I+D   PFS AM
Sbjct: 176  VPMSFYRRNRNMERFRVIAKRNATHYARFDDSEVGDEGTSLYLNPQGNIDDSETPFSIAM 235

Query: 1677 KIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPLNN-EGQNRSRPLVPSLRDLSMNILAKN 1501
            K I+DRA K+                   W P  N +G  +   LVPSL++L + ILA N
Sbjct: 236  KAIKDRAMKKKVCDA--------------WVPKRNPQGGEKRFFLVPSLQELCLEILANN 281

Query: 1500 AEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGCGSPSEICVKDCSWMTEEEATKI 1321
            A+A+ SL+ VPD +R KLS+LLCDSR+M+  F++LL  GSP+EI +KDCSW+TEE+  K 
Sbjct: 282  ADAMVSLEGVPDELRRKLSKLLCDSRKMNSRFLELLLSGSPTEIRIKDCSWLTEEQFAKS 341

Query: 1320 FGACDTNNLTVLQLDQCGRCMPDYILRATLARSPSSLAALVSISLRGGCRITDVGLNALV 1141
            F  CDT  L VLQLDQCGRC+PDY L  TL +SP  L  L+++SL G CR++D GL+ LV
Sbjct: 342  FQTCDTTRLEVLQLDQCGRCIPDYALLGTLRQSPRWLPKLITLSLSGACRLSDKGLHVLV 401

Query: 1140 ASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRELYIDDCQSIDAMLILPALKKLEHLE 961
            +SAPALRS+NL QCSLL+S  IN +ADSLG++L+ELY+DDC  IDA  I+P LKKLEHLE
Sbjct: 402  SSAPALRSINLSQCSLLSSASINILADSLGSLLKELYLDDCLMIDAAQIVPGLKKLEHLE 461

Query: 960  VLSVADIQTVCDAFIHGFITVCGSNMKELVLANCGKLTDTSLKVIAESCSGLCALDLVNL 781
            VLS+A IQTV D FI  +I  CG NMKEL+  +C KLTD S+KVIAE C GLCALDL+NL
Sbjct: 462  VLSLAGIQTVSDEFIKNYIIACGHNMKELIFKDCRKLTDASIKVIAEHCPGLCALDLMNL 521

Query: 780  HKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAAFLEASGESLKELSLNNVKRVGHNTA 601
             KLTD+++G+L N CQ+++TLKLCRN FSDE+IAAFLE +GESLKELSLNN+K+VGH+T 
Sbjct: 522  DKLTDLSLGYLTNSCQALRTLKLCRNLFSDEAIAAFLEITGESLKELSLNNIKKVGHHTT 581

Query: 600  ISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSSLRVLKLFGCTQITNTFFNGHSNPLV 421
            ISLAR +KNL +LDLSWCRNL D  +G IVD+C SLR+LKLFGC+ +T+ F NGHSNP +
Sbjct: 582  ISLARHAKNLHTLDLSWCRNLTDNELGFIVDSCFSLRLLKLFGCSLVTDVFLNGHSNPEI 641

Query: 420  RIIGLKLTPILEHLNVLDRPEAPLRY 343
            +I+GLK++P+L+++ V +  + PLRY
Sbjct: 642  QILGLKMSPLLQNVKVPEPYQGPLRY 667


Top