BLASTX nr result

ID: Akebia25_contig00010895 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00010895
         (2864 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007204292.1| hypothetical protein PRUPE_ppa001096mg [Prun...   697   0.0  
emb|CBI27815.3| unnamed protein product [Vitis vinifera]              660   0.0  
ref|XP_002278147.1| PREDICTED: uncharacterized protein LOC100244...   651   0.0  
ref|XP_006453195.1| hypothetical protein CICLE_v10007604mg [Citr...   637   e-180
ref|XP_004288980.1| PREDICTED: uncharacterized protein LOC101312...   630   e-178
ref|XP_007012655.1| DNA repair protein rhp7, putative [Theobroma...   627   e-177
ref|XP_002516283.1| rad7, putative [Ricinus communis] gi|2235447...   616   e-173
ref|XP_003546506.2| PREDICTED: uncharacterized protein LOC100808...   602   e-169
ref|XP_004243936.1| PREDICTED: DNA repair protein RAD7-like [Sol...   585   e-164
ref|XP_007138661.1| hypothetical protein PHAVU_009G227500g, part...   581   e-163
ref|XP_006297020.1| hypothetical protein CARUB_v10013011mg [Caps...   569   e-159
gb|EXB38942.1| hypothetical protein L484_027377 [Morus notabilis]     566   e-158
ref|XP_007012421.1| Rad7, putative isoform 1 [Theobroma cacao] g...   564   e-158
dbj|BAD43070.1| hypothetical protein [Arabidopsis thaliana] gi|6...   561   e-157
ref|NP_178661.2| uncharacterized protein [Arabidopsis thaliana] ...   561   e-157
ref|XP_002309465.2| hypothetical protein POPTR_0006s23720g [Popu...   560   e-156
ref|XP_007012423.1| Rad7, putative isoform 3 [Theobroma cacao] g...   560   e-156
dbj|BAF01173.1| hypothetical protein [Arabidopsis thaliana]           559   e-156
gb|EYU28335.1| hypothetical protein MIMGU_mgv1a001385mg [Mimulus...   558   e-156
ref|XP_002885812.1| hypothetical protein ARALYDRAFT_319345 [Arab...   556   e-155

>ref|XP_007204292.1| hypothetical protein PRUPE_ppa001096mg [Prunus persica]
            gi|462399823|gb|EMJ05491.1| hypothetical protein
            PRUPE_ppa001096mg [Prunus persica]
          Length = 910

 Score =  697 bits (1800), Expect = 0.0
 Identities = 380/714 (53%), Positives = 490/714 (68%), Gaps = 25/714 (3%)
 Frame = -3

Query: 2379 SSTKGKGKRKLEFDLNSPPLDLGISDGSDKGFFNLRSGARISKRKIEG---------IRV 2227
            S  + K KRKLE D+N P L+    DG  KGF +LRSG ++SKR + G         +  
Sbjct: 208  SKEEVKKKRKLEIDINFPALEWEGEDGRSKGFLSLRSGKKVSKRGLGGGHNGALVIDLDA 267

Query: 2226 DSNGIDNVGEKL--------------AXXXXXXXXXXXXXXXKFXXXXXXXXXXXXXXVN 2089
            D NG   +GE                +               +               + 
Sbjct: 268  DENGKGKLGESGFAFNGVDVVELDSDSEEERSSENLVQSSSPRGKRKLSDAIEGVAEDLK 327

Query: 2088 PNTIADEMGPSNVKKRYTREEKGKGRLIKDTWLSIGIDPVELDLKPEEENLLEDAVSGLI 1909
               +A E G  N ++RY+ EEKGKG+LI +  L  G D  EL LK E        V   +
Sbjct: 328  DEVMASENGIDNGRRRYSIEEKGKGKLIGEVVLMNGNDEAELGLKSE--------VLSSV 379

Query: 1908 PDAASGLVQLQEISAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHF--QPE 1735
             + A+  ++ +E +A+  E Q+  +  R  A        RF+D AR++ASRFAHF  + E
Sbjct: 380  ENVAASPIRKRENAALPDESQLINSNTRENAASGNQYMERFRDIARRNASRFAHFASEEE 439

Query: 1734 DEDYIAPLPEPNEEIEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPSN 1555
            +E+ + P  E  ++IEDWPGPFSTAMKII+DRA K    Q+ S    T P P +EW P +
Sbjct: 440  EENQLPPQVEVAQDIEDWPGPFSTAMKIIKDRAAKN--AQLPSKDQ-TKP-PFVEWVPKS 495

Query: 1554 NEGQNRSRPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDL 1375
             + +  S+ L+PSL+DL ++ LAKNA+AI SL+ V D +R++L Q+LCDSR+M+ HF +L
Sbjct: 496  FQDRPLSKNLIPSLQDLCLSFLAKNADAIVSLEHVADALRHRLCQMLCDSRKMNSHFFEL 555

Query: 1374 LGRGSPSEICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPS 1195
            L +G P+E+ ++DCSWMTEE+ TK F   DT+NLTVLQLDQCGRC+ DYIL +TLARS +
Sbjct: 556  LVQGLPTEVRLRDCSWMTEEQFTKSFQQWDTSNLTVLQLDQCGRCVADYILHSTLARSSN 615

Query: 1194 SLAALVSISLRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRE 1015
             L AL ++SL G CR++DVGL ALV+SAPALRSLNL QCSLLTS  I  +ADSLG+VLRE
Sbjct: 616  CLPALTTLSLSGACRLSDVGLGALVSSAPALRSLNLSQCSLLTSSSIGTLADSLGSVLRE 675

Query: 1014 LYIDDCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCG 835
            LY++DCQ IDA+LILPALKKLEHLEVL +  ++ VCD FI  F+T  G ++KELVL +CG
Sbjct: 676  LYLNDCQGIDALLILPALKKLEHLEVLWLGGLENVCDDFIKEFVTARGQSLKELVLTDCG 735

Query: 834  KLTDTSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAA 655
            KLTD+S+KVIAE+C+GLCALDLVNL+KLTD+ +G+LANGC+ IQTLKLCRNAFSDE+IAA
Sbjct: 736  KLTDSSVKVIAETCTGLCALDLVNLYKLTDLTLGYLANGCREIQTLKLCRNAFSDEAIAA 795

Query: 654  FLEASGESLKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSS 475
            FLE SGE L ELSLNN+K+VG+NTAI+LA+ S+ L +LDLSWCRNL DEA+GLI D+C S
Sbjct: 796  FLETSGECLTELSLNNIKKVGYNTAIALAKRSRKLHTLDLSWCRNLTDEALGLIADSCLS 855

Query: 474  LRVLKLFGCTQITNTFFNGHSNPLVRIIGLKLTPILEHLNVLDRPEAPLRY*SV 313
            LR+LKLFGCTQ+TNTF +GHSNP V+IIGLK++PILEH+ V D  E PLRY SV
Sbjct: 856  LRILKLFGCTQLTNTFLDGHSNPEVKIIGLKVSPILEHVKVSDPHEGPLRYSSV 909


>emb|CBI27815.3| unnamed protein product [Vitis vinifera]
          Length = 832

 Score =  660 bits (1703), Expect = 0.0
 Identities = 377/701 (53%), Positives = 470/701 (67%), Gaps = 10/701 (1%)
 Frame = -3

Query: 2400 RRSERLGSSTKGKGKRKLEFDLNSPPLDLGISDGSDKGFFNLRSGARISKRKIEGI-RVD 2224
            +RSE + +  + KGKRKL F+ +  PLD       DKGF  LRSG +I K  + G+ R++
Sbjct: 183  QRSEDMVTVKQSKGKRKLSFEAS--PLD-----DEDKGFLGLRSGKKIVKEIMCGVDRIE 235

Query: 2223 SNGIDNVGEKLAXXXXXXXXXXXXXXXKFXXXXXXXXXXXXXXVNPNTIADEMGPSNVKK 2044
            S+G   V E+                                 +  +  A+E G    ++
Sbjct: 236  SDGGKYVVEQ----------ERGGEDKGVKVQGHGNGEAAVEELQKDPSANENGSVRGRR 285

Query: 2043 RYTREEKGKGRLIKDTWLSIGIDPVELDLKPEEENLLEDAVSGLIPDAASGLVQLQEISA 1864
            R+T EEKGKG+L++D      ID VELDL  E +N++++                  +SA
Sbjct: 286  RFTGEEKGKGKLVEDDEPQNRIDAVELDLNLELKNVIDN------------------MSA 327

Query: 1863 VEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHFQPEDE-----DYIAPLPEPN 1699
             E++   A+TR              F+D AR++ASRFAHF PE E        A +  P+
Sbjct: 328  DENDAVEARTR--------------FRDIARRNASRFAHFAPEQEMENHPSREAEIQRPS 373

Query: 1698 E----EIEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPSNNEGQNRSR 1531
            E    E EDWPGPFSTAMKII+DR  KQN +Q NS+S    PA VI W P   +     +
Sbjct: 374  EGGEKENEDWPGPFSTAMKIIKDREKKQNTQQ-NSSSDRNRPAHVI-WSPRKVKSSECPK 431

Query: 1530 PLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGRGSPSE 1351
            PL PSL+++ + +LA+N +AITSL+ +PD +R+KLSQLLCDSRRM+ H ++LL  GSP E
Sbjct: 432  PLAPSLQEMCLEVLAQNGDAITSLESIPDALRHKLSQLLCDSRRMNSHILELLVSGSPFE 491

Query: 1350 ICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPSSLAALVSI 1171
            +CV+DCSW+TEEE  +IF  CDTN+LTVLQLDQCGRCM DY+LRAT     + L AL ++
Sbjct: 492  VCVRDCSWLTEEEFARIFKRCDTNSLTVLQLDQCGRCMTDYVLRATFDMLSNGLPALTTV 551

Query: 1170 SLRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRELYIDDCQS 991
            SL+G CR++D GL ALV+SAP LRS+NL QCSLLTS  I N+A++LG+VLRELYIDDCQ 
Sbjct: 552  SLKGACRLSDAGLRALVSSAPMLRSINLSQCSLLTSASIKNLAETLGSVLRELYIDDCQG 611

Query: 990  IDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCGKLTDTSLK 811
            IDAMLIL AL+KLE LEVLSVA IQTVCD FI  FI+V G  MKELVL +C +LTD SLK
Sbjct: 612  IDAMLILSALEKLECLEVLSVAGIQTVCDDFIWEFISVHGPTMKELVLTDCSRLTDFSLK 671

Query: 810  VIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAAFLEASGES 631
             IAE+C  L ALDL NL KLTD A G+LA+GCQ++QTLKL  N+FSDE+IAAFLE SG S
Sbjct: 672  AIAETCPELRALDLGNLCKLTDSAFGYLASGCQAMQTLKLRCNSFSDEAIAAFLEISGGS 731

Query: 630  LKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSSLRVLKLFG 451
            LKELSLNNV ++GHNTAISLAR S+ L+ LDLSWCRNL D  +G IVD+C SLRVLKLFG
Sbjct: 732  LKELSLNNVSKIGHNTAISLARRSRELIRLDLSWCRNLTDGDLGFIVDSCLSLRVLKLFG 791

Query: 450  CTQITNTFFNGHSNPLVRIIGLKLTPILEHLNVLDRPEAPL 328
            CTQITN F +GHSNP V IIGLKLTPIL+HL + D    PL
Sbjct: 792  CTQITNMFVDGHSNPQVEIIGLKLTPILKHLKLTDPQSFPL 832


>ref|XP_002278147.1| PREDICTED: uncharacterized protein LOC100244043 [Vitis vinifera]
          Length = 905

 Score =  651 bits (1680), Expect = 0.0
 Identities = 353/595 (59%), Positives = 432/595 (72%), Gaps = 9/595 (1%)
 Frame = -3

Query: 2085 NTIADEMGPSNVKKRYTREEKGKGRLIKDTWLSIGIDPVELDLKPEEENLLEDAVSGLIP 1906
            N  ADE       +RY+REEKGKG LI D      ++PV+ +L+ E +N ++ AVS  I 
Sbjct: 324  NMSADENDAVEGGQRYSREEKGKGILINDDLAPNAVNPVDFNLESEVKNSVDTAVSESI- 382

Query: 1905 DAASGLVQLQEISAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHFQPEDE- 1729
                   QL+    ++ + ++ +T V   A R R    RF+D AR++ASRFAHF PE E 
Sbjct: 383  -------QLEGNVGLQVQNEVIQTSVTGIASRART---RFRDIARRNASRFAHFAPEQEM 432

Query: 1728 ----DYIAPLPEPNE----EIEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVI 1573
                   A +  P+E    E EDWPGPFSTAMKII+DR  KQN +Q NS+S    PA VI
Sbjct: 433  ENHPSREAEIQRPSEGGEKENEDWPGPFSTAMKIIKDREKKQNTQQ-NSSSDRNRPAHVI 491

Query: 1572 EWKPSNNEGQNRSRPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMD 1393
             W P   +     +PL PSL+++ + +LA+N +AITSL+ +PD +R+KLSQLLCDSRRM+
Sbjct: 492  -WSPRKVKSSECPKPLAPSLQEMCLEVLAQNGDAITSLESIPDALRHKLSQLLCDSRRMN 550

Query: 1392 CHFVDLLGRGSPSEICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRAT 1213
             H ++LL  GSP E+CV+DCSW+TEEE  +IF  CDTN+LTVLQLDQCGRCM DY+LRAT
Sbjct: 551  SHILELLVSGSPFEVCVRDCSWLTEEEFARIFKRCDTNSLTVLQLDQCGRCMTDYVLRAT 610

Query: 1212 LARSPSSLAALVSISLRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSL 1033
                 + L AL ++SL+G CR++D GL ALV+SAP LRS+NL QCSLLTS  I N+A++L
Sbjct: 611  FDMLSNGLPALTTVSLKGACRLSDAGLRALVSSAPMLRSINLSQCSLLTSASIKNLAETL 670

Query: 1032 GTVLRELYIDDCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKEL 853
            G+VLRELYIDDCQ IDAMLIL AL+KLE LEVLSVA IQTVCD FI  FI+V G  MKEL
Sbjct: 671  GSVLRELYIDDCQGIDAMLILSALEKLECLEVLSVAGIQTVCDDFIWEFISVHGPTMKEL 730

Query: 852  VLANCGKLTDTSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFS 673
            VL +C +LTD SLK IAE+C  L ALDL NL KLTD A G+LA+GCQ++QTLKL  N+FS
Sbjct: 731  VLTDCSRLTDFSLKAIAETCPELRALDLGNLCKLTDSAFGYLASGCQAMQTLKLRCNSFS 790

Query: 672  DESIAAFLEASGESLKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLI 493
            DE+IAAFLE SG SLKELSLNNV ++GHNTAISLAR S+ L+ LDLSWCRNL D  +G I
Sbjct: 791  DEAIAAFLEISGGSLKELSLNNVSKIGHNTAISLARRSRELIRLDLSWCRNLTDGDLGFI 850

Query: 492  VDNCSSLRVLKLFGCTQITNTFFNGHSNPLVRIIGLKLTPILEHLNVLDRPEAPL 328
            VD+C SLRVLKLFGCTQITN F +GHSNP V IIGLKLTPIL+HL + D    PL
Sbjct: 851  VDSCLSLRVLKLFGCTQITNMFVDGHSNPQVEIIGLKLTPILKHLKLTDPQSFPL 905


>ref|XP_006453195.1| hypothetical protein CICLE_v10007604mg [Citrus clementina]
            gi|568840725|ref|XP_006474316.1| PREDICTED:
            uncharacterized protein LOC102618698 [Citrus sinensis]
            gi|557556421|gb|ESR66435.1| hypothetical protein
            CICLE_v10007604mg [Citrus clementina]
          Length = 715

 Score =  637 bits (1643), Expect = e-180
 Identities = 361/697 (51%), Positives = 462/697 (66%), Gaps = 15/697 (2%)
 Frame = -3

Query: 2358 KRKLEFDLNSPPLDLGISDGSDKGFFNLRSGARISKRKIE---GIRVDSNGIDNVGEKLA 2188
            KRKL+   N     LG+  G  +GF NLRSG ++ KR  E   G  VD    +N  E + 
Sbjct: 54   KRKLDVSENL----LGLEGGDSEGFLNLRSGKKVIKRIGETDGGNSVDGKEKENGKETMD 109

Query: 2187 XXXXXXXXXXXXXXXKFXXXXXXXXXXXXXXVNPNTIADEMGPSNVKKRYTREEKGKGRL 2008
                                                 AD       ++R+ REEKGK +L
Sbjct: 110  FEEVRMLREVSKDGADVDKLDIKQN------------ADGSCSEKRRRRFGREEKGKAKL 157

Query: 2007 IKDTWLSIGIDPVELDL----KPEEENLLEDAVSGLIPDAASGLVQLQEISAVEHERQIA 1840
            I +     G + + LDL    K  EEN+      G + +  +             +R   
Sbjct: 158  IDEDSTVNGSEFINLDLELGTKHSEENV------GSVSEPRT------------EQRVDK 199

Query: 1839 KTRVRMKAMRRRLDRGRFQDAARQSASRFAHFQPED-------EDYIAPLPEPNEEIEDW 1681
            K+ VR+   R      +F+D ARQ+AS+FA+F  E+       E  +    E   EIEDW
Sbjct: 200  KSSVRLSESRME----QFRDIARQNASKFAYFNVEENHLSDDNERLVVADGEVGREIEDW 255

Query: 1680 PGPFSTAMKIIRDRATKQNGRQ-MNSTSGVTNPAPVIEWKPSNNEGQNRSRPLVPSLRDL 1504
            PGPFSTAMKI+RDR  K +G Q + S          I W P   + Q   + ++PSL++L
Sbjct: 256  PGPFSTAMKIVRDREKKLSGGQRIGSLDPKKKSNSSILWIPRKGQRQG-PKLIIPSLKEL 314

Query: 1503 SMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGRGSPSEICVKDCSWM 1324
            SM IL +NA+AITSL+ VPD +R+KLS +LCDSR+M+ HF++LL  GSP+EI ++DCSW+
Sbjct: 315  SMKILVQNADAITSLEHVPDALRHKLSFMLCDSRQMNSHFLNLLFSGSPTEIRLRDCSWL 374

Query: 1323 TEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPSSLAALVSISLRGGCRIT 1144
            TE+E TK F +CDT NLTVLQLD+CGRCMPDYIL +TLA S +SL +L ++S+ G CRI+
Sbjct: 375  TEQEFTKAFVSCDTKNLTVLQLDRCGRCMPDYILLSTLASSLNSLPSLTTLSICGACRIS 434

Query: 1143 DVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRELYIDDCQSIDAMLILPA 964
            DVG  ALV SAPALRS+NL QCSLLTS  ++ +AD LG+ ++ELYI+DCQS++AMLILPA
Sbjct: 435  DVGFKALVTSAPALRSINLSQCSLLTSTSMDILADKLGSFIQELYINDCQSLNAMLILPA 494

Query: 963  LKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCGKLTDTSLKVIAESCSGL 784
            L+KL+HLEVLSVA I+TV D F+ GF+  CG NMKEL+L +C KLTD SLKVIAE+C  L
Sbjct: 495  LRKLKHLEVLSVAGIETVTDEFVRGFVYACGHNMKELILTDCVKLTDFSLKVIAETCPRL 554

Query: 783  CALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAAFLEASGESLKELSLNNV 604
            C LDL NL+KLTD  IG+LANGCQ+IQTLKLCRNAFSDE+IAAFLE +GE LKELSLNNV
Sbjct: 555  CTLDLSNLYKLTDFGIGYLANGCQAIQTLKLCRNAFSDEAIAAFLETAGEPLKELSLNNV 614

Query: 603  KRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSSLRVLKLFGCTQITNTFF 424
            ++V  NTA+SLA+ S  L++LDLSWCRNL+DEA+GLIVD+C SLR+LKLFGC+QITN F 
Sbjct: 615  RKVADNTALSLAKRSNKLVNLDLSWCRNLSDEALGLIVDSCLSLRMLKLFGCSQITNAFL 674

Query: 423  NGHSNPLVRIIGLKLTPILEHLNVLDRPEAPLRY*SV 313
            +GHSNP V+IIGLK++P+LEH+ V D  E PL Y SV
Sbjct: 675  DGHSNPDVQIIGLKMSPVLEHVKVPDFHEGPLHYSSV 711


>ref|XP_004288980.1| PREDICTED: uncharacterized protein LOC101312489 [Fragaria vesca
            subsp. vesca]
          Length = 903

 Score =  630 bits (1626), Expect = e-178
 Identities = 334/600 (55%), Positives = 436/600 (72%), Gaps = 21/600 (3%)
 Frame = -3

Query: 2049 KKRYTREEKGKGRLIKDTWLSIGIDPVELDLKP---------------EEENLLEDAVSG 1915
            +++++R+EKGK +LI    L    D VELD                   E +L+ + V  
Sbjct: 305  RRKFSRQEKGKEKLIGGALLPNDFDKVELDFLGIGALSELSSMPNVVLSELSLMPNVVLS 364

Query: 1914 ---LIPDAASGLVQLQEISAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHF 1744
               L+ +      Q+ E  A++ + Q   T  R +   R     RF+D ARQ+ASRFA F
Sbjct: 365  ELSLMSNVVPSPAQVGENVAMQEQVQARNTNAREEGRDRNQYMERFRDIARQNASRFARF 424

Query: 1743 QP--EDEDYIAPLPEPNEEIEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIE 1570
             P  E+E+ + P  +   E EDWPGPFSTAM+I+RD A K N ++ +++   T PA +++
Sbjct: 425  DPREEEENDMPPQVDVELEDEDWPGPFSTAMRIMRDGAEK-NMQEHSASKDKTKPA-LVK 482

Query: 1569 WKPSNNEGQNR-SRPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMD 1393
            W P   E     S+ L+PSL++L +++LAKNA+ I SL+ VPD +R++LS LLCDSRRM+
Sbjct: 483  WVPKRQEQDLAISKNLIPSLQELCLSVLAKNADEIVSLESVPDALRHQLSHLLCDSRRMN 542

Query: 1392 CHFVDLLGRGSPSEICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRAT 1213
             HF +LL +GSP+E+ ++DCSW+TEEE TK F  CD  NLTVLQLDQCGRC+PDYIL +T
Sbjct: 543  THFFELLVQGSPTEVRLRDCSWLTEEEFTKSFQLCDITNLTVLQLDQCGRCLPDYILNST 602

Query: 1212 LARSPSSLAALVSISLRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSL 1033
            LARS + L +LVS+SL G CR++DVGL ALV+S PALRSLNL QCSLLTS  I+ +A+SL
Sbjct: 603  LARSANCLPSLVSLSLSGACRLSDVGLGALVSSVPALRSLNLSQCSLLTSSSIDTLANSL 662

Query: 1032 GTVLRELYIDDCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKEL 853
            G++L+ELY++DCQSIDAM ILPALKK EHLEVL +  I+ VCD FI  FI+  G N+KEL
Sbjct: 663  GSLLKELYLNDCQSIDAMQILPALKKFEHLEVLWLPGIENVCDDFIKEFISARGHNLKEL 722

Query: 852  VLANCGKLTDTSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFS 673
             L +C  LTD+S+KV+AE+CSGLCALDL NLHKLTD ++G+LANGC++IQTLK CRN+FS
Sbjct: 723  SLTDCINLTDSSVKVLAETCSGLCALDLFNLHKLTDYSLGYLANGCRAIQTLKFCRNSFS 782

Query: 672  DESIAAFLEASGESLKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLI 493
            DE++AAFLE SGE LKELSLNN+ +VG NTAISLAR S+NL  LDLSWCRNL DEA+GLI
Sbjct: 783  DEAVAAFLETSGECLKELSLNNITKVGDNTAISLARHSRNLHCLDLSWCRNLTDEALGLI 842

Query: 492  VDNCSSLRVLKLFGCTQITNTFFNGHSNPLVRIIGLKLTPILEHLNVLDRPEAPLRY*SV 313
            VD+C SL++LKLFGCTQIT+ F +GHSNP V+IIG+++TPIL+ + V D    PL Y +V
Sbjct: 843  VDSCLSLKMLKLFGCTQITDLFLSGHSNPDVKIIGVRMTPILKDVRVPDPAAGPLHYSAV 902


>ref|XP_007012655.1| DNA repair protein rhp7, putative [Theobroma cacao]
            gi|508783018|gb|EOY30274.1| DNA repair protein rhp7,
            putative [Theobroma cacao]
          Length = 742

 Score =  627 bits (1617), Expect = e-177
 Identities = 336/591 (56%), Positives = 420/591 (71%), Gaps = 12/591 (2%)
 Frame = -3

Query: 2058 SNVKKRYTREEKGKGRLIKDTWLSIGIDPVELDLKPEEENLLEDAVSGLIPDAASGLVQL 1879
            +N ++R++ E KGKG+L+ +T                   +LE      +  + SG+   
Sbjct: 170  ANCRRRFSAEGKGKGKLVVET-------------------ILESKAKSSVDGSVSGVNLS 210

Query: 1878 QEISAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHFQPEDED--------- 1726
             E   +  E++  K + R    R       F+D ARQ+ASR+AHF  ++ED         
Sbjct: 211  AEKVRLPDEKRTKKNKKRGYGGRTE----HFRDVARQNASRYAHFDAQEEDDNIFSVEAE 266

Query: 1725 -YIAPLPEPNEE--IEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPSN 1555
              I+P  E  EE  +EDWPGPFSTAMKIIRDRA K N ++  S+SG      ++ W P  
Sbjct: 267  REISPENEQPEETGVEDWPGPFSTAMKIIRDRAEKLNLQRGRSSSGNVQSVQIM-WVPQK 325

Query: 1554 NEGQNRSRPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDL 1375
             +G++RS+ L PSL D+   IL  NA+AI SL  VPD +R+KL Q+LCDSRRM+ +F+DL
Sbjct: 326  GKGKDRSKRLPPSLLDMCFRILVNNADAIASLDHVPDALRHKLCQMLCDSRRMNSNFLDL 385

Query: 1374 LGRGSPSEICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPS 1195
            L  GSPSEI ++DCSW+TEE+ T+ F  CDT  LTVLQLDQCG C+PDYIL +TLA+S +
Sbjct: 386  LVSGSPSEIRLRDCSWLTEEQFTRCFDGCDTTKLTVLQLDQCGCCIPDYILLSTLAQSSN 445

Query: 1194 SLAALVSISLRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRE 1015
            SL AL+++SL G  R++D GLNALV+SAPALRS+NL Q SLLT+   + +A+SL +VL E
Sbjct: 446  SLPALINLSLTGAFRLSDAGLNALVSSAPALRSINLSQSSLLTASAFDTLANSLASVLLE 505

Query: 1014 LYIDDCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCG 835
            LYI+DCQSIDA LILPALKKLEHLEVLSVA +++V D FI  FI   G  +KEL+L  C 
Sbjct: 506  LYINDCQSIDAKLILPALKKLEHLEVLSVAGLESVTDCFIKEFIIARGHGIKELILTGCR 565

Query: 834  KLTDTSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAA 655
            KL+D+SLK+IAE+C  L ALD+ NL KLTD  +G+LANGCQS+Q LK CRNAFSD++IAA
Sbjct: 566  KLSDSSLKIIAETCPNLRALDVGNLSKLTDSTLGYLANGCQSLQLLKFCRNAFSDDAIAA 625

Query: 654  FLEASGESLKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSS 475
            FLE SGE LKELSLNNV +VGHNTA+SLAR SKNLLSLDLSWCRNL DEAVGLIVD+C S
Sbjct: 626  FLETSGEVLKELSLNNVGKVGHNTALSLARRSKNLLSLDLSWCRNLTDEAVGLIVDSCLS 685

Query: 474  LRVLKLFGCTQITNTFFNGHSNPLVRIIGLKLTPILEHLNVLDRPEAPLRY 322
            LRVLKLFGCTQITN F +GHSN  V IIGLK +P+LEH+ V D  E PLRY
Sbjct: 686  LRVLKLFGCTQITNVFLDGHSNSKVEIIGLKFSPLLEHIKVPDSQEGPLRY 736


>ref|XP_002516283.1| rad7, putative [Ricinus communis] gi|223544769|gb|EEF46285.1| rad7,
            putative [Ricinus communis]
          Length = 765

 Score =  616 bits (1588), Expect = e-173
 Identities = 367/691 (53%), Positives = 454/691 (65%), Gaps = 36/691 (5%)
 Frame = -3

Query: 2403 RRRSERLGSST-----KGKGKRKL--------EFDLNSPPLDLGISDGSDKGFF-NLRSG 2266
            RRRS RL S +      G  KRK+        E +  +    +  +D  D     +LRSG
Sbjct: 47   RRRSLRLASKSVPRDQNGSRKRKISSIEKEKEETEEQNSAFQVNDNDNVDSEMILSLRSG 106

Query: 2265 ARISKRKIE-----GIRVDSNGI-----DNVGEKLAXXXXXXXXXXXXXXXKFXXXXXXX 2116
             R+ KRK+E      + +++  +     +NV +K                 K        
Sbjct: 107  KRVVKRKVEYDSGENLVIEAKDLNVEEFENVSDK--------DKGKAKLTEKLMEKQSVV 158

Query: 2115 XXXXXXXVNPNTIADEMGPS-NVKKRYTREEKGKGRLIKDTWL-SIGIDPVELDLKPEE- 1945
                   +  N  + E   S   K+RY+REEKGK  L  D    SIG D +EL  K +E 
Sbjct: 159  EGNCSSRLEVNKFSHESSNSMRTKRRYSREEKGKANLDDDGLSNSIGKDELELQSKVKEL 218

Query: 1944 -ENLLEDAVSGLIPDAASGLVQLQEISAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQ 1768
              +L E+ V  L+P                +ERQ        K    R+D+  F+D A +
Sbjct: 219  GHSLGENVV--LLPG---------------NERQTMNINTSNKN-ESRMDQ--FRDIATR 258

Query: 1767 SASRFAHF-QPEDEDYIAPLP-------EPNEEIEDWPGPFSTAMKIIRDRATKQNGRQM 1612
            +ASRFA F + EDE+  + +        E NE IEDWPGPFSTAMKIIRDRA  +N +Q 
Sbjct: 259  NASRFAQFDRQEDENLPSEVDNVEISSVEENERIEDWPGPFSTAMKIIRDRANMRNSQQG 318

Query: 1611 NSTSGVTNPAPVIEWKPSNNEGQNRSRPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRN 1432
             ST       P I W P+ N    +SR  VPSL++L M I+ KN +A+TSL  VPD +R+
Sbjct: 319  ASTLEKPQSVP-ITWVPTRNR---QSRTCVPSLQELCMRIIVKNVDAVTSLDHVPDALRH 374

Query: 1431 KLSQLLCDSRRMDCHFVDLLGRGSPSEICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQ 1252
            +L QLLCD R+M+  F+DLL RGSP+EI VKDCSWM+EEE  K F  CDTNNL+VLQLDQ
Sbjct: 375  RLCQLLCDCRKMNSSFLDLLVRGSPTEIRVKDCSWMSEEELVKCFEGCDTNNLSVLQLDQ 434

Query: 1251 CGRCMPDYILRATLARSPSSLAALVSISLRGGCRITDVGLNALVASAPALRSLNLGQCSL 1072
            CGRCMPDY++ ATLARS  SL AL+++SL G CR++D+GL+ LVASA +LRS+NL QCS 
Sbjct: 435  CGRCMPDYVIPATLARSSRSLPALITLSLCGACRLSDIGLSLLVASATSLRSINLSQCSH 494

Query: 1071 LTSIGINNVADSLGTVLRELYIDDCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAFIH 892
            LTS  I  +ADSLG+VLRELYIDDCQS+DAMLILP+LKKLEHLEVLS+A IQTVCD F+ 
Sbjct: 495  LTSTSIGTLADSLGSVLRELYIDDCQSLDAMLILPSLKKLEHLEVLSLAGIQTVCDDFVR 554

Query: 891  GFITVCGSNMKELVLANCGKLTDTSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQ 712
             F+  CG N+KE  LA+C KLTD+SLKVIAE+C GLCAL+LVNL KLTD  +G LANGC+
Sbjct: 555  EFVVACGHNIKEFGLADCTKLTDSSLKVIAETCPGLCALNLVNLRKLTDSTLGFLANGCR 614

Query: 711  SIQTLKLCRNAFSDESIAAFLEASGESLKELSLNNVKRVGHNTAISLARCSKNLLSLDLS 532
             IQTLKLCRNAFSDE IAAFLE+SG+ LKELSLNNVK+VGH+TAISLAR S+NL+SLDLS
Sbjct: 615  EIQTLKLCRNAFSDEGIAAFLESSGDLLKELSLNNVKKVGHHTAISLARRSRNLISLDLS 674

Query: 531  WCRNLADEAVGLIVDNCSSLRVLKLFGCTQI 439
            WCRNL+DEAVGLIVD+CSSLRVLKLFGC Q+
Sbjct: 675  WCRNLSDEAVGLIVDSCSSLRVLKLFGCGQV 705


>ref|XP_003546506.2| PREDICTED: uncharacterized protein LOC100808150 [Glycine max]
          Length = 826

 Score =  602 bits (1552), Expect = e-169
 Identities = 297/495 (60%), Positives = 383/495 (77%), Gaps = 2/495 (0%)
 Frame = -3

Query: 1791 RFQDAARQSASRFAHFQPEDEDY--IAPLPEPNEEIEDWPGPFSTAMKIIRDRATKQNGR 1618
            RF D AR++ASRFA F PE ED+    P+    +EIEDWPGPFSTAMKIIRDR +K    
Sbjct: 330  RFHDIARENASRFAFFAPEGEDHDRSPPVEPERDEIEDWPGPFSTAMKIIRDRGSKLQNA 389

Query: 1617 QMNSTSGVTNPAPVIEWKPSNNEGQNRSRPLVPSLRDLSMNILAKNAEAITSLKDVPDGM 1438
            + +S + +      I+W P+   G       VPSL+++ + IL KN +AI SL+ VPD +
Sbjct: 390  EASSQASLCES---IKWVPNAKRGNAGVNVSVPSLQEMCLKILVKNVDAIASLESVPDAL 446

Query: 1437 RNKLSQLLCDSRRMDCHFVDLLGRGSPSEICVKDCSWMTEEEATKIFGACDTNNLTVLQL 1258
            R++LSQLLCDSRR++ HF++LL RG+P+EI ++DCSW+TEE+ T+ F  CDT NL VLQL
Sbjct: 447  RHRLSQLLCDSRRINGHFLELLVRGTPTEIRLRDCSWLTEEQFTESFRTCDTENLVVLQL 506

Query: 1257 DQCGRCMPDYILRATLARSPSSLAALVSISLRGGCRITDVGLNALVASAPALRSLNLGQC 1078
            DQCGRC+PDY++ +TLA+SP  L++L ++SL G CR++D GL ALV+SAPALRS+NL QC
Sbjct: 507  DQCGRCLPDYVVVSTLAQSPRHLSSLSTLSLSGACRLSDGGLRALVSSAPALRSINLSQC 566

Query: 1077 SLLTSIGINNVADSLGTVLRELYIDDCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAF 898
            SLLTS  +  +A+SL ++L+ELY+DDCQ IDA LI+PAL +LEHLEVLSVA IQTVCD F
Sbjct: 567  SLLTSSSVYILAESLKSLLKELYLDDCQGIDAALIVPALIELEHLEVLSVAGIQTVCDEF 626

Query: 897  IHGFITVCGSNMKELVLANCGKLTDTSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANG 718
            +  +I   G NMKELVL +C  LTD S+K I E C GLC LDL+NLHKLTD++IGHLANG
Sbjct: 627  VKNYIVARGQNMKELVLKDCINLTDASIKAIVEHCPGLCVLDLMNLHKLTDLSIGHLANG 686

Query: 717  CQSIQTLKLCRNAFSDESIAAFLEASGESLKELSLNNVKRVGHNTAISLARCSKNLLSLD 538
            C+++ TLKLCRN FSDE+IAAF+E +G SLKELSLNN+K+VG++T +SLA  +KNL SLD
Sbjct: 687  CRALHTLKLCRNPFSDEAIAAFVETTGGSLKELSLNNIKKVGYHTTLSLANHAKNLHSLD 746

Query: 537  LSWCRNLADEAVGLIVDNCSSLRVLKLFGCTQITNTFFNGHSNPLVRIIGLKLTPILEHL 358
            LSWCRNL D A+GLIVD+C +LR LKLFGC+Q+T+ F NGHSN  ++IIGLK++P+LEH+
Sbjct: 747  LSWCRNLTDNALGLIVDSCLALRSLKLFGCSQVTDAFLNGHSNLQIQIIGLKMSPVLEHV 806

Query: 357  NVLDRPEAPLRY*SV 313
             V D  +  L Y SV
Sbjct: 807  KVPDPHQGALNYSSV 821


>ref|XP_004243936.1| PREDICTED: DNA repair protein RAD7-like [Solanum lycopersicum]
          Length = 902

 Score =  585 bits (1509), Expect = e-164
 Identities = 317/601 (52%), Positives = 414/601 (68%), Gaps = 22/601 (3%)
 Frame = -3

Query: 2046 KRYTREEKGKGRLIKDTWLSIGIDPVELDLKPEEENLLEDAVSGLIPDAASGLVQLQEIS 1867
            +R +REEKGK  +  D  L  G+D +E   K   E   ++ VS  I              
Sbjct: 316  RRISREEKGKQVMAGDD-LCHGVDTLEGKSKNGAEKPADEIVSRAIN------------L 362

Query: 1866 AVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHF--QPEDEDYIAP-----LP 1708
             ++   Q+A       A RR + R RF+D AR++ASRFAHF  Q E E+ +A       P
Sbjct: 363  TIQDGEQVADADGSATATRR-VHRERFRDVARRNASRFAHFSSQAEHENDVADEAAEEFP 421

Query: 1707 EP---NEEIEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPSNNEGQNR 1537
            +     EEIEDWPGPFSTAM IIRDR      +Q N +         + W P  ++    
Sbjct: 422  QEVAETEEIEDWPGPFSTAMNIIRDREMNMKHQQQNKSE---KSKIEVVWVPKTDQQGQS 478

Query: 1536 SRPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGRGSP 1357
             + +VPSL DL M+IL KNA+AITSL  +PD +R+K+ Q LCDSR M   F+ LL  GSP
Sbjct: 479  RKMVVPSLHDLCMDILVKNADAITSLDGLPDALRHKICQSLCDSREMTYQFLQLLISGSP 538

Query: 1356 SEICVKDCSWMTEEEATKIFGACDTNN-----------LTVLQLDQCGRCMPDYILRATL 1210
            +EI ++DCSW+ EE  T+ F  CDTNN           L VLQLDQCGRC+PDYIL  TL
Sbjct: 539  TEIRIRDCSWLNEENFTQSFKGCDTNNFESFKGCDTNNLVVLQLDQCGRCLPDYILLVTL 598

Query: 1209 ARSPSSLAALVSISLRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLG 1030
            AR P++L AL ++SL+G CR++D GL A++++AP LRS+NL QCSLLT  GI+++++SLG
Sbjct: 599  ARRPNNLPALTTLSLKGACRLSDAGLEAIISAAPNLRSINLSQCSLLTCDGISSLSNSLG 658

Query: 1029 TVLRELYIDDCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELV 850
            +VLRELY+D+C+++  +LILPAL KL+HLEVLSVA IQTVCDAFI  F+T  G +++E++
Sbjct: 659  SVLRELYLDNCEAVHPILILPALLKLQHLEVLSVAGIQTVCDAFIKEFVTNRGQSLREII 718

Query: 849  LANCGKLTDTSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSD 670
            L  C +LTD SLK I+++C  L A+DL +L KLTD AI HLA GC+ +  LKLCRN FSD
Sbjct: 719  LKGCMELTDRSLKDISQNCPKLRAIDLSDLCKLTDSAIEHLATGCREVDNLKLCRNPFSD 778

Query: 669  ESIAAFLEASGESLKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIV 490
            E++AA++E SG SLKELSLN +K+V HNTA+SLA+CSKNL+SLDLSWCRNL +EA+GLIV
Sbjct: 779  EAVAAYVEISGVSLKELSLNRIKKVSHNTAMSLAKCSKNLISLDLSWCRNLTNEALGLIV 838

Query: 489  DNCSSLRVLKLFGCTQITNTFFNGHSNPLVRIIGLKLTPILEHLNVLDR-PEAPLRY*SV 313
            D+C SL VLKLFGC+Q+T+ F +GHSNP V+IIGLK+TPILEH+   D   + PLRY +V
Sbjct: 839  DSCLSLEVLKLFGCSQVTSVFLDGHSNPQVKIIGLKMTPILEHIEAPDSLQQGPLRYSAV 898

Query: 312  P 310
            P
Sbjct: 899  P 899


>ref|XP_007138661.1| hypothetical protein PHAVU_009G227500g, partial [Phaseolus vulgaris]
            gi|561011748|gb|ESW10655.1| hypothetical protein
            PHAVU_009G227500g, partial [Phaseolus vulgaris]
          Length = 771

 Score =  581 bits (1497), Expect = e-163
 Identities = 297/520 (57%), Positives = 386/520 (74%), Gaps = 3/520 (0%)
 Frame = -3

Query: 1863 VEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHFQPEDED--YIAPLPEP-NEE 1693
            V    + +  R R   +RR     RF D AR++ASRFA F PE+ED     P+PE  +EE
Sbjct: 251  VRERSRNSNARERRSGLRRNDYMERFHDIARENASRFAFFAPEEEDDGRSPPVPEAASEE 310

Query: 1692 IEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPSNNEGQNRSRPLVPSL 1513
            IEDWPGPFSTAMKIIRDR       Q   TS   N    I+W P  ++G       VPSL
Sbjct: 311  IEDWPGPFSTAMKIIRDRGMNLQNAQ---TSSQANLCESIKWVPKAHKGDVGVLS-VPSL 366

Query: 1512 RDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGRGSPSEICVKDC 1333
            +D+   IL  N +AI SL+ VPD +R++LSQLLCDSRR++ HF++LL RG+P+EI ++DC
Sbjct: 367  QDMCFRILVNNVDAIASLESVPDALRHRLSQLLCDSRRINGHFLELLVRGTPTEIRLRDC 426

Query: 1332 SWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPSSLAALVSISLRGGC 1153
            SW+TEE+ T+ F  C+T NL+VLQLDQCGRC+PD+++ ATLARSP +LA L ++SLRG C
Sbjct: 427  SWLTEEQFTECFRMCNTENLSVLQLDQCGRCLPDFVIVATLARSPRNLARLTTLSLRGAC 486

Query: 1152 RITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRELYIDDCQSIDAMLI 973
            R++D GL ALV+SAPALRS+NL QCSLLTS  I  +A+SL  +L+EL++DDCQ IDA LI
Sbjct: 487  RLSDGGLRALVSSAPALRSINLSQCSLLTSASIYLLAESLSYLLKELFLDDCQGIDAALI 546

Query: 972  LPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCGKLTDTSLKVIAESC 793
            +PAL +LEHLEVLSVA I TVCD F+  +I   G NMKELVL +C  LTD+S+KVI E C
Sbjct: 547  VPALIELEHLEVLSVAGIPTVCDEFVKNYIVARGQNMKELVLKDCINLTDSSIKVIVEHC 606

Query: 792  SGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAAFLEASGESLKELSL 613
             GL  LD++NL++LTD+++G+L NGC+ + TLKLCRN FSDE+IAAF+E +G SLKEL L
Sbjct: 607  PGLRVLDIMNLNRLTDLSVGYLTNGCRVLHTLKLCRNPFSDEAIAAFVETTGGSLKELLL 666

Query: 612  NNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSSLRVLKLFGCTQITN 433
            NN+K+VG++T +SLA  +K L  LDLSWCRNL D A+GLIVD+C +LR+L+LFGCTQ+T+
Sbjct: 667  NNIKKVGYHTTLSLANHAKKLHYLDLSWCRNLTDNALGLIVDSCLALRLLRLFGCTQVTD 726

Query: 432  TFFNGHSNPLVRIIGLKLTPILEHLNVLDRPEAPLRY*SV 313
             F NGHSN  ++IIGLK++P+L+ + V D  +  L Y SV
Sbjct: 727  AFLNGHSNLQIQIIGLKMSPVLQDVKVPDPHQGALNYSSV 766


>ref|XP_006297020.1| hypothetical protein CARUB_v10013011mg [Capsella rubella]
            gi|482565729|gb|EOA29918.1| hypothetical protein
            CARUB_v10013011mg [Capsella rubella]
          Length = 791

 Score =  569 bits (1467), Expect = e-159
 Identities = 304/572 (53%), Positives = 405/572 (70%), Gaps = 8/572 (1%)
 Frame = -3

Query: 2049 KKRYTREEKGKGRLIKDTWLSIGIDPVELDLKPEEENLLEDAVSGLIPDAASGLVQLQEI 1870
            ++ YTREEKGKG  +++    I I+  E ++  E ENL+ D  +   PDA+        +
Sbjct: 232  RRIYTREEKGKGIQVENVASPITIEICEEEM--EMENLINDG-NPPTPDASVPESSAMTV 288

Query: 1869 SAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHFQP--EDEDYIAPLPEPNE 1696
            +A + + Q           R       F+D AR++ASRFA++    E+E+ ++   E  +
Sbjct: 289  NAEQTQNQNGNQIGNGGRSRH------FRDIARRNASRFAYYDARMEEEEDLSDR-EGEQ 341

Query: 1695 EIEDWPGPFSTAMKIIRDRATKQNGRQMNSTS----GVTNP--APVIEWKPSNNEGQNRS 1534
            ++EDWPGPFSTAMKII+DR       + NSTS    GV+N   + +  W P  N      
Sbjct: 342  QVEDWPGPFSTAMKIIKDR-------EENSTSYFGIGVSNKEKSSLTIWVPRINFSVAPR 394

Query: 1533 RPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGRGSPS 1354
            +   PSL++LS+ IL KNA+AITSL  VPD +R KL QLLCDSRRMD HF+DLL RGSP+
Sbjct: 395  K--APSLQELSLQILVKNADAITSLDYVPDALRVKLCQLLCDSRRMDVHFLDLLVRGSPT 452

Query: 1353 EICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPSSLAALVS 1174
            EICV DCSW+TEE+ T+ F  CDT+NL VLQLDQCGRCMPDY+L +TLARSP  L  L S
Sbjct: 453  EICVPDCSWLTEEQFTECFKNCDTSNLMVLQLDQCGRCMPDYVLHSTLARSPKQLPMLSS 512

Query: 1173 ISLRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRELYIDDCQ 994
            +SL G CR++D GL  LV+SAPA+ S+NL QCSLLTS  I+ ++DSLG+VLRELYI++CQ
Sbjct: 513  LSLSGACRLSDAGLKTLVSSAPAITSINLNQCSLLTSSSIDMLSDSLGSVLRELYINECQ 572

Query: 993  SIDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCGKLTDTSL 814
            SID   IL ALKK E LE+LS+AD+ +V   F+  F+T  G  +K+L+L N G+LTD+S+
Sbjct: 573  SIDVKRILSALKKFEKLEILSLADLPSVKGQFLKEFVTARGQTLKQLILTNSGRLTDSSI 632

Query: 813  KVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAAFLEASGE 634
            KVI+E+C  L  LDL N+ KLTD ++G+LANGCQ+++ L  CRN FSDE++AAF+E +G 
Sbjct: 633  KVISENCPNLSVLDLANICKLTDSSLGYLANGCQALEKLIFCRNTFSDEAVAAFIETAGG 692

Query: 633  SLKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSSLRVLKLF 454
             L ELSLNNVK+VGHNTA+++A+ S  L  LD+SWCR+++D+ +G IVDNCSSL+VLK+F
Sbjct: 693  CLNELSLNNVKKVGHNTAVAIAKHSTKLQILDVSWCRDMSDDLLGYIVDNCSSLKVLKVF 752

Query: 453  GCTQITNTFFNGHSNPLVRIIGLKLTPILEHL 358
            GCTQ+T+ F NGHSNP V+I+GLK+ P L HL
Sbjct: 753  GCTQVTDVFVNGHSNPTVKILGLKMVPFLGHL 784


>gb|EXB38942.1| hypothetical protein L484_027377 [Morus notabilis]
          Length = 775

 Score =  566 bits (1459), Expect = e-158
 Identities = 340/676 (50%), Positives = 439/676 (64%), Gaps = 7/676 (1%)
 Frame = -3

Query: 2370 KGKGKRKLEF-DLNSPPLDLGISDGSDKGFFNLRSGARISKRKIEGIRVDSNGIDNVGE- 2197
            + KGKRKL   D   P L+         G  +LRSG R+SKR  +GI     G   VGE 
Sbjct: 137  EAKGKRKLGVVDGYLPSLECSEDGEGGIGVLSLRSGKRVSKRGNDGIE----GGRQVGEF 192

Query: 2196 -KLAXXXXXXXXXXXXXXXKFXXXXXXXXXXXXXXVNPN-TIADEMGPSNVKKRYTREEK 2023
             K+                +F                    IADE G +       R+ K
Sbjct: 193  GKIGEDKGKAILDSEEASGEFRIPKISKGKRKISDSGEEEVIADENGDN-------RKRK 245

Query: 2022 GKGRLIKDTWLSIGID-PVELDLKPEEENLLEDAVSGLIPDAASGLVQLQEISAVEHERQ 1846
            GKG L++D  L    +  VE+ L+ E EN   D V                   V +E Q
Sbjct: 246  GKGLLVEDDGLVSNSNLDVEIRLETEVENNSGDNV-------------------VSNEGQ 286

Query: 1845 IAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHFQPEDEDYIAPLPEPNEE--IEDWPGP 1672
                 VR + M R      F+D AR++A RFAHF  E+ED   P  E ++E  IEDWPGP
Sbjct: 287  -----VRNEFMER------FRDIARRNAYRFAHFDGEEEDN-EPHSEVDDEPDIEDWPGP 334

Query: 1671 FSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPSNNEGQNRSRPLVPSLRDLSMNI 1492
            FSTA+KIIRDR  K+N +  NS+S    PA V+ W P +N+    S+ +VPSL++LS+  
Sbjct: 335  FSTALKIIRDRE-KKNQQPGNSSSREKKPADVV-WFPKSNQDCKWSKNVVPSLQELSLRC 392

Query: 1491 LAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGRGSPSEICVKDCSWMTEEE 1312
            LA NA+ + SL   PD ++++LSQLLCDSRRM+ H   LL +GSP+E+CVKDCSW+TEEE
Sbjct: 393  LANNADKLVSLDYFPDCLKHRLSQLLCDSRRMNAHVFKLLLQGSPTEVCVKDCSWLTEEE 452

Query: 1311 ATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPSSLAALVSISLRGGCRITDVGL 1132
             TK F   D +NL VLQL  CGRC+PD++L +TLA + +SL  L ++S+RG CR++D+GL
Sbjct: 453  FTKCFQNFDPSNLMVLQLGFCGRCLPDFLLCSTLACAENSLPVLTTLSVRGACRLSDIGL 512

Query: 1131 NALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRELYIDDCQSIDAMLILPALKKL 952
             +LV+SAPALRSLNL +CSLLTS  I+ +A+SLG +LRELY+D C SID ML LPALKKL
Sbjct: 513  KSLVSSAPALRSLNLTECSLLTSSSIDTLANSLGLILRELYLDQCLSIDVMLTLPALKKL 572

Query: 951  EHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCGKLTDTSLKVIAESCSGLCALD 772
            E LEVLS+A I TVCD FI  FI++ G NMKEL+LA+C  LTD+SLK+IAE C GL A+D
Sbjct: 573  EQLEVLSLAGIATVCDKFIREFISIRGHNMKELILADCVNLTDSSLKIIAEKCPGLRAVD 632

Query: 771  LVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAAFLEASGESLKELSLNNVKRVG 592
            L NL KLTD ++G+LAN C++IQ L L R+ FSD+SIAAFLE SGE L+ELSLN+V++VG
Sbjct: 633  LSNLRKLTDSSLGYLANCCRAIQRLILSRDLFSDKSIAAFLETSGECLEELSLNSVRKVG 692

Query: 591  HNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSSLRVLKLFGCTQITNTFFNGHS 412
             +TA+S+AR  + L SL+LS+CR L D A+G IVD+C SLRVLK+FGCTQ+T+ F NGHS
Sbjct: 693  CHTALSIARRLRVLRSLNLSFCRGLTDNALGFIVDSCLSLRVLKIFGCTQVTSVFVNGHS 752

Query: 411  NPLVRIIGLKLTPILE 364
            NP V+IIGL + P+LE
Sbjct: 753  NPDVKIIGLPMCPVLE 768


>ref|XP_007012421.1| Rad7, putative isoform 1 [Theobroma cacao]
            gi|508782784|gb|EOY30040.1| Rad7, putative isoform 1
            [Theobroma cacao]
          Length = 714

 Score =  564 bits (1453), Expect = e-158
 Identities = 315/614 (51%), Positives = 407/614 (66%), Gaps = 32/614 (5%)
 Frame = -3

Query: 2067 MGPSNVKKRYTREEKGKGRLIK------------DTWLS-IGIDPVELDLKP--EEENLL 1933
            +G  + K+R++ EEKGK +L              D  L+ IGID       P  E E   
Sbjct: 101  VGSPSKKRRFSVEEKGKAKLDGFDEEEEKLNLDLDLGLTQIGIDKAISSFGPPIEAEEQK 160

Query: 1932 EDAVSGLIPDAASGLVQLQEISAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRF 1753
            +  V  L        + L  +  ++++R        +   R+R +  R  + AR+ A R 
Sbjct: 161  DTEVEFLGSTNTLNTIDLV-VGEIDYKRNDETEEFYVS--RKREESRRHHEIARKFAQRL 217

Query: 1752 AHFQPEDEDYIAPLPEPNEE----------------IEDWPGPFSTAMKIIRDRATKQNG 1621
            AH    + D +    + N++                 ED   PF  A+++I+ R +    
Sbjct: 218  AHEVDSEGDLLKSFSKTNKDGALKNVVVVVDDDDDKAEDSESPFGMALEMIKTRNSSSTD 277

Query: 1620 RQMNSTSGVTNPAPVIEWKPSNNEGQNRSRPL-VPSLRDLSMNILAKNAEAITSLKDVPD 1444
            ++  S  G+       +W P N +G + S    VPSL DLS+  LAKNAEA+ SL+ VPD
Sbjct: 278  KKKYSRGGLEAE---FKWVPKNYKGSSISMARDVPSLLDLSLRALAKNAEAMVSLEHVPD 334

Query: 1443 GMRNKLSQLLCDSRRMDCHFVDLLGRGSPSEICVKDCSWMTEEEATKIFGACDTNNLTVL 1264
             +R+KLSQL+CD+R+MD HF++LL RGSP+EI V DCS +TE+E TK+FG CDT NL VL
Sbjct: 335  VLRHKLSQLVCDNRKMDAHFLELLVRGSPTEIRVNDCSGVTEDEFTKMFGCCDTKNLIVL 394

Query: 1263 QLDQCGRCMPDYILRATLARSPSSLAALVSISLRGGCRITDVGLNALVASAPALRSLNLG 1084
            QLD CG C+PDY+L+ TLA S +SL ALV++SL G  R++D GLN L  SAPAL+S+NL 
Sbjct: 395  QLDLCGSCLPDYVLQGTLAHSSNSLPALVTLSLDGAYRLSDKGLNLLALSAPALQSINLS 454

Query: 1083 QCSLLTSIGINNVADSLGTVLRELYIDDCQSIDAMLILPALKKLEHLEVLSVADIQTVCD 904
            QCSLLTS GINN+A    + LRELY+D+CQ+I AM++LPALKKL+ LEVLS+A IQTVCD
Sbjct: 455  QCSLLTSAGINNLASCFESTLRELYLDECQNIQAMVVLPALKKLKCLEVLSLAGIQTVCD 514

Query: 903  AFIHGFITVCGSNMKELVLANCGKLTDTSLKVIAESCSGLCALDLVNLHKLTDVAIGHLA 724
             F+ G +  CG NMKELVLANC +LTD SLK + ++CS LCALDL  LH LTD ++ +LA
Sbjct: 515  DFVVGMVEACGKNMKELVLANCVELTDISLKFVGKNCSRLCALDLSYLHNLTDSSMRYLA 574

Query: 723  NGCQSIQTLKLCRNAFSDESIAAFLEASGESLKELSLNNVKRVGHNTAISLARCSKNLLS 544
            NGC+SI  LKLCRN FSDE+IAAFLEASG SL ELSLNN+  VG NTA+SL++CS+ L S
Sbjct: 575  NGCRSITKLKLCRNGFSDEAIAAFLEASGGSLTELSLNNIISVGLNTALSLSKCSRKLFS 634

Query: 543  LDLSWCRNLADEAVGLIVDNCSSLRVLKLFGCTQITNTFFNGHSNPLVRIIGLKLTPILE 364
            LDLSWCRNL DEA+GLIVD+C  LR+LKLFGCTQIT  F  GHSN  V+IIGLK+T IL+
Sbjct: 635  LDLSWCRNLTDEALGLIVDSCLLLRLLKLFGCTQITEVFLGGHSNAQVQIIGLKMTTILK 694

Query: 363  HLNVLDRPEAPLRY 322
            HLN+L+  EAPLRY
Sbjct: 695  HLNMLEPQEAPLRY 708


>dbj|BAD43070.1| hypothetical protein [Arabidopsis thaliana]
            gi|62318624|dbj|BAD95072.1| hypothetical protein
            [Arabidopsis thaliana]
          Length = 762

 Score =  561 bits (1445), Expect = e-157
 Identities = 300/576 (52%), Positives = 403/576 (69%), Gaps = 3/576 (0%)
 Frame = -3

Query: 2073 DEMGPSNVKKRYTREEKGKGRL-IKDTWLSIGIDPVELDLKPEEENLLEDAVSGLIPDAA 1897
            ++   S  ++RYTREEKGKG + ++D    + ++ VE+D   EEE  +E  V+   P   
Sbjct: 196  EKQSSSMGRRRYTREEKGKGIMQVEDVSSPVTVEIVEVD---EEEMEIESLVNSEKPPDV 252

Query: 1896 SGLVQLQEISAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHF--QPEDEDY 1723
            S    + E++A     + A+ R     +        F+D A + A RFAHF  Q E+E+ 
Sbjct: 253  S----VTELAATMANVEQAQNRENSNEIGNDSRTQHFRDIAERIAHRFAHFDAQVEEEED 308

Query: 1722 IAPLPEPNEEIEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPSNNEGQ 1543
            ++   E  +++EDWPGPFSTAMKII+DR            S     +P I W P +N   
Sbjct: 309  LSD-KEGEQQVEDWPGPFSTAMKIIKDREEYTTPHVGIGVSNKERSSPTI-WVPRSNFSF 366

Query: 1542 NRSRPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGRG 1363
               +   PSL++LS+ +L KNA+AITSL  VPD +R KL QLLCDSRRMD HF+DLL +G
Sbjct: 367  PPRK--APSLQELSLRVLVKNADAITSLDYVPDTLRVKLCQLLCDSRRMDLHFLDLLVQG 424

Query: 1362 SPSEICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPSSLAA 1183
            SP+EICV DCSW+TEEE T+ F  CDT+NL VLQLDQCGRCMPDYIL  TLARSP  L  
Sbjct: 425  SPTEICVPDCSWLTEEEFTECFKNCDTSNLMVLQLDQCGRCMPDYILPFTLARSPKVLPM 484

Query: 1182 LVSISLRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRELYID 1003
            L ++S+ G CR++DVGL  LV+SAPA+ S+NL QCSLLTS  I+ ++DSLG+VLRELYI+
Sbjct: 485  LSTLSISGACRLSDVGLRQLVSSAPAITSINLNQCSLLTSSSIDMLSDSLGSVLRELYIN 544

Query: 1002 DCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCGKLTD 823
            +CQ+ID   IL ALKK E LEVLS+AD+ +V   F+  F+T  G  +K+L+L N  KL+D
Sbjct: 545  ECQNIDMKHILAALKKFEKLEVLSLADLPSVKGRFLKEFVTARGQTLKQLILTNSRKLSD 604

Query: 822  TSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAAFLEA 643
            +S+KVI+E+C  L  LDL N+ KLTD ++G+LANGCQ+++ L  CRN FSDE++AAF+E 
Sbjct: 605  SSIKVISENCPNLSVLDLANVCKLTDSSLGYLANGCQALEKLIFCRNPFSDEAVAAFVET 664

Query: 642  SGESLKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSSLRVL 463
            +G SLKELSLNNVK+VGHNTA++LA+ S  L  LD+SWCR ++++ +G IVDN SSL+VL
Sbjct: 665  AGGSLKELSLNNVKKVGHNTALALAKHSDKLQILDISWCREMSNDLLGYIVDNSSSLKVL 724

Query: 462  KLFGCTQITNTFFNGHSNPLVRIIGLKLTPILEHLN 355
            K+FGC+Q+T+ F  GHSNP V+I+G+K+ P L HL+
Sbjct: 725  KVFGCSQVTDVFVKGHSNPNVKILGVKMDPFLGHLS 760


>ref|NP_178661.2| uncharacterized protein [Arabidopsis thaliana]
            gi|330250903|gb|AEC05997.1| uncharacterized protein
            AT2G06040 [Arabidopsis thaliana]
          Length = 762

 Score =  561 bits (1445), Expect = e-157
 Identities = 300/576 (52%), Positives = 403/576 (69%), Gaps = 3/576 (0%)
 Frame = -3

Query: 2073 DEMGPSNVKKRYTREEKGKGRL-IKDTWLSIGIDPVELDLKPEEENLLEDAVSGLIPDAA 1897
            ++   S  ++RYTREEKGKG + ++D    + ++ VE+D   EEE  +E  V+   P   
Sbjct: 196  EKQSSSMGRRRYTREEKGKGIMQVEDVSSPVTVEIVEVD---EEEMEIESLVNSEKPPDV 252

Query: 1896 SGLVQLQEISAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHF--QPEDEDY 1723
            S    + E++A     + A+ R     +        F+D A + A RFAHF  Q E+E+ 
Sbjct: 253  S----VTELAATMANVEQAQNRENSNEIGNDSRTQHFRDIAERIAHRFAHFDAQVEEEED 308

Query: 1722 IAPLPEPNEEIEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPSNNEGQ 1543
            ++   E  +++EDWPGPFSTAMKII+DR            S     +P I W P +N   
Sbjct: 309  LSD-KEGEQQVEDWPGPFSTAMKIIKDREEYTTPHVGIGVSNKERSSPTI-WVPRSNFSF 366

Query: 1542 NRSRPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGRG 1363
               +   PSL++LS+ +L KNA+AITSL  VPD +R KL QLLCDSRRMD HF+DLL +G
Sbjct: 367  PPRK--APSLQELSLRVLVKNADAITSLDYVPDTLRVKLCQLLCDSRRMDLHFLDLLVQG 424

Query: 1362 SPSEICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPSSLAA 1183
            SP+EICV DCSW+TEEE T+ F  CDT+NL VLQLDQCGRCMPDYIL  TLARSP  L  
Sbjct: 425  SPTEICVPDCSWLTEEEFTECFKNCDTSNLMVLQLDQCGRCMPDYILPFTLARSPKVLPM 484

Query: 1182 LVSISLRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRELYID 1003
            L ++S+ G CR++DVGL  LV+SAPA+ S+NL QCSLLTS  I+ ++DSLG+VLRELYI+
Sbjct: 485  LSTLSISGACRLSDVGLRQLVSSAPAITSINLNQCSLLTSSSIDMLSDSLGSVLRELYIN 544

Query: 1002 DCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCGKLTD 823
            +CQ+ID   IL ALKK E LEVLS+AD+ +V   F+  F+T  G  +K+L+L N  KL+D
Sbjct: 545  ECQNIDMKHILAALKKFEKLEVLSLADLPSVKGRFLKEFVTARGQTLKQLILTNSRKLSD 604

Query: 822  TSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAAFLEA 643
            +S+KVI+E+C  L  LDL N+ KLTD ++G+LANGCQ+++ L  CRN FSDE++AAF+E 
Sbjct: 605  SSIKVISENCPNLSVLDLANVCKLTDSSLGYLANGCQALEKLIFCRNPFSDEAVAAFVET 664

Query: 642  SGESLKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSSLRVL 463
            +G SLKELSLNNVK+VGHNTA++LA+ S  L  LD+SWCR ++++ +G IVDN SSL+VL
Sbjct: 665  AGGSLKELSLNNVKKVGHNTALALAKHSDKLQILDISWCREMSNDLLGYIVDNSSSLKVL 724

Query: 462  KLFGCTQITNTFFNGHSNPLVRIIGLKLTPILEHLN 355
            K+FGC+Q+T+ F  GHSNP V+I+G+K+ P L HL+
Sbjct: 725  KVFGCSQVTDVFVKGHSNPNVKILGVKMDPFLGHLS 760


>ref|XP_002309465.2| hypothetical protein POPTR_0006s23720g [Populus trichocarpa]
            gi|550336952|gb|EEE92988.2| hypothetical protein
            POPTR_0006s23720g [Populus trichocarpa]
          Length = 679

 Score =  560 bits (1443), Expect = e-156
 Identities = 310/608 (50%), Positives = 408/608 (67%), Gaps = 29/608 (4%)
 Frame = -3

Query: 2058 SNVKKRYTREEKGKGRLIKDTWLSIGI---------DPVE--LDLKPEEENLLEDA--VS 1918
            S+ + RYT EEKGK ++  +  L   +         DPVE  +D  P E  LL     + 
Sbjct: 71   SSKRLRYTTEEKGKAKVDCEVNLDFDLNLDLWGFEKDPVEGKMDTWPFEAGLLSSGPVMH 130

Query: 1917 GLIPDAASGLVQLQEISA------VEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASR 1756
               PD+     Q++           E  ++IA + VR +  RR+  +   ++ AR  A R
Sbjct: 131  NFFPDSVERNTQVENYDVPRKDIVFEQRKEIALSSVRKRQSRRKEQKLMQREIARNVAPR 190

Query: 1755 FAHFQPEDE------DYIAPLPEPNEEIE----DWPGPFSTAMKIIRDRATKQNGRQMNS 1606
            FAH  P+++      +    L E + E+E    D   PFS A++ I+ R T + G    S
Sbjct: 191  FAHLGPQEQQMKQHKEKKVKLREVDLEMELDLDDSQSPFSLALEAIKMRQTVRKG----S 246

Query: 1605 TSGVTNPAPVIEWKPSNNEGQNRSRPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKL 1426
             +G +    + +W P+  +  +  +  VP+L DLS+N LAKNA+AI SL+ VPD +R++L
Sbjct: 247  LTGFSES--LFKWVPAKAKDCDALKRDVPTLLDLSLNALAKNADAIVSLEHVPDKLRHRL 304

Query: 1425 SQLLCDSRRMDCHFVDLLGRGSPSEICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCG 1246
            SQL+ D   +D HFV+LL RGSP+EI +++ S +TEEE +KIF  CDT +LTVLQLD CG
Sbjct: 305  SQLVSDCGVVDAHFVELLARGSPTEIRLRNISRLTEEEFSKIFSVCDTKDLTVLQLDLCG 364

Query: 1245 RCMPDYILRATLARSPSSLAALVSISLRGGCRITDVGLNALVASAPALRSLNLGQCSLLT 1066
            RCMPDYIL  TLARS   L +L +ISL+G  R++D+GL  L  SAPAL+S+NL QCSLLT
Sbjct: 365  RCMPDYILNGTLARSSHRLPSLATISLKGAHRLSDIGLTQLAVSAPALQSINLSQCSLLT 424

Query: 1065 SIGINNVADSLGTVLRELYIDDCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGF 886
            S GI++      + LRELYIDDCQ+IDA +ILPALKKL+ LEVLSVA I+TVCD F+ G 
Sbjct: 425  SQGISDFVSCFESTLRELYIDDCQNIDATIILPALKKLKCLEVLSVAGIETVCDNFVIGL 484

Query: 885  ITVCGSNMKELVLANCGKLTDTSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSI 706
            +   G NMKEL  ANC +LTD SL+++ ++C  LCALDL  LH LTD A+ HLANGCQSI
Sbjct: 485  VKALGINMKELGFANCVQLTDISLRIVGKNCPNLCALDLSYLHNLTDSALKHLANGCQSI 544

Query: 705  QTLKLCRNAFSDESIAAFLEASGESLKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWC 526
            + LKL RN FSDE+I+AFLE SG+SL  LS+NN+ RV HNTA+S+A+CS+NL+SLDLSWC
Sbjct: 545  RRLKLHRNDFSDEAISAFLEVSGQSLDALSVNNIHRVAHNTALSIAKCSRNLVSLDLSWC 604

Query: 525  RNLADEAVGLIVDNCSSLRVLKLFGCTQITNTFFNGHSNPLVRIIGLKLTPILEHLNVLD 346
            R L DEA+G+IVD+C SL++LKLFGCTQIT  F NGHSNP+VRIIG K  P+LEHL+ L+
Sbjct: 605  RRLTDEALGMIVDSCLSLKLLKLFGCTQITEAFLNGHSNPMVRIIGCKTGPVLEHLDALE 664

Query: 345  RPEAPLRY 322
              E PLRY
Sbjct: 665  PQENPLRY 672


>ref|XP_007012423.1| Rad7, putative isoform 3 [Theobroma cacao]
            gi|508782786|gb|EOY30042.1| Rad7, putative isoform 3
            [Theobroma cacao]
          Length = 715

 Score =  560 bits (1442), Expect = e-156
 Identities = 315/615 (51%), Positives = 407/615 (66%), Gaps = 33/615 (5%)
 Frame = -3

Query: 2067 MGPSNVKKRYTREEKGKGRLIK------------DTWLS-IGIDPVELDLKP--EEENLL 1933
            +G  + K+R++ EEKGK +L              D  L+ IGID       P  E E   
Sbjct: 101  VGSPSKKRRFSVEEKGKAKLDGFDEEEEKLNLDLDLGLTQIGIDKAISSFGPPIEAEEQK 160

Query: 1932 EDAVSGLIPDAASGLVQLQEISAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRF 1753
            +  V  L        + L  +  ++++R        +   R+R +  R  + AR+ A R 
Sbjct: 161  DTEVEFLGSTNTLNTIDLV-VGEIDYKRNDETEEFYVS--RKREESRRHHEIARKFAQRL 217

Query: 1752 AHFQPEDEDYIAPLPEPNEE----------------IEDWPGPFSTAMKIIRDRATKQNG 1621
            AH    + D +    + N++                 ED   PF  A+++I+ R +    
Sbjct: 218  AHEVDSEGDLLKSFSKTNKDGALKNVVVVVDDDDDKAEDSESPFGMALEMIKTRNSSSTD 277

Query: 1620 RQMNSTSGVTNPAPVIEWKPSNNEGQNRSRPL-VPSLRDLSMNILAKNAEAITSLKDVPD 1444
            ++  S  G+       +W P N +G + S    VPSL DLS+  LAKNAEA+ SL+ VPD
Sbjct: 278  KKKYSRGGLEAE---FKWVPKNYKGSSISMARDVPSLLDLSLRALAKNAEAMVSLEHVPD 334

Query: 1443 GMRNKLSQLLCDSRRMDCHFVDLLGRGSPSEICVKDCSWMTEEEATKIFGACDTNNLTVL 1264
             +R+KLSQL+CD+R+MD HF++LL RGSP+EI V DCS +TE+E TK+FG CDT NL VL
Sbjct: 335  VLRHKLSQLVCDNRKMDAHFLELLVRGSPTEIRVNDCSGVTEDEFTKMFGCCDTKNLIVL 394

Query: 1263 QLDQCGRCMPDYILRATLARSPSSLAALVSISLRGGCRITDVGLNALVASAPALRSLNLG 1084
            QLD CG C+PDY+L+ TLA S +SL ALV++SL G  R++D GLN L  SAPAL+S+NL 
Sbjct: 395  QLDLCGSCLPDYVLQGTLAHSSNSLPALVTLSLDGAYRLSDKGLNLLALSAPALQSINLS 454

Query: 1083 QCSLLTSIGINNVADSLGTVLRELYIDDCQSIDAMLILPALKKLEHLEVLSVADIQTVCD 904
            QCSLLTS GINN+A    + LRELY+D+CQ+I AM++LPALKKL+ LEVLS+A IQTVCD
Sbjct: 455  QCSLLTSAGINNLASCFESTLRELYLDECQNIQAMVVLPALKKLKCLEVLSLAGIQTVCD 514

Query: 903  AFIHGFITVCGSNMKELVLANCGKLTDTSLKVIAESCSGLCALDLVNLHKLTDVAIGHLA 724
             F+ G +  CG NMKELVLANC +LTD SLK + ++CS LCALDL  LH LTD ++ +LA
Sbjct: 515  DFVVGMVEACGKNMKELVLANCVELTDISLKFVGKNCSRLCALDLSYLHNLTDSSMRYLA 574

Query: 723  NGCQSIQTLKLCRNAFSDESIAAFLEASGESLKELSLNN-VKRVGHNTAISLARCSKNLL 547
            NGC+SI  LKLCRN FSDE+IAAFLEASG SL ELSLNN +  VG NTA+SL++CS+ L 
Sbjct: 575  NGCRSITKLKLCRNGFSDEAIAAFLEASGGSLTELSLNNIISVVGLNTALSLSKCSRKLF 634

Query: 546  SLDLSWCRNLADEAVGLIVDNCSSLRVLKLFGCTQITNTFFNGHSNPLVRIIGLKLTPIL 367
            SLDLSWCRNL DEA+GLIVD+C  LR+LKLFGCTQIT  F  GHSN  V+IIGLK+T IL
Sbjct: 635  SLDLSWCRNLTDEALGLIVDSCLLLRLLKLFGCTQITEVFLGGHSNAQVQIIGLKMTTIL 694

Query: 366  EHLNVLDRPEAPLRY 322
            +HLN+L+  EAPLRY
Sbjct: 695  KHLNMLEPQEAPLRY 709


>dbj|BAF01173.1| hypothetical protein [Arabidopsis thaliana]
          Length = 762

 Score =  559 bits (1441), Expect = e-156
 Identities = 299/576 (51%), Positives = 403/576 (69%), Gaps = 3/576 (0%)
 Frame = -3

Query: 2073 DEMGPSNVKKRYTREEKGKGRL-IKDTWLSIGIDPVELDLKPEEENLLEDAVSGLIPDAA 1897
            ++   S  ++RYTREEKGKG + ++D    + ++ VE+D   EEE  +E  V+   P   
Sbjct: 196  EKQSSSMGRRRYTREEKGKGIMQVEDVSSPVTVEIVEVD---EEEMEIESLVNSEKPPDV 252

Query: 1896 SGLVQLQEISAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHF--QPEDEDY 1723
            S    + E++A     + A+ R     +        F+D A + A RFAHF  Q E+E+ 
Sbjct: 253  S----VTELAATMANVEQAQNRENSNEIGNDSRTQHFRDIAERIAHRFAHFDAQVEEEED 308

Query: 1722 IAPLPEPNEEIEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPSNNEGQ 1543
            ++   E  +++EDWPGPFSTAMKII+DR            S     +P I W P +N   
Sbjct: 309  LSD-KEGEQQVEDWPGPFSTAMKIIKDREEYTTPHVGIGVSNKERSSPTI-WVPRSNFSF 366

Query: 1542 NRSRPLVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGRG 1363
               +   PSL++LS+ +L KNA+AITSL  VPD +R KL QLLCDSRRMD HF+DLL +G
Sbjct: 367  PPRK--APSLQELSLRVLVKNADAITSLDYVPDTLRVKLCQLLCDSRRMDLHFLDLLVQG 424

Query: 1362 SPSEICVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPSSLAA 1183
            SP+EICV DCSW+TEEE T+ F  CDT+NL VLQLDQCGRCMPDYIL  TLARSP  L  
Sbjct: 425  SPTEICVPDCSWLTEEEFTECFKNCDTSNLMVLQLDQCGRCMPDYILPFTLARSPKVLPM 484

Query: 1182 LVSISLRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRELYID 1003
            L ++S+ G CR++DVGL  LV+SAPA+ S+NL QCSLLTS  I+ ++DSLG+VLRELYI+
Sbjct: 485  LSTLSISGACRLSDVGLRQLVSSAPAITSINLNQCSLLTSSSIDMLSDSLGSVLRELYIN 544

Query: 1002 DCQSIDAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCGKLTD 823
            +CQ+ID   IL AL+K E LEVLS+AD+ +V   F+  F+T  G  +K+L+L N  KL+D
Sbjct: 545  ECQNIDMKHILAALEKFEKLEVLSLADLPSVKGRFLKEFVTARGQTLKQLILTNSRKLSD 604

Query: 822  TSLKVIAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAAFLEA 643
            +S+KVI+E+C  L  LDL N+ KLTD ++G+LANGCQ+++ L  CRN FSDE++AAF+E 
Sbjct: 605  SSIKVISENCPNLSVLDLANVCKLTDSSLGYLANGCQALEKLIFCRNPFSDEAVAAFVET 664

Query: 642  SGESLKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSSLRVL 463
            +G SLKELSLNNVK+VGHNTA++LA+ S  L  LD+SWCR ++++ +G IVDN SSL+VL
Sbjct: 665  AGGSLKELSLNNVKKVGHNTALALAKHSDKLQILDISWCREMSNDLLGYIVDNSSSLKVL 724

Query: 462  KLFGCTQITNTFFNGHSNPLVRIIGLKLTPILEHLN 355
            K+FGC+Q+T+ F  GHSNP V+I+G+K+ P L HL+
Sbjct: 725  KVFGCSQVTDVFVKGHSNPNVKILGVKMDPFLGHLS 760


>gb|EYU28335.1| hypothetical protein MIMGU_mgv1a001385mg [Mimulus guttatus]
          Length = 827

 Score =  558 bits (1437), Expect = e-156
 Identities = 299/585 (51%), Positives = 410/585 (70%), Gaps = 8/585 (1%)
 Frame = -3

Query: 2043 RYTREEKGKGRLIKDTWLSIGIDPVELDLKPEEENLLEDAVSGLIPDAASGLVQLQEISA 1864
            R ++EEKGK ++      S G +  EL ++   ++ +   +     ++  G  Q++E   
Sbjct: 262  RLSKEEKGKLKIEIKAASSSGTNTSELIIQNVSDSSVSATLHAAANESLPGGSQVREADV 321

Query: 1863 VEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHFQPEDE--------DYIAPLP 1708
              ++               R+ R RF++ AR++ASRFAHF P +E            P+P
Sbjct: 322  NGNDAG-------------RVHRERFRNFARRNASRFAHFSPHEEFGNNAPVGGIQIPVP 368

Query: 1707 EPNEEIEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPSNNEGQNRSRP 1528
            E +  +EDWPGPFSTA+KII D   K+ G   + +  V      ++W P   E   +S+ 
Sbjct: 369  EADNGLEDWPGPFSTAIKIINDG--KRRGASTDKSGAVE-----LKWIPKMQE-LRKSQK 420

Query: 1527 LVPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGRGSPSEI 1348
             VPSL++L ++ILAKNA+AITSL  VPD +R+K+   LCD+R+MD HF++LL  GSP+EI
Sbjct: 421  HVPSLQELCLSILAKNADAITSLDFVPDVLRHKICWFLCDNRKMDSHFLELLVHGSPTEI 480

Query: 1347 CVKDCSWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPSSLAALVSIS 1168
             V+DCSW++EE  TK F  C+ + LTV Q DQ G C+PDY L ATLARS +SL AL ++S
Sbjct: 481  RVRDCSWLSEELFTKTFEGCNASKLTVFQFDQGGACLPDYTLYATLARSTNSLPALTTVS 540

Query: 1167 LRGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRELYIDDCQSI 988
            L+G  R++D GLN LV++A +L+S+++ QC +LTS GI ++A+SL  VLRELYID+C  I
Sbjct: 541  LKGAYRLSDAGLNTLVSAAHSLKSIDISQCPMLTSDGICSLANSLQLVLRELYIDNCHGI 600

Query: 987  DAMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCGKLTDTSLKV 808
            DAM ILPAL KLE+LEVLS+A IQTVCD F+  F+++ G  MKELVLA+C +LTD+S+KV
Sbjct: 601  DAMSILPALLKLENLEVLSLAGIQTVCDDFVSKFVSIHGCRMKELVLADCIELTDSSIKV 660

Query: 807  IAESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAFSDESIAAFLEASGESL 628
            I ++CS L A+DL NL KLTD++IGHLANGC++IQ LK CRNAFSDE+IAA+L+  G  L
Sbjct: 661  IGDTCSKLRAIDLSNLCKLTDISIGHLANGCRAIQMLKFCRNAFSDEAIAAYLDVRGALL 720

Query: 627  KELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSSLRVLKLFGC 448
             +LSLNNV +V ++TA+SLAR  +NL SLDLSWCRNL +EA+GL+VD+CSSL VLKLFGC
Sbjct: 721  NDLSLNNVIQVSNHTALSLARNCRNLRSLDLSWCRNLTNEALGLVVDSCSSLEVLKLFGC 780

Query: 447  TQITNTFFNGHSNPLVRIIGLKLTPILEHLNVLDRPEAPLRY*SV 313
            TQ+TN F +GHSN  V++IGLK+TP+ +H++V D    PLRY S+
Sbjct: 781  TQVTNVFLDGHSNSEVKLIGLKMTPVFKHIDVPDFLLGPLRYSSI 825


>ref|XP_002885812.1| hypothetical protein ARALYDRAFT_319345 [Arabidopsis lyrata subsp.
            lyrata] gi|297331652|gb|EFH62071.1| hypothetical protein
            ARALYDRAFT_319345 [Arabidopsis lyrata subsp. lyrata]
          Length = 773

 Score =  556 bits (1432), Expect = e-155
 Identities = 302/578 (52%), Positives = 401/578 (69%), Gaps = 14/578 (2%)
 Frame = -3

Query: 2049 KKRYTREEKGKGRL-IKDTWLSIGIDPVELDLKPEEENLL--EDAVSGLIPDAASGLVQL 1879
            +++YTREEKGKG + ++D    I I+  E  +  E ENL+  E+     +P+ A+  V +
Sbjct: 206  RRKYTREEKGKGVIQVEDVSSPITIEVGEEAM--EIENLVNNEEPPVVSVPELAAAGVNV 263

Query: 1878 QEISAVEHERQIAKTRVRMKAMRRRLDRGRFQDAARQSASRFAHF--QPEDEDYIAPLPE 1705
            ++            +R R            F+D A+++ASRFA F  Q E+E+ ++   E
Sbjct: 264  EQTQNHNSNEIGNGSRTR-----------HFRDIAKRNASRFARFDAQMEEEEDLSD-KE 311

Query: 1704 PNEEIEDWPGPFSTAMKIIRDRATKQNGRQMNSTSGVTNPAPVIEWKPSNNEGQNRSRPL 1525
               ++EDWPGPFSTA+KII+DR            S     +P I W P  N      +  
Sbjct: 312  GELQVEDWPGPFSTAIKIIKDREENTTPYVGIGVSNKERSSPPI-WVPKRNCSLTPRK-- 368

Query: 1524 VPSLRDLSMNILAKNAEAITSLKDVPDGMRNKLSQLLCDSRRMDCHFVDLLGRGSPSEIC 1345
             PSL++LS+ IL KNA+AITSL  VPD +R KL QLLCDSRRMD HF+DLL +GSP+EIC
Sbjct: 369  APSLQELSLRILVKNADAITSLDYVPDTLRVKLCQLLCDSRRMDVHFLDLLVQGSPTEIC 428

Query: 1344 VKDCSWMTEEEATKIFGACDTNNLTVLQLDQCGRCMPDYILRATLARSPSSLAALVSISL 1165
            V DCSW+TEE+ T+ F  CDT+NL VLQLDQCGRCMPDY+L +TLARSP  L  L S+SL
Sbjct: 429  VPDCSWLTEEQFTECFKNCDTSNLMVLQLDQCGRCMPDYVLHSTLARSPKQLPMLSSLSL 488

Query: 1164 RGGCRITDVGLNALVASAPALRSLNLGQCSLLTSIGINNVADSLGTVLRELYIDDCQSID 985
             G CR++DVGL ALV+SAPA+ S+NL QCSLLTS  I+ ++DSLG+VLRELYI++CQ+ID
Sbjct: 489  SGACRLSDVGLRALVSSAPAITSINLSQCSLLTSSSIDMLSDSLGSVLRELYINECQNID 548

Query: 984  AMLILPALKKLEHLEVLSVADIQTVCDAFIHGFITVCGSNMKELVLANCGKLTDTSLKVI 805
              LI+ ALKK E LEVLS+ADI +V   F+  F+T  G  +K+L+L N GKLTD+S+K I
Sbjct: 549  MKLIVSALKKFEKLEVLSLADIPSVKGQFLKEFVTAIGQTLKQLILTNSGKLTDSSVKAI 608

Query: 804  AESCSGLCALDLVNLHKLTDVAIGHLANGCQSIQTLKLCRNAF---------SDESIAAF 652
            +E+C  L  LDL N+ KLTD ++G+LANGCQ+++ L  CRN+F         SDE++AAF
Sbjct: 609  SENCPNLSVLDLANVCKLTDSSLGYLANGCQALEKLIFCRNSFRQTLHMSLYSDEAVAAF 668

Query: 651  LEASGESLKELSLNNVKRVGHNTAISLARCSKNLLSLDLSWCRNLADEAVGLIVDNCSSL 472
            +E +G SLKELSLNNVK+VGHNTA++LA+ S  L  LD+SWCR ++++ +G  VDNCSSL
Sbjct: 669  VETAGSSLKELSLNNVKKVGHNTALALAKHSDKLQILDVSWCREMSNDLLGYFVDNCSSL 728

Query: 471  RVLKLFGCTQITNTFFNGHSNPLVRIIGLKLTPILEHL 358
            +VLK+FGCTQ+T+ F  GHSNP V+I+GLK+ P L HL
Sbjct: 729  KVLKVFGCTQVTDVFVKGHSNPNVKILGLKMNPFLGHL 766


Top