BLASTX nr result

ID: Akebia24_contig00019531 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00019531
         (1193 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002276974.2| PREDICTED: uncharacterized protein LOC100266...   187   8e-45
ref|XP_006489180.1| PREDICTED: uncharacterized protein LOC102618...   160   1e-36
ref|XP_006419694.1| hypothetical protein CICLE_v10004184mg [Citr...   156   2e-35
ref|XP_006489178.1| PREDICTED: uncharacterized protein LOC102618...   155   2e-35
ref|XP_002314392.2| transcription activation domain-interacting ...   152   3e-34
ref|XP_007035445.1| BRCT domain-containing DNA repair protein, p...   152   3e-34
ref|XP_007035444.1| BRCT domain-containing DNA repair protein, p...   152   3e-34
ref|XP_007035443.1| BRCT domain-containing DNA repair protein, p...   152   3e-34
ref|XP_007035442.1| BRCT domain-containing DNA repair protein, p...   152   3e-34
ref|XP_007035440.1| BRCT domain-containing DNA repair protein, p...   152   3e-34
ref|XP_007227074.1| hypothetical protein PRUPE_ppa000432mg [Prun...   146   2e-32
gb|EXB74824.1| PAX-interacting protein 1 [Morus notabilis]            129   2e-27
emb|CBI26129.3| unnamed protein product [Vitis vinifera]              128   4e-27
ref|XP_004486073.1| PREDICTED: uncharacterized protein LOC101501...   126   2e-26
ref|XP_002516852.1| pax transcription activation domain interact...   124   8e-26
ref|XP_003542911.2| PREDICTED: uncharacterized protein LOC100776...   120   1e-24
ref|XP_006594468.1| PREDICTED: uncharacterized protein LOC100776...   120   1e-24
ref|XP_006597548.1| PREDICTED: uncharacterized protein LOC100817...   118   6e-24
ref|XP_006597547.1| PREDICTED: uncharacterized protein LOC100817...   114   1e-22
ref|XP_003547218.1| PREDICTED: uncharacterized protein LOC100817...   114   1e-22

>ref|XP_002276974.2| PREDICTED: uncharacterized protein LOC100266667 [Vitis vinifera]
          Length = 1294

 Score =  187 bits (475), Expect = 8e-45
 Identities = 139/338 (41%), Positives = 176/338 (52%), Gaps = 42/338 (12%)
 Frame = +3

Query: 306  VQFDDTVRLE--VETQLLNLDAETQELDGLDWDDNVRTQLLSEYEETVCVDSDGEGSDRT 479
            V FDDTV LE   ETQL+NL  ETQ LD  D  +N+RTQLL  +++ V ++SDGEG+DRT
Sbjct: 129  VPFDDTVPLEDAFETQLVNLGGETQVLDDPDCTENIRTQLLDGFDDEVVIESDGEGTDRT 188

Query: 480  EVLS----YSDDECERV---------------CCEQ------------IGKDFVMDSDPS 566
            EVLS     SDD   R                 CEQ            IG+     S P 
Sbjct: 189  EVLSDNEGLSDDNSVRSIGVFPVDKENVHNVSACEQDEKGSLLEPHPLIGEQCNAGSVPR 248

Query: 567  SDKEYGSASLRASGHGTAHVMISKGNVDEPSSILKNIESCKEHLYEDGIRDPTSVGTIVR 746
                  +A+LRASG   A  M   G    P       ++ KE+     IR  ++VG  V 
Sbjct: 249  GFTSVRAAALRASGLA-ARAMTLNGTKSGPLK-----QNDKENKISS-IRGQSAVGAEVA 301

Query: 747  EINRDHDTQNC-NENLKGSRYETKCKGANSTVKKVFNKDA---PTWDNGSVHDN-----V 899
                    +NC  E  +G R ETKC+ + STV+K+F +D     +    ++H N     +
Sbjct: 302  P-------ENCFGEYNEGLRNETKCRVSRSTVRKLFTEDTFAEKSRSTNNIHSNDEGTDL 354

Query: 900  PELLVCNHEFAGLSYVKSQEPGEESQANALDIVDKFLSINDVELSPEVGLVKTVKGTPPR 1079
             +LL C ++ AGLSYV SQEP E SQANALD VD+FL +N +E   EV   KT K     
Sbjct: 355  SQLLACGNKSAGLSYVDSQEPEEASQANALDFVDRFLQVNMLEFDQEVDHGKTTKTKSIT 414

Query: 1080 VSRVSGTQSLAKGMSLRNPVKEVEIFDWRDSCEDEGGG 1193
            VS   G QSLAK  + RN V + EIFDW D+ EDEGGG
Sbjct: 415  VSSAKGPQSLAKASNRRNTVGQSEIFDWDDNREDEGGG 452


>ref|XP_006489180.1| PREDICTED: uncharacterized protein LOC102618575 isoform X3 [Citrus
            sinensis]
          Length = 1154

 Score =  160 bits (405), Expect = 1e-36
 Identities = 141/406 (34%), Positives = 196/406 (48%), Gaps = 61/406 (15%)
 Frame = +3

Query: 159  MDSLGDDGDNIRKHGNSNPI-IINGCSTEDFDTQXXXXXXXXXXGEEPKDVQFD--DTVR 329
            M SLGD GD+      +NP  +     T+ FD+Q           E+  + Q +  DTV 
Sbjct: 1    MGSLGD-GDSPNDSSKTNPNDVFARADTQVFDSQFSPPPSPGEKVEDGNNYQLNIYDTVP 59

Query: 330  LEV-------------------ETQLLNLDAETQELDGLDWDDNVRTQLLSEYEETVCVD 452
            +E                    ETQ L L  ETQ LD  +  +N+ TQLL E++  V  D
Sbjct: 60   VEDTFETQVVGDYETQAWNLGDETQALYLGDETQALDFFNDIENMETQLLDEFDYGVAND 119

Query: 453  SDGEGSDRTEVL-----SYSDDECERVC--------------CEQIGKDFVM-------- 551
            SD EGS RTEVL        +D   R C              CEQ   D  +        
Sbjct: 120  SDNEGSGRTEVLRDGEGMPDEDSARRGCNQSLEQEKTQCTSICEQGSHDCTLRPVFQSTP 179

Query: 552  DSDPSSDKEYGS---ASLRASGHGTAHVMISKGNVDEPSSILKNIESCKEHLYEDGIRDP 722
             S+P S + + S   ASLRASG   A  M SK  +   S  +++ +   +   +D +R+ 
Sbjct: 180  RSEPGSVRRFTSIRAASLRASGL-AARSMASK-EISIDSCFVQSADLSPD---QDAVRND 234

Query: 723  TSVGTIVREINRDHDTQNCNENLKGSRYETKCKGANSTVKKVFNKDAPTWDNGSVH---- 890
             S   +V EI+  HD ++ NE  KG R    C+  +STV+K+F +D+ + D G  +    
Sbjct: 235  GSEPKVVEEIDNIHDLKD-NETEKGLRNGNSCRVGSSTVRKLFTEDSVSQDKGLPNNGDN 293

Query: 891  ----DNVPELLVCNHEFAGLSYVKSQEPGEESQANALDIVDKFLSIND-VELSPEVGLVK 1055
                +N+ +  V + E AGLSYV SQEPGE S+ANAL  V++F+  N+ V+   EV L K
Sbjct: 294  AAGGENLLQFPVNDDELAGLSYVDSQEPGEFSEANALTFVEQFIEKNNFVDFDHEVDLGK 353

Query: 1056 TVKGTPPRVSRVSGTQSLAKGMSLRNPVKEVEIFDWRDSCEDEGGG 1193
            +  G    VS   G QSLAK  + R+   +  I+DW DS EDEGGG
Sbjct: 354  SKGGKSKPVSTAKGPQSLAKKSNDRSKAGKTGIYDWDDSHEDEGGG 399


>ref|XP_006419694.1| hypothetical protein CICLE_v10004184mg [Citrus clementina]
            gi|557521567|gb|ESR32934.1| hypothetical protein
            CICLE_v10004184mg [Citrus clementina]
          Length = 1168

 Score =  156 bits (394), Expect = 2e-35
 Identities = 142/420 (33%), Positives = 195/420 (46%), Gaps = 75/420 (17%)
 Frame = +3

Query: 159  MDSLGDDGDNIRKHGNSNPI-IINGCSTEDFDTQXXXXXXXXXXGEEPKDVQFD--DTVR 329
            M SLGD GD+      +NP  +     T+ FD+Q           E+  + Q +  DTV 
Sbjct: 1    MGSLGD-GDSPNDSSKTNPNDVFARADTQVFDSQFSPPPAPGEKVEDGNNYQLNIYDTVP 59

Query: 330  LEV-------------------ETQLLNLDAETQELDGLDWDDNVRTQLLSEYEETVCVD 452
            +E                    ETQ L L  ETQ LD  +  +N+ TQLL E++  +  D
Sbjct: 60   VEDTFETQVVGDYETQAWNLGDETQALYLGDETQALDFFNDIENMETQLLDEFDYGIAND 119

Query: 453  SDGEGSDRTEVLS-----YSDDECERVC--------------CEQIGKDF---------- 545
            SD EGS RTEVL        DD   R C              CEQ  KD           
Sbjct: 120  SDNEGSGRTEVLRDGEGIPDDDSARRGCNQSLEQEKTQCTSICEQGEKDLREQRDGSNLG 179

Query: 546  ------------VMDSDPSSDKEYGS---ASLRASGHGTAHVMISKGNVDEPSSILKNIE 680
                           S+P S + + S   ASLRASG   A  M SK  +   S  +++ +
Sbjct: 180  SHDCTLRPVFQSTPRSEPGSVRRFTSIRAASLRASGL-AARSMASK-EISIDSCFVQSAD 237

Query: 681  SCKEHLYEDGIRDPTSVGTIVREINRDHDTQNCNENLKGSRYETKCKGANSTVKKVFNKD 860
               +   +D +R+  S   +V EI+  HD ++ NE  KG R    C+  +STV+K+F +D
Sbjct: 238  LSPD---QDAVRNDGSEPKVVEEIDNIHDLKD-NETEKGLRNGNSCRVGSSTVRKLFTED 293

Query: 861  APTWDNGSVH--------DNVPELLVCNHEFAGLSYVKSQEPGEESQANALDIVDKFLSI 1016
            + + D G  +        +N+ +  V + E AGLSYV SQEPGE SQAN L  V++F+  
Sbjct: 294  SVSQDKGLPNNGDNAAGGENLLQFPVNDGELAGLSYVDSQEPGEFSQANVLTFVEQFIEK 353

Query: 1017 ND-VELSPEVGLVKTVKGTPPRVSRVSGTQSLAKGMSLRNPVKEVEIFDWRDSCEDEGGG 1193
            N+ V+   EV L K+  G    VS   G QSLAK  + R+   +  I+DW DS EDEGGG
Sbjct: 354  NNFVDFDHEVDLGKSKGGKSKPVSTAKGPQSLAKKSNDRSKAGKTGIYDWDDSREDEGGG 413


>ref|XP_006489178.1| PREDICTED: uncharacterized protein LOC102618575 isoform X1 [Citrus
            sinensis] gi|568872031|ref|XP_006489179.1| PREDICTED:
            uncharacterized protein LOC102618575 isoform X2 [Citrus
            sinensis]
          Length = 1168

 Score =  155 bits (393), Expect = 2e-35
 Identities = 142/420 (33%), Positives = 196/420 (46%), Gaps = 75/420 (17%)
 Frame = +3

Query: 159  MDSLGDDGDNIRKHGNSNPI-IINGCSTEDFDTQXXXXXXXXXXGEEPKDVQFD--DTVR 329
            M SLGD GD+      +NP  +     T+ FD+Q           E+  + Q +  DTV 
Sbjct: 1    MGSLGD-GDSPNDSSKTNPNDVFARADTQVFDSQFSPPPSPGEKVEDGNNYQLNIYDTVP 59

Query: 330  LEV-------------------ETQLLNLDAETQELDGLDWDDNVRTQLLSEYEETVCVD 452
            +E                    ETQ L L  ETQ LD  +  +N+ TQLL E++  V  D
Sbjct: 60   VEDTFETQVVGDYETQAWNLGDETQALYLGDETQALDFFNDIENMETQLLDEFDYGVAND 119

Query: 453  SDGEGSDRTEVL-----SYSDDECERVC--------------CEQIGKDF---------- 545
            SD EGS RTEVL        +D   R C              CEQ  KD           
Sbjct: 120  SDNEGSGRTEVLRDGEGMPDEDSARRGCNQSLEQEKTQCTSICEQGEKDLREQRDGSNLG 179

Query: 546  ------------VMDSDPSSDKEYGS---ASLRASGHGTAHVMISKGNVDEPSSILKNIE 680
                           S+P S + + S   ASLRASG   A  M SK  +   S  +++ +
Sbjct: 180  SHDCTLRPVFQSTPRSEPGSVRRFTSIRAASLRASGL-AARSMASK-EISIDSCFVQSAD 237

Query: 681  SCKEHLYEDGIRDPTSVGTIVREINRDHDTQNCNENLKGSRYETKCKGANSTVKKVFNKD 860
               +   +D +R+  S   +V EI+  HD ++ NE  KG R    C+  +STV+K+F +D
Sbjct: 238  LSPD---QDAVRNDGSEPKVVEEIDNIHDLKD-NETEKGLRNGNSCRVGSSTVRKLFTED 293

Query: 861  APTWDNGSVH--------DNVPELLVCNHEFAGLSYVKSQEPGEESQANALDIVDKFLSI 1016
            + + D G  +        +N+ +  V + E AGLSYV SQEPGE S+ANAL  V++F+  
Sbjct: 294  SVSQDKGLPNNGDNAAGGENLLQFPVNDDELAGLSYVDSQEPGEFSEANALTFVEQFIEK 353

Query: 1017 ND-VELSPEVGLVKTVKGTPPRVSRVSGTQSLAKGMSLRNPVKEVEIFDWRDSCEDEGGG 1193
            N+ V+   EV L K+  G    VS   G QSLAK  + R+   +  I+DW DS EDEGGG
Sbjct: 354  NNFVDFDHEVDLGKSKGGKSKPVSTAKGPQSLAKKSNDRSKAGKTGIYDWDDSHEDEGGG 413


>ref|XP_002314392.2| transcription activation domain-interacting family protein [Populus
            trichocarpa] gi|550328889|gb|EEF00563.2| transcription
            activation domain-interacting family protein [Populus
            trichocarpa]
          Length = 1102

 Score =  152 bits (384), Expect = 3e-34
 Identities = 136/385 (35%), Positives = 182/385 (47%), Gaps = 40/385 (10%)
 Frame = +3

Query: 159  MDSLGDDGDNIRKHGNSNPIIINGCS-TEDFDTQXXXXXXXXXXGEEPKDVQF------- 314
            M SLGDD D   K G  +P      S T+ FD+Q          GE+  ++QF       
Sbjct: 1    MGSLGDDDDGEIKAGREDPNANFAPSYTQPFDSQFLPSPLPGEKGEDANELQFLQSTMLF 60

Query: 315  DDTVRLE--VETQLLNLDAETQELDGLDWDDNVRTQLLSEYEETVCVDSDGEGSDRTEVL 488
            +DTVR+E   ETQ+++L  ETQ LD LDW  NV TQL+ E      +DSDGEG+DRTEVL
Sbjct: 61   EDTVRVEDAFETQVVDLGGETQALDDLDWFQNVDTQLIDEI-----IDSDGEGTDRTEVL 115

Query: 489  S----YSDDE------CERVCCEQIG---------KDFVMDSDPSSDKEY--GSA----- 590
                  SDDE      CE +  E+I          K  V  SD  +D+++  GSA     
Sbjct: 116  DDGNELSDDESGRRGKCESLDGEKIQDTSLSKHGEKGLVEQSDALTDEQHLSGSALKYTS 175

Query: 591  ----SLRASGHGTAHVMISKGNVDEPSSILKNIESCKEHLYEDGIRDPTSVGTIVREINR 758
                SLR SG   A    S G  +  S  L       E    +  R  T    I  E+  
Sbjct: 176  VRVESLRVSGIA-ARSSASNGTNNSDSCSLVTDGQISEQFTVNTNRSKTK---IPEEVVW 231

Query: 759  DHDTQNCNENLKGSRYETKCKGANSTVKKVFNKDAPTWDNGSVHDNVPELLVCNHEFAGL 938
             HD    ++ +K     ++C    S ++K+F +++     G       E+ +C+   AGL
Sbjct: 232  RHDMWRSDDEVKEFSNGSRCNIGCSAMRKLFAENSFIETKGHFVGG-KEVPICDDGVAGL 290

Query: 939  SYVKSQEPGEESQANALDIVDKFLSINDVELSPEVGLVKTVKGTPPRVSRVSGTQSLAKG 1118
            SY+ SQEPG+ SQA+AL  V K +  + V L  EV L K  +     +S   G QSLAK 
Sbjct: 291  SYIDSQEPGDLSQADALLCVQKLIEESKV-LFDEVDLGKIDRRKSSHISAAKGVQSLAKK 349

Query: 1119 MSLRNPVKEVEIFDWRDSCEDEGGG 1193
             +      +  IFDW D  EDEGGG
Sbjct: 350  TTDGGTKGKSRIFDWDDGLEDEGGG 374


>ref|XP_007035445.1| BRCT domain-containing DNA repair protein, putative isoform 6
            [Theobroma cacao] gi|508714474|gb|EOY06371.1| BRCT
            domain-containing DNA repair protein, putative isoform 6
            [Theobroma cacao]
          Length = 1254

 Score =  152 bits (384), Expect = 3e-34
 Identities = 129/406 (31%), Positives = 178/406 (43%), Gaps = 61/406 (15%)
 Frame = +3

Query: 159  MDSLGDDGDNIRKHGNSNPIIINGCSTE---DFDTQXXXXXXXXXXGEEPKD-------- 305
            M SLGDD   I K    NP   +  +     DFD+Q           +   D        
Sbjct: 1    MGSLGDDNGKI-KPSQMNPKTDSSLAETQPFDFDSQFSLPAVSGDKVDNEDDDGLQYLWS 59

Query: 306  -VQFDD------------TVRLEVETQLLNLDAETQELDGLDWDDNVRTQLLSEYEETVC 446
               FDD             V    ETQ+LN   ETQ LD +D  +N+ TQLL E+++ V 
Sbjct: 60   SAPFDDDNVPGEDAFETQVVNFCGETQVLNFGGETQVLDDVDCFENMETQLLDEFDDEVA 119

Query: 447  VDSDGEGSDRTEVLSYSDDE---------CERVCCEQIGKDFVMDSDPSSDKEYGSA--- 590
            +D+DGEG+D TEVL+  D++         C R   ++  K+ +   + S D++  SA   
Sbjct: 120  LDNDGEGTDVTEVLADGDEDSNDDLSRGDCGRFLGQEEKKESLEQCNASIDEQRSSAVHV 179

Query: 591  -------------------------SLRASGHGTAHVMISKGNVDEPSSILKNIESCKEH 695
                                     SLRASG    +  + +G   E  SI  + +     
Sbjct: 180  STPDVEAVPESKPGSVRRFTSVRAASLRASGLAARNAAL-RGMNSESCSIRTDSQ----- 233

Query: 696  LYEDGIRDPTSVGTIVREINRDHDTQNCNENLKGSRYETKCKGANSTVKKVFNKDAPTWD 875
              +  I +   +   V +IN+ HD  N +E     R    C    ST +K+F +    + 
Sbjct: 234  FSDQCIGNSDGLNPKVEKINQAHDQGNHDEKSISLRNGVNCSVGCSTARKLFAEKEGPFC 293

Query: 876  NGSVHDNVPELLVCNHEFAGLSYVKSQEPGEESQANALDIVDKFLSINDVELSPEVGLVK 1055
             G   D    LL  +   AG SY+ SQEPGE SQANAL+ V++F+  N +EL  EV L K
Sbjct: 294  RGENADAKEGLLQRDGSLAGFSYIDSQEPGELSQANALNFVERFVIDNLMELDGEVDLGK 353

Query: 1056 TVKGTPPRVSRVSGTQSLAKGMSLRNPVKEVEIFDWRDSCEDEGGG 1193
            +  G    +S   G QSLAK    R+   E  IFDW D  EDEGGG
Sbjct: 354  STSGKSKLISSAKGLQSLAKKTIERSTAGETRIFDWDDFIEDEGGG 399


>ref|XP_007035444.1| BRCT domain-containing DNA repair protein, putative isoform 5
            [Theobroma cacao] gi|508714473|gb|EOY06370.1| BRCT
            domain-containing DNA repair protein, putative isoform 5
            [Theobroma cacao]
          Length = 1035

 Score =  152 bits (384), Expect = 3e-34
 Identities = 129/406 (31%), Positives = 178/406 (43%), Gaps = 61/406 (15%)
 Frame = +3

Query: 159  MDSLGDDGDNIRKHGNSNPIIINGCSTE---DFDTQXXXXXXXXXXGEEPKD-------- 305
            M SLGDD   I K    NP   +  +     DFD+Q           +   D        
Sbjct: 1    MGSLGDDNGKI-KPSQMNPKTDSSLAETQPFDFDSQFSLPAVSGDKVDNEDDDGLQYLWS 59

Query: 306  -VQFDD------------TVRLEVETQLLNLDAETQELDGLDWDDNVRTQLLSEYEETVC 446
               FDD             V    ETQ+LN   ETQ LD +D  +N+ TQLL E+++ V 
Sbjct: 60   SAPFDDDNVPGEDAFETQVVNFCGETQVLNFGGETQVLDDVDCFENMETQLLDEFDDEVA 119

Query: 447  VDSDGEGSDRTEVLSYSDDE---------CERVCCEQIGKDFVMDSDPSSDKEYGSA--- 590
            +D+DGEG+D TEVL+  D++         C R   ++  K+ +   + S D++  SA   
Sbjct: 120  LDNDGEGTDVTEVLADGDEDSNDDLSRGDCGRFLGQEEKKESLEQCNASIDEQRSSAVHV 179

Query: 591  -------------------------SLRASGHGTAHVMISKGNVDEPSSILKNIESCKEH 695
                                     SLRASG    +  + +G   E  SI  + +     
Sbjct: 180  STPDVEAVPESKPGSVRRFTSVRAASLRASGLAARNAAL-RGMNSESCSIRTDSQ----- 233

Query: 696  LYEDGIRDPTSVGTIVREINRDHDTQNCNENLKGSRYETKCKGANSTVKKVFNKDAPTWD 875
              +  I +   +   V +IN+ HD  N +E     R    C    ST +K+F +    + 
Sbjct: 234  FSDQCIGNSDGLNPKVEKINQAHDQGNHDEKSISLRNGVNCSVGCSTARKLFAEKEGPFC 293

Query: 876  NGSVHDNVPELLVCNHEFAGLSYVKSQEPGEESQANALDIVDKFLSINDVELSPEVGLVK 1055
             G   D    LL  +   AG SY+ SQEPGE SQANAL+ V++F+  N +EL  EV L K
Sbjct: 294  RGENADAKEGLLQRDGSLAGFSYIDSQEPGELSQANALNFVERFVIDNLMELDGEVDLGK 353

Query: 1056 TVKGTPPRVSRVSGTQSLAKGMSLRNPVKEVEIFDWRDSCEDEGGG 1193
            +  G    +S   G QSLAK    R+   E  IFDW D  EDEGGG
Sbjct: 354  STSGKSKLISSAKGLQSLAKKTIERSTAGETRIFDWDDFIEDEGGG 399


>ref|XP_007035443.1| BRCT domain-containing DNA repair protein, putative isoform 4
            [Theobroma cacao] gi|508714472|gb|EOY06369.1| BRCT
            domain-containing DNA repair protein, putative isoform 4
            [Theobroma cacao]
          Length = 1140

 Score =  152 bits (384), Expect = 3e-34
 Identities = 129/406 (31%), Positives = 178/406 (43%), Gaps = 61/406 (15%)
 Frame = +3

Query: 159  MDSLGDDGDNIRKHGNSNPIIINGCSTE---DFDTQXXXXXXXXXXGEEPKD-------- 305
            M SLGDD   I K    NP   +  +     DFD+Q           +   D        
Sbjct: 1    MGSLGDDNGKI-KPSQMNPKTDSSLAETQPFDFDSQFSLPAVSGDKVDNEDDDGLQYLWS 59

Query: 306  -VQFDD------------TVRLEVETQLLNLDAETQELDGLDWDDNVRTQLLSEYEETVC 446
               FDD             V    ETQ+LN   ETQ LD +D  +N+ TQLL E+++ V 
Sbjct: 60   SAPFDDDNVPGEDAFETQVVNFCGETQVLNFGGETQVLDDVDCFENMETQLLDEFDDEVA 119

Query: 447  VDSDGEGSDRTEVLSYSDDE---------CERVCCEQIGKDFVMDSDPSSDKEYGSA--- 590
            +D+DGEG+D TEVL+  D++         C R   ++  K+ +   + S D++  SA   
Sbjct: 120  LDNDGEGTDVTEVLADGDEDSNDDLSRGDCGRFLGQEEKKESLEQCNASIDEQRSSAVHV 179

Query: 591  -------------------------SLRASGHGTAHVMISKGNVDEPSSILKNIESCKEH 695
                                     SLRASG    +  + +G   E  SI  + +     
Sbjct: 180  STPDVEAVPESKPGSVRRFTSVRAASLRASGLAARNAAL-RGMNSESCSIRTDSQ----- 233

Query: 696  LYEDGIRDPTSVGTIVREINRDHDTQNCNENLKGSRYETKCKGANSTVKKVFNKDAPTWD 875
              +  I +   +   V +IN+ HD  N +E     R    C    ST +K+F +    + 
Sbjct: 234  FSDQCIGNSDGLNPKVEKINQAHDQGNHDEKSISLRNGVNCSVGCSTARKLFAEKEGPFC 293

Query: 876  NGSVHDNVPELLVCNHEFAGLSYVKSQEPGEESQANALDIVDKFLSINDVELSPEVGLVK 1055
             G   D    LL  +   AG SY+ SQEPGE SQANAL+ V++F+  N +EL  EV L K
Sbjct: 294  RGENADAKEGLLQRDGSLAGFSYIDSQEPGELSQANALNFVERFVIDNLMELDGEVDLGK 353

Query: 1056 TVKGTPPRVSRVSGTQSLAKGMSLRNPVKEVEIFDWRDSCEDEGGG 1193
            +  G    +S   G QSLAK    R+   E  IFDW D  EDEGGG
Sbjct: 354  STSGKSKLISSAKGLQSLAKKTIERSTAGETRIFDWDDFIEDEGGG 399


>ref|XP_007035442.1| BRCT domain-containing DNA repair protein, putative isoform 3
            [Theobroma cacao] gi|508714471|gb|EOY06368.1| BRCT
            domain-containing DNA repair protein, putative isoform 3
            [Theobroma cacao]
          Length = 1200

 Score =  152 bits (384), Expect = 3e-34
 Identities = 129/406 (31%), Positives = 178/406 (43%), Gaps = 61/406 (15%)
 Frame = +3

Query: 159  MDSLGDDGDNIRKHGNSNPIIINGCSTE---DFDTQXXXXXXXXXXGEEPKD-------- 305
            M SLGDD   I K    NP   +  +     DFD+Q           +   D        
Sbjct: 1    MGSLGDDNGKI-KPSQMNPKTDSSLAETQPFDFDSQFSLPAVSGDKVDNEDDDGLQYLWS 59

Query: 306  -VQFDD------------TVRLEVETQLLNLDAETQELDGLDWDDNVRTQLLSEYEETVC 446
               FDD             V    ETQ+LN   ETQ LD +D  +N+ TQLL E+++ V 
Sbjct: 60   SAPFDDDNVPGEDAFETQVVNFCGETQVLNFGGETQVLDDVDCFENMETQLLDEFDDEVA 119

Query: 447  VDSDGEGSDRTEVLSYSDDE---------CERVCCEQIGKDFVMDSDPSSDKEYGSA--- 590
            +D+DGEG+D TEVL+  D++         C R   ++  K+ +   + S D++  SA   
Sbjct: 120  LDNDGEGTDVTEVLADGDEDSNDDLSRGDCGRFLGQEEKKESLEQCNASIDEQRSSAVHV 179

Query: 591  -------------------------SLRASGHGTAHVMISKGNVDEPSSILKNIESCKEH 695
                                     SLRASG    +  + +G   E  SI  + +     
Sbjct: 180  STPDVEAVPESKPGSVRRFTSVRAASLRASGLAARNAAL-RGMNSESCSIRTDSQ----- 233

Query: 696  LYEDGIRDPTSVGTIVREINRDHDTQNCNENLKGSRYETKCKGANSTVKKVFNKDAPTWD 875
              +  I +   +   V +IN+ HD  N +E     R    C    ST +K+F +    + 
Sbjct: 234  FSDQCIGNSDGLNPKVEKINQAHDQGNHDEKSISLRNGVNCSVGCSTARKLFAEKEGPFC 293

Query: 876  NGSVHDNVPELLVCNHEFAGLSYVKSQEPGEESQANALDIVDKFLSINDVELSPEVGLVK 1055
             G   D    LL  +   AG SY+ SQEPGE SQANAL+ V++F+  N +EL  EV L K
Sbjct: 294  RGENADAKEGLLQRDGSLAGFSYIDSQEPGELSQANALNFVERFVIDNLMELDGEVDLGK 353

Query: 1056 TVKGTPPRVSRVSGTQSLAKGMSLRNPVKEVEIFDWRDSCEDEGGG 1193
            +  G    +S   G QSLAK    R+   E  IFDW D  EDEGGG
Sbjct: 354  STSGKSKLISSAKGLQSLAKKTIERSTAGETRIFDWDDFIEDEGGG 399


>ref|XP_007035440.1| BRCT domain-containing DNA repair protein, putative isoform 1
            [Theobroma cacao] gi|590660596|ref|XP_007035441.1| BRCT
            domain-containing DNA repair protein, putative isoform 1
            [Theobroma cacao] gi|508714469|gb|EOY06366.1| BRCT
            domain-containing DNA repair protein, putative isoform 1
            [Theobroma cacao] gi|508714470|gb|EOY06367.1| BRCT
            domain-containing DNA repair protein, putative isoform 1
            [Theobroma cacao]
          Length = 1225

 Score =  152 bits (384), Expect = 3e-34
 Identities = 129/406 (31%), Positives = 178/406 (43%), Gaps = 61/406 (15%)
 Frame = +3

Query: 159  MDSLGDDGDNIRKHGNSNPIIINGCSTE---DFDTQXXXXXXXXXXGEEPKD-------- 305
            M SLGDD   I K    NP   +  +     DFD+Q           +   D        
Sbjct: 1    MGSLGDDNGKI-KPSQMNPKTDSSLAETQPFDFDSQFSLPAVSGDKVDNEDDDGLQYLWS 59

Query: 306  -VQFDD------------TVRLEVETQLLNLDAETQELDGLDWDDNVRTQLLSEYEETVC 446
               FDD             V    ETQ+LN   ETQ LD +D  +N+ TQLL E+++ V 
Sbjct: 60   SAPFDDDNVPGEDAFETQVVNFCGETQVLNFGGETQVLDDVDCFENMETQLLDEFDDEVA 119

Query: 447  VDSDGEGSDRTEVLSYSDDE---------CERVCCEQIGKDFVMDSDPSSDKEYGSA--- 590
            +D+DGEG+D TEVL+  D++         C R   ++  K+ +   + S D++  SA   
Sbjct: 120  LDNDGEGTDVTEVLADGDEDSNDDLSRGDCGRFLGQEEKKESLEQCNASIDEQRSSAVHV 179

Query: 591  -------------------------SLRASGHGTAHVMISKGNVDEPSSILKNIESCKEH 695
                                     SLRASG    +  + +G   E  SI  + +     
Sbjct: 180  STPDVEAVPESKPGSVRRFTSVRAASLRASGLAARNAAL-RGMNSESCSIRTDSQ----- 233

Query: 696  LYEDGIRDPTSVGTIVREINRDHDTQNCNENLKGSRYETKCKGANSTVKKVFNKDAPTWD 875
              +  I +   +   V +IN+ HD  N +E     R    C    ST +K+F +    + 
Sbjct: 234  FSDQCIGNSDGLNPKVEKINQAHDQGNHDEKSISLRNGVNCSVGCSTARKLFAEKEGPFC 293

Query: 876  NGSVHDNVPELLVCNHEFAGLSYVKSQEPGEESQANALDIVDKFLSINDVELSPEVGLVK 1055
             G   D    LL  +   AG SY+ SQEPGE SQANAL+ V++F+  N +EL  EV L K
Sbjct: 294  RGENADAKEGLLQRDGSLAGFSYIDSQEPGELSQANALNFVERFVIDNLMELDGEVDLGK 353

Query: 1056 TVKGTPPRVSRVSGTQSLAKGMSLRNPVKEVEIFDWRDSCEDEGGG 1193
            +  G    +S   G QSLAK    R+   E  IFDW D  EDEGGG
Sbjct: 354  STSGKSKLISSAKGLQSLAKKTIERSTAGETRIFDWDDFIEDEGGG 399


>ref|XP_007227074.1| hypothetical protein PRUPE_ppa000432mg [Prunus persica]
            gi|462424010|gb|EMJ28273.1| hypothetical protein
            PRUPE_ppa000432mg [Prunus persica]
          Length = 1188

 Score =  146 bits (368), Expect = 2e-32
 Identities = 114/321 (35%), Positives = 156/321 (48%), Gaps = 36/321 (11%)
 Frame = +3

Query: 339  ETQLLNLDAETQELDGLDWDDNVRTQLLSEYEETVCVDSDGEGSDRTEVLS----YSDDE 506
            ETQ+++   ETQ LD ++  +N+ TQLL E+E+ V  D+D E SD TEV       + DE
Sbjct: 104  ETQVMDFGGETQVLDDINCVENMETQLL-EFEDEVVSDTDSEESDTTEVFDDNKHLTHDE 162

Query: 507  C-----------ERVCC---EQIGKDFVMDSDPSSDKEYGS------------ASLRASG 608
                        E++CC   E   K  +  ++ S  ++  +            ASLRASG
Sbjct: 163  SVRRGSGQVVNEEKICCTPFENNVKGLMEQANNSIHEKQNAGSVHMHFTSVRAASLRASG 222

Query: 609  HGTAHVMISKGNVDEPSSILKNIESCKEHLYEDGIRDPTSVGTIVRE-INRDHDTQNCNE 785
                     KG   E  S+  N +  +    +D         TI  E +N++HD   CNE
Sbjct: 223  LAAR----LKGTNSESPSVPSNSQCLEPLSGKDNAVSLLWGSTIGGEKVNQEHDMGRCNE 278

Query: 786  NLKGSRYETKCKGANSTVKKVFNKDAPTWDNGSVHDNVPE-----LLVCNHEFAGLSYVK 950
             ++ S  E  C+  NST +K+FN+D+   + G  H++        LL      AGLSY+ 
Sbjct: 279  KIRRSTNENNCRIGNSTARKLFNEDSDDEEKGFPHNSSSGEEGEGLLQFPCNLAGLSYID 338

Query: 951  SQEPGEESQANALDIVDKFLSINDVELSPEVGLVKTVKGTPPRVSRVSGTQSLAKGMSLR 1130
            SQEPGE SQANALD VDKFL +N  E   EV            VS   G Q LAK    +
Sbjct: 339  SQEPGELSQANALDFVDKFLQVNVEEFDKEVDRGTCAGENSKFVSSAKGPQRLAKKAIDK 398

Query: 1131 NPVKEVEIFDWRDSCEDEGGG 1193
            + V+ V IFDW DS E+E GG
Sbjct: 399  SIVQNVGIFDWDDSRENEEGG 419


>gb|EXB74824.1| PAX-interacting protein 1 [Morus notabilis]
          Length = 1069

 Score =  129 bits (325), Expect = 2e-27
 Identities = 109/319 (34%), Positives = 147/319 (46%), Gaps = 34/319 (10%)
 Frame = +3

Query: 339  ETQLLNLDAETQELDGLDWDDNVRTQLLSEYEETVCVDSDGEGSDRTEVLSYSDD----- 503
            ETQ  ++  ETQ LD  +  +++ TQLL +Y      DSDGEGSD TEVL   DD     
Sbjct: 46   ETQEADICGETQVLDDDNCFEHMETQLLDDYGNEDVSDSDGEGSDATEVLGDKDDLTDDF 105

Query: 504  -----ECERV---------CCEQIGKDFVMDSDPSSDKEYG-----------SASLRASG 608
                 EC  V          C    K     +  S  +  G           +ASLRASG
Sbjct: 106  LVGEGECHSVDKKKGQFFLVCNNDLKLIEQPNGASHQQNNGGSGTMRFTSVRAASLRASG 165

Query: 609  HGTAHVMISKGNVDEPSSILKNIESCKEHL-YEDGIRDPTSVGTIVREINRDHDTQNCNE 785
                ++ + +      S    N+ S K  +   D        G   +E +++ D    N 
Sbjct: 166  LAARNMALKETKSASSSIPTNNLASEKTDVSVTDNAVSAMEPG---KEGDQERDLGRYNG 222

Query: 786  NLKGSRYETKCKGANSTVKKVFNKDAPTWDNGSVHD-NVPELLVC--NHEFAGLSYVKSQ 956
             +  S+ E   +G N T +K+F +D          D N  E LV    ++ AGLSYV SQ
Sbjct: 223  IVNSSKDENMARGGNLTARKLFTEDLDIETEELPRDTNGGEELVKLRTYDLAGLSYVDSQ 282

Query: 957  EPGEESQANALDIVDKFLSINDVELSPEVGLVKTVKGTPPRVSRVSGTQSLAKGMSLRNP 1136
            EPGE SQANALD VD+F+  N  E   E+    T  G    VS + G Q LAK  + ++ 
Sbjct: 283  EPGELSQANALDFVDRFIKENVAEFDKEIVRGSTA-GNSKCVSSIKGPQKLAKKANEQSM 341

Query: 1137 VKEVEIFDWRDSCEDEGGG 1193
            + E+ I+DW DS EDEGGG
Sbjct: 342  IGELGIYDWDDSHEDEGGG 360


>emb|CBI26129.3| unnamed protein product [Vitis vinifera]
          Length = 1055

 Score =  128 bits (322), Expect = 4e-27
 Identities = 94/238 (39%), Positives = 123/238 (51%), Gaps = 13/238 (5%)
 Frame = +3

Query: 519  CCEQIGKDFVMD----SDPSSDKEYGSASLRASGHGTAHVMISKGNVDEPSSILKNIESC 686
            CC ++   F       S P       +A+LRASG   A  M   G    P       ++ 
Sbjct: 151  CCSKLHFFFQFSCYSGSVPRGFTSVRAAALRASGLA-ARAMTLNGTKSGPLK-----QND 204

Query: 687  KEHLYEDGIRDPTSVGTIVREINRDHDTQNC-NENLKGSRYETKCKGANSTVKKVFNKDA 863
            KE+     IR  ++VG  V         +NC  E  +G R ETKC+ + STV+K+F +D 
Sbjct: 205  KENKISS-IRGQSAVGAEVAP-------ENCFGEYNEGLRNETKCRVSRSTVRKLFTEDT 256

Query: 864  ---PTWDNGSVHDN-----VPELLVCNHEFAGLSYVKSQEPGEESQANALDIVDKFLSIN 1019
                +    ++H N     + +LL C ++ AGLSYV SQEP E SQANALD VD+FL +N
Sbjct: 257  FAEKSRSTNNIHSNDEGTDLSQLLACGNKSAGLSYVDSQEPEEASQANALDFVDRFLQVN 316

Query: 1020 DVELSPEVGLVKTVKGTPPRVSRVSGTQSLAKGMSLRNPVKEVEIFDWRDSCEDEGGG 1193
             +E   EV   KT K     VS   G QSLAK  + RN V + EIFDW D+ EDEGGG
Sbjct: 317  MLEFDQEVDHGKTTKTKSITVSSAKGPQSLAKASNRRNTVGQSEIFDWDDNREDEGGG 374


>ref|XP_004486073.1| PREDICTED: uncharacterized protein LOC101501524 [Cicer arietinum]
          Length = 1139

 Score =  126 bits (317), Expect = 2e-26
 Identities = 118/373 (31%), Positives = 179/373 (47%), Gaps = 36/373 (9%)
 Frame = +3

Query: 183  DNIRKHGNSNPIIINGCSTEDFDTQXXXXXXXXXXGEEPKDVQ---FDDTVRL-----EV 338
            D+ R H N++   +N      FDTQ          G++  D +   F+DTV L     E+
Sbjct: 6    DHHRIHSNTS---LNS-DFNHFDTQPFDDSS----GDDDDDDECRYFEDTVPLDDDDEEL 57

Query: 339  ETQLLNLDAETQELDGLDWDDNVRTQLLSEYEETVCVDSDGEGSDRTEVLSYSDDEC--E 512
            ETQ++N+D ETQ L+         TQLL +++ T  ++ + E SD T VL   DDE   +
Sbjct: 58   ETQVVNVDDETQVLEIAG-----ETQLLDDFD-TELLEEEIE-SDGTHVLENVDDEVSDD 110

Query: 513  RVCCEQIGKDFVMDSDPSS------DKEYGSAS------------LRASGHGTAHVMISK 638
               C   G+     +DPS+      +KE GS S            LR +G    ++ + K
Sbjct: 111  DPQCRDSGQS----ADPSNRERGRDEKETGSGSMPPRFTFIRAESLREAGLAKRNMNL-K 165

Query: 639  GNVDEPSSILKNIESCKEHLYEDGIRDPTSVGTIVREINRDHDTQNCNENLKGSRYETKC 818
               D+ +S++   + C+E L  +               ++      C+E ++    E   
Sbjct: 166  HTQDQSNSVMGMNQFCQEPLAVE---------------SKGKSFLGCSEKVREVDQEFNH 210

Query: 819  KGANSTVKKVFNKDAPTWDNGSVHDN--------VPELLVCNHEFAGLSYVKSQEPGEES 974
              + + V+K+FN D P   NG    N        + +    + E   LSY+ SQEPGE S
Sbjct: 211  DFSRNAVRKLFNDDLPGETNGPSLSNNDFNEGESLGKFPDYHGELERLSYINSQEPGELS 270

Query: 975  QANALDIVDKFLSINDVELSPEVGLVKTVKGTPPRVSRVSGTQSLAKGMSLRNPVKEVEI 1154
            Q NALD VD+FL  N +EL+ E   VK ++     + R+ G QSL+K ++ R+  K+ EI
Sbjct: 271  QINALDCVDRFLKSNFMELNQENNCVKKLEKKSESLPRIKGQQSLSKIINDRSKAKKTEI 330

Query: 1155 FDWRDSCEDEGGG 1193
            FDW D+CEDEGGG
Sbjct: 331  FDWDDNCEDEGGG 343


>ref|XP_002516852.1| pax transcription activation domain interacting protein, putative
            [Ricinus communis] gi|223543940|gb|EEF45466.1| pax
            transcription activation domain interacting protein,
            putative [Ricinus communis]
          Length = 1178

 Score =  124 bits (311), Expect = 8e-26
 Identities = 104/326 (31%), Positives = 150/326 (46%), Gaps = 30/326 (9%)
 Frame = +3

Query: 306  VQFDDTVRLE--VETQLLNLDAETQELDGLDWDDNVRTQLLSEYEETVCVDSDGEGSDRT 479
            V F DTV +E   ETQ+++L  ETQ LD  D  +++ TQ++        ++SDGE +D+T
Sbjct: 53   VPFSDTVAVEDAFETQVIDLCDETQVLDDPDCFEHMETQVIDG------LNSDGEETDKT 106

Query: 480  EVLSYSDD---------------ECERVCCEQIGKDFVMDSDPSSDK-------EYGSAS 593
            EVL  +++               + E    E      V D D +             +AS
Sbjct: 107  EVLDDTNELSDGESLRRGKCDSLDVENTSLELTNNRLVEDLDENHISIAAPRFLSVRAAS 166

Query: 594  LRASGHGTAHVMISKGNVDEPSSILKNIESCKEHLYEDGIRDPTSVGTIVREINRDHDTQ 773
             R SG       +   N +  S +  N     +H  ED ++D  S      E ++  D  
Sbjct: 167  FRVSGLAARRKYLEGINSESSSLLTSN-----QHSEEDTVKDNGS--KTWEEADQVSDEG 219

Query: 774  NCNENLKGSRYETKCKGANSTVKKVFNKD-----APTWDNGSVHDN-VPELLVCNHEFAG 935
               + +KG      CK    T++K+F++D       +  N SV D  + +L   +   AG
Sbjct: 220  RYTDEVKGLINRNSCKIGCPTMRKLFDEDFEIEGLASSSNKSVEDEEMLQLPAADDGLAG 279

Query: 936  LSYVKSQEPGEESQANALDIVDKFLSINDVELSPEVGLVKTVKGTPPRVSRVSGTQSLAK 1115
            LSY+ SQEPGE SQANAL  V + +  N V    E  L K+ KG    +S   G QSLAK
Sbjct: 280  LSYIDSQEPGESSQANALACVQRLIEENKVLFDNEFDLGKSSKGKSNLISTAKGPQSLAK 339

Query: 1116 GMSLRNPVKEVEIFDWRDSCEDEGGG 1193
              + R   ++  IFDW D  EDEGGG
Sbjct: 340  KANDRGTDRKTRIFDWDDGREDEGGG 365


>ref|XP_003542911.2| PREDICTED: uncharacterized protein LOC100776747 isoform X1 [Glycine
            max]
          Length = 1088

 Score =  120 bits (301), Expect = 1e-24
 Identities = 105/333 (31%), Positives = 158/333 (47%), Gaps = 39/333 (11%)
 Frame = +3

Query: 312  FDDTVRLE----VETQLLNLDAETQELD---GLDWDDNV-RTQLLSEYEETVCVDSDGE- 464
            F+DTV       +ET+ +NL  ETQ LD     D DD V  T+ L+   ET  +D DG+ 
Sbjct: 39   FEDTVPFGDDGVLETEAVNLAGETQALDDGDAFDDDDGVLETEALNLAGETQALD-DGDT 97

Query: 465  -------GSDRTEVL-SYSDDECERVCCEQIGKDFVMDSDPSSDKEYGSAS-------LR 599
                    SDRT+VL +  DD+ + V    +  + V      S ++  S S       LR
Sbjct: 98   QLLEEESDSDRTQVLENVDDDDVDEVSVGNVNGEAVDSKKGESSQQNSSGSMPPRFTVLR 157

Query: 600  ASGHGTAHVMISKGNVDEPSSILKNIESCKEHLY-EDGIRDPTSVGTIVREINRDHDTQN 776
            A     A +  +  ++ E   +  ++E   +       ++D  + G+ +R   +D     
Sbjct: 158  AESLRQAALACNM-DLKETQDVTNSVEGTSQFCQVPQAVKD--NGGSFLRCSEKDDGVDQ 214

Query: 777  CNENLK------GSRYETKCKGANSTVKKVFNKDAPTWDNG--------SVHDNVPELLV 914
             N++ K      G + ++ CK ANSTV+K+FN   P   N         +  D++ +L +
Sbjct: 215  ENKHRKYSVEVGGFKSKSMCKVANSTVRKLFNDVLPVETNQPSLRSNDFNEGDDLDKLPI 274

Query: 915  CNHEFAGLSYVKSQEPGEESQANALDIVDKFLSINDVELSPEVGLVKTVKGTPPRVSRVS 1094
             + E  GLSYV+SQEPG  SQ NALD VD+FL  N +E   E   VK ++     +    
Sbjct: 275  YHDELTGLSYVESQEPGVLSQDNALDFVDRFLKDNTLEFDQETNSVKKIEEKSKSIPSTK 334

Query: 1095 GTQSLAKGMSLRNPVKEVEIFDWRDSCEDEGGG 1193
               SLAK ++ R       I+DW D+ EDEGGG
Sbjct: 335  RQHSLAKTVNDRGKSGRTGIYDWDDNREDEGGG 367


>ref|XP_006594468.1| PREDICTED: uncharacterized protein LOC100776747 isoform X2 [Glycine
            max]
          Length = 1102

 Score =  120 bits (301), Expect = 1e-24
 Identities = 105/333 (31%), Positives = 158/333 (47%), Gaps = 39/333 (11%)
 Frame = +3

Query: 312  FDDTVRLE----VETQLLNLDAETQELD---GLDWDDNV-RTQLLSEYEETVCVDSDGE- 464
            F+DTV       +ET+ +NL  ETQ LD     D DD V  T+ L+   ET  +D DG+ 
Sbjct: 39   FEDTVPFGDDGVLETEAVNLAGETQALDDGDAFDDDDGVLETEALNLAGETQALD-DGDT 97

Query: 465  -------GSDRTEVL-SYSDDECERVCCEQIGKDFVMDSDPSSDKEYGSAS-------LR 599
                    SDRT+VL +  DD+ + V    +  + V      S ++  S S       LR
Sbjct: 98   QLLEEESDSDRTQVLENVDDDDVDEVSVGNVNGEAVDSKKGESSQQNSSGSMPPRFTVLR 157

Query: 600  ASGHGTAHVMISKGNVDEPSSILKNIESCKEHLY-EDGIRDPTSVGTIVREINRDHDTQN 776
            A     A +  +  ++ E   +  ++E   +       ++D  + G+ +R   +D     
Sbjct: 158  AESLRQAALACNM-DLKETQDVTNSVEGTSQFCQVPQAVKD--NGGSFLRCSEKDDGVDQ 214

Query: 777  CNENLK------GSRYETKCKGANSTVKKVFNKDAPTWDNG--------SVHDNVPELLV 914
             N++ K      G + ++ CK ANSTV+K+FN   P   N         +  D++ +L +
Sbjct: 215  ENKHRKYSVEVGGFKSKSMCKVANSTVRKLFNDVLPVETNQPSLRSNDFNEGDDLDKLPI 274

Query: 915  CNHEFAGLSYVKSQEPGEESQANALDIVDKFLSINDVELSPEVGLVKTVKGTPPRVSRVS 1094
             + E  GLSYV+SQEPG  SQ NALD VD+FL  N +E   E   VK ++     +    
Sbjct: 275  YHDELTGLSYVESQEPGVLSQDNALDFVDRFLKDNTLEFDQETNSVKKIEEKSKSIPSTK 334

Query: 1095 GTQSLAKGMSLRNPVKEVEIFDWRDSCEDEGGG 1193
               SLAK ++ R       I+DW D+ EDEGGG
Sbjct: 335  RQHSLAKTVNDRGKSGRTGIYDWDDNREDEGGG 367


>ref|XP_006597548.1| PREDICTED: uncharacterized protein LOC100817763 isoform X3 [Glycine
            max]
          Length = 1137

 Score =  118 bits (295), Expect = 6e-24
 Identities = 117/376 (31%), Positives = 176/376 (46%), Gaps = 31/376 (8%)
 Frame = +3

Query: 159  MDSLGDDGDNIRKHGNSNPIIINGCSTEDFDTQXXXXXXXXXXGEEPKDV--QFDDTVRL 332
            +DS G+D   I +         +   T+ FDT           GEE  DV   F+DTV  
Sbjct: 4    IDSTGNDDRKIHQD-------FDFVDTQPFDTD----------GEED-DVCGYFEDTVPF 45

Query: 333  E-----VETQLLNLDAETQELDGLDWDDNV--RTQLLSEYEETVCVDSDGE--------G 467
            +     +ET+ ++L  ETQ LD  D  D+V   T+ ++  EE   +D DG+         
Sbjct: 46   DEDDDVLETEAVDLAGETQALDDGDAFDDVLLETEAVNLAEEIQALD-DGDTQLLEEESD 104

Query: 468  SDRTEVLSYSDDECERVCCEQIGKDFVMDSDPSSDKEYGSASLRASGHGTAHVMISKGNV 647
            SDRT+VL   DD+   V  + +  +        S ++    SLR +    A  M  K  +
Sbjct: 105  SDRTQVLETVDDD--EVSVDNVNGEAADSKKVESSQQNSYESLRQAA--LACDMDLKETL 160

Query: 648  DEPSSILKNIESCKEHLYEDGIRDPTSVGTIVREINRDHDTQNCNENLK------GSRYE 809
            D  +S+    + C+E L    ++D     + +R   +D      NE+ K      G + +
Sbjct: 161  DVTNSVKGTSQFCQEPLV---VKDKGE--SFLRCSEKDGGVDQENEHGKYSVEVGGFKSK 215

Query: 810  TKCKGANSTVKKVFNKDAPTWDNG--------SVHDNVPELLVCNHEFAGLSYVKSQEPG 965
            + CK ANSTV+K+FN   P   N         +  D++ +L + + E +GLSYV SQEPG
Sbjct: 216  SMCKVANSTVRKLFNDVLPVETNQPSLSSNDFNEGDDLDKLPIYHGELSGLSYVNSQEPG 275

Query: 966  EESQANALDIVDKFLSINDVELSPEVGLVKTVKGTPPRVSRVSGTQSLAKGMSLRNPVKE 1145
              SQ NAL  VD+FL  N +E   E   +K ++G    +       SLAK ++ +   + 
Sbjct: 276  VLSQDNALCFVDRFLKDNIMEFDQETNCLK-MEGKSKSIPSTKRQHSLAKTVNDKGKARR 334

Query: 1146 VEIFDWRDSCEDEGGG 1193
              I+DW DS EDEGGG
Sbjct: 335  TGIYDWDDSREDEGGG 350


>ref|XP_006597547.1| PREDICTED: uncharacterized protein LOC100817763 isoform X2 [Glycine
            max]
          Length = 1149

 Score =  114 bits (284), Expect = 1e-22
 Identities = 122/389 (31%), Positives = 180/389 (46%), Gaps = 44/389 (11%)
 Frame = +3

Query: 159  MDSLGDDGDNIRKHGNSNPIIINGCSTEDFDTQXXXXXXXXXXGEEPKDV--QFDDTVRL 332
            +DS G+D   I +         +   T+ FDT           GEE  DV   F+DTV  
Sbjct: 4    IDSTGNDDRKIHQD-------FDFVDTQPFDTD----------GEED-DVCGYFEDTVPF 45

Query: 333  E-----VETQLLNLDAETQELDGLDWDDNV--RTQLLSEYEETVCVDSDGE--------G 467
            +     +ET+ ++L  ETQ LD  D  D+V   T+ ++  EE   +D DG+         
Sbjct: 46   DEDDDVLETEAVDLAGETQALDDGDAFDDVLLETEAVNLAEEIQALD-DGDTQLLEEESD 104

Query: 468  SDRTEVLSYSDDECERVCCEQIGKDFVMDS---DPSSDKEYGSA----------SLRASG 608
            SDRT+VL   DD+   V  + +  +   DS   + S    YGS           SLR + 
Sbjct: 105  SDRTQVLETVDDD--EVSVDNVNGE-AADSKKVESSQQNSYGSMPPRFNFLHAESLRQAA 161

Query: 609  HGTAHVMISKGNVDEPSSILKNIESCKEHLYEDGIRDPTSVGTIVREINRDHDTQNCNEN 788
               A  M  K  +D  +S+    + C+E L    ++D     + +R   +D      NE+
Sbjct: 162  --LACDMDLKETLDVTNSVKGTSQFCQEPLV---VKDKGE--SFLRCSEKDGGVDQENEH 214

Query: 789  LK------GSRYETKCKGANSTVKKVFNKDAPTWDNG--------SVHDNVPELLVCNHE 926
             K      G + ++ CK ANSTV+K+FN   P   N         +  D++ +L + + E
Sbjct: 215  GKYSVEVGGFKSKSMCKVANSTVRKLFNDVLPVETNQPSLSSNDFNEGDDLDKLPIYHGE 274

Query: 927  FAGLSYVKSQEPGEESQANALDIVDKFLSINDVELSPEVGLVKTVKGTPPRVSRVSGTQS 1106
             +GLSYV SQEPG  SQ NAL  VD+FL  N +E   E   +K ++G    +       S
Sbjct: 275  LSGLSYVNSQEPGVLSQDNALCFVDRFLKDNIMEFDQETNCLK-MEGKSKSIPSTKRQHS 333

Query: 1107 LAKGMSLRNPVKEVEIFDWRDSCEDEGGG 1193
            LAK ++ +   +   I+DW DS EDEGGG
Sbjct: 334  LAKTVNDKGKARRTGIYDWDDSREDEGGG 362


>ref|XP_003547218.1| PREDICTED: uncharacterized protein LOC100817763 isoform X1 [Glycine
            max]
          Length = 1147

 Score =  114 bits (284), Expect = 1e-22
 Identities = 122/389 (31%), Positives = 180/389 (46%), Gaps = 44/389 (11%)
 Frame = +3

Query: 159  MDSLGDDGDNIRKHGNSNPIIINGCSTEDFDTQXXXXXXXXXXGEEPKDV--QFDDTVRL 332
            +DS G+D   I +         +   T+ FDT           GEE  DV   F+DTV  
Sbjct: 4    IDSTGNDDRKIHQD-------FDFVDTQPFDTD----------GEED-DVCGYFEDTVPF 45

Query: 333  E-----VETQLLNLDAETQELDGLDWDDNV--RTQLLSEYEETVCVDSDGE--------G 467
            +     +ET+ ++L  ETQ LD  D  D+V   T+ ++  EE   +D DG+         
Sbjct: 46   DEDDDVLETEAVDLAGETQALDDGDAFDDVLLETEAVNLAEEIQALD-DGDTQLLEEESD 104

Query: 468  SDRTEVLSYSDDECERVCCEQIGKDFVMDS---DPSSDKEYGSA----------SLRASG 608
            SDRT+VL   DD+   V  + +  +   DS   + S    YGS           SLR + 
Sbjct: 105  SDRTQVLETVDDD--EVSVDNVNGE-AADSKKVESSQQNSYGSMPPRFNFLHAESLRQAA 161

Query: 609  HGTAHVMISKGNVDEPSSILKNIESCKEHLYEDGIRDPTSVGTIVREINRDHDTQNCNEN 788
               A  M  K  +D  +S+    + C+E L    ++D     + +R   +D      NE+
Sbjct: 162  --LACDMDLKETLDVTNSVKGTSQFCQEPLV---VKDKGE--SFLRCSEKDGGVDQENEH 214

Query: 789  LK------GSRYETKCKGANSTVKKVFNKDAPTWDNG--------SVHDNVPELLVCNHE 926
             K      G + ++ CK ANSTV+K+FN   P   N         +  D++ +L + + E
Sbjct: 215  GKYSVEVGGFKSKSMCKVANSTVRKLFNDVLPVETNQPSLSSNDFNEGDDLDKLPIYHGE 274

Query: 927  FAGLSYVKSQEPGEESQANALDIVDKFLSINDVELSPEVGLVKTVKGTPPRVSRVSGTQS 1106
             +GLSYV SQEPG  SQ NAL  VD+FL  N +E   E   +K ++G    +       S
Sbjct: 275  LSGLSYVNSQEPGVLSQDNALCFVDRFLKDNIMEFDQETNCLK-MEGKSKSIPSTKRQHS 333

Query: 1107 LAKGMSLRNPVKEVEIFDWRDSCEDEGGG 1193
            LAK ++ +   +   I+DW DS EDEGGG
Sbjct: 334  LAKTVNDKGKARRTGIYDWDDSREDEGGG 362


Top