BLASTX nr result

ID: Akebia22_contig00028018 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia22_contig00028018
         (1319 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002269161.1| PREDICTED: uncharacterized protein LOC100264...   287   8e-75
ref|XP_007023577.1| HAT transposon superfamily protein, putative...   256   1e-65
ref|XP_002269962.2| PREDICTED: uncharacterized protein LOC100251...   233   1e-58
ref|XP_006366948.1| PREDICTED: uncharacterized protein LOC102589...   229   2e-57
ref|XP_004246748.1| PREDICTED: uncharacterized protein LOC101247...   229   3e-57
ref|XP_004246747.1| PREDICTED: uncharacterized protein LOC101247...   229   3e-57
ref|XP_002530377.1| protein dimerization, putative [Ricinus comm...   221   4e-55
emb|CBI29151.3| unnamed protein product [Vitis vinifera]              147   1e-32
ref|XP_002273287.1| PREDICTED: uncharacterized protein LOC100260...   147   1e-32
emb|CAN67823.1| hypothetical protein VITISV_028004 [Vitis vinifera]   147   1e-32
ref|XP_006299218.1| hypothetical protein CARUB_v10015366mg [Caps...   146   2e-32
ref|XP_004160654.1| PREDICTED: uncharacterized LOC101213851 [Cuc...   146   2e-32
ref|XP_004147666.1| PREDICTED: uncharacterized protein LOC101213...   146   2e-32
ref|XP_004140930.1| PREDICTED: uncharacterized protein LOC101215...   144   1e-31
ref|XP_007051268.1| HAT dimerization domain-containing protein i...   142   3e-31
ref|XP_007051264.1| HAT dimerization domain-containing protein i...   142   3e-31
ref|XP_007051263.1| HAT dimerization domain-containing protein i...   142   3e-31
ref|XP_002512206.1| DNA binding protein, putative [Ricinus commu...   142   3e-31
ref|XP_002519322.1| DNA binding protein, putative [Ricinus commu...   141   6e-31
ref|XP_003546544.1| PREDICTED: uncharacterized protein LOC100784...   140   1e-30

>ref|XP_002269161.1| PREDICTED: uncharacterized protein LOC100264734 [Vitis vinifera]
          Length = 714

 Score =  287 bits (734), Expect = 8e-75
 Identities = 148/308 (48%), Positives = 194/308 (62%), Gaps = 4/308 (1%)
 Frame = +1

Query: 403  MSTTKVSNNVREHGMVFDEQKKRVKCNYCAKVVSGFSRLKQHLGGIRGDVVPCGEVPENV 582
            M+  +VS NV +HG V D+QK RV+CNYCAK++SGFSRL+ HLG ++GDV PCGEVPENV
Sbjct: 1    MNANEVSANVHDHGKVVDQQKNRVQCNYCAKLMSGFSRLRYHLGCVKGDVTPCGEVPENV 60

Query: 583  KLRMRNSLFERKNEMLSREVLQLFHSDVPLKRNLCSTSSELNCSQLESFQPNGK--RKLV 756
            K  M+  L E K   L +EV  L + D+P KR    + S +   +L++ Q  G   RK V
Sbjct: 61   KELMKTKLLELKRGSLGKEVGTLEYPDLPWKRKWYPSPSAIEHRKLQTTQKAGSDSRKDV 120

Query: 757  --DSIPKEDVMENVLFHQGTSHSHTTAKRQEIEGDSLRNAQRCIGRFFYENGIDFSVAKS 930
              D++ +  V + V    G   S      +E E  S R A++CIGRFFYE G D S A S
Sbjct: 121  QKDTVSENGVTKEVSLPNGRRGSQKVEDHKEREDSSSRQAKKCIGRFFYELGTDLSAATS 180

Query: 931  LSFQRMISSIIGCGSFEYNAPSCNDLKGWILHEEMKEMHHYVKEVRHTWVSTGCSILLDG 1110
             SFQRMI++ +GCG   Y  PSC +LKGWIL EE+KEM  YVK+VR++W +TGCSILLDG
Sbjct: 181  PSFQRMITAALGCGQIGYKLPSCQELKGWILKEEVKEMQQYVKDVRNSWANTGCSILLDG 240

Query: 1111 WTNENGRTLINILVDCQRGPIFLRSAEITSSADDVSTLISLLDGXXXXXXXXXXXXXXTY 1290
            W +E GR LIN+L DC +G I++RS +I++   DV  L   ++               TY
Sbjct: 241  WMDEKGRNLINVLADCPKGTIYIRSCDISAFIADVDALQFFIEQIIEEVGVENVVQIITY 300

Query: 1291 TISDCMVA 1314
            +ISDCM A
Sbjct: 301  SISDCMAA 308


>ref|XP_007023577.1| HAT transposon superfamily protein, putative [Theobroma cacao]
            gi|508778943|gb|EOY26199.1| HAT transposon superfamily
            protein, putative [Theobroma cacao]
          Length = 709

 Score =  256 bits (655), Expect = 1e-65
 Identities = 129/283 (45%), Positives = 181/283 (63%), Gaps = 4/283 (1%)
 Frame = +1

Query: 403  MSTTKVSNNVREHGMVFDEQKKRVKCNYCAKVVSGFSRLKQHLGGIRGDVVPCGEVPENV 582
            M++++ S NV +HG   D +K+RV+CNYC K +SGF RLK HLGG+RGDV+PC  V E+V
Sbjct: 1    MASSEASINVHDHGKAVDGKKQRVQCNYCGKEMSGFFRLKYHLGGVRGDVIPCEMVSEDV 60

Query: 583  KLRMRNSLFERKNEMLSREVLQLFHSDVPLKRNLCSTSSELNCSQLESFQPNGKR----K 750
            K   +N L ER    LS+EV  L   D+P KRN C  S+     + +S + +G R    +
Sbjct: 61   KELFKNMLPERGGR-LSQEVRDLSRQDLPWKRNGCPNSNVAKKMRRQSCKSSGSRSGEDE 119

Query: 751  LVDSIPKEDVMENVLFHQGTSHSHTTAKRQEIEGDSLRNAQRCIGRFFYENGIDFSVAKS 930
            ++DS+ ++DV E  +       S +       E  S +  +RCIGRFFYE GID ++  S
Sbjct: 120  IIDSMSEDDVKEPAILPSARIVSQSAVTGDPEEEPSCKQNKRCIGRFFYETGIDLTLVNS 179

Query: 931  LSFQRMISSIIGCGSFEYNAPSCNDLKGWILHEEMKEMHHYVKEVRHTWVSTGCSILLDG 1110
             SFQRMI+     G   Y  PSC +LKGWIL +E+KEM  YV+++R +W S+GCSILLDG
Sbjct: 180  PSFQRMINDTHCPGQTNYKIPSCQELKGWILKDEVKEMQEYVEKIRQSWASSGCSILLDG 239

Query: 1111 WTNENGRTLINILVDCQRGPIFLRSAEITSSADDVSTLISLLD 1239
            W +E GR L++ +VDC +GPI+L S+++++S DDV  L  L D
Sbjct: 240  WIDEKGRNLVSFIVDCPQGPIYLHSSDVSASVDDVDALQLLFD 282


>ref|XP_002269962.2| PREDICTED: uncharacterized protein LOC100251332 [Vitis vinifera]
          Length = 709

 Score =  233 bits (595), Expect = 1e-58
 Identities = 129/307 (42%), Positives = 175/307 (57%), Gaps = 2/307 (0%)
 Frame = +1

Query: 403  MSTTKVSNNVREHGMVFDEQKKRVKCNYCAKVVSGFSRLKQHLGGIRGDVVPCGEVPENV 582
            M+T  VS    +HG   DEQKK+ +CNYC KVVSGF+RLK HL G RGDV  CGEVP NV
Sbjct: 1    MATNDVSAKFHDHGKAVDEQKKKAQCNYCGKVVSGFTRLKYHLAGKRGDVSACGEVPANV 60

Query: 583  KLRMRNSLFERKNEMLSREVLQLFHSDVPLKRNLCSTSSELNCSQLESFQPNGKR--KLV 756
            K  M+  + E +   L + V ++   D+ LKR     S  +   ++ + Q  G    K  
Sbjct: 61   KELMKEKIHELERRKLRKGVEKMNPPDLSLKRKSSLESKNVKQRKVGTIQSAGSDSGKHA 120

Query: 757  DSIPKEDVMENVLFHQGTSHSHTTAKRQEIEGDSLRNAQRCIGRFFYENGIDFSVAKSLS 936
             + P   V E V F   +  S   +  +E E   +  A++CIGRF YE G DFS A   S
Sbjct: 121  KNDPVSRVNEIVSFSVLSIGSKKASSDKEGEDIPVSQAKKCIGRFLYEMGTDFSAATPTS 180

Query: 937  FQRMISSIIGCGSFEYNAPSCNDLKGWILHEEMKEMHHYVKEVRHTWVSTGCSILLDGWT 1116
             +RMI+ I  C   EY  PS  +LKG IL +E+KEM H+V  +R TW +TGCSI++DGW 
Sbjct: 181  LRRMINGIHSCHQVEYEFPSHQELKGCILQDEVKEMLHHVHGIRDTWATTGCSIVVDGWK 240

Query: 1117 NENGRTLINILVDCQRGPIFLRSAEITSSADDVSTLISLLDGXXXXXXXXXXXXXXTYTI 1296
            +E GR L+N LVDC  GPI LR  +I++ +DDV +L+ L +               +++ 
Sbjct: 241  DEKGRNLMNFLVDCPWGPICLRLCDISTLSDDVHSLVLLFEQVIAEVGVENVVQIVSHSA 300

Query: 1297 SDCMVAV 1317
            S+CM AV
Sbjct: 301  SECMAAV 307


>ref|XP_006366948.1| PREDICTED: uncharacterized protein LOC102589543 isoform X1 [Solanum
            tuberosum] gi|565402986|ref|XP_006366949.1| PREDICTED:
            uncharacterized protein LOC102589543 isoform X2 [Solanum
            tuberosum] gi|565402988|ref|XP_006366950.1| PREDICTED:
            uncharacterized protein LOC102589543 isoform X3 [Solanum
            tuberosum]
          Length = 686

 Score =  229 bits (584), Expect = 2e-57
 Identities = 126/302 (41%), Positives = 170/302 (56%)
 Frame = +1

Query: 412  TKVSNNVREHGMVFDEQKKRVKCNYCAKVVSGFSRLKQHLGGIRGDVVPCGEVPENVKLR 591
            T+   ++ +HG+  D++K +VKCNYC KVVSGFSRLKQHLGGIRGDV PC E P  VK  
Sbjct: 2    TRDKIDIHQHGVPVDQKKLKVKCNYCGKVVSGFSRLKQHLGGIRGDVTPCLETPILVKEA 61

Query: 592  MRNSLFERKNEMLSREVLQLFHSDVPLKRNLCSTSSELNCSQLESFQPNGKRKLVDSIPK 771
            +   +  +KN  L +EV QL H ++PLKRN C    E N             K  +S+ K
Sbjct: 62   LEAEILNKKNGNLIKEVGQLQHPNLPLKRNWCPRDGEPN-------------KTSESVNK 108

Query: 772  EDVMENVLFHQGTSHSHTTAKRQEIEGDSLRNAQRCIGRFFYENGIDFSVAKSLSFQRMI 951
            +        H G    ++      +   S +   + IGRFFYE GID    +  SFQRM+
Sbjct: 109  K--------HNGV---NSKVAGTSVVDSSSQEISKSIGRFFYEAGIDLDAIRLPSFQRMV 157

Query: 952  SSIIGCGSFEYNAPSCNDLKGWILHEEMKEMHHYVKEVRHTWVSTGCSILLDGWTNENGR 1131
             + +  G      PSC +L+GWIL + +KEM  YV E+R++W STGCSILLDGW + NGR
Sbjct: 158  KATLSPGK-TVKFPSCQELRGWILQDAVKEMQQYVMEIRNSWASTGCSILLDGWIDSNGR 216

Query: 1132 TLINILVDCQRGPIFLRSAEITSSADDVSTLISLLDGXXXXXXXXXXXXXXTYTISDCMV 1311
             LINILV C RG I+LRS++I+S   +V  ++   +                Y+ S CM+
Sbjct: 217  NLINILVYCPRGTIYLRSSDISSFNGNVDAMLLFFEEVLEEVGVETVVQIVAYSTSACMM 276

Query: 1312 AV 1317
             V
Sbjct: 277  EV 278


>ref|XP_004246748.1| PREDICTED: uncharacterized protein LOC101247551 isoform 2 [Solanum
            lycopersicum]
          Length = 682

 Score =  229 bits (583), Expect = 3e-57
 Identities = 126/300 (42%), Positives = 169/300 (56%)
 Frame = +1

Query: 412  TKVSNNVREHGMVFDEQKKRVKCNYCAKVVSGFSRLKQHLGGIRGDVVPCGEVPENVKLR 591
            T+   ++R+HG+  D++K +VKCNYC KVVSGFSRLKQHLGGIRGDV PC + P  VK  
Sbjct: 2    TRDKIDIRQHGVPVDQKKLKVKCNYCGKVVSGFSRLKQHLGGIRGDVTPCLKTPILVKEA 61

Query: 592  MRNSLFERKNEMLSREVLQLFHSDVPLKRNLCSTSSELNCSQLESFQPNGKRKLVDSIPK 771
            +   +  +KNE L ++V QL H  +PLKRN C    E N             K  +S+ K
Sbjct: 62   LEAEILNKKNENLIKKVGQLQHPSLPLKRNWCPRDGEPN-------------KTSESVNK 108

Query: 772  EDVMENVLFHQGTSHSHTTAKRQEIEGDSLRNAQRCIGRFFYENGIDFSVAKSLSFQRMI 951
            +        H G    ++      +   S +   + IGRFFYE GIDF   +  SFQRM+
Sbjct: 109  K--------HNGV---NSNVAGTSVVDSSSQEISKSIGRFFYEAGIDFDAIRLPSFQRML 157

Query: 952  SSIIGCGSFEYNAPSCNDLKGWILHEEMKEMHHYVKEVRHTWVSTGCSILLDGWTNENGR 1131
             + +  G      PSC +LKGWIL + +KEM  YV E+R +W STGCSILLDGW +  GR
Sbjct: 158  KATLSPGK-TIKFPSCQELKGWILQDAVKEMQQYVTEIRKSWASTGCSILLDGWIDSKGR 216

Query: 1132 TLINILVDCQRGPIFLRSAEITSSADDVSTLISLLDGXXXXXXXXXXXXXXTYTISDCMV 1311
             LINILV C RG I+LRS++I+S   +V  ++   +                Y+ S CM+
Sbjct: 217  NLINILVYCPRGTIYLRSSDISSFNGNVDAMLVFFEEVLEEVGVETVVQIVGYSTSACMM 276


>ref|XP_004246747.1| PREDICTED: uncharacterized protein LOC101247551 isoform 1 [Solanum
            lycopersicum]
          Length = 692

 Score =  229 bits (583), Expect = 3e-57
 Identities = 126/300 (42%), Positives = 169/300 (56%)
 Frame = +1

Query: 412  TKVSNNVREHGMVFDEQKKRVKCNYCAKVVSGFSRLKQHLGGIRGDVVPCGEVPENVKLR 591
            T+   ++R+HG+  D++K +VKCNYC KVVSGFSRLKQHLGGIRGDV PC + P  VK  
Sbjct: 12   TRDKIDIRQHGVPVDQKKLKVKCNYCGKVVSGFSRLKQHLGGIRGDVTPCLKTPILVKEA 71

Query: 592  MRNSLFERKNEMLSREVLQLFHSDVPLKRNLCSTSSELNCSQLESFQPNGKRKLVDSIPK 771
            +   +  +KNE L ++V QL H  +PLKRN C    E N             K  +S+ K
Sbjct: 72   LEAEILNKKNENLIKKVGQLQHPSLPLKRNWCPRDGEPN-------------KTSESVNK 118

Query: 772  EDVMENVLFHQGTSHSHTTAKRQEIEGDSLRNAQRCIGRFFYENGIDFSVAKSLSFQRMI 951
            +        H G    ++      +   S +   + IGRFFYE GIDF   +  SFQRM+
Sbjct: 119  K--------HNGV---NSNVAGTSVVDSSSQEISKSIGRFFYEAGIDFDAIRLPSFQRML 167

Query: 952  SSIIGCGSFEYNAPSCNDLKGWILHEEMKEMHHYVKEVRHTWVSTGCSILLDGWTNENGR 1131
             + +  G      PSC +LKGWIL + +KEM  YV E+R +W STGCSILLDGW +  GR
Sbjct: 168  KATLSPGK-TIKFPSCQELKGWILQDAVKEMQQYVTEIRKSWASTGCSILLDGWIDSKGR 226

Query: 1132 TLINILVDCQRGPIFLRSAEITSSADDVSTLISLLDGXXXXXXXXXXXXXXTYTISDCMV 1311
             LINILV C RG I+LRS++I+S   +V  ++   +                Y+ S CM+
Sbjct: 227  NLINILVYCPRGTIYLRSSDISSFNGNVDAMLVFFEEVLEEVGVETVVQIVGYSTSACMM 286


>ref|XP_002530377.1| protein dimerization, putative [Ricinus communis]
            gi|223530094|gb|EEF32010.1| protein dimerization,
            putative [Ricinus communis]
          Length = 698

 Score =  221 bits (564), Expect = 4e-55
 Identities = 125/283 (44%), Positives = 172/283 (60%), Gaps = 4/283 (1%)
 Frame = +1

Query: 403  MSTTKVSNNVREHGMVFDEQKKRVKCNYCAKVVSGFSRLKQHLGGIRGDVVPCGEVPENV 582
            M+T K   N  +HG   +  K RV+CNYC KVVSG +RLK HLGGIR DVVPC +VPENV
Sbjct: 1    MATGKGFINTYDHGTALE--KNRVQCNYCGKVVSGITRLKCHLGGIRKDVVPCEKVPENV 58

Query: 583  KLRMRNSLFERKNEMLSREVLQLFHSDVPLKRNLCSTSSELNCSQLESFQPNG----KRK 750
            K   RN L E K E L++E  +    D+P KRN   T + +   + E+ Q  G    K+ 
Sbjct: 59   KEAFRNMLQEIKKEALAKEFGKQCQPDLPWKRNWSPTPNGVKHIKHEASQTAGCESNKQV 118

Query: 751  LVDSIPKEDVMENVLFHQGTSHSHTTAKRQEIEGDSLRNAQRCIGRFFYENGIDFSVAKS 930
             +DS  ++   E +               +  E  S R A+RCIGRFFYE GIDFS A S
Sbjct: 119  DMDSGAEDGAAEYLPVCNRRVDPEFAINGEAKEDASSRQAKRCIGRFFYETGIDFSNANS 178

Query: 931  LSFQRMISSIIGCGSFEYNAPSCNDLKGWILHEEMKEMHHYVKEVRHTWVSTGCSILLDG 1110
             SF+RM+++ +G G  +   P+ ++ KGWIL +E+KE   YVK++R++W STGCS+LLDG
Sbjct: 179  PSFKRMLNTTLGDG--QVKIPTIHEFKGWILWDELKETQEYVKKIRNSWASTGCSLLLDG 236

Query: 1111 WTNENGRTLINILVDCQRGPIFLRSAEITSSADDVSTLISLLD 1239
            W NE G+ L++ +V+   G I+LRSA ++   +D+  L  LLD
Sbjct: 237  WMNEKGQNLVSFVVEGPEGLIYLRSANVSDIINDLDALQLLLD 279


>emb|CBI29151.3| unnamed protein product [Vitis vinifera]
          Length = 718

 Score =  147 bits (371), Expect = 1e-32
 Identities = 88/275 (32%), Positives = 144/275 (52%), Gaps = 10/275 (3%)
 Frame = +1

Query: 439  HGMVFDEQKKRVKCNYCAKVV--SGFSRLKQHLGGIRGDVVPCGEVPENVKLRMRNSLFE 612
            HG++ +  ++++KC YC KV+   G SRLKQHL G RG+V PC EVPE+VK++++  L  
Sbjct: 80   HGIMVNGGRQKIKCKYCHKVILGGGISRLKQHLAGERGNVAPCEEVPEDVKVQIQQHLGF 139

Query: 613  RKNEMLSREV--------LQLFHSDVPLKRNLCSTSSELNCSQLESFQPNGKRKLVDSIP 768
            +  E L R+         L  ++ D     +    S +   ++  S +  GK     +  
Sbjct: 140  KVLEKLKRQKGLKSSKNSLVPYYQDREGGADDVQRSPKAASARGISRKRRGKEIDEGTSY 199

Query: 769  KEDVMENVLFHQGTSHSHTTAKRQEIEGDSLRNAQRCIGRFFYENGIDFSVAKSLSFQRM 948
            K+   +  LF   T  +  +        +S+  A   + RF YE G+ FS A S  FQ+M
Sbjct: 200  KKKRHKKQLFPTATPVAQVSIHNSFASQESMDQADMAVARFMYEAGVPFSAANSYYFQQM 259

Query: 949  ISSIIGCGSFEYNAPSCNDLKGWILHEEMKEMHHYVKEVRHTWVSTGCSILLDGWTNENG 1128
              +I   G   Y  PSC+ L+G +L+  ++++    +E+R +W  TGCS+++D  T+  G
Sbjct: 260  ADAIAAVGP-GYKMPSCHSLRGKLLNRSVQDVEGLCEELRRSWEVTGCSVMVDRCTDRTG 318

Query: 1129 RTLINILVDCQRGPIFLRSAEITSSADDVSTLISL 1233
             T++N  V C +G +FLRS   +  A+    L+SL
Sbjct: 319  HTVLNFYVYCPKGTVFLRSVYASDIANSTEALLSL 353


>ref|XP_002273287.1| PREDICTED: uncharacterized protein LOC100260844 [Vitis vinifera]
          Length = 758

 Score =  147 bits (371), Expect = 1e-32
 Identities = 88/275 (32%), Positives = 144/275 (52%), Gaps = 10/275 (3%)
 Frame = +1

Query: 439  HGMVFDEQKKRVKCNYCAKVV--SGFSRLKQHLGGIRGDVVPCGEVPENVKLRMRNSLFE 612
            HG++ +  ++++KC YC KV+   G SRLKQHL G RG+V PC EVPE+VK++++  L  
Sbjct: 34   HGIMVNGGRQKIKCKYCHKVILGGGISRLKQHLAGERGNVAPCEEVPEDVKVQIQQHLGF 93

Query: 613  RKNEMLSREV--------LQLFHSDVPLKRNLCSTSSELNCSQLESFQPNGKRKLVDSIP 768
            +  E L R+         L  ++ D     +    S +   ++  S +  GK     +  
Sbjct: 94   KVLEKLKRQKGLKSSKNSLVPYYQDREGGADDVQRSPKAASARGISRKRRGKEIDEGTSY 153

Query: 769  KEDVMENVLFHQGTSHSHTTAKRQEIEGDSLRNAQRCIGRFFYENGIDFSVAKSLSFQRM 948
            K+   +  LF   T  +  +        +S+  A   + RF YE G+ FS A S  FQ+M
Sbjct: 154  KKKRHKKQLFPTATPVAQVSIHNSFASQESMDQADMAVARFMYEAGVPFSAANSYYFQQM 213

Query: 949  ISSIIGCGSFEYNAPSCNDLKGWILHEEMKEMHHYVKEVRHTWVSTGCSILLDGWTNENG 1128
              +I   G   Y  PSC+ L+G +L+  ++++    +E+R +W  TGCS+++D  T+  G
Sbjct: 214  ADAIAAVGP-GYKMPSCHSLRGKLLNRSVQDVEGLCEELRRSWEVTGCSVMVDRCTDRTG 272

Query: 1129 RTLINILVDCQRGPIFLRSAEITSSADDVSTLISL 1233
             T++N  V C +G +FLRS   +  A+    L+SL
Sbjct: 273  HTVLNFYVYCPKGTVFLRSVYASDIANSTEALLSL 307


>emb|CAN67823.1| hypothetical protein VITISV_028004 [Vitis vinifera]
          Length = 896

 Score =  147 bits (370), Expect = 1e-32
 Identities = 88/275 (32%), Positives = 144/275 (52%), Gaps = 10/275 (3%)
 Frame = +1

Query: 439  HGMVFDEQKKRVKCNYCAKVV--SGFSRLKQHLGGIRGDVVPCGEVPENVKLRMRNSLFE 612
            HG++ +  ++++KC YC KV+   G SRLKQHL G RG+V PC EVPE+VK++++  L  
Sbjct: 81   HGIMVNGGRQKIKCKYCHKVILGGGISRLKQHLAGERGNVAPCEEVPEDVKVQIQQHLGF 140

Query: 613  RKNEMLSREV--------LQLFHSDVPLKRNLCSTSSELNCSQLESFQPNGKRKLVDSIP 768
            +  E L R+         L  ++ D     +    S +   ++  S +  GK     +  
Sbjct: 141  KVLEKLKRQKGLKSSKNSLVPYYQDREGGADDVQRSPKAASARGISRKRRGKEIDEGTSY 200

Query: 769  KEDVMENVLFHQGTSHSHTTAKRQEIEGDSLRNAQRCIGRFFYENGIDFSVAKSLSFQRM 948
            K+   +  LF   T  +  +        +S+  A   + RF YE G+ FS A S  FQ+M
Sbjct: 201  KKKRHKKQLFPTATPVAQVSIHNSFASQESMDQADMAVARFMYEAGVPFSAANSYYFQQM 260

Query: 949  ISSIIGCGSFEYNAPSCNDLKGWILHEEMKEMHHYVKEVRHTWVSTGCSILLDGWTNENG 1128
              +I   G   Y  PSC+ L+G +L+  ++++    +E+R +W  TGCS+++D  T+  G
Sbjct: 261  ADAIAAVGP-GYKMPSCHSLRGKLLNRSVQDVEGLCEELRRSWEVTGCSVMVDRCTDRTG 319

Query: 1129 RTLINILVDCQRGPIFLRSAEITSSADDVSTLISL 1233
             T++N  V C +G +FLRS   +  A+    L+SL
Sbjct: 320  HTVLNFYVYCPKGTVFLRSVYASXIANSTEALLSL 354


>ref|XP_006299218.1| hypothetical protein CARUB_v10015366mg [Capsella rubella]
            gi|482567927|gb|EOA32116.1| hypothetical protein
            CARUB_v10015366mg [Capsella rubella]
          Length = 596

 Score =  146 bits (368), Expect = 2e-32
 Identities = 89/277 (32%), Positives = 136/277 (49%)
 Frame = +1

Query: 412  TKVSNNVREHGMVFDEQKKRVKCNYCAKVVSGFSRLKQHLGGIRGDVVPCGEVPENVKLR 591
            T V  N+ EHG  + E + RV+CN C K ++ F RLK HL  +  DV  C  V    ++ 
Sbjct: 5    TDVKMNIHEHG-TWVEGRTRVQCNSCGKRMTNFYRLKLHLAYVGKDVTYCPRVSLTTRMA 63

Query: 592  MRNSLFERKNEMLSREVLQLFHSDVPLKRNLCSTSSELNCSQLESFQPNGKRKLVDSIPK 771
                L E                   L+      S  +N ++    +P   RK V     
Sbjct: 64   FYTMLME-------------------LRSRKSGASKNVNAAKPSPGRP---RKRVSP--- 98

Query: 772  EDVMENVLFHQGTSHSHTTAKRQEIEGDSLRNAQRCIGRFFYENGIDFSVAKSLSFQRMI 951
                EN              +   +E D  + +QRC+ RF YE+G+DFS   S SFQ ++
Sbjct: 99   ----EN--------------ENAAVEAD--KQSQRCLARFLYEHGVDFSALDSTSFQELM 138

Query: 952  SSIIGCGSFEYNAPSCNDLKGWILHEEMKEMHHYVKEVRHTWVSTGCSILLDGWTNENGR 1131
            +++ G G      P   DL GW+L E +KE+   VKE++ +W  TGCSILLD W ++ GR
Sbjct: 139  TTVTG-GKLALKIPDSRDLNGWMLQEALKEVQDRVKEIKDSWEITGCSILLDAWIDQKGR 197

Query: 1132 TLINILVDCQRGPIFLRSAEITSSADDVSTLISLLDG 1242
             L++ + DC  G ++L+S++++    DV+ L SL++G
Sbjct: 198  DLVSFVADCPAGAVYLKSSDVSGIKTDVTALKSLVNG 234


>ref|XP_004160654.1| PREDICTED: uncharacterized LOC101213851 [Cucumis sativus]
          Length = 565

 Score =  146 bits (368), Expect = 2e-32
 Identities = 100/307 (32%), Positives = 144/307 (46%), Gaps = 38/307 (12%)
 Frame = +1

Query: 436  EHGMVFDEQKKRVKCNYCAKVVSG-FSRLKQHLGGIRGDVVPCGEVPENVKLRMRNSL-- 606
            EHG+  DE+KK+VKCNYC K+VSG  +R KQHL  I G+V PC   PE V L+++ ++  
Sbjct: 140  EHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVAPCKHAPEEVYLKIKENMKW 199

Query: 607  -------------------FERKNEMLSREVLQLFH---------SDVPLKRNLCSTSSE 702
                                +  NE    E  +  H          D  L ++L ST   
Sbjct: 200  HRTGRRHVQTDANEISAYFMQSDNEEEEEEKEESLHHISKERFIDGDKRLSKDLKSTFRG 259

Query: 703  LNCSQLESFQPNGKRKLVDSI-------PKEDVMENVLFHQGTSHSHTTAKRQEIEGDSL 861
            +  S     +P+ KR  +DS+         E V +  L  +G +              S 
Sbjct: 260  M--SPGGGSEPSVKRSRLDSVFLKTTKRQTEQVQKQALVKRGGNRR------------SR 305

Query: 862  RNAQRCIGRFFYENGIDFSVAKSLSFQRMISSIIGCGSFEYNAPSCNDLKGWILHEEMKE 1041
            +     I +FF   GI F  A S+ F +M+ ++   GS     PSC  + G +L EE+  
Sbjct: 306  KEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGS-GLVGPSCQLMSGRLLQEEVAT 364

Query: 1042 MHHYVKEVRHTWVSTGCSILLDGWTNENGRTLINILVDCQRGPIFLRSAEITSSADDVST 1221
            +  Y+ E++ +W  TGCSIL+D W + +GR  IN LV C RG  F+ S +     DD S 
Sbjct: 365  IKSYLVELKASWAVTGCSILVDNWKDSDGRAFINFLVSCPRGVYFVSSVDAMEIVDDPSN 424

Query: 1222 LISLLDG 1242
            L S+LDG
Sbjct: 425  LFSVLDG 431



 Score = 72.4 bits (176), Expect = 4e-10
 Identities = 34/58 (58%), Positives = 42/58 (72%), Gaps = 1/58 (1%)
 Frame = +1

Query: 436 EHGMVFDEQKKRVKCNYCAKVVS-GFSRLKQHLGGIRGDVVPCGEVPENVKLRMRNSL 606
           EHG+  DE+KK+VKCNYC K+VS G  RLKQHL  + G+V  C + PE V LRMR +L
Sbjct: 16  EHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDKAPEEVYLRMRENL 73


>ref|XP_004147666.1| PREDICTED: uncharacterized protein LOC101213851 [Cucumis sativus]
          Length = 1018

 Score =  146 bits (368), Expect = 2e-32
 Identities = 100/307 (32%), Positives = 144/307 (46%), Gaps = 38/307 (12%)
 Frame = +1

Query: 436  EHGMVFDEQKKRVKCNYCAKVVSG-FSRLKQHLGGIRGDVVPCGEVPENVKLRMRNSL-- 606
            EHG+  DE+KK+VKCNYC K+VSG  +R KQHL  I G+V PC   PE V L+++ ++  
Sbjct: 140  EHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVAPCKHAPEEVYLKIKENMKW 199

Query: 607  -------------------FERKNEMLSREVLQLFH---------SDVPLKRNLCSTSSE 702
                                +  NE    E  +  H          D  L ++L ST   
Sbjct: 200  HRTGRRHVQTDANEISAYFMQSDNEEEEEEKEESLHHISKERFIDGDKRLSKDLKSTFRG 259

Query: 703  LNCSQLESFQPNGKRKLVDSI-------PKEDVMENVLFHQGTSHSHTTAKRQEIEGDSL 861
            +  S     +P+ KR  +DS+         E V +  L  +G +              S 
Sbjct: 260  M--SPGGGSEPSVKRSRLDSVFLKTTKRQTEQVQKQALVKRGGNRR------------SR 305

Query: 862  RNAQRCIGRFFYENGIDFSVAKSLSFQRMISSIIGCGSFEYNAPSCNDLKGWILHEEMKE 1041
            +     I +FF   GI F  A S+ F +M+ ++   GS     PSC  + G +L EE+  
Sbjct: 306  KEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGS-GLVGPSCQLMSGRLLQEEVAT 364

Query: 1042 MHHYVKEVRHTWVSTGCSILLDGWTNENGRTLINILVDCQRGPIFLRSAEITSSADDVST 1221
            +  Y+ E++ +W  TGCSIL+D W + +GR  IN LV C RG  F+ S +     DD S 
Sbjct: 365  IKSYLVELKASWAVTGCSILVDNWKDSDGRAFINFLVSCPRGVYFVSSVDAMEIVDDPSN 424

Query: 1222 LISLLDG 1242
            L S+LDG
Sbjct: 425  LFSVLDG 431



 Score = 72.4 bits (176), Expect = 4e-10
 Identities = 34/58 (58%), Positives = 42/58 (72%), Gaps = 1/58 (1%)
 Frame = +1

Query: 436 EHGMVFDEQKKRVKCNYCAKVVS-GFSRLKQHLGGIRGDVVPCGEVPENVKLRMRNSL 606
           EHG+  DE+KK+VKCNYC K+VS G  RLKQHL  + G+V  C + PE V LRMR +L
Sbjct: 16  EHGVAQDEKKKKVKCNYCGKIVSGGIYRLKQHLARVSGEVTYCDKAPEEVYLRMRENL 73


>ref|XP_004140930.1| PREDICTED: uncharacterized protein LOC101215032 [Cucumis sativus]
          Length = 728

 Score =  144 bits (362), Expect = 1e-31
 Identities = 84/271 (30%), Positives = 146/271 (53%), Gaps = 4/271 (1%)
 Frame = +1

Query: 439  HGMVFDEQKKRVKCNYCAKVV--SGFSRLKQHLGGIRGDVVPCGEVPENVKLRMRNSLFE 612
            HG++ +  ++++KC YC KV+   G SRLKQHL G RG+V PC EVPE VK++++  L  
Sbjct: 21   HGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIQQLLGF 80

Query: 613  RKNEMLSREVLQLFH--SDVPLKRNLCSTSSELNCSQLESFQPNGKRKLVDSIPKEDVME 786
            +  E L R+     +  S  P +  +   +  +  S+  S +   K ++ + + KE   +
Sbjct: 81   KVLEKLKRQKNGSKNAVSCFPSREEINDGTHGVQNSRRHSLRRKAK-EVQEGVTKEAKRK 139

Query: 787  NVLFHQGTSHSHTTAKRQEIEGDSLRNAQRCIGRFFYENGIDFSVAKSLSFQRMISSIIG 966
                H  TS    +  +   + +S+  A   + +F Y+ GI  +V  S  FQ+M  +I  
Sbjct: 140  KK--HLPTSFVTQSVNQNTAQIESIEQADMVVAKFVYQAGIPITVVNSQYFQQMADAIAA 197

Query: 967  CGSFEYNAPSCNDLKGWILHEEMKEMHHYVKEVRHTWVSTGCSILLDGWTNENGRTLINI 1146
             G   Y  P+ + L G +L   ++++  YV+E+R +W  TGCS+L+D W +  G  +IN 
Sbjct: 198  VGP-GYKMPTYHSLMGKLLDRSVQDVGEYVEELRKSWEVTGCSVLVDRWMDRTGSVVINF 256

Query: 1147 LVDCQRGPIFLRSAEITSSADDVSTLISLLD 1239
             V C +G +FL+S +++  ++    L++L D
Sbjct: 257  FVYCSKGTMFLKSVDLSEISESAEGLLNLFD 287


>ref|XP_007051268.1| HAT dimerization domain-containing protein isoform 6 [Theobroma
            cacao] gi|508703529|gb|EOX95425.1| HAT dimerization
            domain-containing protein isoform 6 [Theobroma cacao]
          Length = 897

 Score =  142 bits (359), Expect = 3e-31
 Identities = 91/296 (30%), Positives = 144/296 (48%), Gaps = 28/296 (9%)
 Frame = +1

Query: 436  EHGMVFDEQKKRVKCNYCAKVVSG-FSRLKQHLGGIRGDVVPCGEVPENVKLRMRNSLF- 609
            EH +  DE+KKRVKCNYC K++SG  +R KQHL  I G+V  C + PE V L+++ ++  
Sbjct: 139  EHCVAQDEKKKRVKCNYCEKIISGGINRFKQHLARIPGEVAYCEKAPEEVYLKIKENMKW 198

Query: 610  ----ERKNEMLSREVLQLF-HSDVP------------LKRNLCSTSSELNCSQLES---- 726
                 R  +  ++E+   + HSD              + +++ +   +++ S + +    
Sbjct: 199  HRTGRRHRKPDTKEISAFYLHSDNEDEGGEEDGYLQCISKDILAIDDKVSDSDIRNNNVR 258

Query: 727  -----FQPNGKRKLVDSIPKEDVMENVLFHQGTSHSHTTAKRQEIEGDSLRNAQRCIGRF 891
                    NG   L+     + V    L  Q ++H   T  +   E  + R     I +F
Sbjct: 259  GRSPGSSGNGAEPLLKRSRLDSVFLKSLKSQTSAHYKQTRAKIGFEKKTRREVISAICKF 318

Query: 892  FYENGIDFSVAKSLSFQRMISSIIGCGSFEYNAPSCNDLKGWILHEEMKEMHHYVKEVRH 1071
            FY  GI  + A S  F +M+  ++G      + PS   + G +L EE+  +  Y+ E + 
Sbjct: 319  FYHAGIPSNAANSPYFHKMLE-VVGQYGQGLHGPSSRIISGRLLQEEIANIKEYLAEFKA 377

Query: 1072 TWVSTGCSILLDGWTNENGRTLINILVDCQRGPIFLRSAEITSSADDVSTLISLLD 1239
            +W  TGCS++ D W +  GRTLIN LV C RG  FL S + T   +D + L  LLD
Sbjct: 378  SWAITGCSVMADSWNDAQGRTLINFLVSCPRGVCFLSSVDATDMIEDAANLFKLLD 433



 Score = 71.2 bits (173), Expect = 9e-10
 Identities = 34/58 (58%), Positives = 42/58 (72%), Gaps = 1/58 (1%)
 Frame = +1

Query: 436 EHGMVFDEQKKRVKCNYCAKVVS-GFSRLKQHLGGIRGDVVPCGEVPENVKLRMRNSL 606
           EHG+  DE+KK+VKCNYC K+VS G  RLKQHL  + G+V  C +VPE V L MR +L
Sbjct: 15  EHGIAQDERKKKVKCNYCGKIVSGGIFRLKQHLARLSGEVTHCEKVPEEVCLNMRKNL 72


>ref|XP_007051264.1| HAT dimerization domain-containing protein isoform 2 [Theobroma
            cacao] gi|590720197|ref|XP_007051265.1| HAT dimerization
            domain-containing protein isoform 2 [Theobroma cacao]
            gi|590720203|ref|XP_007051267.1| HAT dimerization
            domain-containing protein isoform 2 [Theobroma cacao]
            gi|590720210|ref|XP_007051269.1| HAT dimerization
            domain-containing protein isoform 2 [Theobroma cacao]
            gi|508703525|gb|EOX95421.1| HAT dimerization
            domain-containing protein isoform 2 [Theobroma cacao]
            gi|508703526|gb|EOX95422.1| HAT dimerization
            domain-containing protein isoform 2 [Theobroma cacao]
            gi|508703528|gb|EOX95424.1| HAT dimerization
            domain-containing protein isoform 2 [Theobroma cacao]
            gi|508703530|gb|EOX95426.1| HAT dimerization
            domain-containing protein isoform 2 [Theobroma cacao]
          Length = 901

 Score =  142 bits (359), Expect = 3e-31
 Identities = 91/296 (30%), Positives = 144/296 (48%), Gaps = 28/296 (9%)
 Frame = +1

Query: 436  EHGMVFDEQKKRVKCNYCAKVVSG-FSRLKQHLGGIRGDVVPCGEVPENVKLRMRNSLF- 609
            EH +  DE+KKRVKCNYC K++SG  +R KQHL  I G+V  C + PE V L+++ ++  
Sbjct: 143  EHCVAQDEKKKRVKCNYCEKIISGGINRFKQHLARIPGEVAYCEKAPEEVYLKIKENMKW 202

Query: 610  ----ERKNEMLSREVLQLF-HSDVP------------LKRNLCSTSSELNCSQLES---- 726
                 R  +  ++E+   + HSD              + +++ +   +++ S + +    
Sbjct: 203  HRTGRRHRKPDTKEISAFYLHSDNEDEGGEEDGYLQCISKDILAIDDKVSDSDIRNNNVR 262

Query: 727  -----FQPNGKRKLVDSIPKEDVMENVLFHQGTSHSHTTAKRQEIEGDSLRNAQRCIGRF 891
                    NG   L+     + V    L  Q ++H   T  +   E  + R     I +F
Sbjct: 263  GRSPGSSGNGAEPLLKRSRLDSVFLKSLKSQTSAHYKQTRAKIGFEKKTRREVISAICKF 322

Query: 892  FYENGIDFSVAKSLSFQRMISSIIGCGSFEYNAPSCNDLKGWILHEEMKEMHHYVKEVRH 1071
            FY  GI  + A S  F +M+  ++G      + PS   + G +L EE+  +  Y+ E + 
Sbjct: 323  FYHAGIPSNAANSPYFHKMLE-VVGQYGQGLHGPSSRIISGRLLQEEIANIKEYLAEFKA 381

Query: 1072 TWVSTGCSILLDGWTNENGRTLINILVDCQRGPIFLRSAEITSSADDVSTLISLLD 1239
            +W  TGCS++ D W +  GRTLIN LV C RG  FL S + T   +D + L  LLD
Sbjct: 382  SWAITGCSVMADSWNDAQGRTLINFLVSCPRGVCFLSSVDATDMIEDAANLFKLLD 437



 Score = 71.2 bits (173), Expect = 9e-10
 Identities = 34/58 (58%), Positives = 42/58 (72%), Gaps = 1/58 (1%)
 Frame = +1

Query: 436 EHGMVFDEQKKRVKCNYCAKVVS-GFSRLKQHLGGIRGDVVPCGEVPENVKLRMRNSL 606
           EHG+  DE+KK+VKCNYC K+VS G  RLKQHL  + G+V  C +VPE V L MR +L
Sbjct: 19  EHGIAQDERKKKVKCNYCGKIVSGGIFRLKQHLARLSGEVTHCEKVPEEVCLNMRKNL 76


>ref|XP_007051263.1| HAT dimerization domain-containing protein isoform 1 [Theobroma
            cacao] gi|590720200|ref|XP_007051266.1| HAT dimerization
            domain-containing protein isoform 1 [Theobroma cacao]
            gi|508703524|gb|EOX95420.1| HAT dimerization
            domain-containing protein isoform 1 [Theobroma cacao]
            gi|508703527|gb|EOX95423.1| HAT dimerization
            domain-containing protein isoform 1 [Theobroma cacao]
          Length = 937

 Score =  142 bits (359), Expect = 3e-31
 Identities = 91/296 (30%), Positives = 144/296 (48%), Gaps = 28/296 (9%)
 Frame = +1

Query: 436  EHGMVFDEQKKRVKCNYCAKVVSG-FSRLKQHLGGIRGDVVPCGEVPENVKLRMRNSLF- 609
            EH +  DE+KKRVKCNYC K++SG  +R KQHL  I G+V  C + PE V L+++ ++  
Sbjct: 143  EHCVAQDEKKKRVKCNYCEKIISGGINRFKQHLARIPGEVAYCEKAPEEVYLKIKENMKW 202

Query: 610  ----ERKNEMLSREVLQLF-HSDVP------------LKRNLCSTSSELNCSQLES---- 726
                 R  +  ++E+   + HSD              + +++ +   +++ S + +    
Sbjct: 203  HRTGRRHRKPDTKEISAFYLHSDNEDEGGEEDGYLQCISKDILAIDDKVSDSDIRNNNVR 262

Query: 727  -----FQPNGKRKLVDSIPKEDVMENVLFHQGTSHSHTTAKRQEIEGDSLRNAQRCIGRF 891
                    NG   L+     + V    L  Q ++H   T  +   E  + R     I +F
Sbjct: 263  GRSPGSSGNGAEPLLKRSRLDSVFLKSLKSQTSAHYKQTRAKIGFEKKTRREVISAICKF 322

Query: 892  FYENGIDFSVAKSLSFQRMISSIIGCGSFEYNAPSCNDLKGWILHEEMKEMHHYVKEVRH 1071
            FY  GI  + A S  F +M+  ++G      + PS   + G +L EE+  +  Y+ E + 
Sbjct: 323  FYHAGIPSNAANSPYFHKMLE-VVGQYGQGLHGPSSRIISGRLLQEEIANIKEYLAEFKA 381

Query: 1072 TWVSTGCSILLDGWTNENGRTLINILVDCQRGPIFLRSAEITSSADDVSTLISLLD 1239
            +W  TGCS++ D W +  GRTLIN LV C RG  FL S + T   +D + L  LLD
Sbjct: 382  SWAITGCSVMADSWNDAQGRTLINFLVSCPRGVCFLSSVDATDMIEDAANLFKLLD 437



 Score = 71.2 bits (173), Expect = 9e-10
 Identities = 34/58 (58%), Positives = 42/58 (72%), Gaps = 1/58 (1%)
 Frame = +1

Query: 436 EHGMVFDEQKKRVKCNYCAKVVS-GFSRLKQHLGGIRGDVVPCGEVPENVKLRMRNSL 606
           EHG+  DE+KK+VKCNYC K+VS G  RLKQHL  + G+V  C +VPE V L MR +L
Sbjct: 19  EHGIAQDERKKKVKCNYCGKIVSGGIFRLKQHLARLSGEVTHCEKVPEEVCLNMRKNL 76


>ref|XP_002512206.1| DNA binding protein, putative [Ricinus communis]
            gi|223548750|gb|EEF50240.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 739

 Score =  142 bits (358), Expect = 3e-31
 Identities = 87/271 (32%), Positives = 137/271 (50%), Gaps = 4/271 (1%)
 Frame = +1

Query: 439  HGMVFDEQKKRVKCNYCAKVV--SGFSRLKQHLGGIRGDVVPCGEVPENVKLRMRNSLFE 612
            HG + +  ++++KC YC K+    G SRLKQHL G RG+V PC +VPE VK++++  L  
Sbjct: 21   HGTMVNGGRQKIKCKYCHKIFLGGGISRLKQHLAGERGNVAPCEDVPEEVKVQIQQHLGF 80

Query: 613  RKNEMLSR--EVLQLFHSDVPLKRNLCSTSSELNCSQLESFQPNGKRKLVDSIPKEDVME 786
            +  E L +  E     +S +   R+       L   Q E+ +   K  L     +    +
Sbjct: 81   KVLERLKKQKEANGSKNSYMLYLRDREEDDVNLGSGQKEASRRRDKEVLEGISKRTKRRK 140

Query: 787  NVLFHQGTSHSHTTAKRQEIEGDSLRNAQRCIGRFFYENGIDFSVAKSLSFQRMISSIIG 966
               +   TS       +     +++  A   + RFFYE GI F+ A S  FQ+M  +II 
Sbjct: 141  KQNYSMATSVITQPICQSFAPPENIELADVAVARFFYEAGIPFTAANSYFFQQMADNIIA 200

Query: 967  CGSFEYNAPSCNDLKGWILHEEMKEMHHYVKEVRHTWVSTGCSILLDGWTNENGRTLINI 1146
             G   Y  PS   L+G +L+  +++   Y  E+R +W  TGC++L+D W +   RT+IN 
Sbjct: 201  AGP-GYKMPSYTSLRGKLLNRCIQDAEEYCSELRKSWEVTGCTVLVDRWMHGRDRTVINF 259

Query: 1147 LVDCQRGPIFLRSAEITSSADDVSTLISLLD 1239
             V C +G +FLRS + +     V  L++L D
Sbjct: 260  FVYCPKGTMFLRSVDASGITKSVEALLNLFD 290


>ref|XP_002519322.1| DNA binding protein, putative [Ricinus communis]
            gi|223541637|gb|EEF43186.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 906

 Score =  141 bits (356), Expect = 6e-31
 Identities = 90/296 (30%), Positives = 143/296 (48%), Gaps = 28/296 (9%)
 Frame = +1

Query: 436  EHGMVFDEQKKRVKCNYCAKVVSG-FSRLKQHLGGIRGDVVPCGEVPENVKLRMRNSL-- 606
            EHG+  DE+KK+VKCNYC KVVSG  +R KQHL  I G+V PC   PE V L+++ ++  
Sbjct: 136  EHGVAQDERKKKVKCNYCDKVVSGGINRFKQHLARIPGEVAPCKNAPEEVYLKIKENMKW 195

Query: 607  ---------------------FERKNEMLSREVLQLFHSDVPLKRNLCSTSSELNCSQLE 723
                                  + ++E    E   LFH     K  +      L      
Sbjct: 196  HRTGRRPRQPDTKPISTFYKQSDNEDEEDEPEQDALFHKS---KERMVIGDKRLGKDLRI 252

Query: 724  SFQPNGKRKLVDSIPKEDVMENVLFHQGTSHSHTTAKRQEIEGDSLRNAQR----CIGRF 891
            +++        +S+ K+  +++V  +   S   ++ K+ +++  S R +++     I +F
Sbjct: 253  TYKGMSSSNASESLCKKSRLDSVFLNTPNSLIPSSCKQLKVKTRSCRKSRKEVISAICKF 312

Query: 892  FYENGIDFSVAKSLSFQRMISSIIGCGSFEYNAPSCNDLKGWILHEEMKEMHHYVKEVRH 1071
            FY  G+    A SL F +M+  +   G      P    + G  L EE+  + +Y+ E + 
Sbjct: 313  FYHAGVPLQAANSLYFHKMLELVAQYGQ-GLVGPRSQVISGRFLQEEIATIKNYLFEYKA 371

Query: 1072 TWVSTGCSILLDGWTNENGRTLINILVDCQRGPIFLRSAEITSSADDVSTLISLLD 1239
            +W  TGCSIL D W +   RTLIN+LV C  G  F+ S + ++  +D S+L  LLD
Sbjct: 372  SWAVTGCSILADSWVDVEDRTLINLLVSCPHGVYFVASVDASNMLEDASSLFKLLD 427



 Score = 70.9 bits (172), Expect = 1e-09
 Identities = 34/58 (58%), Positives = 42/58 (72%), Gaps = 1/58 (1%)
 Frame = +1

Query: 436 EHGMVFDEQKKRVKCNYCAKVVS-GFSRLKQHLGGIRGDVVPCGEVPENVKLRMRNSL 606
           EHG+  DE+KK+VKCNYC KVVS G  RLKQHL  + G+V  C + PE V LRM+ +L
Sbjct: 15  EHGVAQDERKKKVKCNYCGKVVSGGIYRLKQHLARVSGEVTYCDKAPEEVYLRMKANL 72


>ref|XP_003546544.1| PREDICTED: uncharacterized protein LOC100784818 isoform X1 [Glycine
            max] gi|571519886|ref|XP_006597914.1| PREDICTED:
            uncharacterized protein LOC100784818 isoform X2 [Glycine
            max] gi|571519888|ref|XP_006597915.1| PREDICTED:
            uncharacterized protein LOC100784818 isoform X3 [Glycine
            max]
          Length = 900

 Score =  140 bits (354), Expect = 1e-30
 Identities = 95/301 (31%), Positives = 146/301 (48%), Gaps = 33/301 (10%)
 Frame = +1

Query: 436  EHGMVFDEQKKRVKCNYCAKVVSG-FSRLKQHLGGIRGDVVPCGEVPENVKLRMRNSL-F 609
            EHG+  DE+KK+VKCNYC K+VSG  +R KQHL  I G+V PC   PE+V L+++ ++ +
Sbjct: 137  EHGVAQDERKKKVKCNYCEKIVSGGINRFKQHLARIPGEVAPCKSAPEDVYLKIKENMKW 196

Query: 610  ERKNEMLSR-EVLQLFHSDVPLKRNLCSTSSELNCSQLESFQPNGKRKLVDSIPK--EDV 780
             R    L R E+ +L    +P      S + +  C  +E      K  L+D   +  +D+
Sbjct: 197  HRTGRRLRRPEIKEL----MPFYAK--SDNDDDECELVEDLHHMNKETLMDVDKRFSKDI 250

Query: 781  MENVLFHQGTSHS---HTTAKRQEIEGDSLRNAQR------------------------- 876
            M+    ++G SHS       +R  ++   L+  +                          
Sbjct: 251  MKT---YKGVSHSTGPEPVLRRSRLDNVYLKLPKNQTPQAYKQVKVKTGPTKKLRKEVIS 307

Query: 877  CIGRFFYENGIDFSVAKSLSFQRMISSIIGCGSFEYNAPSCNDLKGWILHEEMKEMHHYV 1056
             I +FFY  GI    A SL F +M+  ++G        P+   + G  L EE+  + +Y+
Sbjct: 308  SICKFFYHAGIPIQAADSLYFHKMLE-VVGQYGQGLVCPASQLMSGRFLQEEINSIKNYL 366

Query: 1057 KEVRHTWVSTGCSILLDGWTNENGRTLINILVDCQRGPIFLRSAEITSSADDVSTLISLL 1236
             E + +W  TGCSI+ D W +  GRT+IN LV C  G  F+ S + T+  +D   L  LL
Sbjct: 367  VEYKASWAITGCSIMADSWIDTQGRTIINFLVSCPHGVYFVSSVDATNVVEDAPNLFKLL 426

Query: 1237 D 1239
            D
Sbjct: 427  D 427



 Score = 69.3 bits (168), Expect = 4e-09
 Identities = 33/81 (40%), Positives = 51/81 (62%), Gaps = 5/81 (6%)
 Frame = +1

Query: 436 EHGMVFDEQKKRVKCNYCAKVVS-GFSRLKQHLGGIRGDVVPCGEVPENVKLRMRNSL-- 606
           +HG+  DE+KK+V+CNYC K+VS G  RLKQHL  + G+V  C + P+ V L+M+ +L  
Sbjct: 15  DHGIAQDERKKKVRCNYCGKIVSGGIYRLKQHLARVSGEVTYCEKAPDEVYLKMKENLEG 74

Query: 607 --FERKNEMLSREVLQLFHSD 663
               +K + +  +    FHS+
Sbjct: 75  CRSHKKQKQVDTQAYMNFHSN 95


Top