BLASTX nr result

ID: Akebia23_contig00019550 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00019550
         (2465 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626...   842   0.0  
ref|XP_007009265.1| HAT and BED zinc finger domain-containing pr...   840   0.0  
ref|XP_002524204.1| DNA binding protein, putative [Ricinus commu...   816   0.0  
ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302...   790   0.0  
ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806...   783   0.0  
ref|XP_007163431.1| hypothetical protein PHAVU_001G234100g [Phas...   777   0.0  
ref|XP_007163430.1| hypothetical protein PHAVU_001G234100g [Phas...   777   0.0  
ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817...   770   0.0  
ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591...   768   0.0  
gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo]         766   0.0  
ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226...   760   0.0  
ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254...   756   0.0  
ref|XP_007049027.1| HAT transposon superfamily, putative [Theobr...   734   0.0  
ref|XP_006380932.1| hypothetical protein POPTR_0006s02210g [Popu...   720   0.0  
ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817...   716   0.0  
ref|XP_002521049.1| DNA binding protein, putative [Ricinus commu...   708   0.0  
ref|NP_001051738.1| Os03g0822900 [Oryza sativa Japonica Group] g...   680   0.0  
gb|EAY92386.1| hypothetical protein OsI_14116 [Oryza sativa Indi...   677   0.0  
ref|XP_006651967.1| PREDICTED: uncharacterized protein LOC102714...   671   0.0  
ref|XP_004981234.1| PREDICTED: uncharacterized protein LOC101757...   652   0.0  

>ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626522 [Citrus sinensis]
          Length = 745

 Score =  842 bits (2174), Expect = 0.0
 Identities = 420/725 (57%), Positives = 538/725 (74%), Gaps = 1/725 (0%)
 Frame = -3

Query: 2463 ERVQLKCLYCLKLFSGGGIHRIKEHLACQKGNASCCSRVPLEVQRIMQQSLDGVTVKKKK 2284
            +RVQLKCLYC KLF GGGIHRIKEHLACQKGNAS CSRVPL+V+  MQQSLDGV VKKKK
Sbjct: 29   DRVQLKCLYCFKLFRGGGIHRIKEHLACQKGNASTCSRVPLDVRLAMQQSLDGVVVKKKK 88

Query: 2283 KQRIAEDIRSLTPGSNEMDTFGNQCEVNTGLQLLAHPDTLESNMGVFDRRNEGMKNRISD 2104
            KQ+IAE+I +  P   E+  F +Q +V  GL LL   +T E+   +   R+  + N   D
Sbjct: 89   KQKIAEEITNNNPTFGEVYAFTDQGDVTPGLPLLDDSNTPEACSNLVVSRDV-ISNTTGD 147

Query: 2103 RRKRGRPENASPLPVTPNASMLPVGDLNLRSTREKELVHMAVGRFLYDVGVSLDAVNSHY 1924
            +RKR R +N+S      NA    +   +L +TR    + MAVGRFLYD+G  LDAVNS Y
Sbjct: 148  KRKRWRGKNSSV-----NAYTGAMISASLDATRGNNPIFMAVGRFLYDIGAPLDAVNSEY 202

Query: 1923 FQPMIDAIASRGPGLEPTSYHDLRGWILKYTVEEMNGVLEKYRETWGKTGCSVLADEWVT 1744
            FQPM+DAIAS GP     SYHD+RGWILK +VEE+   +++Y  TWGKTGCS+L D+W T
Sbjct: 203  FQPMVDAIASGGPEAAMPSYHDIRGWILKNSVEEVKNDVDRYTTTWGKTGCSILVDQWNT 262

Query: 1743 ETGRILINFFVYCPEGTMFLKSVDATDIIGSIDPLYDLLKSVVEEVGIDNVLQVITDSTD 1564
            E GR L+ F  YCPEGT+FLKSVDA+ I+ S D LY+LLK VVEEVG+ +VLQVIT S +
Sbjct: 263  EAGRTLLCFLAYCPEGTVFLKSVDASGIMNSSDALYELLKQVVEEVGVRHVLQVITSSEE 322

Query: 1563 HYIVAGKRLIATFRSMYWTPCAARSINLMLEDFGKIEWINVTLEHAKSITRFIYNHSLVL 1384
             +I AG+RL  TF ++YWTPCAAR ++L+LEDF K+EWIN  +E A+++TRF+YNHS+VL
Sbjct: 323  QFIAAGRRLTDTFPTLYWTPCAARCLDLILEDFAKLEWINAIIEQARAVTRFVYNHSVVL 382

Query: 1383 NMMRRYTGGRDLIQPVMTRSATDFTTLKCMVNLKDNLQAMVSSQEWMDCQFLKQPEGIAV 1204
            NM+RRYT G D+++P +TRSAT+FTTL+ M++LK NLQAMV+SQEWMDC + K+P G+ +
Sbjct: 383  NMLRRYTFGNDIVEPGITRSATNFTTLRRMISLKPNLQAMVTSQEWMDCPYSKKPGGLEM 442

Query: 1203 MEIIYNQSFWSRCSVVVHLTDPLVGVLRMVGSDNRPAMGYILAGMYRVKEAIKVELVENK 1024
            ++I+ NQSFWS C ++V LT+PL+ +LR+VGS+ RP++GY+ AGMYR K+A+K EL++  
Sbjct: 443  LDIVSNQSFWSSCGLIVCLTNPLLRLLRIVGSERRPSIGYVYAGMYRAKDALKKELIKRD 502

Query: 1023 EYMVYWNIIDSRWNRQMQHPLHEAGFFLNPRFFYSLEGDARDKIPSGMFDCIERLVPDIN 844
            EYMVYWNIID  W +    PLH AGFFLNP+FFYS++GD  ++I S MFDCIERLVPD  
Sbjct: 503  EYMVYWNIIDHWWEQLWHLPLHAAGFFLNPKFFYSIKGDIHNEIVSRMFDCIERLVPDTK 562

Query: 843  IQDKINRELISYKNAEGDFGRKIAIRARHTLLPAEWWATYGGGCPNLARLAIHILSQTCS 664
            +QDKI++E+  YK+A GDFGRK+AIRAR TLLPAEWW+TYGG CPNLARLA  I SQTCS
Sbjct: 563  VQDKISKEINLYKDAVGDFGRKMAIRARDTLLPAEWWSTYGGSCPNLARLATRIQSQTCS 622

Query: 663  ASVSKRSAIAFEQVQQTRNRLEHQRLRDLVFVQYNLRLQQIQFGKNKEPEPVDPISSDSF 484
            +    R+ I FE++  TRN LE QRL DLVFVQYNLRL+ +   K K+ + +DP+S DSF
Sbjct: 623  SLADTRNQIHFERIYDTRNCLERQRLIDLVFVQYNLRLKHM-VSKKKQQDSMDPMSFDSF 681

Query: 483  DFAEDWVTMKKEFFED-GDSDWMALDQPLVNNMMMLDSPNDEPEYMVAGFEYQERPNGVK 307
               E+W+T K    ED G SDW A++ P   + M+L S +DE E +  GF+  E    VK
Sbjct: 682  STLEEWITGKDICLEDYGSSDWKAVEPP-SGSPMLLGSSDDEVEELAGGFDDYEIFTRVK 740

Query: 306  DDEGD 292
            + E +
Sbjct: 741  EGEDE 745


>ref|XP_007009265.1| HAT and BED zinc finger domain-containing protein, putative
            [Theobroma cacao] gi|508726178|gb|EOY18075.1| HAT and BED
            zinc finger domain-containing protein, putative
            [Theobroma cacao]
          Length = 749

 Score =  840 bits (2170), Expect = 0.0
 Identities = 419/726 (57%), Positives = 544/726 (74%), Gaps = 1/726 (0%)
 Frame = -3

Query: 2463 ERVQLKCLYCLKLFSGGGIHRIKEHLACQKGNASCCSRVPLEVQRIMQQSLDGVTVKKKK 2284
            ERVQLKC+YC K+F GGGIHRIKEHLA QKGNAS C  VP +V+ +M++SLDGV VKK+K
Sbjct: 29   ERVQLKCIYCGKIFRGGGIHRIKEHLAGQKGNASTCFHVPSDVRLLMRESLDGVEVKKRK 88

Query: 2283 KQRIAEDIRSLTPGSNEMDTFGNQCEVNTGLQLLAHPDTLESNMGVFDRRNEGMKNRISD 2104
            KQ+IAE++ +    S+E+DT+ NQ + NTGL ++  PDTL+ +  +   R EG  N   D
Sbjct: 89   KQKIAEEMSNANQVSSEIDTYDNQVDTNTGLLMIEGPDTLQPSSSLLVNR-EGTSNVSGD 147

Query: 2103 RRKRGRPENASPLPVTPNASMLPVGDLNLRSTREKELVHMAVGRFLYDVGVSLDAVNSHY 1924
            RRKRG+ ++++       ++ L V  + L + R    VH+A+GRFL+D+G  LDAVNS Y
Sbjct: 148  RRKRGKGKSSAA-----ESNALVVNTVGLGAKRVNNHVHVAIGRFLFDIGAPLDAVNSVY 202

Query: 1923 FQPMIDAIASRGPGLEPTSYHDLRGWILKYTVEEMNGVLEKYRETWGKTGCSVLADEWVT 1744
            FQPM+DAI S G G+   S  DL+GWILK +VEE+    +K    W +TGCS+L ++W T
Sbjct: 203  FQPMVDAIISGGSGVLMPSCSDLQGWILKKSVEEVKSDNDKVTAAWVRTGCSILVNQWNT 262

Query: 1743 ETGRILINFFVYCPEGTMFLKSVDATDIIGSIDPLYDLLKSVVEEVGIDNVLQVITDSTD 1564
            +TGRIL+NF VYCPEGT+FLKSVDA+ +I S D LY+LLK VVEEVG  +VLQVIT++ +
Sbjct: 263  QTGRILLNFLVYCPEGTVFLKSVDASSVINSSDALYELLKQVVEEVGSKHVLQVITNAEE 322

Query: 1563 HYIVAGKRLIATFRSMYWTPCAARSINLMLEDFGKIEWINVTLEHAKSITRFIYNHSLVL 1384
             YIVAG+RL  TF ++YWTPCAA  INL+LEDF K+EWINV +E A+SITRF+YNHS+VL
Sbjct: 323  QYIVAGRRLAETFPTLYWTPCAAHCINLILEDFAKLEWINVIIEQARSITRFVYNHSVVL 382

Query: 1383 NMMRRYTGGRDLIQPVMTRSATDFTTLKCMVNLKDNLQAMVSSQEWMDCQFLKQPEGIAV 1204
            NM+RRYT G D+++P +T SAT+FTTLK M++LK+NLQAMV+SQEWMDC + K+P G+ +
Sbjct: 383  NMVRRYTLGNDIVEPAVTCSATNFTTLKQMIDLKNNLQAMVTSQEWMDCPYSKKPGGLEM 442

Query: 1203 MEIIYNQSFWSRCSVVVHLTDPLVGVLRMVGSDNRPAMGYILAGMYRVKEAIKVELVENK 1024
            ++++ N SFWS   ++  LT+PL+ VLRMVGS  RPAMGY+ AGMYR KE IK ELV+  
Sbjct: 443  LDLVSNPSFWSSSVLITQLTNPLLRVLRMVGSKKRPAMGYVYAGMYRAKETIKKELVKRN 502

Query: 1023 EYMVYWNIIDSRWNRQMQHPLHEAGFFLNPRFFYSLEGDARDKIPSGMFDCIERLVPDIN 844
            EYM+YWNIID  W +Q  HPLH AGF+LNP+FFYS+EGD  +++ SGM DCIE+LVPD+ 
Sbjct: 503  EYMIYWNIIDHWWEQQWHHPLHGAGFYLNPKFFYSMEGDMPNEMLSGMLDCIEKLVPDVK 562

Query: 843  IQDKINRELISYKNAEGDFGRKIAIRARHTLLPAEWWATYGGGCPNLARLAIHILSQTCS 664
            +QDKI++E+ SYKN  GDFGRK+A+RAR TLLPAEWW+TYGG CPNLARLAIH+LSQTCS
Sbjct: 563  VQDKISKEINSYKNTVGDFGRKMAVRARDTLLPAEWWSTYGGSCPNLARLAIHVLSQTCS 622

Query: 663  ASVSKRSAIAFEQVQQTRNRLEHQRLRDLVFVQYNLRLQQIQFGKNKEPEPVDPISSDSF 484
                K+++I FE++ +TRN LE QR RDL+FVQ NL+L+QI   ++KE   + P+S D+ 
Sbjct: 623  TLGLKQNSIPFEKLHETRNFLEQQRFRDLIFVQCNLQLRQIGC-ESKEQVSMQPMSFDA- 680

Query: 483  DFAEDWVTMKKEFFED-GDSDWMALDQPLVNNMMMLDSPNDEPEYMVAGFEYQERPNGVK 307
               EDWV     F E+   SDW ALD PL  N M+L   +DE E + AGF+  E  NGVK
Sbjct: 681  -TIEDWVMGNDAFLENYTHSDWTALD-PLSVNTMLLGPSSDEVEELGAGFDDYEIFNGVK 738

Query: 306  DDEGDE 289
            + E  E
Sbjct: 739  EQENAE 744


>ref|XP_002524204.1| DNA binding protein, putative [Ricinus communis]
            gi|223536481|gb|EEF38128.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 753

 Score =  816 bits (2107), Expect = 0.0
 Identities = 414/732 (56%), Positives = 534/732 (72%), Gaps = 8/732 (1%)
 Frame = -3

Query: 2463 ERVQLKCLYCLKLFSGGGIHRIKEHLACQKGNASCCSRVPLEVQRIMQQSLDGVTVKKKK 2284
            ERVQLKC+YC K+F GGGIHRIKEHLA QKGNAS C +VP +V+ IMQQSLDGV VKK+K
Sbjct: 30   ERVQLKCVYCGKIFKGGGIHRIKEHLAGQKGNASTCLQVPTDVKLIMQQSLDGVVVKKRK 89

Query: 2283 KQRIAEDIRSLTP--GSNEMDTFGN-QCEVNTGLQLLAHPDTLESNMGVFDRRNEGMKNR 2113
            KQ+IAE+I +L P  G  E++ F N Q EV+TG++L+   + +E +  +     EG  N+
Sbjct: 90   KQKIAEEITNLNPVIGGGEIEVFANDQIEVSTGMELIGVSNVIEPSSSLLISGQEGKANK 149

Query: 2112 ISDRRKRGRPE----NASPLPVTPNASMLPVGDLNLRSTREKELVHMAVGRFLYDVGVSL 1945
              +RRKRGR +    NA+ + V+ N++ + +G     + R  + VHMA+GRFLYD+G  L
Sbjct: 150  GGERRKRGRSKGSGANANAI-VSMNSNRMALG-----AKRVNDHVHMAIGRFLYDIGAPL 203

Query: 1944 DAVNSHYFQPMIDAIASRGPGLEPTSYHDLRGWILKYTVEEMNGVLEKYRETWGKTGCSV 1765
            DAVNS YFQPM+DAIAS G  +   S HDLRGWILK +VEE+   ++K+  TW +TGCSV
Sbjct: 204  DAVNSVYFQPMVDAIASGGLDVGMPSCHDLRGWILKNSVEEVKTEVDKHMATWARTGCSV 263

Query: 1764 LADEWVTETGRILINFFVYCPEGTMFLKSVDATDIIGSIDPLYDLLKSVVEEVGIDNVLQ 1585
            L D+W T  GR L++F VYC EG +FLKSVDA+DII S D LY+L+K VVEEVG+ +VLQ
Sbjct: 264  LVDQWNTLMGRTLLSFLVYCSEGVVFLKSVDASDIINSSDALYELIKKVVEEVGVRHVLQ 323

Query: 1584 VITDSTDHYIVAGKRLIATFRSMYWTPCAARSINLMLEDFGKIEWINVTLEHAKSITRFI 1405
            VIT   + YIV G+RL  TF ++Y  PCAA  I+L+LEDF K+EWI+  +  A+SITRF+
Sbjct: 324  VITSMEEQYIVVGRRLTDTFPTLYRAPCAAHCIDLILEDFAKLEWISTVILQARSITRFV 383

Query: 1404 YNHSLVLNMMRRYTGGRDLIQPVMTRSATDFTTLKCMVNLKDNLQAMVSSQEWMDCQFLK 1225
            YNHS+VLNM++RYT G +++   +T  AT+F TLK MV+LK  LQ MV+SQEWMDC + K
Sbjct: 384  YNHSVVLNMVKRYTFGSEIVATGLTHFATNFETLKRMVDLKHTLQTMVTSQEWMDCPYSK 443

Query: 1224 QPEGIAVMEIIYNQSFWSRCSVVVHLTDPLVGVLRMVGSDNRPAMGYILAGMYRVKEAIK 1045
            +P G+ +++++ NQSFWS C ++ +LT+PL+ +LR+V S  RP MGY+ AG+YR KEAIK
Sbjct: 444  KPRGLEMLDLLSNQSFWSSCVLITNLTNPLLRLLRIVSSKKRPPMGYVYAGIYRAKEAIK 503

Query: 1044 VELVENKEYMVYWNIIDSRWNRQMQHPLHEAGFFLNPRFFYSLEGDARDKIPSGMFDCIE 865
             ELV+ K+YMVYWNIID  W +Q   PLH AGFFLNP+  YS+EGD  ++I SGMFDCIE
Sbjct: 504  KELVKRKDYMVYWNIIDHWWEQQSNLPLHAAGFFLNPKVLYSIEGDLHNEILSGMFDCIE 563

Query: 864  RLVPDINIQDKINRELISYKNAEGDFGRKIAIRARHTLLPAEWWATYGGGCPNLARLAIH 685
            +LVPD+ +QDKI +E+ SYKNA GDFGRK+A+RAR TLLPAEWW+TYGG CPNLARLAI 
Sbjct: 564  KLVPDVTVQDKITKEINSYKNASGDFGRKMAVRARETLLPAEWWSTYGGSCPNLARLAIR 623

Query: 684  ILSQTCSASVSKRSAIAFEQVQQTRNRLEHQRLRDLVFVQYNLRLQQIQFGKNKEPEPVD 505
            +LSQ CS+   K + I+ EQ+  T+N LE QRL DLVFVQYNLRL+Q+  GK++E + VD
Sbjct: 624  VLSQPCSSFGYKLNHISLEQIHDTKNCLERQRLSDLVFVQYNLRLKQM-VGKSEEQDSVD 682

Query: 504  PISSDSFDFAEDWVTMKKEFFED-GDSDWMALDQPLVNNMMMLDSPNDEPEYMVAGFEYQ 328
            P+S D     EDW+  K    ED  +SDWMALD P VN       P+DE + + AGF   
Sbjct: 683  PLSFDCISILEDWIKEKDISTEDYANSDWMALDPPSVNTR----QPHDEVDELGAGFHDY 738

Query: 327  ERPNGVKDDEGD 292
            E  N VKD E D
Sbjct: 739  EIFNRVKDTEDD 750


>ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302111 [Fragaria vesca
            subsp. vesca]
          Length = 754

 Score =  790 bits (2039), Expect = 0.0
 Identities = 406/731 (55%), Positives = 530/731 (72%), Gaps = 9/731 (1%)
 Frame = -3

Query: 2463 ERVQLKCLYCLKLFSGGGIHRIKEHLACQKGNASCCSRVPLEVQRIMQQSLDGVTVKKKK 2284
            +R+QLKC+YC KLF GGGIHRIKEHLA QKGNAS C RVP +V+ +MQQSLDGV VKK+ 
Sbjct: 25   DRIQLKCIYCSKLFRGGGIHRIKEHLAGQKGNASTCLRVPPDVRGLMQQSLDGVVVKKRN 84

Query: 2283 KQRIAEDIRSLTPGSN-EMDTFGN-QCEVNTGLQLLAHPDTLESNMGVFDRRNEGMKN-R 2113
            +Q++ E+I ++TP  + ++D+ G  Q +VN  +QL+       S + V     EG+ + R
Sbjct: 85   RQKLDEEITNITPPQDGDVDSLGGTQSDVNNAVQLVGVSVEPISRLLV---NREGVTSVR 141

Query: 2112 ISDRRKRGRPENASPLPVTPNASMLPVGDLN---LRSTREKELVHMAVGRFLYDVGVSLD 1942
              DRRKRGR +++        +S    G  N   L S +    VH A+GRFL+D+G   +
Sbjct: 142  SMDRRKRGRGKSSW-------SSHGVHGVCNGGALVSRKVNSYVHEAIGRFLFDIGAPPE 194

Query: 1941 AVNSHYFQPMIDAIASRGPGLEPTSYHDLRGWILKYTVEEMNGVLEKYRETWGKTGCSVL 1762
            AVNS YFQPMIDAIAS GPG+EP + HDLR WILK +VEE    ++K+R TWG+TGCS+L
Sbjct: 195  AVNSAYFQPMIDAIASGGPGMEPPTCHDLRSWILKNSVEEARNNIDKHRATWGRTGCSIL 254

Query: 1761 ADEWVTETGRILINFFVYCPEGTMFLKSVDATDIIGSIDPLYDLLKSVVEEVGIDNVLQV 1582
             D+W TE   ++++F VY PEGT+FL+SVDA+ II S D LYDLL+ VVE+VG+ +V+QV
Sbjct: 255  VDQWNTELDNVMLSFLVYSPEGTVFLESVDASAIINSSDALYDLLRRVVEDVGVGDVVQV 314

Query: 1581 ITDSTDHYIVAGKRLIATFRSMYWTPCAARSINLMLEDFGKIEWINVTLEHAKSITRFIY 1402
            IT   + ++VAG+RL  TF +++W PCAAR ++L+LEDFG ++WI+  +E A+SIT+F+Y
Sbjct: 315  ITSGEEQFVVAGRRLADTFPNLFWIPCAARCLDLILEDFGSLDWIHAVIEQARSITKFVY 374

Query: 1401 NHSLVLNMMRRYTGGRDLIQPVMTRSATDFTTLKCMVNLKDNLQAMVSSQEWMDCQFLKQ 1222
            NH++VLN++RR T G D+++P +TR  T FTTLK +V+LK  LQ MV+SQEWMDC + K+
Sbjct: 375  NHNVVLNLVRRSTFGNDIVEPGVTRFGTSFTTLKRLVDLKHCLQVMVTSQEWMDCPYSKE 434

Query: 1221 PEGIAVMEII--YNQSFWSRCSVVVHLTDPLVGVLRMVGSDNRPAMGYILAGMYRVKEAI 1048
            P G+ + ++I   +QSFWS C+++V LT PL+ VLRMVG + RPAMG+I AGMYR KEAI
Sbjct: 435  PGGLEISDLISDRDQSFWSSCTLIVRLTSPLLRVLRMVGCEKRPAMGFIYAGMYRAKEAI 494

Query: 1047 KVELVENKEYMVYWNIIDSRWNRQMQHPLHEAGFFLNPRFFYSLEGDARDKIPSGMFDCI 868
            K ELV+ +EYMVYWNIID RW +    PLH AGF+LNP+ FYS+EGD  + I SGM+DCI
Sbjct: 495  KKELVKREEYMVYWNIIDQRWEQHWNFPLHAAGFYLNPKIFYSIEGDIHNSIQSGMYDCI 554

Query: 867  ERLVPDINIQDKINRELISYKNAEGDFGRKIAIRARHTLLPAEWWATYGGGCPNLARLAI 688
            ER+VPDI +QDKI +E+ISYKNA GDF RK+AIRAR TLLPAEWW+TYGGGCPNLARLAI
Sbjct: 555  ERMVPDIKVQDKIMKEIISYKNAAGDFRRKMAIRARDTLLPAEWWSTYGGGCPNLARLAI 614

Query: 687  HILSQTCSASVSKRSAIAFEQVQQTRNRLEHQRLRDLVFVQYNLRLQQIQFGKNKEPEPV 508
             ILSQTC +   ++S I FE+    RN LE QRLRDLVFVQYNLRL+Q+   KN   + +
Sbjct: 615  RILSQTCGSIGYRQSQIPFEKAHGIRNCLERQRLRDLVFVQYNLRLRQM-VDKNNGEDCM 673

Query: 507  DPISSDSFDFAEDWVTMKKEFFEDGD-SDWMALDQPLVNNMMMLDSPNDEPEYMVAGFEY 331
            DPIS DS    EDWVT K    ED + S WM+LD P  +  M+L   ND+ E + +GF  
Sbjct: 674  DPISFDSISLVEDWVTGKDVCSEDFEGSSWMSLDSPSAST-MLLGPSNDDAEDLGSGFYD 732

Query: 330  QERPNGVKDDE 298
             E  +  KD E
Sbjct: 733  GEIFSRGKDGE 743


>ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806265 isoform X1 [Glycine
            max] gi|571542833|ref|XP_006601996.1| PREDICTED:
            uncharacterized protein LOC100806265 isoform X2 [Glycine
            max]
          Length = 758

 Score =  783 bits (2022), Expect = 0.0
 Identities = 397/732 (54%), Positives = 521/732 (71%), Gaps = 8/732 (1%)
 Frame = -3

Query: 2463 ERVQLKCLYCLKLFSGGGIHRIKEHLACQKGNASCCSRVPLEVQRIMQQSLDGVTVKKKK 2284
            ++VQLKC+YCLK+F GGGIHRIKEHLACQKGNAS CSRVP +V+  MQQSLDGV VKK++
Sbjct: 29   DKVQLKCIYCLKMFKGGGIHRIKEHLACQKGNASTCSRVPHDVRLHMQQSLDGVVVKKRR 88

Query: 2283 KQRIAEDIRSLTPGSNEMDTFGNQ---CEVNTGLQLLA--HPDTLESNMGVFDRRNEGMK 2119
            KQRI E+I S+ P +  +++  N     +VN GLQ +   H  TL  N G      EGM 
Sbjct: 89   KQRIEEEIMSVNPLTTVVNSLPNNNQVVDVNQGLQAIGVEHNSTLVVNPG------EGMS 142

Query: 2118 NRISDRRKRGRPENASPLPVTPNASMLPVGDLN-LRSTREKELVHMAVGRFLYDVGVSLD 1942
              +  R+K    +N  P  V  N+  +   + N L   +    ++MA+GRFLYD+G   D
Sbjct: 143  RNMERRKKMRAAKN--PAAVYANSEDVVAVEKNGLFPKKMDNHIYMAIGRFLYDIGAPFD 200

Query: 1941 AVNSHYFQPMIDAIASRGPGLEPTSYHDLRGWILKYTVEEMNGVLEKYRETWGKTGCSVL 1762
            AVN  +FQ M+DAIAS+G G E  S+H+LRGWILK +VEE+   +++ + TWG+TGCS+L
Sbjct: 201  AVNLVFFQEMVDAIASKGTGFERPSHHELRGWILKNSVEEVKNDIDRCKMTWGRTGCSIL 260

Query: 1761 ADEWVTETGRILINFFVYCPEGTMFLKSVDATDIIGSIDPLYDLLKSVVEEVGIDNVLQV 1582
             D+W TET RILI+F  YCPEG +FLKS+DAT+I+ S D LYDL+K VVEE+G+  V+QV
Sbjct: 261  VDQWTTETSRILISFLAYCPEGLVFLKSLDATEILTSPDFLYDLIKQVVEEIGVGKVVQV 320

Query: 1581 ITDSTDHYIVAGKRLIATFRSMYWTPCAARSINLMLEDFGKIEWINVTLEHAKSITRFIY 1402
            IT   + Y +AG+RL+ TF ++YW+P AA  I+L+LEDFG +EWI+  +E AKS+TRF+Y
Sbjct: 321  ITSGEEQYGIAGRRLMDTFPTLYWSPSAAHCIDLILEDFGNLEWISAVIEQAKSVTRFVY 380

Query: 1401 NHSLVLNMMRRYTGGRDLIQPVMTRSATDFTTLKCMVNLKDNLQAMVSSQEWMDCQFLKQ 1222
            N+S +LNM++RYT G D++ P  +R AT+FTTLK MV+LK NLQA+V+SQEW DC + KQ
Sbjct: 381  NYSAILNMVKRYTLGNDIVDPSFSRFATNFTTLKRMVDLKHNLQALVTSQEWADCPYSKQ 440

Query: 1221 PEGIAVMEIIYNQSFWSRCSVVVHLTDPLVGVLRMVGSDNRPAMGYILAGMYRVKEAIKV 1042
              G+ +++ + NQ+FWS C ++V LT PL+ VLR+ GS+ RP MGY+ AGMYRVKEAIK 
Sbjct: 441  TAGLEMLDCLSNQTFWSSCDMIVCLTAPLLKVLRIAGSEMRPGMGYVYAGMYRVKEAIKK 500

Query: 1041 ELVENKEYMVYWNIIDSRWNRQMQHPLHEAGFFLNPRFFYSLEGDARDKIPSGMFDCIER 862
             L + +EYMVYWNII  RW R   HPLH AGF+LNP+FFYS++GD   +I SGMFDCIER
Sbjct: 501  ALGKREEYMVYWNIIHHRWERLWNHPLHAAGFYLNPKFFYSIQGDILGQIVSGMFDCIER 560

Query: 861  LVPDINIQDKINRELISYKNAEGDFGRKIAIRARHTLLPAEWWATYGGGCPNLARLAIHI 682
            LVPD  IQDKI +E+  YK+A GDFGRK+A+RAR  LLP+EWW+TYGGGCPNL+RLAI I
Sbjct: 561  LVPDTRIQDKIIKEINLYKSAAGDFGRKMAVRARDNLLPSEWWSTYGGGCPNLSRLAIRI 620

Query: 681  LSQTCSASVSKRSAIAFEQVQQTRNRLEHQRLRDLVFVQYNLRLQQIQFGKNKEPEPVDP 502
            LSQT S    KR+ + FEQ+  TRN +E Q L DLVFV  NLRL+Q+    +KE    DP
Sbjct: 621  LSQTSSVMSCKRNQVPFEQIINTRNYIERQHLTDLVFVHCNLRLRQMFM--SKEQNFSDP 678

Query: 501  ISSDSFDFAEDWVTMKKEFFED--GDSDWMALDQPLVNNMMMLDSPNDEPEYMVAGFEYQ 328
            +S D+    E+W+  +  + +D  G+SDWMALD   VN  M+L   NDE E +  G++  
Sbjct: 679  LSFDNVSNVEEWIRPRDLYVDDECGNSDWMALDPSSVNT-MLLRPLNDETEDLGEGYDDY 737

Query: 327  ERPNGVKDDEGD 292
            E  +  KD E +
Sbjct: 738  EIFSFGKDSEDE 749


>ref|XP_007163431.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris]
            gi|561036895|gb|ESW35425.1| hypothetical protein
            PHAVU_001G234100g [Phaseolus vulgaris]
          Length = 756

 Score =  777 bits (2006), Expect = 0.0
 Identities = 387/728 (53%), Positives = 517/728 (71%), Gaps = 4/728 (0%)
 Frame = -3

Query: 2463 ERVQLKCLYCLKLFSGGGIHRIKEHLACQKGNASCCSRVPLEVQRIMQQSLDGVTVKKKK 2284
            ++VQLKC+YC K+F GGGIHRIKEHLACQKGNAS CSRVP +V+  MQQSLDGV VKK++
Sbjct: 29   DKVQLKCIYCQKMFKGGGIHRIKEHLACQKGNASTCSRVPHDVRLHMQQSLDGVVVKKRR 88

Query: 2283 KQRIAEDIRSLTPGSNEMDTF--GNQCEVNTGLQLLAHPDTLESNMGVFDRRNEGMKNRI 2110
            KQ+I E+I S+ P +  +++    NQ +VN GLQ +     ++ N  +     EGM   +
Sbjct: 89   KQKIEEEIMSVNPLTTVVNSLPNNNQVDVNQGLQAIG----VDHNSSLVVNPGEGMSKNM 144

Query: 2109 SDRRKRGRPENASPLPVTPNASMLPVGDLN-LRSTREKELVHMAVGRFLYDVGVSLDAVN 1933
              R+K    +N  P  +  N+  +   + N L   R    +HMA+GRFLYD+G   DAVN
Sbjct: 145  ERRKKMRASKN--PAAIYANSEGVVAVEKNGLFPKRVDNHIHMAIGRFLYDIGAPFDAVN 202

Query: 1932 SHYFQPMIDAIASRGPGLEPTSYHDLRGWILKYTVEEMNGVLEKYRETWGKTGCSVLADE 1753
            S YF  M+DAI+SRG G E  S+H+LRGWILK +VEE+   +++ + TWG+TGCS+L D+
Sbjct: 203  SVYFHEMVDAISSRGAGFERPSHHELRGWILKNSVEEVKNDIDRCKMTWGRTGCSILVDQ 262

Query: 1752 WVTETGRILINFFVYCPEGTMFLKSVDATDIIGSIDPLYDLLKSVVEEVGIDNVLQVITD 1573
            W TETGR+LI+F  YCPEG +FLKS+DAT+I  S D LYD++K VV+EVG+  VLQVIT 
Sbjct: 263  WATETGRVLISFLAYCPEGVVFLKSMDATEISTSADFLYDMIKQVVDEVGVGQVLQVITS 322

Query: 1572 STDHYIVAGKRLIATFRSMYWTPCAARSINLMLEDFGKIEWINVTLEHAKSITRFIYNHS 1393
              + Y VAG+RL  TF ++YW+P AA  I+ +LEDFG +EWI+  +E AKS+TRF+YN+S
Sbjct: 323  GEEQYAVAGRRLTDTFPTLYWSPSAAHCIDFILEDFGNLEWISAVIEQAKSVTRFVYNYS 382

Query: 1392 LVLNMMRRYTGGRDLIQPVMTRSATDFTTLKCMVNLKDNLQAMVSSQEWMDCQFLKQPEG 1213
             +L M++RYT G D++ P  ++ AT+FTTLK MV+LK NLQA+V+SQEW DC + K+  G
Sbjct: 383  AILIMVKRYTLGNDIVDPSFSQFATNFTTLKRMVDLKHNLQALVTSQEWADCPYSKKSAG 442

Query: 1212 IAVMEIIYNQSFWSRCSVVVHLTDPLVGVLRMVGSDNRPAMGYILAGMYRVKEAIKVELV 1033
            + +++ + +Q+FWS C ++V LT PL+ VLR+  S+ RPAMGYI AG+YR KEAIK  L 
Sbjct: 443  LEMLDCLSSQTFWSSCDMIVRLTAPLLKVLRIASSEMRPAMGYIYAGIYRAKEAIKKALG 502

Query: 1032 ENKEYMVYWNIIDSRWNRQMQHPLHEAGFFLNPRFFYSLEGDARDKIPSGMFDCIERLVP 853
            + +EYMVYWNII  RW R   HPLH AGF+LNP+FFYS++GD   +I SGMFDCIERLV 
Sbjct: 503  KREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPKFFYSIQGDIHSQIVSGMFDCIERLVS 562

Query: 852  DINIQDKINRELISYKNAEGDFGRKIAIRARHTLLPAEWWATYGGGCPNLARLAIHILSQ 673
            D  IQDKI +E+  YK+A GDFGRK+A+RAR  LLP+EWW+TYGGGCPNL+RLAI ILSQ
Sbjct: 563  DTRIQDKIIKEINLYKSAAGDFGRKMAVRARDNLLPSEWWSTYGGGCPNLSRLAIRILSQ 622

Query: 672  TCSASVSKRSAIAFEQVQQTRNRLEHQRLRDLVFVQYNLRLQQIQFGKNKEPEPVDPISS 493
            T S    KR+ I FEQ+  TRN +E Q L DLVFV  NLRL+Q+   K+++    DP+S 
Sbjct: 623  TSSVMSCKRNQIPFEQIVNTRNYIERQHLTDLVFVHCNLRLRQMFTSKDQDFS--DPLSF 680

Query: 492  DSFDFAEDWVTMKKEFFED-GDSDWMALDQPLVNNMMMLDSPNDEPEYMVAGFEYQERPN 316
            D+  + ++W+  +  + ++ G+SDWMALD   VN  M+L   NDE E +  GF+  E  +
Sbjct: 681  DTISYVDEWIRPRDLYIDEYGNSDWMALDPSSVNT-MLLRPLNDEAEELDEGFDDDEIFS 739

Query: 315  GVKDDEGD 292
              KD E +
Sbjct: 740  CGKDSEDE 747


>ref|XP_007163430.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris]
            gi|561036894|gb|ESW35424.1| hypothetical protein
            PHAVU_001G234100g [Phaseolus vulgaris]
          Length = 869

 Score =  777 bits (2006), Expect = 0.0
 Identities = 387/728 (53%), Positives = 517/728 (71%), Gaps = 4/728 (0%)
 Frame = -3

Query: 2463 ERVQLKCLYCLKLFSGGGIHRIKEHLACQKGNASCCSRVPLEVQRIMQQSLDGVTVKKKK 2284
            ++VQLKC+YC K+F GGGIHRIKEHLACQKGNAS CSRVP +V+  MQQSLDGV VKK++
Sbjct: 142  DKVQLKCIYCQKMFKGGGIHRIKEHLACQKGNASTCSRVPHDVRLHMQQSLDGVVVKKRR 201

Query: 2283 KQRIAEDIRSLTPGSNEMDTF--GNQCEVNTGLQLLAHPDTLESNMGVFDRRNEGMKNRI 2110
            KQ+I E+I S+ P +  +++    NQ +VN GLQ +     ++ N  +     EGM   +
Sbjct: 202  KQKIEEEIMSVNPLTTVVNSLPNNNQVDVNQGLQAIG----VDHNSSLVVNPGEGMSKNM 257

Query: 2109 SDRRKRGRPENASPLPVTPNASMLPVGDLN-LRSTREKELVHMAVGRFLYDVGVSLDAVN 1933
              R+K    +N  P  +  N+  +   + N L   R    +HMA+GRFLYD+G   DAVN
Sbjct: 258  ERRKKMRASKN--PAAIYANSEGVVAVEKNGLFPKRVDNHIHMAIGRFLYDIGAPFDAVN 315

Query: 1932 SHYFQPMIDAIASRGPGLEPTSYHDLRGWILKYTVEEMNGVLEKYRETWGKTGCSVLADE 1753
            S YF  M+DAI+SRG G E  S+H+LRGWILK +VEE+   +++ + TWG+TGCS+L D+
Sbjct: 316  SVYFHEMVDAISSRGAGFERPSHHELRGWILKNSVEEVKNDIDRCKMTWGRTGCSILVDQ 375

Query: 1752 WVTETGRILINFFVYCPEGTMFLKSVDATDIIGSIDPLYDLLKSVVEEVGIDNVLQVITD 1573
            W TETGR+LI+F  YCPEG +FLKS+DAT+I  S D LYD++K VV+EVG+  VLQVIT 
Sbjct: 376  WATETGRVLISFLAYCPEGVVFLKSMDATEISTSADFLYDMIKQVVDEVGVGQVLQVITS 435

Query: 1572 STDHYIVAGKRLIATFRSMYWTPCAARSINLMLEDFGKIEWINVTLEHAKSITRFIYNHS 1393
              + Y VAG+RL  TF ++YW+P AA  I+ +LEDFG +EWI+  +E AKS+TRF+YN+S
Sbjct: 436  GEEQYAVAGRRLTDTFPTLYWSPSAAHCIDFILEDFGNLEWISAVIEQAKSVTRFVYNYS 495

Query: 1392 LVLNMMRRYTGGRDLIQPVMTRSATDFTTLKCMVNLKDNLQAMVSSQEWMDCQFLKQPEG 1213
             +L M++RYT G D++ P  ++ AT+FTTLK MV+LK NLQA+V+SQEW DC + K+  G
Sbjct: 496  AILIMVKRYTLGNDIVDPSFSQFATNFTTLKRMVDLKHNLQALVTSQEWADCPYSKKSAG 555

Query: 1212 IAVMEIIYNQSFWSRCSVVVHLTDPLVGVLRMVGSDNRPAMGYILAGMYRVKEAIKVELV 1033
            + +++ + +Q+FWS C ++V LT PL+ VLR+  S+ RPAMGYI AG+YR KEAIK  L 
Sbjct: 556  LEMLDCLSSQTFWSSCDMIVRLTAPLLKVLRIASSEMRPAMGYIYAGIYRAKEAIKKALG 615

Query: 1032 ENKEYMVYWNIIDSRWNRQMQHPLHEAGFFLNPRFFYSLEGDARDKIPSGMFDCIERLVP 853
            + +EYMVYWNII  RW R   HPLH AGF+LNP+FFYS++GD   +I SGMFDCIERLV 
Sbjct: 616  KREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPKFFYSIQGDIHSQIVSGMFDCIERLVS 675

Query: 852  DINIQDKINRELISYKNAEGDFGRKIAIRARHTLLPAEWWATYGGGCPNLARLAIHILSQ 673
            D  IQDKI +E+  YK+A GDFGRK+A+RAR  LLP+EWW+TYGGGCPNL+RLAI ILSQ
Sbjct: 676  DTRIQDKIIKEINLYKSAAGDFGRKMAVRARDNLLPSEWWSTYGGGCPNLSRLAIRILSQ 735

Query: 672  TCSASVSKRSAIAFEQVQQTRNRLEHQRLRDLVFVQYNLRLQQIQFGKNKEPEPVDPISS 493
            T S    KR+ I FEQ+  TRN +E Q L DLVFV  NLRL+Q+   K+++    DP+S 
Sbjct: 736  TSSVMSCKRNQIPFEQIVNTRNYIERQHLTDLVFVHCNLRLRQMFTSKDQDFS--DPLSF 793

Query: 492  DSFDFAEDWVTMKKEFFED-GDSDWMALDQPLVNNMMMLDSPNDEPEYMVAGFEYQERPN 316
            D+  + ++W+  +  + ++ G+SDWMALD   VN  M+L   NDE E +  GF+  E  +
Sbjct: 794  DTISYVDEWIRPRDLYIDEYGNSDWMALDPSSVNT-MLLRPLNDEAEELDEGFDDDEIFS 852

Query: 315  GVKDDEGD 292
              KD E +
Sbjct: 853  CGKDSEDE 860


>ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817502 isoform X1 [Glycine
            max] gi|571489936|ref|XP_006591345.1| PREDICTED:
            uncharacterized protein LOC100817502 isoform X2 [Glycine
            max] gi|571489939|ref|XP_006591346.1| PREDICTED:
            uncharacterized protein LOC100817502 isoform X3 [Glycine
            max]
          Length = 759

 Score =  770 bits (1988), Expect = 0.0
 Identities = 391/731 (53%), Positives = 515/731 (70%), Gaps = 7/731 (0%)
 Frame = -3

Query: 2463 ERVQLKCLYCLKLFSGGGIHRIKEHLACQKGNASCCSRVPLEVQRIMQQSLDGVTVKKKK 2284
            ++VQLKC+YCLK+F GGGIHRIKEHLACQKGNAS CSRVP +V+  MQQSLDGV VKK++
Sbjct: 29   DKVQLKCIYCLKMFKGGGIHRIKEHLACQKGNASTCSRVPHDVRLHMQQSLDGVVVKKRR 88

Query: 2283 KQRIAEDIRSLTPGSNEMDTFGNQ----CEVNTGLQLLAHPDTLESNMGVFDRRNEGMKN 2116
            KQRI E+I S+ P +  +++  N      +VN GLQ +     +E N  +     EGM  
Sbjct: 89   KQRIEEEIMSVNPLTTVVNSLPNNNNRVVDVNQGLQAIG----VEHNSSLVVNPGEGMSR 144

Query: 2115 RISDRRKRGRPENASPLPVTPNAS-MLPVGDLNLRSTREKELVHMAVGRFLYDVGVSLDA 1939
             +  R+K    +N  P  V  N+  ++ V    L   +    ++MA+GRFLYD+G   DA
Sbjct: 145  NMERRKKMRATKN--PAAVYANSEGVIAVEKNGLFPKKMDNHIYMAIGRFLYDIGAPFDA 202

Query: 1938 VNSHYFQPMIDAIASRGPGLEPTSYHDLRGWILKYTVEEMNGVLEKYRETWGKTGCSVLA 1759
            VNS YFQ M+DAIASRG G E   +H+LRGWILK +VEE+   +++ + TWG+TGCS+L 
Sbjct: 203  VNSVYFQEMVDAIASRGVGFERPWHHELRGWILKNSVEEVKNDIDRCKMTWGRTGCSILV 262

Query: 1758 DEWVTETGRILINFFVYCPEGTMFLKSVDATDIIGSIDPLYDLLKSVVEEVGIDNVLQVI 1579
            D+W TETG+ILI+F  YCPEG +FL+S+DAT+I  S D LYDL+K VVEEVG   V+QVI
Sbjct: 263  DQWTTETGKILISFLAYCPEGLVFLRSLDATEISTSADFLYDLIKQVVEEVGAGQVVQVI 322

Query: 1578 TDSTDHYIVAGKRLIATFRSMYWTPCAARSINLMLEDFGKIEWINVTLEHAKSITRFIYN 1399
            T   + Y +AG+RL  TF ++Y +P AA  I+L+LEDFG +EWI+  +E A+S+TRF+YN
Sbjct: 323  TSGEEQYGIAGRRLTDTFPTLYLSPSAAHCIDLILEDFGNLEWISAVIEQARSVTRFVYN 382

Query: 1398 HSLVLNMMRRYTGGRDLIQPVMTRSATDFTTLKCMVNLKDNLQAMVSSQEWMDCQFLKQP 1219
            +S +LNM++RYT G D++ P  +  AT+FTTLK MV+LK NLQA+V+SQEW D  + KQ 
Sbjct: 383  YSAILNMVKRYTLGNDIVDPSFSHFATNFTTLKRMVDLKHNLQALVTSQEWADSPYSKQT 442

Query: 1218 EGIAVMEIIYNQSFWSRCSVVVHLTDPLVGVLRMVGSDNRPAMGYILAGMYRVKEAIKVE 1039
             G+ +++ + NQ+FWS C ++V LT PL+ V+R+  S+ RPAMGY+ AGMYR KEAIK  
Sbjct: 443  AGLEMLDCLSNQTFWSSCDMIVCLTAPLLKVMRIASSEMRPAMGYVYAGMYRAKEAIKKA 502

Query: 1038 LVENKEYMVYWNIIDSRWNRQMQHPLHEAGFFLNPRFFYSLEGDARDKIPSGMFDCIERL 859
            L + +EYMVYWNII  RW R   HPLH AGF+LNP+FFYS++GD   +I SGMFDCIERL
Sbjct: 503  LGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPKFFYSIQGDIHGQIVSGMFDCIERL 562

Query: 858  VPDINIQDKINRELISYKNAEGDFGRKIAIRARHTLLPAEWWATYGGGCPNLARLAIHIL 679
            VPD  IQDKI +E+  YK+A GDFGRK+A+RAR  LLP+EWW+TYGGGCPNL+RLAI IL
Sbjct: 563  VPDTRIQDKIIKEINLYKSASGDFGRKMAVRARDNLLPSEWWSTYGGGCPNLSRLAIRIL 622

Query: 678  SQTCSASVSKRSAIAFEQVQQTRNRLEHQRLRDLVFVQYNLRLQQIQFGKNKEPEPVDPI 499
            SQT S    KR+ I FEQ+  TRN +E Q L DLVFV  NLRL+Q+    +KE +  DP+
Sbjct: 623  SQTSSVMSCKRNQIPFEQIINTRNYIERQHLTDLVFVHCNLRLRQMFM--SKEQDFSDPL 680

Query: 498  SSDSFDFAEDWVTMKKEFFED--GDSDWMALDQPLVNNMMMLDSPNDEPEYMVAGFEYQE 325
            S D+    E+W+  +  + +D  G+SDWMALD   VN  M+L   NDE E +  G++  E
Sbjct: 681  SFDNISNVEEWIRPRDLYIDDECGNSDWMALDPSSVNT-MLLRPLNDEAEDLGEGYDDYE 739

Query: 324  RPNGVKDDEGD 292
              +  KD E +
Sbjct: 740  IFSCGKDSEDE 750


>ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591442 [Solanum tuberosum]
          Length = 755

 Score =  768 bits (1984), Expect = 0.0
 Identities = 392/726 (53%), Positives = 512/726 (70%), Gaps = 4/726 (0%)
 Frame = -3

Query: 2463 ERVQLKCLYCLKLFSGGGIHRIKEHLACQKGNASCCSRVPLEVQRIMQQSLDGVTVKKKK 2284
            ERVQLKC+YC K+F GGGIHRIKEHLA QKGNAS C RV  +V+ +MQ SL+GV +KK+K
Sbjct: 29   ERVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNASTCLRVQPDVRLLMQDSLNGVVMKKRK 88

Query: 2283 KQRIAEDIRSLTPGSNEMDT---FGNQCEVNTGLQLLAHPDTLESNMGVFDRRNEGMKNR 2113
            KQ++AE+I +   G+   D    F + C ++T + LL  P  +E    +F  R++G  N 
Sbjct: 89   KQKLAEEITTYNAGTATSDIAAEFTDTCGLDTQVDLLPMPQAIEHTSNLFLNRDQG-PNN 147

Query: 2112 ISDRRKRGRPENASPLPVTPNASMLPVGDLNLRSTREKELVHMAVGRFLYDVGVSLDAVN 1933
            I  R+K+ R    +      NA +LP+     +S R    VHMAV RFL D  V LDAVN
Sbjct: 148  IGARKKKSRIRKGASSS-NNNAMLLPIN----QSKRVNNHVHMAVARFLLDARVPLDAVN 202

Query: 1932 SHYFQPMIDAIASRGPGLEPTSYHDLRGWILKYTVEEMNGVLEKYRETWGKTGCSVLADE 1753
            S YFQPMID IAS+GP +   SYH+LR W+LK +V+E+   +++   TW ++GCSVL DE
Sbjct: 203  SVYFQPMIDVIASQGPQVSAPSYHELRSWVLKASVQEVRNDIDQCSSTWARSGCSVLVDE 262

Query: 1752 WVTETGRILINFFVYCPEGTMFLKSVDATDIIGSIDPLYDLLKSVVEEVGIDNVLQVITD 1573
            W+T  G+ L+NF VYCPEGTMFL+SVDA+ +I S D LY+LLK VVEEVG+ NVLQV+T 
Sbjct: 263  WITGKGKTLLNFLVYCPEGTMFLRSVDASTLINSTDYLYELLKEVVEEVGVRNVLQVVTS 322

Query: 1572 STDHYIVAGKRLIATFRSMYWTPCAARSINLMLEDFGKIEWINVTLEHAKSITRFIYNHS 1393
            + + YI+AGKRL   + +++WTPCAA SI+LMLED  K+EWI+  +E AKSI+RFIYN++
Sbjct: 323  NEERYIIAGKRLTDAYPTLFWTPCAAHSIDLMLEDLKKLEWIDTIMEQAKSISRFIYNNN 382

Query: 1392 LVLNMMRRYTGGRDLIQPVMTRSATDFTTLKCMVNLKDNLQAMVSSQEWMDCQFLKQPEG 1213
            ++L+MMR++T G DL+   +TRSATDF TLK MVN+K NLQ+MV+S EW +  + K+PEG
Sbjct: 383  ILLSMMRKFTLGVDLVDLGVTRSATDFLTLKRMVNIKHNLQSMVTSVEWAESPYSKKPEG 442

Query: 1212 IAVMEIIYNQSFWSRCSVVVHLTDPLVGVLRMVGSDNRPAMGYILAGMYRVKEAIKVELV 1033
             A+++ I NQSFWS CS+V  LTDP++ +LRMV S+ RPAM Y+ AG+YR KE IK ELV
Sbjct: 443  FALLDYIGNQSFWSTCSLVCRLTDPILRLLRMVSSEERPAMAYVYAGVYRAKETIKKELV 502

Query: 1032 ENKEYMVYWNIIDSRWNRQMQHPLHEAGFFLNPRFFYSLEGDARDKIPSGMFDCIERLVP 853
              K+Y VYWNIID RW    +HPLH AGF+LNP+FFY+ E D    I S ++DCIE+LVP
Sbjct: 503  NKKDYSVYWNIIDHRWESLQRHPLHAAGFYLNPKFFYTTEEDVHLHIRSLVYDCIEKLVP 562

Query: 852  DINIQDKINRELISYKNAEGDFGRKIAIRARHTLLPAEWWATYGGGCPNLARLAIHILSQ 673
            D  IQDKI +E  SY N+ GDFGRK+A+RAR TL PAEWW+TYGGGCPNLARLAI ILSQ
Sbjct: 563  DPKIQDKIVKETTSYLNSAGDFGRKMAVRARDTLFPAEWWSTYGGGCPNLARLAIRILSQ 622

Query: 672  TCSASVSKRSAIAFEQVQQTRNRLEHQRLRDLVFVQYNLRLQQIQFGKNKEPEPVDPISS 493
            T S   SK   +  E++ +T+N +EHQRL DL FVQYNL L+Q    KN EP+ +D IS 
Sbjct: 623  TSSLIRSKPGRVPLEEMHETKNCIEHQRLNDLAFVQYNLWLRQ---RKNLEPDCMDSISY 679

Query: 492  DSFDFAEDWVTMKKEFFED-GDSDWMALDQPLVNNMMMLDSPNDEPEYMVAGFEYQERPN 316
            +  +   +WV+ +++  ED   SDWM +D PL  ++  L    D+ E + AGF+  E   
Sbjct: 680  EKMEVVHNWVSRREQISEDLESSDWMTVDPPL-GSIAPLGPLIDDIEALGAGFDDFEIFG 738

Query: 315  GVKDDE 298
            G KD E
Sbjct: 739  GPKDSE 744


>gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo]
          Length = 752

 Score =  766 bits (1977), Expect = 0.0
 Identities = 381/726 (52%), Positives = 516/726 (71%), Gaps = 2/726 (0%)
 Frame = -3

Query: 2463 ERVQLKCLYCLKLFSGGGIHRIKEHLACQKGNASCCSRVPLEVQRIMQQSLDGVTVKKKK 2284
            +RVQLKCLYC KLF GGGIHRIKEHLA QKGNAS C  VP EVQ IMQ+SLDGV +KK+K
Sbjct: 29   DRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRK 88

Query: 2283 KQRIAEDIRSLTPGSNEMDTFGNQCEVNTGLQLLAHPDTLESNMGVFDRRNEGMKNRISD 2104
            +Q++ E++ ++   + E+D   N  ++++ + L+   + L++N  +     EG  N++  
Sbjct: 89   RQKLDEEMTNVNAMTAEVDAISNHMDMDSSIHLIEVAEPLDTNSALLLTHEEGTSNKVG- 147

Query: 2103 RRKRGRPENASPLPVTPNASMLPVGDLNLRSTREKELVHMAVGRFLYDVGVSLDAVNSHY 1924
             RK+G    +S   +     ++P G   L S R++  VHMA+GRFLYD+G SL+AVNS Y
Sbjct: 148  -RKKGSKGKSSSC-LDREMIVIPNGGGILDSNRDRNQVHMAIGRFLYDIGASLEAVNSAY 205

Query: 1923 FQPMIDAIASRGPGLEPTSYHDLRGWILKYTVEEMNGVLEKYRETWGKTGCSVLADEWVT 1744
            FQPMI++IA  G G+ P SYHD+RGWILK +VEE+ G  ++ + TWG TGCSV+ D+W T
Sbjct: 206  FQPMIESIALAGTGIIPPSYHDIRGWILKNSVEEVRGDFDRCKATWGMTGCSVMVDQWCT 265

Query: 1743 ETGRILINFFVYCPEGTMFLKSVDATDIIGSIDPLYDLLKSVVEEVGIDNVLQVITDSTD 1564
            E GR ++NF VYCP+GT+FL+SVDA+ I+ S D LY+LLK VVE+VG+ +V+QVIT   +
Sbjct: 266  EAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEE 325

Query: 1563 HYIVAGKRLIATFRSMYWTPCAARSINLMLEDFGKIEWINVTLEHAKSITRFIYNHSLVL 1384
            ++ +AG++L  T+ ++YWTPCAA  ++L+L D G IE +N  +E A+SITRF+YN+S+VL
Sbjct: 326  NFAIAGRKLSDTYPTLYWTPCAASCVDLILADIGNIEDVNTVIEQARSITRFVYNNSMVL 385

Query: 1383 NMMRRYTGGRDLIQPVMTRSATDFTTLKCMVNLKDNLQAMVSSQEWMDCQFLKQPEGIAV 1204
            NM+R+ T G D+++P +TRSAT+F TL  MV+LK  LQ MV+SQEWMD  + K+P G+ +
Sbjct: 386  NMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLKRCLQNMVTSQEWMDSPYSKRPGGLEM 445

Query: 1203 MEIIYNQSFWSRCSVVVHLTDPLVGVLRMVGSDNRPAMGYILAGMYRVKEAIKVELVENK 1024
            +++I ++SFWS C+ ++ LT+PL+ VLR+VGS  RPAMGY+ A MY  K AIK EL+   
Sbjct: 446  LDLISSESFWSSCNSIIRLTNPLLRVLRIVGSGKRPAMGYVYAAMYNAKLAIKTELINRD 505

Query: 1023 EYMVYWNIIDSRWNRQMQHPLHEAGFFLNPRFFYSLEGDARDKIPSGMFDCIERLVPDIN 844
             YMVYWNIID RW    +HPL  AGF+LNP++FYS+EGD   +I SGMFDCIERLV D N
Sbjct: 506  RYMVYWNIIDQRWEHHWRHPLCAAGFYLNPKYFYSIEGDMHGEILSGMFDCIERLVSDTN 565

Query: 843  IQDKINRELISYKNAEGDFGRKIAIRARHTLLPAEWWATYG-GGCPNLARLAIHILSQTC 667
            +QDKI +E+ SYKNA GDF RK AIRAR TLLPAEWW+T G GGCPNL RLA  ILSQTC
Sbjct: 566  VQDKIIKEITSYKNASGDFARKTAIRARGTLLPAEWWSTCGEGGCPNLTRLATRILSQTC 625

Query: 666  SASVSKRSAIAFEQVQQTRNRLEHQRLRDLVFVQYNLRLQQIQFGKNKEPEPVDPISSDS 487
            S+   K++ + F+++  TRN +EHQRL DLVFV+ NL+L+Q+    N E  P DP+S D 
Sbjct: 626  SSVGFKQNQVFFDKLHDTRNHIEHQRLSDLVFVRSNLQLKQMATNVN-EHYPTDPLSFDG 684

Query: 486  FDFAEDWVTMKKEFFED-GDSDWMALDQPLVNNMMMLDSPNDEPEYMVAGFEYQERPNGV 310
                +DWV  K    ED G+ +W  L+ P  +  M L   ND  + +VAGF+  E     
Sbjct: 685  LGIVDDWVWKKDLSAEDCGNLEWTVLENPPFSPPMRLPQ-NDGYDDLVAGFDDLEVFKRQ 743

Query: 309  KDDEGD 292
            ++ E D
Sbjct: 744  RESEDD 749


>ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226173 [Cucumis sativus]
          Length = 752

 Score =  760 bits (1963), Expect = 0.0
 Identities = 380/726 (52%), Positives = 515/726 (70%), Gaps = 2/726 (0%)
 Frame = -3

Query: 2463 ERVQLKCLYCLKLFSGGGIHRIKEHLACQKGNASCCSRVPLEVQRIMQQSLDGVTVKKKK 2284
            +RVQLKCLYC KLF GGGIHRIKEHLA QKGNAS C  VP EVQ IMQ+SLDGV +KK+K
Sbjct: 29   DRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGNASTCHSVPPEVQNIMQESLDGVMMKKRK 88

Query: 2283 KQRIAEDIRSLTPGSNEMDTFGNQCEVNTGLQLLAHPDTLESNMGVFDRRNEGMKNRISD 2104
            +Q++ E++ ++   + E+D   N  ++++ + L+   + LE+N  +     +G  N++  
Sbjct: 89   RQKLDEEMTNVNTMTGEVDGISNHMDMDSSIHLIEVAEPLETNSVLLLTHEKGTSNKVG- 147

Query: 2103 RRKRGRPENASPLPVTPNASMLPVGDLNLRSTREKELVHMAVGRFLYDVGVSLDAVNSHY 1924
             RK+G    +S   +     ++P G   L S R++  VHMAVGRFLYD+G SL+AVNS Y
Sbjct: 148  -RKKGSKGKSSSC-LEREMIVIPNGGGILDSNRDRNQVHMAVGRFLYDIGASLEAVNSAY 205

Query: 1923 FQPMIDAIASRGPGLEPTSYHDLRGWILKYTVEEMNGVLEKYRETWGKTGCSVLADEWVT 1744
            FQPMI++IA  G G+ P SYHD+RGWILK ++EE+    ++ + TWG TGCSV+ D+W T
Sbjct: 206  FQPMIESIALAGTGIIPPSYHDIRGWILKNSMEEVRSDFDRCKATWGITGCSVMVDQWCT 265

Query: 1743 ETGRILINFFVYCPEGTMFLKSVDATDIIGSIDPLYDLLKSVVEEVGIDNVLQVITDSTD 1564
            E GR ++NF VYCP+GT+FL+SVDA+ I+ S D LY+LLK VVE+VG+ +V+QVIT   +
Sbjct: 266  EAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEE 325

Query: 1563 HYIVAGKRLIATFRSMYWTPCAARSINLMLEDFGKIEWINVTLEHAKSITRFIYNHSLVL 1384
            ++ +AG++L  T+ ++YWTPCAA  ++L+L D G IE +N  +E A+SITRF+YN+S+VL
Sbjct: 326  NFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTVIEQARSITRFVYNNSMVL 385

Query: 1383 NMMRRYTGGRDLIQPVMTRSATDFTTLKCMVNLKDNLQAMVSSQEWMDCQFLKQPEGIAV 1204
            NM+R+ T G D+++P +TRSAT+F TL  MV+LK  LQ MV+SQEWMD  + K+P G+ +
Sbjct: 386  NMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLKRCLQNMVTSQEWMDSPYSKRPGGLEM 445

Query: 1203 MEIIYNQSFWSRCSVVVHLTDPLVGVLRMVGSDNRPAMGYILAGMYRVKEAIKVELVENK 1024
            +++I ++SFWS C+ ++ LT+PL+ VLR+VGS  RPAMGY+ A MY  K AIK EL+   
Sbjct: 446  LDLISSESFWSSCNSIISLTNPLLRVLRIVGSGKRPAMGYVYAAMYNAKLAIKTELINRD 505

Query: 1023 EYMVYWNIIDSRWNRQMQHPLHEAGFFLNPRFFYSLEGDARDKIPSGMFDCIERLVPDIN 844
             YMVYWNIID RW    +HPL+ AGF+LNP++FYS+EGD   +I SGMFDCIERLV D N
Sbjct: 506  RYMVYWNIIDQRWEHHWRHPLYAAGFYLNPKYFYSIEGDMHGEILSGMFDCIERLVSDTN 565

Query: 843  IQDKINRELISYKNAEGDFGRKIAIRARHTLLPAEWWATYG-GGCPNLARLAIHILSQTC 667
            +QDKI +E+ SYKNA GDF RK AIRAR TLLPAEWW+T G GGCPNL RLA  ILSQTC
Sbjct: 566  VQDKIIKEITSYKNASGDFARKTAIRARGTLLPAEWWSTCGEGGCPNLTRLATRILSQTC 625

Query: 666  SASVSKRSAIAFEQVQQTRNRLEHQRLRDLVFVQYNLRLQQIQFGKNKEPEPVDPISSDS 487
            S+   K++   F+++  TRN +EHQRL DLVFV+ NL+L+Q+    N E  P DP+S D 
Sbjct: 626  SSVGFKQNDALFDKLHDTRNHIEHQRLSDLVFVRSNLQLKQMATNVN-EHYPTDPLSFDE 684

Query: 486  FDFAEDWVTMKKEFFED-GDSDWMALDQPLVNNMMMLDSPNDEPEYMVAGFEYQERPNGV 310
                +DWV  K    ED G+ +W  LD P  +  M L   +D  + +VAGF+  E     
Sbjct: 685  LGIVDDWVWKKDLSAEDCGNLEWTVLDNPPFSPPMRLPQ-SDGYDDLVAGFDDLEVFKRQ 743

Query: 309  KDDEGD 292
            ++ E D
Sbjct: 744  RESEDD 749


>ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254391 [Solanum
            lycopersicum]
          Length = 748

 Score =  756 bits (1951), Expect = 0.0
 Identities = 386/729 (52%), Positives = 508/729 (69%), Gaps = 7/729 (0%)
 Frame = -3

Query: 2463 ERVQLKCLYCLKLFSGGGIHRIKEHLACQKGNASCCSRVPLEVQRIMQQSLDGVTVKKKK 2284
            +RVQLKC+YC K+F GGGIHRIKEHLA QKGNAS C RV  +V+ +MQ SL+GV +KK+K
Sbjct: 29   DRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNASTCLRVQPDVRLLMQDSLNGVVMKKRK 88

Query: 2283 KQRIAEDIRSLTPGSNEMDT------FGNQCEVNTGLQLLAHPDTLESNMGVFDRRNEGM 2122
            KQ++AE+I +     N +DT      F + C +NT + LL     +E    +F  R++G 
Sbjct: 89   KQKLAEEITTY----NAIDTSDIAAEFTDTCGLNTQVDLLPMSQAIEHTSSLFLNRDQGP 144

Query: 2121 KNRISDRRKRGRPENASPLPVTPNASMLPVGDLNLRSTREKELVHMAVGRFLYDVGVSLD 1942
             NR    R R    +++ LP+              +S R    VHMAV RFL D  V LD
Sbjct: 145  NNRKKKSRIRKGASSSNNLPIIN------------QSKRVNNQVHMAVARFLLDARVPLD 192

Query: 1941 AVNSHYFQPMIDAIASRGPGLEPTSYHDLRGWILKYTVEEMNGVLEKYRETWGKTGCSVL 1762
            AVNS YFQPMID IAS+GP +   SYHDLR W+LK +V+E+   +++   TW +TGCSVL
Sbjct: 193  AVNSVYFQPMIDVIASQGPPVSAPSYHDLRSWVLKSSVQEVRTDIDQCSSTWARTGCSVL 252

Query: 1761 ADEWVTETGRILINFFVYCPEGTMFLKSVDATDIIGSIDPLYDLLKSVVEEVGIDNVLQV 1582
             DE +T  G+IL+NF VYCP+GTMFL+SVDA+ +I S D LY+LLK VV+E+G+ NVLQV
Sbjct: 253  IDELITGKGKILLNFLVYCPQGTMFLRSVDASTLINSTDYLYELLKEVVDEIGVRNVLQV 312

Query: 1581 ITDSTDHYIVAGKRLIATFRSMYWTPCAARSINLMLEDFGKIEWINVTLEHAKSITRFIY 1402
            +T + + Y++AGKRL   + +++WTPCAA SI+LMLEDF K+EWI+  +E AKSI+RFIY
Sbjct: 313  VTSNEERYVIAGKRLTDAYPTLFWTPCAAHSIDLMLEDFNKLEWIDTIMEQAKSISRFIY 372

Query: 1401 NHSLVLNMMRRYTGGRDLIQPVMTRSATDFTTLKCMVNLKDNLQAMVSSQEWMDCQFLKQ 1222
            N++++L+MMR++T G DL+   +TRSATDF TLK M N+K NLQ+MV+S EW +  + K+
Sbjct: 373  NNNILLSMMRKFTLGVDLVDLGVTRSATDFLTLKRMQNIKHNLQSMVTSVEWAESPYSKK 432

Query: 1221 PEGIAVMEIIYNQSFWSRCSVVVHLTDPLVGVLRMVGSDNRPAMGYILAGMYRVKEAIKV 1042
            PEG A+++ I NQSFWS CS++  LTDP++ +LRMV S+ RPAM Y+ AG+YR KE IK 
Sbjct: 433  PEGFALLDYISNQSFWSTCSLICRLTDPILRLLRMVSSEERPAMPYVYAGVYRAKETIKK 492

Query: 1041 ELVENKEYMVYWNIIDSRWNRQMQHPLHEAGFFLNPRFFYSLEGDARDKIPSGMFDCIER 862
            ELV  K+Y VYWNIID RW    +HPLH AGF+LNP+FFY+ E D    I S ++DCIE+
Sbjct: 493  ELVNKKDYSVYWNIIDHRWESLQRHPLHAAGFYLNPKFFYTTEEDVHLHIRSLVYDCIEK 552

Query: 861  LVPDINIQDKINRELISYKNAEGDFGRKIAIRARHTLLPAEWWATYGGGCPNLARLAIHI 682
            LVPD  IQDKI +E  SY N+ GDFGRK+A+RAR TL PAEWW+TYGGGCPNLARLAI I
Sbjct: 553  LVPDPKIQDKIVKETTSYLNSAGDFGRKMAVRARDTLFPAEWWSTYGGGCPNLARLAIRI 612

Query: 681  LSQTCSASVSKRSAIAFEQVQQTRNRLEHQRLRDLVFVQYNLRLQQIQFGKNKEPEPVDP 502
            LSQT S   SK   I  E++ +T N +EHQRL DL FVQYN+ L+Q    KN+EP+ +D 
Sbjct: 613  LSQTSSLIRSKPGRIPIEEMHETTNCIEHQRLNDLAFVQYNMWLRQ---RKNQEPDCMDS 669

Query: 501  ISSDSFDFAEDWVTMKKEFFED-GDSDWMALDQPLVNNMMMLDSPNDEPEYMVAGFEYQE 325
            IS +  +   +WV+ +++  ED   SDWMA+D PL  ++  L    D+ E +  GF+  E
Sbjct: 670  ISYEKMELVHNWVSRREQMSEDLESSDWMAVDPPL-GSIAPLGPLIDDIEALGTGFDDFE 728

Query: 324  RPNGVKDDE 298
               G KD E
Sbjct: 729  IFGGPKDSE 737


>ref|XP_007049027.1| HAT transposon superfamily, putative [Theobroma cacao]
            gi|508701288|gb|EOX93184.1| HAT transposon superfamily,
            putative [Theobroma cacao]
          Length = 750

 Score =  734 bits (1896), Expect = 0.0
 Identities = 362/726 (49%), Positives = 503/726 (69%), Gaps = 4/726 (0%)
 Frame = -3

Query: 2463 ERVQLKCLYCLKLFSGGGIHRIKEHLACQKGNASCCSRVPLEVQRIMQQSLDGVTVKKKK 2284
            ER+Q+KC+YC K+F GGGIHR KEHLA +KG    C +VP  V+ +MQ+SL+GV +K+  
Sbjct: 29   ERLQIKCMYCGKMFKGGGIHRFKEHLAGRKGQGPICEQVPPGVRALMQESLNGVLLKQDN 88

Query: 2283 KQRIAEDIRSL---TPGSNEMDTFGNQCEVNTGLQLLAHPDTLESNMGVFDRRNEGMKNR 2113
            KQ    ++ +    +P + E+D      +VN G++ +   ++LE +  +       +   
Sbjct: 89   KQNAIPELLACGGSSPHAGEIDKSAYSDDVNNGVKPIQVLNSLEPDSSLVLNGKGEVSQG 148

Query: 2112 ISDRRKRGRPENASPLPVTPNASMLPVGDLNLRSTREKELVHMAVGRFLYDVGVSLDAVN 1933
            I D +KRGR  +     +  N+      DL L S   +  VHMA+GRFLYD+GV+LDAVN
Sbjct: 149  IRDSKKRGRDRS-----LLANSHSCAKSDLALVSIGAENPVHMAIGRFLYDIGVNLDAVN 203

Query: 1932 SHYFQPMIDAIASRGPGLEPTSYHDLRGWILKYTVEEMNGVLEKYRETWGKTGCSVLADE 1753
            S YFQPMIDAIAS G G+ P S  DLRGWILK  +EE+   +++ +  WGKTGCS+L ++
Sbjct: 204  SVYFQPMIDAIASTGSGIVPPSSQDLRGWILKNVMEEVKDDIDRNKTMWGKTGCSILVEQ 263

Query: 1752 WVTETGRILINFFVYCPEGTMFLKSVDATDIIGSIDPLYDLLKSVVEEVGIDNVLQVITD 1573
            W  ++GR L++F VYCP+ T+FLKSVDA+ +I S D L +LLK VVEEVG++NV+QVIT+
Sbjct: 264  WSPKSGRTLLSFLVYCPQATVFLKSVDASRVIFSADHLNELLKQVVEEVGVENVVQVITN 323

Query: 1572 STDHYIVAGKRLIATFRSMYWTPCAARSINLMLEDFGKIEWINVTLEHAKSITRFIYNHS 1393
              + Y +AGKRL+ +F S+YW PC    +++MLEDF  +EWI+ T+E AKS+TRF+YNHS
Sbjct: 324  CEEQYFLAGKRLMESFPSLYWAPCLVHCVDMMLEDFANLEWISETIEQAKSVTRFVYNHS 383

Query: 1392 LVLNMMRRYTGGRDLIQPVMTRSATDFTTLKCMVNLKDNLQAMVSSQEWMDCQFLKQPEG 1213
            +VLNMMRR+T   D+++P +TR A++F TLK M +LK  LQAMV+SQ+W +C + K+P G
Sbjct: 384  VVLNMMRRFTFHNDIVEPAVTRFASNFATLKRMADLKLKLQAMVNSQDWSECPYAKKPGG 443

Query: 1212 IAVMEIIYNQSFWSRCSVVVHLTDPLVGVLRMVGSDNRPAMGYILAGMYRVKEAIKVELV 1033
            + +++I+ N+SFW+ C ++V L  PL+ VL +VGS  R  MGY+ AG+YR KE IK ELV
Sbjct: 444  LVMLDIVKNRSFWNSCILIVRLIYPLLQVLEIVGSKKRSTMGYVYAGIYRAKETIKKELV 503

Query: 1032 ENKEYMVYWNIIDSRWNRQMQHPLHEAGFFLNPRFFYSLEGDARDKIPSGMFDCIERLVP 853
            +  +YMVYWNIID RW +Q   PL+ A FFLNP+FFYS+EG+  + I S MFDCIERLVP
Sbjct: 504  KKDDYMVYWNIIDHRWEQQRHIPLYAAAFFLNPKFFYSIEGNIHNDILSSMFDCIERLVP 563

Query: 852  DINIQDKINRELISYKNAEGDFGRKIAIRARHTLLPAEWWATYGGGCPNLARLAIHILSQ 673
            D N+QD+I RE+  YKNA GD GR +A+RAR  LLP EWW+ YGGGCPNL  LAI ILSQ
Sbjct: 564  DTNVQDQIVREIHLYKNATGDLGRPMAVRARDNLLPGEWWSMYGGGCPNLQHLAIRILSQ 623

Query: 672  TCSASVSKRSAIAFEQVQQTRNRLEHQRLRDLVFVQYNLRLQQIQFGKNKEPEPVDPISS 493
            TCS+  SK + I+ E++  TRN LEHQRL DLV+V+YNL L+Q+   ++++ +  DP+S 
Sbjct: 624  TCSSIGSKPNKISIEEIHDTRNFLEHQRLSDLVYVRYNLYLRQMVL-RSQDKDSADPLSF 682

Query: 492  DSFDFAEDWVTMKKEFFED-GDSDWMALDQPLVNNMMMLDSPNDEPEYMVAGFEYQERPN 316
            +S +  +DW+       ED G SDWM+LD P+ + M+   S ++  +++  GF   E  N
Sbjct: 683  NSKEIRDDWIAYNAVCEEDYGSSDWMSLDPPVGSRMLSGTSGDETEDFLGTGFADLEIFN 742

Query: 315  GVKDDE 298
            G+   E
Sbjct: 743  GLNGVE 748


>ref|XP_006380932.1| hypothetical protein POPTR_0006s02210g [Populus trichocarpa]
            gi|550335284|gb|ERP58729.1| hypothetical protein
            POPTR_0006s02210g [Populus trichocarpa]
          Length = 847

 Score =  720 bits (1858), Expect = 0.0
 Identities = 378/724 (52%), Positives = 499/724 (68%), Gaps = 7/724 (0%)
 Frame = -3

Query: 2463 ERVQLKCLYCLKLFSGGGIHRIKEHLACQK-GNASCCSRVPLEVQRIMQQSLDGVTVKKK 2287
            +RVQ+KC YC KLF GGGIHR KEHLA +  G    C+RVP +V+ +M+Q L  + V+++
Sbjct: 136  KRVQIKCNYCAKLFKGGGIHRFKEHLAGRNSGGVPSCTRVPSDVRDLMEQHLSPIVVRQR 195

Query: 2286 KKQRIA----EDIRSLTPGSNEMDTFGNQCE-VNTGLQLLAHPDTLESNMGVFDRRNEGM 2122
            KK++      +D+ S  PG  ++  F +  + + T L+ +A  + +E N   F    EG 
Sbjct: 196  KKRKSKREKLDDVDS-PPGGEDVYIFADYSDDMITPLRAVAACNLVEVNSD-FLLDGEGT 253

Query: 2121 KNRISDRRKRGRPENASPLPVTPNASMLPVGDLNLRSTREKELVHMAVGRFLYDVGVSLD 1942
             N     RK     +A  +  + +A  L    + + S      +H   GRFLYD+G SLD
Sbjct: 254  SNGNLGTRK-----SAIAVAASDDADAL----IAMGSETADNPIHAIWGRFLYDIGASLD 304

Query: 1941 AVNSHYFQPMIDAIASRGPGLEPTSYHDLRGWILKYTVEEMNGVLEKYRETWGKTGCSVL 1762
            A++S++ QP+ID +A   PG+   S+ DLRG ILK  VEE+   + +Y+  W KTGCS+L
Sbjct: 305  AMDSNFSQPLIDTVAYGRPGIAAPSHQDLRGRILKSLVEEVKSDINQYKTRWVKTGCSLL 364

Query: 1761 ADEWVTETGRILINFFVYCPEGTMFLKSVDATDIIGSIDPLYDLLKSVVEEVGIDNVLQV 1582
             +E  +E+G   +NF VYC +GT+FLKSVDA+++I S D LY+LLK +VEEVG  N+LQV
Sbjct: 365  VEECNSESGVTTLNFLVYCSKGTVFLKSVDASNLIHSTDGLYELLKLMVEEVGAGNILQV 424

Query: 1581 ITDSTDHYIVAGKRLIATFRSMYWTPCAARSINLMLEDFGKIEWINVTLEHAKSITRFIY 1402
            IT+  +HYI AGK+L+ TF S+YW PCAAR I+L+LED GK++WIN  LE AKS+TRF+Y
Sbjct: 425  ITNGEEHYIAAGKKLMDTFPSLYWAPCAARCIDLILEDIGKLDWINTVLEQAKSVTRFVY 484

Query: 1401 NHSLVLNMMRRYTGGRDLIQPVMTRSATDFTTLKCMVNLKDNLQAMVSSQEWMDCQFLKQ 1222
            N+S VLN+MR++T G D++Q  +TRSAT+FT LK M N K NLQ MV+SQEWMDC + KQ
Sbjct: 485  NNSAVLNLMRKFTSGSDIVQQGITRSATNFTALKRMANFKLNLQTMVTSQEWMDCPYSKQ 544

Query: 1221 PEGIAVMEIIYNQSFWSRCSVVVHLTDPLVGVLRMVGSDNRPAMGYILAGMYRVKEAIKV 1042
            P G+A+++II N+SFWS C +++ LT PL+ VL +V S+ R AMGY+ +G+YR KE IK 
Sbjct: 545  PGGLAMVDIITNRSFWSSCILIIRLTSPLLQVLVIVSSEKRAAMGYVFSGIYRAKETIKK 604

Query: 1041 ELVENKEYMVYWNIIDSRWNRQMQHPLHEAGFFLNPRFFYSLEGDARDKIPSGMFDCIER 862
            ELV+ ++YMVYWNIID RW +Q Q PLH AGFF NP+FFYS+EGD  +KI S MFDCIER
Sbjct: 605  ELVKREDYMVYWNIIDHRWEQQWQTPLHAAGFFFNPKFFYSIEGDMHNKILSRMFDCIER 664

Query: 861  LVPDINIQDKINRELISYKNAEGDFGRKIAIRARHTLLPAEWWATYGGGCPNLARLAIHI 682
            LVPD  +QDKI +EL  YKNAEG  G+K+AIRAR T+LP +WW+ YGG CPNLARLAI I
Sbjct: 665  LVPDTEVQDKIVKELTLYKNAEGHLGKKLAIRARGTMLPTDWWSMYGGSCPNLARLAIRI 724

Query: 681  LSQTCSASVSKRSAIAFEQVQQTRNRLEHQRLRDLVFVQYNLRLQQIQFGKNKEPEPVDP 502
            LSQTCSA     + I FE+V +TRN L+ QRL DLVFVQYNLRL+Q+  G NK+  P DP
Sbjct: 725  LSQTCSAIGCSHNHIPFEKVHRTRNFLQRQRLTDLVFVQYNLRLRQMVDG-NKKQIPEDP 783

Query: 501  ISSDSFDFAEDWVTMKKEFFED-GDSDWMALDQPLVNNMMMLDSPNDEPEYMVAGFEYQE 325
            IS D     EDW+T  +   ED G SDWM+L    VN M +  S  DE E + +GF+  E
Sbjct: 784  ISFDDVSLVEDWITQNELCLEDSGSSDWMSLVPRSVNTMPLAPS-TDESEDVASGFDDFE 842

Query: 324  RPNG 313
              NG
Sbjct: 843  IFNG 846



 Score = 74.7 bits (182), Expect = 2e-10
 Identities = 35/84 (41%), Positives = 53/84 (63%), Gaps = 1/84 (1%)
 Frame = -3

Query: 2463 ERVQLKCLYCLKLFSGGGIHRIKEHLACQKGNASCCSRVPLEVQRIMQQSLDGVTVKKKK 2284
            E+ ++KC+YC ++F GGGIHR KEHLA  KG    C  VP +V+ +MQQ LD +T K+  
Sbjct: 24   EKTRMKCIYCGEIFEGGGIHRFKEHLAGPKGGGPMCQSVPPDVRLLMQQDLDVITAKQNS 83

Query: 2283 KQ-RIAEDIRSLTPGSNEMDTFGN 2215
            +Q +I E+   +    +++  F N
Sbjct: 84   QQLKIQEEESDVNLPLSDVGMFSN 107


>ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817502 isoform X4 [Glycine
            max]
          Length = 729

 Score =  716 bits (1849), Expect = 0.0
 Identities = 373/731 (51%), Positives = 491/731 (67%), Gaps = 7/731 (0%)
 Frame = -3

Query: 2463 ERVQLKCLYCLKLFSGGGIHRIKEHLACQKGNASCCSRVPLEVQRIMQQSLDGVTVKKKK 2284
            ++VQLKC+YCLK+F GGGIHRIKEHLACQKGNAS CSRVP +V+  MQQSLDGV VKK++
Sbjct: 29   DKVQLKCIYCLKMFKGGGIHRIKEHLACQKGNASTCSRVPHDVRLHMQQSLDGVVVKKRR 88

Query: 2283 KQRIAEDIRSLTPGSNEMDTFGNQ----CEVNTGLQLLAHPDTLESNMGVFDRRNEGMKN 2116
            KQRI E+I S+ P +  +++  N      +VN GLQ +     +E N  +     EGM  
Sbjct: 89   KQRIEEEIMSVNPLTTVVNSLPNNNNRVVDVNQGLQAIG----VEHNSSLVVNPGEGMSR 144

Query: 2115 RISDRRKRGRPENASPLPVTPNA-SMLPVGDLNLRSTREKELVHMAVGRFLYDVGVSLDA 1939
             +  R+K    +N  P  V  N+  ++ V    L   +    ++MA+GRFLYD+G   DA
Sbjct: 145  NMERRKKMRATKN--PAAVYANSEGVIAVEKNGLFPKKMDNHIYMAIGRFLYDIGAPFDA 202

Query: 1938 VNSHYFQPMIDAIASRGPGLEPTSYHDLRGWILKYTVEEMNGVLEKYRETWGKTGCSVLA 1759
            VNS YFQ M+DAIASRG G E   +H+LRGWILK +VEE+   +++ + TWG+TGCS+L 
Sbjct: 203  VNSVYFQEMVDAIASRGVGFERPWHHELRGWILKNSVEEVKNDIDRCKMTWGRTGCSILV 262

Query: 1758 DEWVTETGRILINFFVYCPEGTMFLKSVDATDIIGSIDPLYDLLKSVVEEVGIDNVLQVI 1579
            D+W TET                              D LYDL+K VVEEVG   V+QVI
Sbjct: 263  DQWTTET------------------------------DFLYDLIKQVVEEVGAGQVVQVI 292

Query: 1578 TDSTDHYIVAGKRLIATFRSMYWTPCAARSINLMLEDFGKIEWINVTLEHAKSITRFIYN 1399
            T   + Y +AG+RL  TF ++Y +P AA  I+L+LEDFG +EWI+  +E A+S+TRF+YN
Sbjct: 293  TSGEEQYGIAGRRLTDTFPTLYLSPSAAHCIDLILEDFGNLEWISAVIEQARSVTRFVYN 352

Query: 1398 HSLVLNMMRRYTGGRDLIQPVMTRSATDFTTLKCMVNLKDNLQAMVSSQEWMDCQFLKQP 1219
            +S +LNM++RYT G D++ P  +  AT+FTTLK MV+LK NLQA+V+SQEW D  + KQ 
Sbjct: 353  YSAILNMVKRYTLGNDIVDPSFSHFATNFTTLKRMVDLKHNLQALVTSQEWADSPYSKQT 412

Query: 1218 EGIAVMEIIYNQSFWSRCSVVVHLTDPLVGVLRMVGSDNRPAMGYILAGMYRVKEAIKVE 1039
             G+ +++ + NQ+FWS C ++V LT PL+ V+R+  S+ RPAMGY+ AGMYR KEAIK  
Sbjct: 413  AGLEMLDCLSNQTFWSSCDMIVCLTAPLLKVMRIASSEMRPAMGYVYAGMYRAKEAIKKA 472

Query: 1038 LVENKEYMVYWNIIDSRWNRQMQHPLHEAGFFLNPRFFYSLEGDARDKIPSGMFDCIERL 859
            L + +EYMVYWNII  RW R   HPLH AGF+LNP+FFYS++GD   +I SGMFDCIERL
Sbjct: 473  LGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPKFFYSIQGDIHGQIVSGMFDCIERL 532

Query: 858  VPDINIQDKINRELISYKNAEGDFGRKIAIRARHTLLPAEWWATYGGGCPNLARLAIHIL 679
            VPD  IQDKI +E+  YK+A GDFGRK+A+RAR  LLP+EWW+TYGGGCPNL+RLAI IL
Sbjct: 533  VPDTRIQDKIIKEINLYKSASGDFGRKMAVRARDNLLPSEWWSTYGGGCPNLSRLAIRIL 592

Query: 678  SQTCSASVSKRSAIAFEQVQQTRNRLEHQRLRDLVFVQYNLRLQQIQFGKNKEPEPVDPI 499
            SQT S    KR+ I FEQ+  TRN +E Q L DLVFV  NLRL+Q+    +KE +  DP+
Sbjct: 593  SQTSSVMSCKRNQIPFEQIINTRNYIERQHLTDLVFVHCNLRLRQMFM--SKEQDFSDPL 650

Query: 498  SSDSFDFAEDWVTMKKEFFED--GDSDWMALDQPLVNNMMMLDSPNDEPEYMVAGFEYQE 325
            S D+    E+W+  +  + +D  G+SDWMALD   VN  M+L   NDE E +  G++  E
Sbjct: 651  SFDNISNVEEWIRPRDLYIDDECGNSDWMALDPSSVNT-MLLRPLNDEAEDLGEGYDDYE 709

Query: 324  RPNGVKDDEGD 292
              +  KD E +
Sbjct: 710  IFSCGKDSEDE 720


>ref|XP_002521049.1| DNA binding protein, putative [Ricinus communis]
            gi|223539752|gb|EEF41333.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 854

 Score =  708 bits (1827), Expect = 0.0
 Identities = 367/715 (51%), Positives = 489/715 (68%), Gaps = 6/715 (0%)
 Frame = -3

Query: 2463 ERVQLKCLYCLKLFSGGGIHRIKEHLACQKGNASCCSRVPLEVQRIMQQSLDGVTVKKKK 2284
            +RVQ+KC YC K+F GGGIHR KEHLA +KG A  C RVP +V+ +MQQ L  V  K+KK
Sbjct: 141  DRVQIKCNYCGKVFKGGGIHRFKEHLAGRKGAAPICDRVPSDVRLLMQQCLHEVVPKQKK 200

Query: 2283 KQRIAEDIRSLT--PGSNEMDTFGNQC---EVNTGLQLLAHPDTLESNMGVFDRRNEGMK 2119
            ++ + E+  ++   P     DTF N     + + G      P ++E N  +    ++ + 
Sbjct: 201  QKVVIEETINVDSPPVPLNTDTFANHFGDEDDDNGA-----PISVEFNSNLSLEEDDVLN 255

Query: 2118 NRISDRRKRGRPENASPLPVTPNASMLPVGDLNLRSTREKELVHMAVGRFLYDVGVSLDA 1939
                  RKRGR + ++   +  +   L V  L +       ++H  VGRFLYD+G + DA
Sbjct: 256  QGNLHTRKRGRGKTSA---IVDHGDPLDVVHLKMIDN----VIHTTVGRFLYDIGANFDA 308

Query: 1938 VNSHYFQPMIDAIASRGPGLEPTSYHDLRGWILKYTVEEMNGVLEKYRETWGKTGCSVLA 1759
            ++S YF+ +ID ++S   G    S HDLRGWILK  VEE+   +++ R TW +TGCSVL 
Sbjct: 309  LDSIYFRSLIDMLSSGASGAVAPSNHDLRGWILKKLVEEIKNDIDQSRTTWARTGCSVLV 368

Query: 1758 DEWVTETGRILINFFVYCPEGTMFLKSVDATDIIGSIDPLYDLLKSVVEEVGIDNVLQVI 1579
            +EW +E+G  L+NF V C +GT+FLKSV+A+ II S D LY LLK VVEEVG  NVLQVI
Sbjct: 369  EEWNSESGITLLNFLVNCSQGTVFLKSVEASHIIYSPDGLYVLLKQVVEEVGASNVLQVI 428

Query: 1578 TDSTDHYIVAGKRLIATFRSMYWTPCAARSINLMLEDFGKIEWINVTLEHAKSITRFIYN 1399
            T+  +HY VAGKRL+  F S++W PCA   ++L+LEDF K+EWI+  +E AKS+TRF+YN
Sbjct: 429  TNGNEHYTVAGKRLMEAFPSLFWAPCAVHCLDLILEDFAKLEWIDAVIEQAKSVTRFVYN 488

Query: 1398 HSLVLNMMRRYTGGRDLIQPVMTRSATDFTTLKCMVNLKDNLQAMVSSQEWMDCQFLKQP 1219
            HS VLN+MR++T G+D++Q  +TRSAT+FT L+ M + K NLQ M++SQEWMDC + KQ 
Sbjct: 489  HSAVLNLMRKFTYGKDIVQQGLTRSATNFTMLQRMADFKLNLQTMITSQEWMDCPYSKQH 548

Query: 1218 EGIAVMEIIYNQSFWSRCSVVVHLTDPLVGVLRMVGSDNRPAMGYILAGMYRVKEAIKVE 1039
             G+A+++II N+SFWS C +++ LT PL+ VL + G   + AMGYI AG+YR KE IK E
Sbjct: 549  GGLAMLDIISNRSFWSSCILIIRLTSPLIRVLGIAGGKRKAAMGYIFAGIYRAKETIKRE 608

Query: 1038 LVENKEYMVYWNIIDSRWNRQMQHPLHEAGFFLNPRFFYSLEGDARDKIPSGMFDCIERL 859
            LV+ ++YMVYWNIID RW+++   PLH AGFFLNP+FFYS+EGD  ++I S +FDCIERL
Sbjct: 609  LVKREDYMVYWNIIDHRWDQRRHPPLHVAGFFLNPKFFYSIEGDVHNEILSRVFDCIERL 668

Query: 858  VPDINIQDKINRELISYKNAEGDFGRKIAIRARHTLLPAEWWATYGGGCPNLARLAIHIL 679
            VPDI +QDKI +EL  YKNA GD GRK+AIR+R TLLPAEWW+TYGGGCPNLARLA+ IL
Sbjct: 669  VPDIEVQDKIAKELNIYKNAVGDLGRKMAIRSRGTLLPAEWWSTYGGGCPNLARLALRIL 728

Query: 678  SQTCSASVSKRSAIAFEQVQQTRNRLEHQRLRDLVFVQYNLRLQQIQFGKNKEPEPVDPI 499
            SQTCS+   + + I FE+V  TRN LE +R  DLVFVQ NLRL+++   ++K   P+DPI
Sbjct: 729  SQTCSSIGCRSNHIPFEKVHATRNCLEQKRRSDLVFVQCNLRLKEM-VDESKNQVPLDPI 787

Query: 498  SSDSFDFAEDWVTMKKEFFEDGDS-DWMALDQPLVNNMMMLDSPNDEPEYMVAGF 337
            S D+    EDW+       ED +S DWM+L  P  NN M   S  DE E +  GF
Sbjct: 788  SFDNISIVEDWILQNDICLEDYESADWMSLVPPSANN-MPAGSAVDEIEDLGVGF 841



 Score = 67.4 bits (163), Expect = 3e-08
 Identities = 33/84 (39%), Positives = 50/84 (59%), Gaps = 5/84 (5%)
 Frame = -3

Query: 2463 ERVQLKCLYCLKLFSGGGIHRIKEHLACQKGNASCCSRVPLEVQRIMQQSLDGVTVKKKK 2284
            E+V +KC YC K+F GGGI R KEHLA +KG    C  VP +V+ +M+Q+LD  + K+  
Sbjct: 27   EKVHIKCSYCGKIFKGGGIFRFKEHLAGRKGGGPMCLNVPADVRLLMEQTLDVSSAKQSS 86

Query: 2283 KQ-----RIAEDIRSLTPGSNEMD 2227
            ++     ++  ++ SL    N  D
Sbjct: 87   RRQSSRLKMTPELPSLPNNKNSDD 110


>ref|NP_001051738.1| Os03g0822900 [Oryza sativa Japonica Group]
            gi|108711817|gb|ABF99612.1| hAT family dimerisation
            domain containing protein, expressed [Oryza sativa
            Japonica Group] gi|113550209|dbj|BAF13652.1| Os03g0822900
            [Oryza sativa Japonica Group]
            gi|215704668|dbj|BAG94296.1| unnamed protein product
            [Oryza sativa Japonica Group] gi|222626069|gb|EEE60201.1|
            hypothetical protein OsJ_13162 [Oryza sativa Japonica
            Group]
          Length = 796

 Score =  680 bits (1755), Expect = 0.0
 Identities = 369/760 (48%), Positives = 482/760 (63%), Gaps = 39/760 (5%)
 Frame = -3

Query: 2460 RVQLKCLYCLKLFSGGGIHRIKEHLACQKGNASCCSRVPLEVQRIMQQSLDGVTVKKKKK 2281
            RV+LKC+YC K F GGGIHR KEHLA + GNA CC +VP EVQ  M  SLD V  KKK+K
Sbjct: 39   RVRLKCVYCHKHFLGGGIHRFKEHLANRPGNACCCPKVPREVQETMLHSLDAVAAKKKRK 98

Query: 2280 QRIAEDIRSLT------PGSNEMDTFGNQCEVNTGLQLLAHPDTLESNMGVFDR---RNE 2128
            Q +AE IR +T        S       +  E+ + + ++   + L+      +       
Sbjct: 99   QSLAEGIRRITHSAPAAAASASPPAPADAAEMESPIHMIPLNEVLDLGSVPLEETPPETR 158

Query: 2127 GMKNRISDRRKRGRPENAS----------PLPVTPNASMLPVGDLNL------------- 2017
             MK  IS +RK+     AS          PL  TP     P   + +             
Sbjct: 159  EMKGSISKKRKKLAARQASTAPLAHQNQQPLQSTPAGLTQPFHQMVVAFDSAASQLMHFD 218

Query: 2016 RSTREKELVHMAVGRFLYDVGVSLDAVNSHYFQPMIDAIASRGPGLEPTSYHDLRGWILK 1837
            +    KE V+MA+GRFLYD GVSL+AVNS YFQPM++A+AS G   E  SYHD RG ILK
Sbjct: 219  QPGSNKEQVYMAIGRFLYDAGVSLEAVNSVYFQPMLEAVASAGGKPEAFSYHDFRGSILK 278

Query: 1836 YTVEEMNGVLEKYRETWGKTGCSVLADEWVTETGRILINFFVYCPEGTMFLKSVDATDII 1657
             +++E+   LE Y+ +W +TGC++LADEW T+ GR LINF VYCPEGTMFLKSVDATDI+
Sbjct: 279  KSLDEVTAQLEFYKGSWTRTGCTLLADEWTTDRGRTLINFSVYCPEGTMFLKSVDATDIV 338

Query: 1656 GSIDPLYDLLKSVVEEVGIDNVLQVITDSTDHYIVAGKRLIATFRSMYWTPCAARSINLM 1477
             S DPLY+LLK+VVEEVG  NV+QVIT++++ + VAGKRL  TF +++W+ C+ + I+ M
Sbjct: 339  VSSDPLYELLKNVVEEVGEKNVVQVITNNSEIHAVAGKRLCETFPTLFWSQCSFQCIDGM 398

Query: 1476 LEDFGKIEWINVTLEHAKSITRFIYNHSLVLNMMRRYTGGRDLIQPVMTRSATDFTTLKC 1297
            LEDF K+  IN  + +AK IT FIYN +   N+M+R+  G+DL+ P  TR+A +F TLK 
Sbjct: 399  LEDFSKVGAINEIICNAKVITGFIYNSAFAFNLMKRHLHGKDLLVPAETRAAMNFVTLKN 458

Query: 1296 MVNLKDNLQAMVSSQEWMDCQFLKQPEGIAVMEIIYNQSFWSRCSVVVHLTDPLVGVLRM 1117
            M NLKD+L+AM+SS EW+     K+P G+ V  +I N  FWS C+ VV +T+PLV +L++
Sbjct: 459  MYNLKDSLEAMISSDEWIHYLLPKKPGGVEVTNLIGNLQFWSSCAAVVRITEPLVHLLKL 518

Query: 1116 VGSDNRPAMGYILAGMYRVKEAIKVELVENKEYMVYWNIIDSRWNRQMQHPLHEAGFFLN 937
            VGS+ RP+MGY+ AG+Y+ K AIK ELV   +YM YW+IID RWN+    PLH AGFFLN
Sbjct: 519  VGSNKRPSMGYVYAGLYQAKAAIKKELVRKNDYMAYWDIIDWRWNKDAPRPLHLAGFFLN 578

Query: 936  PRFFYSLEGDARDKIPSGMFDCIERLVPDINIQDKINRELISYKN-AEGDFGRKIAIRAR 760
            P FF  + G    +I SGM DC+ERLV D+ IQDKI +EL  Y++ A GDF R++AIRAR
Sbjct: 579  PLFFDGVRGGTSSEIFSGMLDCVERLVSDVKIQDKIQKELNVYRSEAAGDFRRQMAIRAR 638

Query: 759  HTLLPAEWWATYGGGCPNLARLAIHILSQTCSASVSKRSAIAFEQVQQTR-NRLEHQRLR 583
            HTL PAEWW TYGG CPNL RLA+ ILSQTCSA    R  I+FEQ+   R N  E QR+ 
Sbjct: 639  HTLPPAEWWYTYGGACPNLTRLAVRILSQTCSAKGCDRRHISFEQIHDQRMNLFERQRMH 698

Query: 582  DLVFVQYNLRLQQIQFGKNKEPEPVDPISSDSFDFAEDWVTMKKEFF--EDGDSDWMALD 409
             L FVQYNLRLQ  Q  K K     DP+S D+ D  +DWV  +      +   S+W  ++
Sbjct: 699  HLTFVQYNLRLQHRQQHKTK---AFDPVSVDNIDIVDDWVVDRSALISGQAEQSNWTEIN 755

Query: 408  QPLVNNMMMLDSPNDEPEYMVAGFE---YQERPNGVKDDE 298
            QP+ N   M  S +DE E  + G +    Q    G ++D+
Sbjct: 756  QPVNNITSMGPSDDDEFESFIEGVDDKMIQGASRGTQEDD 795


>gb|EAY92386.1| hypothetical protein OsI_14116 [Oryza sativa Indica Group]
          Length = 796

 Score =  677 bits (1748), Expect = 0.0
 Identities = 369/760 (48%), Positives = 481/760 (63%), Gaps = 39/760 (5%)
 Frame = -3

Query: 2460 RVQLKCLYCLKLFSGGGIHRIKEHLACQKGNASCCSRVPLEVQRIMQQSLDGVTVKKKKK 2281
            RV+LKC+YC K F GGGIHR KEHLA + GNA CC +VP EVQ  M  SLD V  KKK+K
Sbjct: 39   RVRLKCVYCHKHFLGGGIHRFKEHLARRPGNACCCPKVPREVQETMLHSLDAVAAKKKRK 98

Query: 2280 QRIAEDIRSLT------PGSNEMDTFGNQCEVNTGLQLLAHPDTLESNMGVFDR---RNE 2128
            Q +AE IR +T        S       +  E+ + + ++   + L+      +       
Sbjct: 99   QSLAEGIRRITHSAPAAAASASPPAPADAAEMESPIHMIPLNEVLDLGSVPLEETPPETR 158

Query: 2127 GMKNRISDRRKRGRPENAS----------PLPVTPNASMLPVGDLNL------------- 2017
             MK  IS +RK+     AS          PL  TP     P   + +             
Sbjct: 159  EMKGSISKKRKKLAARQASTAPLAHQNQQPLQSTPAGLTQPFHQMVVAFDSAASQLRHFD 218

Query: 2016 RSTREKELVHMAVGRFLYDVGVSLDAVNSHYFQPMIDAIASRGPGLEPTSYHDLRGWILK 1837
            +    KE V+MA+GRFLYD GVSL+AVNS YFQPM++A+AS G   E  SYHD RG ILK
Sbjct: 219  QPGSNKEQVYMAIGRFLYDAGVSLEAVNSVYFQPMLEAVASAGGKPEAFSYHDFRGSILK 278

Query: 1836 YTVEEMNGVLEKYRETWGKTGCSVLADEWVTETGRILINFFVYCPEGTMFLKSVDATDII 1657
             +++E+   LE Y+ +W +TGC++LADEW T+ GR LINF VYCPEGTMFLKSVDATDI+
Sbjct: 279  KSLDEVTAQLEFYKGSWTRTGCTLLADEWTTDRGRTLINFSVYCPEGTMFLKSVDATDIV 338

Query: 1656 GSIDPLYDLLKSVVEEVGIDNVLQVITDSTDHYIVAGKRLIATFRSMYWTPCAARSINLM 1477
             S DPLY+LLK+VVEEVG  NV+QVIT++++ + VAGKRL  TF +++W+ C+ + I+ M
Sbjct: 339  VSSDPLYELLKNVVEEVGEKNVVQVITNNSEIHAVAGKRLCETFPTLFWSQCSFQCIDGM 398

Query: 1476 LEDFGKIEWINVTLEHAKSITRFIYNHSLVLNMMRRYTGGRDLIQPVMTRSATDFTTLKC 1297
            LEDF K+  IN  + +AK IT FIYN +   N+M+R+  G+DL+ P  TR+A +F TLK 
Sbjct: 399  LEDFSKVGAINEIICNAKVITGFIYNSAFAFNLMKRHLHGKDLLVPAETRAAMNFVTLKN 458

Query: 1296 MVNLKDNLQAMVSSQEWMDCQFLKQPEGIAVMEIIYNQSFWSRCSVVVHLTDPLVGVLRM 1117
            M NLKD+L+AM+SS EW+     K+P G+ V  +I N  FWS C+ VV +T+PLV +L++
Sbjct: 459  MYNLKDSLEAMISSDEWIHYLLPKKPGGVEVTNLIGNLQFWSSCAAVVRITEPLVHLLKL 518

Query: 1116 VGSDNRPAMGYILAGMYRVKEAIKVELVENKEYMVYWNIIDSRWNRQMQHPLHEAGFFLN 937
            VGS+ RP+MGY+ AG+Y+ K AIK ELV   +YM YW+IID RWN+    PLH AGFFLN
Sbjct: 519  VGSNKRPSMGYVYAGLYQAKAAIKKELVRKNDYMAYWDIIDWRWNKDAPRPLHLAGFFLN 578

Query: 936  PRFFYSLEGDARDKIPSGMFDCIERLVPDINIQDKINRELISYKN-AEGDFGRKIAIRAR 760
            P FF  + G    +I SGM DCIERLV D+ IQDKI +EL  Y++ A GDF R++AIRAR
Sbjct: 579  PLFFDGVRGGTSSEIFSGMLDCIERLVSDVKIQDKIQKELNVYRSEAAGDFRRQMAIRAR 638

Query: 759  HTLLPAEWWATYGGGCPNLARLAIHILSQTCSASVSKRSAIAFEQVQQTR-NRLEHQRLR 583
             TL PAEWW TYGG CPNL RLA+ ILSQTCSA    R  I+FEQ+   R N  E QR+ 
Sbjct: 639  RTLPPAEWWYTYGGACPNLTRLAVRILSQTCSAKGCDRRHISFEQIHDQRMNLFERQRMH 698

Query: 582  DLVFVQYNLRLQQIQFGKNKEPEPVDPISSDSFDFAEDWVTMKKEFF--EDGDSDWMALD 409
             L FVQYNLRLQ  Q  K K     DP+S D+ D  +DWV  +      +   S+W  ++
Sbjct: 699  HLTFVQYNLRLQHRQQHKTK---AFDPVSVDNIDIVDDWVVDRSALISGQAEQSNWTEIN 755

Query: 408  QPLVNNMMMLDSPNDEPEYMVAGFE---YQERPNGVKDDE 298
            QP+ N   M  S +DE E  + G +    Q    G ++D+
Sbjct: 756  QPVNNITSMGPSDDDEFESFIEGVDDKMIQGASRGTQEDD 795


>ref|XP_006651967.1| PREDICTED: uncharacterized protein LOC102714280 [Oryza brachyantha]
          Length = 787

 Score =  671 bits (1732), Expect = 0.0
 Identities = 370/742 (49%), Positives = 480/742 (64%), Gaps = 33/742 (4%)
 Frame = -3

Query: 2460 RVQLKCLYCLKLFSGGGIHRIKEHLACQKGNASCCSRVPLEVQRIMQQSLDGVTVKKKKK 2281
            RV+LKC+YC K F GGGIHR KEHLA + GNASCC +VP EVQ  M  SLD V  KKK+K
Sbjct: 39   RVKLKCVYCHKHFLGGGIHRFKEHLARRPGNASCCPKVPPEVQETMHHSLDVVAAKKKRK 98

Query: 2280 QRIAEDIRSLT-------------PGSNEMDTFGNQCEVNTGLQLLAHPDTLESNMGVFD 2140
            Q +AE IR +T              G+ EM++      +N  L L + P  LE       
Sbjct: 99   QSLAEGIRRMTHSAPPAAAPPVDATGAAEMESPIRMIPLNEVLDLGSVP--LEET----P 152

Query: 2139 RRNEGMKNRISDRRKR--------GRPENASPLPVT-PNASMLPVGDLNL-------RST 2008
                 MK   S +RK+          P + +P P T P   M+   D          +S 
Sbjct: 153  PEAREMKGSTSKKRKKLAARHASAAPPAHQNPAPQTQPFHQMVMAFDAAASQLRHFDQSA 212

Query: 2007 REKELVHMAVGRFLYDVGVSLDAVNSHYFQPMIDAIASRGPGLEPTSYHDLRGWILKYTV 1828
              KE V+MA+GRFLYD GVSL+AVNS YFQPM++A+AS G   E  SYHD RG ILK ++
Sbjct: 213  SNKEQVYMAIGRFLYDAGVSLEAVNSVYFQPMLEAVASAGGRPEAFSYHDFRGSILKKSL 272

Query: 1827 EEMNGVLEKYRETWGKTGCSVLADEWVTETGRILINFFVYCPEGTMFLKSVDATDIIGSI 1648
            +E+   +E Y+ +W +TGC++LADEW T+ GR LINF VYCPEGTMFLKSVDATD++ S 
Sbjct: 273  DEVTAQVEFYKGSWTRTGCTLLADEWTTDRGRTLINFSVYCPEGTMFLKSVDATDMVVSS 332

Query: 1647 DPLYDLLKSVVEEVGIDNVLQVITDSTDHYIVAGKRLIATFRSMYWTPCAARSINLMLED 1468
            DPLY+LLK+VVEEVG  NV+QVIT++++ + VAGKRL  TF +++W+PC+ + I+ MLED
Sbjct: 333  DPLYELLKNVVEEVGEKNVVQVITNNSEIHAVAGKRLGETFPTLFWSPCSFQCIDGMLED 392

Query: 1467 FGKIEWINVTLEHAKSITRFIYNHSLVLNMMRRYTGGRDLIQPVMTRSATDFTTLKCMVN 1288
            F K+  IN  + +AK+IT FIYN +  LN+M+R+  G+DL+    TR+A +F TLK M N
Sbjct: 393  FSKVGAINEIICNAKAITGFIYNSAFALNLMKRHLHGKDLLVRAETRAAMNFVTLKNMYN 452

Query: 1287 LKDNLQAMVSSQEWMDCQFLKQPEGIAVMEIIYNQSFWSRCSVVVHLTDPLVGVLRMVGS 1108
            LK++L+AM+SS EW+     K+P G+ V  +I N  FWS C+ VV +T+PLV +L++V S
Sbjct: 453  LKESLEAMISSDEWIHYLLPKKPGGVEVTNLIGNLQFWSSCAAVVRITEPLVHLLKLVSS 512

Query: 1107 DNRPAMGYILAGMYRVKEAIKVELVENKEYMVYWNIIDSRWNRQMQHPLHEAGFFLNPRF 928
            + RP MGY+ AG+Y+ K AIK ELV   +YM YW+IID RW++    PL  AGFFLNP F
Sbjct: 513  NKRPPMGYVYAGLYQAKAAIKKELVRKNDYMAYWDIIDWRWDKHAPRPLDLAGFFLNPLF 572

Query: 927  FYSLEGDARDKIPSGMFDCIERLVPDINIQDKINRELISYKN-AEGDFGRKIAIRARHTL 751
            F  + GD  ++I SGM DCIERLV D+ IQDKI +EL  Y++ A GDF R++AIRAR TL
Sbjct: 573  FDGVRGDISNEIFSGMLDCIERLVSDVKIQDKIQKELNVYRSEAAGDFRRQMAIRARRTL 632

Query: 750  LPAEWWATYGGGCPNLARLAIHILSQTCSASVSKRSAIAFEQVQQTR-NRLEHQRLRDLV 574
             PAEWW TYGG CPNL RLA+ ILSQTCSA    R  I+FEQ+   R N  E QR+  L 
Sbjct: 633  PPAEWWYTYGGACPNLTRLAVRILSQTCSAKGRDRQHISFEQIHDQRMNFFERQRMHHLT 692

Query: 573  FVQYNLRLQQIQFGKNKEPEPVDPISSDSFDFAEDWVTMKKEFF--EDGDSDWMALDQPL 400
            FVQYNLRLQ  Q  K K     DP+S D+ D  EDWV  +      +   S+W  ++QP+
Sbjct: 693  FVQYNLRLQHRQQHKAK---AFDPVSVDNIDIVEDWVLDRSTLMSGQAEQSNWTEINQPV 749

Query: 399  VNNMMMLDSPNDEPEYMVAGFE 334
             N   M  S +DE E  + G +
Sbjct: 750  NNITSMGPSDDDEFESFIEGVD 771


>ref|XP_004981234.1| PREDICTED: uncharacterized protein LOC101757413 [Setaria italica]
          Length = 803

 Score =  652 bits (1682), Expect = 0.0
 Identities = 358/760 (47%), Positives = 475/760 (62%), Gaps = 37/760 (4%)
 Frame = -3

Query: 2460 RVQLKCLYCLKLFSGGGIHRIKEHLACQKGNASCCSRVPLEVQRIMQQSLDGVTVKKKKK 2281
            RV+LKC YC K F GGGIHR KEHLA + GNA CC +VP +VQ  M +SLD V  KK ++
Sbjct: 45   RVRLKCAYCGKHFLGGGIHRFKEHLARRPGNACCCPKVPRDVQDTMMRSLDAVAAKKMQR 104

Query: 2280 QRIAE----DIRSLTPGSNEMDTFGNQCEVNTGLQLLAHPDTLESNMGVFDRRN----EG 2125
            +        D+R   P      +  +    ++ + ++   + L+      D +     E 
Sbjct: 105  KLANALPPGDMRRFAPTDASPASAASGGATDSPIHMIPLNEVLDFEPVPLDEQRPPLPET 164

Query: 2124 MKNRISDRRKRGRPENASPLPVTPNA----------------------SMLPVGDLNLRS 2011
            M+  +S ++KR    NAS  P+TP                        ++ P       +
Sbjct: 165  MRGSVSSKKKRKMLSNASTPPLTPPTLQQHVPSTPQTNPLHQVVMAVDAVTPSSGHFGHA 224

Query: 2010 TREKELVHMAVGRFLYDVGVSLDAVNSHYFQPMIDAIASRGPGLEPTSYHDLRGWILKYT 1831
              +KE V +AVGRFLYDVGV L+AVNS YFQPM++AIAS G   E  SYHD RG ILK +
Sbjct: 225  GLDKEQVSVAVGRFLYDVGVPLEAVNSVYFQPMLEAIASAGGRPEALSYHDFRGHILKKS 284

Query: 1830 VEEMNGVLEKYRETWGKTGCSVLADEWVTETGRILINFFVYCPEGTMFLKSVDATDIIGS 1651
            +++    LE ++ +W +TGCSVLADEW+T+ GR LINF VYCPEGTMFLKSVDAT I+ S
Sbjct: 285  LDDATSRLEFFKGSWTRTGCSVLADEWITDKGRTLINFSVYCPEGTMFLKSVDATSIVAS 344

Query: 1650 IDPLYDLLKSVVEEVGIDNVLQVITDSTDHYIVAGKRLIATFRSMYWTPCAARSINLMLE 1471
             D LY+LLKSVVEEVG   V+QVIT++++ +  AGK+L  TF +++W+PC+ + I+ MLE
Sbjct: 345  SDALYELLKSVVEEVGEKKVVQVITNNSEIHAAAGKKLGETFPTLFWSPCSFQCIDGMLE 404

Query: 1470 DFGKIEWINVTLEHAKSITRFIYNHSLVLNMMRRYTGGRDLIQPVMTRSATDFTTLKCMV 1291
            DF K+  I+  + +AK+IT F YN +  LN+M++Y  G+DL+ P  TR++ +F TLK M 
Sbjct: 405  DFSKVGAISEIISNAKAITGFFYNSAFALNLMKKYLHGKDLLVPAETRASMNFVTLKNMY 464

Query: 1290 NLKDNLQAMVSSQEWMDCQFLKQPEGIAVMEIIYNQSFWSRCSVVVHLTDPLVGVLRMVG 1111
             LK+ LQAMV+S EW+    L +  GI V  ++ +  FWS C+ VVH+T+PLV +L++VG
Sbjct: 465  GLKEALQAMVNSDEWIHF-LLPKKGGIEVSNLVNSLQFWSSCAAVVHITEPLVHLLKLVG 523

Query: 1110 SDNRPAMGYILAGMYRVKEAIKVELVENKEYMVYWNIIDSRWNRQMQHPLHEAGFFLNPR 931
            S  RPAMGYI AG+Y+ K AIK ELV   +YM YWNIID RW+ Q   PLH AGFFLNP 
Sbjct: 524  STKRPAMGYIYAGLYQAKAAIKKELVSKNDYMAYWNIIDWRWDNQTPRPLHSAGFFLNPL 583

Query: 930  FFYSLEGDARDKIPSGMFDCIERLVPDINIQDKINRELISYKN-AEGDFGRKIAIRARHT 754
            FF  + GD  + I SGM DCIERLV D+ IQDKI REL  Y++   GDF R++AIR+R T
Sbjct: 584  FFDGIRGDVSNGIFSGMLDCIERLVSDVKIQDKIQRELNMYRSETAGDFRRQMAIRSRRT 643

Query: 753  LLPAEWWATYGGGCPNLARLAIHILSQTCSASVSKRSAIAFEQVQQTR-NRLEHQRLRDL 577
            L PAEWW TYGG CPNL RLA+ ILSQTCSA    R+ I FEQV   R N  E QR+ DL
Sbjct: 644  LPPAEWWYTYGGACPNLTRLAVRILSQTCSARGCDRAHIPFEQVHDERLNSFERQRMHDL 703

Query: 576  VFVQYNLRLQQIQFGKNKEPEPVDPISSDSFDFAEDWVTMKKEFFED--GDSDWMALDQP 403
             FVQYNLRLQQ Q    ++ +  DP+S D  D  +DWV      F       +WM ++Q 
Sbjct: 704  TFVQYNLRLQQRQ---QRKVKAFDPVSVDYVDIVDDWVVDTPALFSGPVEQPNWMEINQS 760

Query: 402  LVNNMMMLDSPNDEPEYMVAGFE---YQERPNGVKDDEGD 292
             V+ +   +   DE E  + G +    Q    G+++D+ D
Sbjct: 761  -VSRIAPREPNEDEFESFIEGVDDEMIQGAAQGIQEDDDD 799


Top