BLASTX nr result

ID: Mentha24_contig00023821 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha24_contig00023821
         (1349 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591...   553   e-155
ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254...   548   e-153
gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea]       541   e-151
ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626...   486   e-135
gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo]         480   e-133
ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226...   476   e-131
ref|XP_007163431.1| hypothetical protein PHAVU_001G234100g [Phas...   473   e-131
ref|XP_007163430.1| hypothetical protein PHAVU_001G234100g [Phas...   473   e-131
ref|XP_007009265.1| HAT and BED zinc finger domain-containing pr...   471   e-130
ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302...   469   e-129
ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817...   469   e-129
ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806...   468   e-129
ref|XP_002524204.1| DNA binding protein, putative [Ricinus commu...   464   e-128
ref|XP_007214932.1| hypothetical protein PRUPE_ppa001359mg [Prun...   448   e-123
ref|XP_007049027.1| HAT transposon superfamily, putative [Theobr...   439   e-120
ref|XP_004305893.1| PREDICTED: uncharacterized protein LOC101310...   439   e-120
ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817...   438   e-120
ref|XP_006380932.1| hypothetical protein POPTR_0006s02210g [Popu...   433   e-119
ref|XP_002521049.1| DNA binding protein, putative [Ricinus commu...   432   e-118
ref|XP_004981234.1| PREDICTED: uncharacterized protein LOC101757...   395   e-107

>ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591442 [Solanum tuberosum]
          Length = 755

 Score =  553 bits (1426), Expect = e-155
 Identities = 265/427 (62%), Positives = 331/427 (77%)
 Frame = -2

Query: 1282 PALNSKKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSW 1103
            P   SK+  + V MA+ RF  D  +P DAVNS YFQPM+D IASQG  V  PSY++LRSW
Sbjct: 172  PINQSKRVNNHVHMAVARFLLDARVPLDAVNSVYFQPMIDVIASQGPQVSAPSYHELRSW 231

Query: 1102 ILKNSVHEVRYDVEQCTSAWGRTGCSILVYEWSSKKCKTFINLFAYSPEGTIFLRXXXXX 923
            +LK SV EVR D++QC+S W R+GCS+LV EW + K KT +N   Y PEGT+FLR     
Sbjct: 232  VLKASVQEVRNDIDQCSSTWARSGCSVLVDEWITGKGKTLLNFLVYCPEGTMFLRSVDAS 291

Query: 922  XXXXXXDFLYELLKETVEQVGLNNVVQVVTTGEERYVIAGKRLTDTYPTIFWTPCAGYCI 743
                  D+LYELLKE VE+VG+ NV+QVVT+ EERY+IAGKRLTD YPT+FWTPCA + I
Sbjct: 292  TLINSTDYLYELLKEVVEEVGVRNVLQVVTSNEERYIIAGKRLTDAYPTLFWTPCAAHSI 351

Query: 742  DLMLQDIGELPEVKMILNQAKSISSYIYSDTATINMIRRYTSGVDLVDLGTTRSSTDFMT 563
            DLML+D+ +L  +  I+ QAKSIS +IY++   ++M+R++T GVDLVDLG TRS+TDF+T
Sbjct: 352  DLMLEDLKKLEWIDTIMEQAKSISRFIYNNNILLSMMRKFTLGVDLVDLGVTRSATDFLT 411

Query: 562  LKRMLNVRQNLQSMVTSEEWMGSYCSEKAEGIAVLDSVCSQSFWSTCASVVRLTDPILHL 383
            LKRM+N++ NLQSMVTS EW  S  S+K EG A+LD + +QSFWSTC+ V RLTDPIL L
Sbjct: 412  LKRMVNIKHNLQSMVTSVEWAESPYSKKPEGFALLDYIGNQSFWSTCSLVCRLTDPILRL 471

Query: 382  LKLVDSQKMPSMGFVYAGLYRVKEAIKKELLDSGDYLVYWSIIDHRWEQLERHPLHAAGF 203
            L++V S++ P+M +VYAG+YR KE IKKEL++  DY VYW+IIDHRWE L+RHPLHAAGF
Sbjct: 472  LRMVSSEERPAMAYVYAGVYRAKETIKKELVNKKDYSVYWNIIDHRWESLQRHPLHAAGF 531

Query: 202  YLNPKHFNSLEEDGHHHIRSLVFDCIEKLVTDPNIQDKIMRERASYLSCKGDFGRKMAIR 23
            YLNPK F + EED H HIRSLV+DCIEKLV DP IQDKI++E  SYL+  GDFGRKMA+R
Sbjct: 532  YLNPKFFYTTEEDVHLHIRSLVYDCIEKLVPDPKIQDKIVKETTSYLNSAGDFGRKMAVR 591

Query: 22   SRDTILP 2
            +RDT+ P
Sbjct: 592  ARDTLFP 598


>ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254391 [Solanum
            lycopersicum]
          Length = 748

 Score =  548 bits (1412), Expect = e-153
 Identities = 262/428 (61%), Positives = 330/428 (77%), Gaps = 1/428 (0%)
 Frame = -2

Query: 1282 PALNSKKKVS-VVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRS 1106
            P +N  K+V+  V MA+ RF  D  +P DAVNS YFQPM+D IASQG  V  PSY+DLRS
Sbjct: 164  PIINQSKRVNNQVHMAVARFLLDARVPLDAVNSVYFQPMIDVIASQGPPVSAPSYHDLRS 223

Query: 1105 WILKNSVHEVRYDVEQCTSAWGRTGCSILVYEWSSKKCKTFINLFAYSPEGTIFLRXXXX 926
            W+LK+SV EVR D++QC+S W RTGCS+L+ E  + K K  +N   Y P+GT+FLR    
Sbjct: 224  WVLKSSVQEVRTDIDQCSSTWARTGCSVLIDELITGKGKILLNFLVYCPQGTMFLRSVDA 283

Query: 925  XXXXXXXDFLYELLKETVEQVGLNNVVQVVTTGEERYVIAGKRLTDTYPTIFWTPCAGYC 746
                   D+LYELLKE V+++G+ NV+QVVT+ EERYVIAGKRLTD YPT+FWTPCA + 
Sbjct: 284  STLINSTDYLYELLKEVVDEIGVRNVLQVVTSNEERYVIAGKRLTDAYPTLFWTPCAAHS 343

Query: 745  IDLMLQDIGELPEVKMILNQAKSISSYIYSDTATINMIRRYTSGVDLVDLGTTRSSTDFM 566
            IDLML+D  +L  +  I+ QAKSIS +IY++   ++M+R++T GVDLVDLG TRS+TDF+
Sbjct: 344  IDLMLEDFNKLEWIDTIMEQAKSISRFIYNNNILLSMMRKFTLGVDLVDLGVTRSATDFL 403

Query: 565  TLKRMLNVRQNLQSMVTSEEWMGSYCSEKAEGIAVLDSVCSQSFWSTCASVVRLTDPILH 386
            TLKRM N++ NLQSMVTS EW  S  S+K EG A+LD + +QSFWSTC+ + RLTDPIL 
Sbjct: 404  TLKRMQNIKHNLQSMVTSVEWAESPYSKKPEGFALLDYISNQSFWSTCSLICRLTDPILR 463

Query: 385  LLKLVDSQKMPSMGFVYAGLYRVKEAIKKELLDSGDYLVYWSIIDHRWEQLERHPLHAAG 206
            LL++V S++ P+M +VYAG+YR KE IKKEL++  DY VYW+IIDHRWE L+RHPLHAAG
Sbjct: 464  LLRMVSSEERPAMPYVYAGVYRAKETIKKELVNKKDYSVYWNIIDHRWESLQRHPLHAAG 523

Query: 205  FYLNPKHFNSLEEDGHHHIRSLVFDCIEKLVTDPNIQDKIMRERASYLSCKGDFGRKMAI 26
            FYLNPK F + EED H HIRSLV+DCIEKLV DP IQDKI++E  SYL+  GDFGRKMA+
Sbjct: 524  FYLNPKFFYTTEEDVHLHIRSLVYDCIEKLVPDPKIQDKIVKETTSYLNSAGDFGRKMAV 583

Query: 25   RSRDTILP 2
            R+RDT+ P
Sbjct: 584  RARDTLFP 591


>gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea]
          Length = 724

 Score =  541 bits (1393), Expect = e-151
 Identities = 265/416 (63%), Positives = 326/416 (78%)
 Frame = -2

Query: 1249 VDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRY 1070
            V MA+GRFF DVGLPA+A NSAYFQPM++AIASQ AGV+GPSY DLRSWILKN VHE RY
Sbjct: 175  VHMAVGRFFVDVGLPAEAANSAYFQPMVEAIASQEAGVIGPSYQDLRSWILKNLVHETRY 234

Query: 1069 DVEQCTSAWGRTGCSILVYEWSSKKCKTFINLFAYSPEGTIFLRXXXXXXXXXXXDFLYE 890
            DV+Q  +AW RTGC++LV +W+S K +TF+N F Y+ E TIF R           D LYE
Sbjct: 235  DVDQYANAWERTGCTVLVDDWNSGKGETFVNFFVYNSEATIFYRSANVSHGIVSADDLYE 294

Query: 889  LLKETVEQVGLNNVVQVVTTGEERYVIAGKRLTDTYPTIFWTPCAGYCIDLMLQDIGELP 710
            LLKETVEQ+G+ NV+QV+T+ E++Y  AGKRL  TYP++FW+PCAG C+DLMLQD+  LP
Sbjct: 295  LLKETVEQIGVKNVLQVITSCEDQYAFAGKRLATTYPSVFWSPCAGLCVDLMLQDMEHLP 354

Query: 709  EVKMILNQAKSISSYIYSDTATINMIRRYTSGVDLVDLGTTRSSTDFMTLKRMLNVRQNL 530
             VK+ L QAKSIS YIYS+   +NM+RR+T G+DL+D G T SST+FMTLKRML++R +L
Sbjct: 355  MVKVTLEQAKSISRYIYSNGFVLNMLRRHTFGLDLLDEGITPSSTNFMTLKRMLSMRHHL 414

Query: 529  QSMVTSEEWMGSYCSEKAEGIAVLDSVCSQSFWSTCASVVRLTDPILHLLKLVDSQKMPS 350
            QSMVTSE+W+ S  S+K EG A+LD++ SQSFWS CAS+  L DP+L LL+++ S K P+
Sbjct: 415  QSMVTSEDWIQSPHSQKPEGFALLDTMTSQSFWSACASITNLIDPLLRLLRIISSGKKPA 474

Query: 349  MGFVYAGLYRVKEAIKKELLDSGDYLVYWSIIDHRWEQLERHPLHAAGFYLNPKHFNSLE 170
            MG+VYAGLYR KEAIKK  + S DYLVY +IID RWEQL++HPLH AGFYLNPK F SLE
Sbjct: 475  MGYVYAGLYRAKEAIKKHFV-SEDYLVYLNIIDRRWEQLQQHPLHGAGFYLNPKFFYSLE 533

Query: 169  EDGHHHIRSLVFDCIEKLVTDPNIQDKIMRERASYLSCKGDFGRKMAIRSRDTILP 2
             D     RS+V+DCIE+LV DP +QDKIM+E   Y    GDFGRKMAIR+RDT+LP
Sbjct: 534  GDALLRSRSMVYDCIERLVPDPEVQDKIMKEMTYYHGGVGDFGRKMAIRARDTLLP 589


>ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626522 [Citrus sinensis]
          Length = 745

 Score =  486 bits (1252), Expect = e-135
 Identities = 225/426 (52%), Positives = 322/426 (75%)
 Frame = -2

Query: 1279 ALNSKKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWI 1100
            +L++ +  + + MA+GRF +D+G P DAVNS YFQPM+DAIAS G     PSY+D+R WI
Sbjct: 170  SLDATRGNNPIFMAVGRFLYDIGAPLDAVNSEYFQPMVDAIASGGPEAAMPSYHDIRGWI 229

Query: 1099 LKNSVHEVRYDVEQCTSAWGRTGCSILVYEWSSKKCKTFINLFAYSPEGTIFLRXXXXXX 920
            LKNSV EV+ DV++ T+ WG+TGCSILV +W+++  +T +   AY PEGT+FL+      
Sbjct: 230  LKNSVEEVKNDVDRYTTTWGKTGCSILVDQWNTEAGRTLLCFLAYCPEGTVFLKSVDASG 289

Query: 919  XXXXXDFLYELLKETVEQVGLNNVVQVVTTGEERYVIAGKRLTDTYPTIFWTPCAGYCID 740
                 D LYELLK+ VE+VG+ +V+QV+T+ EE+++ AG+RLTDT+PT++WTPCA  C+D
Sbjct: 290  IMNSSDALYELLKQVVEEVGVRHVLQVITSSEEQFIAAGRRLTDTFPTLYWTPCAARCLD 349

Query: 739  LMLQDIGELPEVKMILNQAKSISSYIYSDTATINMIRRYTSGVDLVDLGTTRSSTDFMTL 560
            L+L+D  +L  +  I+ QA++++ ++Y+ +  +NM+RRYT G D+V+ G TRS+T+F TL
Sbjct: 350  LILEDFAKLEWINAIIEQARAVTRFVYNHSVVLNMLRRYTFGNDIVEPGITRSATNFTTL 409

Query: 559  KRMLNVRQNLQSMVTSEEWMGSYCSEKAEGIAVLDSVCSQSFWSTCASVVRLTDPILHLL 380
            +RM++++ NLQ+MVTS+EWM    S+K  G+ +LD V +QSFWS+C  +V LT+P+L LL
Sbjct: 410  RRMISLKPNLQAMVTSQEWMDCPYSKKPGGLEMLDIVSNQSFWSSCGLIVCLTNPLLRLL 469

Query: 379  KLVDSQKMPSMGFVYAGLYRVKEAIKKELLDSGDYLVYWSIIDHRWEQLERHPLHAAGFY 200
            ++V S++ PS+G+VYAG+YR K+A+KKEL+   +Y+VYW+IIDH WEQL   PLHAAGF+
Sbjct: 470  RIVGSERRPSIGYVYAGMYRAKDALKKELIKRDEYMVYWNIIDHWWEQLWHLPLHAAGFF 529

Query: 199  LNPKHFNSLEEDGHHHIRSLVFDCIEKLVTDPNIQDKIMRERASYLSCKGDFGRKMAIRS 20
            LNPK F S++ D H+ I S +FDCIE+LV D  +QDKI +E   Y    GDFGRKMAIR+
Sbjct: 530  LNPKFFYSIKGDIHNEIVSRMFDCIERLVPDTKVQDKISKEINLYKDAVGDFGRKMAIRA 589

Query: 19   RDTILP 2
            RDT+LP
Sbjct: 590  RDTLLP 595


>gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo]
          Length = 752

 Score =  480 bits (1236), Expect = e-133
 Identities = 224/434 (51%), Positives = 319/434 (73%)
 Frame = -2

Query: 1303 IEVPPGYPALNSKKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPS 1124
            I +P G   L+S +  + V MAIGRF +D+G   +AVNSAYFQPM+++IA  G G++ PS
Sbjct: 165  IVIPNGGGILDSNRDRNQVHMAIGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPS 224

Query: 1123 YYDLRSWILKNSVHEVRYDVEQCTSAWGRTGCSILVYEWSSKKCKTFINLFAYSPEGTIF 944
            Y+D+R WILKNSV EVR D ++C + WG TGCS++V +W ++  +T +N   Y P+GT+F
Sbjct: 225  YHDIRGWILKNSVEEVRGDFDRCKATWGMTGCSVMVDQWCTEAGRTMLNFLVYCPKGTVF 284

Query: 943  LRXXXXXXXXXXXDFLYELLKETVEQVGLNNVVQVVTTGEERYVIAGKRLTDTYPTIFWT 764
            L            D LYELLK+ VEQVG+ +VVQV+T  EE + IAG++L+DTYPT++WT
Sbjct: 285  LESVDASGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWT 344

Query: 763  PCAGYCIDLMLQDIGELPEVKMILNQAKSISSYIYSDTATINMIRRYTSGVDLVDLGTTR 584
            PCA  C+DL+L DIG + +V  ++ QA+SI+ ++Y+++  +NM+R+ T G D+V+   TR
Sbjct: 345  PCAASCVDLILADIGNIEDVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTR 404

Query: 583  SSTDFMTLKRMLNVRQNLQSMVTSEEWMGSYCSEKAEGIAVLDSVCSQSFWSTCASVVRL 404
            S+T+F TL RM+++++ LQ+MVTS+EWM S  S++  G+ +LD + S+SFWS+C S++RL
Sbjct: 405  SATNFATLNRMVDLKRCLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIIRL 464

Query: 403  TDPILHLLKLVDSQKMPSMGFVYAGLYRVKEAIKKELLDSGDYLVYWSIIDHRWEQLERH 224
            T+P+L +L++V S K P+MG+VYA +Y  K AIK EL++   Y+VYW+IID RWE   RH
Sbjct: 465  TNPLLRVLRIVGSGKRPAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRH 524

Query: 223  PLHAAGFYLNPKHFNSLEEDGHHHIRSLVFDCIEKLVTDPNIQDKIMRERASYLSCKGDF 44
            PL AAGFYLNPK+F S+E D H  I S +FDCIE+LV+D N+QDKI++E  SY +  GDF
Sbjct: 525  PLCAAGFYLNPKYFYSIEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDF 584

Query: 43   GRKMAIRSRDTILP 2
             RK AIR+R T+LP
Sbjct: 585  ARKTAIRARGTLLP 598


>ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226173 [Cucumis sativus]
          Length = 752

 Score =  476 bits (1225), Expect = e-131
 Identities = 221/434 (50%), Positives = 318/434 (73%)
 Frame = -2

Query: 1303 IEVPPGYPALNSKKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPS 1124
            I +P G   L+S +  + V MA+GRF +D+G   +AVNSAYFQPM+++IA  G G++ PS
Sbjct: 165  IVIPNGGGILDSNRDRNQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPS 224

Query: 1123 YYDLRSWILKNSVHEVRYDVEQCTSAWGRTGCSILVYEWSSKKCKTFINLFAYSPEGTIF 944
            Y+D+R WILKNS+ EVR D ++C + WG TGCS++V +W ++  +T +N   Y P+GT+F
Sbjct: 225  YHDIRGWILKNSMEEVRSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVF 284

Query: 943  LRXXXXXXXXXXXDFLYELLKETVEQVGLNNVVQVVTTGEERYVIAGKRLTDTYPTIFWT 764
            L            D LYELLK+ VEQVG+ +VVQV+T  EE + IAG++L+DTYPT++WT
Sbjct: 285  LESVDASGIMDSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWT 344

Query: 763  PCAGYCIDLMLQDIGELPEVKMILNQAKSISSYIYSDTATINMIRRYTSGVDLVDLGTTR 584
            PCA  C+DL+L DIG +  V  ++ QA+SI+ ++Y+++  +NM+R+ T G D+V+   TR
Sbjct: 345  PCAASCVDLILGDIGNIEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTR 404

Query: 583  SSTDFMTLKRMLNVRQNLQSMVTSEEWMGSYCSEKAEGIAVLDSVCSQSFWSTCASVVRL 404
            S+T+F TL RM+++++ LQ+MVTS+EWM S  S++  G+ +LD + S+SFWS+C S++ L
Sbjct: 405  SATNFATLNRMVDLKRCLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISL 464

Query: 403  TDPILHLLKLVDSQKMPSMGFVYAGLYRVKEAIKKELLDSGDYLVYWSIIDHRWEQLERH 224
            T+P+L +L++V S K P+MG+VYA +Y  K AIK EL++   Y+VYW+IID RWE   RH
Sbjct: 465  TNPLLRVLRIVGSGKRPAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRH 524

Query: 223  PLHAAGFYLNPKHFNSLEEDGHHHIRSLVFDCIEKLVTDPNIQDKIMRERASYLSCKGDF 44
            PL+AAGFYLNPK+F S+E D H  I S +FDCIE+LV+D N+QDKI++E  SY +  GDF
Sbjct: 525  PLYAAGFYLNPKYFYSIEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDF 584

Query: 43   GRKMAIRSRDTILP 2
             RK AIR+R T+LP
Sbjct: 585  ARKTAIRARGTLLP 598


>ref|XP_007163431.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris]
            gi|561036895|gb|ESW35425.1| hypothetical protein
            PHAVU_001G234100g [Phaseolus vulgaris]
          Length = 756

 Score =  473 bits (1218), Expect = e-131
 Identities = 215/416 (51%), Positives = 311/416 (74%)
 Frame = -2

Query: 1249 VDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRY 1070
            + MAIGRF +D+G P DAVNS YF  M+DAI+S+GAG   PS+++LR WILKNSV EV+ 
Sbjct: 183  IHMAIGRFLYDIGAPFDAVNSVYFHEMVDAISSRGAGFERPSHHELRGWILKNSVEEVKN 242

Query: 1069 DVEQCTSAWGRTGCSILVYEWSSKKCKTFINLFAYSPEGTIFLRXXXXXXXXXXXDFLYE 890
            D+++C   WGRTGCSILV +W+++  +  I+  AY PEG +FL+           DFLY+
Sbjct: 243  DIDRCKMTWGRTGCSILVDQWATETGRVLISFLAYCPEGVVFLKSMDATEISTSADFLYD 302

Query: 889  LLKETVEQVGLNNVVQVVTTGEERYVIAGKRLTDTYPTIFWTPCAGYCIDLMLQDIGELP 710
            ++K+ V++VG+  V+QV+T+GEE+Y +AG+RLTDT+PT++W+P A +CID +L+D G L 
Sbjct: 303  MIKQVVDEVGVGQVLQVITSGEEQYAVAGRRLTDTFPTLYWSPSAAHCIDFILEDFGNLE 362

Query: 709  EVKMILNQAKSISSYIYSDTATINMIRRYTSGVDLVDLGTTRSSTDFMTLKRMLNVRQNL 530
             +  ++ QAKS++ ++Y+ +A + M++RYT G D+VD   ++ +T+F TLKRM++++ NL
Sbjct: 363  WISAVIEQAKSVTRFVYNYSAILIMVKRYTLGNDIVDPSFSQFATNFTTLKRMVDLKHNL 422

Query: 529  QSMVTSEEWMGSYCSEKAEGIAVLDSVCSQSFWSTCASVVRLTDPILHLLKLVDSQKMPS 350
            Q++VTS+EW     S+K+ G+ +LD + SQ+FWS+C  +VRLT P+L +L++  S+  P+
Sbjct: 423  QALVTSQEWADCPYSKKSAGLEMLDCLSSQTFWSSCDMIVRLTAPLLKVLRIASSEMRPA 482

Query: 349  MGFVYAGLYRVKEAIKKELLDSGDYLVYWSIIDHRWEQLERHPLHAAGFYLNPKHFNSLE 170
            MG++YAG+YR KEAIKK L    +Y+VYW+II HRWE+L  HPLHAAGFYLNPK F S++
Sbjct: 483  MGYIYAGIYRAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPKFFYSIQ 542

Query: 169  EDGHHHIRSLVFDCIEKLVTDPNIQDKIMRERASYLSCKGDFGRKMAIRSRDTILP 2
             D H  I S +FDCIE+LV+D  IQDKI++E   Y S  GDFGRKMA+R+RD +LP
Sbjct: 543  GDIHSQIVSGMFDCIERLVSDTRIQDKIIKEINLYKSAAGDFGRKMAVRARDNLLP 598


>ref|XP_007163430.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris]
            gi|561036894|gb|ESW35424.1| hypothetical protein
            PHAVU_001G234100g [Phaseolus vulgaris]
          Length = 869

 Score =  473 bits (1218), Expect = e-131
 Identities = 215/416 (51%), Positives = 311/416 (74%)
 Frame = -2

Query: 1249 VDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRY 1070
            + MAIGRF +D+G P DAVNS YF  M+DAI+S+GAG   PS+++LR WILKNSV EV+ 
Sbjct: 296  IHMAIGRFLYDIGAPFDAVNSVYFHEMVDAISSRGAGFERPSHHELRGWILKNSVEEVKN 355

Query: 1069 DVEQCTSAWGRTGCSILVYEWSSKKCKTFINLFAYSPEGTIFLRXXXXXXXXXXXDFLYE 890
            D+++C   WGRTGCSILV +W+++  +  I+  AY PEG +FL+           DFLY+
Sbjct: 356  DIDRCKMTWGRTGCSILVDQWATETGRVLISFLAYCPEGVVFLKSMDATEISTSADFLYD 415

Query: 889  LLKETVEQVGLNNVVQVVTTGEERYVIAGKRLTDTYPTIFWTPCAGYCIDLMLQDIGELP 710
            ++K+ V++VG+  V+QV+T+GEE+Y +AG+RLTDT+PT++W+P A +CID +L+D G L 
Sbjct: 416  MIKQVVDEVGVGQVLQVITSGEEQYAVAGRRLTDTFPTLYWSPSAAHCIDFILEDFGNLE 475

Query: 709  EVKMILNQAKSISSYIYSDTATINMIRRYTSGVDLVDLGTTRSSTDFMTLKRMLNVRQNL 530
             +  ++ QAKS++ ++Y+ +A + M++RYT G D+VD   ++ +T+F TLKRM++++ NL
Sbjct: 476  WISAVIEQAKSVTRFVYNYSAILIMVKRYTLGNDIVDPSFSQFATNFTTLKRMVDLKHNL 535

Query: 529  QSMVTSEEWMGSYCSEKAEGIAVLDSVCSQSFWSTCASVVRLTDPILHLLKLVDSQKMPS 350
            Q++VTS+EW     S+K+ G+ +LD + SQ+FWS+C  +VRLT P+L +L++  S+  P+
Sbjct: 536  QALVTSQEWADCPYSKKSAGLEMLDCLSSQTFWSSCDMIVRLTAPLLKVLRIASSEMRPA 595

Query: 349  MGFVYAGLYRVKEAIKKELLDSGDYLVYWSIIDHRWEQLERHPLHAAGFYLNPKHFNSLE 170
            MG++YAG+YR KEAIKK L    +Y+VYW+II HRWE+L  HPLHAAGFYLNPK F S++
Sbjct: 596  MGYIYAGIYRAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPKFFYSIQ 655

Query: 169  EDGHHHIRSLVFDCIEKLVTDPNIQDKIMRERASYLSCKGDFGRKMAIRSRDTILP 2
             D H  I S +FDCIE+LV+D  IQDKI++E   Y S  GDFGRKMA+R+RD +LP
Sbjct: 656  GDIHSQIVSGMFDCIERLVSDTRIQDKIIKEINLYKSAAGDFGRKMAVRARDNLLP 711


>ref|XP_007009265.1| HAT and BED zinc finger domain-containing protein, putative
            [Theobroma cacao] gi|508726178|gb|EOY18075.1| HAT and BED
            zinc finger domain-containing protein, putative
            [Theobroma cacao]
          Length = 749

 Score =  471 bits (1211), Expect = e-130
 Identities = 221/425 (52%), Positives = 313/425 (73%)
 Frame = -2

Query: 1276 LNSKKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWIL 1097
            L +K+  + V +AIGRF FD+G P DAVNS YFQPM+DAI S G+GV+ PS  DL+ WIL
Sbjct: 171  LGAKRVNNHVHVAIGRFLFDIGAPLDAVNSVYFQPMVDAIISGGSGVLMPSCSDLQGWIL 230

Query: 1096 KNSVHEVRYDVEQCTSAWGRTGCSILVYEWSSKKCKTFINLFAYSPEGTIFLRXXXXXXX 917
            K SV EV+ D ++ T+AW RTGCSILV +W+++  +  +N   Y PEGT+FL+       
Sbjct: 231  KKSVEEVKSDNDKVTAAWVRTGCSILVNQWNTQTGRILLNFLVYCPEGTVFLKSVDASSV 290

Query: 916  XXXXDFLYELLKETVEQVGLNNVVQVVTTGEERYVIAGKRLTDTYPTIFWTPCAGYCIDL 737
                D LYELLK+ VE+VG  +V+QV+T  EE+Y++AG+RL +T+PT++WTPCA +CI+L
Sbjct: 291  INSSDALYELLKQVVEEVGSKHVLQVITNAEEQYIVAGRRLAETFPTLYWTPCAAHCINL 350

Query: 736  MLQDIGELPEVKMILNQAKSISSYIYSDTATINMIRRYTSGVDLVDLGTTRSSTDFMTLK 557
            +L+D  +L  + +I+ QA+SI+ ++Y+ +  +NM+RRYT G D+V+   T S+T+F TLK
Sbjct: 351  ILEDFAKLEWINVIIEQARSITRFVYNHSVVLNMVRRYTLGNDIVEPAVTCSATNFTTLK 410

Query: 556  RMLNVRQNLQSMVTSEEWMGSYCSEKAEGIAVLDSVCSQSFWSTCASVVRLTDPILHLLK 377
            +M++++ NLQ+MVTS+EWM    S+K  G+ +LD V + SFWS+   + +LT+P+L +L+
Sbjct: 411  QMIDLKNNLQAMVTSQEWMDCPYSKKPGGLEMLDLVSNPSFWSSSVLITQLTNPLLRVLR 470

Query: 376  LVDSQKMPSMGFVYAGLYRVKEAIKKELLDSGDYLVYWSIIDHRWEQLERHPLHAAGFYL 197
            +V S+K P+MG+VYAG+YR KE IKKEL+   +Y++YW+IIDH WEQ   HPLH AGFYL
Sbjct: 471  MVGSKKRPAMGYVYAGMYRAKETIKKELVKRNEYMIYWNIIDHWWEQQWHHPLHGAGFYL 530

Query: 196  NPKHFNSLEEDGHHHIRSLVFDCIEKLVTDPNIQDKIMRERASYLSCKGDFGRKMAIRSR 17
            NPK F S+E D  + + S + DCIEKLV D  +QDKI +E  SY +  GDFGRKMA+R+R
Sbjct: 531  NPKFFYSMEGDMPNEMLSGMLDCIEKLVPDVKVQDKISKEINSYKNTVGDFGRKMAVRAR 590

Query: 16   DTILP 2
            DT+LP
Sbjct: 591  DTLLP 595


>ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302111 [Fragaria vesca
            subsp. vesca]
          Length = 754

 Score =  469 bits (1207), Expect = e-129
 Identities = 225/428 (52%), Positives = 309/428 (72%), Gaps = 2/428 (0%)
 Frame = -2

Query: 1279 ALNSKKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWI 1100
            AL S+K  S V  AIGRF FD+G P +AVNSAYFQPM+DAIAS G G+  P+ +DLRSWI
Sbjct: 168  ALVSRKVNSYVHEAIGRFLFDIGAPPEAVNSAYFQPMIDAIASGGPGMEPPTCHDLRSWI 227

Query: 1099 LKNSVHEVRYDVEQCTSAWGRTGCSILVYEWSSKKCKTFINLFAYSPEGTIFLRXXXXXX 920
            LKNSV E R ++++  + WGRTGCSILV +W+++     ++   YSPEGT+FL       
Sbjct: 228  LKNSVEEARNNIDKHRATWGRTGCSILVDQWNTELDNVMLSFLVYSPEGTVFLESVDASA 287

Query: 919  XXXXXDFLYELLKETVEQVGLNNVVQVVTTGEERYVIAGKRLTDTYPTIFWTPCAGYCID 740
                 D LY+LL+  VE VG+ +VVQV+T+GEE++V+AG+RL DT+P +FW PCA  C+D
Sbjct: 288  IINSSDALYDLLRRVVEDVGVGDVVQVITSGEEQFVVAGRRLADTFPNLFWIPCAARCLD 347

Query: 739  LMLQDIGELPEVKMILNQAKSISSYIYSDTATINMIRRYTSGVDLVDLGTTRSSTDFMTL 560
            L+L+D G L  +  ++ QA+SI+ ++Y+    +N++RR T G D+V+ G TR  T F TL
Sbjct: 348  LILEDFGSLDWIHAVIEQARSITKFVYNHNVVLNLVRRSTFGNDIVEPGVTRFGTSFTTL 407

Query: 559  KRMLNVRQNLQSMVTSEEWMGSYCSEKAEGIAVLDSVC--SQSFWSTCASVVRLTDPILH 386
            KR+++++  LQ MVTS+EWM    S++  G+ + D +    QSFWS+C  +VRLT P+L 
Sbjct: 408  KRLVDLKHCLQVMVTSQEWMDCPYSKEPGGLEISDLISDRDQSFWSSCTLIVRLTSPLLR 467

Query: 385  LLKLVDSQKMPSMGFVYAGLYRVKEAIKKELLDSGDYLVYWSIIDHRWEQLERHPLHAAG 206
            +L++V  +K P+MGF+YAG+YR KEAIKKEL+   +Y+VYW+IID RWEQ    PLHAAG
Sbjct: 468  VLRMVGCEKRPAMGFIYAGMYRAKEAIKKELVKREEYMVYWNIIDQRWEQHWNFPLHAAG 527

Query: 205  FYLNPKHFNSLEEDGHHHIRSLVFDCIEKLVTDPNIQDKIMRERASYLSCKGDFGRKMAI 26
            FYLNPK F S+E D H+ I+S ++DCIE++V D  +QDKIM+E  SY +  GDF RKMAI
Sbjct: 528  FYLNPKIFYSIEGDIHNSIQSGMYDCIERMVPDIKVQDKIMKEIISYKNAAGDFRRKMAI 587

Query: 25   RSRDTILP 2
            R+RDT+LP
Sbjct: 588  RARDTLLP 595


>ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817502 isoform X1 [Glycine
            max] gi|571489936|ref|XP_006591345.1| PREDICTED:
            uncharacterized protein LOC100817502 isoform X2 [Glycine
            max] gi|571489939|ref|XP_006591346.1| PREDICTED:
            uncharacterized protein LOC100817502 isoform X3 [Glycine
            max]
          Length = 759

 Score =  469 bits (1207), Expect = e-129
 Identities = 221/422 (52%), Positives = 310/422 (73%)
 Frame = -2

Query: 1267 KKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNS 1088
            KK  + + MAIGRF +D+G P DAVNS YFQ M+DAIAS+G G   P +++LR WILKNS
Sbjct: 179  KKMDNHIYMAIGRFLYDIGAPFDAVNSVYFQEMVDAIASRGVGFERPWHHELRGWILKNS 238

Query: 1087 VHEVRYDVEQCTSAWGRTGCSILVYEWSSKKCKTFINLFAYSPEGTIFLRXXXXXXXXXX 908
            V EV+ D+++C   WGRTGCSILV +W+++  K  I+  AY PEG +FLR          
Sbjct: 239  VEEVKNDIDRCKMTWGRTGCSILVDQWTTETGKILISFLAYCPEGLVFLRSLDATEISTS 298

Query: 907  XDFLYELLKETVEQVGLNNVVQVVTTGEERYVIAGKRLTDTYPTIFWTPCAGYCIDLMLQ 728
             DFLY+L+K+ VE+VG   VVQV+T+GEE+Y IAG+RLTDT+PT++ +P A +CIDL+L+
Sbjct: 299  ADFLYDLIKQVVEEVGAGQVVQVITSGEEQYGIAGRRLTDTFPTLYLSPSAAHCIDLILE 358

Query: 727  DIGELPEVKMILNQAKSISSYIYSDTATINMIRRYTSGVDLVDLGTTRSSTDFMTLKRML 548
            D G L  +  ++ QA+S++ ++Y+ +A +NM++RYT G D+VD   +  +T+F TLKRM+
Sbjct: 359  DFGNLEWISAVIEQARSVTRFVYNYSAILNMVKRYTLGNDIVDPSFSHFATNFTTLKRMV 418

Query: 547  NVRQNLQSMVTSEEWMGSYCSEKAEGIAVLDSVCSQSFWSTCASVVRLTDPILHLLKLVD 368
            +++ NLQ++VTS+EW  S  S++  G+ +LD + +Q+FWS+C  +V LT P+L ++++  
Sbjct: 419  DLKHNLQALVTSQEWADSPYSKQTAGLEMLDCLSNQTFWSSCDMIVCLTAPLLKVMRIAS 478

Query: 367  SQKMPSMGFVYAGLYRVKEAIKKELLDSGDYLVYWSIIDHRWEQLERHPLHAAGFYLNPK 188
            S+  P+MG+VYAG+YR KEAIKK L    +Y+VYW+II HRWE+L  HPLHAAGFYLNPK
Sbjct: 479  SEMRPAMGYVYAGMYRAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPK 538

Query: 187  HFNSLEEDGHHHIRSLVFDCIEKLVTDPNIQDKIMRERASYLSCKGDFGRKMAIRSRDTI 8
             F S++ D H  I S +FDCIE+LV D  IQDKI++E   Y S  GDFGRKMA+R+RD +
Sbjct: 539  FFYSIQGDIHGQIVSGMFDCIERLVPDTRIQDKIIKEINLYKSASGDFGRKMAVRARDNL 598

Query: 7    LP 2
            LP
Sbjct: 599  LP 600


>ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806265 isoform X1 [Glycine
            max] gi|571542833|ref|XP_006601996.1| PREDICTED:
            uncharacterized protein LOC100806265 isoform X2 [Glycine
            max]
          Length = 758

 Score =  468 bits (1203), Expect = e-129
 Identities = 219/422 (51%), Positives = 310/422 (73%)
 Frame = -2

Query: 1267 KKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNS 1088
            KK  + + MAIGRF +D+G P DAVN  +FQ M+DAIAS+G G   PS+++LR WILKNS
Sbjct: 178  KKMDNHIYMAIGRFLYDIGAPFDAVNLVFFQEMVDAIASKGTGFERPSHHELRGWILKNS 237

Query: 1087 VHEVRYDVEQCTSAWGRTGCSILVYEWSSKKCKTFINLFAYSPEGTIFLRXXXXXXXXXX 908
            V EV+ D+++C   WGRTGCSILV +W+++  +  I+  AY PEG +FL+          
Sbjct: 238  VEEVKNDIDRCKMTWGRTGCSILVDQWTTETSRILISFLAYCPEGLVFLKSLDATEILTS 297

Query: 907  XDFLYELLKETVEQVGLNNVVQVVTTGEERYVIAGKRLTDTYPTIFWTPCAGYCIDLMLQ 728
             DFLY+L+K+ VE++G+  VVQV+T+GEE+Y IAG+RL DT+PT++W+P A +CIDL+L+
Sbjct: 298  PDFLYDLIKQVVEEIGVGKVVQVITSGEEQYGIAGRRLMDTFPTLYWSPSAAHCIDLILE 357

Query: 727  DIGELPEVKMILNQAKSISSYIYSDTATINMIRRYTSGVDLVDLGTTRSSTDFMTLKRML 548
            D G L  +  ++ QAKS++ ++Y+ +A +NM++RYT G D+VD   +R +T+F TLKRM+
Sbjct: 358  DFGNLEWISAVIEQAKSVTRFVYNYSAILNMVKRYTLGNDIVDPSFSRFATNFTTLKRMV 417

Query: 547  NVRQNLQSMVTSEEWMGSYCSEKAEGIAVLDSVCSQSFWSTCASVVRLTDPILHLLKLVD 368
            +++ NLQ++VTS+EW     S++  G+ +LD + +Q+FWS+C  +V LT P+L +L++  
Sbjct: 418  DLKHNLQALVTSQEWADCPYSKQTAGLEMLDCLSNQTFWSSCDMIVCLTAPLLKVLRIAG 477

Query: 367  SQKMPSMGFVYAGLYRVKEAIKKELLDSGDYLVYWSIIDHRWEQLERHPLHAAGFYLNPK 188
            S+  P MG+VYAG+YRVKEAIKK L    +Y+VYW+II HRWE+L  HPLHAAGFYLNPK
Sbjct: 478  SEMRPGMGYVYAGMYRVKEAIKKALGKREEYMVYWNIIHHRWERLWNHPLHAAGFYLNPK 537

Query: 187  HFNSLEEDGHHHIRSLVFDCIEKLVTDPNIQDKIMRERASYLSCKGDFGRKMAIRSRDTI 8
             F S++ D    I S +FDCIE+LV D  IQDKI++E   Y S  GDFGRKMA+R+RD +
Sbjct: 538  FFYSIQGDILGQIVSGMFDCIERLVPDTRIQDKIIKEINLYKSAAGDFGRKMAVRARDNL 597

Query: 7    LP 2
            LP
Sbjct: 598  LP 599


>ref|XP_002524204.1| DNA binding protein, putative [Ricinus communis]
            gi|223536481|gb|EEF38128.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 753

 Score =  464 bits (1195), Expect = e-128
 Identities = 220/426 (51%), Positives = 309/426 (72%)
 Frame = -2

Query: 1279 ALNSKKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWI 1100
            AL +K+    V MAIGRF +D+G P DAVNS YFQPM+DAIAS G  V  PS +DLR WI
Sbjct: 178  ALGAKRVNDHVHMAIGRFLYDIGAPLDAVNSVYFQPMVDAIASGGLDVGMPSCHDLRGWI 237

Query: 1099 LKNSVHEVRYDVEQCTSAWGRTGCSILVYEWSSKKCKTFINLFAYSPEGTIFLRXXXXXX 920
            LKNSV EV+ +V++  + W RTGCS+LV +W++   +T ++   Y  EG +FL+      
Sbjct: 238  LKNSVEEVKTEVDKHMATWARTGCSVLVDQWNTLMGRTLLSFLVYCSEGVVFLKSVDASD 297

Query: 919  XXXXXDFLYELLKETVEQVGLNNVVQVVTTGEERYVIAGKRLTDTYPTIFWTPCAGYCID 740
                 D LYEL+K+ VE+VG+ +V+QV+T+ EE+Y++ G+RLTDT+PT++  PCA +CID
Sbjct: 298  IINSSDALYELIKKVVEEVGVRHVLQVITSMEEQYIVVGRRLTDTFPTLYRAPCAAHCID 357

Query: 739  LMLQDIGELPEVKMILNQAKSISSYIYSDTATINMIRRYTSGVDLVDLGTTRSSTDFMTL 560
            L+L+D  +L  +  ++ QA+SI+ ++Y+ +  +NM++RYT G ++V  G T  +T+F TL
Sbjct: 358  LILEDFAKLEWISTVILQARSITRFVYNHSVVLNMVKRYTFGSEIVATGLTHFATNFETL 417

Query: 559  KRMLNVRQNLQSMVTSEEWMGSYCSEKAEGIAVLDSVCSQSFWSTCASVVRLTDPILHLL 380
            KRM++++  LQ+MVTS+EWM    S+K  G+ +LD + +QSFWS+C  +  LT+P+L LL
Sbjct: 418  KRMVDLKHTLQTMVTSQEWMDCPYSKKPRGLEMLDLLSNQSFWSSCVLITNLTNPLLRLL 477

Query: 379  KLVDSQKMPSMGFVYAGLYRVKEAIKKELLDSGDYLVYWSIIDHRWEQLERHPLHAAGFY 200
            ++V S+K P MG+VYAG+YR KEAIKKEL+   DY+VYW+IIDH WEQ    PLHAAGF+
Sbjct: 478  RIVSSKKRPPMGYVYAGIYRAKEAIKKELVKRKDYMVYWNIIDHWWEQQSNLPLHAAGFF 537

Query: 199  LNPKHFNSLEEDGHHHIRSLVFDCIEKLVTDPNIQDKIMRERASYLSCKGDFGRKMAIRS 20
            LNPK   S+E D H+ I S +FDCIEKLV D  +QDKI +E  SY +  GDFGRKMA+R+
Sbjct: 538  LNPKVLYSIEGDLHNEILSGMFDCIEKLVPDVTVQDKITKEINSYKNASGDFGRKMAVRA 597

Query: 19   RDTILP 2
            R+T+LP
Sbjct: 598  RETLLP 603


>ref|XP_007214932.1| hypothetical protein PRUPE_ppa001359mg [Prunus persica]
            gi|462411082|gb|EMJ16131.1| hypothetical protein
            PRUPE_ppa001359mg [Prunus persica]
          Length = 845

 Score =  448 bits (1152), Expect = e-123
 Identities = 215/417 (51%), Positives = 297/417 (71%), Gaps = 1/417 (0%)
 Frame = -2

Query: 1249 VDMAIGRFFFDVGLPADAV-NSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVR 1073
            + MAIGRF +++  P D V NS YFQPM+DAIAS G G + PSY DLR WILKN+V EV+
Sbjct: 278  IHMAIGRFLYEIQAPLDVVKNSVYFQPMIDAIASGGKGTIAPSYDDLRGWILKNAVGEVK 337

Query: 1072 YDVEQCTSAWGRTGCSILVYEWSSKKCKTFINLFAYSPEGTIFLRXXXXXXXXXXXDFLY 893
             D+ Q    W RTGCS+LV +WSS+K KT +N     PEGTI+L+           D L+
Sbjct: 338  SDIHQHMETWARTGCSLLVNQWSSEKGKTLLNFAVQCPEGTIYLKSVDASYFIFSPDALF 397

Query: 892  ELLKETVEQVGLNNVVQVVTTGEERYVIAGKRLTDTYPTIFWTPCAGYCIDLMLQDIGEL 713
            E LKE VE+VG+ +V+QV+T  EE++ +AGKRL DT+PT++W+PC    IDL+L+D G++
Sbjct: 398  EFLKEVVEEVGVGHVLQVITNTEEQFAVAGKRLMDTFPTLYWSPCVATSIDLILEDFGKV 457

Query: 712  PEVKMILNQAKSISSYIYSDTATINMIRRYTSGVDLVDLGTTRSSTDFMTLKRMLNVRQN 533
              +  ++ QA+S++ +IY     +NM+RRYT G D+V LG TR +T+F TLK+M +++ N
Sbjct: 458  EWINSVIEQARSVTRFIYKHVVILNMMRRYTFGNDIVRLGVTRFATNFTTLKQMADLKFN 517

Query: 532  LQSMVTSEEWMGSYCSEKAEGIAVLDSVCSQSFWSTCASVVRLTDPILHLLKLVDSQKMP 353
            LQSMVTS+EWM    S+  EG AVLD + + SFWS C  V  LT+P+L +L++V SQK  
Sbjct: 518  LQSMVTSKEWMCCPYSKTPEGSAVLDVLSNHSFWSACILVTHLTNPLLRVLRIVGSQKRA 577

Query: 352  SMGFVYAGLYRVKEAIKKELLDSGDYLVYWSIIDHRWEQLERHPLHAAGFYLNPKHFNSL 173
            +MG+V+AG+YR KE IK+EL+   +Y+VYW IID+RW++L   PLHAAGFYLNPK F S+
Sbjct: 578  AMGYVFAGIYRAKETIKRELVKREEYMVYWDIIDYRWKKLWPLPLHAAGFYLNPKFFYSV 637

Query: 172  EEDGHHHIRSLVFDCIEKLVTDPNIQDKIMRERASYLSCKGDFGRKMAIRSRDTILP 2
            + D H+ I S +FDCIE+LV D  IQD++++E   Y +  GD GR +A+R+RD +LP
Sbjct: 638  KGDLHNEIISRMFDCIERLVPDIKIQDEVIKEINLYKNAVGDLGRNLAVRARDNLLP 694


>ref|XP_007049027.1| HAT transposon superfamily, putative [Theobroma cacao]
            gi|508701288|gb|EOX93184.1| HAT transposon superfamily,
            putative [Theobroma cacao]
          Length = 750

 Score =  439 bits (1129), Expect = e-120
 Identities = 205/416 (49%), Positives = 297/416 (71%)
 Frame = -2

Query: 1249 VDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRY 1070
            V MAIGRF +D+G+  DAVNS YFQPM+DAIAS G+G+V PS  DLR WILKN + EV+ 
Sbjct: 184  VHMAIGRFLYDIGVNLDAVNSVYFQPMIDAIASTGSGIVPPSSQDLRGWILKNVMEEVKD 243

Query: 1069 DVEQCTSAWGRTGCSILVYEWSSKKCKTFINLFAYSPEGTIFLRXXXXXXXXXXXDFLYE 890
            D+++  + WG+TGCSILV +WS K  +T ++   Y P+ T+FL+           D L E
Sbjct: 244  DIDRNKTMWGKTGCSILVEQWSPKSGRTLLSFLVYCPQATVFLKSVDASRVIFSADHLNE 303

Query: 889  LLKETVEQVGLNNVVQVVTTGEERYVIAGKRLTDTYPTIFWTPCAGYCIDLMLQDIGELP 710
            LLK+ VE+VG+ NVVQV+T  EE+Y +AGKRL +++P+++W PC  +C+D+ML+D   L 
Sbjct: 304  LLKQVVEEVGVENVVQVITNCEEQYFLAGKRLMESFPSLYWAPCLVHCVDMMLEDFANLE 363

Query: 709  EVKMILNQAKSISSYIYSDTATINMIRRYTSGVDLVDLGTTRSSTDFMTLKRMLNVRQNL 530
             +   + QAKS++ ++Y+ +  +NM+RR+T   D+V+   TR +++F TLKRM +++  L
Sbjct: 364  WISETIEQAKSVTRFVYNHSVVLNMMRRFTFHNDIVEPAVTRFASNFATLKRMADLKLKL 423

Query: 529  QSMVTSEEWMGSYCSEKAEGIAVLDSVCSQSFWSTCASVVRLTDPILHLLKLVDSQKMPS 350
            Q+MV S++W     ++K  G+ +LD V ++SFW++C  +VRL  P+L +L++V S+K  +
Sbjct: 424  QAMVNSQDWSECPYAKKPGGLVMLDIVKNRSFWNSCILIVRLIYPLLQVLEIVGSKKRST 483

Query: 349  MGFVYAGLYRVKEAIKKELLDSGDYLVYWSIIDHRWEQLERHPLHAAGFYLNPKHFNSLE 170
            MG+VYAG+YR KE IKKEL+   DY+VYW+IIDHRWEQ    PL+AA F+LNPK F S+E
Sbjct: 484  MGYVYAGIYRAKETIKKELVKKDDYMVYWNIIDHRWEQQRHIPLYAAAFFLNPKFFYSIE 543

Query: 169  EDGHHHIRSLVFDCIEKLVTDPNIQDKIMRERASYLSCKGDFGRKMAIRSRDTILP 2
             + H+ I S +FDCIE+LV D N+QD+I+RE   Y +  GD GR MA+R+RD +LP
Sbjct: 544  GNIHNDILSSMFDCIERLVPDTNVQDQIVREIHLYKNATGDLGRPMAVRARDNLLP 599


>ref|XP_004305893.1| PREDICTED: uncharacterized protein LOC101310825 [Fragaria vesca
            subsp. vesca]
          Length = 869

 Score =  439 bits (1129), Expect = e-120
 Identities = 209/417 (50%), Positives = 295/417 (70%), Gaps = 1/417 (0%)
 Frame = -2

Query: 1249 VDMAIGRFFFDVGLPADAV-NSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVR 1073
            + MAIGRF +++  P DAV NS YFQPM+DAIAS G     PSY+DLR WIL ++  EV+
Sbjct: 299  IQMAIGRFLYEIQAPLDAVKNSLYFQPMIDAIASGGMESKAPSYHDLRGWILNDAAEEVK 358

Query: 1072 YDVEQCTSAWGRTGCSILVYEWSSKKCKTFINLFAYSPEGTIFLRXXXXXXXXXXXDFLY 893
             ++ Q T++W R GCS+LV +++S+K +  +N   Y PEGT +L+           D LY
Sbjct: 359  NEIYQHTNSWERNGCSLLVNQFNSEKGRILLNFSVYCPEGTTYLKSVDASTFINSPDALY 418

Query: 892  ELLKETVEQVGLNNVVQVVTTGEERYVIAGKRLTDTYPTIFWTPCAGYCIDLMLQDIGEL 713
            E+LK+ VE+VG+  V+QV+T  EE YV+AGKRL DT+PT++W+PCA  CI+ +L+D G+ 
Sbjct: 419  EILKQVVEEVGVRRVLQVITNSEEHYVVAGKRLMDTFPTLYWSPCAAACINSILEDFGKF 478

Query: 712  PEVKMILNQAKSISSYIYSDTATINMIRRYTSGVDLVDLGTTRSSTDFMTLKRMLNVRQN 533
              +  I+ QA+S++ +IY     +NM+RRYT G D+V LG TR +TDFMTLK+M +++ N
Sbjct: 479  EWINSIIAQARSVTRFIYKHVVILNMMRRYTFGNDIVKLGITRYATDFMTLKQMADLKFN 538

Query: 532  LQSMVTSEEWMGSYCSEKAEGIAVLDSVCSQSFWSTCASVVRLTDPILHLLKLVDSQKMP 353
            LQ+MVTS+EW G   S+  EG+A+LD + + +FWS+C  + R T+P+L +L++V SQK  
Sbjct: 539  LQTMVTSKEWEGCPYSKTPEGLAMLDLLSNHTFWSSCIMITRFTNPLLQVLRIVGSQKKA 598

Query: 352  SMGFVYAGLYRVKEAIKKELLDSGDYLVYWSIIDHRWEQLERHPLHAAGFYLNPKHFNSL 173
            +MG+V+ G+YR KE IK+EL+    Y  YW+IID+RW +L  HPLHAAGFYLNPK F S+
Sbjct: 599  AMGYVFGGMYRAKETIKRELVKKEVYTAYWNIIDYRWAKLWDHPLHAAGFYLNPKFFYSI 658

Query: 172  EEDGHHHIRSLVFDCIEKLVTDPNIQDKIMRERASYLSCKGDFGRKMAIRSRDTILP 2
            + + H  I S +FDCIEKLV D  +QD+I +E   Y +  GD GR +AIR+RDT+LP
Sbjct: 659  KGEMHKVIMSRMFDCIEKLVPDLKVQDEISKEINLYQNAVGDMGRNLAIRARDTLLP 715


>ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817502 isoform X4 [Glycine
            max]
          Length = 729

 Score =  438 bits (1126), Expect = e-120
 Identities = 211/422 (50%), Positives = 298/422 (70%)
 Frame = -2

Query: 1267 KKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNS 1088
            KK  + + MAIGRF +D+G P DAVNS YFQ M+DAIAS+G G   P +++LR WILKNS
Sbjct: 179  KKMDNHIYMAIGRFLYDIGAPFDAVNSVYFQEMVDAIASRGVGFERPWHHELRGWILKNS 238

Query: 1087 VHEVRYDVEQCTSAWGRTGCSILVYEWSSKKCKTFINLFAYSPEGTIFLRXXXXXXXXXX 908
            V EV+ D+++C   WGRTGCSILV +W+++                              
Sbjct: 239  VEEVKNDIDRCKMTWGRTGCSILVDQWTTET----------------------------- 269

Query: 907  XDFLYELLKETVEQVGLNNVVQVVTTGEERYVIAGKRLTDTYPTIFWTPCAGYCIDLMLQ 728
             DFLY+L+K+ VE+VG   VVQV+T+GEE+Y IAG+RLTDT+PT++ +P A +CIDL+L+
Sbjct: 270  -DFLYDLIKQVVEEVGAGQVVQVITSGEEQYGIAGRRLTDTFPTLYLSPSAAHCIDLILE 328

Query: 727  DIGELPEVKMILNQAKSISSYIYSDTATINMIRRYTSGVDLVDLGTTRSSTDFMTLKRML 548
            D G L  +  ++ QA+S++ ++Y+ +A +NM++RYT G D+VD   +  +T+F TLKRM+
Sbjct: 329  DFGNLEWISAVIEQARSVTRFVYNYSAILNMVKRYTLGNDIVDPSFSHFATNFTTLKRMV 388

Query: 547  NVRQNLQSMVTSEEWMGSYCSEKAEGIAVLDSVCSQSFWSTCASVVRLTDPILHLLKLVD 368
            +++ NLQ++VTS+EW  S  S++  G+ +LD + +Q+FWS+C  +V LT P+L ++++  
Sbjct: 389  DLKHNLQALVTSQEWADSPYSKQTAGLEMLDCLSNQTFWSSCDMIVCLTAPLLKVMRIAS 448

Query: 367  SQKMPSMGFVYAGLYRVKEAIKKELLDSGDYLVYWSIIDHRWEQLERHPLHAAGFYLNPK 188
            S+  P+MG+VYAG+YR KEAIKK L    +Y+VYW+II HRWE+L  HPLHAAGFYLNPK
Sbjct: 449  SEMRPAMGYVYAGMYRAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPK 508

Query: 187  HFNSLEEDGHHHIRSLVFDCIEKLVTDPNIQDKIMRERASYLSCKGDFGRKMAIRSRDTI 8
             F S++ D H  I S +FDCIE+LV D  IQDKI++E   Y S  GDFGRKMA+R+RD +
Sbjct: 509  FFYSIQGDIHGQIVSGMFDCIERLVPDTRIQDKIIKEINLYKSASGDFGRKMAVRARDNL 568

Query: 7    LP 2
            LP
Sbjct: 569  LP 570


>ref|XP_006380932.1| hypothetical protein POPTR_0006s02210g [Populus trichocarpa]
            gi|550335284|gb|ERP58729.1| hypothetical protein
            POPTR_0006s02210g [Populus trichocarpa]
          Length = 847

 Score =  433 bits (1114), Expect = e-119
 Identities = 203/426 (47%), Positives = 300/426 (70%)
 Frame = -2

Query: 1279 ALNSKKKVSVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWI 1100
            A+ S+   + +    GRF +D+G   DA++S + QP++D +A    G+  PS+ DLR  I
Sbjct: 278  AMGSETADNPIHAIWGRFLYDIGASLDAMDSNFSQPLIDTVAYGRPGIAAPSHQDLRGRI 337

Query: 1099 LKNSVHEVRYDVEQCTSAWGRTGCSILVYEWSSKKCKTFINLFAYSPEGTIFLRXXXXXX 920
            LK+ V EV+ D+ Q  + W +TGCS+LV E +S+   T +N   Y  +GT+FL+      
Sbjct: 338  LKSLVEEVKSDINQYKTRWVKTGCSLLVEECNSESGVTTLNFLVYCSKGTVFLKSVDASN 397

Query: 919  XXXXXDFLYELLKETVEQVGLNNVVQVVTTGEERYVIAGKRLTDTYPTIFWTPCAGYCID 740
                 D LYELLK  VE+VG  N++QV+T GEE Y+ AGK+L DT+P+++W PCA  CID
Sbjct: 398  LIHSTDGLYELLKLMVEEVGAGNILQVITNGEEHYIAAGKKLMDTFPSLYWAPCAARCID 457

Query: 739  LMLQDIGELPEVKMILNQAKSISSYIYSDTATINMIRRYTSGVDLVDLGTTRSSTDFMTL 560
            L+L+DIG+L  +  +L QAKS++ ++Y+++A +N++R++TSG D+V  G TRS+T+F  L
Sbjct: 458  LILEDIGKLDWINTVLEQAKSVTRFVYNNSAVLNLMRKFTSGSDIVQQGITRSATNFTAL 517

Query: 559  KRMLNVRQNLQSMVTSEEWMGSYCSEKAEGIAVLDSVCSQSFWSTCASVVRLTDPILHLL 380
            KRM N + NLQ+MVTS+EWM    S++  G+A++D + ++SFWS+C  ++RLT P+L +L
Sbjct: 518  KRMANFKLNLQTMVTSQEWMDCPYSKQPGGLAMVDIITNRSFWSSCILIIRLTSPLLQVL 577

Query: 379  KLVDSQKMPSMGFVYAGLYRVKEAIKKELLDSGDYLVYWSIIDHRWEQLERHPLHAAGFY 200
             +V S+K  +MG+V++G+YR KE IKKEL+   DY+VYW+IIDHRWEQ  + PLHAAGF+
Sbjct: 578  VIVSSEKRAAMGYVFSGIYRAKETIKKELVKREDYMVYWNIIDHRWEQQWQTPLHAAGFF 637

Query: 199  LNPKHFNSLEEDGHHHIRSLVFDCIEKLVTDPNIQDKIMRERASYLSCKGDFGRKMAIRS 20
             NPK F S+E D H+ I S +FDCIE+LV D  +QDKI++E   Y + +G  G+K+AIR+
Sbjct: 638  FNPKFFYSIEGDMHNKILSRMFDCIERLVPDTEVQDKIVKELTLYKNAEGHLGKKLAIRA 697

Query: 19   RDTILP 2
            R T+LP
Sbjct: 698  RGTMLP 703


>ref|XP_002521049.1| DNA binding protein, putative [Ricinus communis]
            gi|223539752|gb|EEF41333.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 854

 Score =  432 bits (1112), Expect = e-118
 Identities = 195/418 (46%), Positives = 293/418 (70%)
 Frame = -2

Query: 1255 SVVDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEV 1076
            +V+   +GRF +D+G   DA++S YF+ ++D ++S  +G V PS +DLR WILK  V E+
Sbjct: 289  NVIHTTVGRFLYDIGANFDALDSIYFRSLIDMLSSGASGAVAPSNHDLRGWILKKLVEEI 348

Query: 1075 RYDVEQCTSAWGRTGCSILVYEWSSKKCKTFINLFAYSPEGTIFLRXXXXXXXXXXXDFL 896
            + D++Q  + W RTGCS+LV EW+S+   T +N      +GT+FL+           D L
Sbjct: 349  KNDIDQSRTTWARTGCSVLVEEWNSESGITLLNFLVNCSQGTVFLKSVEASHIIYSPDGL 408

Query: 895  YELLKETVEQVGLNNVVQVVTTGEERYVIAGKRLTDTYPTIFWTPCAGYCIDLMLQDIGE 716
            Y LLK+ VE+VG +NV+QV+T G E Y +AGKRL + +P++FW PCA +C+DL+L+D  +
Sbjct: 409  YVLLKQVVEEVGASNVLQVITNGNEHYTVAGKRLMEAFPSLFWAPCAVHCLDLILEDFAK 468

Query: 715  LPEVKMILNQAKSISSYIYSDTATINMIRRYTSGVDLVDLGTTRSSTDFMTLKRMLNVRQ 536
            L  +  ++ QAKS++ ++Y+ +A +N++R++T G D+V  G TRS+T+F  L+RM + + 
Sbjct: 469  LEWIDAVIEQAKSVTRFVYNHSAVLNLMRKFTYGKDIVQQGLTRSATNFTMLQRMADFKL 528

Query: 535  NLQSMVTSEEWMGSYCSEKAEGIAVLDSVCSQSFWSTCASVVRLTDPILHLLKLVDSQKM 356
            NLQ+M+TS+EWM    S++  G+A+LD + ++SFWS+C  ++RLT P++ +L +   ++ 
Sbjct: 529  NLQTMITSQEWMDCPYSKQHGGLAMLDIISNRSFWSSCILIIRLTSPLIRVLGIAGGKRK 588

Query: 355  PSMGFVYAGLYRVKEAIKKELLDSGDYLVYWSIIDHRWEQLERHPLHAAGFYLNPKHFNS 176
             +MG+++AG+YR KE IK+EL+   DY+VYW+IIDHRW+Q    PLH AGF+LNPK F S
Sbjct: 589  AAMGYIFAGIYRAKETIKRELVKREDYMVYWNIIDHRWDQRRHPPLHVAGFFLNPKFFYS 648

Query: 175  LEEDGHHHIRSLVFDCIEKLVTDPNIQDKIMRERASYLSCKGDFGRKMAIRSRDTILP 2
            +E D H+ I S VFDCIE+LV D  +QDKI +E   Y +  GD GRKMAIRSR T+LP
Sbjct: 649  IEGDVHNEILSRVFDCIERLVPDIEVQDKIAKELNIYKNAVGDLGRKMAIRSRGTLLP 706


>ref|XP_004981234.1| PREDICTED: uncharacterized protein LOC101757413 [Setaria italica]
          Length = 803

 Score =  395 bits (1014), Expect = e-107
 Identities = 200/417 (47%), Positives = 276/417 (66%), Gaps = 1/417 (0%)
 Frame = -2

Query: 1249 VDMAIGRFFFDVGLPADAVNSAYFQPMLDAIASQGAGVVGPSYYDLRSWILKNSVHEVRY 1070
            V +A+GRF +DVG+P +AVNS YFQPML+AIAS G      SY+D R  ILK S+ +   
Sbjct: 231  VSVAVGRFLYDVGVPLEAVNSVYFQPMLEAIASAGGRPEALSYHDFRGHILKKSLDDATS 290

Query: 1069 DVEQCTSAWGRTGCSILVYEWSSKKCKTFINLFAYSPEGTIFLRXXXXXXXXXXXDFLYE 890
             +E    +W RTGCS+L  EW + K +T IN   Y PEGT+FL+           D LYE
Sbjct: 291  RLEFFKGSWTRTGCSVLADEWITDKGRTLINFSVYCPEGTMFLKSVDATSIVASSDALYE 350

Query: 889  LLKETVEQVGLNNVVQVVTTGEERYVIAGKRLTDTYPTIFWTPCAGYCIDLMLQDIGELP 710
            LLK  VE+VG   VVQV+T   E +  AGK+L +T+PT+FW+PC+  CID ML+D  ++ 
Sbjct: 351  LLKSVVEEVGEKKVVQVITNNSEIHAAAGKKLGETFPTLFWSPCSFQCIDGMLEDFSKVG 410

Query: 709  EVKMILNQAKSISSYIYSDTATINMIRRYTSGVDLVDLGTTRSSTDFMTLKRMLNVRQNL 530
             +  I++ AK+I+ + Y+    +N++++Y  G DL+    TR+S +F+TLK M  +++ L
Sbjct: 411  AISEIISNAKAITGFFYNSAFALNLMKKYLHGKDLLVPAETRASMNFVTLKNMYGLKEAL 470

Query: 529  QSMVTSEEWMGSYCSEKAEGIAVLDSVCSQSFWSTCASVVRLTDPILHLLKLVDSQKMPS 350
            Q+MV S+EW+  +   K  GI V + V S  FWS+CA+VV +T+P++HLLKLV S K P+
Sbjct: 471  QAMVNSDEWI-HFLLPKKGGIEVSNLVNSLQFWSSCAAVVHITEPLVHLLKLVGSTKRPA 529

Query: 349  MGFVYAGLYRVKEAIKKELLDSGDYLVYWSIIDHRWEQLERHPLHAAGFYLNPKHFNSLE 170
            MG++YAGLY+ K AIKKEL+   DY+ YW+IID RW+     PLH+AGF+LNP  F+ + 
Sbjct: 530  MGYIYAGLYQAKAAIKKELVSKNDYMAYWNIIDWRWDNQTPRPLHSAGFFLNPLFFDGIR 589

Query: 169  EDGHHHIRSLVFDCIEKLVTDPNIQDKIMRERASYLS-CKGDFGRKMAIRSRDTILP 2
             D  + I S + DCIE+LV+D  IQDKI RE   Y S   GDF R+MAIRSR T+ P
Sbjct: 590  GDVSNGIFSGMLDCIERLVSDVKIQDKIQRELNMYRSETAGDFRRQMAIRSRRTLPP 646


Top