BLASTX nr result
ID: Mentha25_contig00017749
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00017749 (1189 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591... 534 e-149 ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254... 530 e-148 gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea] 517 e-144 ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626... 469 e-129 ref|XP_007009265.1| HAT and BED zinc finger domain-containing pr... 465 e-128 ref|XP_004305893.1| PREDICTED: uncharacterized protein LOC101310... 462 e-127 ref|XP_002524204.1| DNA binding protein, putative [Ricinus commu... 462 e-127 ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302... 460 e-127 ref|XP_007214932.1| hypothetical protein PRUPE_ppa001359mg [Prun... 456 e-126 ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817... 456 e-125 ref|XP_007163431.1| hypothetical protein PHAVU_001G234100g [Phas... 451 e-124 ref|XP_007163430.1| hypothetical protein PHAVU_001G234100g [Phas... 451 e-124 ref|XP_006380932.1| hypothetical protein POPTR_0006s02210g [Popu... 450 e-124 ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806... 448 e-123 ref|XP_002521049.1| DNA binding protein, putative [Ricinus commu... 444 e-122 gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo] 441 e-121 ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226... 437 e-120 ref|XP_007049027.1| HAT transposon superfamily, putative [Theobr... 426 e-117 ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817... 418 e-114 ref|NP_188861.2| hAT dimerization domain-containing protein [Ara... 387 e-105 >ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591442 [Solanum tuberosum] Length = 755 Score = 534 bits (1376), Expect = e-149 Identities = 256/392 (65%), Positives = 311/392 (79%) Frame = -3 Query: 1184 RTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEVG 1005 R+GCS+LVDE +GKGKT +NF YCPEGT+FL D LYEL+KE+VEEVG Sbjct: 253 RSGCSVLVDEWITGKGKTLLNFLVYCPEGTMFLRSVDASTLINSTDYLYELLKEVVEEVG 312 Query: 1004 LRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQAR 825 +RNVLQVVT+ E+RY+IAGKRLTD YP++FWTPCA H IDLML+D+ + + +++QA+ Sbjct: 313 VRNVLQVVTSNEERYIIAGKRLTDAYPTLFWTPCAAHSIDLMLEDLKKLEWIDTIMEQAK 372 Query: 824 SISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWT 645 SISR+IY+N +++MMR++T GVDLVD+G TRS TDF+TLKR+VNI+H+LQSMV S EW Sbjct: 373 SISRFIYNNNILLSMMRKFTLGVDLVDLGVTRSATDFLTLKRMVNIKHNLQSMVTSVEWA 432 Query: 644 ESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYR 465 ES YSK E FA+ D I NQSFWS+C+ + RLTDP+LRL R+V S + PAM YV+AG+YR Sbjct: 433 ESPYSKKPEGFALLDYIGNQSFWSTCSLVCRLTDPILRLLRMVSSEERPAMAYVYAGVYR 492 Query: 464 AKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSL 285 AKE IKKEL K++Y YW+IID RWE LQRHPLHAAGFYLNPKFFY+ E D H HI+SL Sbjct: 493 AKETIKKELVNKKDYSVYWNIIDHRWESLQRHPLHAAGFYLNPKFFYTTEEDVHLHIRSL 552 Query: 284 VNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPNL 105 V DCIE+LVPD K+ DKI+KE SY AGDFGRKMA+RARDTL P EWW TYGG CPNL Sbjct: 553 VYDCIEKLVPDPKIQDKIVKETTSYLNSAGDFGRKMAVRARDTLFPAEWWSTYGGGCPNL 612 Query: 104 ARLAIRILSQTCCLIQHKLDKVPLEHLHKRTN 9 ARLAIRILSQT LI+ K +VPLE +H+ N Sbjct: 613 ARLAIRILSQTSSLIRSKPGRVPLEEMHETKN 644 >ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254391 [Solanum lycopersicum] Length = 748 Score = 530 bits (1365), Expect = e-148 Identities = 253/392 (64%), Positives = 310/392 (79%) Frame = -3 Query: 1184 RTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEVG 1005 RTGCS+L+DE +GKGK +NF YCP+GT+FL D LYEL+KE+V+E+G Sbjct: 246 RTGCSVLIDELITGKGKILLNFLVYCPQGTMFLRSVDASTLINSTDYLYELLKEVVDEIG 305 Query: 1004 LRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQAR 825 +RNVLQVVT+ E+RYVIAGKRLTD YP++FWTPCA H IDLML+D + + +++QA+ Sbjct: 306 VRNVLQVVTSNEERYVIAGKRLTDAYPTLFWTPCAAHSIDLMLEDFNKLEWIDTIMEQAK 365 Query: 824 SISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWT 645 SISR+IY+N +++MMR++T GVDLVD+G TRS TDF+TLKR+ NI+H+LQSMV S EW Sbjct: 366 SISRFIYNNNILLSMMRKFTLGVDLVDLGVTRSATDFLTLKRMQNIKHNLQSMVTSVEWA 425 Query: 644 ESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYR 465 ES YSK E FA+ D ISNQSFWS+C+ I RLTDP+LRL R+V S + PAM YV+AG+YR Sbjct: 426 ESPYSKKPEGFALLDYISNQSFWSTCSLICRLTDPILRLLRMVSSEERPAMPYVYAGVYR 485 Query: 464 AKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSL 285 AKE IKKEL K++Y YW+IID RWE LQRHPLHAAGFYLNPKFFY+ E D H HI+SL Sbjct: 486 AKETIKKELVNKKDYSVYWNIIDHRWESLQRHPLHAAGFYLNPKFFYTTEEDVHLHIRSL 545 Query: 284 VNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPNL 105 V DCIE+LVPD K+ DKI+KE SY AGDFGRKMA+RARDTL P EWW TYGG CPNL Sbjct: 546 VYDCIEKLVPDPKIQDKIVKETTSYLNSAGDFGRKMAVRARDTLFPAEWWSTYGGGCPNL 605 Query: 104 ARLAIRILSQTCCLIQHKLDKVPLEHLHKRTN 9 ARLAIRILSQT LI+ K ++P+E +H+ TN Sbjct: 606 ARLAIRILSQTSSLIRSKPGRIPIEEMHETTN 637 >gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea] Length = 724 Score = 517 bits (1331), Expect = e-144 Identities = 254/393 (64%), Positives = 309/393 (78%), Gaps = 1/393 (0%) Frame = -3 Query: 1184 RTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEVG 1005 RTGC++LVD+ SGKG+TFVNFF Y E TIF D LYEL+KE VE++G Sbjct: 245 RTGCTVLVDDWNSGKGETFVNFFVYNSEATIFYRSANVSHGIVSADDLYELLKETVEQIG 304 Query: 1004 LRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQAR 825 ++NVLQV+T+ ED+Y AGKRL TYPS+FW+PCAG C+DLMLQD+ P VK+ L+QA+ Sbjct: 305 VKNVLQVITSCEDQYAFAGKRLATTYPSVFWSPCAGLCVDLMLQDMEHLPMVKVTLEQAK 364 Query: 824 SISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWT 645 SISRYIYSN V+NM+RR+TFG+DL+D G T S T+FMTLKR++++RH LQSMV SE+W Sbjct: 365 SISRYIYSNGFVLNMLRRHTFGLDLLDEGITPSSTNFMTLKRMLSMRHHLQSMVTSEDWI 424 Query: 644 ESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYR 465 +S +S+ E FA+ D++++QSFWS+CASI L DPLLRL RI+ S K PAMGYV+AGLYR Sbjct: 425 QSPHSQKPEGFALLDTMTSQSFWSACASITNLIDPLLRLLRIISSGKKPAMGYVYAGLYR 484 Query: 464 AKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSL 285 AKEAIKK E+YL Y +IID RWEQLQ+HPLH AGFYLNPKFFYSLEGD +S+ Sbjct: 485 AKEAIKKHF-VSEDYLVYLNIIDRRWEQLQQHPLHGAGFYLNPKFFYSLEGDALLRSRSM 543 Query: 284 VNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPNL 105 V DCIERLVPD +V DKIMKE YH G GDFGRKMAIRARDTLLPTEWW+ YGG CPNL Sbjct: 544 VYDCIERLVPDPEVQDKIMKEMTYYHGGVGDFGRKMAIRARDTLLPTEWWIAYGGSCPNL 603 Query: 104 ARLAIRILSQTCCLIQHK-LDKVPLEHLHKRTN 9 +RLA+++LSQTC IQ K LDK+PLE +H+ N Sbjct: 604 SRLAVQVLSQTCGFIQLKLLDKLPLETMHRIKN 636 >ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626522 [Citrus sinensis] Length = 745 Score = 469 bits (1206), Expect = e-129 Identities = 219/395 (55%), Positives = 298/395 (75%) Frame = -3 Query: 1187 GRTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEV 1008 G+TGCSILVD+ + G+T + F AYCPEGT+FL D LYEL+K++VEEV Sbjct: 249 GKTGCSILVDQWNTEAGRTLLCFLAYCPEGTVFLKSVDASGIMNSSDALYELLKQVVEEV 308 Query: 1007 GLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQA 828 G+R+VLQV+T+ E++++ AG+RLTDT+P+++WTPCA C+DL+L+D A+ + +++QA Sbjct: 309 GVRHVLQVITSSEEQFIAAGRRLTDTFPTLYWTPCAARCLDLILEDFAKLEWINAIIEQA 368 Query: 827 RSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEW 648 R+++R++Y+++ V+NM+RRYTFG D+V+ G TRS T+F TL+R+++++ +LQ+MV S+EW Sbjct: 369 RAVTRFVYNHSVVLNMLRRYTFGNDIVEPGITRSATNFTTLRRMISLKPNLQAMVTSQEW 428 Query: 647 TESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLY 468 + YSK + D +SNQSFWSSC I+ LT+PLLRL RIV S + P++GYV+AG+Y Sbjct: 429 MDCPYSKKPGGLEMLDIVSNQSFWSSCGLIVCLTNPLLRLLRIVGSERRPSIGYVYAGMY 488 Query: 467 RAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQS 288 RAK+A+KKEL ++EY+ YW+IID WEQL PLHAAGF+LNPKFFYS++GD H+ I S Sbjct: 489 RAKDALKKELIKRDEYMVYWNIIDHWWEQLWHLPLHAAGFFLNPKFFYSIKGDIHNEIVS 548 Query: 287 LVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPN 108 + DCIERLVPD KV DKI KE Y GDFGRKMAIRARDTLLP EWW TYGG CPN Sbjct: 549 RMFDCIERLVPDTKVQDKISKEINLYKDAVGDFGRKMAIRARDTLLPAEWWSTYGGSCPN 608 Query: 107 LARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3 LARLA RI SQTC + +++ E ++ N L Sbjct: 609 LARLATRIQSQTCSSLADTRNQIHFERIYDTRNCL 643 >ref|XP_007009265.1| HAT and BED zinc finger domain-containing protein, putative [Theobroma cacao] gi|508726178|gb|EOY18075.1| HAT and BED zinc finger domain-containing protein, putative [Theobroma cacao] Length = 749 Score = 465 bits (1196), Expect = e-128 Identities = 217/394 (55%), Positives = 294/394 (74%) Frame = -3 Query: 1184 RTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEVG 1005 RTGCSILV++ + G+ +NF YCPEGT+FL D LYEL+K++VEEVG Sbjct: 250 RTGCSILVNQWNTQTGRILLNFLVYCPEGTVFLKSVDASSVINSSDALYELLKQVVEEVG 309 Query: 1004 LRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQAR 825 ++VLQV+T E++Y++AG+RL +T+P+++WTPCA HCI+L+L+D A+ + ++++QAR Sbjct: 310 SKHVLQVITNAEEQYIVAGRRLAETFPTLYWTPCAAHCINLILEDFAKLEWINVIIEQAR 369 Query: 824 SISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWT 645 SI+R++Y+++ V+NM+RRYT G D+V+ T S T+F TLK++++++++LQ+MV S+EW Sbjct: 370 SITRFVYNHSVVLNMVRRYTLGNDIVEPAVTCSATNFTTLKQMIDLKNNLQAMVTSQEWM 429 Query: 644 ESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYR 465 + YSK + D +SN SFWSS I +LT+PLLR+ R+V S K PAMGYV+AG+YR Sbjct: 430 DCPYSKKPGGLEMLDLVSNPSFWSSSVLITQLTNPLLRVLRMVGSKKRPAMGYVYAGMYR 489 Query: 464 AKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSL 285 AKE IKKEL + EY+ YW+IID WEQ HPLH AGFYLNPKFFYS+EGD + + S Sbjct: 490 AKETIKKELVKRNEYMIYWNIIDHWWEQQWHHPLHGAGFYLNPKFFYSMEGDMPNEMLSG 549 Query: 284 VNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPNL 105 + DCIE+LVPD+KV DKI KE SY GDFGRKMA+RARDTLLP EWW TYGG CPNL Sbjct: 550 MLDCIEKLVPDVKVQDKISKEINSYKNTVGDFGRKMAVRARDTLLPAEWWSTYGGSCPNL 609 Query: 104 ARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3 ARLAI +LSQTC + K + +P E LH+ N+L Sbjct: 610 ARLAIHVLSQTCSTLGLKQNSIPFEKLHETRNFL 643 >ref|XP_004305893.1| PREDICTED: uncharacterized protein LOC101310825 [Fragaria vesca subsp. vesca] Length = 869 Score = 462 bits (1189), Expect = e-127 Identities = 220/394 (55%), Positives = 285/394 (72%) Frame = -3 Query: 1184 RTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEVG 1005 R GCS+LV++ S KG+ +NF YCPEGT +L D LYE++K++VEEVG Sbjct: 370 RNGCSLLVNQFNSEKGRILLNFSVYCPEGTTYLKSVDASTFINSPDALYEILKQVVEEVG 429 Query: 1004 LRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQAR 825 +R VLQV+T E+ YV+AGKRL DT+P+++W+PCA CI+ +L+D +F + ++ QAR Sbjct: 430 VRRVLQVITNSEEHYVVAGKRLMDTFPTLYWSPCAAACINSILEDFGKFEWINSIIAQAR 489 Query: 824 SISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWT 645 S++R+IY + ++NMMRRYTFG D+V +G TR TDFMTLK++ +++ +LQ+MV S+EW Sbjct: 490 SVTRFIYKHVVILNMMRRYTFGNDIVKLGITRYATDFMTLKQMADLKFNLQTMVTSKEWE 549 Query: 644 ESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYR 465 YSK E A+ D +SN +FWSSC I R T+PLL++ RIV S K AMGYVF G+YR Sbjct: 550 GCPYSKTPEGLAMLDLLSNHTFWSSCIMITRFTNPLLQVLRIVGSQKKAAMGYVFGGMYR 609 Query: 464 AKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSL 285 AKE IK+EL KE Y YW+IID RW +L HPLHAAGFYLNPKFFYS++G+ H I S Sbjct: 610 AKETIKRELVKKEVYTAYWNIIDYRWAKLWDHPLHAAGFYLNPKFFYSIKGEMHKVIMSR 669 Query: 284 VNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPNL 105 + DCIE+LVPDLKV D+I KE Y GD GR +AIRARDTLLP EWW TYG CPN+ Sbjct: 670 MFDCIEKLVPDLKVQDEISKEINLYQNAVGDMGRNLAIRARDTLLPAEWWSTYGSGCPNM 729 Query: 104 ARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3 ARLA+ ILSQTC LIQ K +++P + LHK N L Sbjct: 730 ARLAVHILSQTCSLIQCKENQIPFDQLHKTRNSL 763 >ref|XP_002524204.1| DNA binding protein, putative [Ricinus communis] gi|223536481|gb|EEF38128.1| DNA binding protein, putative [Ricinus communis] Length = 753 Score = 462 bits (1188), Expect = e-127 Identities = 219/394 (55%), Positives = 292/394 (74%) Frame = -3 Query: 1184 RTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEVG 1005 RTGCS+LVD+ + G+T ++F YC EG +FL D LYEL+K++VEEVG Sbjct: 258 RTGCSVLVDQWNTLMGRTLLSFLVYCSEGVVFLKSVDASDIINSSDALYELIKKVVEEVG 317 Query: 1004 LRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQAR 825 +R+VLQV+T++E++Y++ G+RLTDT+P+++ PCA HCIDL+L+D A+ + V+ QAR Sbjct: 318 VRHVLQVITSMEEQYIVVGRRLTDTFPTLYRAPCAAHCIDLILEDFAKLEWISTVILQAR 377 Query: 824 SISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWT 645 SI+R++Y+++ V+NM++RYTFG ++V G T T+F TLKR+V+++H+LQ+MV S+EW Sbjct: 378 SITRFVYNHSVVLNMVKRYTFGSEIVATGLTHFATNFETLKRMVDLKHTLQTMVTSQEWM 437 Query: 644 ESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYR 465 + YSK + D +SNQSFWSSC I LT+PLLRL RIV S K P MGYV+AG+YR Sbjct: 438 DCPYSKKPRGLEMLDLLSNQSFWSSCVLITNLTNPLLRLLRIVSSKKRPPMGYVYAGIYR 497 Query: 464 AKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSL 285 AKEAIKKEL +++Y+ YW+IID WEQ PLHAAGF+LNPK YS+EGD H+ I S Sbjct: 498 AKEAIKKELVKRKDYMVYWNIIDHWWEQQSNLPLHAAGFFLNPKVLYSIEGDLHNEILSG 557 Query: 284 VNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPNL 105 + DCIE+LVPD+ V DKI KE SY +GDFGRKMA+RAR+TLLP EWW TYGG CPNL Sbjct: 558 MFDCIEKLVPDVTVQDKITKEINSYKNASGDFGRKMAVRARETLLPAEWWSTYGGSCPNL 617 Query: 104 ARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3 ARLAIR+LSQ C +KL+ + LE +H N L Sbjct: 618 ARLAIRVLSQPCSSFGYKLNHISLEQIHDTKNCL 651 >ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302111 [Fragaria vesca subsp. vesca] Length = 754 Score = 460 bits (1184), Expect = e-127 Identities = 222/397 (55%), Positives = 290/397 (73%), Gaps = 2/397 (0%) Frame = -3 Query: 1187 GRTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEV 1008 GRTGCSILVD+ + ++F Y PEGT+FL D LY+L++ +VE+V Sbjct: 247 GRTGCSILVDQWNTELDNVMLSFLVYSPEGTVFLESVDASAIINSSDALYDLLRRVVEDV 306 Query: 1007 GLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQA 828 G+ +V+QV+T+ E+++V+AG+RL DT+P++FW PCA C+DL+L+D + V++QA Sbjct: 307 GVGDVVQVITSGEEQFVVAGRRLADTFPNLFWIPCAARCLDLILEDFGSLDWIHAVIEQA 366 Query: 827 RSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEW 648 RSI++++Y++ V+N++RR TFG D+V+ G TR T F TLKR+V+++H LQ MV S+EW Sbjct: 367 RSITKFVYNHNVVLNLVRRSTFGNDIVEPGVTRFGTSFTTLKRLVDLKHCLQVMVTSQEW 426 Query: 647 TESSYSKDQEAFAVQDSISN--QSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAG 474 + YSK+ + D IS+ QSFWSSC I+RLT PLLR+ R+V K PAMG+++AG Sbjct: 427 MDCPYSKEPGGLEISDLISDRDQSFWSSCTLIVRLTSPLLRVLRMVGCEKRPAMGFIYAG 486 Query: 473 LYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHI 294 +YRAKEAIKKEL +EEY+ YW+IID RWEQ PLHAAGFYLNPK FYS+EGD H+ I Sbjct: 487 MYRAKEAIKKELVKREEYMVYWNIIDQRWEQHWNFPLHAAGFYLNPKIFYSIEGDIHNSI 546 Query: 293 QSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGEC 114 QS + DCIER+VPD+KV DKIMKE SY AGDF RKMAIRARDTLLP EWW TYGG C Sbjct: 547 QSGMYDCIERMVPDIKVQDKIMKEIISYKNAAGDFRRKMAIRARDTLLPAEWWSTYGGGC 606 Query: 113 PNLARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3 PNLARLAIRILSQTC I ++ ++P E H N L Sbjct: 607 PNLARLAIRILSQTCGSIGYRQSQIPFEKAHGIRNCL 643 >ref|XP_007214932.1| hypothetical protein PRUPE_ppa001359mg [Prunus persica] gi|462411082|gb|EMJ16131.1| hypothetical protein PRUPE_ppa001359mg [Prunus persica] Length = 845 Score = 456 bits (1174), Expect = e-126 Identities = 218/394 (55%), Positives = 289/394 (73%) Frame = -3 Query: 1184 RTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEVG 1005 RTGCS+LV++ +S KGKT +NF CPEGTI+L D L+E +KE+VEEVG Sbjct: 349 RTGCSLLVNQWSSEKGKTLLNFAVQCPEGTIYLKSVDASYFIFSPDALFEFLKEVVEEVG 408 Query: 1004 LRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQAR 825 + +VLQV+T E+++ +AGKRL DT+P+++W+PC IDL+L+D + + V++QAR Sbjct: 409 VGHVLQVITNTEEQFAVAGKRLMDTFPTLYWSPCVATSIDLILEDFGKVEWINSVIEQAR 468 Query: 824 SISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWT 645 S++R+IY + ++NMMRRYTFG D+V +G TR T+F TLK++ +++ +LQSMV S+EW Sbjct: 469 SVTRFIYKHVVILNMMRRYTFGNDIVRLGVTRFATNFTTLKQMADLKFNLQSMVTSKEWM 528 Query: 644 ESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYR 465 YSK E AV D +SN SFWS+C + LT+PLLR+ RIV S K AMGYVFAG+YR Sbjct: 529 CCPYSKTPEGSAVLDVLSNHSFWSACILVTHLTNPLLRVLRIVGSQKRAAMGYVFAGIYR 588 Query: 464 AKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSL 285 AKE IK+EL +EEY+ YW IID RW++L PLHAAGFYLNPKFFYS++GD H+ I S Sbjct: 589 AKETIKRELVKREEYMVYWDIIDYRWKKLWPLPLHAAGFYLNPKFFYSVKGDLHNEIISR 648 Query: 284 VNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPNL 105 + DCIERLVPD+K+ D+++KE Y GD GR +A+RARD LLP EWW TYG CPNL Sbjct: 649 MFDCIERLVPDIKIQDEVIKEINLYKNAVGDLGRNLAVRARDNLLPAEWWSTYGSSCPNL 708 Query: 104 ARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3 ARLAIRILSQTC ++Q + +++P E LHK N L Sbjct: 709 ARLAIRILSQTCSIVQGQENQIPFELLHKTRNSL 742 >ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817502 isoform X1 [Glycine max] gi|571489936|ref|XP_006591345.1| PREDICTED: uncharacterized protein LOC100817502 isoform X2 [Glycine max] gi|571489939|ref|XP_006591346.1| PREDICTED: uncharacterized protein LOC100817502 isoform X3 [Glycine max] Length = 759 Score = 456 bits (1172), Expect = e-125 Identities = 215/395 (54%), Positives = 292/395 (73%) Frame = -3 Query: 1187 GRTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEV 1008 GRTGCSILVD+ T+ GK ++F AYCPEG +FL D LY+L+K++VEEV Sbjct: 254 GRTGCSILVDQWTTETGKILISFLAYCPEGLVFLRSLDATEISTSADFLYDLIKQVVEEV 313 Query: 1007 GLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQA 828 G V+QV+T+ E++Y IAG+RLTDT+P+++ +P A HCIDL+L+D + V++QA Sbjct: 314 GAGQVVQVITSGEEQYGIAGRRLTDTFPTLYLSPSAAHCIDLILEDFGNLEWISAVIEQA 373 Query: 827 RSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEW 648 RS++R++Y+ +A++NM++RYT G D+VD + T+F TLKR+V+++H+LQ++V S+EW Sbjct: 374 RSVTRFVYNYSAILNMVKRYTLGNDIVDPSFSHFATNFTTLKRMVDLKHNLQALVTSQEW 433 Query: 647 TESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLY 468 +S YSK + D +SNQ+FWSSC I+ LT PLL++ RI S PAMGYV+AG+Y Sbjct: 434 ADSPYSKQTAGLEMLDCLSNQTFWSSCDMIVCLTAPLLKVMRIASSEMRPAMGYVYAGMY 493 Query: 467 RAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQS 288 RAKEAIKK L +EEY+ YW+II RWE+L HPLHAAGFYLNPKFFYS++GD H I S Sbjct: 494 RAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPKFFYSIQGDIHGQIVS 553 Query: 287 LVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPN 108 + DCIERLVPD ++ DKI+KE Y +GDFGRKMA+RARD LLP+EWW TYGG CPN Sbjct: 554 GMFDCIERLVPDTRIQDKIIKEINLYKSASGDFGRKMAVRARDNLLPSEWWSTYGGGCPN 613 Query: 107 LARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3 L+RLAIRILSQT ++ K +++P E + N++ Sbjct: 614 LSRLAIRILSQTSSVMSCKRNQIPFEQIINTRNYI 648 >ref|XP_007163431.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] gi|561036895|gb|ESW35425.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] Length = 756 Score = 451 bits (1159), Expect = e-124 Identities = 207/395 (52%), Positives = 291/395 (73%) Frame = -3 Query: 1187 GRTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEV 1008 GRTGCSILVD+ + G+ ++F AYCPEG +FL D LY+++K++V+EV Sbjct: 252 GRTGCSILVDQWATETGRVLISFLAYCPEGVVFLKSMDATEISTSADFLYDMIKQVVDEV 311 Query: 1007 GLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQA 828 G+ VLQV+T+ E++Y +AG+RLTDT+P+++W+P A HCID +L+D + V++QA Sbjct: 312 GVGQVLQVITSGEEQYAVAGRRLTDTFPTLYWSPSAAHCIDFILEDFGNLEWISAVIEQA 371 Query: 827 RSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEW 648 +S++R++Y+ +A++ M++RYT G D+VD ++ T+F TLKR+V+++H+LQ++V S+EW Sbjct: 372 KSVTRFVYNYSAILIMVKRYTLGNDIVDPSFSQFATNFTTLKRMVDLKHNLQALVTSQEW 431 Query: 647 TESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLY 468 + YSK + D +S+Q+FWSSC I+RLT PLL++ RI S PAMGY++AG+Y Sbjct: 432 ADCPYSKKSAGLEMLDCLSSQTFWSSCDMIVRLTAPLLKVLRIASSEMRPAMGYIYAGIY 491 Query: 467 RAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQS 288 RAKEAIKK L +EEY+ YW+II RWE+L HPLHAAGFYLNPKFFYS++GD H I S Sbjct: 492 RAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPKFFYSIQGDIHSQIVS 551 Query: 287 LVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPN 108 + DCIERLV D ++ DKI+KE Y AGDFGRKMA+RARD LLP+EWW TYGG CPN Sbjct: 552 GMFDCIERLVSDTRIQDKIIKEINLYKSAAGDFGRKMAVRARDNLLPSEWWSTYGGGCPN 611 Query: 107 LARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3 L+RLAIRILSQT ++ K +++P E + N++ Sbjct: 612 LSRLAIRILSQTSSVMSCKRNQIPFEQIVNTRNYI 646 >ref|XP_007163430.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] gi|561036894|gb|ESW35424.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] Length = 869 Score = 451 bits (1159), Expect = e-124 Identities = 207/395 (52%), Positives = 291/395 (73%) Frame = -3 Query: 1187 GRTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEV 1008 GRTGCSILVD+ + G+ ++F AYCPEG +FL D LY+++K++V+EV Sbjct: 365 GRTGCSILVDQWATETGRVLISFLAYCPEGVVFLKSMDATEISTSADFLYDMIKQVVDEV 424 Query: 1007 GLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQA 828 G+ VLQV+T+ E++Y +AG+RLTDT+P+++W+P A HCID +L+D + V++QA Sbjct: 425 GVGQVLQVITSGEEQYAVAGRRLTDTFPTLYWSPSAAHCIDFILEDFGNLEWISAVIEQA 484 Query: 827 RSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEW 648 +S++R++Y+ +A++ M++RYT G D+VD ++ T+F TLKR+V+++H+LQ++V S+EW Sbjct: 485 KSVTRFVYNYSAILIMVKRYTLGNDIVDPSFSQFATNFTTLKRMVDLKHNLQALVTSQEW 544 Query: 647 TESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLY 468 + YSK + D +S+Q+FWSSC I+RLT PLL++ RI S PAMGY++AG+Y Sbjct: 545 ADCPYSKKSAGLEMLDCLSSQTFWSSCDMIVRLTAPLLKVLRIASSEMRPAMGYIYAGIY 604 Query: 467 RAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQS 288 RAKEAIKK L +EEY+ YW+II RWE+L HPLHAAGFYLNPKFFYS++GD H I S Sbjct: 605 RAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPKFFYSIQGDIHSQIVS 664 Query: 287 LVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPN 108 + DCIERLV D ++ DKI+KE Y AGDFGRKMA+RARD LLP+EWW TYGG CPN Sbjct: 665 GMFDCIERLVSDTRIQDKIIKEINLYKSAAGDFGRKMAVRARDNLLPSEWWSTYGGGCPN 724 Query: 107 LARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3 L+RLAIRILSQT ++ K +++P E + N++ Sbjct: 725 LSRLAIRILSQTSSVMSCKRNQIPFEQIVNTRNYI 759 >ref|XP_006380932.1| hypothetical protein POPTR_0006s02210g [Populus trichocarpa] gi|550335284|gb|ERP58729.1| hypothetical protein POPTR_0006s02210g [Populus trichocarpa] Length = 847 Score = 450 bits (1158), Expect = e-124 Identities = 217/394 (55%), Positives = 284/394 (72%) Frame = -3 Query: 1184 RTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEVG 1005 +TGCS+LV+E S G T +NF YC +GT+FL D LYEL+K +VEEVG Sbjct: 358 KTGCSLLVEECNSESGVTTLNFLVYCSKGTVFLKSVDASNLIHSTDGLYELLKLMVEEVG 417 Query: 1004 LRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQAR 825 N+LQV+T E+ Y+ AGK+L DT+PS++W PCA CIDL+L+DI + + VL+QA+ Sbjct: 418 AGNILQVITNGEEHYIAAGKKLMDTFPSLYWAPCAARCIDLILEDIGKLDWINTVLEQAK 477 Query: 824 SISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWT 645 S++R++Y+N+AV+N+MR++T G D+V G TRS T+F LKR+ N + +LQ+MV S+EW Sbjct: 478 SVTRFVYNNSAVLNLMRKFTSGSDIVQQGITRSATNFTALKRMANFKLNLQTMVTSQEWM 537 Query: 644 ESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYR 465 + YSK A+ D I+N+SFWSSC IIRLT PLL++ IV S K AMGYVF+G+YR Sbjct: 538 DCPYSKQPGGLAMVDIITNRSFWSSCILIIRLTSPLLQVLVIVSSEKRAAMGYVFSGIYR 597 Query: 464 AKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSL 285 AKE IKKEL +E+Y+ YW+IID RWEQ + PLHAAGF+ NPKFFYS+EGD H+ I S Sbjct: 598 AKETIKKELVKREDYMVYWNIIDHRWEQQWQTPLHAAGFFFNPKFFYSIEGDMHNKILSR 657 Query: 284 VNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPNL 105 + DCIERLVPD +V DKI+KE Y G G+K+AIRAR T+LPT+WW YGG CPNL Sbjct: 658 MFDCIERLVPDTEVQDKIVKELTLYKNAEGHLGKKLAIRARGTMLPTDWWSMYGGSCPNL 717 Query: 104 ARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3 ARLAIRILSQTC I + +P E +H+ N+L Sbjct: 718 ARLAIRILSQTCSAIGCSHNHIPFEKVHRTRNFL 751 >ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806265 isoform X1 [Glycine max] gi|571542833|ref|XP_006601996.1| PREDICTED: uncharacterized protein LOC100806265 isoform X2 [Glycine max] Length = 758 Score = 448 bits (1152), Expect = e-123 Identities = 210/395 (53%), Positives = 289/395 (73%) Frame = -3 Query: 1187 GRTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEV 1008 GRTGCSILVD+ T+ + ++F AYCPEG +FL D LY+L+K++VEE+ Sbjct: 253 GRTGCSILVDQWTTETSRILISFLAYCPEGLVFLKSLDATEILTSPDFLYDLIKQVVEEI 312 Query: 1007 GLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQA 828 G+ V+QV+T+ E++Y IAG+RL DT+P+++W+P A HCIDL+L+D + V++QA Sbjct: 313 GVGKVVQVITSGEEQYGIAGRRLMDTFPTLYWSPSAAHCIDLILEDFGNLEWISAVIEQA 372 Query: 827 RSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEW 648 +S++R++Y+ +A++NM++RYT G D+VD +R T+F TLKR+V+++H+LQ++V S+EW Sbjct: 373 KSVTRFVYNYSAILNMVKRYTLGNDIVDPSFSRFATNFTTLKRMVDLKHNLQALVTSQEW 432 Query: 647 TESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLY 468 + YSK + D +SNQ+FWSSC I+ LT PLL++ RI S P MGYV+AG+Y Sbjct: 433 ADCPYSKQTAGLEMLDCLSNQTFWSSCDMIVCLTAPLLKVLRIAGSEMRPGMGYVYAGMY 492 Query: 467 RAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQS 288 R KEAIKK L +EEY+ YW+II RWE+L HPLHAAGFYLNPKFFYS++GD I S Sbjct: 493 RVKEAIKKALGKREEYMVYWNIIHHRWERLWNHPLHAAGFYLNPKFFYSIQGDILGQIVS 552 Query: 287 LVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPN 108 + DCIERLVPD ++ DKI+KE Y AGDFGRKMA+RARD LLP+EWW TYGG CPN Sbjct: 553 GMFDCIERLVPDTRIQDKIIKEINLYKSAAGDFGRKMAVRARDNLLPSEWWSTYGGGCPN 612 Query: 107 LARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3 L+RLAIRILSQT ++ K ++VP E + N++ Sbjct: 613 LSRLAIRILSQTSSVMSCKRNQVPFEQIINTRNYI 647 >ref|XP_002521049.1| DNA binding protein, putative [Ricinus communis] gi|223539752|gb|EEF41333.1| DNA binding protein, putative [Ricinus communis] Length = 854 Score = 444 bits (1143), Expect = e-122 Identities = 211/394 (53%), Positives = 281/394 (71%) Frame = -3 Query: 1184 RTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEVG 1005 RTGCS+LV+E S G T +NF C +GT+FL D LY L+K++VEEVG Sbjct: 361 RTGCSVLVEEWNSESGITLLNFLVNCSQGTVFLKSVEASHIIYSPDGLYVLLKQVVEEVG 420 Query: 1004 LRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQAR 825 NVLQV+T + Y +AGKRL + +PS+FW PCA HC+DL+L+D A+ + V++QA+ Sbjct: 421 ASNVLQVITNGNEHYTVAGKRLMEAFPSLFWAPCAVHCLDLILEDFAKLEWIDAVIEQAK 480 Query: 824 SISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWT 645 S++R++Y+++AV+N+MR++T+G D+V G TRS T+F L+R+ + + +LQ+M+ S+EW Sbjct: 481 SVTRFVYNHSAVLNLMRKFTYGKDIVQQGLTRSATNFTMLQRMADFKLNLQTMITSQEWM 540 Query: 644 ESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYR 465 + YSK A+ D ISN+SFWSSC IIRLT PL+R+ I + AMGY+FAG+YR Sbjct: 541 DCPYSKQHGGLAMLDIISNRSFWSSCILIIRLTSPLIRVLGIAGGKRKAAMGYIFAGIYR 600 Query: 464 AKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSL 285 AKE IK+EL +E+Y+ YW+IID RW+Q + PLH AGF+LNPKFFYS+EGD H+ I S Sbjct: 601 AKETIKRELVKREDYMVYWNIIDHRWDQRRHPPLHVAGFFLNPKFFYSIEGDVHNEILSR 660 Query: 284 VNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPNL 105 V DCIERLVPD++V DKI KE Y GD GRKMAIR+R TLLP EWW TYGG CPNL Sbjct: 661 VFDCIERLVPDIEVQDKIAKELNIYKNAVGDLGRKMAIRSRGTLLPAEWWSTYGGGCPNL 720 Query: 104 ARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3 ARLA+RILSQTC I + + +P E +H N L Sbjct: 721 ARLALRILSQTCSSIGCRSNHIPFEKVHATRNCL 754 >gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo] Length = 752 Score = 441 bits (1133), Expect = e-121 Identities = 216/394 (54%), Positives = 284/394 (72%), Gaps = 1/394 (0%) Frame = -3 Query: 1187 GRTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEV 1008 G TGCS++VD+ + G+T +NF YCP+GT+FL D+LYEL+K++VE+V Sbjct: 252 GMTGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLLYELLKKVVEQV 311 Query: 1007 GLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQA 828 G+++V+QV+T E+ + IAG++L+DTYP+++WTPCA C+DL+L DI V V++QA Sbjct: 312 GVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILADIGNIEDVNTVIEQA 371 Query: 827 RSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEW 648 RSI+R++Y+N+ V+NM+R+ TFG D+V+ TRS T+F TL R+V+++ LQ+MV S+EW Sbjct: 372 RSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLKRCLQNMVTSQEW 431 Query: 647 TESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLY 468 +S YSK + D IS++SFWSSC SIIRLT+PLLR+ RIV S K PAMGYV+A +Y Sbjct: 432 MDSPYSKRPGGLEMLDLISSESFWSSCNSIIRLTNPLLRVLRIVGSGKRPAMGYVYAAMY 491 Query: 467 RAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQS 288 AK AIK EL ++ Y+ YW+IID RWE RHPL AAGFYLNPK+FYS+EGD H I S Sbjct: 492 NAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLCAAGFYLNPKYFYSIEGDMHGEILS 551 Query: 287 LVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYG-GECP 111 + DCIERLV D V DKI+KE SY +GDF RK AIRAR TLLP EWW T G G CP Sbjct: 552 GMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAEWWSTCGEGGCP 611 Query: 110 NLARLAIRILSQTCCLIQHKLDKVPLEHLHKRTN 9 NL RLA RILSQTC + K ++V + LH N Sbjct: 612 NLTRLATRILSQTCSSVGFKQNQVFFDKLHDTRN 645 >ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226173 [Cucumis sativus] Length = 752 Score = 437 bits (1123), Expect = e-120 Identities = 214/394 (54%), Positives = 282/394 (71%), Gaps = 1/394 (0%) Frame = -3 Query: 1187 GRTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEV 1008 G TGCS++VD+ + G+T +NF YCP+GT+FL D+LYEL+K++VE+V Sbjct: 252 GITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIMDSPDLLYELLKKVVEQV 311 Query: 1007 GLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQA 828 G+++V+QV+T E+ + IAG++L+DTYP+++WTPCA C+DL+L DI V V++QA Sbjct: 312 GVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLILGDIGNIEGVNTVIEQA 371 Query: 827 RSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEW 648 RSI+R++Y+N+ V+NM+R+ TFG D+V+ TRS T+F TL R+V+++ LQ+MV S+EW Sbjct: 372 RSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNRMVDLKRCLQNMVTSQEW 431 Query: 647 TESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLY 468 +S YSK + D IS++SFWSSC SII LT+PLLR+ RIV S K PAMGYV+A +Y Sbjct: 432 MDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLLRVLRIVGSGKRPAMGYVYAAMY 491 Query: 467 RAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQS 288 AK AIK EL ++ Y+ YW+IID RWE RHPL+AAGFYLNPK+FYS+EGD H I S Sbjct: 492 NAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLYAAGFYLNPKYFYSIEGDMHGEILS 551 Query: 287 LVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYG-GECP 111 + DCIERLV D V DKI+KE SY +GDF RK AIRAR TLLP EWW T G G CP Sbjct: 552 GMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARGTLLPAEWWSTCGEGGCP 611 Query: 110 NLARLAIRILSQTCCLIQHKLDKVPLEHLHKRTN 9 NL RLA RILSQTC + K + + LH N Sbjct: 612 NLTRLATRILSQTCSSVGFKQNDALFDKLHDTRN 645 >ref|XP_007049027.1| HAT transposon superfamily, putative [Theobroma cacao] gi|508701288|gb|EOX93184.1| HAT transposon superfamily, putative [Theobroma cacao] Length = 750 Score = 426 bits (1096), Expect = e-117 Identities = 199/395 (50%), Positives = 280/395 (70%) Frame = -3 Query: 1187 GRTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEV 1008 G+TGCSILV++ + G+T ++F YCP+ T+FL D L EL+K++VEEV Sbjct: 253 GKTGCSILVEQWSPKSGRTLLSFLVYCPQATVFLKSVDASRVIFSADHLNELLKQVVEEV 312 Query: 1007 GLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQA 828 G+ NV+QV+T E++Y +AGKRL +++PS++W PC HC+D+ML+D A + ++QA Sbjct: 313 GVENVVQVITNCEEQYFLAGKRLMESFPSLYWAPCLVHCVDMMLEDFANLEWISETIEQA 372 Query: 827 RSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEW 648 +S++R++Y+++ V+NMMRR+TF D+V+ TR ++F TLKR+ +++ LQ+MVNS++W Sbjct: 373 KSVTRFVYNHSVVLNMMRRFTFHNDIVEPAVTRFASNFATLKRMADLKLKLQAMVNSQDW 432 Query: 647 TESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLY 468 +E Y+K + D + N+SFW+SC I+RL PLL++ IV S K MGYV+AG+Y Sbjct: 433 SECPYAKKPGGLVMLDIVKNRSFWNSCILIVRLIYPLLQVLEIVGSKKRSTMGYVYAGIY 492 Query: 467 RAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQS 288 RAKE IKKEL K++Y+ YW+IID RWEQ + PL+AA F+LNPKFFYS+EG+ H+ I S Sbjct: 493 RAKETIKKELVKKDDYMVYWNIIDHRWEQQRHIPLYAAAFFLNPKFFYSIEGNIHNDILS 552 Query: 287 LVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPN 108 + DCIERLVPD V D+I++E Y GD GR MA+RARD LLP EWW YGG CPN Sbjct: 553 SMFDCIERLVPDTNVQDQIVREIHLYKNATGDLGRPMAVRARDNLLPGEWWSMYGGGCPN 612 Query: 107 LARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3 L LAIRILSQTC I K +K+ +E +H N+L Sbjct: 613 LQHLAIRILSQTCSSIGSKPNKISIEEIHDTRNFL 647 >ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817502 isoform X4 [Glycine max] Length = 729 Score = 418 bits (1075), Expect = e-114 Identities = 205/395 (51%), Positives = 279/395 (70%) Frame = -3 Query: 1187 GRTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEV 1008 GRTGCSILVD+ T T +F LY+L+K++VEEV Sbjct: 254 GRTGCSILVDQWT-----TETDF-------------------------LYDLIKQVVEEV 283 Query: 1007 GLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQA 828 G V+QV+T+ E++Y IAG+RLTDT+P+++ +P A HCIDL+L+D + V++QA Sbjct: 284 GAGQVVQVITSGEEQYGIAGRRLTDTFPTLYLSPSAAHCIDLILEDFGNLEWISAVIEQA 343 Query: 827 RSISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEW 648 RS++R++Y+ +A++NM++RYT G D+VD + T+F TLKR+V+++H+LQ++V S+EW Sbjct: 344 RSVTRFVYNYSAILNMVKRYTLGNDIVDPSFSHFATNFTTLKRMVDLKHNLQALVTSQEW 403 Query: 647 TESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLY 468 +S YSK + D +SNQ+FWSSC I+ LT PLL++ RI S PAMGYV+AG+Y Sbjct: 404 ADSPYSKQTAGLEMLDCLSNQTFWSSCDMIVCLTAPLLKVMRIASSEMRPAMGYVYAGMY 463 Query: 467 RAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQS 288 RAKEAIKK L +EEY+ YW+II RWE+L HPLHAAGFYLNPKFFYS++GD H I S Sbjct: 464 RAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPKFFYSIQGDIHGQIVS 523 Query: 287 LVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPN 108 + DCIERLVPD ++ DKI+KE Y +GDFGRKMA+RARD LLP+EWW TYGG CPN Sbjct: 524 GMFDCIERLVPDTRIQDKIIKEINLYKSASGDFGRKMAVRARDNLLPSEWWSTYGGGCPN 583 Query: 107 LARLAIRILSQTCCLIQHKLDKVPLEHLHKRTNWL 3 L+RLAIRILSQT ++ K +++P E + N++ Sbjct: 584 LSRLAIRILSQTSSVMSCKRNQIPFEQIINTRNYI 618 >ref|NP_188861.2| hAT dimerization domain-containing protein [Arabidopsis thaliana] gi|79313325|ref|NP_001030742.1| hAT dimerization domain-containing protein [Arabidopsis thaliana] gi|11994740|dbj|BAB03069.1| transposase-like protein [Arabidopsis thaliana] gi|28393360|gb|AAO42104.1| unknown protein [Arabidopsis thaliana] gi|28827622|gb|AAO50655.1| unknown protein [Arabidopsis thaliana] gi|332643084|gb|AEE76605.1| hAT dimerization domain-containing protein [Arabidopsis thaliana] gi|332643085|gb|AEE76606.1| hAT dimerization domain-containing protein [Arabidopsis thaliana] Length = 761 Score = 387 bits (995), Expect = e-105 Identities = 186/372 (50%), Positives = 253/372 (68%) Frame = -3 Query: 1184 RTGCSILVDESTSGKGKTFVNFFAYCPEGTIFLXXXXXXXXXXXXDVLYELMKEIVEEVG 1005 RTGCS+LV E S +G + F YCPE +FL D LYEL+KE+VEE+G Sbjct: 260 RTGCSVLVQELNSNEGPLILKFLVYCPEKVVFLKSVDASEILDSEDKLYELLKEVVEEIG 319 Query: 1004 LRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAEFPTVKMVLDQAR 825 NV+QV+T ED Y AGK+L D YPS++W PCA HCID ML++ + ++ +++QAR Sbjct: 320 DTNVVQVITKCEDHYAAAGKKLMDVYPSLYWVPCAAHCIDKMLEEFGKMDWIREIIEQAR 379 Query: 824 SISRYIYSNTAVINMMRRYTFGVDLVDVGTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWT 645 +++R IY+++ V+N+MR++TFG D+V T S T+F T+ RI +++ LQ+MV S EW Sbjct: 380 TVTRIIYNHSGVLNLMRKFTFGNDIVQPVCTSSATNFTTMGRIADLKPYLQAMVTSSEWN 439 Query: 644 ESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYR 465 + SYSK+ A+ ++I+++ FW + +T P+LR+ RIV S + PAMGYV+A +YR Sbjct: 440 DCSYSKEAGGLAMTETINDEDFWKALTLANHITAPILRVLRIVCSERKPAMGYVYAAMYR 499 Query: 464 AKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSL 285 AKEAIK L +EEY+ YW IID W Q PL+AAGFYLNPKFFYS++ + I Sbjct: 500 AKEAIKTNLAHREEYIVYWKIIDRWWLQ---QPLYAAGFYLNPKFFYSIDEEMRSEIHLA 556 Query: 284 VNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTEWWLTYGGECPNL 105 V DCIE+LVPD+ + D ++K+ SY G FGR +AIRARDT+LP EWW TYG C NL Sbjct: 557 VVDCIEKLVPDVNIQDIVIKDINSYKNAVGIFGRNLAIRARDTMLPAEWWSTYGESCLNL 616 Query: 104 ARLAIRILSQTC 69 +R AIRILSQTC Sbjct: 617 SRFAIRILSQTC 628