BLASTX nr result
ID: Mentha27_contig00020362
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha27_contig00020362 (2356 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591... 910 0.0 ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254... 905 0.0 gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea] 841 0.0 ref|XP_002524204.1| DNA binding protein, putative [Ricinus commu... 789 0.0 ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626... 777 0.0 ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302... 773 0.0 ref|XP_007009265.1| HAT and BED zinc finger domain-containing pr... 754 0.0 ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817... 725 0.0 ref|XP_007163431.1| hypothetical protein PHAVU_001G234100g [Phas... 719 0.0 ref|XP_007163430.1| hypothetical protein PHAVU_001G234100g [Phas... 719 0.0 ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806... 716 0.0 gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo] 716 0.0 ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226... 714 0.0 ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817... 691 0.0 ref|XP_002521049.1| DNA binding protein, putative [Ricinus commu... 686 0.0 ref|XP_007049027.1| HAT transposon superfamily, putative [Theobr... 676 0.0 ref|XP_006380932.1| hypothetical protein POPTR_0006s02210g [Popu... 672 0.0 gb|AAM98154.1| putative protein [Arabidopsis thaliana] 621 e-175 ref|NP_001154234.1| hAT transposon superfamily [Arabidopsis thal... 621 e-175 ref|XP_004305893.1| PREDICTED: uncharacterized protein LOC101310... 611 e-172 >ref|XP_006346820.1| PREDICTED: uncharacterized protein LOC102591442 [Solanum tuberosum] Length = 755 Score = 910 bits (2353), Expect = 0.0 Identities = 449/760 (59%), Positives = 566/760 (74%), Gaps = 2/760 (0%) Frame = +2 Query: 47 NNMELVAVNSQKHDPAWKHCQMFKVGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAA 226 +N+E V V SQKHDPAWKHC+MFK G+RVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNA+ Sbjct: 3 SNLEPVPVTSQKHDPAWKHCEMFKNGERVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAS 62 Query: 227 TCLRVQADVRMQMLESLNGVAVRKRKKQKLAEEMSGFSNPGNSGVEIVAH--NSCGLNSD 400 TCLRVQ DVR+ M +SLNGV ++KRKKQKLAEE++ + N G + +I A ++CGL++ Sbjct: 63 TCLRVQPDVRLLMQDSLNGVVMKKRKKQKLAEEITTY-NAGTATSDIAAEFTDTCGLDTQ 121 Query: 401 MVLLPVPEMIEHXXXXXXXXXREDGMXXXXXXXXXXXXXALDVVNSNNTALAVIPAGSFK 580 + LLP+P+ IEH + A +SNN A+ ++P K Sbjct: 122 VDLLPMPQAIEHTSNLFLNRDQGPNNIGARKKKSRIRKGAS---SSNNNAM-LLPINQSK 177 Query: 581 KANSVVNMAVGRFFFDVGLPADAANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNVI 760 + N+ V+MAV RF D +P DA NS YFQPMID IASQG + PSYH+LR+ +LK + Sbjct: 178 RVNNHVHMAVARFLLDARVPLDAVNSVYFQPMIDVIASQGPQVSAPSYHELRSWVLKASV 237 Query: 761 HEVRYDVDQCIAAWERTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXXX 940 EVR D+DQC + W R+GCS+LVDE +GKGKT +NF YC EGT+FL Sbjct: 238 QEVRNDIDQCSSTWARSGCSVLVDEWITGKGKTLLNFLVYCPEGTMFLRSVDASTLINST 297 Query: 941 XVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQD 1120 LYEL+KE+VEEVG+RNVLQVVT+ E+RY+IAGKRLTD YP++FWTPCA H IDLML+D Sbjct: 298 DYLYELLKEVVEEVGVRNVLQVVTSNEERYIIAGKRLTDAYPTLFWTPCAAHSIDLMLED 357 Query: 1121 IAEFPTVKMVLDQARSISRYIYSNTAVINMMRRFTFGVDLVDVGTTRSFTDFMTLKRIVN 1300 + + + +++QA+SISR+IY+N +++MMR+FT GVDLVD+G TRS TDF+TLKR+VN Sbjct: 358 LKKLEWIDTIMEQAKSISRFIYNNNILLSMMRKFTLGVDLVDLGVTRSATDFLTLKRMVN 417 Query: 1301 IRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRS 1480 I+H+LQSMV S EW ES YSK E FA+ D I NQSFWS+C+ + RLTDP+LRL R+V S Sbjct: 418 IKHNLQSMVTSVEWAESPYSKKPEGFALLDYIGNQSFWSTCSLVCRLTDPILRLLRMVSS 477 Query: 1481 LKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKF 1660 + PAM YV+AG+YRAKE IKKEL K++Y YW+IID RWE LQRHPLHAAGFYLNPKF Sbjct: 478 EERPAMAYVYAGVYRAKETIKKELVNKKDYSVYWNIIDHRWESLQRHPLHAAGFYLNPKF 537 Query: 1661 FYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLL 1840 FY+ E D H HI+SLV DCIE+LVPD K+ DKI+KE SY AGDFGRKMA+RARDTL Sbjct: 538 FYTTEEDVHLHIRSLVYDCIEKLVPDPKIQDKIVKETTSYLNSAGDFGRKMAVRARDTLF 597 Query: 1841 PTEWWLTYGGGCPNLARFAIRILSQTCCLIQNKLDKVPLEHLHKRTNWLEHRRLSDIVYV 2020 P EWW TYGGGCPNLAR AIRILSQT LI++K +VPLE +H+ N +EH+RL+D+ +V Sbjct: 598 PAEWWSTYGGGCPNLARLAIRILSQTSSLIRSKPGRVPLEEMHETKNCIEHQRLNDLAFV 657 Query: 2021 QYNMSLKHMGSGDRQQEGVDTISYEQIDLVEEWVTDKDFCSEDPAKKGWMDVDPPLGSAV 2200 QYN+ L+ + + + +D+ISYE++++V WV+ ++ SED WM VDPPLGS Sbjct: 658 QYNLWLRQ--RKNLEPDCMDSISYEKMEVVHNWVSRREQISEDLESSDWMTVDPPLGSIA 715 Query: 2201 HLGPQIDDVQALGAGFDDFDIFEAAKDSEEEIADKNIGND 2320 LGP IDD++ALGAGFDDF+IF KDSEEEI ++N N+ Sbjct: 716 PLGPLIDDIEALGAGFDDFEIFGGPKDSEEEIGEENTVNE 755 >ref|XP_004240774.1| PREDICTED: uncharacterized protein LOC101254391 [Solanum lycopersicum] Length = 748 Score = 905 bits (2340), Expect = 0.0 Identities = 446/758 (58%), Positives = 560/758 (73%) Frame = +2 Query: 47 NNMELVAVNSQKHDPAWKHCQMFKVGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAA 226 +N+E VAV SQKHDPAWKHC+MFK GDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNA+ Sbjct: 3 SNLEPVAVTSQKHDPAWKHCEMFKNGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAS 62 Query: 227 TCLRVQADVRMQMLESLNGVAVRKRKKQKLAEEMSGFSNPGNSGVEIVAHNSCGLNSDMV 406 TCLRVQ DVR+ M +SLNGV ++KRKKQKLAEE++ ++ S + ++CGLN+ + Sbjct: 63 TCLRVQPDVRLLMQDSLNGVVMKKRKKQKLAEEITTYNAIDTSDIAAEFTDTCGLNTQVD 122 Query: 407 LLPVPEMIEHXXXXXXXXXREDGMXXXXXXXXXXXXXALDVVNSNNTALAVIPAGSFKKA 586 LLP+ + IEH R+ G + +SNN + K+ Sbjct: 123 LLPMSQAIEHTSSLFLN--RDQGPNNRKKKSRIRKGAS----SSNNLPII----NQSKRV 172 Query: 587 NSVVNMAVGRFFFDVGLPADAANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNVIHE 766 N+ V+MAV RF D +P DA NS YFQPMID IASQG PSYHDLR+ +LK+ + E Sbjct: 173 NNQVHMAVARFLLDARVPLDAVNSVYFQPMIDVIASQGPPVSAPSYHDLRSWVLKSSVQE 232 Query: 767 VRYDVDQCIAAWERTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXXXXV 946 VR D+DQC + W RTGCS+L+DE +GKGK +NF YC +GT+FL Sbjct: 233 VRTDIDQCSSTWARTGCSVLIDELITGKGKILLNFLVYCPQGTMFLRSVDASTLINSTDY 292 Query: 947 LYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIA 1126 LYEL+KE+V+E+G+RNVLQVVT+ E+RYVIAGKRLTD YP++FWTPCA H IDLML+D Sbjct: 293 LYELLKEVVDEIGVRNVLQVVTSNEERYVIAGKRLTDAYPTLFWTPCAAHSIDLMLEDFN 352 Query: 1127 EFPTVKMVLDQARSISRYIYSNTAVINMMRRFTFGVDLVDVGTTRSFTDFMTLKRIVNIR 1306 + + +++QA+SISR+IY+N +++MMR+FT GVDLVD+G TRS TDF+TLKR+ NI+ Sbjct: 353 KLEWIDTIMEQAKSISRFIYNNNILLSMMRKFTLGVDLVDLGVTRSATDFLTLKRMQNIK 412 Query: 1307 HSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLK 1486 H+LQSMV S EW ES YSK E FA+ D ISNQSFWS+C+ I RLTDP+LRL R+V S + Sbjct: 413 HNLQSMVTSVEWAESPYSKKPEGFALLDYISNQSFWSTCSLICRLTDPILRLLRMVSSEE 472 Query: 1487 IPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFY 1666 PAM YV+AG+YRAKE IKKEL K++Y YW+IID RWE LQRHPLHAAGFYLNPKFFY Sbjct: 473 RPAMPYVYAGVYRAKETIKKELVNKKDYSVYWNIIDHRWESLQRHPLHAAGFYLNPKFFY 532 Query: 1667 SLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPT 1846 + E D H HI+SLV DCIE+LVPD K+ DKI+KE SY AGDFGRKMA+RARDTL P Sbjct: 533 TTEEDVHLHIRSLVYDCIEKLVPDPKIQDKIVKETTSYLNSAGDFGRKMAVRARDTLFPA 592 Query: 1847 EWWLTYGGGCPNLARFAIRILSQTCCLIQNKLDKVPLEHLHKRTNWLEHRRLSDIVYVQY 2026 EWW TYGGGCPNLAR AIRILSQT LI++K ++P+E +H+ TN +EH+RL+D+ +VQY Sbjct: 593 EWWSTYGGGCPNLARLAIRILSQTSSLIRSKPGRIPIEEMHETTNCIEHQRLNDLAFVQY 652 Query: 2027 NMSLKHMGSGDRQQEGVDTISYEQIDLVEEWVTDKDFCSEDPAKKGWMDVDPPLGSAVHL 2206 NM L+ +++ + +D+ISYE+++LV WV+ ++ SED WM VDPPLGS L Sbjct: 653 NMWLRQ--RKNQEPDCMDSISYEKMELVHNWVSRREQMSEDLESSDWMAVDPPLGSIAPL 710 Query: 2207 GPQIDDVQALGAGFDDFDIFEAAKDSEEEIADKNIGND 2320 GP IDD++ALG GFDDF+IF KDSEEEI ++N N+ Sbjct: 711 GPLIDDIEALGTGFDDFEIFGGPKDSEEEIGEENTVNE 748 >gb|EPS63146.1| hypothetical protein M569_11643 [Genlisea aurea] Length = 724 Score = 841 bits (2172), Expect = 0.0 Identities = 425/734 (57%), Positives = 535/734 (72%), Gaps = 2/734 (0%) Frame = +2 Query: 50 NMELVAVNSQKHDPAWKHCQMFKVGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAAT 229 +MELV + SQKHDPAWKHCQMFK +++ LKCIYCGKIFKGGGIHRIKEHLAGQKGNA+T Sbjct: 4 HMELVPMTSQKHDPAWKHCQMFKTEEKIHLKCIYCGKIFKGGGIHRIKEHLAGQKGNAST 63 Query: 230 CLRVQADVRMQMLESLNGVAVRKRKKQKLAEEMSGFSNPGNSGVEIVAHNSCGLNSDMVL 409 CLRV +V+ QML+SLNGVAV+K+KK KL E++SG+ NP + + H+S LNS+ Sbjct: 64 CLRVLPEVKQQMLDSLNGVAVKKKKKLKLTEQLSGYDNPAD---RVNEHSS--LNSEAFF 118 Query: 410 LPVPEMIEHXXXXXXXXXREDGMXXXXXXXXXXXXXALDVVNSNNTALAVIPAGSFKKAN 589 LP PE++EH E+G + + + ++A++ S + + Sbjct: 119 LPGPEIVEHDDDAYEEG--EEGTTSKRGPRQKRP----QIRKNPSESMALMSLPSVQPCS 172 Query: 590 SVVNMAVGRFFFDVGLPADAANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNVIHEV 769 V+MAVGRFF DVGLPA+AANS YFQPM++AIASQ A +GPSY DLR+ ILKN++HE Sbjct: 173 KKVHMAVGRFFVDVGLPAEAANSAYFQPMVEAIASQEAGVIGPSYQDLRSWILKNLVHET 232 Query: 770 RYDVDQCIAAWERTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXXXXVL 949 RYDVDQ AWERTGC++LVD+ SGKG+TFVNFF Y SE TIF L Sbjct: 233 RYDVDQYANAWERTGCTVLVDDWNSGKGETFVNFFVYNSEATIFYRSANVSHGIVSADDL 292 Query: 950 YELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAE 1129 YEL+KE VE++G++NVLQV+T+ ED+Y AGKRL TYPS+FW+PCAG C+DLMLQD+ Sbjct: 293 YELLKETVEQIGVKNVLQVITSCEDQYAFAGKRLATTYPSVFWSPCAGLCVDLMLQDMEH 352 Query: 1130 FPTVKMVLDQARSISRYIYSNTAVINMMRRFTFGVDLVDVGTTRSFTDFMTLKRIVNIRH 1309 P VK+ L+QA+SISRYIYSN V+NM+RR TFG+DL+D G T S T+FMTLKR++++RH Sbjct: 353 LPMVKVTLEQAKSISRYIYSNGFVLNMLRRHTFGLDLLDEGITPSSTNFMTLKRMLSMRH 412 Query: 1310 SLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSLKI 1489 LQSMV SE+W +S +S+ E FA+ D++++QSFWS+CASI L DPLLRL RI+ S K Sbjct: 413 HLQSMVTSEDWIQSPHSQKPEGFALLDTMTSQSFWSACASITNLIDPLLRLLRIISSGKK 472 Query: 1490 PAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFFYS 1669 PAMGYV+AGLYRAKEAIKK E+YL Y +IID RWEQLQ+HPLH AGFYLNPKFFYS Sbjct: 473 PAMGYVYAGLYRAKEAIKKHF-VSEDYLVYLNIIDRRWEQLQQHPLHGAGFYLNPKFFYS 531 Query: 1670 LEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLPTE 1849 LEGD +S+V DCIERLVPD +V DKIMKE YH G GDFGRKMAIRARDTLLPTE Sbjct: 532 LEGDALLRSRSMVYDCIERLVPDPEVQDKIMKEMTYYHGGVGDFGRKMAIRARDTLLPTE 591 Query: 1850 WWLTYGGGCPNLARFAIRILSQTCCLIQNK-LDKVPLEHLHKRTNWLEHRRLSDIVYVQY 2026 WW+ YGG CPNL+R A+++LSQTC IQ K LDK+PLE +H+ N LE +RL+ +V+V Y Sbjct: 592 WWIAYGGSCPNLSRLAVQVLSQTCGFIQLKLLDKLPLETMHRIKNPLERQRLNHLVFVHY 651 Query: 2027 NMSLKHMGSGDRQQEGVDTISYEQIDLVEEWVTDKDFCSEDPAKKG-WMDVDPPLGSAVH 2203 NM +K + S R + D I+YE D+ ++W+ + S + + WM VDP LG V Sbjct: 652 NMRVKQLVSAKRTRRVSDPIAYEHDDMFDDWIVGNEALSVGSSGEAEWMTVDPALG--VD 709 Query: 2204 LGPQIDDVQALGAG 2245 P++DD +G G Sbjct: 710 AIPEVDDADDMGGG 723 >ref|XP_002524204.1| DNA binding protein, putative [Ricinus communis] gi|223536481|gb|EEF38128.1| DNA binding protein, putative [Ricinus communis] Length = 753 Score = 789 bits (2038), Expect = 0.0 Identities = 389/758 (51%), Positives = 530/758 (69%), Gaps = 6/758 (0%) Frame = +2 Query: 38 MASNNMELVAVNSQKHDPAWKHCQMFKVGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKG 217 M S+++E + + SQKHDPAWKHCQMFK G+RVQLKC+YCGKIFKGGGIHRIKEHLAGQKG Sbjct: 1 MDSDDLEPIPITSQKHDPAWKHCQMFKNGERVQLKCVYCGKIFKGGGIHRIKEHLAGQKG 60 Query: 218 NAATCLRVQADVRMQMLESLNGVAVRKRKKQKLAEEMSGFSNP--GNSGVEIVAHNSCGL 391 NA+TCL+V DV++ M +SL+GV V+KRKKQK+AEE++ NP G +E+ A++ + Sbjct: 61 NASTCLQVPTDVKLIMQQSLDGVVVKKRKKQKIAEEITNL-NPVIGGGEIEVFANDQIEV 119 Query: 392 NSDMVLLPVPEMIEHXXXXXXXXXR----EDGMXXXXXXXXXXXXXALDVVNSNNTALAV 559 ++ M L+ V +IE + G A +V+ N+ +A+ Sbjct: 120 STGMELIGVSNVIEPSSSLLISGQEGKANKGGERRKRGRSKGSGANANAIVSMNSNRMAL 179 Query: 560 IPAGSFKKANSVVNMAVGRFFFDVGLPADAANSPYFQPMIDAIASQGAEAVGPSYHDLRN 739 K+ N V+MA+GRF +D+G P DA NS YFQPM+DAIAS G + PS HDLR Sbjct: 180 ----GAKRVNDHVHMAIGRFLYDIGAPLDAVNSVYFQPMVDAIASGGLDVGMPSCHDLRG 235 Query: 740 SILKNVIHEVRYDVDQCIAAWERTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXX 919 ILKN + EV+ +VD+ +A W RTGCS+LVD+ + G+T ++F YCSEG +FL Sbjct: 236 WILKNSVEEVKTEVDKHMATWARTGCSVLVDQWNTLMGRTLLSFLVYCSEGVVFLKSVDA 295 Query: 920 XXXXXXXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHC 1099 LYEL+K++VEEVG+R+VLQV+T++E++Y++ G+RLTDT+P+++ PCA HC Sbjct: 296 SDIINSSDALYELIKKVVEEVGVRHVLQVITSMEEQYIVVGRRLTDTFPTLYRAPCAAHC 355 Query: 1100 IDLMLQDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRFTFGVDLVDVGTTRSFTDFM 1279 IDL+L+D A+ + V+ QARSI+R++Y+++ V+NM++R+TFG ++V G T T+F Sbjct: 356 IDLILEDFAKLEWISTVILQARSITRFVYNHSVVLNMVKRYTFGSEIVATGLTHFATNFE 415 Query: 1280 TLKRIVNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLR 1459 TLKR+V+++H+LQ+MV S+EW + YSK + D +SNQSFWSSC I LT+PLLR Sbjct: 416 TLKRMVDLKHTLQTMVTSQEWMDCPYSKKPRGLEMLDLLSNQSFWSSCVLITNLTNPLLR 475 Query: 1460 LFRIVRSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAG 1639 L RIV S K P MGYV+AG+YRAKEAIKKEL +++Y+ YW+IID WEQ PLHAAG Sbjct: 476 LLRIVSSKKRPPMGYVYAGIYRAKEAIKKELVKRKDYMVYWNIIDHWWEQQSNLPLHAAG 535 Query: 1640 FYLNPKFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAI 1819 F+LNPK YS+EGD H+ I S + DCIE+LVPD+ V DKI KE SY +GDFGRKMA+ Sbjct: 536 FFLNPKVLYSIEGDLHNEILSGMFDCIEKLVPDVTVQDKITKEINSYKNASGDFGRKMAV 595 Query: 1820 RARDTLLPTEWWLTYGGGCPNLARFAIRILSQTCCLIQNKLDKVPLEHLHKRTNWLEHRR 1999 RAR+TLLP EWW TYGG CPNLAR AIR+LSQ C KL+ + LE +H N LE +R Sbjct: 596 RARETLLPAEWWSTYGGSCPNLARLAIRVLSQPCSSFGYKLNHISLEQIHDTKNCLERQR 655 Query: 2000 LSDIVYVQYNMSLKHMGSGDRQQEGVDTISYEQIDLVEEWVTDKDFCSEDPAKKGWMDVD 2179 LSD+V+VQYN+ LK M +Q+ VD +S++ I ++E+W+ +KD +ED A WM +D Sbjct: 656 LSDLVFVQYNLRLKQMVGKSEEQDSVDPLSFDCISILEDWIKEKDISTEDYANSDWMALD 715 Query: 2180 PPLGSAVHLGPQIDDVQALGAGFDDFDIFEAAKDSEEE 2293 PP +V+ D+V LGAGF D++IF KD+E++ Sbjct: 716 PP---SVNTRQPHDEVDELGAGFHDYEIFNRVKDTEDD 750 >ref|XP_006486394.1| PREDICTED: uncharacterized protein LOC102626522 [Citrus sinensis] Length = 745 Score = 777 bits (2007), Expect = 0.0 Identities = 382/751 (50%), Positives = 521/751 (69%), Gaps = 1/751 (0%) Frame = +2 Query: 44 SNNMELVAVNSQKHDPAWKHCQMFKVGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNA 223 ++ +E + ++SQKHDPAWKHCQMFK GDRVQLKC+YC K+F+GGGIHRIKEHLA QKGNA Sbjct: 2 ASGLEPIPISSQKHDPAWKHCQMFKNGDRVQLKCLYCFKLFRGGGIHRIKEHLACQKGNA 61 Query: 224 ATCLRVQADVRMQMLESLNGVAVRKRKKQKLAEEMSGFSNPGNSGVEIVAHNSCGLNSDM 403 +TC RV DVR+ M +SL+GV V+K+KKQK+AEE++ +NP E+ A G D+ Sbjct: 62 STCSRVPLDVRLAMQQSLDGVVVKKKKKQKIAEEITN-NNPTFG--EVYAFTDQG---DV 115 Query: 404 VL-LPVPEMIEHXXXXXXXXXREDGMXXXXXXXXXXXXXALDVVNSNNTALAVIPAGSFK 580 LP+ + D + VN+ A+ + + Sbjct: 116 TPGLPLLDDSNTPEACSNLVVSRDVISNTTGDKRKRWRGKNSSVNAYTGAM-ISASLDAT 174 Query: 581 KANSVVNMAVGRFFFDVGLPADAANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNVI 760 + N+ + MAVGRF +D+G P DA NS YFQPM+DAIAS G EA PSYHD+R ILKN + Sbjct: 175 RGNNPIFMAVGRFLYDIGAPLDAVNSEYFQPMVDAIASGGPEAAMPSYHDIRGWILKNSV 234 Query: 761 HEVRYDVDQCIAAWERTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXXX 940 EV+ DVD+ W +TGCSILVD+ + G+T + F AYC EGT+FL Sbjct: 235 EEVKNDVDRYTTTWGKTGCSILVDQWNTEAGRTLLCFLAYCPEGTVFLKSVDASGIMNSS 294 Query: 941 XVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQD 1120 LYEL+K++VEEVG+R+VLQV+T+ E++++ AG+RLTDT+P+++WTPCA C+DL+L+D Sbjct: 295 DALYELLKQVVEEVGVRHVLQVITSSEEQFIAAGRRLTDTFPTLYWTPCAARCLDLILED 354 Query: 1121 IAEFPTVKMVLDQARSISRYIYSNTAVINMMRRFTFGVDLVDVGTTRSFTDFMTLKRIVN 1300 A+ + +++QAR+++R++Y+++ V+NM+RR+TFG D+V+ G TRS T+F TL+R+++ Sbjct: 355 FAKLEWINAIIEQARAVTRFVYNHSVVLNMLRRYTFGNDIVEPGITRSATNFTTLRRMIS 414 Query: 1301 IRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRS 1480 ++ +LQ+MV S+EW + YSK + D +SNQSFWSSC I+ LT+PLLRL RIV S Sbjct: 415 LKPNLQAMVTSQEWMDCPYSKKPGGLEMLDIVSNQSFWSSCGLIVCLTNPLLRLLRIVGS 474 Query: 1481 LKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKF 1660 + P++GYV+AG+YRAK+A+KKEL ++EY+ YW+IID WEQL PLHAAGF+LNPKF Sbjct: 475 ERRPSIGYVYAGMYRAKDALKKELIKRDEYMVYWNIIDHWWEQLWHLPLHAAGFFLNPKF 534 Query: 1661 FYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLL 1840 FYS++GD H+ I S + DCIERLVPD KV DKI KE Y GDFGRKMAIRARDTLL Sbjct: 535 FYSIKGDIHNEIVSRMFDCIERLVPDTKVQDKISKEINLYKDAVGDFGRKMAIRARDTLL 594 Query: 1841 PTEWWLTYGGGCPNLARFAIRILSQTCCLIQNKLDKVPLEHLHKRTNWLEHRRLSDIVYV 2020 P EWW TYGG CPNLAR A RI SQTC + + +++ E ++ N LE +RL D+V+V Sbjct: 595 PAEWWSTYGGSCPNLARLATRIQSQTCSSLADTRNQIHFERIYDTRNCLERQRLIDLVFV 654 Query: 2021 QYNMSLKHMGSGDRQQEGVDTISYEQIDLVEEWVTDKDFCSEDPAKKGWMDVDPPLGSAV 2200 QYN+ LKHM S +QQ+ +D +S++ +EEW+T KD C ED W V+PP GS + Sbjct: 655 QYNLRLKHMVSKKKQQDSMDPMSFDSFSTLEEWITGKDICLEDYGSSDWKAVEPPSGSPM 714 Query: 2201 HLGPQIDDVQALGAGFDDFDIFEAAKDSEEE 2293 LG D+V+ L GFDD++IF K+ E+E Sbjct: 715 LLGSSDDEVEELAGGFDDYEIFTRVKEGEDE 745 >ref|XP_004307479.1| PREDICTED: uncharacterized protein LOC101302111 [Fragaria vesca subsp. vesca] Length = 754 Score = 773 bits (1996), Expect = 0.0 Identities = 391/758 (51%), Positives = 508/758 (67%), Gaps = 3/758 (0%) Frame = +2 Query: 53 MELVAVNSQKHDPAWKHCQMFKVGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATC 232 ME V + SQKHDPAWKHCQMFK GDR+QLKCIYC K+F+GGGIHRIKEHLAGQKGNA+TC Sbjct: 1 MEPVPITSQKHDPAWKHCQMFKSGDRIQLKCIYCSKLFRGGGIHRIKEHLAGQKGNASTC 60 Query: 233 LRVQADVRMQMLESLNGVAVRKRKKQKLAEEMSGFSNPGNSGVEIVAHNSCGLNSDMVLL 412 LRV DVR M +SL+GV V+KR +QKL EE++ + P + V+ + +N+ + L+ Sbjct: 61 LRVPPDVRGLMQQSLDGVVVKKRNRQKLDEEITNITPPQDGDVDSLGGTQSDVNNAVQLV 120 Query: 413 PVP-EMIEHXXXXXXXXXREDGMXXXXXXXXXXXXXALDVVNSNNTALAVIPAGSFKKAN 589 V E I M + V N V +K N Sbjct: 121 GVSVEPISRLLVNREGVTSVRSMDRRKRGRGKSSWSSHGVHGVCNGGALVS-----RKVN 175 Query: 590 SVVNMAVGRFFFDVGLPADAANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNVIHEV 769 S V+ A+GRF FD+G P +A NS YFQPMIDAIAS G P+ HDLR+ ILKN + E Sbjct: 176 SYVHEAIGRFLFDIGAPPEAVNSAYFQPMIDAIASGGPGMEPPTCHDLRSWILKNSVEEA 235 Query: 770 RYDVDQCIAAWERTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXXXXVL 949 R ++D+ A W RTGCSILVD+ + ++F Y EGT+FL L Sbjct: 236 RNNIDKHRATWGRTGCSILVDQWNTELDNVMLSFLVYSPEGTVFLESVDASAIINSSDAL 295 Query: 950 YELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDIAE 1129 Y+L++ +VE+VG+ +V+QV+T+ E+++V+AG+RL DT+P++FW PCA C+DL+L+D Sbjct: 296 YDLLRRVVEDVGVGDVVQVITSGEEQFVVAGRRLADTFPNLFWIPCAARCLDLILEDFGS 355 Query: 1130 FPTVKMVLDQARSISRYIYSNTAVINMMRRFTFGVDLVDVGTTRSFTDFMTLKRIVNIRH 1309 + V++QARSI++++Y++ V+N++RR TFG D+V+ G TR T F TLKR+V+++H Sbjct: 356 LDWIHAVIEQARSITKFVYNHNVVLNLVRRSTFGNDIVEPGVTRFGTSFTTLKRLVDLKH 415 Query: 1310 SLQSMVNSEEWTESSYSKDQEAFAVQDSISN--QSFWSSCASIIRLTDPLLRLFRIVRSL 1483 LQ MV S+EW + YSK+ + D IS+ QSFWSSC I+RLT PLLR+ R+V Sbjct: 416 CLQVMVTSQEWMDCPYSKEPGGLEISDLISDRDQSFWSSCTLIVRLTSPLLRVLRMVGCE 475 Query: 1484 KIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFF 1663 K PAMG+++AG+YRAKEAIKKEL +EEY+ YW+IID RWEQ PLHAAGFYLNPK F Sbjct: 476 KRPAMGFIYAGMYRAKEAIKKELVKREEYMVYWNIIDQRWEQHWNFPLHAAGFYLNPKIF 535 Query: 1664 YSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLP 1843 YS+EGD H+ IQS + DCIER+VPD+KV DKIMKE SY AGDF RKMAIRARDTLLP Sbjct: 536 YSIEGDIHNSIQSGMYDCIERMVPDIKVQDKIMKEIISYKNAAGDFRRKMAIRARDTLLP 595 Query: 1844 TEWWLTYGGGCPNLARFAIRILSQTCCLIQNKLDKVPLEHLHKRTNWLEHRRLSDIVYVQ 2023 EWW TYGGGCPNLAR AIRILSQTC I + ++P E H N LE +RL D+V+VQ Sbjct: 596 AEWWSTYGGGCPNLARLAIRILSQTCGSIGYRQSQIPFEKAHGIRNCLERQRLRDLVFVQ 655 Query: 2024 YNMSLKHMGSGDRQQEGVDTISYEQIDLVEEWVTDKDFCSEDPAKKGWMDVDPPLGSAVH 2203 YN+ L+ M + ++ +D IS++ I LVE+WVT KD CSED WM +D P S + Sbjct: 656 YNLRLRQMVDKNNGEDCMDPISFDSISLVEDWVTGKDVCSEDFEGSSWMSLDSPSASTML 715 Query: 2204 LGPQIDDVQALGAGFDDFDIFEAAKDSEEEIADKNIGN 2317 LGP DD + LG+GF D +IF KD E EI + N+ N Sbjct: 716 LGPSNDDAEDLGSGFYDGEIFSRGKDGEIEILEDNVEN 753 >ref|XP_007009265.1| HAT and BED zinc finger domain-containing protein, putative [Theobroma cacao] gi|508726178|gb|EOY18075.1| HAT and BED zinc finger domain-containing protein, putative [Theobroma cacao] Length = 749 Score = 754 bits (1948), Expect = 0.0 Identities = 370/756 (48%), Positives = 517/756 (68%) Frame = +2 Query: 44 SNNMELVAVNSQKHDPAWKHCQMFKVGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNA 223 ++N+E + + SQKHDPAWKHCQMF+ G+RVQLKCIYCGKIF+GGGIHRIKEHLAGQKGNA Sbjct: 2 ASNLEPIPITSQKHDPAWKHCQMFRNGERVQLKCIYCGKIFRGGGIHRIKEHLAGQKGNA 61 Query: 224 ATCLRVQADVRMQMLESLNGVAVRKRKKQKLAEEMSGFSNPGNSGVEIVAHNSCGLNSDM 403 +TC V +DVR+ M ESL+GV V+KRKKQK+AEEMS +N +S ++ N N+ + Sbjct: 62 STCFHVPSDVRLLMRESLDGVEVKKRKKQKIAEEMSN-ANQVSSEIDTY-DNQVDTNTGL 119 Query: 404 VLLPVPEMIEHXXXXXXXXXREDGMXXXXXXXXXXXXXALDVVNSNNTALAVIPAGSFKK 583 +++ P+ ++ +G SN + + G+ K+ Sbjct: 120 LMIEGPDTLQ---PSSSLLVNREGTSNVSGDRRKRGKGKSSAAESNALVVNTVGLGA-KR 175 Query: 584 ANSVVNMAVGRFFFDVGLPADAANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNVIH 763 N+ V++A+GRF FD+G P DA NS YFQPM+DAI S G+ + PS DL+ ILK + Sbjct: 176 VNNHVHVAIGRFLFDIGAPLDAVNSVYFQPMVDAIISGGSGVLMPSCSDLQGWILKKSVE 235 Query: 764 EVRYDVDQCIAAWERTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXXXX 943 EV+ D D+ AAW RTGCSILV++ + G+ +NF YC EGT+FL Sbjct: 236 EVKSDNDKVTAAWVRTGCSILVNQWNTQTGRILLNFLVYCPEGTVFLKSVDASSVINSSD 295 Query: 944 VLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDI 1123 LYEL+K++VEEVG ++VLQV+T E++Y++AG+RL +T+P+++WTPCA HCI+L+L+D Sbjct: 296 ALYELLKQVVEEVGSKHVLQVITNAEEQYIVAGRRLAETFPTLYWTPCAAHCINLILEDF 355 Query: 1124 AEFPTVKMVLDQARSISRYIYSNTAVINMMRRFTFGVDLVDVGTTRSFTDFMTLKRIVNI 1303 A+ + ++++QARSI+R++Y+++ V+NM+RR+T G D+V+ T S T+F TLK+++++ Sbjct: 356 AKLEWINVIIEQARSITRFVYNHSVVLNMVRRYTLGNDIVEPAVTCSATNFTTLKQMIDL 415 Query: 1304 RHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSL 1483 +++LQ+MV S+EW + YSK + D +SN SFWSS I +LT+PLLR+ R+V S Sbjct: 416 KNNLQAMVTSQEWMDCPYSKKPGGLEMLDLVSNPSFWSSSVLITQLTNPLLRVLRMVGSK 475 Query: 1484 KIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFF 1663 K PAMGYV+AG+YRAKE IKKEL + EY+ YW+IID WEQ HPLH AGFYLNPKFF Sbjct: 476 KRPAMGYVYAGMYRAKETIKKELVKRNEYMIYWNIIDHWWEQQWHHPLHGAGFYLNPKFF 535 Query: 1664 YSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLP 1843 YS+EGD + + S + DCIE+LVPD+KV DKI KE SY GDFGRKMA+RARDTLLP Sbjct: 536 YSMEGDMPNEMLSGMLDCIEKLVPDVKVQDKISKEINSYKNTVGDFGRKMAVRARDTLLP 595 Query: 1844 TEWWLTYGGGCPNLARFAIRILSQTCCLIQNKLDKVPLEHLHKRTNWLEHRRLSDIVYVQ 2023 EWW TYGG CPNLAR AI +LSQTC + K + +P E LH+ N+LE +R D+++VQ Sbjct: 596 AEWWSTYGGSCPNLARLAIHVLSQTCSTLGLKQNSIPFEKLHETRNFLEQQRFRDLIFVQ 655 Query: 2024 YNMSLKHMGSGDRQQEGVDTISYEQIDLVEEWVTDKDFCSEDPAKKGWMDVDPPLGSAVH 2203 N+ L+ +G ++Q + +S++ +E+WV D E+ W +DP + + Sbjct: 656 CNLQLRQIGCESKEQVSMQPMSFDA--TIEDWVMGNDAFLENYTHSDWTALDPLSVNTML 713 Query: 2204 LGPQIDDVQALGAGFDDFDIFEAAKDSEEEIADKNI 2311 LGP D+V+ LGAGFDD++IF K E+E A+ N+ Sbjct: 714 LGPSSDEVEELGAGFDDYEIFNGVK--EQENAEDNV 747 >ref|XP_003538417.1| PREDICTED: uncharacterized protein LOC100817502 isoform X1 [Glycine max] gi|571489936|ref|XP_006591345.1| PREDICTED: uncharacterized protein LOC100817502 isoform X2 [Glycine max] gi|571489939|ref|XP_006591346.1| PREDICTED: uncharacterized protein LOC100817502 isoform X3 [Glycine max] Length = 759 Score = 725 bits (1871), Expect = 0.0 Identities = 364/754 (48%), Positives = 506/754 (67%), Gaps = 5/754 (0%) Frame = +2 Query: 47 NNMELVAVNSQKHDPAWKHCQMFKVGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAA 226 +N+E V + SQKHDPAWKH QMFK GD+VQLKCIYC K+FKGGGIHRIKEHLA QKGNA+ Sbjct: 3 SNLEPVPITSQKHDPAWKHVQMFKNGDKVQLKCIYCLKMFKGGGIHRIKEHLACQKGNAS 62 Query: 227 TCLRVQADVRMQMLESLNGVAVRKRKKQKLAEEMSGFSNPGNSGVEIVAHNS---CGLNS 397 TC RV DVR+ M +SL+GV V+KR+KQ++ EE+ NP + V + +N+ +N Sbjct: 63 TCSRVPHDVRLHMQQSLDGVVVKKRRKQRIEEEIMSV-NPLTTVVNSLPNNNNRVVDVNQ 121 Query: 398 DMVLLPVPEMIEHXXXXXXXXXREDGMXXXXXXXXXXXXXALDVVNSNNTALAVIPAGSF 577 + + V EH V ++ +AV G F Sbjct: 122 GLQAIGV----EHNSSLVVNPGEGMSRNMERRKKMRATKNPAAVYANSEGVIAVEKNGLF 177 Query: 578 -KKANSVVNMAVGRFFFDVGLPADAANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKN 754 KK ++ + MA+GRF +D+G P DA NS YFQ M+DAIAS+G P +H+LR ILKN Sbjct: 178 PKKMDNHIYMAIGRFLYDIGAPFDAVNSVYFQEMVDAIASRGVGFERPWHHELRGWILKN 237 Query: 755 VIHEVRYDVDQCIAAWERTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXX 934 + EV+ D+D+C W RTGCSILVD+ T+ GK ++F AYC EG +FL Sbjct: 238 SVEEVKNDIDRCKMTWGRTGCSILVDQWTTETGKILISFLAYCPEGLVFLRSLDATEIST 297 Query: 935 XXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLML 1114 LY+L+K++VEEVG V+QV+T+ E++Y IAG+RLTDT+P+++ +P A HCIDL+L Sbjct: 298 SADFLYDLIKQVVEEVGAGQVVQVITSGEEQYGIAGRRLTDTFPTLYLSPSAAHCIDLIL 357 Query: 1115 QDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRFTFGVDLVDVGTTRSFTDFMTLKRI 1294 +D + V++QARS++R++Y+ +A++NM++R+T G D+VD + T+F TLKR+ Sbjct: 358 EDFGNLEWISAVIEQARSVTRFVYNYSAILNMVKRYTLGNDIVDPSFSHFATNFTTLKRM 417 Query: 1295 VNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIV 1474 V+++H+LQ++V S+EW +S YSK + D +SNQ+FWSSC I+ LT PLL++ RI Sbjct: 418 VDLKHNLQALVTSQEWADSPYSKQTAGLEMLDCLSNQTFWSSCDMIVCLTAPLLKVMRIA 477 Query: 1475 RSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNP 1654 S PAMGYV+AG+YRAKEAIKK L +EEY+ YW+II RWE+L HPLHAAGFYLNP Sbjct: 478 SSEMRPAMGYVYAGMYRAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNP 537 Query: 1655 KFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDT 1834 KFFYS++GD H I S + DCIERLVPD ++ DKI+KE Y +GDFGRKMA+RARD Sbjct: 538 KFFYSIQGDIHGQIVSGMFDCIERLVPDTRIQDKIIKEINLYKSASGDFGRKMAVRARDN 597 Query: 1835 LLPTEWWLTYGGGCPNLARFAIRILSQTCCLIQNKLDKVPLEHLHKRTNWLEHRRLSDIV 2014 LLP+EWW TYGGGCPNL+R AIRILSQT ++ K +++P E + N++E + L+D+V Sbjct: 598 LLPSEWWSTYGGGCPNLSRLAIRILSQTSSVMSCKRNQIPFEQIINTRNYIERQHLTDLV 657 Query: 2015 YVQYNMSLKHMGSGDRQQEGVDTISYEQIDLVEEWVTDKD-FCSEDPAKKGWMDVDPPLG 2191 +V N+ L+ M ++Q+ D +S++ I VEEW+ +D + ++ WM +DP Sbjct: 658 FVHCNLRLRQMFM-SKEQDFSDPLSFDNISNVEEWIRPRDLYIDDECGNSDWMALDPSSV 716 Query: 2192 SAVHLGPQIDDVQALGAGFDDFDIFEAAKDSEEE 2293 + + L P D+ + LG G+DD++IF KDSE+E Sbjct: 717 NTMLLRPLNDEAEDLGEGYDDYEIFSCGKDSEDE 750 >ref|XP_007163431.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] gi|561036895|gb|ESW35425.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] Length = 756 Score = 719 bits (1856), Expect = 0.0 Identities = 352/751 (46%), Positives = 504/751 (67%), Gaps = 2/751 (0%) Frame = +2 Query: 47 NNMELVAVNSQKHDPAWKHCQMFKVGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAA 226 +N+E V + SQKHDPAWKH QM+K GD+VQLKCIYC K+FKGGGIHRIKEHLA QKGNA+ Sbjct: 3 SNLEPVPITSQKHDPAWKHVQMYKNGDKVQLKCIYCQKMFKGGGIHRIKEHLACQKGNAS 62 Query: 227 TCLRVQADVRMQMLESLNGVAVRKRKKQKLAEEMSGFSNPGNSGVEIVAHNS-CGLNSDM 403 TC RV DVR+ M +SL+GV V+KR+KQK+ EE+ NP + V + +N+ +N + Sbjct: 63 TCSRVPHDVRLHMQQSLDGVVVKKRRKQKIEEEIMSV-NPLTTVVNSLPNNNQVDVNQGL 121 Query: 404 VLLPVPEMIEHXXXXXXXXXREDGMXXXXXXXXXXXXXALDVVNSNNTALAVIPAGSF-K 580 + V +H + ++ +AV G F K Sbjct: 122 QAIGV----DHNSSLVVNPGEGMSKNMERRKKMRASKNPAAIYANSEGVVAVEKNGLFPK 177 Query: 581 KANSVVNMAVGRFFFDVGLPADAANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNVI 760 + ++ ++MA+GRF +D+G P DA NS YF M+DAI+S+GA PS+H+LR ILKN + Sbjct: 178 RVDNHIHMAIGRFLYDIGAPFDAVNSVYFHEMVDAISSRGAGFERPSHHELRGWILKNSV 237 Query: 761 HEVRYDVDQCIAAWERTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXXX 940 EV+ D+D+C W RTGCSILVD+ + G+ ++F AYC EG +FL Sbjct: 238 EEVKNDIDRCKMTWGRTGCSILVDQWATETGRVLISFLAYCPEGVVFLKSMDATEISTSA 297 Query: 941 XVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQD 1120 LY+++K++V+EVG+ VLQV+T+ E++Y +AG+RLTDT+P+++W+P A HCID +L+D Sbjct: 298 DFLYDMIKQVVDEVGVGQVLQVITSGEEQYAVAGRRLTDTFPTLYWSPSAAHCIDFILED 357 Query: 1121 IAEFPTVKMVLDQARSISRYIYSNTAVINMMRRFTFGVDLVDVGTTRSFTDFMTLKRIVN 1300 + V++QA+S++R++Y+ +A++ M++R+T G D+VD ++ T+F TLKR+V+ Sbjct: 358 FGNLEWISAVIEQAKSVTRFVYNYSAILIMVKRYTLGNDIVDPSFSQFATNFTTLKRMVD 417 Query: 1301 IRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRS 1480 ++H+LQ++V S+EW + YSK + D +S+Q+FWSSC I+RLT PLL++ RI S Sbjct: 418 LKHNLQALVTSQEWADCPYSKKSAGLEMLDCLSSQTFWSSCDMIVRLTAPLLKVLRIASS 477 Query: 1481 LKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKF 1660 PAMGY++AG+YRAKEAIKK L +EEY+ YW+II RWE+L HPLHAAGFYLNPKF Sbjct: 478 EMRPAMGYIYAGIYRAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPKF 537 Query: 1661 FYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLL 1840 FYS++GD H I S + DCIERLV D ++ DKI+KE Y AGDFGRKMA+RARD LL Sbjct: 538 FYSIQGDIHSQIVSGMFDCIERLVSDTRIQDKIIKEINLYKSAAGDFGRKMAVRARDNLL 597 Query: 1841 PTEWWLTYGGGCPNLARFAIRILSQTCCLIQNKLDKVPLEHLHKRTNWLEHRRLSDIVYV 2020 P+EWW TYGGGCPNL+R AIRILSQT ++ K +++P E + N++E + L+D+V+V Sbjct: 598 PSEWWSTYGGGCPNLSRLAIRILSQTSSVMSCKRNQIPFEQIVNTRNYIERQHLTDLVFV 657 Query: 2021 QYNMSLKHMGSGDRQQEGVDTISYEQIDLVEEWVTDKDFCSEDPAKKGWMDVDPPLGSAV 2200 N+ L+ M + + Q+ D +S++ I V+EW+ +D ++ WM +DP + + Sbjct: 658 HCNLRLRQMFT-SKDQDFSDPLSFDTISYVDEWIRPRDLYIDEYGNSDWMALDPSSVNTM 716 Query: 2201 HLGPQIDDVQALGAGFDDFDIFEAAKDSEEE 2293 L P D+ + L GFDD +IF KDSE+E Sbjct: 717 LLRPLNDEAEELDEGFDDDEIFSCGKDSEDE 747 >ref|XP_007163430.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] gi|561036894|gb|ESW35424.1| hypothetical protein PHAVU_001G234100g [Phaseolus vulgaris] Length = 869 Score = 719 bits (1856), Expect = 0.0 Identities = 352/751 (46%), Positives = 504/751 (67%), Gaps = 2/751 (0%) Frame = +2 Query: 47 NNMELVAVNSQKHDPAWKHCQMFKVGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAA 226 +N+E V + SQKHDPAWKH QM+K GD+VQLKCIYC K+FKGGGIHRIKEHLA QKGNA+ Sbjct: 116 SNLEPVPITSQKHDPAWKHVQMYKNGDKVQLKCIYCQKMFKGGGIHRIKEHLACQKGNAS 175 Query: 227 TCLRVQADVRMQMLESLNGVAVRKRKKQKLAEEMSGFSNPGNSGVEIVAHNS-CGLNSDM 403 TC RV DVR+ M +SL+GV V+KR+KQK+ EE+ NP + V + +N+ +N + Sbjct: 176 TCSRVPHDVRLHMQQSLDGVVVKKRRKQKIEEEIMSV-NPLTTVVNSLPNNNQVDVNQGL 234 Query: 404 VLLPVPEMIEHXXXXXXXXXREDGMXXXXXXXXXXXXXALDVVNSNNTALAVIPAGSF-K 580 + V +H + ++ +AV G F K Sbjct: 235 QAIGV----DHNSSLVVNPGEGMSKNMERRKKMRASKNPAAIYANSEGVVAVEKNGLFPK 290 Query: 581 KANSVVNMAVGRFFFDVGLPADAANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNVI 760 + ++ ++MA+GRF +D+G P DA NS YF M+DAI+S+GA PS+H+LR ILKN + Sbjct: 291 RVDNHIHMAIGRFLYDIGAPFDAVNSVYFHEMVDAISSRGAGFERPSHHELRGWILKNSV 350 Query: 761 HEVRYDVDQCIAAWERTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXXX 940 EV+ D+D+C W RTGCSILVD+ + G+ ++F AYC EG +FL Sbjct: 351 EEVKNDIDRCKMTWGRTGCSILVDQWATETGRVLISFLAYCPEGVVFLKSMDATEISTSA 410 Query: 941 XVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQD 1120 LY+++K++V+EVG+ VLQV+T+ E++Y +AG+RLTDT+P+++W+P A HCID +L+D Sbjct: 411 DFLYDMIKQVVDEVGVGQVLQVITSGEEQYAVAGRRLTDTFPTLYWSPSAAHCIDFILED 470 Query: 1121 IAEFPTVKMVLDQARSISRYIYSNTAVINMMRRFTFGVDLVDVGTTRSFTDFMTLKRIVN 1300 + V++QA+S++R++Y+ +A++ M++R+T G D+VD ++ T+F TLKR+V+ Sbjct: 471 FGNLEWISAVIEQAKSVTRFVYNYSAILIMVKRYTLGNDIVDPSFSQFATNFTTLKRMVD 530 Query: 1301 IRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRS 1480 ++H+LQ++V S+EW + YSK + D +S+Q+FWSSC I+RLT PLL++ RI S Sbjct: 531 LKHNLQALVTSQEWADCPYSKKSAGLEMLDCLSSQTFWSSCDMIVRLTAPLLKVLRIASS 590 Query: 1481 LKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKF 1660 PAMGY++AG+YRAKEAIKK L +EEY+ YW+II RWE+L HPLHAAGFYLNPKF Sbjct: 591 EMRPAMGYIYAGIYRAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNPKF 650 Query: 1661 FYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLL 1840 FYS++GD H I S + DCIERLV D ++ DKI+KE Y AGDFGRKMA+RARD LL Sbjct: 651 FYSIQGDIHSQIVSGMFDCIERLVSDTRIQDKIIKEINLYKSAAGDFGRKMAVRARDNLL 710 Query: 1841 PTEWWLTYGGGCPNLARFAIRILSQTCCLIQNKLDKVPLEHLHKRTNWLEHRRLSDIVYV 2020 P+EWW TYGGGCPNL+R AIRILSQT ++ K +++P E + N++E + L+D+V+V Sbjct: 711 PSEWWSTYGGGCPNLSRLAIRILSQTSSVMSCKRNQIPFEQIVNTRNYIERQHLTDLVFV 770 Query: 2021 QYNMSLKHMGSGDRQQEGVDTISYEQIDLVEEWVTDKDFCSEDPAKKGWMDVDPPLGSAV 2200 N+ L+ M + + Q+ D +S++ I V+EW+ +D ++ WM +DP + + Sbjct: 771 HCNLRLRQMFT-SKDQDFSDPLSFDTISYVDEWIRPRDLYIDEYGNSDWMALDPSSVNTM 829 Query: 2201 HLGPQIDDVQALGAGFDDFDIFEAAKDSEEE 2293 L P D+ + L GFDD +IF KDSE+E Sbjct: 830 LLRPLNDEAEELDEGFDDDEIFSCGKDSEDE 860 >ref|XP_003552872.1| PREDICTED: uncharacterized protein LOC100806265 isoform X1 [Glycine max] gi|571542833|ref|XP_006601996.1| PREDICTED: uncharacterized protein LOC100806265 isoform X2 [Glycine max] Length = 758 Score = 716 bits (1849), Expect = 0.0 Identities = 356/751 (47%), Positives = 503/751 (66%), Gaps = 2/751 (0%) Frame = +2 Query: 47 NNMELVAVNSQKHDPAWKHCQMFKVGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAA 226 +N+E V + SQKHDPAWKH QMFK GD+VQLKCIYC K+FKGGGIHRIKEHLA QKGNA+ Sbjct: 3 SNLEPVPITSQKHDPAWKHVQMFKNGDKVQLKCIYCLKMFKGGGIHRIKEHLACQKGNAS 62 Query: 227 TCLRVQADVRMQMLESLNGVAVRKRKKQKLAEEMSGFSNPGNSGVEIVAHNSCGLNSDMV 406 TC RV DVR+ M +SL+GV V+KR+KQ++ EE+ NP + V + +N+ ++ + Sbjct: 63 TCSRVPHDVRLHMQQSLDGVVVKKRRKQRIEEEIMSV-NPLTTVVNSLPNNNQVVDVNQG 121 Query: 407 LLPVPEMIEHXXXXXXXXXREDGMXXXXXXXXXXXXXALDVVNSNNTALAVIPAGSF-KK 583 L + +EH V ++ +AV G F KK Sbjct: 122 LQAIG--VEHNSTLVVNPGEGMSRNMERRKKMRAAKNPAAVYANSEDVVAVEKNGLFPKK 179 Query: 584 ANSVVNMAVGRFFFDVGLPADAANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNVIH 763 ++ + MA+GRF +D+G P DA N +FQ M+DAIAS+G PS+H+LR ILKN + Sbjct: 180 MDNHIYMAIGRFLYDIGAPFDAVNLVFFQEMVDAIASKGTGFERPSHHELRGWILKNSVE 239 Query: 764 EVRYDVDQCIAAWERTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXXXX 943 EV+ D+D+C W RTGCSILVD+ T+ + ++F AYC EG +FL Sbjct: 240 EVKNDIDRCKMTWGRTGCSILVDQWTTETSRILISFLAYCPEGLVFLKSLDATEILTSPD 299 Query: 944 VLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQDI 1123 LY+L+K++VEE+G+ V+QV+T+ E++Y IAG+RL DT+P+++W+P A HCIDL+L+D Sbjct: 300 FLYDLIKQVVEEIGVGKVVQVITSGEEQYGIAGRRLMDTFPTLYWSPSAAHCIDLILEDF 359 Query: 1124 AEFPTVKMVLDQARSISRYIYSNTAVINMMRRFTFGVDLVDVGTTRSFTDFMTLKRIVNI 1303 + V++QA+S++R++Y+ +A++NM++R+T G D+VD +R T+F TLKR+V++ Sbjct: 360 GNLEWISAVIEQAKSVTRFVYNYSAILNMVKRYTLGNDIVDPSFSRFATNFTTLKRMVDL 419 Query: 1304 RHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVRSL 1483 +H+LQ++V S+EW + YSK + D +SNQ+FWSSC I+ LT PLL++ RI S Sbjct: 420 KHNLQALVTSQEWADCPYSKQTAGLEMLDCLSNQTFWSSCDMIVCLTAPLLKVLRIAGSE 479 Query: 1484 KIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPKFF 1663 P MGYV+AG+YR KEAIKK L +EEY+ YW+II RWE+L HPLHAAGFYLNPKFF Sbjct: 480 MRPGMGYVYAGMYRVKEAIKKALGKREEYMVYWNIIHHRWERLWNHPLHAAGFYLNPKFF 539 Query: 1664 YSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTLLP 1843 YS++GD I S + DCIERLVPD ++ DKI+KE Y AGDFGRKMA+RARD LLP Sbjct: 540 YSIQGDILGQIVSGMFDCIERLVPDTRIQDKIIKEINLYKSAAGDFGRKMAVRARDNLLP 599 Query: 1844 TEWWLTYGGGCPNLARFAIRILSQTCCLIQNKLDKVPLEHLHKRTNWLEHRRLSDIVYVQ 2023 +EWW TYGGGCPNL+R AIRILSQT ++ K ++VP E + N++E + L+D+V+V Sbjct: 600 SEWWSTYGGGCPNLSRLAIRILSQTSSVMSCKRNQVPFEQIINTRNYIERQHLTDLVFVH 659 Query: 2024 YNMSLKHMGSGDRQQEGVDTISYEQIDLVEEWVTDKD-FCSEDPAKKGWMDVDPPLGSAV 2200 N+ L+ M ++Q D +S++ + VEEW+ +D + ++ WM +DP + + Sbjct: 660 CNLRLRQMFM-SKEQNFSDPLSFDNVSNVEEWIRPRDLYVDDECGNSDWMALDPSSVNTM 718 Query: 2201 HLGPQIDDVQALGAGFDDFDIFEAAKDSEEE 2293 L P D+ + LG G+DD++IF KDSE+E Sbjct: 719 LLRPLNDETEDLGEGYDDYEIFSFGKDSEDE 749 >gb|ADN34075.1| DNA binding protein [Cucumis melo subsp. melo] Length = 752 Score = 716 bits (1848), Expect = 0.0 Identities = 367/756 (48%), Positives = 503/756 (66%), Gaps = 6/756 (0%) Frame = +2 Query: 44 SNNMELVAVNSQKHDPAWKHCQMFKVGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNA 223 S+ ++ V + QKHDPAWKHCQMFK GDRVQLKC+YC K+FKGGGIHRIKEHLAGQKGNA Sbjct: 2 SSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGNA 61 Query: 224 ATCLRVQADVRMQMLESLNGVAVRKRKKQKLAEEMSGFSNPGNSGVEIVAHNSCGLNSDM 403 +TC V +V+ M ESL+GV ++KRK+QKL EEM+ N + V+ ++ N ++S + Sbjct: 62 STCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNV-NAMTAEVDAIS-NHMDMDSSI 119 Query: 404 VLLPVPEMIEHXXXXXXXXXREDGMXXXXXXXXXXXXXALDVVNSNNTALAVIPAGS--- 574 L+ V E ++ E+G + ++ + VIP G Sbjct: 120 HLIEVAEPLD--TNSALLLTHEEGTSNKVGRKKGSKGKSSSCLDRE---MIVIPNGGGIL 174 Query: 575 -FKKANSVVNMAVGRFFFDVGLPADAANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILK 751 + + V+MA+GRF +D+G +A NS YFQPMI++IA G + PSYHD+R ILK Sbjct: 175 DSNRDRNQVHMAIGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILK 234 Query: 752 NVIHEVRYDVDQCIAAWERTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXX 931 N + EVR D D+C A W TGCS++VD+ + G+T +NF YC +GT+FL Sbjct: 235 NSVEEVRGDFDRCKATWGMTGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIM 294 Query: 932 XXXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLM 1111 +LYEL+K++VE+VG+++V+QV+T E+ + IAG++L+DTYP+++WTPCA C+DL+ Sbjct: 295 DSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLI 354 Query: 1112 LQDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRFTFGVDLVDVGTTRSFTDFMTLKR 1291 L DI V V++QARSI+R++Y+N+ V+NM+R+ TFG D+V+ TRS T+F TL R Sbjct: 355 LADIGNIEDVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNR 414 Query: 1292 IVNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRI 1471 +V+++ LQ+MV S+EW +S YSK + D IS++SFWSSC SIIRLT+PLLR+ RI Sbjct: 415 MVDLKRCLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIIRLTNPLLRVLRI 474 Query: 1472 VRSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLN 1651 V S K PAMGYV+A +Y AK AIK EL ++ Y+ YW+IID RWE RHPL AAGFYLN Sbjct: 475 VGSGKRPAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLCAAGFYLN 534 Query: 1652 PKFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARD 1831 PK+FYS+EGD H I S + DCIERLV D V DKI+KE SY +GDF RK AIRAR Sbjct: 535 PKYFYSIEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARG 594 Query: 1832 TLLPTEWWLTYG-GGCPNLARFAIRILSQTCCLIQNKLDKVPLEHLHKRTNWLEHRRLSD 2008 TLLP EWW T G GGCPNL R A RILSQTC + K ++V + LH N +EH+RLSD Sbjct: 595 TLLPAEWWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNQVFFDKLHDTRNHIEHQRLSD 654 Query: 2009 IVYVQYNMSLKHMGSGDRQQEGVDTISYEQIDLVEEWVTDKDFCSEDPAKKGWMDVD-PP 2185 +V+V+ N+ LK M + + D +S++ + +V++WV KD +ED W ++ PP Sbjct: 655 LVFVRSNLQLKQMATNVNEHYPTDPLSFDGLGIVDDWVWKKDLSAEDCGNLEWTVLENPP 714 Query: 2186 LGSAVHLGPQIDDVQALGAGFDDFDIFEAAKDSEEE 2293 + L PQ D L AGFDD ++F+ ++SE++ Sbjct: 715 FSPPMRL-PQNDGYDDLVAGFDDLEVFKRQRESEDD 749 >ref|XP_004169404.1| PREDICTED: uncharacterized protein LOC101226173 [Cucumis sativus] Length = 752 Score = 714 bits (1842), Expect = 0.0 Identities = 368/756 (48%), Positives = 499/756 (66%), Gaps = 6/756 (0%) Frame = +2 Query: 44 SNNMELVAVNSQKHDPAWKHCQMFKVGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNA 223 S+ ++ V + QKHDPAWKHCQMFK GDRVQLKC+YC K+FKGGGIHRIKEHLAGQKGNA Sbjct: 2 SSGLQPVPITPQKHDPAWKHCQMFKNGDRVQLKCLYCHKLFKGGGIHRIKEHLAGQKGNA 61 Query: 224 ATCLRVQADVRMQMLESLNGVAVRKRKKQKLAEEMSGFSNPGNSGVEIVAHNSCGLNSDM 403 +TC V +V+ M ESL+GV ++KRK+QKL EEM+ N V+ ++ N ++S + Sbjct: 62 STCHSVPPEVQNIMQESLDGVMMKKRKRQKLDEEMTNV-NTMTGEVDGIS-NHMDMDSSI 119 Query: 404 VLLPVPEMIEHXXXXXXXXXREDGMXXXXXXXXXXXXXALDVVNSNNTALAVIPAGS--- 574 L+ V E +E E G + + + VIP G Sbjct: 120 HLIEVAEPLE--TNSVLLLTHEKGTSNKVGRKKGSKGKSSSCLERE---MIVIPNGGGIL 174 Query: 575 -FKKANSVVNMAVGRFFFDVGLPADAANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILK 751 + + V+MAVGRF +D+G +A NS YFQPMI++IA G + PSYHD+R ILK Sbjct: 175 DSNRDRNQVHMAVGRFLYDIGASLEAVNSAYFQPMIESIALAGTGIIPPSYHDIRGWILK 234 Query: 752 NVIHEVRYDVDQCIAAWERTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXX 931 N + EVR D D+C A W TGCS++VD+ + G+T +NF YC +GT+FL Sbjct: 235 NSMEEVRSDFDRCKATWGITGCSVMVDQWCTEAGRTMLNFLVYCPKGTVFLESVDASGIM 294 Query: 932 XXXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLM 1111 +LYEL+K++VE+VG+++V+QV+T E+ + IAG++L+DTYP+++WTPCA C+DL+ Sbjct: 295 DSPDLLYELLKKVVEQVGVKHVVQVITRFEENFAIAGRKLSDTYPTLYWTPCAASCVDLI 354 Query: 1112 LQDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRFTFGVDLVDVGTTRSFTDFMTLKR 1291 L DI V V++QARSI+R++Y+N+ V+NM+R+ TFG D+V+ TRS T+F TL R Sbjct: 355 LGDIGNIEGVNTVIEQARSITRFVYNNSMVLNMVRKCTFGNDIVEPCLTRSATNFATLNR 414 Query: 1292 IVNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRI 1471 +V+++ LQ+MV S+EW +S YSK + D IS++SFWSSC SII LT+PLLR+ RI Sbjct: 415 MVDLKRCLQNMVTSQEWMDSPYSKRPGGLEMLDLISSESFWSSCNSIISLTNPLLRVLRI 474 Query: 1472 VRSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLN 1651 V S K PAMGYV+A +Y AK AIK EL ++ Y+ YW+IID RWE RHPL+AAGFYLN Sbjct: 475 VGSGKRPAMGYVYAAMYNAKLAIKTELINRDRYMVYWNIIDQRWEHHWRHPLYAAGFYLN 534 Query: 1652 PKFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARD 1831 PK+FYS+EGD H I S + DCIERLV D V DKI+KE SY +GDF RK AIRAR Sbjct: 535 PKYFYSIEGDMHGEILSGMFDCIERLVSDTNVQDKIIKEITSYKNASGDFARKTAIRARG 594 Query: 1832 TLLPTEWWLTYG-GGCPNLARFAIRILSQTCCLIQNKLDKVPLEHLHKRTNWLEHRRLSD 2008 TLLP EWW T G GGCPNL R A RILSQTC + K + + LH N +EH+RLSD Sbjct: 595 TLLPAEWWSTCGEGGCPNLTRLATRILSQTCSSVGFKQNDALFDKLHDTRNHIEHQRLSD 654 Query: 2009 IVYVQYNMSLKHMGSGDRQQEGVDTISYEQIDLVEEWVTDKDFCSEDPAKKGWMDVD-PP 2185 +V+V+ N+ LK M + + D +S++++ +V++WV KD +ED W +D PP Sbjct: 655 LVFVRSNLQLKQMATNVNEHYPTDPLSFDELGIVDDWVWKKDLSAEDCGNLEWTVLDNPP 714 Query: 2186 LGSAVHLGPQIDDVQALGAGFDDFDIFEAAKDSEEE 2293 + L PQ D L AGFDD ++F+ ++SE++ Sbjct: 715 FSPPMRL-PQSDGYDDLVAGFDDLEVFKRQRESEDD 749 >ref|XP_006591347.1| PREDICTED: uncharacterized protein LOC100817502 isoform X4 [Glycine max] Length = 729 Score = 691 bits (1782), Expect = 0.0 Identities = 356/754 (47%), Positives = 495/754 (65%), Gaps = 5/754 (0%) Frame = +2 Query: 47 NNMELVAVNSQKHDPAWKHCQMFKVGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAA 226 +N+E V + SQKHDPAWKH QMFK GD+VQLKCIYC K+FKGGGIHRIKEHLA QKGNA+ Sbjct: 3 SNLEPVPITSQKHDPAWKHVQMFKNGDKVQLKCIYCLKMFKGGGIHRIKEHLACQKGNAS 62 Query: 227 TCLRVQADVRMQMLESLNGVAVRKRKKQKLAEEMSGFSNPGNSGVEIVAHNS---CGLNS 397 TC RV DVR+ M +SL+GV V+KR+KQ++ EE+ NP + V + +N+ +N Sbjct: 63 TCSRVPHDVRLHMQQSLDGVVVKKRRKQRIEEEIMSV-NPLTTVVNSLPNNNNRVVDVNQ 121 Query: 398 DMVLLPVPEMIEHXXXXXXXXXREDGMXXXXXXXXXXXXXALDVVNSNNTALAVIPAGSF 577 + + V EH V ++ +AV G F Sbjct: 122 GLQAIGV----EHNSSLVVNPGEGMSRNMERRKKMRATKNPAAVYANSEGVIAVEKNGLF 177 Query: 578 -KKANSVVNMAVGRFFFDVGLPADAANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKN 754 KK ++ + MA+GRF +D+G P DA NS YFQ M+DAIAS+G P +H+LR ILKN Sbjct: 178 PKKMDNHIYMAIGRFLYDIGAPFDAVNSVYFQEMVDAIASRGVGFERPWHHELRGWILKN 237 Query: 755 VIHEVRYDVDQCIAAWERTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXX 934 + EV+ D+D+C W RTGCSILVD+ T T +F Sbjct: 238 SVEEVKNDIDRCKMTWGRTGCSILVDQWT-----TETDF--------------------- 271 Query: 935 XXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLML 1114 LY+L+K++VEEVG V+QV+T+ E++Y IAG+RLTDT+P+++ +P A HCIDL+L Sbjct: 272 ----LYDLIKQVVEEVGAGQVVQVITSGEEQYGIAGRRLTDTFPTLYLSPSAAHCIDLIL 327 Query: 1115 QDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRFTFGVDLVDVGTTRSFTDFMTLKRI 1294 +D + V++QARS++R++Y+ +A++NM++R+T G D+VD + T+F TLKR+ Sbjct: 328 EDFGNLEWISAVIEQARSVTRFVYNYSAILNMVKRYTLGNDIVDPSFSHFATNFTTLKRM 387 Query: 1295 VNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIV 1474 V+++H+LQ++V S+EW +S YSK + D +SNQ+FWSSC I+ LT PLL++ RI Sbjct: 388 VDLKHNLQALVTSQEWADSPYSKQTAGLEMLDCLSNQTFWSSCDMIVCLTAPLLKVMRIA 447 Query: 1475 RSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNP 1654 S PAMGYV+AG+YRAKEAIKK L +EEY+ YW+II RWE+L HPLHAAGFYLNP Sbjct: 448 SSEMRPAMGYVYAGMYRAKEAIKKALGKREEYMVYWNIIHHRWERLWHHPLHAAGFYLNP 507 Query: 1655 KFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDT 1834 KFFYS++GD H I S + DCIERLVPD ++ DKI+KE Y +GDFGRKMA+RARD Sbjct: 508 KFFYSIQGDIHGQIVSGMFDCIERLVPDTRIQDKIIKEINLYKSASGDFGRKMAVRARDN 567 Query: 1835 LLPTEWWLTYGGGCPNLARFAIRILSQTCCLIQNKLDKVPLEHLHKRTNWLEHRRLSDIV 2014 LLP+EWW TYGGGCPNL+R AIRILSQT ++ K +++P E + N++E + L+D+V Sbjct: 568 LLPSEWWSTYGGGCPNLSRLAIRILSQTSSVMSCKRNQIPFEQIINTRNYIERQHLTDLV 627 Query: 2015 YVQYNMSLKHMGSGDRQQEGVDTISYEQIDLVEEWVTDKD-FCSEDPAKKGWMDVDPPLG 2191 +V N+ L+ M ++Q+ D +S++ I VEEW+ +D + ++ WM +DP Sbjct: 628 FVHCNLRLRQMFM-SKEQDFSDPLSFDNISNVEEWIRPRDLYIDDECGNSDWMALDPSSV 686 Query: 2192 SAVHLGPQIDDVQALGAGFDDFDIFEAAKDSEEE 2293 + + L P D+ + LG G+DD++IF KDSE+E Sbjct: 687 NTMLLRPLNDEAEDLGEGYDDYEIFSCGKDSEDE 720 >ref|XP_002521049.1| DNA binding protein, putative [Ricinus communis] gi|223539752|gb|EEF41333.1| DNA binding protein, putative [Ricinus communis] Length = 854 Score = 686 bits (1770), Expect = 0.0 Identities = 343/755 (45%), Positives = 471/755 (62%), Gaps = 15/755 (1%) Frame = +2 Query: 62 VAVNSQKHDPAWKHCQMFKVGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCLRV 241 + V K D AWK+CQ K GDRVQ+KC YCGK+FKGGGIHR KEHLAG+KG A C RV Sbjct: 120 IIVTRHKKDMAWKYCQPSKYGDRVQIKCNYCGKVFKGGGIHRFKEHLAGRKGAAPICDRV 179 Query: 242 QADVRMQMLESLNGVAVRKRKKQKLAEEMSGFSNPGNSGVEIVAHNSCGLNSDMVLLPVP 421 +DVR+ M + L+ V +++K++ + EE +P PVP Sbjct: 180 PSDVRLLMQQCLHEVVPKQKKQKVVIEETINVDSP----------------------PVP 217 Query: 422 EMIEHXXXXXXXXXREDGMXXXXXXXXXXXXXALDVVNSNNT---------ALAVIPAGS 574 + ++G DV+N N A++ G Sbjct: 218 LNTDTFANHFGDEDDDNGAPISVEFNSNLSLEEDDVLNQGNLHTRKRGRGKTSAIVDHGD 277 Query: 575 ------FKKANSVVNMAVGRFFFDVGLPADAANSPYFQPMIDAIASQGAEAVGPSYHDLR 736 K ++V++ VGRF +D+G DA +S YF+ +ID ++S + AV PS HDLR Sbjct: 278 PLDVVHLKMIDNVIHTTVGRFLYDIGANFDALDSIYFRSLIDMLSSGASGAVAPSNHDLR 337 Query: 737 NSILKNVIHEVRYDVDQCIAAWERTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXX 916 ILK ++ E++ D+DQ W RTGCS+LV+E S G T +NF CS+GT+FL Sbjct: 338 GWILKKLVEEIKNDIDQSRTTWARTGCSVLVEEWNSESGITLLNFLVNCSQGTVFLKSVE 397 Query: 917 XXXXXXXXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGH 1096 LY L+K++VEEVG NVLQV+T + Y +AGKRL + +PS+FW PCA H Sbjct: 398 ASHIIYSPDGLYVLLKQVVEEVGASNVLQVITNGNEHYTVAGKRLMEAFPSLFWAPCAVH 457 Query: 1097 CIDLMLQDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRFTFGVDLVDVGTTRSFTDF 1276 C+DL+L+D A+ + V++QA+S++R++Y+++AV+N+MR+FT+G D+V G TRS T+F Sbjct: 458 CLDLILEDFAKLEWIDAVIEQAKSVTRFVYNHSAVLNLMRKFTYGKDIVQQGLTRSATNF 517 Query: 1277 MTLKRIVNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLL 1456 L+R+ + + +LQ+M+ S+EW + YSK A+ D ISN+SFWSSC IIRLT PL+ Sbjct: 518 TMLQRMADFKLNLQTMITSQEWMDCPYSKQHGGLAMLDIISNRSFWSSCILIIRLTSPLI 577 Query: 1457 RLFRIVRSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAA 1636 R+ I + AMGY+FAG+YRAKE IK+EL +E+Y+ YW+IID RW+Q + PLH A Sbjct: 578 RVLGIAGGKRKAAMGYIFAGIYRAKETIKRELVKREDYMVYWNIIDHRWDQRRHPPLHVA 637 Query: 1637 GFYLNPKFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMA 1816 GF+LNPKFFYS+EGD H+ I S V DCIERLVPD++V DKI KE Y GD GRKMA Sbjct: 638 GFFLNPKFFYSIEGDVHNEILSRVFDCIERLVPDIEVQDKIAKELNIYKNAVGDLGRKMA 697 Query: 1817 IRARDTLLPTEWWLTYGGGCPNLARFAIRILSQTCCLIQNKLDKVPLEHLHKRTNWLEHR 1996 IR+R TLLP EWW TYGGGCPNLAR A+RILSQTC I + + +P E +H N LE + Sbjct: 698 IRSRGTLLPAEWWSTYGGGCPNLARLALRILSQTCSSIGCRSNHIPFEKVHATRNCLEQK 757 Query: 1997 RLSDIVYVQYNMSLKHMGSGDRQQEGVDTISYEQIDLVEEWVTDKDFCSEDPAKKGWMDV 2176 R SD+V+VQ N+ LK M + Q +D IS++ I +VE+W+ D C ED WM + Sbjct: 758 RRSDLVFVQCNLRLKEMVDESKNQVPLDPISFDNISIVEDWILQNDICLEDYESADWMSL 817 Query: 2177 DPPLGSAVHLGPQIDDVQALGAGFDDFDIFEAAKD 2281 PP + + G +D+++ LG GF DF+IFE K+ Sbjct: 818 VPPSANNMPAGSAVDEIEDLGVGFTDFEIFERLKE 852 Score = 98.6 bits (244), Expect = 1e-17 Identities = 47/97 (48%), Positives = 62/97 (63%), Gaps = 5/97 (5%) Frame = +2 Query: 80 KHDPAWKHCQMFKVGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATCLRVQADVRM 259 KHD WK+C+M K G++V +KC YCGKIFKGGGI R KEHLAG+KG CL V ADVR+ Sbjct: 12 KHDLGWKYCEMIKEGEKVHIKCSYCGKIFKGGGIFRFKEHLAGRKGGGPMCLNVPADVRL 71 Query: 260 QMLESLNGVAV-----RKRKKQKLAEEMSGFSNPGNS 355 M ++L+ + R+ + K+ E+ N NS Sbjct: 72 LMEQTLDVSSAKQSSRRQSSRLKMTPELPSLPNNKNS 108 >ref|XP_007049027.1| HAT transposon superfamily, putative [Theobroma cacao] gi|508701288|gb|EOX93184.1| HAT transposon superfamily, putative [Theobroma cacao] Length = 750 Score = 676 bits (1743), Expect = 0.0 Identities = 339/753 (45%), Positives = 483/753 (64%), Gaps = 6/753 (0%) Frame = +2 Query: 50 NMELVAVNSQKHDPAWKHCQMFKVGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAAT 229 N+ +++ QK DPAW HC+ FK G+R+Q+KC+YCGK+FKGGGIHR KEHLAG+KG Sbjct: 4 NLTPISITKQKQDPAWNHCEAFKNGERLQIKCMYCGKMFKGGGIHRFKEHLAGRKGQGPI 63 Query: 230 CLRVQADVRMQMLESLNGVAVRKRKKQKLAEEM--SGFSNPGNSGVEIVAHNSCGLNSDM 403 C +V VR M ESLNGV +++ KQ E+ G S+P ++ A++ +N+ + Sbjct: 64 CEQVPPGVRALMQESLNGVLLKQDNKQNAIPELLACGGSSPHAGEIDKSAYSD-DVNNGV 122 Query: 404 VLLPVPEMIEHXXXXXXXXXREDGMXXXXXXXXXXXXXALDVVNSNNTA---LAVIPAGS 574 + V +E E L NS++ A LA++ G Sbjct: 123 KPIQVLNSLEPDSSLVLNGKGEVSQGIRDSKKRGRDRSLL--ANSHSCAKSDLALVSIG- 179 Query: 575 FKKANSVVNMAVGRFFFDVGLPADAANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKN 754 A + V+MA+GRF +D+G+ DA NS YFQPMIDAIAS G+ V PS DLR ILKN Sbjct: 180 ---AENPVHMAIGRFLYDIGVNLDAVNSVYFQPMIDAIASTGSGIVPPSSQDLRGWILKN 236 Query: 755 VIHEVRYDVDQCIAAWERTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXX 934 V+ EV+ D+D+ W +TGCSILV++ + G+T ++F YC + T+FL Sbjct: 237 VMEEVKDDIDRNKTMWGKTGCSILVEQWSPKSGRTLLSFLVYCPQATVFLKSVDASRVIF 296 Query: 935 XXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLML 1114 L EL+K++VEEVG+ NV+QV+T E++Y +AGKRL +++PS++W PC HC+D+ML Sbjct: 297 SADHLNELLKQVVEEVGVENVVQVITNCEEQYFLAGKRLMESFPSLYWAPCLVHCVDMML 356 Query: 1115 QDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRFTFGVDLVDVGTTRSFTDFMTLKRI 1294 +D A + ++QA+S++R++Y+++ V+NMMRRFTF D+V+ TR ++F TLKR+ Sbjct: 357 EDFANLEWISETIEQAKSVTRFVYNHSVVLNMMRRFTFHNDIVEPAVTRFASNFATLKRM 416 Query: 1295 VNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIV 1474 +++ LQ+MVNS++W+E Y+K + D + N+SFW+SC I+RL PLL++ IV Sbjct: 417 ADLKLKLQAMVNSQDWSECPYAKKPGGLVMLDIVKNRSFWNSCILIVRLIYPLLQVLEIV 476 Query: 1475 RSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNP 1654 S K MGYV+AG+YRAKE IKKEL K++Y+ YW+IID RWEQ + PL+AA F+LNP Sbjct: 477 GSKKRSTMGYVYAGIYRAKETIKKELVKKDDYMVYWNIIDHRWEQQRHIPLYAAAFFLNP 536 Query: 1655 KFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDT 1834 KFFYS+EG+ H+ I S + DCIERLVPD V D+I++E Y GD GR MA+RARD Sbjct: 537 KFFYSIEGNIHNDILSSMFDCIERLVPDTNVQDQIVREIHLYKNATGDLGRPMAVRARDN 596 Query: 1835 LLPTEWWLTYGGGCPNLARFAIRILSQTCCLIQNKLDKVPLEHLHKRTNWLEHRRLSDIV 2014 LLP EWW YGGGCPNL AIRILSQTC I +K +K+ +E +H N+LEH+RLSD+V Sbjct: 597 LLPGEWWSMYGGGCPNLQHLAIRILSQTCSSIGSKPNKISIEEIHDTRNFLEHQRLSDLV 656 Query: 2015 YVQYNMSLKHMGSGDRQQEGVDTISYEQIDLVEEWVTDKDFCSEDPAKKGWMDVDPPLGS 2194 YV+YN+ L+ M + ++ D +S+ ++ ++W+ C ED WM +DPP+GS Sbjct: 657 YVRYNLYLRQMVLRSQDKDSADPLSFNSKEIRDDWIAYNAVCEEDYGSSDWMSLDPPVGS 716 Query: 2195 AVHLGPQIDDVQ-ALGAGFDDFDIFEAAKDSEE 2290 + G D+ + LG GF D +IF E+ Sbjct: 717 RMLSGTSGDETEDFLGTGFADLEIFNGLNGVED 749 >ref|XP_006380932.1| hypothetical protein POPTR_0006s02210g [Populus trichocarpa] gi|550335284|gb|ERP58729.1| hypothetical protein POPTR_0006s02210g [Populus trichocarpa] Length = 847 Score = 672 bits (1734), Expect = 0.0 Identities = 345/746 (46%), Positives = 476/746 (63%), Gaps = 3/746 (0%) Frame = +2 Query: 47 NNMELVAVNSQKHDPAWKHCQMFKVGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQ-KGNA 223 +N E +K D WKHC+M K RVQ+KC YC K+FKGGGIHR KEHLAG+ G Sbjct: 110 SNFESAPGMRRKKDVGWKHCEMLKNEKRVQIKCNYCAKLFKGGGIHRFKEHLAGRNSGGV 169 Query: 224 ATCLRVQADVRMQMLESLNGVAVRKRKKQKLAEEM--SGFSNPGNSGVEIVAHNSCGLNS 397 +C RV +DVR M + L+ + VR+RKK+K E S PG V I A S Sbjct: 170 PSCTRVPSDVRDLMEQHLSPIVVRQRKKRKSKREKLDDVDSPPGGEDVYIFAD-----YS 224 Query: 398 DMVLLPVPEMIEHXXXXXXXXXREDGMXXXXXXXXXXXXXALDVVNSNNTALAVIPAGSF 577 D ++ P+ + DG + V +++ A A+I GS Sbjct: 225 DDMITPLRAVAACNLVEVNSDFLLDG--EGTSNGNLGTRKSAIAVAASDDADALIAMGS- 281 Query: 578 KKANSVVNMAVGRFFFDVGLPADAANSPYFQPMIDAIASQGAEAVGPSYHDLRNSILKNV 757 + A++ ++ GRF +D+G DA +S + QP+ID +A PS+ DLR ILK++ Sbjct: 282 ETADNPIHAIWGRFLYDIGASLDAMDSNFSQPLIDTVAYGRPGIAAPSHQDLRGRILKSL 341 Query: 758 IHEVRYDVDQCIAAWERTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXXXXXXX 937 + EV+ D++Q W +TGCS+LV+E S G T +NF YCS+GT+FL Sbjct: 342 VEEVKSDINQYKTRWVKTGCSLLVEECNSESGVTTLNFLVYCSKGTVFLKSVDASNLIHS 401 Query: 938 XXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCIDLMLQ 1117 LYEL+K +VEEVG N+LQV+T E+ Y+ AGK+L DT+PS++W PCA CIDL+L+ Sbjct: 402 TDGLYELLKLMVEEVGAGNILQVITNGEEHYIAAGKKLMDTFPSLYWAPCAARCIDLILE 461 Query: 1118 DIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRFTFGVDLVDVGTTRSFTDFMTLKRIV 1297 DI + + VL+QA+S++R++Y+N+AV+N+MR+FT G D+V G TRS T+F LKR+ Sbjct: 462 DIGKLDWINTVLEQAKSVTRFVYNNSAVLNLMRKFTSGSDIVQQGITRSATNFTALKRMA 521 Query: 1298 NIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRLFRIVR 1477 N + +LQ+MV S+EW + YSK A+ D I+N+SFWSSC IIRLT PLL++ IV Sbjct: 522 NFKLNLQTMVTSQEWMDCPYSKQPGGLAMVDIITNRSFWSSCILIIRLTSPLLQVLVIVS 581 Query: 1478 SLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGFYLNPK 1657 S K AMGYVF+G+YRAKE IKKEL +E+Y+ YW+IID RWEQ + PLHAAGF+ NPK Sbjct: 582 SEKRAAMGYVFSGIYRAKETIKKELVKREDYMVYWNIIDHRWEQQWQTPLHAAGFFFNPK 641 Query: 1658 FFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIRARDTL 1837 FFYS+EGD H+ I S + DCIERLVPD +V DKI+KE Y G G+K+AIRAR T+ Sbjct: 642 FFYSIEGDMHNKILSRMFDCIERLVPDTEVQDKIVKELTLYKNAEGHLGKKLAIRARGTM 701 Query: 1838 LPTEWWLTYGGGCPNLARFAIRILSQTCCLIQNKLDKVPLEHLHKRTNWLEHRRLSDIVY 2017 LPT+WW YGG CPNLAR AIRILSQTC I + +P E +H+ N+L+ +RL+D+V+ Sbjct: 702 LPTDWWSMYGGSCPNLARLAIRILSQTCSAIGCSHNHIPFEKVHRTRNFLQRQRLTDLVF 761 Query: 2018 VQYNMSLKHMGSGDRQQEGVDTISYEQIDLVEEWVTDKDFCSEDPAKKGWMDVDPPLGSA 2197 VQYN+ L+ M G+++Q D IS++ + LVE+W+T + C ED WM + P + Sbjct: 762 VQYNLRLRQMVDGNKKQIPEDPISFDDVSLVEDWITQNELCLEDSGSSDWMSLVPRSVNT 821 Query: 2198 VHLGPQIDDVQALGAGFDDFDIFEAA 2275 + L P D+ + + +GFDDF+IF + Sbjct: 822 MPLAPSTDESEDVASGFDDFEIFNGS 847 Score = 100 bits (249), Expect = 3e-18 Identities = 50/99 (50%), Positives = 66/99 (66%), Gaps = 1/99 (1%) Frame = +2 Query: 53 MELVAVNSQKHDPAWKHCQMFKVGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATC 232 ME V S K D AWKHCQMF+ G++ ++KCIYCG+IF+GGGIHR KEHLAG KG C Sbjct: 1 MEEFLVPSPK-DLAWKHCQMFEEGEKTRMKCIYCGEIFEGGGIHRFKEHLAGPKGGGPMC 59 Query: 233 LRVQADVRMQMLESLNGVAVRKRKKQ-KLAEEMSGFSNP 346 V DVR+ M + L+ + ++ +Q K+ EE S + P Sbjct: 60 QSVPPDVRLLMQQDLDVITAKQNSQQLKIQEEESDVNLP 98 >gb|AAM98154.1| putative protein [Arabidopsis thaliana] Length = 768 Score = 621 bits (1602), Expect = e-175 Identities = 329/772 (42%), Positives = 469/772 (60%), Gaps = 25/772 (3%) Frame = +2 Query: 53 MELVAVNSQKHDPAWKHCQMFKVGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATC 232 +E VA+ QK D AWKHC+++K GDR+Q++C+YC K+FKGGGI R+KEHLAG+KG C Sbjct: 5 LEPVALTPQKQDNAWKHCEIYKYGDRLQMRCLYCRKMFKGGGITRVKEHLAGKKGQGTIC 64 Query: 233 LRVQADVRMQMLESLNGVAVRKRKKQKLAEEM---------------------SGFSNPG 349 +V DVR+ + + ++G R+RK+ K + E GF +PG Sbjct: 65 DQVPEDVRLFLQQCIDGTVRRQRKRHKSSSEPLSVASLPPIEGDMMVVQPDVNDGFKSPG 124 Query: 350 NSGVEIVAHNSCGLNSDMVLLPVPEMIEHXXXXXXXXXREDGMXXXXXXXXXXXXXALDV 529 +S ++V N L+ + E+G L Sbjct: 125 SS--DVVVQNESLLSG---------RTKQRTYRSKKNAFENGSASNNVDLIGRDMDNLIP 173 Query: 530 VNSNNTALAVIPAGSFKKANSVVNMAVGRFFFDVGLPADAANSPYFQPMIDAIASQGAEA 709 V ++ V P SF+ + ++MA+GRF F +G DA NS FQPMIDAIAS G Sbjct: 174 VAISSVKNIVHP--SFRDRENTIHMAIGRFLFGIGADFDAVNSVNFQPMIDAIASGGFGV 231 Query: 710 VGPSYHDLRNSILKNVIHEVRYDVDQCIAAWERTGCSILVDESTSGKGKTFVNFFAYCSE 889 P++ DLR ILKN + E+ ++D+C A W+RTGCSILV+E S KG +NF YC E Sbjct: 232 SAPTHDDLRGWILKNCVEEMAKEIDECKAMWKRTGCSILVEELNSDKGFKVLNFLVYCPE 291 Query: 890 GTIFLXXXXXXXXXXXXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPS 1069 +FL L+EL+ E+VEEVG NV+QV+T +D YV AGKRL YPS Sbjct: 292 KVVFLKSVDASEVLSSADKLFELLSELVEEVGSTNVVQVITKCDDYYVDAGKRLMLVYPS 351 Query: 1070 IFWTPCAGHCIDLMLQDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRFTFGVDLVDV 1249 ++W PCA HCID ML++ + + ++QA++I+R++Y+++ V+N+M +FT G D++ Sbjct: 352 LYWVPCAAHCIDQMLEEFGKLGWISETIEQAQAITRFVYNHSGVLNLMWKFTSGNDILLP 411 Query: 1250 GTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCAS 1429 + S T+F TL RI ++ +LQ+MV S EW E SYS++ V +++++++FW + A Sbjct: 412 AFSSSATNFATLGRIAELKSNLQAMVTSAEWNECSYSEEPSGL-VMNALTDEAFWKAVAL 470 Query: 1430 IIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQ 1609 + LT PLLR RIV S K PAMGYV+A LYRAK+AIK L +E+Y+ YW IID WEQ Sbjct: 471 VNHLTSPLLRALRIVCSEKRPAMGYVYAALYRAKDAIKTHLVNREDYIIYWKIIDRWWEQ 530 Query: 1610 LQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIG 1789 Q PL AAGF+LNPK FY+ + + V DCIERLVPD K+ DKI+KE SY Sbjct: 531 QQHIPLLAAGFFLNPKLFYNTNEEMRSELILSVLDCIERLVPDDKIQDKIIKELTSYKTA 590 Query: 1790 AGDFGRKMAIRARDTLLPTEWWLTYGGGCPNLARFAIRILSQTC-CLIQNKLDKVPLEHL 1966 G FGR +AIRARDT+LP EWW TYG C NL+RFAIRILSQTC + + +++P+EH+ Sbjct: 591 GGVFGRNLAIRARDTMLPAEWWSTYGESCLNLSRFAIRILSQTCSSSVSCRRNQIPVEHI 650 Query: 1967 HKRTNWLEHRRLSDIVYVQYNMSLKHMGSGDRQQEGVDTISYEQIDLVEEWVTDKDFCSE 2146 ++ N +E +RLSD+V+VQYNM L+ +G G + +D +S+ +ID+++EWV+ C E Sbjct: 651 YQSKNSIEQKRLSDLVFVQYNMRLRQLGPGS-GDDTLDPLSHNRIDVLKEWVSGDQACVE 709 Query: 2147 DPAKKGWMDVDPPLGSAVH---LGPQIDDVQALGAGFDDFDIFEAAKDSEEE 2293 W ++ ++H + P IDD + LG+GFDD +IF+ K+ +E Sbjct: 710 GNGSADWKSLE-----SIHRNQVAPIIDDTEDLGSGFDDIEIFKVEKEVRDE 756 >ref|NP_001154234.1| hAT transposon superfamily [Arabidopsis thaliana] gi|240255844|ref|NP_193238.5| hAT transposon superfamily [Arabidopsis thaliana] gi|332658140|gb|AEE83540.1| hAT transposon superfamily [Arabidopsis thaliana] gi|332658141|gb|AEE83541.1| hAT transposon superfamily [Arabidopsis thaliana] Length = 768 Score = 621 bits (1601), Expect = e-175 Identities = 329/772 (42%), Positives = 469/772 (60%), Gaps = 25/772 (3%) Frame = +2 Query: 53 MELVAVNSQKHDPAWKHCQMFKVGDRVQLKCIYCGKIFKGGGIHRIKEHLAGQKGNAATC 232 +E VA+ QK D AWKHC+++K GDR+Q++C+YC K+FKGGGI R+KEHLAG+KG C Sbjct: 5 LEPVALTPQKQDNAWKHCEIYKYGDRLQMRCLYCRKMFKGGGITRVKEHLAGKKGQGTIC 64 Query: 233 LRVQADVRMQMLESLNGVAVRKRKKQKLAEEM---------------------SGFSNPG 349 +V DVR+ + + ++G R+RK+ K + E GF +PG Sbjct: 65 DQVPEDVRLFLQQCIDGTVRRQRKRHKSSSEPLSVASLPPIEGDMMVVQPDVNDGFKSPG 124 Query: 350 NSGVEIVAHNSCGLNSDMVLLPVPEMIEHXXXXXXXXXREDGMXXXXXXXXXXXXXALDV 529 +S ++V N L+ + E+G L Sbjct: 125 SS--DVVVQNESLLSG---------RTKQRTYRSKKNAFENGSASNNVDLIGRDMDNLIP 173 Query: 530 VNSNNTALAVIPAGSFKKANSVVNMAVGRFFFDVGLPADAANSPYFQPMIDAIASQGAEA 709 V ++ V P SF+ + ++MA+GRF F +G DA NS FQPMIDAIAS G Sbjct: 174 VAISSVKNIVHP--SFRDRENTIHMAIGRFLFGIGADFDAVNSVNFQPMIDAIASGGFGV 231 Query: 710 VGPSYHDLRNSILKNVIHEVRYDVDQCIAAWERTGCSILVDESTSGKGKTFVNFFAYCSE 889 P++ DLR ILKN + E+ ++D+C A W+RTGCSILV+E S KG +NF YC E Sbjct: 232 SAPTHDDLRGWILKNCVEEMAKEIDECKAMWKRTGCSILVEELNSDKGFKVLNFLVYCPE 291 Query: 890 GTIFLXXXXXXXXXXXXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPS 1069 +FL L+EL+ E+VEEVG NV+QV+T +D YV AGKRL YPS Sbjct: 292 KVVFLKSVDASEVLSSADKLFELLSELVEEVGSTNVVQVITKCDDYYVDAGKRLMLVYPS 351 Query: 1070 IFWTPCAGHCIDLMLQDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRFTFGVDLVDV 1249 ++W PCA HCID ML++ + + ++QA++I+R++Y+++ V+N+M +FT G D++ Sbjct: 352 LYWVPCAAHCIDQMLEEFGKLGWISETIEQAQAITRFVYNHSGVLNLMWKFTSGNDILLP 411 Query: 1250 GTTRSFTDFMTLKRIVNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCAS 1429 + S T+F TL RI ++ +LQ+MV S EW E SYS++ V +++++++FW + A Sbjct: 412 AFSSSATNFATLGRIAELKSNLQAMVTSAEWNECSYSEEPSGL-VMNALTDEAFWKAVAL 470 Query: 1430 IIRLTDPLLRLFRIVRSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQ 1609 + LT PLLR RIV S K PAMGYV+A LYRAK+AIK L +E+Y+ YW IID WEQ Sbjct: 471 VNHLTSPLLRALRIVCSEKRPAMGYVYAALYRAKDAIKTHLVNREDYIIYWKIIDRWWEQ 530 Query: 1610 LQRHPLHAAGFYLNPKFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIG 1789 Q PL AAGF+LNPK FY+ + + V DCIERLVPD K+ DKI+KE SY Sbjct: 531 QQHIPLLAAGFFLNPKLFYNTNEEIRSELILSVLDCIERLVPDDKIQDKIIKELTSYKTA 590 Query: 1790 AGDFGRKMAIRARDTLLPTEWWLTYGGGCPNLARFAIRILSQTC-CLIQNKLDKVPLEHL 1966 G FGR +AIRARDT+LP EWW TYG C NL+RFAIRILSQTC + + +++P+EH+ Sbjct: 591 GGVFGRNLAIRARDTMLPAEWWSTYGESCLNLSRFAIRILSQTCSSSVSCRRNQIPVEHI 650 Query: 1967 HKRTNWLEHRRLSDIVYVQYNMSLKHMGSGDRQQEGVDTISYEQIDLVEEWVTDKDFCSE 2146 ++ N +E +RLSD+V+VQYNM L+ +G G + +D +S+ +ID+++EWV+ C E Sbjct: 651 YQSKNSIEQKRLSDLVFVQYNMRLRQLGPGS-GDDTLDPLSHNRIDVLKEWVSGDQACVE 709 Query: 2147 DPAKKGWMDVDPPLGSAVH---LGPQIDDVQALGAGFDDFDIFEAAKDSEEE 2293 W ++ ++H + P IDD + LG+GFDD +IF+ K+ +E Sbjct: 710 GNGSADWKSLE-----SIHRNQVAPIIDDTEDLGSGFDDIEIFKVEKEVRDE 756 >ref|XP_004305893.1| PREDICTED: uncharacterized protein LOC101310825 [Fragaria vesca subsp. vesca] Length = 869 Score = 611 bits (1576), Expect = e-172 Identities = 296/568 (52%), Positives = 397/568 (69%), Gaps = 2/568 (0%) Frame = +2 Query: 569 GSFKKANSV-VNMAVGRFFFDVGLPADAA-NSPYFQPMIDAIASQGAEAVGPSYHDLRNS 742 G +KANS + MA+GRF +++ P DA NS YFQPMIDAIAS G E+ PSYHDLR Sbjct: 289 GEVEKANSQQIQMAIGRFLYEIQAPLDAVKNSLYFQPMIDAIASGGMESKAPSYHDLRGW 348 Query: 743 ILKNVIHEVRYDVDQCIAAWERTGCSILVDESTSGKGKTFVNFFAYCSEGTIFLXXXXXX 922 IL + EV+ ++ Q +WER GCS+LV++ S KG+ +NF YC EGT +L Sbjct: 349 ILNDAAEEVKNEIYQHTNSWERNGCSLLVNQFNSEKGRILLNFSVYCPEGTTYLKSVDAS 408 Query: 923 XXXXXXXVLYELMKEIVEEVGLRNVLQVVTTIEDRYVIAGKRLTDTYPSIFWTPCAGHCI 1102 LYE++K++VEEVG+R VLQV+T E+ YV+AGKRL DT+P+++W+PCA CI Sbjct: 409 TFINSPDALYEILKQVVEEVGVRRVLQVITNSEEHYVVAGKRLMDTFPTLYWSPCAAACI 468 Query: 1103 DLMLQDIAEFPTVKMVLDQARSISRYIYSNTAVINMMRRFTFGVDLVDVGTTRSFTDFMT 1282 + +L+D +F + ++ QARS++R+IY + ++NMMRR+TFG D+V +G TR TDFMT Sbjct: 469 NSILEDFGKFEWINSIIAQARSVTRFIYKHVVILNMMRRYTFGNDIVKLGITRYATDFMT 528 Query: 1283 LKRIVNIRHSLQSMVNSEEWTESSYSKDQEAFAVQDSISNQSFWSSCASIIRLTDPLLRL 1462 LK++ +++ +LQ+MV S+EW YSK E A+ D +SN +FWSSC I R T+PLL++ Sbjct: 529 LKQMADLKFNLQTMVTSKEWEGCPYSKTPEGLAMLDLLSNHTFWSSCIMITRFTNPLLQV 588 Query: 1463 FRIVRSLKIPAMGYVFAGLYRAKEAIKKELDTKEEYLPYWSIIDSRWEQLQRHPLHAAGF 1642 RIV S K AMGYVF G+YRAKE IK+EL KE Y YW+IID RW +L HPLHAAGF Sbjct: 589 LRIVGSQKKAAMGYVFGGMYRAKETIKRELVKKEVYTAYWNIIDYRWAKLWDHPLHAAGF 648 Query: 1643 YLNPKFFYSLEGDGHHHIQSLVNDCIERLVPDLKVLDKIMKEKASYHIGAGDFGRKMAIR 1822 YLNPKFFYS++G+ H I S + DCIE+LVPDLKV D+I KE Y GD GR +AIR Sbjct: 649 YLNPKFFYSIKGEMHKVIMSRMFDCIEKLVPDLKVQDEISKEINLYQNAVGDMGRNLAIR 708 Query: 1823 ARDTLLPTEWWLTYGGGCPNLARFAIRILSQTCCLIQNKLDKVPLEHLHKRTNWLEHRRL 2002 ARDTLLP EWW TYG GCPN+AR A+ ILSQTC LIQ K +++P + LHK N LEH+RL Sbjct: 709 ARDTLLPAEWWSTYGSGCPNMARLAVHILSQTCSLIQCKENQIPFDQLHKTRNSLEHQRL 768 Query: 2003 SDIVYVQYNMSLKHMGSGDRQQEGVDTISYEQIDLVEEWVTDKDFCSEDPAKKGWMDVDP 2182 SD V++QYN+ L+ M +++ VD IS+E +VE+WVT+ + E+ W +DP Sbjct: 769 SDFVFLQYNLQLRQMVHKNKEHAYVDPISFENTGVVEDWVTEPEMYLENDENTDWKALDP 828 Query: 2183 PLGSAVHLGPQIDDVQALGAGFDDFDIF 2266 P ++ L +D+ + LG+GFDD++IF Sbjct: 829 PSYNSRLLELSVDEGEDLGSGFDDYEIF 856 Score = 99.8 bits (247), Expect = 5e-18 Identities = 52/104 (50%), Positives = 74/104 (71%), Gaps = 5/104 (4%) Frame = +2 Query: 56 ELVAVNSQKHDPAWKHCQMF-KVGD-RVQLK-CIYCGKIFKGGGIHRIKEHLAGQKGNAA 226 E VA++ K DP WKHCQ+F K+GD +V++K C+YCGK+F+GGGI R+K HLAG+KGN Sbjct: 8 EPVAISPHKQDPGWKHCQIFSKIGDPKVEVKKCLYCGKVFQGGGISRLKFHLAGRKGNGP 67 Query: 227 TCLRVQADVRMQMLESLN-GVAVRKRKKQKLAEEMS-GFSNPGN 352 C +V DVR+ ML++L+ V +++K +L +S FS GN Sbjct: 68 ICDQVPPDVRVSMLQNLDEKVGTSRQRKSQLGTNLSHSFSELGN 111