BLASTX nr result
ID: Mentha28_contig00005380
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha28_contig00005380 (2140 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU39705.1| hypothetical protein MIMGU_mgv1a004483mg [Mimulus... 484 e-134 ref|XP_002284460.1| PREDICTED: uncharacterized protein LOC100259... 333 2e-88 ref|XP_006472760.1| PREDICTED: transcriptional regulator ATRX ho... 332 4e-88 ref|XP_006351897.1| PREDICTED: lisH domain-containing protein C1... 330 2e-87 ref|XP_006434168.1| hypothetical protein CICLE_v10000938mg [Citr... 330 2e-87 ref|XP_003522580.1| PREDICTED: transcriptional regulator ATRX ho... 328 8e-87 ref|XP_002300995.2| hypothetical protein POPTR_0002s08550g [Popu... 323 3e-85 ref|XP_002516334.1| conserved hypothetical protein [Ricinus comm... 320 1e-84 ref|XP_006581582.1| PREDICTED: DNA ligase 1-like isoform X2 [Gly... 320 2e-84 ref|XP_003527934.1| PREDICTED: DNA ligase 1-like isoform X1 [Gly... 320 2e-84 ref|XP_007019032.1| Uncharacterized protein isoform 2 [Theobroma... 315 7e-83 ref|XP_007222349.1| hypothetical protein PRUPE_ppa004840mg [Prun... 311 8e-82 ref|XP_004169339.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 311 1e-81 ref|XP_004145363.1| PREDICTED: uncharacterized protein LOC101217... 311 1e-81 ref|XP_007019033.1| Uncharacterized protein isoform 3 [Theobroma... 310 2e-81 ref|XP_004500560.1| PREDICTED: transcriptional regulator ATRX ho... 308 8e-81 ref|XP_006578974.1| PREDICTED: transcriptional regulator ATRX ho... 301 8e-79 ref|XP_004290855.1| PREDICTED: uncharacterized protein LOC101302... 300 2e-78 ref|XP_007137404.1| hypothetical protein PHAVU_009G124200g [Phas... 297 1e-77 ref|XP_006434169.1| hypothetical protein CICLE_v10000938mg [Citr... 297 1e-77 >gb|EYU39705.1| hypothetical protein MIMGU_mgv1a004483mg [Mimulus guttatus] Length = 525 Score = 484 bits (1245), Expect = e-134 Identities = 271/441 (61%), Positives = 319/441 (72%), Gaps = 19/441 (4%) Frame = -1 Query: 2047 MAEGDGEKRAFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFI 1868 MAE +GEK+ E QLE AV +RLQHFKDQADSLTLESVRRLLEKDLGLEK ALDAHKRFI Sbjct: 1 MAE-EGEKQGIEQQLEHAVCSRLQHFKDQADSLTLESVRRLLEKDLGLEKFALDAHKRFI 59 Query: 1867 RHYLEKIMDGADESNSSPATVNM-EGGVLLSKEEEKVIPKQEEANSESKKASTGNEETME 1691 RHYLEK M+ AD+ N E V LSKE+ ++PKQ E+N++ KK+STG+EE ME Sbjct: 60 RHYLEKKMEDADDCKPETEKENENEKDVHLSKEDATILPKQNESNNDLKKSSTGDEEMME 119 Query: 1690 DSPIMGVLTPKSEVGTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDK 1511 DSPIMGVLTPKSE+G Q LSES I+KAILERADH ANS+ ++L GVRRLLEEDLGLDK Sbjct: 120 DSPIMGVLTPKSEIGAQGPLSESRIEKAILERADHFLANSENLTLAGVRRLLEEDLGLDK 179 Query: 1510 NTLDAYKNFISRQVDLVLXXXXXXXXXXXKR---SEDVKSRKSRKVNSED-SDTSQSGSD 1343 N LD +K FIS+Q+D VL + SE +KS+K + V+SE+ S++ S SD Sbjct: 180 NDLDPFKKFISQQIDQVLNPPKATKSVKNVKKKTSESLKSKKVKTVSSEEGSESLPSESD 239 Query: 1342 EMSDKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKK-----------QIEED 1196 EM DK K +KE+ RKN KK EQP+KR+ +D+D+S KKP K EED Sbjct: 240 EMEDKVKSKKESASRKNSKKLEQPKKRK----SDLDVSAKKPSKLQKRQKEEDNDSKEED 295 Query: 1195 NNSDEGGSISEDGQSQLS---LEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYK 1025 NNS E GS+SEDGQSQ S LEKPA RKEK P YGK+VENLKSIIKACGMS+PP IYK Sbjct: 296 NNSGEDGSLSEDGQSQSSVEKLEKPAQRKEKPVPAYGKKVENLKSIIKACGMSIPPVIYK 355 Query: 1024 KVKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERARELEGIDXXXXXXX 845 K K VPD+KRE ++++ELEGIL REGLSKNP+EKEIKDC+K+KE ARELEGID Sbjct: 356 KAKQVPDNKREAVIIQELEGILLREGLSKNPSEKEIKDCKKRKETARELEGIDMSNIISS 415 Query: 844 XXXXXXXXFVAPERPVVRAKK 782 F AP +P RAKK Sbjct: 416 SRRRSTFSFGAPAKPEARAKK 436 >ref|XP_002284460.1| PREDICTED: uncharacterized protein LOC100259114 [Vitis vinifera] gi|302141832|emb|CBI19035.3| unnamed protein product [Vitis vinifera] Length = 502 Score = 333 bits (854), Expect = 2e-88 Identities = 199/429 (46%), Positives = 267/429 (62%), Gaps = 17/429 (3%) Frame = -1 Query: 2029 EKRAFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEK 1850 E + E Q++ A+S+R+ HFK+QADSLT E VRRLLEKDLGLE ALD HKRF++ +L + Sbjct: 18 EAQEIESQIKAAMSSRVGHFKEQADSLTFEGVRRLLEKDLGLETYALDVHKRFVKQFLLE 77 Query: 1849 IMDGADESNSSPATVNMEG-GVLLSKEEEKVIPKQEEANSESKKASTGNEETMEDSPIMG 1673 ++ A + N S + G V +K E P+ ++ + K+ S+G+EE +E SP++G Sbjct: 78 CINAAADDNPSKKSGETRGKNVCSTKGEAAEPPETVKSKKDVKEPSSGDEEKIEGSPVLG 137 Query: 1672 VLT----PKSEVGTQSSL------SESTIKKAILERADHLQANSDKISLGGVRRLLEEDL 1523 ++T KSE SESTI+KAI +RA + +A S+ I++ GVRR+LEEDL Sbjct: 138 LMTGQKIAKSETEETQGKENKEVPSESTIRKAIRKRASYFKAKSENITMAGVRRVLEEDL 197 Query: 1522 GLDKNTLDAYKNFISRQVDLVLXXXXXXXXXXXKRSEDVK----SRKSRKVNSEDSDTS- 1358 LDK TLD YK FIS Q+D VL + K SR SRK +SE S S Sbjct: 198 KLDKKTLDPYKKFISEQLDEVLKSPQVSKPTTGVKKGSPKKNSHSRASRKTSSEGSSESL 257 Query: 1357 QSGSDEMSDKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQIEEDNN-SDE 1181 +S SDE K K + + + + RKR +E R K + + EDN+ +++ Sbjct: 258 ESESDEEEVKPKTKMAPKGKTQNSEDLRKRKRPVTETKMPSKKRSKTAETVSEDNSDAED 317 Query: 1180 GGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDD 1001 G++S+DG SQ S EKP RKE SAP YGKRVENLKSIIK+C MSVPP++YK+VK P++ Sbjct: 318 SGNVSDDGHSQSSSEKPVKRKEVSAPAYGKRVENLKSIIKSCAMSVPPSVYKRVKQAPEN 377 Query: 1000 KRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERARELEGIDXXXXXXXXXXXXXXX 821 KRE L+KELE ILS+EGLSKNP+EK+IK+ RKKKERA+ELEGID Sbjct: 378 KREAHLIKELEEILSKEGLSKNPSEKDIKEVRKKKERAKELEGIDTSNIVLSSRRRSTRS 437 Query: 820 FVAPERPVV 794 FVAP +P + Sbjct: 438 FVAPPKPKI 446 >ref|XP_006472760.1| PREDICTED: transcriptional regulator ATRX homolog [Citrus sinensis] Length = 497 Score = 332 bits (851), Expect = 4e-88 Identities = 188/431 (43%), Positives = 267/431 (61%), Gaps = 15/431 (3%) Frame = -1 Query: 2020 AFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMD 1841 + E Q++ A+ +R+ HFK+QADSLT E VRRL+EKDLGLE ALD HK+FI+ L + MD Sbjct: 19 SIEPQIKAAMISRVSHFKEQADSLTFEGVRRLIEKDLGLETHALDVHKKFIKQCLLECMD 78 Query: 1840 GADESNSSPATVNMEGGVLLSKEEEKVIPKQEEANSESKKASTGNEETMEDSPIMGVLTP 1661 GA ++S + + S +EE+ P+ ++ + K+ N E MEDSP++G++T Sbjct: 79 GAGGVSASKDSAESAKENVSSTKEEEKSPEGYQSAKDVKEPCPENYEKMEDSPVLGLMTG 138 Query: 1660 KSEV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDK 1511 + G + SES IKKAI +RA +++ N +K+++ G+RR+LEEDL LDK Sbjct: 139 NKKTKFETEEAQGDGNKEDPSESAIKKAIRKRAAYIKTNIEKVTMAGLRRILEEDLKLDK 198 Query: 1510 NTLDAYKNFISRQVDLVLXXXXXXXXXXXKRSEDVKSR---KSRKVNSEDSDTSQSGSDE 1340 TLD++K IS+++D VL ++ + +K K+++V+SE S S G + Sbjct: 199 FTLDSFKKMISQELDEVLKSSEVLEPSTVEKKKSLKKNYQSKAKEVSSEGSSDSSDGEVD 258 Query: 1339 MSDKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPK--KQIEEDNNSDEGGSIS 1166 D+ K RK+ + ++ E +KR+ E +KK K K EDNN E GS+S Sbjct: 259 EEDEMKPRKKIVSKGKVQNNEGLKKRKRPEKETKASIKKKTKAVKIASEDNNDAESGSVS 318 Query: 1165 EDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRETI 986 +DG SQ S EKP +K S P YGKRVE+LK++IK+CGMS+PP++YKKVK P++KRE Sbjct: 319 DDGHSQSSSEKPIKKKVVSTPAYGKRVEHLKTVIKSCGMSIPPSVYKKVKQAPENKREAQ 378 Query: 985 LVKELEGILSREGLSKNPTEKEIKDCRKKKERARELEGIDXXXXXXXXXXXXXXXFVAPE 806 L+KELEGILSREGLS NP+EKEIK+ +KKKERARELEGID FV P Sbjct: 379 LIKELEGILSREGLSSNPSEKEIKEVKKKKERARELEGIDMSNIVSSSRRRSATSFVPPP 438 Query: 805 RPVVRAKKYKG 773 +P + + G Sbjct: 439 KPKIPDESESG 449 >ref|XP_006351897.1| PREDICTED: lisH domain-containing protein C1711.05-like [Solanum tuberosum] Length = 476 Score = 330 bits (846), Expect = 2e-87 Identities = 202/427 (47%), Positives = 280/427 (65%), Gaps = 13/427 (3%) Frame = -1 Query: 2041 EGDGEKRAFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRH 1862 E + EK+ E+++E A+ +R+QHFK+ ADS TLE VRRL+E+DL LEK ALD HKR I+ Sbjct: 4 EVNEEKQGIEVKIEEALRSRIQHFKENADSFTLERVRRLIEEDLELEKYALDVHKRSIKL 63 Query: 1861 YLEKIMDGA-DESNSSPATVNMEGGVLLSKEEEKVI--PKQEEANSESKKASTGNEETME 1691 LEK+M+ A D+ + + N+E L+K+E++V+ PK++ + K+ + +E M+ Sbjct: 64 ILEKLMENAADDGDPKDSQENLEKDASLTKQEKEVLESPKKQVIKKDIKEPAF-DEAEMD 122 Query: 1690 DSPIMGVLTPKSE-VGTQS-SLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGL 1517 DSPIMGV++ KSE V QS SES+IKKAI ERA H + NS+ I+L GVRRLLEEDLGL Sbjct: 123 DSPIMGVMSSKSESVDAQSVKASESSIKKAIWERAAHFRDNSESITLAGVRRLLEEDLGL 182 Query: 1516 DKNTLDAYKNFISRQVDLVLXXXXXXXXXXXKRSEDVKSRKSRKVNSEDSDTSQSGSDEM 1337 +KNTLDA+K FI Q+D VL K+S + KS+ ++K + E+S++ S + Sbjct: 183 EKNTLDAFKKFIQIQIDEVLTPSEAPKSSSVKKSPEKKSKTAKK-SGENSNSFSSKRKHI 241 Query: 1336 SDKEKLRKEAGLRKNIKKFE--QPRKRRNSENADMDISRKKPKKQIEEDNNS-DEGGSIS 1166 ++K K RK + ++ ++K E + RK+ NSE+ +K+ K + ++N+ D S S Sbjct: 242 AEKVKSRKSSAAKETVEKSEGLKKRKKPNSEDNVPAKKQKEVSKNLSDENSDGDTDKSDS 301 Query: 1165 EDGQSQLSLEKPAPRKE-----KSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDD 1001 EDGQS S E + +K+ + GYGKRVE+LKSI KACGMSV P+IYK+ K V DD Sbjct: 302 EDGQSGSSAEIISAKKKVVKGASANTGYGKRVEHLKSIFKACGMSVAPSIYKRAKQVSDD 361 Query: 1000 KRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERARELEGIDXXXXXXXXXXXXXXX 821 KRE L+KELE ILS EGLS NPTEKEIK+ +K+K+ A+ELEGID Sbjct: 362 KREGFLIKELEKILSAEGLSTNPTEKEIKEVKKRKQTAKELEGIDLSNIVSNTRRRSTTS 421 Query: 820 FVAPERP 800 FVAP RP Sbjct: 422 FVAPPRP 428 >ref|XP_006434168.1| hypothetical protein CICLE_v10000938mg [Citrus clementina] gi|557536290|gb|ESR47408.1| hypothetical protein CICLE_v10000938mg [Citrus clementina] Length = 497 Score = 330 bits (846), Expect = 2e-87 Identities = 187/431 (43%), Positives = 267/431 (61%), Gaps = 15/431 (3%) Frame = -1 Query: 2020 AFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMD 1841 + E Q++ A+ +R+ HFK+QADSLT E VRRL+EKDLGLE ALD HK+FI+ L + MD Sbjct: 19 SIEPQIKAAMISRVSHFKEQADSLTFEGVRRLIEKDLGLETHALDVHKKFIKQCLLECMD 78 Query: 1840 GADESNSSPATVNMEGGVLLSKEEEKVIPKQEEANSESKKASTGNEETMEDSPIMGVLTP 1661 GA ++S + + S +EE+ P+ ++ + K+ N E MEDSP++G++T Sbjct: 79 GAGGVSASKDSAESAKENVSSTKEEEKSPEGYQSAKDVKEPCPENYEKMEDSPVLGLMTG 138 Query: 1660 KSEV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDK 1511 + G + SES IKKAI +RA +++ N +K+++ G+RR+LEEDL LDK Sbjct: 139 NKKTKFETEEAQGDGNKEDPSESAIKKAIRKRAAYIKTNIEKVTMAGLRRILEEDLKLDK 198 Query: 1510 NTLDAYKNFISRQVDLVLXXXXXXXXXXXKRSEDVKSR---KSRKVNSEDSDTSQSGSDE 1340 TLD++K IS+++D VL ++ + +K K+++V+SE S S G + Sbjct: 199 FTLDSFKKMISQELDEVLKSSEVLEPSTVEKKKSLKKNYQSKAKEVSSEGSSDSSDGEVD 258 Query: 1339 MSDKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPK--KQIEEDNNSDEGGSIS 1166 D+ K RK+ + ++ E +KR+ E +KK K K EDNN E GS+S Sbjct: 259 EEDEMKPRKKIVSKGKVQNNEGLKKRKRPEKETKASIKKKTKAVKIASEDNNDAESGSVS 318 Query: 1165 EDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRETI 986 +DG+SQ S EKP +K S P YGKRVE+LK++IK+C MS+PP++YKKVK P++KRE Sbjct: 319 DDGRSQSSSEKPIKKKVVSTPAYGKRVEHLKTVIKSCAMSIPPSVYKKVKQAPENKREAQ 378 Query: 985 LVKELEGILSREGLSKNPTEKEIKDCRKKKERARELEGIDXXXXXXXXXXXXXXXFVAPE 806 L+KELEGILSREGLS NP+EKEIK+ +KKKERARELEGID FV P Sbjct: 379 LIKELEGILSREGLSSNPSEKEIKEVKKKKERARELEGIDMSNIVSSSRRRSATSFVPPP 438 Query: 805 RPVVRAKKYKG 773 +P + + G Sbjct: 439 KPKIPDESESG 449 >ref|XP_003522580.1| PREDICTED: transcriptional regulator ATRX homolog isoform X1 [Glycine max] Length = 490 Score = 328 bits (840), Expect = 8e-87 Identities = 197/414 (47%), Positives = 269/414 (64%), Gaps = 21/414 (5%) Frame = -1 Query: 2044 AEGDGEKRAF-ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFI 1868 +EG +K E Q+E A+ +R+ HFK+Q+DSLT E VRRLLEKDLGLE+ ALD HKRFI Sbjct: 5 SEGTTKKEEILESQIETAMRSRVSHFKEQSDSLTFEGVRRLLEKDLGLEEYALDVHKRFI 64 Query: 1867 RHYLEKIMDGADESNSSPATVNMEG--GVLLSKEEEKVIPKQEEANSESKKASTGNEETM 1694 + L K ++G + + P EG G + + EE PK+E + ++K +EE M Sbjct: 65 KQCLLKCLEGVGDDDG-PKISGKEGEKGSSIQESEE---PKEECESKDAKDLCPEDEEKM 120 Query: 1693 EDSPIMGVLTPKSEV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGVR 1544 EDSP++G+L + GT+ SE+ IKKA+ +R+ +++AN++KI++ G+R Sbjct: 121 EDSPVLGLLKEQKRAKLETKDDKGNGTKVVPSEALIKKAVRKRSSYIKANAEKITMAGLR 180 Query: 1543 RLLEEDLGLDKNTLDAYKNFISRQVDLVLXXXXXXXXXXXKRSEDVK---SRKSRKVNSE 1373 RLLEEDL LDK TLD YK F+S+Q+D VL + K ++ ++KV+SE Sbjct: 181 RLLEEDLKLDKFTLDPYKKFVSQQLDEVLTSSEVPEPAKNAKKIVKKKPDTKVTKKVSSE 240 Query: 1372 D-SDTSQSGSDEMSDKE---KLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQI 1205 + SDTS +DE +E K RK+ + +K QP+KR+ E+ R KP K Sbjct: 241 ENSDTSDKETDEEESEEDEVKPRKKILPKGKVKTSVQPKKRKGEESDLSSKKRVKPAKAA 300 Query: 1204 EEDNN-SDEGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIY 1028 EDN+ +++ G SED QS S EKP+ +KE S P YGKRVE+LKS+IKACGMSVPP IY Sbjct: 301 SEDNSDAEDNGKNSEDDQSHSSPEKPSKKKEVSNPVYGKRVEHLKSVIKACGMSVPPVIY 360 Query: 1027 KKVKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERARELEGID 866 KKVK VP++KRE L+KELE ILSREGLS NP+EKEIK+ ++KK RA+ELEGID Sbjct: 361 KKVKQVPENKREGQLIKELEEILSREGLSSNPSEKEIKEVKRKKARAKELEGID 414 >ref|XP_002300995.2| hypothetical protein POPTR_0002s08550g [Populus trichocarpa] gi|550344567|gb|EEE80268.2| hypothetical protein POPTR_0002s08550g [Populus trichocarpa] Length = 476 Score = 323 bits (827), Expect = 3e-85 Identities = 197/435 (45%), Positives = 271/435 (62%), Gaps = 14/435 (3%) Frame = -1 Query: 2056 EELMAEGDGEKRAFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHK 1877 E + E E E Q++ A+ +R+ HFK QADSLT E VRRLLEKDLGL+KLALD HK Sbjct: 11 ETVKKETPDESLDIESQVKEAMLSRVSHFKKQADSLTFEGVRRLLEKDLGLDKLALDVHK 70 Query: 1876 RFIRHYLEKIMDGADESNSSPATVNMEGGVLLSKEEEKVIPKQEEANSESKKASTGNEET 1697 RF++ L + +DGA N+S + + + S +E P++ + + K+ + +EE Sbjct: 71 RFVKQCLFECLDGAVTDNASKDSGDTVEKHVDSPKEVTESPERRDLKNNIKEPCSEDEEK 130 Query: 1696 MEDSPIMGVL----TPKSEV-GTQSSL-----SESTIKKAILERADHLQANSDKISLGGV 1547 MEDSP+MG+L T KS+ TQ++ SE +IKKA++ RA +++ANS++I++ G+ Sbjct: 131 MEDSPVMGLLSGQKTTKSKAKDTQANEVKEVPSEGSIKKAMMRRASYIKANSEEITMAGL 190 Query: 1546 RRLLEEDLGLDKNTLDAYKNFISRQVDLVLXXXXXXXXXXXKRSEDVKSRKSRKVNSEDS 1367 RRLLEEDL LDK +LD YK FIS+Q+D +V SR+S + Sbjct: 191 RRLLEEDLKLDKFSLDPYKKFISKQLD------------------EVSSRES-------A 225 Query: 1366 DTSQSGSDEMSDKEK-LRKEAGLRKNIKKFEQPRKRRNSENADMDISRK--KPKKQIEED 1196 D+S S+E ++ K +K+ G+ + ++ E +KRR +E + K KP + ED Sbjct: 226 DSSDKESEEEDEEVKPKKKKIGVERKMQNSEGSKKRRRTEKETKVSANKRIKPLETAAED 285 Query: 1195 NNSDE-GGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKV 1019 N+ E G+ SED S S EKP +KE S P YGKRVE+LKS+IK+CGMSVPP+IYKKV Sbjct: 286 NSDSEVSGNASEDNNSPSSAEKPVKKKEASTPAYGKRVEHLKSVIKSCGMSVPPSIYKKV 345 Query: 1018 KGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERARELEGIDXXXXXXXXX 839 K P++KRE L+KELE ILSREGLS NP+EKEIK+ RK+KERA+ELEGID Sbjct: 346 KQAPENKREARLIKELEEILSREGLSSNPSEKEIKEVRKRKERAKELEGIDLSNIVTTSR 405 Query: 838 XXXXXXFVAPERPVV 794 FVAP +P V Sbjct: 406 RRSATSFVAPPKPKV 420 >ref|XP_002516334.1| conserved hypothetical protein [Ricinus communis] gi|223544564|gb|EEF46081.1| conserved hypothetical protein [Ricinus communis] Length = 517 Score = 320 bits (821), Expect = 1e-84 Identities = 183/419 (43%), Positives = 263/419 (62%), Gaps = 12/419 (2%) Frame = -1 Query: 2014 ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 1835 E Q++ A+ +R+ +F +Q++SLT E VRRLLEKDLGL++ ALD HKRF++ L + +DG Sbjct: 26 ESQIKDAMRSRVNYFNEQSNSLTFEGVRRLLEKDLGLQEYALDVHKRFVKQCLLQCLDGD 85 Query: 1834 DESNSSPATVNMEGGVLLSKEEEKVIPKQEEANSESKKASTGNEETMEDSPIMGVLT--- 1664 + S S T E G K E P+ E+ K+ + +EE E+SP+MG+LT Sbjct: 86 NASKDSGETD--EKGSRSIKGEATESPEGHESKDHIKEPCSEDEEKTEESPVMGLLTGKK 143 Query: 1663 -PKSEVG---TQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDKNTLDA 1496 PKSE + + +ES IKKA+ +RA +++ANSDK+++ G+RRLLEEDL LDK+ LD Sbjct: 144 TPKSETDKTLVKEAPTESIIKKALSKRASYIKANSDKVTMAGLRRLLEEDLRLDKHALDP 203 Query: 1495 YKNFISRQVDLVLXXXXXXXXXXXKRSEDVKSRKSRKVNSEDSDTSQSGSDEMSDKEKLR 1316 YK FIS Q+D VL + + + S+K+ +E+S S + D+++++ Sbjct: 204 YKKFISAQLDEVLQSSEVSEPKKKSVKTNSQGKASKKMRTEESSDSSGKEMDTEDEDEVK 263 Query: 1315 KEAGLRKNIKKFE----QPRKRRNSENADMDISRKKPKKQIEEDNN-SDEGGSISEDGQS 1151 + + N K + RKR E R KP +++ ED++ +++ G+ SEDG+S Sbjct: 264 PKKKIAPNKKMINSEGSKKRKRFEKETKVTSKKRVKPTEKVAEDSSDAEDSGNASEDGRS 323 Query: 1150 QLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRETILVKEL 971 Q S EKP +KE P YGKRVE+LKS+IK+CGMSVPP +YKKVK VP++KRE L+KEL Sbjct: 324 QSSAEKPVKKKEAPTPVYGKRVEHLKSVIKSCGMSVPPVVYKKVKQVPENKREAQLIKEL 383 Query: 970 EGILSREGLSKNPTEKEIKDCRKKKERARELEGIDXXXXXXXXXXXXXXXFVAPERPVV 794 E ILS+EGLS NP+EKEIK+ RK+KERA+ELEGID +V P +P + Sbjct: 384 EEILSKEGLSSNPSEKEIKEVRKRKERAKELEGIDMSNIVSSSRRRSATSYVPPPKPKI 442 >ref|XP_006581582.1| PREDICTED: DNA ligase 1-like isoform X2 [Glycine max] Length = 486 Score = 320 bits (820), Expect = 2e-84 Identities = 191/412 (46%), Positives = 264/412 (64%), Gaps = 19/412 (4%) Frame = -1 Query: 2044 AEGDGEKRAF-ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFI 1868 +EG +K E Q+E A+ +R+ FK+Q+DSLT E VRRLLEKDLGLE+ ALD HKRFI Sbjct: 5 SEGTAKKEEILESQIETAMRSRVSLFKEQSDSLTFEGVRRLLEKDLGLEEYALDVHKRFI 64 Query: 1867 RHYLEKIMDGADESNSSPATVNMEGGVLLSKEEEKVIPKQEEANSESKKASTGNEETMED 1688 + L K ++G + + A ++ + G + +E PK+E ++K +EE MED Sbjct: 65 KQCLLKCLEGVGDDDG--AKISGKEGEKGTSTQESEEPKEECEAKDAKDLCPEDEEKMED 122 Query: 1687 SPIMGVLTPKSEV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGVRRL 1538 SP++G+L + GT+ E+ IKKA+ +R+ +++AN++KI++ G+RRL Sbjct: 123 SPVLGLLKEQKRAKLETKDDKGNGTKVVPIEALIKKAVRKRSSYIKANAEKITMAGLRRL 182 Query: 1537 LEEDLGLDKNTLDAYKNFISRQVDLVLXXXXXXXXXXXKRSEDVK---SRKSRKVNSED- 1370 LEEDL LDK TLD YK F+S+Q+D VL + K ++ ++KV+SE+ Sbjct: 183 LEEDLKLDKFTLDPYKKFVSQQLDEVLASSEVPKPSNNAKKIVKKKPDTKVTKKVSSEEN 242 Query: 1369 SDTSQSGSDEMSDKE---KLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQIEE 1199 SDTS +DE +E K RK+ + +K QP+KR+ E R KP K E Sbjct: 243 SDTSDKETDEEESEEDEVKPRKKIVPKGKVKTSVQPKKRKGEETDLSSKKRVKPAKATSE 302 Query: 1198 DNN-SDEGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKK 1022 DN+ +++ G SED QS S EKP+ +KE S P YGK VE+LKS+IKACGMSVPP IYKK Sbjct: 303 DNSDAEDDGKNSEDDQSSSSPEKPSKKKEVSTPVYGKHVEHLKSVIKACGMSVPPVIYKK 362 Query: 1021 VKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERARELEGID 866 VK VP++KRE L+KELE ILSREGLS NP+EKEIK+ ++KK RA+ELEGID Sbjct: 363 VKQVPENKREEQLIKELEEILSREGLSSNPSEKEIKEVKRKKARAKELEGID 414 >ref|XP_003527934.1| PREDICTED: DNA ligase 1-like isoform X1 [Glycine max] Length = 488 Score = 320 bits (820), Expect = 2e-84 Identities = 191/412 (46%), Positives = 264/412 (64%), Gaps = 19/412 (4%) Frame = -1 Query: 2044 AEGDGEKRAF-ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFI 1868 +EG +K E Q+E A+ +R+ FK+Q+DSLT E VRRLLEKDLGLE+ ALD HKRFI Sbjct: 5 SEGTAKKEEILESQIETAMRSRVSLFKEQSDSLTFEGVRRLLEKDLGLEEYALDVHKRFI 64 Query: 1867 RHYLEKIMDGADESNSSPATVNMEGGVLLSKEEEKVIPKQEEANSESKKASTGNEETMED 1688 + L K ++G + + A ++ + G + +E PK+E ++K +EE MED Sbjct: 65 KQCLLKCLEGVGDDDG--AKISGKEGEKGTSTQESEEPKEECEAKDAKDLCPEDEEKMED 122 Query: 1687 SPIMGVLTPKSEV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGVRRL 1538 SP++G+L + GT+ E+ IKKA+ +R+ +++AN++KI++ G+RRL Sbjct: 123 SPVLGLLKEQKRAKLETKDDKGNGTKVVPIEALIKKAVRKRSSYIKANAEKITMAGLRRL 182 Query: 1537 LEEDLGLDKNTLDAYKNFISRQVDLVLXXXXXXXXXXXKRSEDVK---SRKSRKVNSED- 1370 LEEDL LDK TLD YK F+S+Q+D VL + K ++ ++KV+SE+ Sbjct: 183 LEEDLKLDKFTLDPYKKFVSQQLDEVLASSEVPKPSNNAKKIVKKKPDTKVTKKVSSEEN 242 Query: 1369 SDTSQSGSDEMSDKE---KLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQIEE 1199 SDTS +DE +E K RK+ + +K QP+KR+ E R KP K E Sbjct: 243 SDTSDKETDEEESEEDEVKPRKKIVPKGKVKTSVQPKKRKGEETDLSSKKRVKPAKATSE 302 Query: 1198 DNN-SDEGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKK 1022 DN+ +++ G SED QS S EKP+ +KE S P YGK VE+LKS+IKACGMSVPP IYKK Sbjct: 303 DNSDAEDDGKNSEDDQSSSSPEKPSKKKEVSTPVYGKHVEHLKSVIKACGMSVPPVIYKK 362 Query: 1021 VKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERARELEGID 866 VK VP++KRE L+KELE ILSREGLS NP+EKEIK+ ++KK RA+ELEGID Sbjct: 363 VKQVPENKREEQLIKELEEILSREGLSSNPSEKEIKEVKRKKARAKELEGID 414 >ref|XP_007019032.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508724360|gb|EOY16257.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 523 Score = 315 bits (806), Expect = 7e-83 Identities = 187/445 (42%), Positives = 266/445 (59%), Gaps = 38/445 (8%) Frame = -1 Query: 2014 ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 1835 E ++ A+ +R+ HFK+QADSLT E VRRLLEKDLGLE ALD HKRF++ L K +DG Sbjct: 29 ESRITTAMRSRVGHFKEQADSLTFEGVRRLLEKDLGLETFALDVHKRFVKQCLLKCLDGG 88 Query: 1834 DESNSSPATVNMEGGVLLSKEEEKVIPKQEEANSESKKASTGNEETMEDSPIMGVLTPKS 1655 D+ ++ ++ L + E PK ++ + K+A + +EE +EDSP++G+LT Sbjct: 89 DDDDAPKSSGETGEKNLSTTTEVTESPKGRQSKKDVKEAFSEDEEKLEDSPVLGLLTGHK 148 Query: 1654 EVGTQS---------SLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDKNTL 1502 T++ + ESTIKKAI +RA +++ANS+K+++ G+RRLLEEDL LDK+TL Sbjct: 149 TTKTETMETETKENKDVFESTIKKAIKKRASYVEANSEKVTMAGLRRLLEEDLKLDKDTL 208 Query: 1501 DAYKNFISRQVDLVLXXXXXXXXXXXKRSEDVK-------SRKSRKVNSEDSDTSQSGSD 1343 D YK FI+ Q+D VL + ++K S+K+ K S S S+S + Sbjct: 209 DPYKKFITEQLDEVLKSREVSAPASVVKKNNLKKNSQSKASKKASKKLSSASSGSESDEE 268 Query: 1342 EMSDKE-------------------KLRKEAGLRKNIKKFEQPRKRR-NSENADMDISRK 1223 E ++E K +K+ + IK E +KR+ + A+M ++ Sbjct: 269 EGEEEEDEDEDEDVDEEEEEEEEEVKPKKKISAKGKIKNSEGLKKRKIPKKEAEMPSKKR 328 Query: 1222 KPKKQIEEDNNSD--EGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGM 1049 + D+NSD + GS+S+D +S+ S K RKE S P YGK VE+LKS+IK+CGM Sbjct: 329 SKHAESISDDNSDAEDSGSVSDDNRSRSSAAKAVKRKETSTPVYGKHVEHLKSVIKSCGM 388 Query: 1048 SVPPNIYKKVKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERARELEGI 869 SVPP IYK+VK VP++ RE L+KELE ILS+EGLS NP+EKEIK+ RK+KERA+ELEGI Sbjct: 389 SVPPAIYKRVKQVPENNREAQLIKELEEILSKEGLSSNPSEKEIKEVRKRKERAKELEGI 448 Query: 868 DXXXXXXXXXXXXXXXFVAPERPVV 794 D FVAP +P + Sbjct: 449 DTSNIVLSSRRRSTTSFVAPPKPKI 473 Score = 61.2 bits (147), Expect = 2e-06 Identities = 82/373 (21%), Positives = 152/373 (40%), Gaps = 59/373 (15%) Frame = -1 Query: 2074 HSQLPPEELMAEGDGEKRAFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKL 1895 H E + E K FE +++A+ R + + ++ +T+ +RRLLE+DL L+K Sbjct: 147 HKTTKTETMETETKENKDVFESTIKKAIKKRASYVEANSEKVTMAGLRRLLEEDLKLDKD 206 Query: 1894 ALDAHKRFIRHYLEKIMDGADESNSSPATVNMEGGV---LLSKEEEKVIPKQEEANSESK 1724 LD +K+FI L++++ + S+PA+V + + SK +K K A+S S+ Sbjct: 207 TLDPYKKFITEQLDEVLKSREV--SAPASVVKKNNLKKNSQSKASKKASKKLSSASSGSE 264 Query: 1723 KASTGNEETMEDSPIMGV----------LTPKSEVGTQSSL--SESTIKKAI-------- 1604 EE ++ V + PK ++ + + SE K+ I Sbjct: 265 SDEEEGEEEEDEDEDEDVDEEEEEEEEEVKPKKKISAKGKIKNSEGLKKRKIPKKEAEMP 324 Query: 1603 -LERADHLQA----NSDKISLGGV------RRLLEEDLGLDKNTLDAYKNFISRQVDLVL 1457 +R+ H ++ NSD G V R + + + + Y + ++ Sbjct: 325 SKKRSKHAESISDDNSDAEDSGSVSDDNRSRSSAAKAVKRKETSTPVYGKHVEHLKSVIK 384 Query: 1456 XXXXXXXXXXXKRSEDV--KSRKSRKVNSEDSDTSQSG-SDEMSDKE----KLRKE---- 1310 KR + V +R+++ + + S+ G S S+KE + RKE Sbjct: 385 SCGMSVPPAIYKRVKQVPENNREAQLIKELEEILSKEGLSSNPSEKEIKEVRKRKERAKE 444 Query: 1309 -----------AGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQIEED---NNSDEGGS 1172 + R++ F P K + + +D D S + ++D +N DE G Sbjct: 445 LEGIDTSNIVLSSRRRSTTSFVAPPKPKIPDASDDDESEESDDNDDDDDDDEDNDDEDGG 504 Query: 1171 ISEDGQSQLSLEK 1133 + QS+ S E+ Sbjct: 505 DEGNSQSEGSDEE 517 >ref|XP_007222349.1| hypothetical protein PRUPE_ppa004840mg [Prunus persica] gi|462419285|gb|EMJ23548.1| hypothetical protein PRUPE_ppa004840mg [Prunus persica] Length = 489 Score = 311 bits (797), Expect = 8e-82 Identities = 184/429 (42%), Positives = 274/429 (63%), Gaps = 17/429 (3%) Frame = -1 Query: 2029 EKRAFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEK 1850 E + Q++ A+ +R+ +FK+Q+DSLT E VRRLLEKDLGLE ALD HKRF++ +L + Sbjct: 18 EAHDIQSQIKDAMRSRVPYFKEQSDSLTFEGVRRLLEKDLGLETFALDVHKRFVKEHLVE 77 Query: 1849 IMDGADESNSSPATVNMEGGVLLSKEEEKVIPKQEEANSESKKASTGNEETMEDSPIMGV 1670 ++GA + N+S ++ + ++ K E P+ ++N + K+ + +EE MEDSP+MG+ Sbjct: 78 CLEGAGDDNTSKSSGETDEKSII-KGEAAESPEGYKSNKDVKETYSEDEEKMEDSPVMGL 136 Query: 1669 L----TPKS------EVGTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLG 1520 L T KS ++ + SE+ IK A+ +R +++ANS+KI++ G+RRLLEEDL Sbjct: 137 LAGNKTAKSGTEETKSTKSKKAPSETVIKSALRKRVSYIKANSEKITMAGLRRLLEEDLK 196 Query: 1519 LDKNTLDAYKNFISRQVDLVLXXXXXXXXXXXKRS--EDVKSRKSRKVNSEDSD-TSQSG 1349 L+K TLD K FI+ +D VL K++ + V+ + S KV S++S +S + Sbjct: 197 LEKYTLDPCKKFINEHLDKVLESCEISEPAPVKKNVKKSVQRKASTKVRSDESSGSSDNE 256 Query: 1348 SDEMSDKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKK----QIEEDNNSDE 1181 SDE D+ K R ++ + ++ +KR+ N + +IS KK K + E+ ++++ Sbjct: 257 SDEEEDEVKPRNKSVPKGKMQNSNDLKKRKRMAN-ETNISGKKRIKPSETEPEDKSDAEV 315 Query: 1180 GGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDD 1001 G++SED +SQ S EKP +KE S P YGKRVE+L+S+IKACGMSV P++YKKVK VP+ Sbjct: 316 SGNVSEDDRSQSSAEKPVKKKEVSTPAYGKRVEHLRSVIKACGMSVAPSVYKKVKQVPES 375 Query: 1000 KRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERARELEGIDXXXXXXXXXXXXXXX 821 KRE L+KELE ILS+EGLS +PTEKEIK+ +KKKERA+ELEGID Sbjct: 376 KREAHLIKELEEILSKEGLSAHPTEKEIKEVKKKKERAKELEGIDMSNIVTSSRRRSTTS 435 Query: 820 FVAPERPVV 794 FV P +P + Sbjct: 436 FVPPPKPKI 444 Score = 62.0 bits (149), Expect = 1e-06 Identities = 57/267 (21%), Positives = 110/267 (41%), Gaps = 9/267 (3%) Frame = -1 Query: 1666 TPKSEVGTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDKNTLDAYKN 1487 +P +V ++ +S IK A+ R + + SD ++ GVRRLLE+DLGL+ LD +K Sbjct: 10 SPVKQVKQEAHDIQSQIKDAMRSRVPYFKEQSDSLTFEGVRRLLEKDLGLETFALDVHKR 69 Query: 1486 FISRQV---------DLVLXXXXXXXXXXXKRSEDVKSRKSRKVNSEDSDTSQSGSDEMS 1334 F+ + D + E +S + K N + +T ++M Sbjct: 70 FVKEHLVECLEGAGDDNTSKSSGETDEKSIIKGEAAESPEGYKSNKDVKETYSEDEEKME 129 Query: 1333 DKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQIEEDNNSDEGGSISEDGQ 1154 D + AG + E+ + ++ + + + +K++ + E I+ G Sbjct: 130 DSPVMGLLAGNKTAKSGTEETKSTKSKKAPSETVIKSALRKRVSYIKANSE--KITMAGL 187 Query: 1153 SQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRETILVKE 974 +L E K P E+L ++++C +S P + K VK K T + + Sbjct: 188 RRLLEEDLKLEKYTLDPCKKFINEHLDKVLESCEISEPAPVKKNVKKSVQRKASTKVRSD 247 Query: 973 LEGILSREGLSKNPTEKEIKDCRKKKE 893 G S N +++E + + + + Sbjct: 248 -----ESSGSSDNESDEEEDEVKPRNK 269 >ref|XP_004169339.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101229552 [Cucumis sativus] Length = 488 Score = 311 bits (796), Expect = 1e-81 Identities = 188/416 (45%), Positives = 256/416 (61%), Gaps = 17/416 (4%) Frame = -1 Query: 2062 PPEELMAEGDGEKRAFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDA 1883 P EE M G E ++ A+ +R+ HFK+QADSLT E VRRLLEKDL +E LD Sbjct: 11 PKEEPMDVAVG----IETKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCMETYTLDV 66 Query: 1882 HKRFIRHYLEKIMDGADESNSSPATVNMEGGVLLSKEEEKVIPKQEEANSESKKASTGNE 1703 HKR+++ L K ++ E N S + + G ++KEE P+ ++ +K+ +E Sbjct: 67 HKRYVKQCLVKCLEADLEDNVSKDS-ELTGRKSVNKEEAPESPEGHQSKKGAKEPCLEDE 125 Query: 1702 ETMEDSPIMGVLTPKSEVGTQSS-------------LSESTIKKAILERADHLQANSDKI 1562 E MEDSP+MG+LT +S +S SESTI KAI +R +L+ANS+K+ Sbjct: 126 EKMEDSPVMGLLTGRSTKNVESDGIKGIKGKDDKDVPSESTIMKAIRKRTSYLKANSEKV 185 Query: 1561 SLGGVRRLLEEDLGLDKNTLDAYKNFISRQVDLVLXXXXXXXXXXXKRSEDVKSRKSRKV 1382 ++ GVRRLLE+DL L KN LD+ K FIS+QV+ +L +E V + KS K Sbjct: 186 TMAGVRRLLEDDLKLTKNVLDSCKKFISQQVEEILTSCEA--------AEQVSNLKSPKK 237 Query: 1381 NSEDSDTSQSGS--DEMSDKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQ 1208 S++S S GS +E +D+ K + I + +KR+ S + ++ Q Sbjct: 238 ISKESSYSTEGSSSEEENDEVNPGKTNATKGRIPDANETKKRKRSTKKTVSAQKQSKHVQ 297 Query: 1207 IEEDNNSDEGG-SISEDGQSQLSLEKPAPRK-EKSAPGYGKRVENLKSIIKACGMSVPPN 1034 D +SDEGG ++SEDG+S S EKP ++ S P YGKRVE+LKS+IK+CGMSVPP+ Sbjct: 298 DTSDEDSDEGGGNVSEDGRSGSSNEKPVKKEVSSSTPVYGKRVEHLKSVIKSCGMSVPPS 357 Query: 1033 IYKKVKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERARELEGID 866 IYKKVK P+ KRE+ L+KELEGILSREGLS N TEKEIK+ +KKKERA+ELEGID Sbjct: 358 IYKKVKQAPESKRESQLIKELEGILSREGLSANSTEKEIKEVKKKKERAKELEGID 413 >ref|XP_004145363.1| PREDICTED: uncharacterized protein LOC101217045 [Cucumis sativus] Length = 488 Score = 311 bits (796), Expect = 1e-81 Identities = 188/416 (45%), Positives = 256/416 (61%), Gaps = 17/416 (4%) Frame = -1 Query: 2062 PPEELMAEGDGEKRAFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDA 1883 P EE M G E ++ A+ +R+ HFK+QADSLT E VRRLLEKDL +E LD Sbjct: 11 PKEEPMDVAVG----IETKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCMETYTLDV 66 Query: 1882 HKRFIRHYLEKIMDGADESNSSPATVNMEGGVLLSKEEEKVIPKQEEANSESKKASTGNE 1703 HKR+++ L K ++ E N S + + G ++KEE P+ ++ +K+ +E Sbjct: 67 HKRYVKQCLVKCLEADLEDNVSKDS-ELTGRKSVNKEEAPESPEGHQSKKGAKEPCLEDE 125 Query: 1702 ETMEDSPIMGVLTPKSEVGTQSS-------------LSESTIKKAILERADHLQANSDKI 1562 E MEDSP+MG+LT +S +S SESTI KAI +R +L+ANS+K+ Sbjct: 126 EKMEDSPVMGLLTGRSTKNVESDGIKGIKGKDDKDVPSESTIMKAIRKRTSYLKANSEKV 185 Query: 1561 SLGGVRRLLEEDLGLDKNTLDAYKNFISRQVDLVLXXXXXXXXXXXKRSEDVKSRKSRKV 1382 ++ GVRRLLE+DL L KN LD+ K FIS+QV+ +L +E V + KS K Sbjct: 186 TMAGVRRLLEDDLKLTKNVLDSCKKFISQQVEEILTSCEA--------AEQVSNLKSPKK 237 Query: 1381 NSEDSDTSQSGS--DEMSDKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQ 1208 S++S S GS +E +D+ K + I + +KR+ S + ++ Q Sbjct: 238 ISKESSYSTEGSSSEEENDEVNPGKTNATKGRIPDSNETKKRKRSTKKTVSAQKQSKHVQ 297 Query: 1207 IEEDNNSDEGG-SISEDGQSQLSLEKPAPRK-EKSAPGYGKRVENLKSIIKACGMSVPPN 1034 D +SDEGG ++SEDG+S S EKP ++ S P YGKRVE+LKS+IK+CGMSVPP+ Sbjct: 298 DTSDEDSDEGGGNVSEDGRSGSSNEKPVKKEVSSSTPVYGKRVEHLKSVIKSCGMSVPPS 357 Query: 1033 IYKKVKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERARELEGID 866 IYKKVK P+ KRE+ L+KELEGILSREGLS N TEKEIK+ +KKKERA+ELEGID Sbjct: 358 IYKKVKQAPESKRESQLIKELEGILSREGLSANSTEKEIKEVKKKKERAKELEGID 413 >ref|XP_007019033.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508724361|gb|EOY16258.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 521 Score = 310 bits (794), Expect = 2e-81 Identities = 187/445 (42%), Positives = 266/445 (59%), Gaps = 38/445 (8%) Frame = -1 Query: 2014 ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 1835 E ++ A+ +R+ HFK+QADSLT E VRRLLEKDLGLE ALD HKRF++ L K +DG Sbjct: 29 ESRITTAMRSRVGHFKEQADSLTFEGVRRLLEKDLGLETFALDVHKRFVKQCLLKCLDGG 88 Query: 1834 DESNSSPATVNMEGGVLLSKEEEKVIPKQEEANSESKKASTGNEETMEDSPIMGVLTPKS 1655 D+ ++ ++ L + E PK ++ + K+A + +EE +EDSP++G+LT Sbjct: 89 DDDDAPKSSGETGEKNLSTTTEVTESPKGRQSKKDVKEAFSEDEEKLEDSPVLGLLTGHK 148 Query: 1654 EVGTQS---------SLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDKNTL 1502 T++ + ESTIKKAI +RA +++ANS+K+++ G+RRLLEEDL LDK+TL Sbjct: 149 TTKTETMETETKENKDVFESTIKKAIKKRASYVEANSEKVTMAGLRRLLEEDLKLDKDTL 208 Query: 1501 DAYKNFISRQVDLVLXXXXXXXXXXXKRSEDVK-------SRKSRKVNSEDSDTSQSGSD 1343 D YK FI+ Q+D VL + ++K S+K+ K S S S+S + Sbjct: 209 DPYKKFITEQLDEVLKSREVSAPASVVKKNNLKKNSQSKASKKASKKLSSASSGSESDEE 268 Query: 1342 EMSDKE-------------------KLRKEAGLRKNIKKFEQPRKRR-NSENADMDISRK 1223 E ++E K +K+ + IK E +KR+ + A+M ++ Sbjct: 269 EGEEEEDEDEDEDVDEEEEEEEEEVKPKKKISAKGKIKNSEGLKKRKIPKKEAEMPSKKR 328 Query: 1222 KPKKQIEEDNNSD--EGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGM 1049 + D+NSD + GS+S+D +S+ S K RKE S P YGK VE+LKS+IK+CGM Sbjct: 329 SKHAESISDDNSDAEDSGSVSDDNRSRSSAAKA--RKETSTPVYGKHVEHLKSVIKSCGM 386 Query: 1048 SVPPNIYKKVKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERARELEGI 869 SVPP IYK+VK VP++ RE L+KELE ILS+EGLS NP+EKEIK+ RK+KERA+ELEGI Sbjct: 387 SVPPAIYKRVKQVPENNREAQLIKELEEILSKEGLSSNPSEKEIKEVRKRKERAKELEGI 446 Query: 868 DXXXXXXXXXXXXXXXFVAPERPVV 794 D FVAP +P + Sbjct: 447 DTSNIVLSSRRRSTTSFVAPPKPKI 471 >ref|XP_004500560.1| PREDICTED: transcriptional regulator ATRX homolog isoform X1 [Cicer arietinum] gi|502130188|ref|XP_004500561.1| PREDICTED: transcriptional regulator ATRX homolog isoform X2 [Cicer arietinum] Length = 497 Score = 308 bits (788), Expect = 8e-81 Identities = 189/429 (44%), Positives = 266/429 (62%), Gaps = 19/429 (4%) Frame = -1 Query: 2014 ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 1835 E Q++ A+ +R+ HFK QADSLT E VRRLLEKDLG E+ +LD+HKRFI+ LEK ++ Sbjct: 16 ESQIQTAMLSRVPHFKQQADSLTFEGVRRLLEKDLGFEEYSLDSHKRFIKQCLEKCLEEV 75 Query: 1834 DESNSSPATVNMEGGVLLSKEEEKVIPKQEEANSESKKASTGNEETMEDSPIMGVLTPKS 1655 + ++S + E + ++V K+EE S+ +K T +EE MEDSP++G+L + Sbjct: 76 GDDDASKMSGEEEEK---GESTQEVEGKKEEHQSKDEKDLTEDEEKMEDSPVLGLLKEQK 132 Query: 1654 EVGTQSSLSEST----------IKKAILERADHLQANSDKISLGGVRRLLEEDLGLDKNT 1505 V ++ +E IKKAI++R+ +L+AN+D++++ G+RRLLEEDL LDK + Sbjct: 133 RVKNETKKAEGNGKKVVPNEALIKKAIIKRSSYLKANADEVTVAGLRRLLEEDLKLDKFS 192 Query: 1504 LDAYKNFISRQVDLVLXXXXXXXXXXXKRSEDVK---SRKSRKVNSED-SDTSQSGSDEM 1337 LD +K FI +Q+D VL + K S+ ++KV++E+ SDTS S+E Sbjct: 193 LDPFKKFIRQQLDEVLMSSEVLEPAKSAKKIVKKKPDSKVTKKVSTEENSDTSDKVSEEE 252 Query: 1336 SDKEKLRKEAGLRKNIKKFEQ---PRKRRNSENADMDISRKKPKKQIEEDNN-SDEGGSI 1169 +E K +K++ K + P+KR+ E R KP K+ EDN+ +++GG Sbjct: 253 ESQEDEVKPK--KKSVPKGKASVGPKKRKGEEIKSPSKKRAKPDKEASEDNSDAEDGGKN 310 Query: 1168 SEDGQSQLSLEKPAPRKEKSAPG-YGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRE 992 SED QS S E +K+ S P Y KRVE+LKS+IKACGMSVPP IYKKVK VP++KRE Sbjct: 311 SEDDQSHSSAENTTQKKQVSTPVVYSKRVEHLKSVIKACGMSVPPVIYKKVKQVPENKRE 370 Query: 991 TILVKELEGILSREGLSKNPTEKEIKDCRKKKERARELEGIDXXXXXXXXXXXXXXXFVA 812 L+KELE ILSREGLS NP+EKEIK+ ++KKERA+ELEGID F A Sbjct: 371 GQLIKELEEILSREGLSSNPSEKEIKEVKRKKERAKELEGIDMSNIVSSTRRRATTSFAA 430 Query: 811 PERPVVRAK 785 P P + K Sbjct: 431 PPPPKPKPK 439 >ref|XP_006578974.1| PREDICTED: transcriptional regulator ATRX homolog isoform X2 [Glycine max] Length = 408 Score = 301 bits (771), Expect = 8e-79 Identities = 184/395 (46%), Positives = 252/395 (63%), Gaps = 21/395 (5%) Frame = -1 Query: 2044 AEGDGEKRAF-ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFI 1868 +EG +K E Q+E A+ +R+ HFK+Q+DSLT E VRRLLEKDLGLE+ ALD HKRFI Sbjct: 5 SEGTTKKEEILESQIETAMRSRVSHFKEQSDSLTFEGVRRLLEKDLGLEEYALDVHKRFI 64 Query: 1867 RHYLEKIMDGADESNSSPATVNMEG--GVLLSKEEEKVIPKQEEANSESKKASTGNEETM 1694 + L K ++G + + P EG G + + EE PK+E + ++K +EE M Sbjct: 65 KQCLLKCLEGVGDDDG-PKISGKEGEKGSSIQESEE---PKEECESKDAKDLCPEDEEKM 120 Query: 1693 EDSPIMGVLTPKSEV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGVR 1544 EDSP++G+L + GT+ SE+ IKKA+ +R+ +++AN++KI++ G+R Sbjct: 121 EDSPVLGLLKEQKRAKLETKDDKGNGTKVVPSEALIKKAVRKRSSYIKANAEKITMAGLR 180 Query: 1543 RLLEEDLGLDKNTLDAYKNFISRQVDLVLXXXXXXXXXXXKRSEDVK---SRKSRKVNSE 1373 RLLEEDL LDK TLD YK F+S+Q+D VL + K ++ ++KV+SE Sbjct: 181 RLLEEDLKLDKFTLDPYKKFVSQQLDEVLTSSEVPEPAKNAKKIVKKKPDTKVTKKVSSE 240 Query: 1372 D-SDTSQSGSDEMSDKE---KLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQI 1205 + SDTS +DE +E K RK+ + +K QP+KR+ E+ R KP K Sbjct: 241 ENSDTSDKETDEEESEEDEVKPRKKILPKGKVKTSVQPKKRKGEESDLSSKKRVKPAKAA 300 Query: 1204 EEDNN-SDEGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIY 1028 EDN+ +++ G SED QS S EKP+ +KE S P YGKRVE+LKS+IKACGMSVPP IY Sbjct: 301 SEDNSDAEDNGKNSEDDQSHSSPEKPSKKKEVSNPVYGKRVEHLKSVIKACGMSVPPVIY 360 Query: 1027 KKVKGVPDDKRETILVKELEGILSREGLSKNPTEK 923 KKVK VP++KRE L+KELE ILSREGLS NP+EK Sbjct: 361 KKVKQVPENKREGQLIKELEEILSREGLSSNPSEK 395 >ref|XP_004290855.1| PREDICTED: uncharacterized protein LOC101302129 [Fragaria vesca subsp. vesca] Length = 490 Score = 300 bits (768), Expect = 2e-78 Identities = 180/443 (40%), Positives = 273/443 (61%), Gaps = 17/443 (3%) Frame = -1 Query: 2071 SQLPPEELMAEGDGEKRAFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLA 1892 S+ P ++ GD E + E A+ AR+ HFK+Q+DSLT +VRR+LEKDLGLE A Sbjct: 8 SEAPMKKEEETGDMESKILE-----AMKARVPHFKEQSDSLTFVNVRRVLEKDLGLEPSA 62 Query: 1891 LDAHKRFIRHYLEKIMDGADESNSSPATVNMEGGVLLSKEEEKVIPKQEEANSESKKAST 1712 LDAHK F++ +L K ++GA E N+S ++ + L+ K E + ++N + K+ S+ Sbjct: 63 LDAHKGFVKEHLLKCLEGAGEDNNSKSSGQTDEKSLI-KGEATGSTEGHQSNKDMKETSS 121 Query: 1711 GNEETMEDSPIMGVLTPKSEV-----GTQSSLS-----ESTIKKAILERADHLQANSDKI 1562 +EE +EDSP +LT G++SS + E+ IK A+ +R +++AN +K+ Sbjct: 122 ADEEKVEDSPASELLTEHKTAKVKAEGSKSSNNKKAPTEAMIKSALGKRGSYIKANIEKL 181 Query: 1561 SLGGVRRLLEEDLGLDKNTLDAYKNFISRQVDLVLXXXXXXXXXXXKRSEDVKSRKS--- 1391 ++G +RR+LE+DL LD +LD +K FI++Q+D VL + K ++ Sbjct: 182 TMGELRRVLEKDLKLDTYSLDPFKKFINQQLDEVLESCVDPEPVKNVKKNVKKPQRKPTP 241 Query: 1390 RKVNSEDSDTSQSGSDEMSDKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKK 1211 +++ E S + SG+DE D+ K RK++ + ++ + +KR++ + +IS KK K Sbjct: 242 EEISEESSGPANSGTDEEEDEVKPRKKSVTKGKMQNSDGLKKRKSLAK-ETNISGKKRIK 300 Query: 1210 QI----EEDNNSDEGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSV 1043 + EE +++ + ++SED S+ S EKP +KE S P YGKRVE+L+S+IKACGMSV Sbjct: 301 SLKADSEEKSDAKDSENVSEDEDSKSSAEKPVKKKEVSTPAYGKRVEHLRSVIKACGMSV 360 Query: 1042 PPNIYKKVKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERARELEGIDX 863 PP+IYKKVK VP++KRE L+KELE IL REGLS +PTEKEIK+ +KKKE+A+ELEGID Sbjct: 361 PPSIYKKVKQVPENKREAQLIKELEDILGREGLSSSPTEKEIKEVKKKKEKAKELEGIDM 420 Query: 862 XXXXXXXXXXXXXXFVAPERPVV 794 FV P +P + Sbjct: 421 SNIVTSSRRRSTTSFVPPPKPKI 443 >ref|XP_007137404.1| hypothetical protein PHAVU_009G124200g [Phaseolus vulgaris] gi|561010491|gb|ESW09398.1| hypothetical protein PHAVU_009G124200g [Phaseolus vulgaris] Length = 493 Score = 297 bits (761), Expect = 1e-77 Identities = 186/439 (42%), Positives = 258/439 (58%), Gaps = 23/439 (5%) Frame = -1 Query: 2047 MAEGDGEKRA---FELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHK 1877 MAE E + E Q+E A+ +R+ HFK+Q+DSLT E VRRLLEKDLGLE+ ALD HK Sbjct: 1 MAEDSEEMKKGENIESQIETAMLSRVSHFKEQSDSLTFEGVRRLLEKDLGLEECALDVHK 60 Query: 1876 RFIRHYLEKIMDGA--DESNSSPATVNMEGGVLLSKEEEKVIPKQEEANSESKKASTGNE 1703 RFI+ L + ++G D EG L +E PK++ + K +E Sbjct: 61 RFIKQCLLECLEGVGDDAGPRISEKAGEEGAGTLEPDE----PKEKCELKDEKDLCPEDE 116 Query: 1702 ETMEDSPIMGVLTPKSEV----------GTQSSLSESTIKKAILERADHLQANSDKISLG 1553 E MEDSP++G+L + G + SE+ + KA+ +R+ +++AN++ I++ Sbjct: 117 EKMEDSPVLGLLKEQKRAKLETKDDKGNGNKVVPSEALVMKAVKKRSSYIKANAETITMA 176 Query: 1552 GVRRLLEEDLGLDKNTLDAYKNFISRQVDLVLXXXXXXXXXXXKRSEDVK---SRKSRKV 1382 G+RRLLE+DL LDK TLD YK FIS+Q+D VL + K ++ ++KV Sbjct: 177 GLRRLLEDDLKLDKFTLDLYKKFISQQLDEVLASSVVSEPAKNAKKIVKKKPDTKVTKKV 236 Query: 1381 NSED-SDTSQSGSDE---MSDKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPK 1214 +SE+ SDTS DE D+ K K+ + + Q +KR+ E R KP Sbjct: 237 SSEENSDTSDKEIDEDESQEDEVKPMKKVVPKGKAQTPVQSKKRKGEETDLSSKKRMKPA 296 Query: 1213 KQIEED-NNSDEGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPP 1037 K E+ +++++ G SED QS S EKP+ +KE S P YGKRVE LKS+IKACGM VPP Sbjct: 297 KAASEEISDAEDSGKNSEDDQSHSSSEKPSKKKEVSTPVYGKRVETLKSVIKACGMGVPP 356 Query: 1036 NIYKKVKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERARELEGIDXXX 857 +IYKK+K V ++KRE L+KELE ILSREGLS NP+EKEIK+ ++KK RA+ELEGID Sbjct: 357 SIYKKIKQVSENKREGQLIKELEEILSREGLSSNPSEKEIKEVKRKKARAKELEGIDVSN 416 Query: 856 XXXXXXXXXXXXFVAPERP 800 ++AP P Sbjct: 417 IVSSSRRRSTSSYIAPPPP 435 >ref|XP_006434169.1| hypothetical protein CICLE_v10000938mg [Citrus clementina] gi|557536291|gb|ESR47409.1| hypothetical protein CICLE_v10000938mg [Citrus clementina] Length = 451 Score = 297 bits (761), Expect = 1e-77 Identities = 166/381 (43%), Positives = 241/381 (63%), Gaps = 15/381 (3%) Frame = -1 Query: 2020 AFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMD 1841 + E Q++ A+ +R+ HFK+QADSLT E VRRL+EKDLGLE ALD HK+FI+ L + MD Sbjct: 19 SIEPQIKAAMISRVSHFKEQADSLTFEGVRRLIEKDLGLETHALDVHKKFIKQCLLECMD 78 Query: 1840 GADESNSSPATVNMEGGVLLSKEEEKVIPKQEEANSESKKASTGNEETMEDSPIMGVLTP 1661 GA ++S + + S +EE+ P+ ++ + K+ N E MEDSP++G++T Sbjct: 79 GAGGVSASKDSAESAKENVSSTKEEEKSPEGYQSAKDVKEPCPENYEKMEDSPVLGLMTG 138 Query: 1660 KSEV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDK 1511 + G + SES IKKAI +RA +++ N +K+++ G+RR+LEEDL LDK Sbjct: 139 NKKTKFETEEAQGDGNKEDPSESAIKKAIRKRAAYIKTNIEKVTMAGLRRILEEDLKLDK 198 Query: 1510 NTLDAYKNFISRQVDLVLXXXXXXXXXXXKRSEDVKSR---KSRKVNSEDSDTSQSGSDE 1340 TLD++K IS+++D VL ++ + +K K+++V+SE S S G + Sbjct: 199 FTLDSFKKMISQELDEVLKSSEVLEPSTVEKKKSLKKNYQSKAKEVSSEGSSDSSDGEVD 258 Query: 1339 MSDKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPK--KQIEEDNNSDEGGSIS 1166 D+ K RK+ + ++ E +KR+ E +KK K K EDNN E GS+S Sbjct: 259 EEDEMKPRKKIVSKGKVQNNEGLKKRKRPEKETKASIKKKTKAVKIASEDNNDAESGSVS 318 Query: 1165 EDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRETI 986 +DG+SQ S EKP +K S P YGKRVE+LK++IK+C MS+PP++YKKVK P++KRE Sbjct: 319 DDGRSQSSSEKPIKKKVVSTPAYGKRVEHLKTVIKSCAMSIPPSVYKKVKQAPENKREAQ 378 Query: 985 LVKELEGILSREGLSKNPTEK 923 L+KELEGILSREGLS NP+EK Sbjct: 379 LIKELEGILSREGLSSNPSEK 399