BLASTX nr result
ID: Mentha29_contig00009141
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00009141 (2069 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU39705.1| hypothetical protein MIMGU_mgv1a004483mg [Mimulus... 438 e-120 ref|XP_006472760.1| PREDICTED: transcriptional regulator ATRX ho... 308 8e-81 ref|XP_006434168.1| hypothetical protein CICLE_v10000938mg [Citr... 306 3e-80 ref|XP_002284460.1| PREDICTED: uncharacterized protein LOC100259... 306 3e-80 ref|XP_002300995.2| hypothetical protein POPTR_0002s08550g [Popu... 302 3e-79 ref|XP_006351897.1| PREDICTED: lisH domain-containing protein C1... 300 1e-78 ref|XP_003522580.1| PREDICTED: transcriptional regulator ATRX ho... 298 8e-78 ref|XP_002516334.1| conserved hypothetical protein [Ricinus comm... 296 2e-77 ref|XP_007019032.1| Uncharacterized protein isoform 2 [Theobroma... 293 2e-76 ref|XP_006581582.1| PREDICTED: DNA ligase 1-like isoform X2 [Gly... 290 1e-75 ref|XP_003527934.1| PREDICTED: DNA ligase 1-like isoform X1 [Gly... 290 1e-75 ref|XP_007019033.1| Uncharacterized protein isoform 3 [Theobroma... 288 5e-75 ref|XP_007222349.1| hypothetical protein PRUPE_ppa004840mg [Prun... 283 2e-73 ref|XP_004500560.1| PREDICTED: transcriptional regulator ATRX ho... 283 3e-73 ref|XP_004169339.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 281 8e-73 ref|XP_004145363.1| PREDICTED: uncharacterized protein LOC101217... 281 8e-73 ref|XP_004290855.1| PREDICTED: uncharacterized protein LOC101302... 275 6e-71 ref|XP_006434169.1| hypothetical protein CICLE_v10000938mg [Citr... 274 1e-70 ref|XP_007137404.1| hypothetical protein PHAVU_009G124200g [Phas... 273 3e-70 ref|XP_006578974.1| PREDICTED: transcriptional regulator ATRX ho... 270 2e-69 >gb|EYU39705.1| hypothetical protein MIMGU_mgv1a004483mg [Mimulus guttatus] Length = 525 Score = 438 bits (1127), Expect = e-120 Identities = 252/441 (57%), Positives = 294/441 (66%), Gaps = 19/441 (4%) Frame = +2 Query: 38 MAEGDGEKGAFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFI 217 MAE +GEK E QLE AV +RLQHFKDQADSLTLESVRRLLEKDLGLEK ALDAHKRFI Sbjct: 1 MAE-EGEKQGIEQQLEHAVCSRLQHFKDQADSLTLESVRRLLEKDLGLEKFALDAHKRFI 59 Query: 218 RHYLEKIMDGADESNSSPATVNM-EGGVLLSXXXXXXXXXXXXANSESKKASTGNKETME 394 RHYLEK M+ AD+ N E V LS +N++ KK+STG++E ME Sbjct: 60 RHYLEKKMEDADDCKPETEKENENEKDVHLSKEDATILPKQNESNNDLKKSSTGDEEMME 119 Query: 395 DSPIMGVLTPKSEVGTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDK 574 DSPIMGVLTPKSE+G Q LSES I+KAILERADH ANS+ ++L GVRRLLEEDLGLDK Sbjct: 120 DSPIMGVLTPKSEIGAQGPLSESRIEKAILERADHFLANSENLTLAGVRRLLEEDLGLDK 179 Query: 575 NTLDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSRKSKKV----NXXXXXXXXXXXX 742 N LD +K IS+Q+D VL + + +S KSKKV + Sbjct: 180 NDLDPFKKFISQQIDQVLNPPKATKSVKNVKKKTSESLKSKKVKTVSSEEGSESLPSESD 239 Query: 743 XXXXKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKK-----------QIEED 889 K K +KE+ RKN KK EQP+KR++ D+D+S KKP K EED Sbjct: 240 EMEDKVKSKKESASRKNSKKLEQPKKRKS----DLDVSAKKPSKLQKRQKEEDNDSKEED 295 Query: 890 NNSDEGGSISEDGQSQLS---LEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYK 1060 NNS E GS+SEDGQSQ S LEKPA RKEK P YGK+VENLKSIIKACGMS+PP IYK Sbjct: 296 NNSGEDGSLSEDGQSQSSVEKLEKPAQRKEKPVPAYGKKVENLKSIIKACGMSIPPVIYK 355 Query: 1061 KVKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGIDXXXXXXX 1240 K K VPD+KRE ++++ELEGIL REGLSKNP+EKEIKDC+K+KE A+ELEGID Sbjct: 356 KAKQVPDNKREAVIIQELEGILLREGLSKNPSEKEIKDCKKRKETARELEGIDMSNIISS 415 Query: 1241 XXXXXXXXXVAPERPVVRAKK 1303 AP +P RAKK Sbjct: 416 SRRRSTFSFGAPAKPEARAKK 436 >ref|XP_006472760.1| PREDICTED: transcriptional regulator ATRX homolog [Citrus sinensis] Length = 497 Score = 308 bits (788), Expect = 8e-81 Identities = 178/431 (41%), Positives = 252/431 (58%), Gaps = 15/431 (3%) Frame = +2 Query: 65 AFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMD 244 + E Q++ A+ +R+ HFK+QADSLT E VRRL+EKDLGLE ALD HK+FI+ L + MD Sbjct: 19 SIEPQIKAAMISRVSHFKEQADSLTFEGVRRLIEKDLGLETHALDVHKKFIKQCLLECMD 78 Query: 245 GADESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVLTP 424 GA ++S + + S + + K+ N E MEDSP++G++T Sbjct: 79 GAGGVSASKDSAESAKENVSSTKEEEKSPEGYQSAKDVKEPCPENYEKMEDSPVLGLMTG 138 Query: 425 KSEV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDK 574 + G + SES IKKAI +RA +++ N +K+++ G+RR+LEEDL LDK Sbjct: 139 NKKTKFETEEAQGDGNKEDPSESAIKKAIRKRAAYIKTNIEKVTMAGLRRILEEDLKLDK 198 Query: 575 NTLDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSR---KSKKVNXXXXXXXXXXXXX 745 TLD++K +IS+++D VL + + +K K+K+V+ Sbjct: 199 FTLDSFKKMISQELDEVLKSSEVLEPSTVEKKKSLKKNYQSKAKEVSSEGSSDSSDGEVD 258 Query: 746 XXXKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPK--KQIEEDNNSDEGGSIS 919 + K RK+ + ++ E +KR+ E +KK K K EDNN E GS+S Sbjct: 259 EEDEMKPRKKIVSKGKVQNNEGLKKRKRPEKETKASIKKKTKAVKIASEDNNDAESGSVS 318 Query: 920 EDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRETI 1099 +DG SQ S EKP +K S P YGKRVE+LK++IK+CGMS+PP++YKKVK P++KRE Sbjct: 319 DDGHSQSSSEKPIKKKVVSTPAYGKRVEHLKTVIKSCGMSIPPSVYKKVKQAPENKREAQ 378 Query: 1100 LVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGIDXXXXXXXXXXXXXXXXVAPE 1279 L+KELEGILSREGLS NP+EKEIK+ +KKKERA+ELEGID V P Sbjct: 379 LIKELEGILSREGLSSNPSEKEIKEVKKKKERARELEGIDMSNIVSSSRRRSATSFVPPP 438 Query: 1280 RPVVRAKKYKG 1312 +P + + G Sbjct: 439 KPKIPDESESG 449 >ref|XP_006434168.1| hypothetical protein CICLE_v10000938mg [Citrus clementina] gi|557536290|gb|ESR47408.1| hypothetical protein CICLE_v10000938mg [Citrus clementina] Length = 497 Score = 306 bits (783), Expect = 3e-80 Identities = 177/431 (41%), Positives = 252/431 (58%), Gaps = 15/431 (3%) Frame = +2 Query: 65 AFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMD 244 + E Q++ A+ +R+ HFK+QADSLT E VRRL+EKDLGLE ALD HK+FI+ L + MD Sbjct: 19 SIEPQIKAAMISRVSHFKEQADSLTFEGVRRLIEKDLGLETHALDVHKKFIKQCLLECMD 78 Query: 245 GADESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVLTP 424 GA ++S + + S + + K+ N E MEDSP++G++T Sbjct: 79 GAGGVSASKDSAESAKENVSSTKEEEKSPEGYQSAKDVKEPCPENYEKMEDSPVLGLMTG 138 Query: 425 KSEV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDK 574 + G + SES IKKAI +RA +++ N +K+++ G+RR+LEEDL LDK Sbjct: 139 NKKTKFETEEAQGDGNKEDPSESAIKKAIRKRAAYIKTNIEKVTMAGLRRILEEDLKLDK 198 Query: 575 NTLDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSR---KSKKVNXXXXXXXXXXXXX 745 TLD++K +IS+++D VL + + +K K+K+V+ Sbjct: 199 FTLDSFKKMISQELDEVLKSSEVLEPSTVEKKKSLKKNYQSKAKEVSSEGSSDSSDGEVD 258 Query: 746 XXXKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPK--KQIEEDNNSDEGGSIS 919 + K RK+ + ++ E +KR+ E +KK K K EDNN E GS+S Sbjct: 259 EEDEMKPRKKIVSKGKVQNNEGLKKRKRPEKETKASIKKKTKAVKIASEDNNDAESGSVS 318 Query: 920 EDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRETI 1099 +DG+SQ S EKP +K S P YGKRVE+LK++IK+C MS+PP++YKKVK P++KRE Sbjct: 319 DDGRSQSSSEKPIKKKVVSTPAYGKRVEHLKTVIKSCAMSIPPSVYKKVKQAPENKREAQ 378 Query: 1100 LVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGIDXXXXXXXXXXXXXXXXVAPE 1279 L+KELEGILSREGLS NP+EKEIK+ +KKKERA+ELEGID V P Sbjct: 379 LIKELEGILSREGLSSNPSEKEIKEVKKKKERARELEGIDMSNIVSSSRRRSATSFVPPP 438 Query: 1280 RPVVRAKKYKG 1312 +P + + G Sbjct: 439 KPKIPDESESG 449 >ref|XP_002284460.1| PREDICTED: uncharacterized protein LOC100259114 [Vitis vinifera] gi|302141832|emb|CBI19035.3| unnamed protein product [Vitis vinifera] Length = 502 Score = 306 bits (783), Expect = 3e-80 Identities = 189/444 (42%), Positives = 260/444 (58%), Gaps = 19/444 (4%) Frame = +2 Query: 17 QLPPAELMAEGDGEKGA-FELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLA 193 ++ +E + +G E+ E Q++ A+S+R+ HFK+QADSLT E VRRLLEKDLGLE A Sbjct: 4 EMQDSEPITKGTEEEAQEIESQIKAAMSSRVGHFKEQADSLTFEGVRRLLEKDLGLETYA 63 Query: 194 LDAHKRFIRHYLEKIMDGADESNSSPATVNMEG-GVLLSXXXXXXXXXXXXANSESKKAS 370 LD HKRF++ +L + ++ A + N S + G V + + + K+ S Sbjct: 64 LDVHKRFVKQFLLECINAAADDNPSKKSGETRGKNVCSTKGEAAEPPETVKSKKDVKEPS 123 Query: 371 TGNKETMEDSPIMGVLT----PKSEVGTQSSL------SESTIKKAILERADHLQANSDK 520 +G++E +E SP++G++T KSE SESTI+KAI +RA + +A S+ Sbjct: 124 SGDEEKIEGSPVLGLMTGQKIAKSETEETQGKENKEVPSESTIRKAIRKRASYFKAKSEN 183 Query: 521 ISLGGVRRLLEEDLGLDKNTLDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVK----SR 688 I++ GVRR+LEEDL LDK TLD YK IS Q+D VL + K SR Sbjct: 184 ITMAGVRRVLEEDLKLDKKTLDPYKKFISEQLDEVLKSPQVSKPTTGVKKGSPKKNSHSR 243 Query: 689 KSKKVNXXXXXXXXXXXXXXXXKEKLRKEA--GLRKNIKKFEQPRKRRNSENADMDISRK 862 S+K + + K A G +N + + RKR +E R Sbjct: 244 ASRKTSSEGSSESLESESDEEEVKPKTKMAPKGKTQNSEDLRK-RKRPVTETKMPSKKRS 302 Query: 863 KPKKQIEEDNN-SDEGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMS 1039 K + + EDN+ +++ G++S+DG SQ S EKP RKE SAP YGKRVENLKSIIK+C MS Sbjct: 303 KTAETVSEDNSDAEDSGNVSDDGHSQSSSEKPVKRKEVSAPAYGKRVENLKSIIKSCAMS 362 Query: 1040 VPPNIYKKVKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGID 1219 VPP++YK+VK P++KRE L+KELE ILS+EGLSKNP+EK+IK+ RKKKERAKELEGID Sbjct: 363 VPPSVYKRVKQAPENKREAHLIKELEEILSKEGLSKNPSEKDIKEVRKKKERAKELEGID 422 Query: 1220 XXXXXXXXXXXXXXXXVAPERPVV 1291 VAP +P + Sbjct: 423 TSNIVLSSRRRSTRSFVAPPKPKI 446 >ref|XP_002300995.2| hypothetical protein POPTR_0002s08550g [Populus trichocarpa] gi|550344567|gb|EEE80268.2| hypothetical protein POPTR_0002s08550g [Populus trichocarpa] Length = 476 Score = 302 bits (774), Expect = 3e-79 Identities = 185/420 (44%), Positives = 248/420 (59%), Gaps = 13/420 (3%) Frame = +2 Query: 71 ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 250 E Q++ A+ +R+ HFK QADSLT E VRRLLEKDLGL+KLALD HKRF++ L + +DGA Sbjct: 25 ESQVKEAMLSRVSHFKKQADSLTFEGVRRLLEKDLGLDKLALDVHKRFVKQCLFECLDGA 84 Query: 251 DESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVL---- 418 N+S + + + S + K+ + ++E MEDSP+MG+L Sbjct: 85 VTDNASKDSGDTVEKHVDSPKEVTESPERRDLKNNIKEPCSEDEEKMEDSPVMGLLSGQK 144 Query: 419 TPKSEV-GTQSSL-----SESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDKNT 580 T KS+ TQ++ SE +IKKA++ RA +++ANS++I++ G+RRLLEEDL LDK + Sbjct: 145 TTKSKAKDTQANEVKEVPSEGSIKKAMMRRASYIKANSEEITMAGLRRLLEEDLKLDKFS 204 Query: 581 LDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSRKSKKVNXXXXXXXXXXXXXXXXKE 760 LD YK IS+Q+D V ED + + KK Sbjct: 205 LDPYKKFISKQLDEVSSRESADSSDKESEEEDEEVKPKKK-------------------- 244 Query: 761 KLRKEAGLRKNIKKFEQPRKRRNSENADMDISRK--KPKKQIEEDNNSDE-GGSISEDGQ 931 + G+ + ++ E +KRR +E + K KP + EDN+ E G+ SED Sbjct: 245 ----KIGVERKMQNSEGSKKRRRTEKETKVSANKRIKPLETAAEDNSDSEVSGNASEDNN 300 Query: 932 SQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRETILVKE 1111 S S EKP +KE S P YGKRVE+LKS+IK+CGMSVPP+IYKKVK P++KRE L+KE Sbjct: 301 SPSSAEKPVKKKEASTPAYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPENKREARLIKE 360 Query: 1112 LEGILSREGLSKNPTEKEIKDCRKKKERAKELEGIDXXXXXXXXXXXXXXXXVAPERPVV 1291 LE ILSREGLS NP+EKEIK+ RK+KERAKELEGID VAP +P V Sbjct: 361 LEEILSREGLSSNPSEKEIKEVRKRKERAKELEGIDLSNIVTTSRRRSATSFVAPPKPKV 420 >ref|XP_006351897.1| PREDICTED: lisH domain-containing protein C1711.05-like [Solanum tuberosum] Length = 476 Score = 300 bits (769), Expect = 1e-78 Identities = 192/426 (45%), Positives = 254/426 (59%), Gaps = 12/426 (2%) Frame = +2 Query: 44 EGDGEKGAFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRH 223 E + EK E+++E A+ +R+QHFK+ ADS TLE VRRL+E+DL LEK ALD HKR I+ Sbjct: 4 EVNEEKQGIEVKIEEALRSRIQHFKENADSFTLERVRRLIEEDLELEKYALDVHKRSIKL 63 Query: 224 YLEKIMDGA-DESNSSPATVNMEGGVLLSXXXXXXXXXXXX-ANSESKKASTGNKETMED 397 LEK+M+ A D+ + + N+E L+ + K ++ M+D Sbjct: 64 ILEKLMENAADDGDPKDSQENLEKDASLTKQEKEVLESPKKQVIKKDIKEPAFDEAEMDD 123 Query: 398 SPIMGVLTPKSE-VGTQS-SLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLD 571 SPIMGV++ KSE V QS SES+IKKAI ERA H + NS+ I+L GVRRLLEEDLGL+ Sbjct: 124 SPIMGVMSSKSESVDAQSVKASESSIKKAIWERAAHFRDNSESITLAGVRRLLEEDLGLE 183 Query: 572 KNTLDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSRKSKKVNXXXXXXXXXXXXXXX 751 KNTLDA+K I Q+D VL +S + KS+ +KK + Sbjct: 184 KNTLDAFKKFIQIQIDEVLTPSEAPKSSSVKKSPEKKSKTAKK-SGENSNSFSSKRKHIA 242 Query: 752 XKEKLRKEAGLRKNIKKFE--QPRKRRNSENADMDISRKKPKKQIEEDNNS-DEGGSISE 922 K K RK + ++ ++K E + RK+ NSE+ +K+ K + ++N+ D S SE Sbjct: 243 EKVKSRKSSAAKETVEKSEGLKKRKKPNSEDNVPAKKQKEVSKNLSDENSDGDTDKSDSE 302 Query: 923 DGQSQLSLEKPAPRKE-----KSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDK 1087 DGQS S E + +K+ + GYGKRVE+LKSI KACGMSV P+IYK+ K V DDK Sbjct: 303 DGQSGSSAEIISAKKKVVKGASANTGYGKRVEHLKSIFKACGMSVAPSIYKRAKQVSDDK 362 Query: 1088 RETILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGIDXXXXXXXXXXXXXXXX 1267 RE L+KELE ILS EGLS NPTEKEIK+ +K+K+ AKELEGID Sbjct: 363 REGFLIKELEKILSAEGLSTNPTEKEIKEVKKRKQTAKELEGIDLSNIVSNTRRRSTTSF 422 Query: 1268 VAPERP 1285 VAP RP Sbjct: 423 VAPPRP 428 >ref|XP_003522580.1| PREDICTED: transcriptional regulator ATRX homolog isoform X1 [Glycine max] Length = 490 Score = 298 bits (762), Expect = 8e-78 Identities = 177/401 (44%), Positives = 242/401 (60%), Gaps = 18/401 (4%) Frame = +2 Query: 71 ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 250 E Q+E A+ +R+ HFK+Q+DSLT E VRRLLEKDLGLE+ ALD HKRFI+ L K ++G Sbjct: 16 ESQIETAMRSRVSHFKEQSDSLTFEGVRRLLEKDLGLEEYALDVHKRFIKQCLLKCLEGV 75 Query: 251 DESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVLTPKS 430 + + ++ + G S + ++K ++E MEDSP++G+L + Sbjct: 76 GDDDGPK--ISGKEGEKGSSIQESEEPKEECESKDAKDLCPEDEEKMEDSPVLGLLKEQK 133 Query: 431 EV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDKNT 580 GT+ SE+ IKKA+ +R+ +++AN++KI++ G+RRLLEEDL LDK T Sbjct: 134 RAKLETKDDKGNGTKVVPSEALIKKAVRKRSSYIKANAEKITMAGLRRLLEEDLKLDKFT 193 Query: 581 LDAYKNLISRQVDLVLXXXXXXXXXXXXRS-------EDVKSRKSKKVNXXXXXXXXXXX 739 LD YK +S+Q+D VL + V + S + N Sbjct: 194 LDPYKKFVSQQLDEVLTSSEVPEPAKNAKKIVKKKPDTKVTKKVSSEENSDTSDKETDEE 253 Query: 740 XXXXXKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQIEEDNN-SDEGGSI 916 + K RK+ + +K QP+KR+ E+ R KP K EDN+ +++ G Sbjct: 254 ESEEDEVKPRKKILPKGKVKTSVQPKKRKGEESDLSSKKRVKPAKAASEDNSDAEDNGKN 313 Query: 917 SEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRET 1096 SED QS S EKP+ +KE S P YGKRVE+LKS+IKACGMSVPP IYKKVK VP++KRE Sbjct: 314 SEDDQSHSSPEKPSKKKEVSNPVYGKRVEHLKSVIKACGMSVPPVIYKKVKQVPENKREG 373 Query: 1097 ILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGID 1219 L+KELE ILSREGLS NP+EKEIK+ ++KK RAKELEGID Sbjct: 374 QLIKELEEILSREGLSSNPSEKEIKEVKRKKARAKELEGID 414 >ref|XP_002516334.1| conserved hypothetical protein [Ricinus communis] gi|223544564|gb|EEF46081.1| conserved hypothetical protein [Ricinus communis] Length = 517 Score = 296 bits (759), Expect = 2e-77 Identities = 175/419 (41%), Positives = 249/419 (59%), Gaps = 12/419 (2%) Frame = +2 Query: 71 ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 250 E Q++ A+ +R+ +F +Q++SLT E VRRLLEKDLGL++ ALD HKRF++ L + +DG Sbjct: 26 ESQIKDAMRSRVNYFNEQSNSLTFEGVRRLLEKDLGLQEYALDVHKRFVKQCLLQCLDGD 85 Query: 251 DESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVLT--- 421 + S S T E G + K+ + ++E E+SP+MG+LT Sbjct: 86 NASKDSGETD--EKGSRSIKGEATESPEGHESKDHIKEPCSEDEEKTEESPVMGLLTGKK 143 Query: 422 -PKSEVG---TQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDKNTLDA 589 PKSE + + +ES IKKA+ +RA +++ANSDK+++ G+RRLLEEDL LDK+ LD Sbjct: 144 TPKSETDKTLVKEAPTESIIKKALSKRASYIKANSDKVTMAGLRRLLEEDLRLDKHALDP 203 Query: 590 YKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSRKSKKVNXXXXXXXXXXXXXXXXKEKLR 769 YK IS Q+D VL + + + SKK+ +++++ Sbjct: 204 YKKFISAQLDEVLQSSEVSEPKKKSVKTNSQGKASKKMRTEESSDSSGKEMDTEDEDEVK 263 Query: 770 KEAGLRKNIKKFE----QPRKRRNSENADMDISRKKPKKQIEEDNN-SDEGGSISEDGQS 934 + + N K + RKR E R KP +++ ED++ +++ G+ SEDG+S Sbjct: 264 PKKKIAPNKKMINSEGSKKRKRFEKETKVTSKKRVKPTEKVAEDSSDAEDSGNASEDGRS 323 Query: 935 QLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRETILVKEL 1114 Q S EKP +KE P YGKRVE+LKS+IK+CGMSVPP +YKKVK VP++KRE L+KEL Sbjct: 324 QSSAEKPVKKKEAPTPVYGKRVEHLKSVIKSCGMSVPPVVYKKVKQVPENKREAQLIKEL 383 Query: 1115 EGILSREGLSKNPTEKEIKDCRKKKERAKELEGIDXXXXXXXXXXXXXXXXVAPERPVV 1291 E ILS+EGLS NP+EKEIK+ RK+KERAKELEGID V P +P + Sbjct: 384 EEILSKEGLSSNPSEKEIKEVRKRKERAKELEGIDMSNIVSSSRRRSATSYVPPPKPKI 442 >ref|XP_007019032.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508724360|gb|EOY16257.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 523 Score = 293 bits (750), Expect = 2e-76 Identities = 178/445 (40%), Positives = 252/445 (56%), Gaps = 38/445 (8%) Frame = +2 Query: 71 ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 250 E ++ A+ +R+ HFK+QADSLT E VRRLLEKDLGLE ALD HKRF++ L K +DG Sbjct: 29 ESRITTAMRSRVGHFKEQADSLTFEGVRRLLEKDLGLETFALDVHKRFVKQCLLKCLDGG 88 Query: 251 DESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVL---- 418 D+ ++ ++ L + + + K+A + ++E +EDSP++G+L Sbjct: 89 DDDDAPKSSGETGEKNLSTTTEVTESPKGRQSKKDVKEAFSEDEEKLEDSPVLGLLTGHK 148 Query: 419 -----TPKSEVGTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDKNTL 583 T ++E + ESTIKKAI +RA +++ANS+K+++ G+RRLLEEDL LDK+TL Sbjct: 149 TTKTETMETETKENKDVFESTIKKAIKKRASYVEANSEKVTMAGLRRLLEEDLKLDKDTL 208 Query: 584 DAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVK----SRKSKKVNXXXXXXXXXXXXXXX 751 D YK I+ Q+D VL + ++K S+ SKK + Sbjct: 209 DPYKKFITEQLDEVLKSREVSAPASVVKKNNLKKNSQSKASKKASKKLSSASSGSESDEE 268 Query: 752 XKE----------------------KLRKEAGLRKNIKKFEQPRKRR-NSENADMDISRK 862 E K +K+ + IK E +KR+ + A+M ++ Sbjct: 269 EGEEEEDEDEDEDVDEEEEEEEEEVKPKKKISAKGKIKNSEGLKKRKIPKKEAEMPSKKR 328 Query: 863 KPKKQIEEDNNSD--EGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGM 1036 + D+NSD + GS+S+D +S+ S K RKE S P YGK VE+LKS+IK+CGM Sbjct: 329 SKHAESISDDNSDAEDSGSVSDDNRSRSSAAKAVKRKETSTPVYGKHVEHLKSVIKSCGM 388 Query: 1037 SVPPNIYKKVKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGI 1216 SVPP IYK+VK VP++ RE L+KELE ILS+EGLS NP+EKEIK+ RK+KERAKELEGI Sbjct: 389 SVPPAIYKRVKQVPENNREAQLIKELEEILSKEGLSSNPSEKEIKEVRKRKERAKELEGI 448 Query: 1217 DXXXXXXXXXXXXXXXXVAPERPVV 1291 D VAP +P + Sbjct: 449 DTSNIVLSSRRRSTTSFVAPPKPKI 473 >ref|XP_006581582.1| PREDICTED: DNA ligase 1-like isoform X2 [Glycine max] Length = 486 Score = 290 bits (743), Expect = 1e-75 Identities = 177/412 (42%), Positives = 243/412 (58%), Gaps = 19/412 (4%) Frame = +2 Query: 41 AEGDGEKGAF-ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFI 217 +EG +K E Q+E A+ +R+ FK+Q+DSLT E VRRLLEKDLGLE+ ALD HKRFI Sbjct: 5 SEGTAKKEEILESQIETAMRSRVSLFKEQSDSLTFEGVRRLLEKDLGLEEYALDVHKRFI 64 Query: 218 RHYLEKIMDGADESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMED 397 + L K ++G + + A ++ + G + ++K ++E MED Sbjct: 65 KQCLLKCLEGVGDDDG--AKISGKEGEKGTSTQESEEPKEECEAKDAKDLCPEDEEKMED 122 Query: 398 SPIMGVLTPKSEV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGVRRL 547 SP++G+L + GT+ E+ IKKA+ +R+ +++AN++KI++ G+RRL Sbjct: 123 SPVLGLLKEQKRAKLETKDDKGNGTKVVPIEALIKKAVRKRSSYIKANAEKITMAGLRRL 182 Query: 548 LEEDLGLDKNTLDAYKNLISRQVDLVLXXXXXXXXXXXXRS-------EDVKSRKSKKVN 706 LEEDL LDK TLD YK +S+Q+D VL + V + S + N Sbjct: 183 LEEDLKLDKFTLDPYKKFVSQQLDEVLASSEVPKPSNNAKKIVKKKPDTKVTKKVSSEEN 242 Query: 707 XXXXXXXXXXXXXXXXKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQIEE 886 + K RK+ + +K QP+KR+ E R KP K E Sbjct: 243 SDTSDKETDEEESEEDEVKPRKKIVPKGKVKTSVQPKKRKGEETDLSSKKRVKPAKATSE 302 Query: 887 DNN-SDEGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKK 1063 DN+ +++ G SED QS S EKP+ +KE S P YGK VE+LKS+IKACGMSVPP IYKK Sbjct: 303 DNSDAEDDGKNSEDDQSSSSPEKPSKKKEVSTPVYGKHVEHLKSVIKACGMSVPPVIYKK 362 Query: 1064 VKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGID 1219 VK VP++KRE L+KELE ILSREGLS NP+EKEIK+ ++KK RAKELEGID Sbjct: 363 VKQVPENKREEQLIKELEEILSREGLSSNPSEKEIKEVKRKKARAKELEGID 414 >ref|XP_003527934.1| PREDICTED: DNA ligase 1-like isoform X1 [Glycine max] Length = 488 Score = 290 bits (743), Expect = 1e-75 Identities = 177/412 (42%), Positives = 243/412 (58%), Gaps = 19/412 (4%) Frame = +2 Query: 41 AEGDGEKGAF-ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFI 217 +EG +K E Q+E A+ +R+ FK+Q+DSLT E VRRLLEKDLGLE+ ALD HKRFI Sbjct: 5 SEGTAKKEEILESQIETAMRSRVSLFKEQSDSLTFEGVRRLLEKDLGLEEYALDVHKRFI 64 Query: 218 RHYLEKIMDGADESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMED 397 + L K ++G + + A ++ + G + ++K ++E MED Sbjct: 65 KQCLLKCLEGVGDDDG--AKISGKEGEKGTSTQESEEPKEECEAKDAKDLCPEDEEKMED 122 Query: 398 SPIMGVLTPKSEV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGVRRL 547 SP++G+L + GT+ E+ IKKA+ +R+ +++AN++KI++ G+RRL Sbjct: 123 SPVLGLLKEQKRAKLETKDDKGNGTKVVPIEALIKKAVRKRSSYIKANAEKITMAGLRRL 182 Query: 548 LEEDLGLDKNTLDAYKNLISRQVDLVLXXXXXXXXXXXXRS-------EDVKSRKSKKVN 706 LEEDL LDK TLD YK +S+Q+D VL + V + S + N Sbjct: 183 LEEDLKLDKFTLDPYKKFVSQQLDEVLASSEVPKPSNNAKKIVKKKPDTKVTKKVSSEEN 242 Query: 707 XXXXXXXXXXXXXXXXKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQIEE 886 + K RK+ + +K QP+KR+ E R KP K E Sbjct: 243 SDTSDKETDEEESEEDEVKPRKKIVPKGKVKTSVQPKKRKGEETDLSSKKRVKPAKATSE 302 Query: 887 DNN-SDEGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKK 1063 DN+ +++ G SED QS S EKP+ +KE S P YGK VE+LKS+IKACGMSVPP IYKK Sbjct: 303 DNSDAEDDGKNSEDDQSSSSPEKPSKKKEVSTPVYGKHVEHLKSVIKACGMSVPPVIYKK 362 Query: 1064 VKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGID 1219 VK VP++KRE L+KELE ILSREGLS NP+EKEIK+ ++KK RAKELEGID Sbjct: 363 VKQVPENKREEQLIKELEEILSREGLSSNPSEKEIKEVKRKKARAKELEGID 414 >ref|XP_007019033.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508724361|gb|EOY16258.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 521 Score = 288 bits (738), Expect = 5e-75 Identities = 178/445 (40%), Positives = 252/445 (56%), Gaps = 38/445 (8%) Frame = +2 Query: 71 ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 250 E ++ A+ +R+ HFK+QADSLT E VRRLLEKDLGLE ALD HKRF++ L K +DG Sbjct: 29 ESRITTAMRSRVGHFKEQADSLTFEGVRRLLEKDLGLETFALDVHKRFVKQCLLKCLDGG 88 Query: 251 DESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVL---- 418 D+ ++ ++ L + + + K+A + ++E +EDSP++G+L Sbjct: 89 DDDDAPKSSGETGEKNLSTTTEVTESPKGRQSKKDVKEAFSEDEEKLEDSPVLGLLTGHK 148 Query: 419 -----TPKSEVGTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDKNTL 583 T ++E + ESTIKKAI +RA +++ANS+K+++ G+RRLLEEDL LDK+TL Sbjct: 149 TTKTETMETETKENKDVFESTIKKAIKKRASYVEANSEKVTMAGLRRLLEEDLKLDKDTL 208 Query: 584 DAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVK----SRKSKKVNXXXXXXXXXXXXXXX 751 D YK I+ Q+D VL + ++K S+ SKK + Sbjct: 209 DPYKKFITEQLDEVLKSREVSAPASVVKKNNLKKNSQSKASKKASKKLSSASSGSESDEE 268 Query: 752 XKE----------------------KLRKEAGLRKNIKKFEQPRKRR-NSENADMDISRK 862 E K +K+ + IK E +KR+ + A+M ++ Sbjct: 269 EGEEEEDEDEDEDVDEEEEEEEEEVKPKKKISAKGKIKNSEGLKKRKIPKKEAEMPSKKR 328 Query: 863 KPKKQIEEDNNSD--EGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGM 1036 + D+NSD + GS+S+D +S+ S K RKE S P YGK VE+LKS+IK+CGM Sbjct: 329 SKHAESISDDNSDAEDSGSVSDDNRSRSSAAK--ARKETSTPVYGKHVEHLKSVIKSCGM 386 Query: 1037 SVPPNIYKKVKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGI 1216 SVPP IYK+VK VP++ RE L+KELE ILS+EGLS NP+EKEIK+ RK+KERAKELEGI Sbjct: 387 SVPPAIYKRVKQVPENNREAQLIKELEEILSKEGLSSNPSEKEIKEVRKRKERAKELEGI 446 Query: 1217 DXXXXXXXXXXXXXXXXVAPERPVV 1291 D VAP +P + Sbjct: 447 DTSNIVLSSRRRSTTSFVAPPKPKI 471 >ref|XP_007222349.1| hypothetical protein PRUPE_ppa004840mg [Prunus persica] gi|462419285|gb|EMJ23548.1| hypothetical protein PRUPE_ppa004840mg [Prunus persica] Length = 489 Score = 283 bits (725), Expect = 2e-73 Identities = 169/421 (40%), Positives = 248/421 (58%), Gaps = 16/421 (3%) Frame = +2 Query: 77 QLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGADE 256 Q++ A+ +R+ +FK+Q+DSLT E VRRLLEKDLGLE ALD HKRF++ +L + ++GA + Sbjct: 25 QIKDAMRSRVPYFKEQSDSLTFEGVRRLLEKDLGLETFALDVHKRFVKEHLVECLEGAGD 84 Query: 257 SNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVL----TP 424 N+S ++ + ++ +N + K+ + ++E MEDSP+MG+L T Sbjct: 85 DNTSKSSGETDEKSIIKGEAAESPEGYK-SNKDVKETYSEDEEKMEDSPVMGLLAGNKTA 143 Query: 425 KS------EVGTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDKNTLD 586 KS ++ + SE+ IK A+ +R +++ANS+KI++ G+RRLLEEDL L+K TLD Sbjct: 144 KSGTEETKSTKSKKAPSETVIKSALRKRVSYIKANSEKITMAGLRRLLEEDLKLEKYTLD 203 Query: 587 AYKNLISRQVDLVLXXXXXXXXXXXXRS--EDVKSRKSKKVNXXXXXXXXXXXXXXXXKE 760 K I+ +D VL ++ + V+ + S KV E Sbjct: 204 PCKKFINEHLDKVLESCEISEPAPVKKNVKKSVQRKASTKVRSDESSGSSDNESDEEEDE 263 Query: 761 KLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKK----QIEEDNNSDEGGSISEDG 928 + + K + K+R + +IS KK K + E+ ++++ G++SED Sbjct: 264 VKPRNKSVPKGKMQNSNDLKKRKRMANETNISGKKRIKPSETEPEDKSDAEVSGNVSEDD 323 Query: 929 QSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRETILVK 1108 +SQ S EKP +KE S P YGKRVE+L+S+IKACGMSV P++YKKVK VP+ KRE L+K Sbjct: 324 RSQSSAEKPVKKKEVSTPAYGKRVEHLRSVIKACGMSVAPSVYKKVKQVPESKREAHLIK 383 Query: 1109 ELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGIDXXXXXXXXXXXXXXXXVAPERPV 1288 ELE ILS+EGLS +PTEKEIK+ +KKKERAKELEGID V P +P Sbjct: 384 ELEEILSKEGLSAHPTEKEIKEVKKKKERAKELEGIDMSNIVTSSRRRSTTSFVPPPKPK 443 Query: 1289 V 1291 + Sbjct: 444 I 444 >ref|XP_004500560.1| PREDICTED: transcriptional regulator ATRX homolog isoform X1 [Cicer arietinum] gi|502130188|ref|XP_004500561.1| PREDICTED: transcriptional regulator ATRX homolog isoform X2 [Cicer arietinum] Length = 497 Score = 283 bits (723), Expect = 3e-73 Identities = 175/428 (40%), Positives = 244/428 (57%), Gaps = 18/428 (4%) Frame = +2 Query: 71 ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 250 E Q++ A+ +R+ HFK QADSLT E VRRLLEKDLG E+ +LD+HKRFI+ LEK ++ Sbjct: 16 ESQIQTAMLSRVPHFKQQADSLTFEGVRRLLEKDLGFEEYSLDSHKRFIKQCLEKCLEEV 75 Query: 251 DESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVLTPKS 430 + ++S + E S+ +K T ++E MEDSP++G+L + Sbjct: 76 GDDDASKMSGEEEEK---GESTQEVEGKKEEHQSKDEKDLTEDEEKMEDSPVLGLLKEQK 132 Query: 431 EVGTQSSLSEST----------IKKAILERADHLQANSDKISLGGVRRLLEEDLGLDKNT 580 V ++ +E IKKAI++R+ +L+AN+D++++ G+RRLLEEDL LDK + Sbjct: 133 RVKNETKKAEGNGKKVVPNEALIKKAIIKRSSYLKANADEVTVAGLRRLLEEDLKLDKFS 192 Query: 581 LDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSRKSKKVNXXXXXXXXXXXXXXXXKE 760 LD +K I +Q+D VL + + VK + KV +E Sbjct: 193 LDPFKKFIRQQLDEVLMSSEVLEPAKSAK-KIVKKKPDSKVTKKVSTEENSDTSDKVSEE 251 Query: 761 KLRKEAGLRKNIKKFEQ------PRKRRNSENADMDISRKKPKKQIEEDNN-SDEGGSIS 919 + +E ++ K + P+KR+ E R KP K+ EDN+ +++GG S Sbjct: 252 EESQEDEVKPKKKSVPKGKASVGPKKRKGEEIKSPSKKRAKPDKEASEDNSDAEDGGKNS 311 Query: 920 EDGQSQLSLEKPAPRKEKSAPG-YGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRET 1096 ED QS S E +K+ S P Y KRVE+LKS+IKACGMSVPP IYKKVK VP++KRE Sbjct: 312 EDDQSHSSAENTTQKKQVSTPVVYSKRVEHLKSVIKACGMSVPPVIYKKVKQVPENKREG 371 Query: 1097 ILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGIDXXXXXXXXXXXXXXXXVAP 1276 L+KELE ILSREGLS NP+EKEIK+ ++KKERAKELEGID AP Sbjct: 372 QLIKELEEILSREGLSSNPSEKEIKEVKRKKERAKELEGIDMSNIVSSTRRRATTSFAAP 431 Query: 1277 ERPVVRAK 1300 P + K Sbjct: 432 PPPKPKPK 439 >ref|XP_004169339.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101229552 [Cucumis sativus] Length = 488 Score = 281 bits (719), Expect = 8e-73 Identities = 171/400 (42%), Positives = 235/400 (58%), Gaps = 17/400 (4%) Frame = +2 Query: 71 ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 250 E ++ A+ +R+ HFK+QADSLT E VRRLLEKDL +E LD HKR+++ L K ++ Sbjct: 23 ETKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCMETYTLDVHKRYVKQCLVKCLEAD 82 Query: 251 DESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVLTPKS 430 E N S + + G ++ + +K+ ++E MEDSP+MG+LT +S Sbjct: 83 LEDNVSKDS-ELTGRKSVNKEEAPESPEGHQSKKGAKEPCLEDEEKMEDSPVMGLLTGRS 141 Query: 431 EVGTQSS-------------LSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLD 571 +S SESTI KAI +R +L+ANS+K+++ GVRRLLE+DL L Sbjct: 142 TKNVESDGIKGIKGKDDKDVPSESTIMKAIRKRTSYLKANSEKVTMAGVRRLLEDDLKLT 201 Query: 572 KNTLDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSRKSKKVNXXXXXXXXXXXXXXX 751 KN LD+ K IS+QV+ +L +E V + KS K Sbjct: 202 KNVLDSCKKFISQQVEEILTSCEA--------AEQVSNLKSPKKISKESSYSTEGSSSEE 253 Query: 752 XKEKLR--KEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQIEEDNNSDEGG-SISE 922 +++ K + I + +KR+ S + ++ Q D +SDEGG ++SE Sbjct: 254 ENDEVNPGKTNATKGRIPDANETKKRKRSTKKTVSAQKQSKHVQDTSDEDSDEGGGNVSE 313 Query: 923 DGQSQLSLEKPAPRK-EKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRETI 1099 DG+S S EKP ++ S P YGKRVE+LKS+IK+CGMSVPP+IYKKVK P+ KRE+ Sbjct: 314 DGRSGSSNEKPVKKEVSSSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQ 373 Query: 1100 LVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGID 1219 L+KELEGILSREGLS N TEKEIK+ +KKKERAKELEGID Sbjct: 374 LIKELEGILSREGLSANSTEKEIKEVKKKKERAKELEGID 413 >ref|XP_004145363.1| PREDICTED: uncharacterized protein LOC101217045 [Cucumis sativus] Length = 488 Score = 281 bits (719), Expect = 8e-73 Identities = 171/400 (42%), Positives = 235/400 (58%), Gaps = 17/400 (4%) Frame = +2 Query: 71 ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 250 E ++ A+ +R+ HFK+QADSLT E VRRLLEKDL +E LD HKR+++ L K ++ Sbjct: 23 ETKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCMETYTLDVHKRYVKQCLVKCLEAD 82 Query: 251 DESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVLTPKS 430 E N S + + G ++ + +K+ ++E MEDSP+MG+LT +S Sbjct: 83 LEDNVSKDS-ELTGRKSVNKEEAPESPEGHQSKKGAKEPCLEDEEKMEDSPVMGLLTGRS 141 Query: 431 EVGTQSS-------------LSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLD 571 +S SESTI KAI +R +L+ANS+K+++ GVRRLLE+DL L Sbjct: 142 TKNVESDGIKGIKGKDDKDVPSESTIMKAIRKRTSYLKANSEKVTMAGVRRLLEDDLKLT 201 Query: 572 KNTLDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSRKSKKVNXXXXXXXXXXXXXXX 751 KN LD+ K IS+QV+ +L +E V + KS K Sbjct: 202 KNVLDSCKKFISQQVEEILTSCEA--------AEQVSNLKSPKKISKESSYSTEGSSSEE 253 Query: 752 XKEKLR--KEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQIEEDNNSDEGG-SISE 922 +++ K + I + +KR+ S + ++ Q D +SDEGG ++SE Sbjct: 254 ENDEVNPGKTNATKGRIPDSNETKKRKRSTKKTVSAQKQSKHVQDTSDEDSDEGGGNVSE 313 Query: 923 DGQSQLSLEKPAPRK-EKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRETI 1099 DG+S S EKP ++ S P YGKRVE+LKS+IK+CGMSVPP+IYKKVK P+ KRE+ Sbjct: 314 DGRSGSSNEKPVKKEVSSSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQ 373 Query: 1100 LVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGID 1219 L+KELEGILSREGLS N TEKEIK+ +KKKERAKELEGID Sbjct: 374 LIKELEGILSREGLSANSTEKEIKEVKKKKERAKELEGID 413 >ref|XP_004290855.1| PREDICTED: uncharacterized protein LOC101302129 [Fragaria vesca subsp. vesca] Length = 490 Score = 275 bits (703), Expect = 6e-71 Identities = 167/438 (38%), Positives = 258/438 (58%), Gaps = 17/438 (3%) Frame = +2 Query: 29 AELMAEGDGEKGAFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHK 208 +E + + E G E ++ A+ AR+ HFK+Q+DSLT +VRR+LEKDLGLE ALDAHK Sbjct: 8 SEAPMKKEEETGDMESKILEAMKARVPHFKEQSDSLTFVNVRRVLEKDLGLEPSALDAHK 67 Query: 209 RFIRHYLEKIMDGADESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKET 388 F++ +L K ++GA E N+S ++ + L+ +N + K+ S+ ++E Sbjct: 68 GFVKEHLLKCLEGAGEDNNSKSSGQTDEKSLIKGEATGSTEGHQ-SNKDMKETSSADEEK 126 Query: 389 MEDSPIMGVLTPKSEV-----GTQSSLS-----ESTIKKAILERADHLQANSDKISLGGV 538 +EDSP +LT G++SS + E+ IK A+ +R +++AN +K+++G + Sbjct: 127 VEDSPASELLTEHKTAKVKAEGSKSSNNKKAPTEAMIKSALGKRGSYIKANIEKLTMGEL 186 Query: 539 RRLLEEDLGLDKNTLDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSRKS---KKVNX 709 RR+LE+DL LD +LD +K I++Q+D VL + K ++ ++++ Sbjct: 187 RRVLEKDLKLDTYSLDPFKKFINQQLDEVLESCVDPEPVKNVKKNVKKPQRKPTPEEISE 246 Query: 710 XXXXXXXXXXXXXXXKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQI--- 880 + K RK++ + ++ + +KR++ + +IS KK K + Sbjct: 247 ESSGPANSGTDEEEDEVKPRKKSVTKGKMQNSDGLKKRKSLAK-ETNISGKKRIKSLKAD 305 Query: 881 -EEDNNSDEGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIY 1057 EE +++ + ++SED S+ S EKP +KE S P YGKRVE+L+S+IKACGMSVPP+IY Sbjct: 306 SEEKSDAKDSENVSEDEDSKSSAEKPVKKKEVSTPAYGKRVEHLRSVIKACGMSVPPSIY 365 Query: 1058 KKVKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGIDXXXXXX 1237 KKVK VP++KRE L+KELE IL REGLS +PTEKEIK+ +KKKE+AKELEGID Sbjct: 366 KKVKQVPENKREAQLIKELEDILGREGLSSSPTEKEIKEVKKKKEKAKELEGIDMSNIVT 425 Query: 1238 XXXXXXXXXXVAPERPVV 1291 V P +P + Sbjct: 426 SSRRRSTTSFVPPPKPKI 443 >ref|XP_006434169.1| hypothetical protein CICLE_v10000938mg [Citrus clementina] gi|557536291|gb|ESR47409.1| hypothetical protein CICLE_v10000938mg [Citrus clementina] Length = 451 Score = 274 bits (701), Expect = 1e-70 Identities = 158/381 (41%), Positives = 227/381 (59%), Gaps = 15/381 (3%) Frame = +2 Query: 65 AFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMD 244 + E Q++ A+ +R+ HFK+QADSLT E VRRL+EKDLGLE ALD HK+FI+ L + MD Sbjct: 19 SIEPQIKAAMISRVSHFKEQADSLTFEGVRRLIEKDLGLETHALDVHKKFIKQCLLECMD 78 Query: 245 GADESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVLTP 424 GA ++S + + S + + K+ N E MEDSP++G++T Sbjct: 79 GAGGVSASKDSAESAKENVSSTKEEEKSPEGYQSAKDVKEPCPENYEKMEDSPVLGLMTG 138 Query: 425 KSEV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDK 574 + G + SES IKKAI +RA +++ N +K+++ G+RR+LEEDL LDK Sbjct: 139 NKKTKFETEEAQGDGNKEDPSESAIKKAIRKRAAYIKTNIEKVTMAGLRRILEEDLKLDK 198 Query: 575 NTLDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSR---KSKKVNXXXXXXXXXXXXX 745 TLD++K +IS+++D VL + + +K K+K+V+ Sbjct: 199 FTLDSFKKMISQELDEVLKSSEVLEPSTVEKKKSLKKNYQSKAKEVSSEGSSDSSDGEVD 258 Query: 746 XXXKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPK--KQIEEDNNSDEGGSIS 919 + K RK+ + ++ E +KR+ E +KK K K EDNN E GS+S Sbjct: 259 EEDEMKPRKKIVSKGKVQNNEGLKKRKRPEKETKASIKKKTKAVKIASEDNNDAESGSVS 318 Query: 920 EDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRETI 1099 +DG+SQ S EKP +K S P YGKRVE+LK++IK+C MS+PP++YKKVK P++KRE Sbjct: 319 DDGRSQSSSEKPIKKKVVSTPAYGKRVEHLKTVIKSCAMSIPPSVYKKVKQAPENKREAQ 378 Query: 1100 LVKELEGILSREGLSKNPTEK 1162 L+KELEGILSREGLS NP+EK Sbjct: 379 LIKELEGILSREGLSSNPSEK 399 >ref|XP_007137404.1| hypothetical protein PHAVU_009G124200g [Phaseolus vulgaris] gi|561010491|gb|ESW09398.1| hypothetical protein PHAVU_009G124200g [Phaseolus vulgaris] Length = 493 Score = 273 bits (697), Expect = 3e-70 Identities = 174/438 (39%), Positives = 245/438 (55%), Gaps = 22/438 (5%) Frame = +2 Query: 38 MAEGDGE--KGA-FELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHK 208 MAE E KG E Q+E A+ +R+ HFK+Q+DSLT E VRRLLEKDLGLE+ ALD HK Sbjct: 1 MAEDSEEMKKGENIESQIETAMLSRVSHFKEQSDSLTFEGVRRLLEKDLGLEECALDVHK 60 Query: 209 RFIRHYLEKIMDGADESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKET 388 RFI+ L + ++G + + ++ + G + + K ++E Sbjct: 61 RFIKQCLLECLEGVGDD--AGPRISEKAGEEGAGTLEPDEPKEKCELKDEKDLCPEDEEK 118 Query: 389 MEDSPIMGVLTPKSEV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGV 538 MEDSP++G+L + G + SE+ + KA+ +R+ +++AN++ I++ G+ Sbjct: 119 MEDSPVLGLLKEQKRAKLETKDDKGNGNKVVPSEALVMKAVKKRSSYIKANAETITMAGL 178 Query: 539 RRLLEEDLGLDKNTLDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSRKSKKVNXXXX 718 RRLLE+DL LDK TLD YK IS+Q+D VL + + VK + KV Sbjct: 179 RRLLEDDLKLDKFTLDLYKKFISQQLDEVLASSVVSEPAKNAK-KIVKKKPDTKVTKKVS 237 Query: 719 XXXXXXXXXXXXKEKLRKEAGLRKNIK-----KFEQPRKRRNSENADMDISRKKPKKQI- 880 E +E ++ K K + P + + + + D+S KK K Sbjct: 238 SEENSDTSDKEIDEDESQEDEVKPMKKVVPKGKAQTPVQSKKRKGEETDLSSKKRMKPAK 297 Query: 881 ---EEDNNSDEGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPN 1051 EE +++++ G SED QS S EKP+ +KE S P YGKRVE LKS+IKACGM VPP+ Sbjct: 298 AASEEISDAEDSGKNSEDDQSHSSSEKPSKKKEVSTPVYGKRVETLKSVIKACGMGVPPS 357 Query: 1052 IYKKVKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGIDXXXX 1231 IYKK+K V ++KRE L+KELE ILSREGLS NP+EKEIK+ ++KK RAKELEGID Sbjct: 358 IYKKIKQVSENKREGQLIKELEEILSREGLSSNPSEKEIKEVKRKKARAKELEGIDVSNI 417 Query: 1232 XXXXXXXXXXXXVAPERP 1285 +AP P Sbjct: 418 VSSSRRRSTSSYIAPPPP 435 >ref|XP_006578974.1| PREDICTED: transcriptional regulator ATRX homolog isoform X2 [Glycine max] Length = 408 Score = 270 bits (690), Expect = 2e-69 Identities = 163/382 (42%), Positives = 225/382 (58%), Gaps = 18/382 (4%) Frame = +2 Query: 71 ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 250 E Q+E A+ +R+ HFK+Q+DSLT E VRRLLEKDLGLE+ ALD HKRFI+ L K ++G Sbjct: 16 ESQIETAMRSRVSHFKEQSDSLTFEGVRRLLEKDLGLEEYALDVHKRFIKQCLLKCLEGV 75 Query: 251 DESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVLTPKS 430 + + ++ + G S + ++K ++E MEDSP++G+L + Sbjct: 76 GDDDGPK--ISGKEGEKGSSIQESEEPKEECESKDAKDLCPEDEEKMEDSPVLGLLKEQK 133 Query: 431 EV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDKNT 580 GT+ SE+ IKKA+ +R+ +++AN++KI++ G+RRLLEEDL LDK T Sbjct: 134 RAKLETKDDKGNGTKVVPSEALIKKAVRKRSSYIKANAEKITMAGLRRLLEEDLKLDKFT 193 Query: 581 LDAYKNLISRQVDLVLXXXXXXXXXXXXRS-------EDVKSRKSKKVNXXXXXXXXXXX 739 LD YK +S+Q+D VL + V + S + N Sbjct: 194 LDPYKKFVSQQLDEVLTSSEVPEPAKNAKKIVKKKPDTKVTKKVSSEENSDTSDKETDEE 253 Query: 740 XXXXXKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQIEEDNN-SDEGGSI 916 + K RK+ + +K QP+KR+ E+ R KP K EDN+ +++ G Sbjct: 254 ESEEDEVKPRKKILPKGKVKTSVQPKKRKGEESDLSSKKRVKPAKAASEDNSDAEDNGKN 313 Query: 917 SEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRET 1096 SED QS S EKP+ +KE S P YGKRVE+LKS+IKACGMSVPP IYKKVK VP++KRE Sbjct: 314 SEDDQSHSSPEKPSKKKEVSNPVYGKRVEHLKSVIKACGMSVPPVIYKKVKQVPENKREG 373 Query: 1097 ILVKELEGILSREGLSKNPTEK 1162 L+KELE ILSREGLS NP+EK Sbjct: 374 QLIKELEEILSREGLSSNPSEK 395