BLASTX nr result
ID: Zingiber25_contig00033977
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Zingiber25_contig00033977 (1388 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003576067.1| PREDICTED: UPF0361 protein C3orf37 homolog [... 372 e-100 ref|XP_003635244.1| PREDICTED: UPF0361 protein C3orf37 homolog [... 367 5e-99 ref|XP_003622783.1| hypothetical protein MTR_7g052250 [Medicago ... 363 9e-98 ref|XP_002303080.1| hypothetical protein POPTR_0002s25190g [Popu... 362 3e-97 ref|XP_004492204.1| PREDICTED: UPF0361 protein C3orf37 homolog [... 360 1e-96 ref|XP_004290141.1| PREDICTED: UPF0361 protein C3orf37 homolog i... 358 3e-96 ref|XP_003532247.1| PREDICTED: embryonic stem cell-specific 5-hy... 357 5e-96 ref|XP_004982141.1| PREDICTED: UPF0361 protein C3orf37 homolog [... 356 1e-95 gb|EMJ00266.1| hypothetical protein PRUPE_ppa018685mg [Prunus pe... 355 3e-95 gb|ESW12729.1| hypothetical protein PHAVU_008G137400g [Phaseolus... 351 4e-94 ref|XP_006484827.1| PREDICTED: embryonic stem cell-specific 5-hy... 346 2e-92 ref|XP_006850341.1| hypothetical protein AMTR_s00020p00243160 [A... 345 2e-92 ref|XP_002880802.1| hypothetical protein ARALYDRAFT_481505 [Arab... 342 3e-91 ref|XP_002527247.1| conserved hypothetical protein [Ricinus comm... 342 3e-91 ref|NP_001144583.1| uncharacterized protein LOC100277594 [Zea ma... 342 3e-91 ref|NP_180215.2| uncharacterized protein [Arabidopsis thaliana] ... 339 1e-90 gb|ACF82411.1| unknown [Zea mays] gi|414588288|tpg|DAA38859.1| T... 339 1e-90 gb|EOX93768.1| Uncharacterized protein TCM_002685 [Theobroma cacao] 338 3e-90 gb|EMS46705.1| hypothetical protein TRIUR3_27289 [Triticum urartu] 337 7e-90 gb|EXB84512.1| hypothetical protein L484_015844 [Morus notabilis] 332 2e-88 >ref|XP_003576067.1| PREDICTED: UPF0361 protein C3orf37 homolog [Brachypodium distachyon] Length = 421 Score = 372 bits (954), Expect = e-100 Identities = 208/421 (49%), Positives = 270/421 (64%), Gaps = 45/421 (10%) Frame = +1 Query: 100 MCGRTRCTLNPDRVARACGF----------------------AASIPTRQIDRYRPSYNV 213 MCGR RCTL+P ++ARA GF A ++PT Q+DR+RPSYNV Sbjct: 1 MCGRARCTLSPAQIARAFGFPTTGAAGGGDGGGGAGAAGGGDAPAVPTLQMDRFRPSYNV 60 Query: 214 SPGAYLPVLLLERTT-------GEAEASPSICCMKWGLVPSFTKKTEKPDHYKMFNARSE 372 SPGAYLPV + RT GE E P I CMKWGLVPSFT KTEKPDH++MFNARSE Sbjct: 61 SPGAYLPVGVRARTVDGDGGREGEGELEPVIQCMKWGLVPSFTSKTEKPDHFRMFNARSE 120 Query: 373 SVKEKPSFCRLLPTNRCVVAVEGFYEWKKDASKKQPYYIHFKDSRPLVFAALYDSWKSSE 552 S+KE+ SF RL+P NR +VAVEGFYEWKKD SKKQPYYIHF+D RPLVFAAL+D+WK+SE Sbjct: 121 SIKERASFRRLVPKNRGLVAVEGFYEWKKDGSKKQPYYIHFQDQRPLVFAALFDTWKNSE 180 Query: 553 GDILYTFTILTVGSSKSLQWLHDRMPVILGDDVSVDVWLNNGMPKSEIVLRPYEDEDLVW 732 G+ L+TF+ILT +S SL+WLHDRMPVILGD+ SV+ WLNNG K E + PYE DLVW Sbjct: 181 GETLHTFSILTTCASTSLKWLHDRMPVILGDNNSVNAWLNNGSVKLEEITVPYEGADLVW 240 Query: 733 YPVTTAVGKPSFDGPDCITEIKLKRPVENQIAKFFTKKADGKNQ-MEEEHTKRLELSPKG 909 YPVTTA+GK SF+G +CI E+KL RP E I++FFTKKA Q ++ E T R + Sbjct: 241 YPVTTAMGKTSFNGLECIQEVKL-RPSEKPISEFFTKKAAVNCQGIKPEKTSREITESQV 299 Query: 910 DRVDDTKKDASRTENVANFKEESP-QNDMFNCMKEDCEHHVD-EKHSSNGLLKKE----- 1068 R + D S + ++ P +N C+ +D ++ + +++KE Sbjct: 300 FRTAKEECDESEENQLDKTDKQQPAENQEAACVVKDEPATLELQTFHPAQIIEKEAVTVP 359 Query: 1069 ---NVSPDIFGTKRQTQEIPLDSGSTSEK-----VSSLLKKARRVKNVDDKQASLLSYFG 1224 N D+F TKR+ ++ +++ ++K + + KK + K+ D QASLLS+F Sbjct: 360 DDANQKDDLFRTKRKIEDTEVNAEVKTQKSCRSTILPVKKKEKGAKSSSDGQASLLSFFA 419 Query: 1225 K 1227 K Sbjct: 420 K 420 >ref|XP_003635244.1| PREDICTED: UPF0361 protein C3orf37 homolog [Vitis vinifera] gi|296090568|emb|CBI40918.3| unnamed protein product [Vitis vinifera] Length = 392 Score = 367 bits (943), Expect = 5e-99 Identities = 212/408 (51%), Positives = 254/408 (62%), Gaps = 31/408 (7%) Frame = +1 Query: 100 MCGRTRCTLNPDRVARACGFAASIPTR--QIDRYRPSYNVSPGAYLPVLLLERTTGEAEA 273 MCGR RCTL PD +ARAC ++PT+ Q+DRYRPSYNVSPGA LPV+ R G E Sbjct: 1 MCGRARCTLRPDNIARACNLN-TLPTQNIQMDRYRPSYNVSPGANLPVV---RRGGGTEG 56 Query: 274 SPSIC-CMKWGLVPSFTKKTEKPDHYKMFNARSESVKEKPSFCRLLPTNRCVVAVEGFYE 450 +I CMKWGLVPSFTKK+EKPDHYKMFNARSESV EK SF RL+P NRC+VAVEGFYE Sbjct: 57 EEAIVHCMKWGLVPSFTKKSEKPDHYKMFNARSESVCEKASFRRLVPKNRCLVAVEGFYE 116 Query: 451 WKKDASKKQPYYIHFKDSRPLVFAALYDSWKSSEGDILYTFTILTVGSSKSLQWLHDRMP 630 WKKD SKKQPYYIH KD RPLVFAAL+DSW +SEG+ILYT TILT SS +LQWLHDRMP Sbjct: 117 WKKDGSKKQPYYIHLKDGRPLVFAALFDSWANSEGEILYTCTILTTSSSSALQWLHDRMP 176 Query: 631 VILGDDVSVDVWLN-NGMPKSEIVLRPYEDEDLVWYPVTTAVGKPSFDGPDCITEIKLKR 807 VILGD S D WLN + + VL+PYED DLVWYPVT A+GKPSF+GP+CI EI+LK Sbjct: 177 VILGDKESTDAWLNGSSSSQFNTVLKPYEDPDLVWYPVTQAMGKPSFEGPECIKEIQLKN 236 Query: 808 PVENQIAKFFTKKADGKNQMEEEHTKRLELSPKGDRVDDTKKDASRTENVANFKEESPQN 987 + I+KFF+ K KN+ + L P + + K+ EN + + Sbjct: 237 E-QRPISKFFSTKGI-KNE------QGLSNEPVKSNLPQSLKEEPAIENSTGLPSSTVKG 288 Query: 988 DMFNCMKEDCEHHVDEKHSS-----NGLLKKENVSPDIFG-------------------T 1095 D C + ++ S+ LK+E + D G Sbjct: 289 D----HDSTCSRSIPQEESTWFTNLPKSLKQEPETEDKTGLPFPGDHDSKCDEEATKLPI 344 Query: 1096 KRQTQEIPLDS---GSTSEKVSSLLKKARRVKNVDDKQASLLSYFGKA 1230 KR +E DS T EK S + KK + KN DKQ +L SYFGK+ Sbjct: 345 KRDFEEFSADSKPNTDTVEKPSPVTKKGKLNKNAGDKQPTLFSYFGKS 392 >ref|XP_003622783.1| hypothetical protein MTR_7g052250 [Medicago truncatula] gi|355497798|gb|AES79001.1| hypothetical protein MTR_7g052250 [Medicago truncatula] Length = 354 Score = 363 bits (932), Expect = 9e-98 Identities = 194/382 (50%), Positives = 240/382 (62%), Gaps = 6/382 (1%) Frame = +1 Query: 100 MCGRTRCTLNPDRVARACGFAASIPTR--QIDRYRPSYNVSPGAYLPVLLLERTTGEAEA 273 MCGRTRC+L D V RAC + P+R IDRYRPS NVSPG +PV+ E Sbjct: 1 MCGRTRCSLRADDVPRAC-HRTTAPSRLLHIDRYRPSNNVSPGFNIPVVRREDNASAESD 59 Query: 274 SPSICCMKWGLVPSFTKKTEKPDHYKMFNARSESVKEKPSFCRLLPTNRCVVAVEGFYEW 453 + CMKWGL+PSFTKKT+KPDHYKMFNARSES+ EK SF RLLP NRC+VAVEGFYEW Sbjct: 60 GHVVHCMKWGLIPSFTKKTDKPDHYKMFNARSESIDEKASFRRLLPKNRCLVAVEGFYEW 119 Query: 454 KKDASKKQPYYIHFKDSRPLVFAALYDSWKSSEGDILYTFTILTVGSSKSLQWLHDRMPV 633 KKD SKKQPYYIHFKD RPLVFAALYDSW++SEG+ILYTFTI+T SS + +WLHDRMPV Sbjct: 120 KKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTFTIVTTSSSSAFKWLHDRMPV 179 Query: 634 ILGDDVSVDVWLNNGMPKSEIVLRPYEDEDLVWYPVTTAVGKPSFDGPDCITEIKLKRPV 813 ILGD + D WL++ + V++PYE+ DLVWYPVT A+GKPSFDGP+CI EI++K Sbjct: 180 ILGDKDTTDTWLSSA-SSFKSVMKPYEESDLVWYPVTPAMGKPSFDGPECIKEIQIKTEG 238 Query: 814 ENQIAKFFTKKADGKNQMEEEHTKRLELSPKGDRVDDTKKDASRTENVANFKEESPQNDM 993 I+KFF+KK + EH K ++ D ++A E + K Sbjct: 239 YIPISKFFSKKEAEVEDTKPEHKILSHEPVKTEQTKDVSEEAKTEEGDTDLK-------- 290 Query: 994 FNCMKEDCEHHVDEKHSSNGLLKKENVSPDIFGTKRQTQEIPLDS----GSTSEKVSSLL 1161 S+G+ +NV+ F KR+ I DS + + ++ Sbjct: 291 -----------------SSGISPSQNVNR--FAIKREYDAISSDSKPSLANNDQVSANPA 331 Query: 1162 KKARRVKNVDDKQASLLSYFGK 1227 KK + K DDKQ +L SYFGK Sbjct: 332 KKKEKAKTADDKQPTLFSYFGK 353 >ref|XP_002303080.1| hypothetical protein POPTR_0002s25190g [Populus trichocarpa] gi|222844806|gb|EEE82353.1| hypothetical protein POPTR_0002s25190g [Populus trichocarpa] Length = 367 Score = 362 bits (928), Expect = 3e-97 Identities = 196/389 (50%), Positives = 252/389 (64%), Gaps = 13/389 (3%) Frame = +1 Query: 100 MCGRTRCTLNPDRVARACGF-AASIPTRQIDRYRPSYNVSPGAYLPVLLLERTTGEAEAS 276 MCGR RCTL D + RAC A++ + +DRYRPSYN SPG+ L V+ + AS Sbjct: 1 MCGRARCTLRADDIPRACHRNTATVRSVNMDRYRPSYNASPGSNLAVVRRDDAASGDGAS 60 Query: 277 P----SICCMKWGLVPSFTKKTEKPDHYKMFNARSESVKEKPSFCRLLPTNRCVVAVEGF 444 +I CMKWGL+P FTKK+EKPD YKMFNARSES+ EK SF RL+P +RC+VAVEGF Sbjct: 61 GGDGYAIHCMKWGLIPGFTKKSEKPDFYKMFNARSESLSEKASFRRLIPKSRCLVAVEGF 120 Query: 445 YEWKKDASKKQPYYIHFKDSRPLVFAALYDSWKSSEGDILYTFTILTVGSSKSLQWLHDR 624 YEWKKD SKKQPYYIHFKD RPLVFAALYDSW++SEG+ILYTFTI+T +S ++QWLH+R Sbjct: 121 YEWKKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTFTIVTTAASSAIQWLHER 180 Query: 625 MPVILGDDVSVDVWLN-NGMPKSEIVLRPYEDEDLVWYPVTTAVGKPSFDGPDCITEIKL 801 MPVILGD + D WL+ + K + VL+PYE DLVWYPVT A+GKPSFDGP+CI EI L Sbjct: 181 MPVILGDKEATDTWLSVSSNSKFDTVLKPYEHSDLVWYPVTPAMGKPSFDGPECIKEIHL 240 Query: 802 KRPVENQIAKFFTKK--ADGKNQMEEEHTKRLELSPKGDRVDDTKKDASRTENVANFKEE 975 K + I+KFF++K + N E H K L+L PK + + EN + K E Sbjct: 241 KMEEKGTISKFFSRKEFKEESNPEESTHGKSLKLEPK----------SVKEENESEEKLE 290 Query: 976 SPQNDMFNCMKEDCEHHVDEK-----HSSNGLLKKENVSPDIFGTKRQTQEIPLDSGSTS 1140 +P C + ++ + + H K + ++ +K +T EI S + Sbjct: 291 TP------CSAKTVDYDLKSELETFSHEGETKCKTKRDREELVDSKLKTDEIVKPRASPA 344 Query: 1141 EKVSSLLKKARRVKNVDDKQASLLSYFGK 1227 +K ++L K+VDDKQ +LLSYFGK Sbjct: 345 KKKANL-------KSVDDKQPTLLSYFGK 366 >ref|XP_004492204.1| PREDICTED: UPF0361 protein C3orf37 homolog [Cicer arietinum] Length = 375 Score = 360 bits (923), Expect = 1e-96 Identities = 201/383 (52%), Positives = 251/383 (65%), Gaps = 7/383 (1%) Frame = +1 Query: 100 MCGRTRCTLNPDRVARACGFAASIPTR--QIDRYRPSYNVSPGAYLPVLLLERTTGEAEA 273 MCGR RCTL PD + AC + PTR +DRYRPS+NVSPG ++PV+ E + E+E Sbjct: 21 MCGRGRCTLRPDDIPTAC-HRTTAPTRLLHVDRYRPSHNVSPGFHMPVVRREDAS-ESEG 78 Query: 274 SPSICCMKWGLVPSFTKKTEKPDHYKMFNARSESVKEKPSFCRLLPTNRCVVAVEGFYEW 453 + CMKWGL+PSFTKKTEKPDHY+MFNARSES+ EK SF RLLP NRC+VAVEGFYEW Sbjct: 79 HV-LHCMKWGLIPSFTKKTEKPDHYRMFNARSESIDEKASFRRLLPKNRCLVAVEGFYEW 137 Query: 454 KKDASKKQPYYIHFKDSRPLVFAALYDSWKSSEGDILYTFTILTVGSSKSLQWLHDRMPV 633 KKD SKKQPYYIHFKD RPLVFAALYDSW++SEG+ LYTFTI+T SS +LQWLHDRMPV Sbjct: 138 KKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGETLYTFTIVTTSSSSTLQWLHDRMPV 197 Query: 634 ILGDDVSVDVWLNNGMPKSEIVLRPYEDEDLVWYPVTTAVGKPSFDGPDCITEIKLKRPV 813 IL D S D WLN+ + VL+PYE+ DL WYPVT A+GKPSFDGP+CI EI++K Sbjct: 198 ILSDKDSTDTWLNSA-SSFKSVLKPYEECDLAWYPVTPAMGKPSFDGPECIKEIQVKAEG 256 Query: 814 ENQIAKFFTKKADGKNQMEEEHTKRLELSPKGDRVDDTKKDASRTENVANFKEESPQNDM 993 I+KFF++K G+ + + K L L + + + T KD S K E ++D+ Sbjct: 257 NIPISKFFSRKG-GEGEDTKSGHKILSLCHEPVKTEQTTKDLSE-----GAKTEEGESDL 310 Query: 994 FNCMKEDCEHHVDEKHSSNGLLKKENVSPDIFGTKRQTQEIPLDS----GSTSEKVSS-L 1158 S+G +NV+ F KR+ I DS G + +++ Sbjct: 311 ----------------KSSG-SSPQNVTK--FTVKREYDAISSDSKPSLGINDQVIANPP 351 Query: 1159 LKKARRVKNVDDKQASLLSYFGK 1227 KK + KN DDKQ +L S+FGK Sbjct: 352 TKKKEKAKNADDKQPTLFSFFGK 374 >ref|XP_004290141.1| PREDICTED: UPF0361 protein C3orf37 homolog isoform 1 [Fragaria vesca subsp. vesca] Length = 366 Score = 358 bits (919), Expect = 3e-96 Identities = 198/386 (51%), Positives = 250/386 (64%), Gaps = 9/386 (2%) Frame = +1 Query: 100 MCGRTRCTLNPDRVARACGFAASIPTRQI--DRYRPSYNVSPGAYLPVLLLERTTG-EAE 270 MCGR RCTL D ++RAC + P R + DRY+P YNVSPGA LPV+ R G + E Sbjct: 1 MCGRARCTLRADDISRAC-YRNHGPVRSVNMDRYQPRYNVSPGANLPVV--RRGDGADGE 57 Query: 271 ASPSICCMKWGLVPSFTKKTEKPDHYKMFNARSESVKEKPSFCRLLPTNRCVVAVEGFYE 450 + CMKWGL+PSFTKKTEKPDHY+MFNARSES+ EK SF RL+P +RCVVAVEGFYE Sbjct: 58 DGVVLHCMKWGLIPSFTKKTEKPDHYRMFNARSESICEKASFRRLVPKSRCVVAVEGFYE 117 Query: 451 WKKDASKKQPYYIHFKDSRPLVFAALYDSWKSSEGDILYTFTILTVGSSKSLQWLHDRMP 630 WKKD SKKQPYY+HFKD RPL+FAALYDSW++SEG+ LYTFTI+T SS +L WLHDRMP Sbjct: 118 WKKDGSKKQPYYVHFKDGRPLLFAALYDSWENSEGEKLYTFTIITTSSSSALGWLHDRMP 177 Query: 631 VILGDDVSVDVWLNNGMPKS-EIVLRPYEDEDLVWYPVTTAVGKPSFDGPDCITEIKLKR 807 V+LGD SVD WL+ + + +L+PYE DLVWYPVT A+GK SFDGP+C EIKLK Sbjct: 178 VVLGDKESVDTWLDGSSASNFDKLLKPYEGPDLVWYPVTPAMGKVSFDGPECSNEIKLKT 237 Query: 808 PVENQIAKFFTKKADGKNQMEEEHTKRLELSPKGDRVDDTKKDASRTENVANFKEESPQN 987 N I KFF+ K TK+ E++PK + D+ E++ N + E+ + Sbjct: 238 DGTNSITKFFSTKG----------TKKEEINPKDTSLHDSSVKTEFPESL-NEEPETKEE 286 Query: 988 DMFNCMKEDCEHHVDEKHSSNGLLKKENVSPDIFGTKRQTQEI-----PLDSGSTSEKVS 1152 + CE + SS +L +E+ S + TKR +E PL + S + + Sbjct: 287 KVQPSSTVKCE----DSKSSVSILSQEDASKE--QTKRDYEEFLADSKPLPNESDKKSSA 340 Query: 1153 SLLKKARRVKNVDDKQASLLSYFGKA 1230 S KK +K DKQ +L SYF K+ Sbjct: 341 SPAKKKVNLKTSHDKQPTLFSYFRKS 366 >ref|XP_003532247.1| PREDICTED: embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein-like [Glycine max] Length = 382 Score = 357 bits (917), Expect = 5e-96 Identities = 196/388 (50%), Positives = 250/388 (64%), Gaps = 11/388 (2%) Frame = +1 Query: 100 MCGRTRCTLNPDRVARACGFAASIPTR--QIDRYRPSYNVSPGAYLPVLLLERTTGEAEA 273 MCGR RCTL D V RAC + S PTR IDRYRP+YNVSPG +PV+ + +G Sbjct: 1 MCGRARCTLRADDVPRACHRSTS-PTRTLHIDRYRPAYNVSPGFDVPVVRRDDASGGE-- 57 Query: 274 SPSICCMKWGLVPSFTKKTEKPDHYKMFNARSESVKEKPSFCRLLPTNRCVVAVEGFYEW 453 + CMKWGL+PSFTKKTEKPDHY+MFNARSES+ EK SF RLLP +RC+VAVEGFYEW Sbjct: 58 GYVLQCMKWGLIPSFTKKTEKPDHYRMFNARSESIDEKASFRRLLPKSRCLVAVEGFYEW 117 Query: 454 KKDASKKQPYYIHFKDSRPLVFAALYDSWKSSEGDILYTFTILTVGSSKSLQWLHDRMPV 633 KKD SKKQPYYIHFKD RPLVFAALYDSW++SEG+ LYTFTI+T SS +LQWLHDRMPV Sbjct: 118 KKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGETLYTFTIVTTSSSSALQWLHDRMPV 177 Query: 634 ILGDDVSVDVWLNNGMPKSEIVLRPYEDEDLVWYPVTTAVGKPSFDGPDCITEIKLKRPV 813 ILG S D+WL++ + V++PYE+ DLVWYPVT+A+GK SFDGP+CI EI++K Sbjct: 178 ILGSKESTDIWLSSSASSFKSVMKPYEESDLVWYPVTSAMGKASFDGPECIKEIQVKAQG 237 Query: 814 ENQIAKFFTKKADGKNQMEEEHTKRLELSPKGDRVDD--TKKDASRTENVAN--FKEESP 981 I+ FF+KK D + E K + +D KD + ++ F + P Sbjct: 238 NTSISMFFSKKGDESKDTKPEQKASCPEVVKTEHTEDLTESKDTKPEQKTSSHEFVKTEP 297 Query: 982 QNDMFNCMKEDCEHHVDEKHSSNGLLKKENVSPDIFGTKRQTQEIPLDSG----STSEKV 1149 D+ K + E D K +G +NVS + KR+ + + +++ Sbjct: 298 TEDLRERAKTE-EGGNDLKF--HGSSHSQNVS--MLPIKREYETFSAADSKPALANHDQI 352 Query: 1150 S-SLLKKARRVKNVDDKQASLLSYFGKA 1230 S + KK + K +DKQ +L SYFGK+ Sbjct: 353 SPNPAKKKEKAKTANDKQPTLFSYFGKS 380 >ref|XP_004982141.1| PREDICTED: UPF0361 protein C3orf37 homolog [Setaria italica] Length = 416 Score = 356 bits (914), Expect = 1e-95 Identities = 205/432 (47%), Positives = 256/432 (59%), Gaps = 56/432 (12%) Frame = +1 Query: 100 MCGRTRCTLNPDRVARACGF----------------AASIPTRQIDRYRPSYNVSPGAYL 231 MCGR RCTL+ + ARA GF A ++ T +DR+RPSYNVSPGAYL Sbjct: 1 MCGRARCTLSAAQAARAFGFPTTTAAAAGSGGGAGDAPAVRTLDLDRFRPSYNVSPGAYL 60 Query: 232 PVLLLERTT--------GEAEASPSICCMKWGLVPSFTKKTEKPDHYKMFNARSESVKEK 387 PV + G A P I CMKWGLVPSFT KTEKPDH++MFNARSESVKEK Sbjct: 61 PVGTVRAQPAAGSDGGRGGDGAEPVIQCMKWGLVPSFTGKTEKPDHFRMFNARSESVKEK 120 Query: 388 PSFCRLLPTNRCVVAVEGFYEWKKDASKKQPYYIHFKDSRPLVFAALYDSWKSSEGDILY 567 SF RL+P NRC+VAVEGFYEWKKD SKKQPYYIHF+D RPLVFAALYD+W +SEG++++ Sbjct: 121 ASFRRLIPKNRCLVAVEGFYEWKKDGSKKQPYYIHFQDHRPLVFAALYDTWTNSEGEVIH 180 Query: 568 TFTILTVGSSKSLQWLHDRMPVILGDDVSVDVWLNNGMPKSEIVLRPYEDEDLVWYPVTT 747 TFTILT +S SL+WLHDRMPVILGD+ SV+VWLN+ K E + PYE DLVWYPVT+ Sbjct: 181 TFTILTTRASTSLKWLHDRMPVILGDNDSVNVWLNDASVKLEEITSPYEGADLVWYPVTS 240 Query: 748 AVGKPSFDGPDCITEIKLKRPVENQIAKFFTKKADGKNQMEEEHTKRLELSPKGDRVDDT 927 A+GK SFDGP+CI E+ + P E I+KFFTKK+ +Q + P+ ++ Sbjct: 241 AMGKTSFDGPECIKELHM-GPSEKPISKFFTKKSTAHDQ---------SVKPEKTTLEFA 290 Query: 928 KKDASRTENVANFKEESPQNDMFNCMKEDCEHHVDEKHSSNGLLKKENVS---------- 1077 + +SR V +ES QN ED E+ +++ +K E VS Sbjct: 291 ETHSSRASKVE--CDESVQN-----QPEDVNQQHGEERTTSSTVKDEPVSLGPQVIGKPQ 343 Query: 1078 -----------------PDIFGTKRQTQEIPLDSGSTSEKVSS-----LLKKARRVKNVD 1191 D FG KR+ ++ + + V S KK + K Sbjct: 344 SIKDEDTMTSTGITIEKQDDFGIKRKIEDTEVKAEMMENSVWSCSRPTTTKKGKGAKAAP 403 Query: 1192 DKQASLLSYFGK 1227 D QASLLSYF + Sbjct: 404 DGQASLLSYFAR 415 >gb|EMJ00266.1| hypothetical protein PRUPE_ppa018685mg [Prunus persica] Length = 363 Score = 355 bits (910), Expect = 3e-95 Identities = 195/383 (50%), Positives = 239/383 (62%), Gaps = 6/383 (1%) Frame = +1 Query: 100 MCGRTRCTLNPDRVARACGFA-ASIPTRQIDRYRPSYNVSPGAYLPVLLLERTTGEAEAS 276 MCGR RCTL D + RAC + + T +DR+RP +N SPG+ LPV+ R G Sbjct: 1 MCGRARCTLRADDIPRACHRSHGPVRTVNMDRFRPLFNASPGSNLPVV--RREDGGDGDG 58 Query: 277 PSICCMKWGLVPSFTKKTEKPDHYKMFNARSESVKEKPSFCRLLPTNRCVVAVEGFYEWK 456 + CMKWGL+PSFTKKTEKPDHYKMFNARSES+ EK SF RL+P NRC++AVEGFYEWK Sbjct: 59 VVVHCMKWGLIPSFTKKTEKPDHYKMFNARSESICEKASFRRLIPKNRCLIAVEGFYEWK 118 Query: 457 KDASKKQPYYIHFKDSRPLVFAALYDSWKSSEGDILYTFTILTVGSSKSLQWLHDRMPVI 636 KD SKKQPYY+HF D RPL+FAALYD W++SEG+ LYTFTI+T SS +L WLHDRMPVI Sbjct: 119 KDGSKKQPYYVHFNDGRPLLFAALYDFWENSEGEKLYTFTIITTSSSSALGWLHDRMPVI 178 Query: 637 LGDDVSVDVWLNNGMPKS-EIVLRPYEDEDLVWYPVTTAVGKPSFDGPDCITEIKLKRPV 813 LGD S D WL+ + + +L+PYE DLVWYPVT A+GK SFDGP+CI EI+LK Sbjct: 179 LGDKGSTDSWLSGSSTSNFDSLLKPYEGPDLVWYPVTQAMGKVSFDGPECINEIQLKTEG 238 Query: 814 ENQIAKFFTKKADGKNQMEEEHTKRLELSPKGDRVDDTKKDASRTENVANFKEESPQNDM 993 N I KFF K K ++ + T + S K D K++ E K E P + Sbjct: 239 NNSITKFFMSKGTKKEELNPKDTSFYDSSVKNDLPKSVKEEPEGKE-----KTEQPAS-- 291 Query: 994 FNCMKEDCEHHVDEKHSSNGLLKKENVSPDIFGTKRQTQEIPLDSG----STSEKVSSLL 1161 E CE+ S + +E VS TKR +E DS TSE +S Sbjct: 292 ----TEKCEN-----DSKGQTISQEGVSKG--QTKRDYEEFSADSKPVAYETSEMSASPA 340 Query: 1162 KKARRVKNVDDKQASLLSYFGKA 1230 KK K+ DKQ +L SYFGK+ Sbjct: 341 KKKVNPKSSVDKQPTLFSYFGKS 363 >gb|ESW12729.1| hypothetical protein PHAVU_008G137400g [Phaseolus vulgaris] Length = 353 Score = 351 bits (901), Expect = 4e-94 Identities = 196/386 (50%), Positives = 242/386 (62%), Gaps = 10/386 (2%) Frame = +1 Query: 100 MCGRTRCTLNPDRVARACGFAASIPTRQI--DRYRPSYNVSPGAYLPVLLLERTTGEAEA 273 MCGRTRCTL D V RAC + PTR + DRYRP+YNVSPG+ +PV+ E EA Sbjct: 1 MCGRTRCTLRSDDVPRAC-HRSDAPTRTLHMDRYRPAYNVSPGSNMPVVRRE------EA 53 Query: 274 SPS----ICCMKWGLVPSFTKKTEKPDHYKMFNARSESVKEKPSFCRLLPTNRCVVAVEG 441 S S + MKWGL+PSFTKKTEKPDHYKMFNARSES+ EK SF RLLP +RC+VAVEG Sbjct: 54 SDSGGYVLHSMKWGLIPSFTKKTEKPDHYKMFNARSESIDEKASFRRLLPKSRCLVAVEG 113 Query: 442 FYEWKKDASKKQPYYIHFKDSRPLVFAALYDSWKSSEGDILYTFTILTVGSSKSLQWLHD 621 FYEWKKD SKKQPYYIHFKD R LVFAALYDSW++SEG+ L+TFTI+T SS +LQWLHD Sbjct: 114 FYEWKKDGSKKQPYYIHFKDGRRLVFAALYDSWQNSEGETLHTFTIVTTSSSSALQWLHD 173 Query: 622 RMPVILGDDVSVDVWLNNGMPKSEIVLRPYEDEDLVWYPVTTAVGKPSFDGPDCITEIKL 801 RMPVILG S D WL++ + V++PYE+ DLVWYPVT+A+GK SFDGP+CI EI++ Sbjct: 174 RMPVILGSKESTDTWLSSSASSFKSVMKPYEESDLVWYPVTSAMGKTSFDGPECIKEIQV 233 Query: 802 KRPVENQIAKFFTKKADGKNQMEEEHTKRLELSPKGDRVDDTKKDASRTENVANFKEESP 981 K I+ FF+K KG DTK + + + F + P Sbjct: 234 KAEGNTSISMFFSK--------------------KGAESKDTKPEQKLSSH--EFVKTEP 271 Query: 982 QNDMFNCMKEDCEHHVDEKHSSNGLLKKENVSPDIFGTKRQTQEIPLDS----GSTSEKV 1149 D+ K + E D K S + K + P KR+ + DS + + Sbjct: 272 TEDLIEGAKAE-EGDNDLKFSGSSHSKNASTLP----IKREYETFSADSKPALANHDQIS 326 Query: 1150 SSLLKKARRVKNVDDKQASLLSYFGK 1227 S+ KK + K +DKQ +L SYFGK Sbjct: 327 SNPAKKKEKTKTANDKQPTLFSYFGK 352 >ref|XP_006484827.1| PREDICTED: embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein-like isoform X1 [Citrus sinensis] Length = 398 Score = 346 bits (887), Expect = 2e-92 Identities = 197/404 (48%), Positives = 251/404 (62%), Gaps = 28/404 (6%) Frame = +1 Query: 100 MCGRTRCTLNPDRVARACGFAAS-IPTRQIDRYRPSYNVSPGAYLPVLLLERTTGEAEAS 276 MCGR RCTL D + RAC S T +DRYRPSYNV+PG LPV+ R + E Sbjct: 1 MCGRARCTLRADDLPRACHRTGSPARTLNMDRYRPSYNVAPGWNLPVV---RRDDDGEGF 57 Query: 277 PSICCMKWGLVPSFTKKTEKPDHYKMFNARSESVKEKPSFCRLLPTNRCVVAVEGFYEWK 456 + CMKWGL+PSFTKK EKPD YKMFNARSESV EK SF RLLP +RC+ AVEGFYEWK Sbjct: 58 V-LHCMKWGLIPSFTKKNEKPDFYKMFNARSESVTEKASFRRLLPKSRCLAAVEGFYEWK 116 Query: 457 KDASKKQPYYIHFKDSRPLVFAALYDSWKSSEGDILYTFTILTVGSSKSLQWLHDRMPVI 636 KD SKKQPYY+HFKD RPLVFAALYD+W+SSEG+ILYTFTILT SS +LQWLHDRMPVI Sbjct: 117 KDGSKKQPYYVHFKDGRPLVFAALYDTWQSSEGEILYTFTILTTSSSAALQWLHDRMPVI 176 Query: 637 LGDDVSVDVWLN-NGMPKSEIVLRPYEDEDLVWYPVTTAVGKPSFDGPDCITEIKLKRPV 813 LGD S D WLN + K + +L+PYE+ DLVWYPVT +GK SF+GP+CI EI LK Sbjct: 177 LGDKESSDAWLNGSSSSKYDTILKPYEESDLVWYPVTPVMGKLSFNGPECIKEIPLKTEG 236 Query: 814 ENQIAKFFTK---------KADGKNQMEEEHTKRLELSPKGDRVDDTKKD-ASRTENVAN 963 +N I+ FF K K D K+ +E L KG+ + + K++ S E + Sbjct: 237 KNPISNFFLKKEIKKEQESKMDEKSSFDESVKTNLPKRMKGEPIKEIKEEPVSGLEEKYS 296 Query: 964 FKEESPQNDMFNCMKEDCEHHVDEKHSSN------------GLLKKENVSPDIFGTKRQT 1107 F + + Q ++ +K++ D + S+ +L E+ ++ KR Sbjct: 297 F-DTTAQTNLPKSVKDEAVTADDIRTQSSVEKGDPDTKSVASVLSDEDTKKEL--QKRDY 353 Query: 1108 QEIPLDS----GSTSEKVSSLLKKARRVKNVDDKQASLLSYFGK 1227 +E DS ++ +S LK+ VK+ +KQ +L SY+ K Sbjct: 354 KEFLADSKPVIDGNNKLETSPLKRKGNVKDAGEKQPTLFSYYSK 397 >ref|XP_006850341.1| hypothetical protein AMTR_s00020p00243160 [Amborella trichopoda] gi|548853962|gb|ERN11922.1| hypothetical protein AMTR_s00020p00243160 [Amborella trichopoda] Length = 413 Score = 345 bits (886), Expect = 2e-92 Identities = 186/383 (48%), Positives = 243/383 (63%), Gaps = 6/383 (1%) Frame = +1 Query: 100 MCGRTRCTLNP-DRVARACGFAASIPTRQIDRYRPSYNVSPGAYLPVLLLERTTGEAEAS 276 MCGR RCTLNP + V RACGF A++PT RYR SYN++PGAYLPVL E+ E++ Sbjct: 40 MCGRARCTLNPVEDVPRACGFNANLPTLHTQRYRLSYNIAPGAYLPVLRKEQ---ESKHG 96 Query: 277 PSICCMKWGLVPSFTKKTEKPDHYKMFNARSESVKEKPSFCRLLPTNRCVVAVEGFYEWK 456 + CMKWGLVPSFTKKTEKPDH+KMFNARSES++EK SF RL+P RC+V VEGFYEWK Sbjct: 97 YVVHCMKWGLVPSFTKKTEKPDHFKMFNARSESIQEKASFRRLVPNKRCLVVVEGFYEWK 156 Query: 457 KDASKKQPYYIHFKDSRPLVFAALYDSWKSSEGDILYTFTILTVGSSKSLQWLHDRMPVI 636 KD SKKQPYY+HF+D R LVFA LYD+W++SEG+ LYTFTILT S +L WLHDRMPVI Sbjct: 157 KDGSKKQPYYLHFRDGRALVFAGLYDTWENSEGEGLYTFTILTTRCSSALDWLHDRMPVI 216 Query: 637 LGDDVSVDVWLN-NGMPKSEIVLRPYEDEDLVWYPVTTAVGKPSFDGPDCITEIKLKRPV 813 LG+ ++D WLN PK + +L+PYE DLVWYPVT A+GK F GP+CI EI+LK Sbjct: 217 LGNKEAIDAWLNITPSPKVDSLLQPYEGSDLVWYPVTPAMGKIFFAGPECIKEIQLKSEN 276 Query: 814 ENQIAKFFTKKADGKNQMEEEH-TKRLELSPKGDRVDDTKKDASRTENVANFKEESPQND 990 +N I+K F + + K + E K E S G +++++ ++ E + P +D Sbjct: 277 KNTISKLFMQSHNKKQPISEPSIRKAAEDSTHGHTFENSQEPSNTNE------DWEPIDD 330 Query: 991 MFNCMKEDCEHHVDEKHSSNGLLKKENVSPDIFGTKRQT---QEIPLDSGSTSEKVSSLL 1161 C+ E + K ++ + K++T +E P+ + Sbjct: 331 FKVCIGIKREASPGNAEETEKRRTKRDIEQLLVDPKKETIVGKENPISGEERQGYMDRGS 390 Query: 1162 KKARRVKNVDDKQASLLSYFGKA 1230 K + KQA+L SYFGK+ Sbjct: 391 HKNGMPRITGGKQANLFSYFGKS 413 >ref|XP_002880802.1| hypothetical protein ARALYDRAFT_481505 [Arabidopsis lyrata subsp. lyrata] gi|297326641|gb|EFH57061.1| hypothetical protein ARALYDRAFT_481505 [Arabidopsis lyrata subsp. lyrata] Length = 489 Score = 342 bits (876), Expect = 3e-91 Identities = 170/304 (55%), Positives = 219/304 (72%), Gaps = 4/304 (1%) Frame = +1 Query: 100 MCGRTRCTLNPDRVARACGFAASIPTRQI--DRYRPSYNVSPGAYLPVLLLER-TTGEAE 270 MCGRTRCTL PD + RA ++PTR + DRYRPSYN++PG+Y+PVL E G+ Sbjct: 1 MCGRTRCTLRPDDIQRA-SHRHTVPTRSLHLDRYRPSYNIAPGSYIPVLRRENEVVGDGV 59 Query: 271 ASPSICCMKWGLVPSFTKKTEKPDHYKMFNARSESVKEKPSFCRLLPTNRCVVAVEGFYE 450 + CMKWGLVP FTKKT+KPD +KMFNARSESV EK SF RLLP NRC+VAV+GFYE Sbjct: 60 V---VHCMKWGLVPGFTKKTDKPDFFKMFNARSESVAEKASFRRLLPKNRCLVAVDGFYE 116 Query: 451 WKKDASKKQPYYIHFKDSRPLVFAALYDSWKSSEGDILYTFTILTVGSSKSLQWLHDRMP 630 WKK+ SKKQPYYIHF+D RPLVFAAL+DSW++S G+ LYTFTILT SS LQWLHDRMP Sbjct: 117 WKKEGSKKQPYYIHFEDGRPLVFAALFDSWQNSGGETLYTFTILTTTSSSPLQWLHDRMP 176 Query: 631 VILGDDVSVDVWLNN-GMPKSEIVLRPYEDEDLVWYPVTTAVGKPSFDGPDCITEIKLKR 807 VILGD SVD WL++ K + +L PYE DLVWYPVTTA+GKP+FDGP+CI +I LK Sbjct: 177 VILGDKDSVDTWLDDPSTTKLQPLLSPYEKSDLVWYPVTTAIGKPTFDGPECIQQIPLKA 236 Query: 808 PVENQIAKFFTKKADGKNQMEEEHTKRLELSPKGDRVDDTKKDASRTENVANFKEESPQN 987 + I+KFF++K + ++ + + + K + + ++A+ +++V +E + Sbjct: 237 SQNSLISKFFSRKTEEGDKETKSTDANISVDLKEEPMVGGYEEATFSDSVKKIEELGGEK 296 Query: 988 DMFN 999 D+ N Sbjct: 297 DILN 300 >ref|XP_002527247.1| conserved hypothetical protein [Ricinus communis] gi|223533340|gb|EEF35091.1| conserved hypothetical protein [Ricinus communis] Length = 409 Score = 342 bits (876), Expect = 3e-91 Identities = 194/410 (47%), Positives = 247/410 (60%), Gaps = 34/410 (8%) Frame = +1 Query: 100 MCGRTRCTLNPDRVARACGFAASIPTRQI--DRYRPSYNVSPGAYLPVLLLERTTGEAEA 273 MCGR RCTL D + RAC P R + DR+RPSYNVSPG+ +PV+ E + Sbjct: 1 MCGRARCTLRADDIPRACHRTTG-PVRSVNMDRWRPSYNVSPGSNMPVVCREGDGSDGGD 59 Query: 274 SPSICCMKWGLVPSFTKKTEKPDHYKMFNARSESVKEKPSFCRLLPTNRCVVAVEGFYEW 453 + CM WGL+PSFTKKTEKPD YKMFNARSESV EK SF RLLP +RC+VA EGFYEW Sbjct: 60 GFFVQCMTWGLIPSFTKKTEKPDFYKMFNARSESVGEKASFRRLLPKSRCLVAAEGFYEW 119 Query: 454 KKDASKKQPYYIHFKDSRPLVFAALYDSWKSSEGDILYTFTILTVGSSKSLQWLHDRMPV 633 KKD SKKQPYYIHFKD RPLVFAALYDSW++SEG+ILYTFTILT SS +L+WLHDRMPV Sbjct: 120 KKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTFTILTTSSSSALEWLHDRMPV 179 Query: 634 ILGDDVSVDVWLN-NGMPKSEIVLRPYEDEDLVWYPVTTAVGKPSFDGPDCITEIKLKRP 810 ILGD S D WLN + K ++VL YE DLVW PVT A+GK SFDGP+C+ EI +K Sbjct: 180 ILGDKESTDTWLNGSSSSKYDVVLESYESSDLVWCPVTPAMGKSSFDGPECVKEIHVKTE 239 Query: 811 VENQIAKFFTKK-ADGKNQM---------------------EEEHTKRLELSPKGDRVD- 921 ++ I+KFF++K G+ ++ E E ++L++ P D Sbjct: 240 SKSTISKFFSRKEIKGEQELNSRESTFDKSVKMDLPESVKEEYESEEKLDIPPSNQINDQ 299 Query: 922 DTKKDASRTENVANFKEESPQNDMFNCMKEDCEH---HVDEKHSSNGLLKKENVSPDIFG 1092 D K + S K + P +D C D + + + + + K + + Sbjct: 300 DLKSNVSTIPCEDETKCQIPDHDETKCQIPDHDETKCQIPDHDLISNVSKLPHEDATLGQ 359 Query: 1093 TKRQTQEIPLD-----SGSTSEKVSSLLKKARRVKNVDDKQASLLSYFGK 1227 KR +E +D G+ + + KKA +K+ DKQ +LLSYF K Sbjct: 360 PKRHHEEALIDRELNPDGNEKLRRNPARKKA-NLKSGGDKQPTLLSYFRK 408 >ref|NP_001144583.1| uncharacterized protein LOC100277594 [Zea mays] gi|195644134|gb|ACG41535.1| hypothetical protein [Zea mays] Length = 408 Score = 342 bits (876), Expect = 3e-91 Identities = 198/422 (46%), Positives = 247/422 (58%), Gaps = 46/422 (10%) Frame = +1 Query: 100 MCGRTRCTLNPDRVARACGFAAS------------IPTRQIDRYRPSYNVSPGAYLPVLL 243 MCGR RCTL+P VARA GF + +PT ++R+RPSYNV PGAYLPV Sbjct: 1 MCGRARCTLSPAEVARAFGFPTTSANAGGGGDGPAVPTLHLNRFRPSYNVLPGAYLPVGA 60 Query: 244 LERTTGEAEAS-------PSICCMKWGLVPSFTKKTEKPDHYKMFNARSESVKEKPSFCR 402 + G A P I CMKWGLVPSFT K EKPDH++MFNARSESVKEK SF R Sbjct: 61 MRALPGCAHGGGGSDGEGPVIQCMKWGLVPSFTGKAEKPDHFRMFNARSESVKEKVSFRR 120 Query: 403 LLPTNRCVVAVEGFYEWKKDASKKQPYYIHFKDSRPLVFAALYDSWKSSEGDILYTFTIL 582 L+ NRC+VAVEGFYEWKK+ SKKQPYYIHF+D RPLVFAALYD+W +SEG+I +TFTIL Sbjct: 121 LIQKNRCLVAVEGFYEWKKNGSKKQPYYIHFQDHRPLVFAALYDAWTNSEGEITHTFTIL 180 Query: 583 TVGSSKSLQWLHDRMPVILGDDVSVDVWLNNGMPKSEIVLRPYEDEDLVWYPVTTAVGKP 762 T +S SL WLHDRMPVILG VD WLN+ K E + PYE DLVWYPVT+A+GK Sbjct: 181 TTHASTSLNWLHDRMPVILGSKDYVDAWLNDVSVKLEEITAPYEGADLVWYPVTSALGKA 240 Query: 763 SFDGPDCITEIKLKRPVENQIAKFFTKKADGKNQMEEEHTKRLELSPKGDRVDDTKKDAS 942 SFDGP+CI E+ + + I+KFFTKK+ +LS K + + A Sbjct: 241 SFDGPECIKEVHI-GATDKPISKFFTKKSTA-----------YDLSGKYENMSRELAHAY 288 Query: 943 RTENVANFKEESPQNDMFNCMKEDCEHHVDEKHSSNGLLKKENVS--PDIF--------- 1089 + V + N + +H EK ++N +K E V+ P +F Sbjct: 289 KAAKV------ECDGSVENQGGDGNQHQSREKQTTNCTIKDEPVTLEPQVFETPWSIEHE 342 Query: 1090 ----------------GTKRQTQEIPLDSGSTSEKVSSLLKKARRVKNVDDKQASLLSYF 1221 G KR+ ++ +++ S K S L +K + VK D QASLLSYF Sbjct: 343 DTMTLAGATLETQRDLGFKRKIEDTQVEA---SMKPSQLTRKEKAVKAASDGQASLLSYF 399 Query: 1222 GK 1227 + Sbjct: 400 AR 401 >ref|NP_180215.2| uncharacterized protein [Arabidopsis thaliana] gi|26449484|dbj|BAC41868.1| unknown protein [Arabidopsis thaliana] gi|29028900|gb|AAO64829.1| At2g26470 [Arabidopsis thaliana] gi|330252748|gb|AEC07842.1| uncharacterized protein AT2G26470 [Arabidopsis thaliana] Length = 487 Score = 339 bits (870), Expect = 1e-90 Identities = 174/309 (56%), Positives = 223/309 (72%), Gaps = 6/309 (1%) Frame = +1 Query: 100 MCGRTRCTLNPDRVARACGFAASIPTR--QIDRYRPSYNVSPGAYLPVLLL--ERTTGEA 267 MCGRTRCTL PD V RA ++PTR +DRYRPSYNV+PG+Y+PVL E G+ Sbjct: 1 MCGRTRCTLRPDDVPRA-SHRHTVPTRFLHLDRYRPSYNVAPGSYIPVLRRDNEEVVGDG 59 Query: 268 EASPSICCMKWGLVPSFTKKTEKPDHYKMFNARSESVKEKPSFCRLLPTNRCVVAVEGFY 447 + CMKWGLVPSFTKKT+KPD +KMFNARSESV EK SF RLLP NRC+VAV+GFY Sbjct: 60 VV---VHCMKWGLVPSFTKKTDKPDFFKMFNARSESVAEKASFRRLLPKNRCLVAVDGFY 116 Query: 448 EWKKDASKKQPYYIHFKDSRPLVFAALYDSWKSSEGDILYTFTILTVGSSKSLQWLHDRM 627 EWKK+ SKKQPYYIHF+D RPLVFAAL+D+W++S G+ LYTFTILT SS +LQWLHDRM Sbjct: 117 EWKKEGSKKQPYYIHFEDGRPLVFAALFDTWQNSGGETLYTFTILTTASSSALQWLHDRM 176 Query: 628 PVILGDDVSVDVWLNN-GMPKSEIVLRPYEDEDLVWYPVTTAVGKPSFDGPDCITEIKLK 804 PVILGD S+D WL++ K + +L PYE DLVWYPVT+A+GKP+FDGP+CI +I LK Sbjct: 177 PVILGDKDSIDTWLDDPSTTKLQPLLSPYEKSDLVWYPVTSAIGKPTFDGPECIQQIPLK 236 Query: 805 RPVENQIAKFFTKKADGKNQMEEEHTKRLELSPKGD-RVDDTKKDASRTENVANFKEESP 981 + I+KFF+ K ++ ++E TK + + D + + T + + ++++ +E Sbjct: 237 TSQNSLISKFFSTKQPKTDEGDKE-TKSTDANIIVDLKKEPTAEKDTFSDSIKKIEELDG 295 Query: 982 QNDMFNCMK 1008 + DM N K Sbjct: 296 EKDMSNVAK 304 >gb|ACF82411.1| unknown [Zea mays] gi|414588288|tpg|DAA38859.1| TPA: hypothetical protein ZEAMMB73_572218 [Zea mays] Length = 408 Score = 339 bits (870), Expect = 1e-90 Identities = 197/422 (46%), Positives = 247/422 (58%), Gaps = 46/422 (10%) Frame = +1 Query: 100 MCGRTRCTLNPDRVARACGFAAS------------IPTRQIDRYRPSYNVSPGAYLPVLL 243 MCGR RCTL+P VARA GF + +PT ++R+RPSYNV PGAYLPV Sbjct: 1 MCGRARCTLSPAEVARAFGFPTTSANAGGGGDGPAVPTLHLNRFRPSYNVLPGAYLPVGA 60 Query: 244 LERTTGEAEAS-------PSICCMKWGLVPSFTKKTEKPDHYKMFNARSESVKEKPSFCR 402 + G A P I CMKWGLVPSFT K EKPD+++MFNARSESVKEK SF R Sbjct: 61 MRALPGCAHGGGGSDGEGPVIQCMKWGLVPSFTGKAEKPDYFRMFNARSESVKEKVSFRR 120 Query: 403 LLPTNRCVVAVEGFYEWKKDASKKQPYYIHFKDSRPLVFAALYDSWKSSEGDILYTFTIL 582 L+ NRC+VAVEGFYEWKK+ SKKQPYYIHF+D RPLVFAALYD+W +SEG+I +TFTIL Sbjct: 121 LIQKNRCLVAVEGFYEWKKNGSKKQPYYIHFQDHRPLVFAALYDAWTNSEGEITHTFTIL 180 Query: 583 TVGSSKSLQWLHDRMPVILGDDVSVDVWLNNGMPKSEIVLRPYEDEDLVWYPVTTAVGKP 762 T +S SL WLHDRMPVILG VD WLN+ K E + PYE DLVWYPVT+A+GK Sbjct: 181 TTHASTSLNWLHDRMPVILGSKDYVDAWLNDVSVKLEEITAPYEGADLVWYPVTSALGKA 240 Query: 763 SFDGPDCITEIKLKRPVENQIAKFFTKKADGKNQMEEEHTKRLELSPKGDRVDDTKKDAS 942 SFDGP+CI E+ + + I+KFFTKK+ +LS K + + A Sbjct: 241 SFDGPECIKEVHI-GATDKPISKFFTKKSTA-----------YDLSGKYENMSRELAHAY 288 Query: 943 RTENVANFKEESPQNDMFNCMKEDCEHHVDEKHSSNGLLKKENVS--PDIF--------- 1089 + V + N + +H EK ++N +K E V+ P +F Sbjct: 289 KAAKV------ECDGSVENQGGDGNQHQSREKQTTNCTIKDEPVTLEPQVFETPWSIEHE 342 Query: 1090 ----------------GTKRQTQEIPLDSGSTSEKVSSLLKKARRVKNVDDKQASLLSYF 1221 G KR+ ++ +++ S K S L +K + VK D QASLLSYF Sbjct: 343 DTMTLAGATLETQRDLGFKRKIEDTQVEA---SMKPSQLTRKEKAVKAASDGQASLLSYF 399 Query: 1222 GK 1227 + Sbjct: 400 AR 401 >gb|EOX93768.1| Uncharacterized protein TCM_002685 [Theobroma cacao] Length = 360 Score = 338 bits (867), Expect = 3e-90 Identities = 189/382 (49%), Positives = 239/382 (62%), Gaps = 6/382 (1%) Frame = +1 Query: 100 MCGRTRCTLNPDRVARACGFAASIPTRQI--DRYRPSYNVSPGAYLPVLLLERTTGE-AE 270 MCGR RCTL D + RA P R + DRYRPSYNV PG LPV+ R G + Sbjct: 1 MCGRARCTLRADDIPRA-SHRNDGPVRHVHMDRYRPSYNVGPGMNLPVV--RRDDGSNGD 57 Query: 271 ASPSICCMKWGLVPSFTKKTEKPDHYKMFNARSESVKEKPSFCRLLPTNRCVVAVEGFYE 450 + CMKWGL+PSFTKKT+KPD YKMFNARSESV EK SF RLLP +RC+VAVEGFYE Sbjct: 58 GGVVLHCMKWGLIPSFTKKTDKPDFYKMFNARSESVCEKASFRRLLPKSRCLVAVEGFYE 117 Query: 451 WKKDASKKQPYYIHFKDSRPLVFAALYDSWKSSEGDILYTFTILTVGSSKSLQWLHDRMP 630 WKKD SKKQPYYIHFKD RPLVFAALYD W++SEG+ LYTFTILT SS + WLHDRMP Sbjct: 118 WKKDGSKKQPYYIHFKDGRPLVFAALYDCWENSEGEKLYTFTILTTASSSAFLWLHDRMP 177 Query: 631 VILGDDVSVDVWLNNGMPKSEIVLRPYEDEDLVWYPVTTAVGKPSFDGPDCITEIKLKRP 810 VILGD S D WLN K + +L+PYE+ DLVWYPVT+A+GK SF+GP+C+ E+ LK Sbjct: 178 VILGDKESTDTWLNG--TKIDTLLKPYENPDLVWYPVTSAIGKLSFEGPECVKEVPLKTQ 235 Query: 811 VENQIAKFFTKKADGKNQMEEEHTKRLELSPKGDRVDDTKKDASRTENVANFKEE--SPQ 984 +N I+KFF+ + +++ E +E S + V +T + N KEE SP+ Sbjct: 236 EKNPISKFFSTR-----EVKREQESNMEKSLCDESV--------QTNLLKNLKEEPNSPE 282 Query: 985 NDMFNCMKEDCEHHVDEKHSSNGLLKKENVSPDIFGTKRQTQEIPLDSGSTSEKVS-SLL 1161 + + ++ S + +L TKR +E D+ +++ S Sbjct: 283 DKEIPSLASK-----EDNDSKSSVLVPTCEDVRKCQTKRDYEEFSADTKPAKDEIEVSPA 337 Query: 1162 KKARRVKNVDDKQASLLSYFGK 1227 +K +K V KQ +L +YFGK Sbjct: 338 RKKGNIKGVAGKQPTLFAYFGK 359 >gb|EMS46705.1| hypothetical protein TRIUR3_27289 [Triticum urartu] Length = 368 Score = 337 bits (864), Expect = 7e-90 Identities = 187/369 (50%), Positives = 242/369 (65%), Gaps = 21/369 (5%) Frame = +1 Query: 184 IDRYRPSYNVSPGAYLPVLLLERTT-----GEAEASPSICCMKWGLVPSFTKKTEKPDHY 348 +DR+RPSYNV+PGAYLPV L G E P I CMKWGLVPSF+ KT+KPDH+ Sbjct: 1 MDRFRPSYNVTPGAYLPVGTLRARAAGGEGGAEEQGPVIQCMKWGLVPSFSSKTDKPDHF 60 Query: 349 KMFNARSESVKEKPSFCRLLPTNRCVVAVEGFYEWKKDASKKQPYYIHFKDSRPLVFAAL 528 +MFNARSES+KEK SF RL+P NRC+VAVEGFYEWKKD SKKQPYYIHF+D RPLVFAAL Sbjct: 61 RMFNARSESIKEKASFRRLIPKNRCLVAVEGFYEWKKDGSKKQPYYIHFQDERPLVFAAL 120 Query: 529 YDSWKSSEGDILYTFTILTVGSSKSLQWLHDRMPVILGDDVSVDVWLNNGMPKSEIVLRP 708 +D+W +SEG+ L+TFTILT S SL+WLHDRMPVILGD+ SV+ WLNN K E + P Sbjct: 121 FDTWTNSEGETLHTFTILTTHVSTSLKWLHDRMPVILGDEDSVNAWLNNSSVKLEEITVP 180 Query: 709 YEDEDLVWYPVTTAVGKPSFDGPDCITEIKLKRPVENQIAKFFTKKADGKNQMEEEHTKR 888 YE DLVWYPVTTA+GK SF GPDCI E+K+ P E I+ FFTKKA + E+ + Sbjct: 181 YEGTDLVWYPVTTAMGKTSFQGPDCIKEVKI-GPSEKPISNFFTKKAAAPVKSEKASGEF 239 Query: 889 LEL----SPKGDRVDDTKKDAS-RTE--NVANFKEESPQNDMFNCMKEDCEHHVDEKHSS 1047 E + K +R DD+ ++ S +TE + A+ +++S + + + V K + Sbjct: 240 AETQAFKTAKEERDDDSGENPSNKTEQHHQASLEKQSASSTVVKNEHVTLDPQVFYK-AD 298 Query: 1048 NGLLKKENVSP-------DIFGTKRQTQEIPLDSGSTSEKV--SSLLKKARRVKNVDDKQ 1200 G+ K++ + P D FG KR+ ++ +++ K S + ++ K D Q Sbjct: 299 EGIKKEDGMLPDDPVEERDPFGIKRKIEDAGVEAEMEMGKSGRSPVTPVRKKEKGPKDGQ 358 Query: 1201 ASLLSYFGK 1227 ASL SYF K Sbjct: 359 ASLFSYFAK 367 >gb|EXB84512.1| hypothetical protein L484_015844 [Morus notabilis] Length = 469 Score = 332 bits (852), Expect = 2e-88 Identities = 186/372 (50%), Positives = 228/372 (61%), Gaps = 11/372 (2%) Frame = +1 Query: 100 MCGRTRCTLNPDRVARACGFA-ASIPTRQIDRYRPSYNVSPGAYLPVLLLERTTGEAEAS 276 MCGR RCTL D V RAC S+ T +DRYRPSYNVSPG+ +PV+ R G Sbjct: 1 MCGRARCTLRADDVPRACHRNNGSVRTVNMDRYRPSYNVSPGSNIPVV--RREDGSDGEG 58 Query: 277 PSICCMKWGLVPSFTKKTEKPDHYKMFNARSESVKEKPSFCRLLPTNRCVVAVEGFYEWK 456 + CMKWGL+PSFTKKT+KPDHYKMFNARSES+ EK SF RL+P +RC+VAVEGFYEWK Sbjct: 59 FVVHCMKWGLIPSFTKKTDKPDHYKMFNARSESIGEKVSFRRLIPKSRCLVAVEGFYEWK 118 Query: 457 KDASKKQPYYIHFKDSRPLVFAALYDSWKS--------SEGDILYTFTILTVGSSKSLQW 612 KD SKKQPYYIHFKD RPLVFAALYDSW++ G+ILYTFTILT+ SS +L W Sbjct: 119 KDGSKKQPYYIHFKDGRPLVFAALYDSWENYLVTAIVIPAGEILYTFTILTISSSSALGW 178 Query: 613 LHDRMPVILGDDVSVDVWLNNGMPKSEIVLRPYEDEDLVWYPVTTAVGKPSFDGPDCITE 792 LHDRMPVI GD S D WL K +L+PYED DLVWYPVT A+GKPSFDGP+CI E Sbjct: 179 LHDRMPVIFGDKESSDAWLTGSSSKVGALLKPYEDPDLVWYPVTPAMGKPSFDGPECI-E 237 Query: 793 IKLKRPVENQIAKFFTKKADGKNQMEEEHTKRLELSPK--GDRVDDTKKDASRTENVANF 966 +KLK I+KFF+ K K +L+P+ +VD K + E+ AN Sbjct: 238 MKLKADGNIPISKFFSAKGT---------KKEADLNPEESSSKVDSAKCLEEKPESKAN- 287 Query: 967 KEESPQNDMFNCMKEDCEHHVDEKHSSNGLLKKENVSPDIFGTKRQTQEIPLDSGSTSEK 1146 + + K + + S G +K + KR +++ DS S +++ Sbjct: 288 -----RGPFSSTEKGEADSKSSVSSFSQGGAEKCQI-------KRDHEKLSADSKSNTDE 335 Query: 1147 VSSLLKKARRVK 1182 L R K Sbjct: 336 TKKLFDSPGRKK 347