BLASTX nr result
ID: Catharanthus22_contig00015139
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00015139 (1483 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002303080.1| hypothetical protein POPTR_0002s25190g [Popu... 379 e-102 ref|XP_004492204.1| PREDICTED: UPF0361 protein C3orf37 homolog [... 376 e-101 ref|XP_004290141.1| PREDICTED: UPF0361 protein C3orf37 homolog i... 375 e-101 ref|XP_003635244.1| PREDICTED: UPF0361 protein C3orf37 homolog [... 371 e-100 gb|EMJ00266.1| hypothetical protein PRUPE_ppa018685mg [Prunus pe... 370 e-100 gb|EXB84512.1| hypothetical protein L484_015844 [Morus notabilis] 369 1e-99 gb|EOX93768.1| Uncharacterized protein TCM_002685 [Theobroma cacao] 369 2e-99 gb|ESW12729.1| hypothetical protein PHAVU_008G137400g [Phaseolus... 366 2e-98 ref|XP_003622783.1| hypothetical protein MTR_7g052250 [Medicago ... 365 3e-98 ref|XP_003532247.1| PREDICTED: embryonic stem cell-specific 5-hy... 365 3e-98 ref|XP_006484827.1| PREDICTED: embryonic stem cell-specific 5-hy... 363 1e-97 ref|NP_180215.2| uncharacterized protein [Arabidopsis thaliana] ... 351 5e-94 ref|XP_006293960.1| hypothetical protein CARUB_v10022949mg [Caps... 350 7e-94 ref|XP_006293959.1| hypothetical protein CARUB_v10022949mg [Caps... 350 7e-94 ref|XP_002880802.1| hypothetical protein ARALYDRAFT_481505 [Arab... 349 2e-93 ref|XP_006294385.1| hypothetical protein CARUB_v10023401mg, part... 345 2e-92 ref|XP_006403078.1| hypothetical protein EUTSA_v10003450mg [Eutr... 344 6e-92 ref|XP_004150365.1| PREDICTED: LOW QUALITY PROTEIN: UPF0361 prot... 344 6e-92 ref|XP_006850341.1| hypothetical protein AMTR_s00020p00243160 [A... 342 2e-91 ref|XP_004165094.1| PREDICTED: UPF0361 protein C3orf37 homolog [... 342 2e-91 >ref|XP_002303080.1| hypothetical protein POPTR_0002s25190g [Populus trichocarpa] gi|222844806|gb|EEE82353.1| hypothetical protein POPTR_0002s25190g [Populus trichocarpa] Length = 367 Score = 379 bits (973), Expect = e-102 Identities = 204/407 (50%), Positives = 247/407 (60%) Frame = -1 Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163 MCGRARCTLR DD RA H N VR V+MDRYRPSYN SPG NL V+RR Sbjct: 1 MCGRARCTLRADDIPRACHRNTATVRSVNMDRYRPSYNASPGSNLAVVRRDDAASGDGAS 60 Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983 +HCMKWGLIP FTKK+EKPD YKMFNARSES+ EKASFRRL+P +RCLV Sbjct: 61 GGDGY----AIHCMKWGLIPGFTKKSEKPDFYKMFNARSESLSEKASFRRLIPKSRCLVA 116 Query: 982 VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803 VEGFYEWKKDGSK+QPYYIHF+D RP+VFAAL+DSW+NS+GE LYTFTI + W Sbjct: 117 VEGFYEWKKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTFTIVTTAASSAIQW 176 Query: 802 LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623 LH+RMPVI G+KE+ + WL+ ++ FDT+LKPYE DL WYPVTPAMGKPSFDGPECIK Sbjct: 177 LHERMPVILGDKEATDTWLSVSSNSKFDTVLKPYEHSDLVWYPVTPAMGKPSFDGPECIK 236 Query: 622 EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPKTEEDTANEDPKDSQA 443 EI LK E TIS+FFS+K + S P +T ++ ++ P +E+ +E+ ++ Sbjct: 237 EIHLKMEEKGTISKFFSRKEFKEE--SNP-EESTHGKSLKLEPKSVKEENESEEKLETPC 293 Query: 442 ITXXXXXXXXXXNFESFHQGAANISMKREYEELSTKMKHSDEEADKQHVSPPQKRLKDES 263 H+G KR+ EEL K +E K SP +K+ Sbjct: 294 SAKTVDYDLKSELETFSHEGETKCKTKRDREEL-VDSKLKTDEIVKPRASPAKKKAN--- 349 Query: 262 GXXXXXXXXXXXTPSLDETDXXXXXXXXXXXXXXXXKQPTLLSYFGK 122 S+D+ KQPTLLSYFGK Sbjct: 350 ------------LKSVDD------------------KQPTLLSYFGK 366 >ref|XP_004492204.1| PREDICTED: UPF0361 protein C3orf37 homolog [Cicer arietinum] Length = 375 Score = 376 bits (965), Expect = e-101 Identities = 198/358 (55%), Positives = 241/358 (67%), Gaps = 1/358 (0%) Frame = -1 Query: 1348 EEMCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXX 1169 +EMCGR RCTLRPDD A H R + +DRYRPS+NVSPGF++PV+RR Sbjct: 19 DEMCGRGRCTLRPDDIPTACHRTTAPTRLLHVDRYRPSHNVSPGFHMPVVRREDASESEG 78 Query: 1168 XXXXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCL 989 VLHCMKWGLIPSFTKKTEKPDHY+MFNARSES+ EKASFRRL+P NRCL Sbjct: 79 H----------VLHCMKWGLIPSFTKKTEKPDHYRMFNARSESIDEKASFRRLLPKNRCL 128 Query: 988 VEVEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXL 809 V VEGFYEWKKDGSK+QPYYIHF+D RP+VFAAL+DSW+NS+GE LYTFTI L Sbjct: 129 VAVEGFYEWKKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGETLYTFTIVTTSSSSTL 188 Query: 808 AWLHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPEC 629 WLHDRMPVI +K+S + WLN+ +++F ++LKPYEE DLAWYPVTPAMGKPSFDGPEC Sbjct: 189 QWLHDRMPVILSDKDSTDTWLNS--ASSFKSVLKPYEECDLAWYPVTPAMGKPSFDGPEC 246 Query: 628 IKEIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPP-KTEEDTANEDPKD 452 IKEIQ+KA N IS+FFS+KG G +++ + + H P KTE+ T KD Sbjct: 247 IKEIQVKAEGNIPISKFFSRKG-----GEGEDTKSGHKILSLCHEPVKTEQTT-----KD 296 Query: 451 SQAITXXXXXXXXXXNFESFHQGAANISMKREYEELSTKMKHSDEEADKQHVSPPQKR 278 + S Q ++KREY+ +S+ K S D+ +PP K+ Sbjct: 297 LSEGAKTEEGESDLKSSGSSPQNVTKFTVKREYDAISSDSKPSLGINDQVIANPPTKK 354 >ref|XP_004290141.1| PREDICTED: UPF0361 protein C3orf37 homolog isoform 1 [Fragaria vesca subsp. vesca] Length = 366 Score = 375 bits (962), Expect = e-101 Identities = 192/356 (53%), Positives = 237/356 (66%), Gaps = 1/356 (0%) Frame = -1 Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163 MCGRARCTLR DD RA + N VR V+MDRY+P YNVSPG NLPV+RR Sbjct: 1 MCGRARCTLRADDISRACYRNHGPVRSVNMDRYQPRYNVSPGANLPVVRRGDGADGEDG- 59 Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983 VVLHCMKWGLIPSFTKKTEKPDHY+MFNARSES+ EKASFRRLVP +RC+V Sbjct: 60 --------VVLHCMKWGLIPSFTKKTEKPDHYRMFNARSESICEKASFRRLVPKSRCVVA 111 Query: 982 VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803 VEGFYEWKKDGSK+QPYY+HF+D RP++FAAL+DSW+NS+GE LYTFTI L W Sbjct: 112 VEGFYEWKKDGSKKQPYYVHFKDGRPLLFAALYDSWENSEGEKLYTFTIITTSSSSALGW 171 Query: 802 LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623 LHDRMPV+ G+KES++ WL+ ++NFD +LKPYE DL WYPVTPAMGK SFDGPEC Sbjct: 172 LHDRMPVVLGDKESVDTWLDGSSASNFDKLLKPYEGPDLVWYPVTPAMGKVSFDGPECSN 231 Query: 622 EIQLKANENRTISEFFSKKGAGRQP-GSKPYSRNTTEEATEIHPPKTEEDTANEDPKDSQ 446 EI+LK + +I++FFS KG ++ K S + + TE P E+ ++ K Sbjct: 232 EIKLKTDGTNSITKFFSTKGTKKEEINPKDTSLHDSSVKTEF-PESLNEEPETKEEKVQP 290 Query: 445 AITXXXXXXXXXXNFESFHQGAANISMKREYEELSTKMKHSDEEADKQHVSPPQKR 278 + T + S + A+ KR+YEE K E+DK+ + P K+ Sbjct: 291 SSTVKCEDSKSSVSILS-QEDASKEQTKRDYEEFLADSKPLPNESDKKSSASPAKK 345 >ref|XP_003635244.1| PREDICTED: UPF0361 protein C3orf37 homolog [Vitis vinifera] gi|296090568|emb|CBI40918.3| unnamed protein product [Vitis vinifera] Length = 392 Score = 371 bits (952), Expect = e-100 Identities = 185/292 (63%), Positives = 212/292 (72%), Gaps = 2/292 (0%) Frame = -1 Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163 MCGRARCTLRPD+ RA +LN +++ MDRYRPSYNVSPG NLPV+RR Sbjct: 1 MCGRARCTLRPDNIARACNLNTLPTQNIQMDRYRPSYNVSPGANLPVVRRGGGTEGEE-- 58 Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983 ++HCMKWGL+PSFTKK+EKPDHYKMFNARSES+ EKASFRRLVP NRCLV Sbjct: 59 --------AIVHCMKWGLVPSFTKKSEKPDHYKMFNARSESVCEKASFRRLVPKNRCLVA 110 Query: 982 VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803 VEGFYEWKKDGSK+QPYYIH +D RP+VFAALFDSW NS+GE LYT TI L W Sbjct: 111 VEGFYEWKKDGSKKQPYYIHLKDGRPLVFAALFDSWANSEGEILYTCTILTTSSSSALQW 170 Query: 802 LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623 LHDRMPVI G+KES + WLN S+ F+T+LKPYE+ DL WYPVT AMGKPSF+GPECIK Sbjct: 171 LHDRMPVILGDKESTDAWLNGSSSSQFNTVLKPYEDPDLVWYPVTQAMGKPSFEGPECIK 230 Query: 622 EIQLKANENRTISEFFSKKGAGRQPG--SKPYSRNTTEEATEIHPPKTEEDT 473 EIQLK NE R IS+FFS KG + G ++P N + E P E T Sbjct: 231 EIQLK-NEQRPISKFFSTKGIKNEQGLSNEPVKSNLPQSLKE--EPAIENST 279 >gb|EMJ00266.1| hypothetical protein PRUPE_ppa018685mg [Prunus persica] Length = 363 Score = 370 bits (951), Expect = e-100 Identities = 197/407 (48%), Positives = 242/407 (59%) Frame = -1 Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163 MCGRARCTLR DD RA H + VR V+MDR+RP +N SPG NLPV+RR Sbjct: 1 MCGRARCTLRADDIPRACHRSHGPVRTVNMDRFRPLFNASPGSNLPVVRREDGGDGDG-- 58 Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983 VV+HCMKWGLIPSFTKKTEKPDHYKMFNARSES+ EKASFRRL+P NRCL+ Sbjct: 59 --------VVVHCMKWGLIPSFTKKTEKPDHYKMFNARSESICEKASFRRLIPKNRCLIA 110 Query: 982 VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803 VEGFYEWKKDGSK+QPYY+HF D RP++FAAL+D W+NS+GE LYTFTI L W Sbjct: 111 VEGFYEWKKDGSKKQPYYVHFNDGRPLLFAALYDFWENSEGEKLYTFTIITTSSSSALGW 170 Query: 802 LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623 LHDRMPVI G+K S + WL+ ++NFD++LKPYE DL WYPVT AMGK SFDGPECI Sbjct: 171 LHDRMPVILGDKGSTDSWLSGSSTSNFDSLLKPYEGPDLVWYPVTQAMGKVSFDGPECIN 230 Query: 622 EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPKTEEDTANEDPKDSQA 443 EIQLK N +I++FF KG ++ + + P +E+ ++ + A Sbjct: 231 EIQLKTEGNNSITKFFMSKGTKKEELNPKDTSFYDSSVKNDLPKSVKEEPEGKEKTEQPA 290 Query: 442 ITXXXXXXXXXXNFESFHQGAANISMKREYEELSTKMKHSDEEADKQHVSPPQKRLKDES 263 T +G + KR+YEE S K E + SP +K++ +S Sbjct: 291 STEKCENDSKGQTIS--QEGVSKGQTKRDYEEFSADSKPVAYETSEMSASPAKKKVNPKS 348 Query: 262 GXXXXXXXXXXXTPSLDETDXXXXXXXXXXXXXXXXKQPTLLSYFGK 122 S+D+ QPTL SYFGK Sbjct: 349 --------------SVDK-------------------QPTLFSYFGK 362 >gb|EXB84512.1| hypothetical protein L484_015844 [Morus notabilis] Length = 469 Score = 369 bits (948), Expect = 1e-99 Identities = 200/369 (54%), Positives = 240/369 (65%), Gaps = 9/369 (2%) Frame = -1 Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163 MCGRARCTLR DD RA H N VR V+MDRYRPSYNVSPG N+PV+RR Sbjct: 1 MCGRARCTLRADDVPRACHRNNGSVRTVNMDRYRPSYNVSPGSNIPVVRREDGSDGEGF- 59 Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983 V+HCMKWGLIPSFTKKT+KPDHYKMFNARSES+ EK SFRRL+P +RCLV Sbjct: 60 ---------VVHCMKWGLIPSFTKKTDKPDHYKMFNARSESIGEKVSFRRLIPKSRCLVA 110 Query: 982 VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKN--------SKGEALYTFTIXXX 827 VEGFYEWKKDGSK+QPYYIHF+D RP+VFAAL+DSW+N GE LYTFTI Sbjct: 111 VEGFYEWKKDGSKKQPYYIHFKDGRPLVFAALYDSWENYLVTAIVIPAGEILYTFTILTI 170 Query: 826 XXXXXLAWLHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPS 647 L WLHDRMPVIFG+KES + WL S+ +LKPYE+ DL WYPVTPAMGKPS Sbjct: 171 SSSSALGWLHDRMPVIFGDKESSDAWLTG-SSSKVGALLKPYEDPDLVWYPVTPAMGKPS 229 Query: 646 FDGPECIKEIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPKTEEDTAN 467 FDGPECI E++LKA+ N IS+FFS KG ++ P ++ ++ + K E AN Sbjct: 230 FDGPECI-EMKLKADGNIPISKFFSAKGTKKEADLNPEESSSKVDSAKCLEEK-PESKAN 287 Query: 466 EDPKDSQAITXXXXXXXXXXNFESFHQGAA-NISMKREYEELSTKMKHSDEEADKQHVSP 290 P S + SF QG A +KR++E+LS K + +E K SP Sbjct: 288 RGPFSS----TEKGEADSKSSVSSFSQGGAEKCQIKRDHEKLSADSKSNTDETKKLFDSP 343 Query: 289 PQKRLKDES 263 +K++K +S Sbjct: 344 GRKKVKLKS 352 >gb|EOX93768.1| Uncharacterized protein TCM_002685 [Theobroma cacao] Length = 360 Score = 369 bits (946), Expect = 2e-99 Identities = 191/356 (53%), Positives = 232/356 (65%), Gaps = 1/356 (0%) Frame = -1 Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163 MCGRARCTLR DD RASH N VRHV MDRYRPSYNV PG NLPV+RR Sbjct: 1 MCGRARCTLRADDIPRASHRNDGPVRHVHMDRYRPSYNVGPGMNLPVVRRDDGSNGDGG- 59 Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983 VVLHCMKWGLIPSFTKKT+KPD YKMFNARSES+ EKASFRRL+P +RCLV Sbjct: 60 --------VVLHCMKWGLIPSFTKKTDKPDFYKMFNARSESVCEKASFRRLLPKSRCLVA 111 Query: 982 VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803 VEGFYEWKKDGSK+QPYYIHF+D RP+VFAAL+D W+NS+GE LYTFTI W Sbjct: 112 VEGFYEWKKDGSKKQPYYIHFKDGRPLVFAALYDCWENSEGEKLYTFTILTTASSSAFLW 171 Query: 802 LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623 LHDRMPVI G+KES + WLN T DT+LKPYE DL WYPVT A+GK SF+GPEC+K Sbjct: 172 LHDRMPVILGDKESTDTWLN---GTKIDTLLKPYENPDLVWYPVTSAIGKLSFEGPECVK 228 Query: 622 EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPKT-EEDTANEDPKDSQ 446 E+ LK E IS+FFS + R+ S ++ +E+ + + K +E+ + + K+ Sbjct: 229 EVPLKTQEKNPISKFFSTREVKREQESN-MEKSLCDESVQTNLLKNLKEEPNSPEDKEIP 287 Query: 445 AITXXXXXXXXXXNFESFHQGAANISMKREYEELSTKMKHSDEEADKQHVSPPQKR 278 ++ + KR+YEE S K + +E + VSP +K+ Sbjct: 288 SLASKEDNDSKSSVLVPTCEDVRKCQTKRDYEEFSADTKPAKDEIE---VSPARKK 340 >gb|ESW12729.1| hypothetical protein PHAVU_008G137400g [Phaseolus vulgaris] Length = 353 Score = 366 bits (939), Expect = 2e-98 Identities = 202/407 (49%), Positives = 248/407 (60%) Frame = -1 Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163 MCGR RCTLR DD RA H + R + MDRYRP+YNVSPG N+PV+RR Sbjct: 1 MCGRTRCTLRSDDVPRACHRSDAPTRTLHMDRYRPAYNVSPGSNMPVVRREEASDSGGY- 59 Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983 VLH MKWGLIPSFTKKTEKPDHYKMFNARSES+ EKASFRRL+P +RCLV Sbjct: 60 ---------VLHSMKWGLIPSFTKKTEKPDHYKMFNARSESIDEKASFRRLLPKSRCLVA 110 Query: 982 VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803 VEGFYEWKKDGSK+QPYYIHF+D R +VFAAL+DSW+NS+GE L+TFTI L W Sbjct: 111 VEGFYEWKKDGSKKQPYYIHFKDGRRLVFAALYDSWQNSEGETLHTFTIVTTSSSSALQW 170 Query: 802 LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623 LHDRMPVI G+KES + WL++ S +F +++KPYEE DL WYPVT AMGK SFDGPECIK Sbjct: 171 LHDRMPVILGSKESTDTWLSSSAS-SFKSVMKPYEESDLVWYPVTSAMGKTSFDGPECIK 229 Query: 622 EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPKTEEDTANEDPKDSQA 443 EIQ+KA N +IS FFSKKGA +KP + ++ E + P + + A + D+ Sbjct: 230 EIQVKAEGNTSISMFFSKKGA-ESKDTKPEQKLSSHEFVKTEPTEDLIEGAKAEEGDND- 287 Query: 442 ITXXXXXXXXXXNFESFHQGAANISMKREYEELSTKMKHSDEEADKQHVSPPQKRLKDES 263 + S + A+ + +KREYE S K + D+ +P +K+ K ++ Sbjct: 288 ---------LKFSGSSHSKNASTLPIKREYETFSADSKPALANHDQISSNPAKKKEKTKT 338 Query: 262 GXXXXXXXXXXXTPSLDETDXXXXXXXXXXXXXXXXKQPTLLSYFGK 122 KQPTL SYFGK Sbjct: 339 A---------------------------------NDKQPTLFSYFGK 352 >ref|XP_003622783.1| hypothetical protein MTR_7g052250 [Medicago truncatula] gi|355497798|gb|AES79001.1| hypothetical protein MTR_7g052250 [Medicago truncatula] Length = 354 Score = 365 bits (937), Expect = 3e-98 Identities = 202/408 (49%), Positives = 249/408 (61%), Gaps = 1/408 (0%) Frame = -1 Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163 MCGR RC+LR DD RA H R + +DRYRPS NVSPGFN+PV+RR Sbjct: 1 MCGRTRCSLRADDVPRACHRTTAPSRLLHIDRYRPSNNVSPGFNIPVVRREDNASAESDG 60 Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983 V+HCMKWGLIPSFTKKT+KPDHYKMFNARSES+ EKASFRRL+P NRCLV Sbjct: 61 H--------VVHCMKWGLIPSFTKKTDKPDHYKMFNARSESIDEKASFRRLLPKNRCLVA 112 Query: 982 VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803 VEGFYEWKKDGSK+QPYYIHF+D RP+VFAAL+DSW+NS+GE LYTFTI W Sbjct: 113 VEGFYEWKKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGEILYTFTIVTTSSSSAFKW 172 Query: 802 LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623 LHDRMPVI G+K++ + WL++ +++F +++KPYEE DL WYPVTPAMGKPSFDGPECIK Sbjct: 173 LHDRMPVILGDKDTTDTWLSS--ASSFKSVMKPYEESDLVWYPVTPAMGKPSFDGPECIK 230 Query: 622 EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPKTEE-DTANEDPKDSQ 446 EIQ+K IS+FFSKK A +KP + + E P KTE+ +E+ K + Sbjct: 231 EIQIKTEGYIPISKFFSKKEA-EVEDTKPEHKILSHE-----PVKTEQTKDVSEEAKTEE 284 Query: 445 AITXXXXXXXXXXNFESFHQGAANISMKREYEELSTKMKHSDEEADKQHVSPPQKRLKDE 266 T S Q ++KREY+ +S+ K S D+ +P +K+ K + Sbjct: 285 GDTDLKSSGI------SPSQNVNRFAIKREYDAISSDSKPSLANNDQVSANPAKKKEKAK 338 Query: 265 SGXXXXXXXXXXXTPSLDETDXXXXXXXXXXXXXXXXKQPTLLSYFGK 122 + KQPTL SYFGK Sbjct: 339 TA---------------------------------DDKQPTLFSYFGK 353 >ref|XP_003532247.1| PREDICTED: embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein-like [Glycine max] Length = 382 Score = 365 bits (936), Expect = 3e-98 Identities = 207/423 (48%), Positives = 253/423 (59%), Gaps = 16/423 (3%) Frame = -1 Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163 MCGRARCTLR DD RA H + R + +DRYRP+YNVSPGF++PV+RR Sbjct: 1 MCGRARCTLRADDVPRACHRSTSPTRTLHIDRYRPAYNVSPGFDVPVVRRDDASGGEGY- 59 Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983 VL CMKWGLIPSFTKKTEKPDHY+MFNARSES+ EKASFRRL+P +RCLV Sbjct: 60 ---------VLQCMKWGLIPSFTKKTEKPDHYRMFNARSESIDEKASFRRLLPKSRCLVA 110 Query: 982 VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803 VEGFYEWKKDGSK+QPYYIHF+D RP+VFAAL+DSW+NS+GE LYTFTI L W Sbjct: 111 VEGFYEWKKDGSKKQPYYIHFKDGRPLVFAALYDSWQNSEGETLYTFTIVTTSSSSALQW 170 Query: 802 LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623 LHDRMPVI G+KES ++WL++ S +F +++KPYEE DL WYPVT AMGK SFDGPECIK Sbjct: 171 LHDRMPVILGSKESTDIWLSSSAS-SFKSVMKPYEESDLVWYPVTSAMGKASFDGPECIK 229 Query: 622 EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNT---------TEEATEIHPPKTEEDTA 470 EIQ+KA N +IS FFSKKG +KP + + TE+ TE K E+ T+ Sbjct: 230 EIQVKAQGNTSISMFFSKKG-DESKDTKPEQKASCPEVVKTEHTEDLTESKDTKPEQKTS 288 Query: 469 N------EDPKDSQAITXXXXXXXXXXNFESFH-QGAANISMKREYEELSTKMKHSDEEA 311 + E +D + S H Q + + +KREYE S A Sbjct: 289 SHEFVKTEPTEDLRERAKTEEGGNDLKFHGSSHSQNVSMLPIKREYETFSA-ADSKPALA 347 Query: 310 DKQHVSPPQKRLKDESGXXXXXXXXXXXTPSLDETDXXXXXXXXXXXXXXXXKQPTLLSY 131 + +SP + K+++ KQPTL SY Sbjct: 348 NHDQISPNPAKKKEKA-------------------------------KTANDKQPTLFSY 376 Query: 130 FGK 122 FGK Sbjct: 377 FGK 379 >ref|XP_006484827.1| PREDICTED: embryonic stem cell-specific 5-hydroxymethylcytosine-binding protein-like isoform X1 [Citrus sinensis] Length = 398 Score = 363 bits (932), Expect = 1e-97 Identities = 176/285 (61%), Positives = 207/285 (72%) Frame = -1 Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163 MCGRARCTLR DD RA H G R ++MDRYRPSYNV+PG+NLPV+RR Sbjct: 1 MCGRARCTLRADDLPRACHRTGSPARTLNMDRYRPSYNVAPGWNLPVVRRDDDGEGF--- 57 Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983 VLHCMKWGLIPSFTKK EKPD YKMFNARSES+ EKASFRRL+P +RCL Sbjct: 58 ---------VLHCMKWGLIPSFTKKNEKPDFYKMFNARSESVTEKASFRRLLPKSRCLAA 108 Query: 982 VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803 VEGFYEWKKDGSK+QPYY+HF+D RP+VFAAL+D+W++S+GE LYTFTI L W Sbjct: 109 VEGFYEWKKDGSKKQPYYVHFKDGRPLVFAALYDTWQSSEGEILYTFTILTTSSSAALQW 168 Query: 802 LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623 LHDRMPVI G+KES + WLN S+ +DTILKPYEE DL WYPVTP MGK SF+GPECIK Sbjct: 169 LHDRMPVILGDKESSDAWLNGSSSSKYDTILKPYEESDLVWYPVTPVMGKLSFNGPECIK 228 Query: 622 EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPK 488 EI LK IS FF KK ++ SK +++ +E+ + + PK Sbjct: 229 EIPLKTEGKNPISNFFLKKEIKKEQESKMDEKSSFDESVKTNLPK 273 >ref|NP_180215.2| uncharacterized protein [Arabidopsis thaliana] gi|26449484|dbj|BAC41868.1| unknown protein [Arabidopsis thaliana] gi|29028900|gb|AAO64829.1| At2g26470 [Arabidopsis thaliana] gi|330252748|gb|AEC07842.1| uncharacterized protein AT2G26470 [Arabidopsis thaliana] Length = 487 Score = 351 bits (900), Expect = 5e-94 Identities = 170/297 (57%), Positives = 208/297 (70%), Gaps = 1/297 (0%) Frame = -1 Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163 MCGR RCTLRPDD RASH + R + +DRYRPSYNV+PG +PV+RR Sbjct: 1 MCGRTRCTLRPDDVPRASHRHTVPTRFLHLDRYRPSYNVAPGSYIPVLRRDNEEVVGDG- 59 Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983 VV+HCMKWGL+PSFTKKT+KPD +KMFNARSES+ EKASFRRL+P NRCLV Sbjct: 60 --------VVVHCMKWGLVPSFTKKTDKPDFFKMFNARSESVAEKASFRRLLPKNRCLVA 111 Query: 982 VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803 V+GFYEWKK+GSK+QPYYIHF+D RP+VFAALFD+W+NS GE LYTFTI L W Sbjct: 112 VDGFYEWKKEGSKKQPYYIHFEDGRPLVFAALFDTWQNSGGETLYTFTILTTASSSALQW 171 Query: 802 LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623 LHDRMPVI G+K+SI+ WL+ P +T +L PYE+ DL WYPVT A+GKP+FDGPECI+ Sbjct: 172 LHDRMPVILGDKDSIDTWLDDPSTTKLQPLLSPYEKSDLVWYPVTSAIGKPTFDGPECIQ 231 Query: 622 EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEI-HPPKTEEDTANEDPK 455 +I LK ++N IS+FFS K G K ++ P E+DT ++ K Sbjct: 232 QIPLKTSQNSLISKFFSTKQPKTDEGDKETKSTDANIIVDLKKEPTAEKDTFSDSIK 288 >ref|XP_006293960.1| hypothetical protein CARUB_v10022949mg [Capsella rubella] gi|482562668|gb|EOA26858.1| hypothetical protein CARUB_v10022949mg [Capsella rubella] Length = 540 Score = 350 bits (899), Expect = 7e-94 Identities = 170/294 (57%), Positives = 207/294 (70%) Frame = -1 Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163 MCGR RCTLRPDD RASH +G + R + DRYRPSYNV+PG +PV+RR Sbjct: 1 MCGRTRCTLRPDDVPRASHRHGVQTRFLHTDRYRPSYNVAPGSYMPVLRRDNEVVGDG-- 58 Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983 VV+HCMKWGL+P FTKKT+KPD +KMFNARSES+ EKASFRRL+P NRCLV Sbjct: 59 --------VVVHCMKWGLVPGFTKKTDKPDFFKMFNARSESVAEKASFRRLLPKNRCLVA 110 Query: 982 VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803 V+GFYEWKK+GSK+QPYYIHF+D RP+VFAALFDSW+NS GE LYTFTI L W Sbjct: 111 VDGFYEWKKEGSKKQPYYIHFEDGRPLVFAALFDSWQNSGGETLYTFTILTTASSSSLHW 170 Query: 802 LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623 LHDRMPVI G+K+S++ WL+ P +T +L PYE+ DL WYPVT A+GKP+FDGPECI+ Sbjct: 171 LHDRMPVILGDKDSVDTWLDDPSTTKLQPLLSPYEKSDLVWYPVTSAIGKPTFDGPECIQ 230 Query: 622 EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPKTEEDTANED 461 +I LKA++N IS+FFS K PKT+++TA+ D Sbjct: 231 QITLKASQNSLISKFFSTK-----------------------HPKTDKETASTD 261 >ref|XP_006293959.1| hypothetical protein CARUB_v10022949mg [Capsella rubella] gi|482562667|gb|EOA26857.1| hypothetical protein CARUB_v10022949mg [Capsella rubella] Length = 521 Score = 350 bits (899), Expect = 7e-94 Identities = 170/294 (57%), Positives = 207/294 (70%) Frame = -1 Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163 MCGR RCTLRPDD RASH +G + R + DRYRPSYNV+PG +PV+RR Sbjct: 1 MCGRTRCTLRPDDVPRASHRHGVQTRFLHTDRYRPSYNVAPGSYMPVLRRDNEVVGDG-- 58 Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983 VV+HCMKWGL+P FTKKT+KPD +KMFNARSES+ EKASFRRL+P NRCLV Sbjct: 59 --------VVVHCMKWGLVPGFTKKTDKPDFFKMFNARSESVAEKASFRRLLPKNRCLVA 110 Query: 982 VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803 V+GFYEWKK+GSK+QPYYIHF+D RP+VFAALFDSW+NS GE LYTFTI L W Sbjct: 111 VDGFYEWKKEGSKKQPYYIHFEDGRPLVFAALFDSWQNSGGETLYTFTILTTASSSSLHW 170 Query: 802 LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623 LHDRMPVI G+K+S++ WL+ P +T +L PYE+ DL WYPVT A+GKP+FDGPECI+ Sbjct: 171 LHDRMPVILGDKDSVDTWLDDPSTTKLQPLLSPYEKSDLVWYPVTSAIGKPTFDGPECIQ 230 Query: 622 EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPKTEEDTANED 461 +I LKA++N IS+FFS K PKT+++TA+ D Sbjct: 231 QITLKASQNSLISKFFSTK-----------------------HPKTDKETASTD 261 >ref|XP_002880802.1| hypothetical protein ARALYDRAFT_481505 [Arabidopsis lyrata subsp. lyrata] gi|297326641|gb|EFH57061.1| hypothetical protein ARALYDRAFT_481505 [Arabidopsis lyrata subsp. lyrata] Length = 489 Score = 349 bits (895), Expect = 2e-93 Identities = 165/280 (58%), Positives = 203/280 (72%) Frame = -1 Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163 MCGR RCTLRPDD RASH + R + +DRYRPSYN++PG +PV+RR Sbjct: 1 MCGRTRCTLRPDDIQRASHRHTVPTRSLHLDRYRPSYNIAPGSYIPVLRRENEVVGDG-- 58 Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983 VV+HCMKWGL+P FTKKT+KPD +KMFNARSES+ EKASFRRL+P NRCLV Sbjct: 59 --------VVVHCMKWGLVPGFTKKTDKPDFFKMFNARSESVAEKASFRRLLPKNRCLVA 110 Query: 982 VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803 V+GFYEWKK+GSK+QPYYIHF+D RP+VFAALFDSW+NS GE LYTFTI L W Sbjct: 111 VDGFYEWKKEGSKKQPYYIHFEDGRPLVFAALFDSWQNSGGETLYTFTILTTTSSSPLQW 170 Query: 802 LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623 LHDRMPVI G+K+S++ WL+ P +T +L PYE+ DL WYPVT A+GKP+FDGPECI+ Sbjct: 171 LHDRMPVILGDKDSVDTWLDDPSTTKLQPLLSPYEKSDLVWYPVTTAIGKPTFDGPECIQ 230 Query: 622 EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATE 503 +I LKA++N IS+FFS+K +K N + + E Sbjct: 231 QIPLKASQNSLISKFFSRKTEEGDKETKSTDANISVDLKE 270 >ref|XP_006294385.1| hypothetical protein CARUB_v10023401mg, partial [Capsella rubella] gi|482563093|gb|EOA27283.1| hypothetical protein CARUB_v10023401mg, partial [Capsella rubella] Length = 389 Score = 345 bits (886), Expect = 2e-92 Identities = 168/294 (57%), Positives = 205/294 (69%) Frame = -1 Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163 MCGR CTLRPDD RASH +G + R + DRYRPSYNV+PG +PV+RR Sbjct: 1 MCGRTCCTLRPDDVPRASHRHGVQTRFLHTDRYRPSYNVAPGSYMPVLRRDNEVVGDG-- 58 Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983 VV+HCMKWGL+P FTKKT+KPD +KMFNARSES+ EK+SFRRL+P NRCLV Sbjct: 59 --------VVVHCMKWGLVPGFTKKTDKPDFFKMFNARSESVAEKSSFRRLLPKNRCLVA 110 Query: 982 VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803 V+GFYEWKK+GSK+QPYYIHF+D RP+VFAALFDSW NS GE LYTFTI L W Sbjct: 111 VDGFYEWKKEGSKKQPYYIHFEDGRPLVFAALFDSWPNSGGETLYTFTILTAASSSALHW 170 Query: 802 LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623 LHDRMPVI G+K+S++ WL+ P +T +L PYE+ DL WYPVT A+GKP+FDGPECI+ Sbjct: 171 LHDRMPVILGDKDSVDTWLDDPSTTKLQPLLSPYEKSDLVWYPVTSAIGKPTFDGPECIQ 230 Query: 622 EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPKTEEDTANED 461 +I LKA++N IS+FFS K PKT+++TA+ D Sbjct: 231 QITLKASQNSLISKFFSTK-----------------------HPKTDKETASTD 261 >ref|XP_006403078.1| hypothetical protein EUTSA_v10003450mg [Eutrema salsugineum] gi|557104185|gb|ESQ44531.1| hypothetical protein EUTSA_v10003450mg [Eutrema salsugineum] Length = 480 Score = 344 bits (882), Expect = 6e-92 Identities = 166/288 (57%), Positives = 201/288 (69%), Gaps = 1/288 (0%) Frame = -1 Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163 MCGRARCTLRPDD RASH +G R + +DRYRPSYNV+PG +PV+RR Sbjct: 1 MCGRARCTLRPDDVPRASHRHGVPARFLHLDRYRPSYNVAPGTYMPVLRRDNDG------ 54 Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983 + +HCMKWGL+PSFTKKT+KPD +KMFNARSES+ EKASFRRL+P NRCLV Sbjct: 55 --------IAVHCMKWGLVPSFTKKTDKPDFFKMFNARSESVAEKASFRRLLPKNRCLVA 106 Query: 982 VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803 V+GFYEWKK+GSK+QPYYIHF D RP+VFAALFDSW+NS GE L TFTI L W Sbjct: 107 VDGFYEWKKEGSKKQPYYIHFNDRRPLVFAALFDSWQNSGGETLDTFTILTTTSSSALDW 166 Query: 802 LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623 LHDRMPVI +KES++ WL+ P ++N +L PYE DL WYPVT A+GK FDGPECI+ Sbjct: 167 LHDRMPVILNDKESVDTWLDGPSTSNLKPLLVPYENSDLVWYPVTSAIGKLCFDGPECIQ 226 Query: 622 EIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEI-HPPKTE 482 +I LKA++N IS+FFS K G + + ++ PK E Sbjct: 227 QIPLKASQNSLISKFFSAKHPNTDEGDRETKSTDADTPVDLKEKPKVE 274 >ref|XP_004150365.1| PREDICTED: LOW QUALITY PROTEIN: UPF0361 protein C3orf37 homolog, partial [Cucumis sativus] Length = 344 Score = 344 bits (882), Expect = 6e-92 Identities = 177/358 (49%), Positives = 225/358 (62%), Gaps = 7/358 (1%) Frame = -1 Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163 MCGRARCTLR DD RA H G VR ++MDR+RP +N SPG +LPV+RR Sbjct: 1 MCGRARCTLRADDITRACHRTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGG-- 58 Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983 VVL CMKWGLIPSFT+K EKP+++KMFNARSES+ EKASF RLVP RCLV Sbjct: 59 --------VVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVA 110 Query: 982 VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803 VEGFYEWKKDG K+QPYYIHF+D +P+ AAL+D W+N +GE LYTFTI L W Sbjct: 111 VEGFYEWKKDGXKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKW 170 Query: 802 LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623 LHDRMPVI G+KE ++MWLN S+ +D++LKPYE DL WYPVTP+MGKPSFDGP+CIK Sbjct: 171 LHDRMPVILGDKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIK 230 Query: 622 EIQLKANENRTISEFFSKKGAGRQPG-------SKPYSRNTTEEATEIHPPKTEEDTANE 464 EIQLK + + IS+FFS K ++ S + + E H + ++E Sbjct: 231 EIQLKNDGSNLISKFFSAKETKKEYSVSQEKTCSNTSVKPEASPSLEEHKREVNRGASSE 290 Query: 463 DPKDSQAITXXXXXXXXXXNFESFHQGAANISMKREYEELSTKMKHSDEEADKQHVSP 290 + KD A + + +KR+ E++S+ +K ++ K SP Sbjct: 291 ESKDCLA--------------KCSSDTSLTYQIKRDREDISSDLKSGMDDYSKVGSSP 334 >ref|XP_006850341.1| hypothetical protein AMTR_s00020p00243160 [Amborella trichopoda] gi|548853962|gb|ERN11922.1| hypothetical protein AMTR_s00020p00243160 [Amborella trichopoda] Length = 413 Score = 342 bits (878), Expect = 2e-91 Identities = 169/296 (57%), Positives = 208/296 (70%), Gaps = 1/296 (0%) Frame = -1 Query: 1351 REEMCGRARCTLRP-DDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXX 1175 R++MCGRARCTL P +D RA N + + + RYR SYN++PG LPV+R+ Sbjct: 37 RKKMCGRARCTLNPVEDVPRACGFNAN-LPTLHTQRYRLSYNIAPGAYLPVLRKEQESKH 95 Query: 1174 XXXXXXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNR 995 V+HCMKWGL+PSFTKKTEKPDH+KMFNARSES++EKASFRRLVP R Sbjct: 96 GY-----------VVHCMKWGLVPSFTKKTEKPDHFKMFNARSESIQEKASFRRLVPNKR 144 Query: 994 CLVEVEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXX 815 CLV VEGFYEWKKDGSK+QPYY+HF+D R +VFA L+D+W+NS+GE LYTFTI Sbjct: 145 CLVVVEGFYEWKKDGSKKQPYYLHFRDGRALVFAGLYDTWENSEGEGLYTFTILTTRCSS 204 Query: 814 XLAWLHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGP 635 L WLHDRMPVI GNKE+I+ WLN PS D++L+PYE DL WYPVTPAMGK F GP Sbjct: 205 ALDWLHDRMPVILGNKEAIDAWLNITPSPKVDSLLQPYEGSDLVWYPVTPAMGKIFFAGP 264 Query: 634 ECIKEIQLKANENRTISEFFSKKGAGRQPGSKPYSRNTTEEATEIHPPKTEEDTAN 467 ECIKEIQLK+ TIS+ F + +QP S+P R E++T H + ++ +N Sbjct: 265 ECIKEIQLKSENKNTISKLFMQSHNKKQPISEPSIRKAAEDSTHGHTFENSQEPSN 320 >ref|XP_004165094.1| PREDICTED: UPF0361 protein C3orf37 homolog [Cucumis sativus] Length = 267 Score = 342 bits (877), Expect = 2e-91 Identities = 164/263 (62%), Positives = 194/263 (73%) Frame = -1 Query: 1342 MCGRARCTLRPDDFIRASHLNGHRVRHVDMDRYRPSYNVSPGFNLPVIRRXXXXXXXXXX 1163 MCGRARCTLR DD RA H G VR ++MDR+RP +N SPG +LPV+RR Sbjct: 1 MCGRARCTLRADDITRACHRTGGPVRSLNMDRFRPLFNASPGSDLPVVRRDDESSDGG-- 58 Query: 1162 XXXXXXXGVVLHCMKWGLIPSFTKKTEKPDHYKMFNARSESMREKASFRRLVPTNRCLVE 983 VVL CMKWGLIPSFT+K EKP+++KMFNARSES+ EKASF RLVP RCLV Sbjct: 59 --------VVLQCMKWGLIPSFTEKFEKPNYFKMFNARSESIHEKASFHRLVPKRRCLVA 110 Query: 982 VEGFYEWKKDGSKRQPYYIHFQDERPMVFAALFDSWKNSKGEALYTFTIXXXXXXXXLAW 803 VEGFYEWKKDGSK+QPYYIHF+D +P+ AAL+D W+N +GE LYTFTI L W Sbjct: 111 VEGFYEWKKDGSKKQPYYIHFKDGQPLALAALYDCWENLEGELLYTFTILTTSSSPALKW 170 Query: 802 LHDRMPVIFGNKESIEMWLNAPPSTNFDTILKPYEEKDLAWYPVTPAMGKPSFDGPECIK 623 LHDRMPVI G+KE ++MWLN S+ +D++LKPYE DL WYPVTP+MGKPSFDGP+CIK Sbjct: 171 LHDRMPVILGDKERMDMWLNDSSSSKYDSVLKPYEAPDLVWYPVTPSMGKPSFDGPDCIK 230 Query: 622 EIQLKANENRTISEFFSKKGAGR 554 EIQLK + + IS+FFS K R Sbjct: 231 EIQLKNDGSNLISKFFSAKETKR 253