BLASTX nr result
ID: Mentha22_contig00024156
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00024156 (1135 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU31416.1| hypothetical protein MIMGU_mgv1a004094mg [Mimulus... 209 1e-51 ref|XP_006349926.1| PREDICTED: uncharacterized protein LOC102586... 176 2e-41 ref|XP_004252993.1| PREDICTED: uncharacterized protein LOC101261... 174 8e-41 emb|CBI17315.3| unnamed protein product [Vitis vinifera] 163 1e-37 ref|XP_002266466.2| PREDICTED: uncharacterized protein LOC100250... 161 5e-37 emb|CAN62042.1| hypothetical protein VITISV_006702 [Vitis vinifera] 160 1e-36 ref|XP_006420909.1| hypothetical protein CICLE_v10004416mg [Citr... 151 6e-34 ref|XP_006493819.1| PREDICTED: intracellular protein transport p... 147 8e-33 ref|XP_006493818.1| PREDICTED: intracellular protein transport p... 147 8e-33 ref|XP_002323777.2| hypothetical protein POPTR_0017s08220g [Popu... 141 4e-31 ref|XP_002518101.1| conserved hypothetical protein [Ricinus comm... 140 1e-30 ref|XP_004300993.1| PREDICTED: uncharacterized protein LOC101300... 138 4e-30 ref|XP_007227345.1| hypothetical protein PRUPE_ppa002653mg [Prun... 134 5e-29 ref|XP_007034368.1| Uncharacterized protein TCM_020328 [Theobrom... 130 1e-27 gb|EXC16951.1| Lysine-specific demethylase 3B [Morus notabilis] 128 4e-27 ref|XP_007049771.1| Uncharacterized protein TCM_002927 [Theobrom... 123 2e-25 gb|EXC25123.1| hypothetical protein L484_003047 [Morus notabilis] 119 2e-24 ref|XP_006840152.1| hypothetical protein AMTR_s00089p00065300 [A... 114 6e-23 ref|XP_004506884.1| PREDICTED: intracellular protein transport p... 114 1e-22 ref|XP_006576617.1| PREDICTED: uncharacterized protein LOC100815... 111 6e-22 >gb|EYU31416.1| hypothetical protein MIMGU_mgv1a004094mg [Mimulus guttatus] Length = 544 Score = 209 bits (533), Expect = 1e-51 Identities = 143/367 (38%), Positives = 181/367 (49%), Gaps = 5/367 (1%) Frame = +3 Query: 18 MACHAVQMWSLSGLVAAFLDLTIXXXXXXXXXXXXXXXXXXGFFGLNLPCPCDGIFINVH 197 MAC AVQMWSLS L AA+LDL I GF GLNLPCPC+G+F N+H Sbjct: 1 MACQAVQMWSLSNLAAAYLDLAIAYILLFASVVAYVASKFLGFLGLNLPCPCNGMFFNIH 60 Query: 198 SRSLCFNRLLVDVPLQKLSDVQLSVRQKFPFTNSISPRKNDGSVFG--DSCGNGIFELEG 371 SR++C N LLVD P QK+S+VQLS++ +FPF++S P+ +D S+ G +S NG+ E+EG Sbjct: 61 SRNICLNSLLVDFPTQKVSNVQLSIKHRFPFSDSTCPKNHDYSIIGGGNSNVNGVLEIEG 120 Query: 372 EVSCSSSVSDARESADAVWRQISSRTGKIDMKGKGILSYRPKXXXXXXXXXXXXXXXXXX 551 + SCSS VSDAR+ +DMKGKG +SYR + Sbjct: 121 DASCSS-VSDARKP--------------VDMKGKGAVSYRQRGRFRKHRKASGSIGKYSS 165 Query: 552 XX-YDPPFHEEVTDSDTYHHFSTNKRGSWCTR--DGSPLTDSENHNLEYDKAPKKKRGRR 722 YD P HE Y H ST+K + T D P T E Sbjct: 166 VSSYDLPLHEP------YCHSSTDKGENGFTNGDDSKPSTTLET---------------- 203 Query: 723 RTLSRDEMNLSSDVDTYMNENVQSGEELQSDQQGIQSYGGDENNKIXXXXXXXXXXXXXX 902 N SSD +T++ + EELQ D+ I Sbjct: 204 --------NRSSDEETHVKRSTH--EELQISSL-------DDKTAIRLLEETLEEERTAR 246 Query: 903 XXXXXXXXKERXXXXXXXXXXXXXILRLQEEKASIEMESRQYQRIIEEKSAYDDEEMDIL 1082 KER ILRLQ EKA++EME+RQYQR+IEEKSAYD EEM+IL Sbjct: 247 AALYTELEKERSAAASAADEAMAMILRLQAEKAAVEMEARQYQRMIEEKSAYDAEEMNIL 306 Query: 1083 QEILMRR 1103 +EIL+RR Sbjct: 307 KEILVRR 313 >ref|XP_006349926.1| PREDICTED: uncharacterized protein LOC102586054 [Solanum tuberosum] Length = 745 Score = 176 bits (445), Expect = 2e-41 Identities = 130/371 (35%), Positives = 178/371 (47%), Gaps = 10/371 (2%) Frame = +3 Query: 18 MACHAVQMWSLSGLVAAFLDLTIXXXXXXXXXXXXXXXXXXGFFGLNLPCPCDGI-FINV 194 MAC MWSLSG+V AF+DL I FFGL LPCPCDG+ F V Sbjct: 1 MACEGRYMWSLSGIVGAFVDLAIAYFLLCAATVAFLASKFLDFFGLRLPCPCDGLLFGTV 60 Query: 195 HSRSLCFNRLLVDVPLQKLSDVQLSVRQKFPFTNSISPRKNDGSVFGDSCG-------NG 353 +R+LCF+RLLVD P +K+S+VQLS++ FPF ++I + + V G +G Sbjct: 61 PNRNLCFHRLLVDFPAEKVSNVQLSIKANFPFNDTILGKDQNCDVNWRLIGHEKENSPHG 120 Query: 354 IFELEGEVSCSSSVSDARESADAVWRQISSRTGKIDMKGKGILSYRPK--XXXXXXXXXX 527 E+ E SC SSVSDAR+S + ++S R + +KGKG+++ R + Sbjct: 121 YLEMGDEASC-SSVSDARKSHNIAMIELSPR-NEFGIKGKGVMNQRQRGGVRRRRRKAAV 178 Query: 528 XXXXXXXXXXYDPPFHEEVTDSDTYHHFSTNKRGSWCTRDGSPLTDSENHNLEYDKAPKK 707 YDP + E G P S N Sbjct: 179 DYGRSSSVSSYDPQYEEFPL--------------------GPPSPPSTNKEDGGHPPLVM 218 Query: 708 KRGRRRTLSRDEMNLSSDVDTYMNENVQSGEELQSDQQGIQSYGGDENNKIXXXXXXXXX 887 + G+R + E+N SSD + +N+ S EEL+ + + + S+ E N+I Sbjct: 219 RLGQRDSF---ELNGSSDEVEHTEKNIASIEELRHNGEPVSSF--HEENRIRFLEQALEH 273 Query: 888 XXXXXXXXXXXXXKERXXXXXXXXXXXXXILRLQEEKASIEMESRQYQRIIEEKSAYDDE 1067 KER ILRLQEEKASIEM++RQYQR+IEEKSA++ E Sbjct: 274 EREARDALCIELEKERNAAASAADEAMAMILRLQEEKASIEMDARQYQRLIEEKSAFEAE 333 Query: 1068 EMDILQEILMR 1100 EM+IL EI+MR Sbjct: 334 EMNILMEIVMR 344 >ref|XP_004252993.1| PREDICTED: uncharacterized protein LOC101261797 [Solanum lycopersicum] Length = 745 Score = 174 bits (440), Expect = 8e-41 Identities = 131/371 (35%), Positives = 179/371 (48%), Gaps = 10/371 (2%) Frame = +3 Query: 18 MACHAVQMWSLSGLVAAFLDLTIXXXXXXXXXXXXXXXXXXGFFGLNLPCPCDGI-FINV 194 M+C MWSLSG+V AF+DL I FFGL LPCP DG+ F V Sbjct: 1 MSCEGRYMWSLSGIVGAFVDLAIAYFLLCAATVAFIASKFLDFFGLRLPCPGDGLLFGTV 60 Query: 195 HSRSLCFNRLLVDVPLQKLSDVQLSVRQKFPFTNSISPRKNDGS----VFGDSCGN---G 353 +R+LCF+RLLVD P +K+S+VQLS+R FPFT++I + + + G GN G Sbjct: 61 PNRNLCFHRLLVDFPAEKVSNVQLSIRANFPFTDTILGKDQNCDLNLRLIGQEKGNSPHG 120 Query: 354 IFELEGEVSCSSSVSDARESADAVWRQISSRTGKIDMKGKGILSYRPK--XXXXXXXXXX 527 E+ E SC SSVSDAR+S + ++S R + KGKG+++ R + Sbjct: 121 YLEMGDEASC-SSVSDARKSHNIAMIELSPR-NEFGQKGKGVMNQRQRGGVRRRRRKTAV 178 Query: 528 XXXXXXXXXXYDPPFHEEVTDSDTYHHFSTNKRGSWCTRDGSPLTDSENHNLEYDKAPKK 707 YDP + + G P S N A Sbjct: 179 DYGRSSSVSSYDPQYEDFPL--------------------GPPSPPSTNKEDGGHPALVM 218 Query: 708 KRGRRRTLSRDEMNLSSDVDTYMNENVQSGEELQSDQQGIQSYGGDENNKIXXXXXXXXX 887 + G+R + E+ SSD ++ +NV S EEL+ + + + S+ E N+I Sbjct: 219 RLGQRDSF---ELTGSSDEIEHIEKNVASIEELRHNGEPVSSF--HEGNRIRLLERALEH 273 Query: 888 XXXXXXXXXXXXXKERXXXXXXXXXXXXXILRLQEEKASIEMESRQYQRIIEEKSAYDDE 1067 KER ILRLQEEKA+IEM++RQYQR+IEEKSA++ E Sbjct: 274 EREARDALCIELEKERNAAASAADEAMAMILRLQEEKAAIEMDARQYQRLIEEKSAFEAE 333 Query: 1068 EMDILQEILMR 1100 EM+IL EILMR Sbjct: 334 EMNILMEILMR 344 >emb|CBI17315.3| unnamed protein product [Vitis vinifera] Length = 797 Score = 163 bits (412), Expect = 1e-37 Identities = 122/380 (32%), Positives = 169/380 (44%), Gaps = 18/380 (4%) Frame = +3 Query: 18 MACHAVQMWSLSGLVAAFLDLTIXXXXXXXXXXXXXXXXXXGFFGLNLPCPCDGIFINVH 197 MAC + W+ GLV A+LDL I FFGL LPCPC+G F N + Sbjct: 1 MACQEIHSWTFGGLVGAYLDLAIAYLLLCGSTLAFFASKFLSFFGLCLPCPCNGFFGNPN 60 Query: 198 SRSLCFNRLLVDVPLQKLSDVQLSVRQKFPF----TNSISPRKNDGSVFGDSCGNGIFEL 365 + C + LVD P +++S VQL V+ KFPF N SP N + G + +G L Sbjct: 61 GDN-CLQKFLVDYPTERISSVQLCVKSKFPFDSVWANEGSPHPNWKLLKGRNSDDGAVGL 119 Query: 366 EGEVSCSS------------SVSDARESADAVWRQISSRTGKIDMKGKGILSYRPK-XXX 506 EGE SCSS S +R + V + + GK D KGK + + RPK Sbjct: 120 EGEASCSSFWDVMRSPDIAGKDSISRNGSCGVMNTPALKEGKSDTKGKRVSNQRPKTGVR 179 Query: 507 XXXXXXXXXXXXXXXXXYDPPFHEEVTDSDTYHHFSTNKRG-SWCTRDGSPLTDSENHNL 683 +DPP + S S ++ G ++ + P Sbjct: 180 RRRRSAVDHGKFSSVSSFDPPRLD--APSGLRSPSSVSETGEAFVGKTLVPDASGGEDGF 237 Query: 684 EYDKAPKKKRGRRRTLSRDEMNLSSDVDTYMNENVQSGEELQSDQQGIQSYGGDENNKIX 863 + + P R L ++N D D ++ S EE++ + +G S+ G+ N + Sbjct: 238 QDELVPILIDLGERALHGIKLNEHIDEDKPSEKDASSAEEVKCNARGKLSFNGNTENTVR 297 Query: 864 XXXXXXXXXXXXXXXXXXXXXKERXXXXXXXXXXXXXILRLQEEKASIEMESRQYQRIIE 1043 KER ILR+QEEKASIEME+RQ+QRIIE Sbjct: 298 VLEQALEEEHAARAALYHELEKERSAAASAADEAMAMILRIQEEKASIEMEARQFQRIIE 357 Query: 1044 EKSAYDDEEMDILQEILMRR 1103 EKSAYD EEM++L+EIL+RR Sbjct: 358 EKSAYDAEEMNLLKEILLRR 377 >ref|XP_002266466.2| PREDICTED: uncharacterized protein LOC100250255 [Vitis vinifera] Length = 588 Score = 161 bits (407), Expect = 5e-37 Identities = 123/379 (32%), Positives = 165/379 (43%), Gaps = 17/379 (4%) Frame = +3 Query: 18 MACHAVQMWSLSGLVAAFLDLTIXXXXXXXXXXXXXXXXXXGFFGLNLPCPCDGIFINVH 197 MAC + W+ GLV A+LDL I FFGL LPCPC+G F N + Sbjct: 1 MACQEIHSWTFGGLVGAYLDLAIAYLLLCGSTLAFFASKFLSFFGLCLPCPCNGFFGNPN 60 Query: 198 SRSLCFNRLLVDVPLQKLSDVQLSVRQKFPF----TNSISPRKNDGSVFGDSCGNGIFEL 365 + C + LVD P +++S VQL V+ KFPF N SP N + G + +G L Sbjct: 61 GDN-CLQKFLVDYPTERISSVQLCVKSKFPFDSVWANEGSPHPNWKLLKGRNSDDGAVGL 119 Query: 366 EGEVSCSS------------SVSDARESADAVWRQISSRTGKIDMKGKGILSYRPK-XXX 506 EGE SCSS S +R + V + + GK D KGK + + RPK Sbjct: 120 EGEASCSSFWDVMRSPDIAGKDSISRNGSCGVMNTPALKEGKSDTKGKRVSNQRPKTGVR 179 Query: 507 XXXXXXXXXXXXXXXXXYDPPFHEEVTDSDTYHHFSTNKRGSWCTRDGSPLTDSENHNLE 686 +DPP + + SP + SE L Sbjct: 180 RRRRSAVDHGKFSSVSSFDPPRLDAPSGL------------------RSPSSVSETDEL- 220 Query: 687 YDKAPKKKRGRRRTLSRDEMNLSSDVDTYMNENVQSGEELQSDQQGIQSYGGDENNKIXX 866 P R L ++N D D ++ S EE++ + +G S+ G+ N + Sbjct: 221 ---VPILIDLGERALHGIKLNEHIDEDKPSEKDASSAEEVKCNARGKLSFNGNTENTVRV 277 Query: 867 XXXXXXXXXXXXXXXXXXXXKERXXXXXXXXXXXXXILRLQEEKASIEMESRQYQRIIEE 1046 KER ILR+QEEKASIEME+RQ+QRIIEE Sbjct: 278 LEQALEEEHAARAALYHELEKERSAAASAADEAMAMILRIQEEKASIEMEARQFQRIIEE 337 Query: 1047 KSAYDDEEMDILQEILMRR 1103 KSAYD EEM++L+EIL+RR Sbjct: 338 KSAYDAEEMNLLKEILLRR 356 >emb|CAN62042.1| hypothetical protein VITISV_006702 [Vitis vinifera] Length = 829 Score = 160 bits (404), Expect = 1e-36 Identities = 120/379 (31%), Positives = 168/379 (44%), Gaps = 18/379 (4%) Frame = +3 Query: 18 MACHAVQMWSLSGLVAAFLDLTIXXXXXXXXXXXXXXXXXXGFFGLNLPCPCDGIFINVH 197 MAC + W+ GLV A+LDL I FFGL LPCPC+G F N + Sbjct: 1 MACQEIHSWTFGGLVGAYLDLAIAYLLLCGSTLAFFASKFLSFFGLCLPCPCNGFFGNPN 60 Query: 198 SRSLCFNRLLVDVPLQKLSDVQLSVRQKFPF----TNSISPRKNDGSVFGDSCGNGIFEL 365 + C + LVD P +++S VQL V+ KFPF N SP N + G + +G L Sbjct: 61 GDN-CLQKFLVDYPTERISSVQLCVKSKFPFDSVWANEGSPHPNWKLLKGRNSDDGAVGL 119 Query: 366 EGEVSCSS------------SVSDARESADAVWRQISSRTGKIDMKGKGILSYRPK-XXX 506 EGE SCSS S +R + V + + GK D KGK + + RPK Sbjct: 120 EGEASCSSFWDVMRSPDIAGKDSISRNGSCGVMNTPALKEGKSDTKGKRVSNQRPKTGVR 179 Query: 507 XXXXXXXXXXXXXXXXXYDPPFHEEVTDSDTYHHFSTNKRG-SWCTRDGSPLTDSENHNL 683 +DPP + S S ++ G ++ + P Sbjct: 180 RRRRSAVDHGKFSSVSSFDPPRLD--APSGLRSPSSVSETGEAFVGKTLVPDASGGEDGF 237 Query: 684 EYDKAPKKKRGRRRTLSRDEMNLSSDVDTYMNENVQSGEELQSDQQGIQSYGGDENNKIX 863 + + P R L ++N D D ++ S EE++ + +G S+ G+ N + Sbjct: 238 QDELVPILIDLGERALHGIKLNEHIDEDKPSEKDASSAEEVKCNARGKLSFNGNTENTVR 297 Query: 864 XXXXXXXXXXXXXXXXXXXXXKERXXXXXXXXXXXXXILRLQEEKASIEMESRQYQRIIE 1043 KER ILR+QEEKASIEME+RQ+QRIIE Sbjct: 298 VLEQALEEEHAARAALYHELEKERSAAASAADEAMAMILRIQEEKASIEMEARQFQRIIE 357 Query: 1044 EKSAYDDEEMDILQEILMR 1100 EKSAYD EEM++L+EIL++ Sbjct: 358 EKSAYDAEEMNLLKEILLK 376 >ref|XP_006420909.1| hypothetical protein CICLE_v10004416mg [Citrus clementina] gi|557522782|gb|ESR34149.1| hypothetical protein CICLE_v10004416mg [Citrus clementina] Length = 738 Score = 151 bits (381), Expect = 6e-34 Identities = 117/366 (31%), Positives = 163/366 (44%), Gaps = 4/366 (1%) Frame = +3 Query: 18 MACHAVQMWSLSGLVAAFLDLTIXXXXXXXXXXXXXXXXXXGFFGLNLPCPCDGIF--IN 191 M C A+ +W+ S LV AFL+L I G FGL+LPCPC+G F N Sbjct: 5 MMCQAIDVWTFSELVGAFLNLAIAYFLLCGSALAYFASKFLGLFGLSLPCPCNGHFGKPN 64 Query: 192 VHSRSLCFNRLLVDVPLQKLSDVQLSVRQKFPFTNSISPRKNDGSVFGDSCGNGIFELEG 371 S C+ LVD P +K+S++Q + KFPF + ++ N S + G EG Sbjct: 65 KISYGNCWQGFLVDCPTEKISNIQFLAKTKFPFDSILASNMNPQSKERE-FDKGHVASEG 123 Query: 372 EVSCSSSVSDARESADAVWRQISSRTGKIDMKGKGILSYRPKXXXXXXXXXXXXXXXXXX 551 E SC+SS S + + R S + G+ D KGK + S+RP+ Sbjct: 124 ETSCASS------SRERIGRDSSMKEGRFDFKGKVVKSHRPRYGIRRHRKSVFRNEKSLS 177 Query: 552 XXYDPPFHEEVTD--SDTYHHFSTNKRGSWCTRDGSPLTDSENHNLEYDKAPKKKRGRRR 725 F + V+D S S +K G+ + S + +E + K+ Sbjct: 178 F---SSFDQLVSDWQSVLSSPSSFSKIGTEISEGSSVPVHRGSETIEDSRGASKE----D 230 Query: 726 TLSRDEMNLSSDVDTYMNENVQSGEELQSDQQGIQSYGGDENNKIXXXXXXXXXXXXXXX 905 E N D + + ++ S E L D G G++ + I Sbjct: 231 VTMTSESNEPVDKNNTVEKDASSVEVLNCDLPGELGLDGNDISTIRSLEEALGEEHAARS 290 Query: 906 XXXXXXXKERXXXXXXXXXXXXXILRLQEEKASIEMESRQYQRIIEEKSAYDDEEMDILQ 1085 KER ILRLQEEKASIEME+RQYQR+IEEKSAYD EEM+IL+ Sbjct: 291 ALYLELEKERSAAATAADEAMAMILRLQEEKASIEMEARQYQRMIEEKSAYDAEEMNILK 350 Query: 1086 EILMRR 1103 EI++RR Sbjct: 351 EIIIRR 356 >ref|XP_006493819.1| PREDICTED: intracellular protein transport protein USO1-like isoform X2 [Citrus sinensis] Length = 723 Score = 147 bits (371), Expect = 8e-33 Identities = 116/366 (31%), Positives = 161/366 (43%), Gaps = 4/366 (1%) Frame = +3 Query: 18 MACHAVQMWSLSGLVAAFLDLTIXXXXXXXXXXXXXXXXXXGFFGLNLPCPCDGIF--IN 191 M C A+ +W+ S LV AFL+L I G FGL+LPCPC+G F N Sbjct: 5 MMCQAIDVWTFSELVGAFLNLAIAYFLLCGSALAYFASKFLGLFGLSLPCPCNGHFGKPN 64 Query: 192 VHSRSLCFNRLLVDVPLQKLSDVQLSVRQKFPFTNSISPRKNDGSVFGDSCGNGIFELEG 371 S C+ LVD P +K+S++Q + KFPF + ++ N S + G EG Sbjct: 65 KISYGNCWQGFLVDCPTEKISNIQFLAKTKFPFDSILASNMNPQSNERE-FDKGHVASEG 123 Query: 372 EVSCSSSVSDARESADAVWRQISSRTGKIDMKGKGILSYRPKXXXXXXXXXXXXXXXXXX 551 E SC+SS S + + R S + G+ D KGK + S RP+ Sbjct: 124 ETSCASS------SREIIGRDSSMKEGRFDFKGKVVKSQRPRYGIRRHRKSAFHNEKSVS 177 Query: 552 XXYDPPFHEEVTD--SDTYHHFSTNKRGSWCTRDGSPLTDSENHNLEYDKAPKKKRGRRR 725 F + V+D S S + G+ + S + +E + K+ Sbjct: 178 F---SSFDQLVSDWQSVLPSPSSFSNIGTEISEGSSVPVHRGSETIEDSRGASKE----D 230 Query: 726 TLSRDEMNLSSDVDTYMNENVQSGEELQSDQQGIQSYGGDENNKIXXXXXXXXXXXXXXX 905 E N D + + ++ S E L D G G++ + I Sbjct: 231 VTMTSESNEPVDKNNTVEKDASSVEVLNCDLPGELGLDGNDISTIRSLEEALEEEHAARS 290 Query: 906 XXXXXXXKERXXXXXXXXXXXXXILRLQEEKASIEMESRQYQRIIEEKSAYDDEEMDILQ 1085 KER ILRLQEEKASIEME+RQYQR+IEEKSAYD EEM+IL+ Sbjct: 291 ALYLELEKERSAAATAADEAMAMILRLQEEKASIEMEARQYQRMIEEKSAYDAEEMNILK 350 Query: 1086 EILMRR 1103 EI++RR Sbjct: 351 EIIIRR 356 >ref|XP_006493818.1| PREDICTED: intracellular protein transport protein USO1-like isoform X1 [Citrus sinensis] Length = 738 Score = 147 bits (371), Expect = 8e-33 Identities = 116/366 (31%), Positives = 161/366 (43%), Gaps = 4/366 (1%) Frame = +3 Query: 18 MACHAVQMWSLSGLVAAFLDLTIXXXXXXXXXXXXXXXXXXGFFGLNLPCPCDGIF--IN 191 M C A+ +W+ S LV AFL+L I G FGL+LPCPC+G F N Sbjct: 5 MMCQAIDVWTFSELVGAFLNLAIAYFLLCGSALAYFASKFLGLFGLSLPCPCNGHFGKPN 64 Query: 192 VHSRSLCFNRLLVDVPLQKLSDVQLSVRQKFPFTNSISPRKNDGSVFGDSCGNGIFELEG 371 S C+ LVD P +K+S++Q + KFPF + ++ N S + G EG Sbjct: 65 KISYGNCWQGFLVDCPTEKISNIQFLAKTKFPFDSILASNMNPQSNERE-FDKGHVASEG 123 Query: 372 EVSCSSSVSDARESADAVWRQISSRTGKIDMKGKGILSYRPKXXXXXXXXXXXXXXXXXX 551 E SC+SS S + + R S + G+ D KGK + S RP+ Sbjct: 124 ETSCASS------SREIIGRDSSMKEGRFDFKGKVVKSQRPRYGIRRHRKSAFHNEKSVS 177 Query: 552 XXYDPPFHEEVTD--SDTYHHFSTNKRGSWCTRDGSPLTDSENHNLEYDKAPKKKRGRRR 725 F + V+D S S + G+ + S + +E + K+ Sbjct: 178 F---SSFDQLVSDWQSVLPSPSSFSNIGTEISEGSSVPVHRGSETIEDSRGASKE----D 230 Query: 726 TLSRDEMNLSSDVDTYMNENVQSGEELQSDQQGIQSYGGDENNKIXXXXXXXXXXXXXXX 905 E N D + + ++ S E L D G G++ + I Sbjct: 231 VTMTSESNEPVDKNNTVEKDASSVEVLNCDLPGELGLDGNDISTIRSLEEALEEEHAARS 290 Query: 906 XXXXXXXKERXXXXXXXXXXXXXILRLQEEKASIEMESRQYQRIIEEKSAYDDEEMDILQ 1085 KER ILRLQEEKASIEME+RQYQR+IEEKSAYD EEM+IL+ Sbjct: 291 ALYLELEKERSAAATAADEAMAMILRLQEEKASIEMEARQYQRMIEEKSAYDAEEMNILK 350 Query: 1086 EILMRR 1103 EI++RR Sbjct: 351 EIIIRR 356 >ref|XP_002323777.2| hypothetical protein POPTR_0017s08220g [Populus trichocarpa] gi|550319756|gb|EEF03910.2| hypothetical protein POPTR_0017s08220g [Populus trichocarpa] Length = 781 Score = 141 bits (356), Expect = 4e-31 Identities = 118/375 (31%), Positives = 167/375 (44%), Gaps = 13/375 (3%) Frame = +3 Query: 18 MACHAVQMWSLSGLVAAFLDLTIXXXXXXXXXXXXXXXXXXGFFGLNLPCPCDGIFINVH 197 M C ++ W+ LV A+LDL I G FGL LPCPC+G+F + H Sbjct: 1 MPCQEIKSWAFDELVGAYLDLAIAYFLLCASTFAFFAEKFLGLFGLCLPCPCNGLFGD-H 59 Query: 198 SRSLCFNRLLVDVPLQKLSDVQLSVRQKFPFTNSISPRKNDGSVFGD----SCGNGIFEL 365 +R+ C+ +L D P + +S VQ SV+ +FPF + N S G +CG+ L Sbjct: 60 NRNKCWRSVLADRPSENISSVQFSVKSRFPFDSMWDKHLNFESSVGTINEVNCGSDNAGL 119 Query: 366 EGEVSCSSSVSDARESADAVWRQISS----RTGKIDMKGKGILSYRPKXXXXXXXXXXXX 533 EGE C S R+S V R + + + GK D+K +G + + Sbjct: 120 EGEAWCGS--LRERKSGKGVERSVVNVRDVKEGKFDVKERGFSIQKGRYLRRRRKVAADK 177 Query: 534 XXXXXXXXYDPPFHEEVTDSDTYHH--FSTNK-RGSWCTRDGSPLTD-SENHNLEYDKAP 701 YD H + ++S T+ S NK D P + ++ + E K Sbjct: 178 GLFSSVSSYD---HSQ-SNSRTHPQSPASVNKLMNKHHEGDMVPASSGADALHFEDSKES 233 Query: 702 KKKRGRRRTLSRD-EMNLSSDVDTYMNENVQSGEELQSDQQGIQSYGGDENNKIXXXXXX 878 G T+S D E N + M + G++L+ QG + G+E + I Sbjct: 234 SVDTGFVGTVSNDFESNEPLGENKPMEKAAPLGDDLKCKAQGEPCFDGEEKHGIRVLEQA 293 Query: 879 XXXXXXXXXXXXXXXXKERXXXXXXXXXXXXXILRLQEEKASIEMESRQYQRIIEEKSAY 1058 KER ILRLQE+KA IEME+RQY R+IEEKSAY Sbjct: 294 SEEEHAAFSALYLELEKERSAAASAADEAMAMILRLQEDKALIEMEARQYHRMIEEKSAY 353 Query: 1059 DDEEMDILQEILMRR 1103 D EEM+IL+EIL+RR Sbjct: 354 DLEEMNILKEILLRR 368 >ref|XP_002518101.1| conserved hypothetical protein [Ricinus communis] gi|223542697|gb|EEF44234.1| conserved hypothetical protein [Ricinus communis] Length = 641 Score = 140 bits (353), Expect = 1e-30 Identities = 124/386 (32%), Positives = 163/386 (42%), Gaps = 24/386 (6%) Frame = +3 Query: 18 MACHAVQMWSLSGLVAAFLDLTIXXXXXXXXXXXXXXXXXXGFFGLNLPCPCDGIFINVH 197 M C A++ W+ SGLV AFLDL+I FFGLNLPCPC+G F Sbjct: 1 MPCLAIRRWTFSGLVGAFLDLSITFFLLCASALAYFASKFLAFFGLNLPCPCNGFFAIPD 60 Query: 198 SRSLCFNRLLVDVPLQKLSDVQLSVRQKFPFTNSISPR---KNDGSVFGDSCGNGIFELE 368 + + C R VD PLQK+S +Q SV+ KFPF +SI R K++ + + N + E Sbjct: 61 ASNNCLQRQFVDYPLQKISSIQSSVKSKFPF-DSIGNRSQWKSNLENYRNIVKNEVAGSE 119 Query: 369 GEVSC-SSSVSDARESADAVWRQISS-------------RTGKIDMKGKGILSYRPKXXX 506 GE SC SSSV+ A S D ++ + K D K KG+L +R + Sbjct: 120 GESSCISSSVTRAENSRDGDLAKMKEKGFVMGAMNLQDVKERKFDCKWKGLLRHRSRNNL 179 Query: 507 XXXXXXXXXXXXXXXXXYDPPFHEEVTDSDTYHHFSTNKRGSWCTRDGSPL----TDSE- 671 F +D++T R C PL T SE Sbjct: 180 RRRRKDNGKLSQV------SSFKSLWSDAETPQSPPARIRNETCKDGMEPLNYRGTVSEV 233 Query: 672 --NHNLEYDKAPKKKRGRRRTLSRDEMNLSSDVDTYMNENVQSGEELQSDQQGIQSYGGD 845 L+ + G +R +S+ L VD E SD + G+ Sbjct: 234 NCYEILDGKEGSVVDIGSKRKISQG-FELYEPVD----------ENETSDHENTSDLDGN 282 Query: 846 ENNKIXXXXXXXXXXXXXXXXXXXXXXKERXXXXXXXXXXXXXILRLQEEKASIEMESRQ 1025 N I KER I RLQ+EKA IEME+RQ Sbjct: 283 ARNTIRLLELALEEEHAARAVLYVELEKERSAAATAADEAMAMIQRLQKEKALIEMEARQ 342 Query: 1026 YQRIIEEKSAYDDEEMDILQEILMRR 1103 QR+IEEK AYD EEM+IL+EIL+RR Sbjct: 343 CQRMIEEKYAYDAEEMNILKEILLRR 368 >ref|XP_004300993.1| PREDICTED: uncharacterized protein LOC101300919 [Fragaria vesca subsp. vesca] Length = 869 Score = 138 bits (348), Expect = 4e-30 Identities = 124/422 (29%), Positives = 172/422 (40%), Gaps = 60/422 (14%) Frame = +3 Query: 18 MACHAVQMWSLSGLVAAFLDLTIXXXXXXXXXXXXXXXXXXGFFGLNLPCPCDGIFINVH 197 MAC + W+LSGLV AFLDL+I FGL+LPCPCDG+F N Sbjct: 1 MACQMIHSWTLSGLVGAFLDLSIAYMLFCASALAFFTSKFLDLFGLSLPCPCDGLFGN-P 59 Query: 198 SRSLCFNRLLVDVPLQKLSDVQLSVRQKFPFTNSISPRKNDGSVFGDSCGNGIFELEGEV 377 + CF + LV P +K+ VQL ++ KFPF S + + +G FE EG+ Sbjct: 60 KNNYCFQKQLVQGPSEKIGRVQLQLKSKFPFDVMWSGDPHLHAKCKSDHESGHFEFEGDA 119 Query: 378 SCSS-------------SVSDARESADAVWRQISSRTGKIDMKGKGILSYRPKXXXXXXX 518 SCSS SVS +S ++ ++ R +D KGK + RP Sbjct: 120 SCSSFSDGKGLSGVGRGSVSGNEQSCESGAVKLEER---LDHKGKKVGGRRPSHGLRRCR 176 Query: 519 XXXXXXXXXXXXXYD------------------PPFH-------------EEVTDSDTYH 605 PFH +V Y Sbjct: 177 TGASVDFGKLFSVSSCDVVQSDSQDMVTPINSRRPFHGMQRRRKGCSVDYAKVFSVSPYD 236 Query: 606 HFSTNKRGSWCTRDGSPLTDSENHNLEYD------KAPKKKRGRRRTLSRDEMNLSSDVD 767 F + R S L++ N E +APK +R +SR E+N Sbjct: 237 MFQSGARD--IANPPSSLSNVGNQGAEVPSSSDGMEAPKPER-----VSRLELN------ 283 Query: 768 TYMNENVQSGEELQSDQQGIQSYGGDENNK----------IXXXXXXXXXXXXXXXXXXX 917 E+V + +++D ++++G +E+ K + Sbjct: 284 ----EHVGKTKSIENDASSVENFGANEHEKPAYDSNDKTMVRVLEQALDEEHTARTALYY 339 Query: 918 XXXKERXXXXXXXXXXXXXILRLQEEKASIEMESRQYQRIIEEKSAYDDEEMDILQEILM 1097 KER ILRLQEEKASIEME+RQYQR+I+EKS YD EEM+IL+EIL+ Sbjct: 340 ELEKERSAAATAADEAMAMILRLQEEKASIEMEARQYQRMIQEKSIYDAEEMNILKEILL 399 Query: 1098 RR 1103 RR Sbjct: 400 RR 401 >ref|XP_007227345.1| hypothetical protein PRUPE_ppa002653mg [Prunus persica] gi|462424281|gb|EMJ28544.1| hypothetical protein PRUPE_ppa002653mg [Prunus persica] Length = 648 Score = 134 bits (338), Expect = 5e-29 Identities = 122/385 (31%), Positives = 158/385 (41%), Gaps = 23/385 (5%) Frame = +3 Query: 18 MACHAVQMWSLSGLVAAFLDLTIXXXXXXXXXXXXXXXXXXGFFGLNLPCPCDGIFINVH 197 M C + W + LV AFLDL I FGL LPCPCDG F Sbjct: 1 MTCQMIHSWRFNELVGAFLDLAIAYLLLCAAAVAFFTSKFVSVFGLCLPCPCDGFF-GTP 59 Query: 198 SRSLCFNRLLVDVPLQKLSDVQLSVRQKFPFTN--SISPRKNDGSVFGDSC--GNGIFEL 365 +S CF R DVP +K+S VQ +V+ KFPF S + N S F D NG FE Sbjct: 60 RKSHCFQRQFADVPCEKISAVQWAVKSKFPFDVLWSENSNINSKSKFVDETYYENGHFEF 119 Query: 366 EGEVSCSS-----------SVSDARESADAVWRQISSRTGK---IDMKGKGILSYRPKXX 503 EGE SCSS S S A + + TGK ++K K + RP+ Sbjct: 120 EGEASCSSLSERRLLDMVESDSVAENDQSVEFGVANLETGKEQHFELKPKKVSGRRPR-- 177 Query: 504 XXXXXXXXXXXXXXXXXXYDPPFHEEVTDSDTYH----HFSTNKRGSWCTRDGSPLTDSE 671 Y P V+ D ++ ST+ CT + ++ + Sbjct: 178 -----LRRRRRRRGGSVDYGNPV--SVSSYDVFYSDAGDISTSPSSINCTEAPTYISSPD 230 Query: 672 NHNLEYDKAPKKKRGRRRTLSRDEMN-LSSDVDTYMNENVQSGEELQSDQQGIQSYGGDE 848 + + P+ K S DE D + +GE+L + +E Sbjct: 231 SVS-----RPEFKE------SMDETKPTGKDGSVVEDSGCNAGEKL--------GFDSNE 271 Query: 849 NNKIXXXXXXXXXXXXXXXXXXXXXXKERXXXXXXXXXXXXXILRLQEEKASIEMESRQY 1028 + KER ILRLQEEKASIEME+RQY Sbjct: 272 TTTVRVLEQALEEEHATRAALYLELEKERSAAATAADEAMAMILRLQEEKASIEMEARQY 331 Query: 1029 QRIIEEKSAYDDEEMDILQEILMRR 1103 QR+IEEKSAYD EEM+IL+EIL+RR Sbjct: 332 QRMIEEKSAYDAEEMNILKEILVRR 356 >ref|XP_007034368.1| Uncharacterized protein TCM_020328 [Theobroma cacao] gi|508713397|gb|EOY05294.1| Uncharacterized protein TCM_020328 [Theobroma cacao] Length = 758 Score = 130 bits (327), Expect = 1e-27 Identities = 112/374 (29%), Positives = 155/374 (41%), Gaps = 12/374 (3%) Frame = +3 Query: 18 MACHAVQMWSLSGLVAAFLDLTIXXXXXXXXXXXXXXXXXXGFFGLNLPCPCDGIFINVH 197 MAC+ + W+ +GLV AFLDL+I G FGL+LPCPC G+F + Sbjct: 1 MACNVINSWTFNGLVGAFLDLSIAYLLLCGSTLSYLASKFLGLFGLSLPCPCSGLFGST- 59 Query: 198 SRSLCFNRLLVDVPLQKLSDVQLSVRQKFPFTN---------SISPRKNDGSVFGDSCGN 350 +S C +LV+ P K+S VQ SV++K PF + ++D D N Sbjct: 60 DKSNCLQAILVNKPSLKISSVQSSVKKKLPFDSIWNNFYDDEDEDEEQHDSQSNVDKWQN 119 Query: 351 GIFELEGEVSCSSSVSDARESADAVWRQISSRTGKIDMKGKGILSYRPKXXXXXXXXXXX 530 E+EGE S S E + V + S T KG G S RP+ Sbjct: 120 RNVEMEGEASSCS----WNEKKNFVGVKKGSFTPFPKWKGFG--SQRPRVGLRRRKRAAS 173 Query: 531 XXXXXXXXXYDPPFHEEVTDSDTYHHFSTNKRGSWCTRDGSPLTDSENHNLEYDKAPKKK 710 T + S K G+ T G+ +SE+ + + Sbjct: 174 GRRGKVLSFSYDSLVSMTTPTGLNSSASIGKFGNDITEGGTTSANSEDGWETSKEIEMPE 233 Query: 711 RGRRRTLSRDEMNLSSDVDTYMNENVQSGEELQSDQQGI---QSYGGDENNKIXXXXXXX 881 +G D D + + E ++ + + Q + G + N I Sbjct: 234 QG--------SQGFEMDDDPFAENTLIEKEVALAEFKCLPPDQDFDGSDRNAIRVLEQAL 285 Query: 882 XXXXXXXXXXXXXXXKERXXXXXXXXXXXXXILRLQEEKASIEMESRQYQRIIEEKSAYD 1061 KER ILRLQEEKA+IEME+RQYQR+IEEKSAYD Sbjct: 286 EEEHAARTALYLELEKERSAAATAADEAMAMILRLQEEKATIEMEARQYQRMIEEKSAYD 345 Query: 1062 DEEMDILQEILMRR 1103 EEM+IL+EIL+RR Sbjct: 346 AEEMNILKEILLRR 359 >gb|EXC16951.1| Lysine-specific demethylase 3B [Morus notabilis] Length = 2152 Score = 128 bits (322), Expect = 4e-27 Identities = 111/386 (28%), Positives = 151/386 (39%), Gaps = 23/386 (5%) Frame = +3 Query: 15 EMACHAVQMWSLSGLVAAFLDLTIXXXXXXXXXXXXXXXXXXGFFGLNLPCPCDGIFINV 194 ++ A+ W+ S LVAAFLDL+I FGL LPCPCDG+F N Sbjct: 2 KLTSQAMNSWTFSELVAAFLDLSIAYCLLCASAFAFFASRFLALFGLCLPCPCDGLFWNP 61 Query: 195 HSRSLCFNRLLVDVPLQKLSDVQLSVRQKFPFTNSISPRKNDGSVFGDSCGNGIFELEGE 374 + S NR LVD P +K+S V SV+ KFPF + + + +G+ G GE Sbjct: 62 RNNS---NRQLVDCPYEKISSVYFSVKSKFPFDSVLGGDEQNGNSHLKLENEGNHGENGE 118 Query: 375 VSCSSSVSDARESADAVWRQISSRTG--------------KIDMKGKGILSYRPKXXXXX 512 SSS S D V + +TG + +G+ ++ RP+ Sbjct: 119 CGTSSSSSPGTRFHDLVETDVKRKTGAEFGAVNFEGSKEEEYGAEGQRVVEQRPRHGLRR 178 Query: 513 XXXXXXXXXXXXXXXYDPPFHEEVTDSDTYHHFSTN---------KRGSWCTRDGSPLTD 665 V+ DT + N K G+ + D + D Sbjct: 179 RRKGGGSVNYGKASF--------VSSYDTLQSDARNIPQSPPSISKMGNEVSEDPNDYGD 230 Query: 666 SENHNLEYDKAPKKKRGRRRTLSRDEMNLSSDVDTYMNENVQSGEELQSDQQGIQSYGGD 845 + G S D+ + S EEL QG + Sbjct: 231 DREAITAFGSLEGVSLGLASNNSNDDCK-------HAEREAVSIEELGRGTQGDFELDKN 283 Query: 846 ENNKIXXXXXXXXXXXXXXXXXXXXXXKERXXXXXXXXXXXXXILRLQEEKASIEMESRQ 1025 E N I KER ILRLQ+EKASIEME++Q Sbjct: 284 EKNMIRLLEQALEEEHAACSALYLELEKERSAAASAADEAMAMILRLQKEKASIEMEAKQ 343 Query: 1026 YQRIIEEKSAYDDEEMDILQEILMRR 1103 YQR+IE K+AYD EEM+IL+EIL+RR Sbjct: 344 YQRMIEAKAAYDAEEMNILKEILLRR 369 >ref|XP_007049771.1| Uncharacterized protein TCM_002927 [Theobroma cacao] gi|508702032|gb|EOX93928.1| Uncharacterized protein TCM_002927 [Theobroma cacao] Length = 649 Score = 123 bits (308), Expect = 2e-25 Identities = 105/367 (28%), Positives = 149/367 (40%), Gaps = 5/367 (1%) Frame = +3 Query: 18 MACHAVQMWSLSGLVAAFLDLTIXXXXXXXXXXXXXXXXXXGFFGLN-LPCPCDGIFINV 194 M + W+L GL+ AFLD+ + FGL LPCPC G F Sbjct: 1 MVSRTIPSWTLFGLIRAFLDVAVAYFLLCGSTLGFFAWKFYHVFGLYYLPCPCTGFF-GY 59 Query: 195 HSRSLCFNRLLVDVPLQKLSDVQLSVRQKFPFT----NSISPRKNDGSVFGDSCGNGIFE 362 + +LC+++LL++ P +K+ VQ +FPF N N + GNG+ E Sbjct: 60 QNSNLCWHKLLIEWPARKIYSVQKLALNRFPFNLVWFNDQECNLNAKYIKDRKFGNGVIE 119 Query: 363 LEGEVSCSSSVSDARESADAVWRQISSRTGKIDMKGKGILSYRPKXXXXXXXXXXXXXXX 542 +GE +CSSS S R R + + D KGK I++ + K Sbjct: 120 SDGE-ACSSSPSGLR------LRTMVDKESGYDAKGKKIINQKQKSGIRRCRRAAFGYGK 172 Query: 543 XXXXXYDPPFHEEVTDSDTYHHFSTNKRGSWCTRDGSPLTDSENHNLEYDKAPKKKRGRR 722 F V + R G P+++ + D P K + Sbjct: 173 SSPVLLSGNFSSAVAGVSCSSYNGGETRSEISEHLG-PVSEID------DSFPDNKNNQT 225 Query: 723 RTLSRDEMNLSSDVDTYMNENVQSGEELQSDQQGIQSYGGDENNKIXXXXXXXXXXXXXX 902 T D + + S +++ + G GDE N+I Sbjct: 226 GTDGGDGTWHGFEFSNGEEKVSTSMKKINCNTNGKLGITGDEANRIRMLEQALEEEKAAY 285 Query: 903 XXXXXXXXKERXXXXXXXXXXXXXILRLQEEKASIEMESRQYQRIIEEKSAYDDEEMDIL 1082 KER ILRLQE+KASIEME+ QYQR+IEEK AYD+EEM+IL Sbjct: 286 AALYLELEKERAAAATAADEAMAMILRLQEDKASIEMEAMQYQRMIEEKFAYDEEEMNIL 345 Query: 1083 QEILMRR 1103 +EIL+RR Sbjct: 346 KEILVRR 352 >gb|EXC25123.1| hypothetical protein L484_003047 [Morus notabilis] Length = 445 Score = 119 bits (298), Expect = 2e-24 Identities = 108/363 (29%), Positives = 153/363 (42%), Gaps = 1/363 (0%) Frame = +3 Query: 18 MACHAVQMWSLSGLVAAFLDLTIXXXXXXXXXXXXXXXXXXGFFGLNLPCPCDGIFINVH 197 MA ++ W+L GL+ AF+DL++ GL LPCPC G F+ Sbjct: 1 MAWKEIRPWTLCGLIGAFIDLSLAYFVLCSSSFFFFPSKFLSIIGLRLPCPCKG-FLGYQ 59 Query: 198 SRSLCFNRLLVDVPLQKLSDVQLSVRQKFPFTNSISPRKNDGSVFGD-SCGNGIFELEGE 374 + + C++RLL+D P++K+ VQL + +FPF D F D CGNG+ EL G+ Sbjct: 60 NGNFCWHRLLIDWPIRKIHAVQLLAKSRFPF---------DLVWFRDVKCGNGVVELGGD 110 Query: 375 VSCSSSVSDARESADAVWRQISSRTGKIDMKGKGILSYRPKXXXXXXXXXXXXXXXXXXX 554 S SSS S +R + ++ R + KGK ++++ + Sbjct: 111 GS-SSSFSGSRN------QNLADRESGCEYKGKRAVNFKQRSGIRRRRRANLAYGKLASV 163 Query: 555 XYDPPFHEEVTDSDTYHHFSTNKRGSWCTRDGSPLTDSENHNLEYDKAPKKKRGRRRTLS 734 F E D Y G R+ P N Y K G + S Sbjct: 164 MSYGNFREM---KDRYAEI----LGPLSRREDGPQDGITASNGNY-KGEGTCHGFELSGS 215 Query: 735 RDEMNLSSDVDTYMNENVQSGEELQSDQQGIQSYGGDENNKIXXXXXXXXXXXXXXXXXX 914 E N D Y +N E D +S G E +KI Sbjct: 216 FGESN-----DAY--KNSVWFENYIDDAGEKKSIVGSEFDKIRMLERALKEEKAASAALY 268 Query: 915 XXXXKERXXXXXXXXXXXXXILRLQEEKASIEMESRQYQRIIEEKSAYDDEEMDILQEIL 1094 KER I RLQ++K S+E E+RQYQR+IEEK AYD+EE++IL+EIL Sbjct: 269 VELEKERAAAATAADEAMAMISRLQKDKGSMETETRQYQRMIEEKFAYDEEEIEILKEIL 328 Query: 1095 MRR 1103 +RR Sbjct: 329 IRR 331 >ref|XP_006840152.1| hypothetical protein AMTR_s00089p00065300 [Amborella trichopoda] gi|548841851|gb|ERN01827.1| hypothetical protein AMTR_s00089p00065300 [Amborella trichopoda] Length = 903 Score = 114 bits (286), Expect = 6e-23 Identities = 101/390 (25%), Positives = 157/390 (40%), Gaps = 27/390 (6%) Frame = +3 Query: 15 EMACHAVQMWSLSGLVAAFLDLTIXXXXXXXXXXXXXXXXXXGFFGLNLPCPCDGIF--- 185 +MAC + W+ L+ AFLDL+I G FGL PC C+G+F Sbjct: 116 KMACTVIHSWTFCSLIGAFLDLSIAFFMLCGSAMAFFTAKFMGIFGLYFPCTCNGLFGDP 175 Query: 186 INVHSRSLCFNRLLVDVPLQKLSDVQLSVRQKFPFTN---SISPRKNDGSVFGDSCGNGI 356 +N + C R+LV+ P +K S +Q+S++ KFP+ R++D G++ + Sbjct: 176 MNSGNGHFCIQRVLVEFPPRKSSSLQMSLQNKFPYDTIWLRDIGREHDNLKIGNTSDGAL 235 Query: 357 ---FELEGEVSCSSSVSDARES-------ADAVWRQISSRTG------KIDMKGKGILSY 488 + + E S S+S + S A + W + R + KGKG+ + Sbjct: 236 GLNHKTDDESSSSASEAQTLRSSIVDETRAKSEWDMVEFRASPCQGSVSLSSKGKGVWNP 295 Query: 489 RPKXXXXXXXXXXXXXXXXXXXXYDPPFHEEVTDSDTYHHFSTNKRGSWCTRDGSPLTDS 668 R + F E S S +R+ + Sbjct: 296 RSRSSLHRRRRASGDARSSIKLSLSRSFGEGGDHSPL--------NSSELSRENPVESFH 347 Query: 669 ENHNLEYDKAPKKKRGRRRTLSRDEMN-----LSSDVDTYMNENVQSGEELQSDQQGIQS 833 + + + +++ G +M+ L S+ + + E+ E L G Q Sbjct: 348 PSSHFKMNQSQYSGEGSDGNAMVGDMHGVSKELFSEDNNTIEEDTSFVEALVEGSLGEQG 407 Query: 834 YGGDENNKIXXXXXXXXXXXXXXXXXXXXXXKERXXXXXXXXXXXXXILRLQEEKASIEM 1013 G+E + I KER ILRLQEEKA IEM Sbjct: 408 LMGNEADTIRILGKALEEERTSRTALYNELEKERSAAATAADEAMAMILRLQEEKAVIEM 467 Query: 1014 ESRQYQRIIEEKSAYDDEEMDILQEILMRR 1103 E+RQYQR+IEEK+ YD+EE +L+EIL+RR Sbjct: 468 EARQYQRMIEEKATYDEEERSVLKEILVRR 497 >ref|XP_004506884.1| PREDICTED: intracellular protein transport protein USO1-like isoform X1 [Cicer arietinum] gi|502147710|ref|XP_004506885.1| PREDICTED: intracellular protein transport protein USO1-like isoform X2 [Cicer arietinum] Length = 594 Score = 114 bits (284), Expect = 1e-22 Identities = 99/369 (26%), Positives = 154/369 (41%), Gaps = 7/369 (1%) Frame = +3 Query: 18 MACHAVQMWSLSGLVAAFLDLTIXXXXXXXXXXXXXXXXXXGFFGLNLPCPCDGIFINVH 197 MA + W+L GL+ AF+DL + FFGL+LPCPC GI + Sbjct: 1 MALEEIHTWNLVGLIGAFIDLFVAYVLLCVSTIAFLAFNLYRFFGLHLPCPCKGI-LGFK 59 Query: 198 SRSLCFNRLLVDVPLQKLSDVQLSVRQKFPF-------TNSISPRKNDGSVFGDSCGNGI 356 + +LCF+ +L + PL+K+ +Q+ ++FPF +S++ + V D N + Sbjct: 60 NSNLCFHMMLFEWPLKKVCSIQVMAAKRFPFDLVWVKKDHSLNYANENKMV--DVNDNRV 117 Query: 357 FELEGEVSCSSSVSDARESADAVWRQISSRTGKIDMKGKGILSYRPKXXXXXXXXXXXXX 536 ELE E SCS + + +G D KGK ++S + + Sbjct: 118 VELEDESSCS--------GPPRLLSLVDKESG-YDAKGKRVMSLKQRSGIRRRKRGGYDC 168 Query: 537 XXXXXXXYDPPFHEEVTDSDTYHHFSTNKRGSWCTRDGSPLTDSENHNLEYDKAPKKKRG 716 F +V + T S N D H+L+ + Sbjct: 169 GKINSVICCDDFQSDVV-AFTPCSQSINVASGKEVSVHYDEDDRTFHDLDEKTCHSYEFN 227 Query: 717 RRRTLSRDEMNLSSDVDTYMNENVQSGEELQSDQQGIQSYGGDENNKIXXXXXXXXXXXX 896 S SS ++ YM+ VQ ++ +E++++ Sbjct: 228 ASMVDSPVRGIYSSSMEHYMSTTVQDNIQIVK----------NEDDRMKMLENALEEERS 277 Query: 897 XXXXXXXXXXKERXXXXXXXXXXXXXILRLQEEKASIEMESRQYQRIIEEKSAYDDEEMD 1076 KER I RLQEEKAS+EME RQ++R+IEE++AYD+EEM+ Sbjct: 278 AYAALYLELEKERAAAASAADEAMAMISRLQEEKASMEMEMRQFERLIEERAAYDEEEMN 337 Query: 1077 ILQEILMRR 1103 I+QEIL+RR Sbjct: 338 IMQEILIRR 346 >ref|XP_006576617.1| PREDICTED: uncharacterized protein LOC100815378 isoform X2 [Glycine max] Length = 612 Score = 111 bits (277), Expect = 6e-22 Identities = 107/369 (28%), Positives = 146/369 (39%), Gaps = 15/369 (4%) Frame = +3 Query: 42 WSLSGLVAAFLDLTIXXXXXXXXXXXXXXXXXXGFFGLNLPCPCDGIFINVHSRSLCFNR 221 W+L GL+ AF+DL + FFGL LPCPC G F S C +R Sbjct: 11 WTLGGLIGAFIDLVLAYFLLCGSAFAFFVSKWFRFFGLCLPCPCKGSF-GFRSSRFCVHR 69 Query: 222 LLVDVPLQKLSDVQLSVRQKFPF----TNSISPRKNDGSVFGDSCGNGIFELEGEVSCSS 389 LL + P +K+ +Q+ ++FPF S ND V + N + ELE E SCS Sbjct: 70 LLFEWPSRKICSIQVMAVKRFPFDLVWVKGHSCSANDKVVTERTHDNRVVELEDEASCS- 128 Query: 390 SVSDARESADAVWRQISSRTGKIDMKGKGILSYRPKXXXXXXXXXXXXXXXXXXXXYDPP 569 S S W + D KGK ++S + + YD Sbjct: 129 SCSGPCLLPFVDWENV------YDAKGKRVMSMKRRSGVRCHRRAS----------YDCA 172 Query: 570 FHEEVTDSDTYHHFSTNKRGSWCTRDGSPLTDSENHNLE--------YDKAPKKKRGR-- 719 S+ S C DGS + D N + D A + G Sbjct: 173 KVSSAVPSENLQSDVVLIPCSPC--DGSIIRDKTNAGMSPTSGKGVSVDDAKDDQTGHDL 230 Query: 720 -RRTLSRDEMNLSSDVDTYMNENVQSGEELQSDQQGIQSYGGDENNKIXXXXXXXXXXXX 896 +T + N S ++ + S E ++ G+E +++ Sbjct: 231 DEKTCHSYDFNGSMVDSPGHDKCLLSLEHYINNVCDNVQIVGNEEDRVKMLENALEEEKA 290 Query: 897 XXXXXXXXXXKERXXXXXXXXXXXXXILRLQEEKASIEMESRQYQRIIEEKSAYDDEEMD 1076 KER I RLQEEKAS+E+E RQY RIIEE+ AYD+EEMD Sbjct: 291 AYAALYLELEKERAAAATAADETMAMISRLQEEKASMELEMRQYLRIIEERVAYDEEEMD 350 Query: 1077 ILQEILMRR 1103 ILQEIL+RR Sbjct: 351 ILQEILIRR 359