BLASTX nr result
ID: Mentha22_contig00034002
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00034002 (724 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU18323.1| hypothetical protein MIMGU_mgv1a0010871mg, partia... 446 e-123 ref|XP_007217047.1| hypothetical protein PRUPE_ppa001424mg [Prun... 425 e-117 ref|XP_004238851.1| PREDICTED: uncharacterized protein LOC101257... 424 e-116 ref|XP_003525612.1| PREDICTED: uncharacterized protein LOC100776... 422 e-116 ref|XP_006344223.1| PREDICTED: uncharacterized protein LOC102606... 421 e-115 dbj|BAL63045.1| peptidyl serine alpha-galactosyltransferase [Nic... 421 e-115 ref|XP_006447182.1| hypothetical protein CICLE_v10014283mg [Citr... 421 e-115 ref|XP_007031711.1| F28J7.5 protein isoform 2, partial [Theobrom... 419 e-115 ref|XP_007031710.1| F28J7.5 protein isoform 1 [Theobroma cacao] ... 419 e-115 ref|XP_002526934.1| conserved hypothetical protein [Ricinus comm... 419 e-115 ref|XP_004173110.1| PREDICTED: uncharacterized LOC101221472, par... 418 e-114 ref|XP_004145689.1| PREDICTED: uncharacterized protein LOC101221... 418 e-114 ref|XP_002271170.1| PREDICTED: uncharacterized protein LOC100242... 415 e-114 ref|XP_007151570.1| hypothetical protein PHAVU_004G058000g [Phas... 414 e-113 ref|XP_002298591.2| hypothetical protein POPTR_0001s36250g [Popu... 412 e-113 gb|EXC31392.1| hypothetical protein L484_017674 [Morus notabilis] 412 e-113 gb|EPS70488.1| hypothetical protein M569_04262 [Genlisea aurea] 410 e-112 ref|XP_006599063.1| PREDICTED: uncharacterized protein LOC100783... 407 e-111 ref|NP_566148.2| uncharacterized protein [Arabidopsis thaliana] ... 405 e-111 gb|AAF01555.1|AC009325_25 unknown protein [Arabidopsis thaliana]... 405 e-111 >gb|EYU18323.1| hypothetical protein MIMGU_mgv1a0010871mg, partial [Mimulus guttatus] Length = 883 Score = 446 bits (1148), Expect = e-123 Identities = 203/241 (84%), Positives = 222/241 (92%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 EKG+PVAAYYGYL+GCDNILAKLHTKHPE+CDKVGGLLA H+DDLR APMWLSKTEEVR Sbjct: 159 EKGKPVAAYYGYLVGCDNILAKLHTKHPEICDKVGGLLAMHIDDLRALAPMWLSKTEEVR 218 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 ADKDHW TN TGDIY GWISEMYGYSFGAAEVGLRHKI DNLMIYPGY+PREG+EPILM Sbjct: 219 ADKDHWPTNYTGDIYGMGWISEMYGYSFGAAEVGLRHKITDNLMIYPGYIPREGVEPILM 278 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGL FS+GNWSFSKL+HHED+IVYDCGRLFPEPPYPRE+ +ESDPNKRRALFL IEC+ Sbjct: 279 HYGLTFSIGNWSFSKLQHHEDNIVYDCGRLFPEPPYPRELTAIESDPNKRRALFLDIECM 338 Query: 184 NTLNEGLLLNHEKHGCPKPKWSKYLSFLKSDRFAELTRPKQLTPKSREMMEEAHVQEKQD 5 NTLNEGLLL+H GCPKPKWSKYLSFLKS++FAELTRPK+LTPKSR+MME Q+KQ+ Sbjct: 339 NTLNEGLLLHHAARGCPKPKWSKYLSFLKSNKFAELTRPKKLTPKSRQMMEVVVKQQKQE 398 Query: 4 V 2 V Sbjct: 399 V 399 Score = 235 bits (599), Expect = 1e-59 Identities = 118/232 (50%), Positives = 148/232 (63%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 +KGRPV+ Y YLIGCDN LAK HT+HPE CDKVGG++ H+ DL+ FA +WL KTEEVR Sbjct: 565 DKGRPVSTPYDYLIGCDNELAKAHTRHPEACDKVGGVIIMHVRDLKRFALLWLHKTEEVR 624 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 AD HW+ +ITGDIY +GWISEMYGYSF AAE+ LRH I+ +++IYPGY P G+ + Sbjct: 625 ADVGHWSKDITGDIYEAGWISEMYGYSFAAAEMNLRHVISKDMLIYPGYNPVPGVNYRVF 684 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGL F VGNW+F K K IV C FP+PP P +K + D +R LSIEC Sbjct: 685 HYGLEFRVGNWNFDKAKWRRMDIVNKCWAKFPDPPDPSTLKRADEDSFQRD--LLSIECG 742 Query: 184 NTLNEGLLLNHEKHGCPKPKWSKYLSFLKSDRFAELTRPKQLTPKSREMMEE 29 LNE L ++E+ CP P LS ++ P L+P R+ E Sbjct: 743 KALNEALQSHYERRKCPDP---NTLSNPVREQAKPAPNPPSLSPPIRKKTPE 791 >ref|XP_007217047.1| hypothetical protein PRUPE_ppa001424mg [Prunus persica] gi|462413197|gb|EMJ18246.1| hypothetical protein PRUPE_ppa001424mg [Prunus persica] Length = 831 Score = 425 bits (1092), Expect = e-117 Identities = 193/235 (82%), Positives = 209/235 (88%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 EKG+PVAAYYGYL+GCDNIL++LHTKHP+LCDKVGGLLA HMDDLR APMWLSKTEEVR Sbjct: 149 EKGKPVAAYYGYLVGCDNILSQLHTKHPDLCDKVGGLLAMHMDDLRALAPMWLSKTEEVR 208 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 D+ HW TNITGDIY GWISEMYGYSFGAAEVGL+HKINDNLMIYPGY PREG+ PIL Sbjct: 209 EDRAHWTTNITGDIYGKGWISEMYGYSFGAAEVGLQHKINDNLMIYPGYTPREGVVPILF 268 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGLPFSVGNWSFSKL HHED IVYDCGRLFPEPPYP+EVK MESDPNKRRAL +++ECI Sbjct: 269 HYGLPFSVGNWSFSKLDHHEDGIVYDCGRLFPEPPYPKEVKLMESDPNKRRALLMNLECI 328 Query: 184 NTLNEGLLLNHEKHGCPKPKWSKYLSFLKSDRFAELTRPKQLTPKSREMMEEAHV 20 NTLNEGLLL H +GCPKPKWSKYLSFLKS FAELTRPKQLTP + + + HV Sbjct: 329 NTLNEGLLLQHAANGCPKPKWSKYLSFLKSKTFAELTRPKQLTPATLQFEKAVHV 383 Score = 254 bits (648), Expect = 3e-65 Identities = 119/198 (60%), Positives = 143/198 (72%) Frame = -1 Query: 721 KGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVRA 542 +GRPV+ Y YLIGCDN LA LHT+HPE CDKVGG++ H+DDLR FA +WL KTEEVRA Sbjct: 517 RGRPVSTPYDYLIGCDNELANLHTRHPEACDKVGGVIIMHIDDLRKFALLWLHKTEEVRA 576 Query: 541 DKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILMH 362 D H+ATNITGDIY SGWISEMYGYSFGAAE+ LRH+I+ ++IYPGY P+ GI + H Sbjct: 577 DTAHYATNITGDIYESGWISEMYGYSFGAAELKLRHQISSEILIYPGYAPQPGIRYRVFH 636 Query: 361 YGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECIN 182 YGL + VGNWSF K +V C FP+PP P + ++D NK + LSIECI Sbjct: 637 YGLEYKVGNWSFDKANWRNVDVVNKCWGQFPDPPDPSTLD--QTDKNKLQTDLLSIECIK 694 Query: 181 TLNEGLLLNHEKHGCPKP 128 TLNE L L+HE+ CP P Sbjct: 695 TLNEALRLHHERRNCPDP 712 >ref|XP_004238851.1| PREDICTED: uncharacterized protein LOC101257369 [Solanum lycopersicum] Length = 912 Score = 424 bits (1090), Expect = e-116 Identities = 195/241 (80%), Positives = 213/241 (88%), Gaps = 1/241 (0%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 EKG+PV+AYYGYLIGCDNILAKLHTKHPE CDKVGGLLA H+DDLR AP+WLSKTEEVR Sbjct: 151 EKGKPVSAYYGYLIGCDNILAKLHTKHPEFCDKVGGLLAMHIDDLRALAPLWLSKTEEVR 210 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 D+ HW TN TGDIY +GWISEMYGYSFGAAEVGLRHKINDNLMIYPGY PREG+EPILM Sbjct: 211 EDRAHWPTNYTGDIYGTGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYTPREGVEPILM 270 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGLPF+VGNWSFSKL HHED IVYDC RLFPEPPYPRE+ +MESD NKRRALFL+IECI Sbjct: 271 HYGLPFNVGNWSFSKLDHHEDDIVYDCSRLFPEPPYPREITQMESDHNKRRALFLNIECI 330 Query: 184 NTLNEGLLLNHEKHGCPKPKWSKYLSFLKSDRFAELTRPKQLTPKSREMME-EAHVQEKQ 8 NT+NEGLLL H CPKPKWSKYLSFLKS FAEL+RPK LTP+SR+MME E H + + Sbjct: 331 NTMNEGLLLQHAAFKCPKPKWSKYLSFLKSKTFAELSRPKHLTPQSRQMMEIEIHEEVNK 390 Query: 7 D 5 + Sbjct: 391 E 391 Score = 247 bits (631), Expect = 3e-63 Identities = 113/198 (57%), Positives = 138/198 (69%) Frame = -1 Query: 721 KGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVRA 542 + RP + Y YLIGCDNILAKLHT+HPE CDKVGG++ H+DDLR FA WL KT EVR Sbjct: 533 RSRPASTPYDYLIGCDNILAKLHTRHPEACDKVGGVIIMHVDDLRKFALQWLHKTMEVRL 592 Query: 541 DKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILMH 362 D+ HW+ NITGDIY SGWISEMYGYSFGAAE+ LRH I+D ++IYPGYVP+ G+ + H Sbjct: 593 DRSHWSKNITGDIYESGWISEMYGYSFGAAELNLRHVISDEILIYPGYVPKPGVNYRVFH 652 Query: 361 YGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECIN 182 YGL + VG WSF K +V C FP+PP P + ++D N + LS+EC Sbjct: 653 YGLEYRVGKWSFDKANWRHTDLVNKCWAKFPDPPDPSSLD--QTDNNSLQRDLLSVECAT 710 Query: 181 TLNEGLLLNHEKHGCPKP 128 TLNE L L+HE+ CP P Sbjct: 711 TLNEALRLHHERRKCPDP 728 >ref|XP_003525612.1| PREDICTED: uncharacterized protein LOC100776740 [Glycine max] Length = 821 Score = 422 bits (1084), Expect = e-116 Identities = 193/240 (80%), Positives = 214/240 (89%), Gaps = 3/240 (1%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFH+DDLRVFAP+WLSKTEEVR Sbjct: 148 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHIDDLRVFAPLWLSKTEEVR 207 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 D HWATNITGDIY GWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPIL+ Sbjct: 208 EDTVHWATNITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILL 267 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGLPFSVGNWSF+KL HH+D IVY+C +LFPEPPYP+EV+++E DPN+RR LFLS+ECI Sbjct: 268 HYGLPFSVGNWSFNKLAHHDDGIVYECNQLFPEPPYPKEVRQLELDPNRRRGLFLSLECI 327 Query: 184 NTLNEGLLLNHEKHGCPKPKWSKYLSFLKSDRFAELTRPKQLTPKSREMME---EAHVQE 14 N +NEGLLL H +GCPKP WSKYLSFLKS +AELT+PK + P + +MME E HV + Sbjct: 328 NIINEGLLLQHAANGCPKPTWSKYLSFLKSKAYAELTQPKYVNPATLQMMEDIKEEHVDD 387 Score = 232 bits (592), Expect = 8e-59 Identities = 113/192 (58%), Positives = 138/192 (71%) Frame = -1 Query: 721 KGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVRA 542 + PV+ Y YLIGCDN LAKLHT HPE CDKVGG++ H+DDLR FA +WL KTEEVRA Sbjct: 511 RSHPVSTPYDYLIGCDNELAKLHTSHPEACDKVGGVIIMHIDDLRKFAMLWLHKTEEVRA 570 Query: 541 DKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILMH 362 D+ H+A NITGDIY SGWISEMYGYSFGAAE+ LRH IN+ ++IYPGYVP + + H Sbjct: 571 DRAHYARNITGDIYESGWISEMYGYSFGAAELKLRHTINNEILIYPGYVPVPSVNYRVFH 630 Query: 361 YGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECIN 182 YGL FSVGNWSF K +V C FP+PP + ++ ++ + +R L LSIEC Sbjct: 631 YGLRFSVGNWSFDKADWRNVDMVNKCWAKFPDPPDSSPI-DLANNEDLQRDL-LSIECAK 688 Query: 181 TLNEGLLLNHEK 146 TLNE L L+H+K Sbjct: 689 TLNEALNLHHQK 700 >ref|XP_006344223.1| PREDICTED: uncharacterized protein LOC102606280 [Solanum tuberosum] Length = 905 Score = 421 bits (1083), Expect = e-115 Identities = 193/231 (83%), Positives = 209/231 (90%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 EKG+PV+AYYGYLIGCDNILAKLHTKHPELCDKVGGLLA H+DDLR AP+WLSKTEEVR Sbjct: 151 EKGKPVSAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAMHIDDLRALAPLWLSKTEEVR 210 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 DK HW TN TGDIY +GWISEMYGYSFGAAEVGLRHKINDNLMIYPGY PREG+EPILM Sbjct: 211 EDKVHWPTNYTGDIYGTGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYTPREGVEPILM 270 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGLPF+VGNWSFSKL HHED IVYDC RLFPEPPYPRE+ +MESD +KRRALFL+IECI Sbjct: 271 HYGLPFNVGNWSFSKLDHHEDDIVYDCSRLFPEPPYPREITQMESDHSKRRALFLNIECI 330 Query: 184 NTLNEGLLLNHEKHGCPKPKWSKYLSFLKSDRFAELTRPKQLTPKSREMME 32 NT+NEGLLL H CPKPKWSKYLSFLKS FAEL+RPK+LT +SR+MME Sbjct: 331 NTMNEGLLLQHAAFKCPKPKWSKYLSFLKSKTFAELSRPKRLTAQSRQMME 381 Score = 252 bits (644), Expect = 8e-65 Identities = 116/198 (58%), Positives = 140/198 (70%) Frame = -1 Query: 721 KGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVRA 542 + RP + Y YLIGCDNILAKLHT+HPE CDKVGG++ H+DDLR FA WL KT EVR Sbjct: 533 RSRPASTPYDYLIGCDNILAKLHTRHPEACDKVGGVIIMHVDDLRKFALQWLHKTMEVRL 592 Query: 541 DKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILMH 362 DK HW+ NITGDIY SGWISEMYGYSFGAAE+ LRH I+D ++IYPGYVP+ G+ + H Sbjct: 593 DKSHWSKNITGDIYESGWISEMYGYSFGAAELNLRHVISDEILIYPGYVPKPGVNYRVFH 652 Query: 361 YGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECIN 182 YGL + VGNWSF K +V C FP+PP P + ++D N + LSIEC Sbjct: 653 YGLEYRVGNWSFDKANWRHADLVNKCWAKFPDPPDPSSLD--QTDNNSLQRDLLSIECAT 710 Query: 181 TLNEGLLLNHEKHGCPKP 128 TLNE L+L+HE+ CP P Sbjct: 711 TLNEALMLHHERRKCPDP 728 >dbj|BAL63045.1| peptidyl serine alpha-galactosyltransferase [Nicotiana tabacum] Length = 898 Score = 421 bits (1083), Expect = e-115 Identities = 192/231 (83%), Positives = 209/231 (90%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 EKGRPV+AYYGYL+GCDN+LAKLHTKHPELCDKVGGLLA H+DDLR AP+WLSKTEEVR Sbjct: 150 EKGRPVSAYYGYLVGCDNVLAKLHTKHPELCDKVGGLLAMHIDDLRALAPLWLSKTEEVR 209 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 DK HWATN TGDIY +GWISEMYGYSFGAAEVGLRHKINDNLMIYPGY+PREG+EPILM Sbjct: 210 EDKAHWATNYTGDIYGTGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYIPREGVEPILM 269 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGLPF+VGNWSFSKL+HH D IVY+C RLF EPPYPRE+ +ME D NKRRALFL+IECI Sbjct: 270 HYGLPFNVGNWSFSKLEHHNDDIVYNCNRLFLEPPYPREIAQMEPDRNKRRALFLNIECI 329 Query: 184 NTLNEGLLLNHEKHGCPKPKWSKYLSFLKSDRFAELTRPKQLTPKSREMME 32 NTLNEGLLL H GCPKPKWSKYLSFLKS FAEL+RPK LT +SR+MME Sbjct: 330 NTLNEGLLLQHAAFGCPKPKWSKYLSFLKSKTFAELSRPKPLTSQSRQMME 380 Score = 246 bits (628), Expect = 6e-63 Identities = 111/198 (56%), Positives = 140/198 (70%) Frame = -1 Query: 721 KGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVRA 542 +G PV+ Y YLIGCDN+LAKLHT+HPE CDKVGG++ H+DDLR FA WL KT EVR Sbjct: 513 RGHPVSTPYDYLIGCDNVLAKLHTRHPEACDKVGGVIIMHVDDLRKFALQWLHKTVEVRL 572 Query: 541 DKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILMH 362 D+ HW+ NITGD+Y +GWISEMYGYSFGAAE+ LRH I+ ++IYPGYVP G++ + H Sbjct: 573 DRSHWSKNITGDVYEAGWISEMYGYSFGAAELNLRHVISGEILIYPGYVPAPGVKYRVFH 632 Query: 361 YGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECIN 182 YGL + VGNWSF K +V C FP+PP P + + ++D +R LSIEC Sbjct: 633 YGLEYRVGNWSFDKANWRHVDLVNKCWAKFPDPPDPSSLDQSDNDSLQRD--LLSIECAT 690 Query: 181 TLNEGLLLNHEKHGCPKP 128 TLNE L ++HE+ CP P Sbjct: 691 TLNEALRIHHERRKCPDP 708 >ref|XP_006447182.1| hypothetical protein CICLE_v10014283mg [Citrus clementina] gi|568831415|ref|XP_006469963.1| PREDICTED: uncharacterized protein LOC102629731 [Citrus sinensis] gi|557549793|gb|ESR60422.1| hypothetical protein CICLE_v10014283mg [Citrus clementina] Length = 823 Score = 421 bits (1082), Expect = e-115 Identities = 192/222 (86%), Positives = 208/222 (93%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 EKGRPVAA YGYLIGC+NILAKLHTKHPELCDKVGGLLA H+DDLR AP+WLSKTEEVR Sbjct: 149 EKGRPVAALYGYLIGCNNILAKLHTKHPELCDKVGGLLAMHIDDLRALAPLWLSKTEEVR 208 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 D+ HWATNITGDIY+SGWISEMYGYSFGAAEVGLRHKIND+LMIYPGY+PREG+EPIL+ Sbjct: 209 EDRAHWATNITGDIYASGWISEMYGYSFGAAEVGLRHKINDDLMIYPGYIPREGVEPILL 268 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGLPF VGNWSFSKL+HHED+IVYDCGRLFPEPPYPREVKEME DPN+RRALFL+IECI Sbjct: 269 HYGLPFRVGNWSFSKLEHHEDNIVYDCGRLFPEPPYPREVKEMEPDPNQRRALFLNIECI 328 Query: 184 NTLNEGLLLNHEKHGCPKPKWSKYLSFLKSDRFAELTRPKQL 59 NT+NEGLLL H +GCPKPKWS+YLSFLKS FAELTRPK L Sbjct: 329 NTINEGLLLQHTANGCPKPKWSRYLSFLKSKSFAELTRPKLL 370 Score = 249 bits (637), Expect = 5e-64 Identities = 117/199 (58%), Positives = 143/199 (71%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 E+GRPV+ Y YLIGC+N LAKLHT+HP+ CDKVGG++ H+DDLR FA +WL KTEEVR Sbjct: 510 ERGRPVSTPYDYLIGCNNELAKLHTRHPDACDKVGGVIIMHIDDLRKFAMLWLHKTEEVR 569 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 ADK H++ NITGD+Y SGWISEMYGYSFGAAE+ LRH IN ++IYPGY+P G++ + Sbjct: 570 ADKAHYSRNITGDVYESGWISEMYGYSFGAAELKLRHIINRKILIYPGYIPEPGVKYRVF 629 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGL FSVGNWSF K + +V C FPEPP P + SD N + LSIEC Sbjct: 630 HYGLEFSVGNWSFDKANWRDADMVNKCWAQFPEPPDPSTLD--RSDKNILQRDLLSIECA 687 Query: 184 NTLNEGLLLNHEKHGCPKP 128 LNE L L+H++ CP P Sbjct: 688 KKLNEALRLHHKRRNCPDP 706 >ref|XP_007031711.1| F28J7.5 protein isoform 2, partial [Theobroma cacao] gi|508710740|gb|EOY02637.1| F28J7.5 protein isoform 2, partial [Theobroma cacao] Length = 605 Score = 419 bits (1076), Expect = e-115 Identities = 195/240 (81%), Positives = 211/240 (87%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 EKG PV+AYYGYL+GCDNILAKLHTKHPELCDKVGGLLA H++DLRV AP+WLSKTEEVR Sbjct: 147 EKGWPVSAYYGYLVGCDNILAKLHTKHPELCDKVGGLLAMHIEDLRVLAPLWLSKTEEVR 206 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 D+ HWATNITGDIY GWISEMYGYSFGAAE GLRHKIND+LMIYPGY PR G+EPIL+ Sbjct: 207 EDRAHWATNITGDIYGKGWISEMYGYSFGAAEAGLRHKINDDLMIYPGYTPRPGVEPILL 266 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGLP VGNWSFSKL HHEDSIVYDCGRLFPEPPYPREVK MESDPNKRR LFLSIECI Sbjct: 267 HYGLPIRVGNWSFSKLDHHEDSIVYDCGRLFPEPPYPREVKSMESDPNKRRGLFLSIECI 326 Query: 184 NTLNEGLLLNHEKHGCPKPKWSKYLSFLKSDRFAELTRPKQLTPKSREMMEEAHVQEKQD 5 NT+NEGLL++H +HGC KPKWSKYLSFLKS FAELT+PK LTP SR E A ++ D Sbjct: 327 NTMNEGLLIHHARHGCLKPKWSKYLSFLKSKTFAELTQPKLLTP-SRVQTEVAEEEKGID 385 Score = 148 bits (373), Expect = 2e-33 Identities = 67/87 (77%), Positives = 75/87 (86%) Frame = -1 Query: 721 KGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVRA 542 +GRPV+ Y YLIGCDN LAKLHT+HPE CDKVGG++ H+DDLR FA +WL KTEEVRA Sbjct: 509 RGRPVSTPYEYLIGCDNELAKLHTRHPEACDKVGGVIIMHIDDLREFALLWLLKTEEVRA 568 Query: 541 DKDHWATNITGDIYSSGWISEMYGYSF 461 DK H+ATNITGDIY SGWISEMYGYSF Sbjct: 569 DKAHYATNITGDIYESGWISEMYGYSF 595 >ref|XP_007031710.1| F28J7.5 protein isoform 1 [Theobroma cacao] gi|508710739|gb|EOY02636.1| F28J7.5 protein isoform 1 [Theobroma cacao] Length = 820 Score = 419 bits (1076), Expect = e-115 Identities = 195/240 (81%), Positives = 211/240 (87%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 EKG PV+AYYGYL+GCDNILAKLHTKHPELCDKVGGLLA H++DLRV AP+WLSKTEEVR Sbjct: 147 EKGWPVSAYYGYLVGCDNILAKLHTKHPELCDKVGGLLAMHIEDLRVLAPLWLSKTEEVR 206 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 D+ HWATNITGDIY GWISEMYGYSFGAAE GLRHKIND+LMIYPGY PR G+EPIL+ Sbjct: 207 EDRAHWATNITGDIYGKGWISEMYGYSFGAAEAGLRHKINDDLMIYPGYTPRPGVEPILL 266 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGLP VGNWSFSKL HHEDSIVYDCGRLFPEPPYPREVK MESDPNKRR LFLSIECI Sbjct: 267 HYGLPIRVGNWSFSKLDHHEDSIVYDCGRLFPEPPYPREVKSMESDPNKRRGLFLSIECI 326 Query: 184 NTLNEGLLLNHEKHGCPKPKWSKYLSFLKSDRFAELTRPKQLTPKSREMMEEAHVQEKQD 5 NT+NEGLL++H +HGC KPKWSKYLSFLKS FAELT+PK LTP SR E A ++ D Sbjct: 327 NTMNEGLLIHHARHGCLKPKWSKYLSFLKSKTFAELTQPKLLTP-SRVQTEVAEEEKGID 385 Score = 256 bits (655), Expect = 4e-66 Identities = 120/198 (60%), Positives = 146/198 (73%) Frame = -1 Query: 721 KGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVRA 542 +GRPV+ Y YLIGCDN LAKLHT+HPE CDKVGG++ H+DDLR FA +WL KTEEVRA Sbjct: 509 RGRPVSTPYEYLIGCDNELAKLHTRHPEACDKVGGVIIMHIDDLREFALLWLLKTEEVRA 568 Query: 541 DKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILMH 362 DK H+ATNITGDIY SGWISEMYGYSFGAAE+ LRH I+ +++YPGYVP G++ + H Sbjct: 569 DKAHYATNITGDIYESGWISEMYGYSFGAAELKLRHHISSKILLYPGYVPEPGVKYRVFH 628 Query: 361 YGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECIN 182 YGL F VGNWSF K + +V C F +PP P V+ ++D N R+ LSIEC Sbjct: 629 YGLEFKVGNWSFDKANWRDTDVVNRCWATFLDPPDPSTVE--QTDENLRQRDLLSIECAK 686 Query: 181 TLNEGLLLNHEKHGCPKP 128 TLNE LLL+H++ CP P Sbjct: 687 TLNEALLLHHKRRNCPDP 704 >ref|XP_002526934.1| conserved hypothetical protein [Ricinus communis] gi|223533686|gb|EEF35421.1| conserved hypothetical protein [Ricinus communis] Length = 817 Score = 419 bits (1076), Expect = e-115 Identities = 196/240 (81%), Positives = 208/240 (86%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 EKGRPVAAYYGYL+GCDNILA+LHTKHPELCDKVGGLLA HMDDLR APMWLSKTEEVR Sbjct: 142 EKGRPVAAYYGYLVGCDNILAQLHTKHPELCDKVGGLLAMHMDDLRALAPMWLSKTEEVR 201 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 D+ HWATNITGDIY GWISEMYGYSFGAAEVGL+HKIND+LMIYPGY PR G++PIL+ Sbjct: 202 EDRAHWATNITGDIYGQGWISEMYGYSFGAAEVGLQHKINDDLMIYPGYTPRPGVQPILL 261 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGLPFSVGNWSF+KL HHED IVYDC RLFPEPPYPREVK MESDPNKRR LFLSIECI Sbjct: 262 HYGLPFSVGNWSFTKLNHHEDDIVYDCDRLFPEPPYPREVKLMESDPNKRRGLFLSIECI 321 Query: 184 NTLNEGLLLNHEKHGCPKPKWSKYLSFLKSDRFAELTRPKQLTPKSREMMEEAHVQEKQD 5 NTLNEGLLL H +GC KPKWSKYLSFLKS FAELTRPK LT +S + E Q D Sbjct: 322 NTLNEGLLLQHAANGCAKPKWSKYLSFLKSKTFAELTRPKLLTSESIKTEAENEQQVIDD 381 Score = 242 bits (617), Expect = 1e-61 Identities = 112/196 (57%), Positives = 143/196 (72%) Frame = -1 Query: 721 KGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVRA 542 +GRPV+ Y YLIGCDN LAKLHT++P+ CDKVGG++ H++DLR FA +WL KTEEVRA Sbjct: 504 RGRPVSTPYDYLIGCDNELAKLHTRYPDACDKVGGIIIMHIEDLRKFAMLWLHKTEEVRA 563 Query: 541 DKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILMH 362 DK H+ATN TGDIY+SGWISEMYGYSFGAAE+ L+H I+ +++IYPGY+P G++ + H Sbjct: 564 DKAHYATNFTGDIYNSGWISEMYGYSFGAAELQLQHIISRDILIYPGYIPEPGVKYRVFH 623 Query: 361 YGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECIN 182 YGL F VGNWSF K + +V C FP+PP P + ++D +R LSIEC Sbjct: 624 YGLEFKVGNWSFDKANWRDTDMVNKCWAKFPDPPDPSTLDRTDNDILQRDR--LSIECAR 681 Query: 181 TLNEGLLLNHEKHGCP 134 LNE L L+H+K CP Sbjct: 682 KLNEALFLHHKKRKCP 697 >ref|XP_004173110.1| PREDICTED: uncharacterized LOC101221472, partial [Cucumis sativus] Length = 410 Score = 418 bits (1074), Expect = e-114 Identities = 189/232 (81%), Positives = 208/232 (89%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 EKGRPVAAYYGYL+GCDNILAKLHTKHPELCDKVGGLLA H+DDLRVFAPMWLSKTEEVR Sbjct: 123 EKGRPVAAYYGYLVGCDNILAKLHTKHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVR 182 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 D+DHWATNITGDIY GWISEMYGYSFGAAEVGLRHKIN+NLMIYPGY+PR IEPIL+ Sbjct: 183 EDRDHWATNITGDIYGKGWISEMYGYSFGAAEVGLRHKINENLMIYPGYIPRPDIEPILL 242 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGLPFSVGNWSFSKL HHED IVYDC RLFPEPPYPRE+++MESD NK+R L ++IECI Sbjct: 243 HYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFPEPPYPREIQQMESDSNKKRGLLINIECI 302 Query: 184 NTLNEGLLLNHEKHGCPKPKWSKYLSFLKSDRFAELTRPKQLTPKSREMMEE 29 N LNEGLL H+++GCPKP+WSKYLSFLKS F +LT+PK TP S M E+ Sbjct: 303 NLLNEGLLWQHKRNGCPKPQWSKYLSFLKSKTFTDLTKPKYPTPASLVMKED 354 >ref|XP_004145689.1| PREDICTED: uncharacterized protein LOC101221472 [Cucumis sativus] Length = 800 Score = 418 bits (1074), Expect = e-114 Identities = 189/232 (81%), Positives = 208/232 (89%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 EKGRPVAAYYGYL+GCDNILAKLHTKHPELCDKVGGLLA H+DDLRVFAPMWLSKTEEVR Sbjct: 123 EKGRPVAAYYGYLVGCDNILAKLHTKHPELCDKVGGLLAMHIDDLRVFAPMWLSKTEEVR 182 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 D+DHWATNITGDIY GWISEMYGYSFGAAEVGLRHKIN+NLMIYPGY+PR IEPIL+ Sbjct: 183 EDRDHWATNITGDIYGKGWISEMYGYSFGAAEVGLRHKINENLMIYPGYIPRPDIEPILL 242 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGLPFSVGNWSFSKL HHED IVYDC RLFPEPPYPRE+++MESD NK+R L ++IECI Sbjct: 243 HYGLPFSVGNWSFSKLNHHEDGIVYDCNRLFPEPPYPREIQQMESDSNKKRGLLINIECI 302 Query: 184 NTLNEGLLLNHEKHGCPKPKWSKYLSFLKSDRFAELTRPKQLTPKSREMMEE 29 N LNEGLL H+++GCPKP+WSKYLSFLKS F +LT+PK TP S M E+ Sbjct: 303 NLLNEGLLWQHKRNGCPKPQWSKYLSFLKSKTFTDLTKPKYPTPASLVMKED 354 Score = 247 bits (631), Expect = 3e-63 Identities = 116/198 (58%), Positives = 139/198 (70%) Frame = -1 Query: 721 KGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVRA 542 +GRPV+ Y YLIGCDN+LAKLHT HPE CDKVGG++ H+DDLR F+ +WL KTEEVRA Sbjct: 485 RGRPVSTPYDYLIGCDNVLAKLHTSHPEACDKVGGVIIMHIDDLRKFSMLWLHKTEEVRA 544 Query: 541 DKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILMH 362 D+ H+ATNITGDIY SGWISEMYGYSFGAAE+ LRH + +++YPGY P G+ + H Sbjct: 545 DRAHYATNITGDIYQSGWISEMYGYSFGAAELQLRHIRSSEILLYPGYAPDPGVHYRVFH 604 Query: 361 YGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECIN 182 YGL F VGNWSF K E +V C FP PP P + + + D R LSIECI Sbjct: 605 YGLEFKVGNWSFDKANWRETDLVNRCWAQFPAPPDPSTLDQSDKDGFARD--LLSIECIR 662 Query: 181 TLNEGLLLNHEKHGCPKP 128 TLNE L L+H+K C P Sbjct: 663 TLNEALYLHHKKRNCSDP 680 >ref|XP_002271170.1| PREDICTED: uncharacterized protein LOC100242361 [Vitis vinifera] gi|296081317|emb|CBI17699.3| unnamed protein product [Vitis vinifera] Length = 817 Score = 415 bits (1066), Expect = e-114 Identities = 191/236 (80%), Positives = 208/236 (88%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 EKGRPVAA YGYL+GCDNILA+LHTKHPELCDKVGGLLA H+DDLR APMWLSKTEEVR Sbjct: 149 EKGRPVAALYGYLVGCDNILAQLHTKHPELCDKVGGLLAMHIDDLRALAPMWLSKTEEVR 208 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 D+ HWATN TGDIY GWISEMYGYSFGAAEVGLRHKINDNLM+YPGY+P++GIEPIL+ Sbjct: 209 EDRAHWATNFTGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMLYPGYIPQDGIEPILL 268 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGLPF+VGNWSFSKL++HED +VYDCGRLF EPPYP+EVK ME+DP KRRALFLSIECI Sbjct: 269 HYGLPFTVGNWSFSKLEYHEDGVVYDCGRLFAEPPYPKEVKLMEADPRKRRALFLSIECI 328 Query: 184 NTLNEGLLLNHEKHGCPKPKWSKYLSFLKSDRFAELTRPKQLTPKSREMMEEAHVQ 17 NTLNEGLLL H +GC KPKWSKYLSFLKS FAELTRPK LTP S + E Q Sbjct: 329 NTLNEGLLLQHAANGCSKPKWSKYLSFLKSKTFAELTRPKFLTPDSLQAEEAVQKQ 384 Score = 249 bits (636), Expect = 7e-64 Identities = 117/198 (59%), Positives = 141/198 (71%) Frame = -1 Query: 721 KGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVRA 542 +G+PV+ YGYLIGCDN LA+LHT+HPE CDKVGG++ H+DDLR FA +WL KTEEVRA Sbjct: 511 RGQPVSTPYGYLIGCDNELAQLHTRHPEACDKVGGVIIMHIDDLRKFALLWLHKTEEVRA 570 Query: 541 DKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILMH 362 DK H+A NITGDIY SGWISEMYGYSFGAAE+ LRH IN ++IYPGYVP G++ + H Sbjct: 571 DKAHYARNITGDIYESGWISEMYGYSFGAAELNLRHGINREILIYPGYVPEPGVKYRVFH 630 Query: 361 YGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECIN 182 YGL F VGNWSF K + +V C FP+PP P + + D +R LSIEC Sbjct: 631 YGLEFVVGNWSFDKANWRDSDLVNKCWAKFPDPPDPSTLDASDDDILQRD--LLSIECAK 688 Query: 181 TLNEGLLLNHEKHGCPKP 128 LNE L L H++ CP P Sbjct: 689 KLNEALYLYHKRRNCPDP 706 >ref|XP_007151570.1| hypothetical protein PHAVU_004G058000g [Phaseolus vulgaris] gi|593702323|ref|XP_007151571.1| hypothetical protein PHAVU_004G058000g [Phaseolus vulgaris] gi|561024879|gb|ESW23564.1| hypothetical protein PHAVU_004G058000g [Phaseolus vulgaris] gi|561024880|gb|ESW23565.1| hypothetical protein PHAVU_004G058000g [Phaseolus vulgaris] Length = 814 Score = 414 bits (1063), Expect = e-113 Identities = 190/240 (79%), Positives = 212/240 (88%), Gaps = 3/240 (1%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 EK RPVAAYYGYL GCDNILA+LHTKHPELCDKVGGLLAFH+DDLRVFAP+WLSKTEEVR Sbjct: 145 EKKRPVAAYYGYLKGCDNILAQLHTKHPELCDKVGGLLAFHIDDLRVFAPLWLSKTEEVR 204 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 D+ HWATNITGDIY GWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPIL+ Sbjct: 205 EDRAHWATNITGDIYGKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILL 264 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGLPFSVGNWSF+KL HH+D +VY+C LFPEPPYP+EV+++E D N+RR LFLSIECI Sbjct: 265 HYGLPFSVGNWSFNKLAHHDDGLVYECNSLFPEPPYPKEVRQLELDDNRRRGLFLSIECI 324 Query: 184 NTLNEGLLLNHEKHGCPKPKWSKYLSFLKSDRFAELTRPKQLTPKSREMME---EAHVQE 14 N +NEGLLL H +GCPKP WSKYLSFLKS +AELT+PK +TP + +MME E HV + Sbjct: 325 NIINEGLLLQHAANGCPKPVWSKYLSFLKSKAYAELTQPKYVTPATLQMMEDIKEEHVDD 384 Score = 237 bits (604), Expect = 3e-60 Identities = 112/192 (58%), Positives = 139/192 (72%) Frame = -1 Query: 721 KGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVRA 542 +G PV+ Y YLIGCDN LAKLHT HPE CDKVGG++ H++DLR FA +WL KTEEVRA Sbjct: 508 RGHPVSTPYDYLIGCDNELAKLHTSHPEACDKVGGVIIMHIEDLRKFAMLWLHKTEEVRA 567 Query: 541 DKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILMH 362 D+ H+A NITGDIY SGWISEMYGYSFGAAE+ L+H IND ++IYPGYVP+ G++ + H Sbjct: 568 DRAHYARNITGDIYESGWISEMYGYSFGAAELKLKHTINDEILIYPGYVPQPGVKYRVFH 627 Query: 361 YGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECIN 182 YGL FSVGNWSF K +V C FP+PP + + ++ +R LSIEC Sbjct: 628 YGLQFSVGNWSFDKADWRNVDMVNKCWAKFPDPPDSSTLGQANTEDLQRD--LLSIECAK 685 Query: 181 TLNEGLLLNHEK 146 TLNE L L+H++ Sbjct: 686 TLNEALNLHHKR 697 >ref|XP_002298591.2| hypothetical protein POPTR_0001s36250g [Populus trichocarpa] gi|550349003|gb|EEE83396.2| hypothetical protein POPTR_0001s36250g [Populus trichocarpa] Length = 804 Score = 412 bits (1059), Expect = e-113 Identities = 189/240 (78%), Positives = 211/240 (87%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 EKGRPVAAYYGYL+GCDNILAKLHTKHPELCDKVGGLLA H+DDLR AP+WLSKTEEVR Sbjct: 146 EKGRPVAAYYGYLVGCDNILAKLHTKHPELCDKVGGLLAMHIDDLRALAPLWLSKTEEVR 205 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 D+ HW TNITGDIY +GWISEMYGYSFGAAE GL+HKI+++LMIYPGY+PR+GIEPIL+ Sbjct: 206 EDRTHWGTNITGDIYGAGWISEMYGYSFGAAEAGLQHKISEDLMIYPGYIPRKGIEPILI 265 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGLPFSVGNWSFSKL HHED IVYDCGRLFPEPPYPREV+ + SD NK+RALFL++ECI Sbjct: 266 HYGLPFSVGNWSFSKLDHHEDDIVYDCGRLFPEPPYPREVRLLASDLNKKRALFLNLECI 325 Query: 184 NTLNEGLLLNHEKHGCPKPKWSKYLSFLKSDRFAELTRPKQLTPKSREMMEEAHVQEKQD 5 NTLNEGLLL H +GCPKPKWS+YLSFLKS FA+LTRPK L P S E E A+ Q+ Sbjct: 326 NTLNEGLLLQHAANGCPKPKWSRYLSFLKSKTFADLTRPKFLAPGSIETKEAANQGGNQE 385 Score = 251 bits (642), Expect = 1e-64 Identities = 118/198 (59%), Positives = 143/198 (72%) Frame = -1 Query: 721 KGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVRA 542 +GRPV+ Y YLIGCDN LAKLHT+HP+ CDKVGG++ H+DDLR FA +WL K+EEVRA Sbjct: 513 RGRPVSTPYDYLIGCDNELAKLHTRHPDACDKVGGVIIMHIDDLRKFAMLWLHKSEEVRA 572 Query: 541 DKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILMH 362 DK H+ATNITGDIY+SGWISEMYGYSFGAAE+ LRH IN ++IYPGYVP G++ + H Sbjct: 573 DKAHYATNITGDIYASGWISEMYGYSFGAAELKLRHLINSEILIYPGYVPEPGVKYRVFH 632 Query: 361 YGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECIN 182 YGL F VGNWSF K + +V C FP+PP P + D +R LSIEC Sbjct: 633 YGLDFKVGNWSFDKANWRDTDVVNKCWAKFPDPPDPLTLDRSNEDILQRD--LLSIECGK 690 Query: 181 TLNEGLLLNHEKHGCPKP 128 TLN+ L L+H+K CP P Sbjct: 691 TLNDALELHHKKRNCPDP 708 >gb|EXC31392.1| hypothetical protein L484_017674 [Morus notabilis] Length = 811 Score = 412 bits (1058), Expect = e-113 Identities = 187/237 (78%), Positives = 204/237 (86%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 EKGRPVAAYYGYL+GCDNILA LHTKHPELCDKVGGLLA H+DDLR AP+WLSKTEEVR Sbjct: 152 EKGRPVAAYYGYLVGCDNILADLHTKHPELCDKVGGLLAMHIDDLRKLAPLWLSKTEEVR 211 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 D+ HWATN TGDIY GWISEMYGYSFGAAE GLRHKINDNLMIYPGY+PREG+EPIL+ Sbjct: 212 EDRAHWATNFTGDIYGKGWISEMYGYSFGAAEAGLRHKINDNLMIYPGYIPREGVEPILL 271 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGLPF VGNWSFSKL HHED IVY CG+LF EPPYP+EVK ME DPNK+R+L ++ ECI Sbjct: 272 HYGLPFKVGNWSFSKLDHHEDDIVYKCGKLFTEPPYPKEVKMMEPDPNKKRSLLINTECI 331 Query: 184 NTLNEGLLLNHEKHGCPKPKWSKYLSFLKSDRFAELTRPKQLTPKSREMMEEAHVQE 14 NTLNEGLL H GCP PKWSKYLSFLKS+ FAELT+PK TP S E+ME+ QE Sbjct: 332 NTLNEGLLAQHAADGCPSPKWSKYLSFLKSNTFAELTKPKHPTPASLELMEDRKPQE 388 Score = 263 bits (672), Expect = 4e-68 Identities = 124/199 (62%), Positives = 149/199 (74%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 E+GRPV+ Y YLIGCDN LAKLHT+HPE CDKVGG++ H+DDLR FA +WL KTEEVR Sbjct: 514 ERGRPVSTPYEYLIGCDNELAKLHTRHPEACDKVGGVIIMHIDDLRKFALLWLHKTEEVR 573 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 ADKDH+ATNITGDIY+SGWISEMYGYSFGAAE+ LRH I+D +MIYPGYVP G++ + Sbjct: 574 ADKDHYATNITGDIYASGWISEMYGYSFGAAELKLRHLISDEIMIYPGYVPEPGVKYRVF 633 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGL F VGNWSF K K + +V C FP+PP P + + + D +R LSIECI Sbjct: 634 HYGLEFRVGNWSFDKAKWRDTDMVNRCWAKFPDPPEPSILNDTDKDIMQRD--LLSIECI 691 Query: 184 NTLNEGLLLNHEKHGCPKP 128 T+NE L L+HE+ C P Sbjct: 692 RTINEALRLHHERRKCQDP 710 >gb|EPS70488.1| hypothetical protein M569_04262 [Genlisea aurea] Length = 796 Score = 410 bits (1054), Expect = e-112 Identities = 185/241 (76%), Positives = 217/241 (90%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 +KG+PV+AYYGYLIGCDNI+AKLHTKHPELCDKVGGLL H+DDLR AP+WLSKTEE+R Sbjct: 156 QKGKPVSAYYGYLIGCDNIVAKLHTKHPELCDKVGGLLVMHIDDLRALAPLWLSKTEEMR 215 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 DK HWATN TGDIYS+GWISEMYGYSFGAAEVGLRHKI +LM+YPGYVP+EGIEPIL+ Sbjct: 216 DDKAHWATNYTGDIYSAGWISEMYGYSFGAAEVGLRHKIYGSLMLYPGYVPQEGIEPILL 275 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGLPF+VGNWSFSKL+HHED+IVY+CG+LFPEPPYPREV EMESD NKRR LFLS+EC+ Sbjct: 276 HYGLPFNVGNWSFSKLEHHEDAIVYNCGQLFPEPPYPREVMEMESDLNKRRGLFLSLECV 335 Query: 184 NTLNEGLLLNHEKHGCPKPKWSKYLSFLKSDRFAELTRPKQLTPKSREMMEEAHVQEKQD 5 NTLNEGLLL+H HGCPKPKWSKYLSFLKS+ FA T+PK++ ++ + + E H+ E+++ Sbjct: 336 NTLNEGLLLHHAAHGCPKPKWSKYLSFLKSNTFANQTKPKRVRRETLKSL-EVHIHEEEE 394 Query: 4 V 2 V Sbjct: 395 V 395 Score = 244 bits (623), Expect = 2e-62 Identities = 112/195 (57%), Positives = 139/195 (71%) Frame = -1 Query: 721 KGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVRA 542 KGRPV+ Y YLIGCDNILAK+HT HPELCDKVGG++ H+ DL+ FA +WL KTEEVRA Sbjct: 520 KGRPVSTPYNYLIGCDNILAKIHTSHPELCDKVGGVIIMHISDLKRFALLWLHKTEEVRA 579 Query: 541 DKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILMH 362 D HW+ N+TGD+Y SGWISEMYGYSFGAAE+ LRH ++ ++IYPGY P G+ ++H Sbjct: 580 DVSHWSRNVTGDVYESGWISEMYGYSFGAAELNLRHVVSSEILIYPGYTPVRGVNYRVLH 639 Query: 361 YGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECIN 182 YGL F VGNWSF K K +V +C FP+PP + SDP+ LS+EC N Sbjct: 640 YGLEFRVGNWSFDKAKWRHMDVVKECWAKFPDPPDKSRLD--SSDPDVLGRDLLSLECGN 697 Query: 181 TLNEGLLLNHEKHGC 137 +LNE L L+HE+ C Sbjct: 698 SLNEALKLHHERSKC 712 >ref|XP_006599063.1| PREDICTED: uncharacterized protein LOC100783769 [Glycine max] Length = 801 Score = 407 bits (1047), Expect = e-111 Identities = 189/240 (78%), Positives = 208/240 (86%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 EKGRPVAAYYGYL GCDNILA+LHTKHPELCDKVGGLLA H+DDLR APMWLSKTEEVR Sbjct: 146 EKGRPVAAYYGYLRGCDNILAQLHTKHPELCDKVGGLLAMHIDDLRALAPMWLSKTEEVR 205 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 D+ HW NITGDIY GWISEMYGYSFGAAEVGLRHKINDNLMIYPGY PREG+EPIL+ Sbjct: 206 QDRAHWGVNITGDIYEKGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYAPREGVEPILL 265 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGLPF VGNWSFSK H ED+IVY+CG+LFP+PPYPREV ++E+DPN RR LFLSIECI Sbjct: 266 HYGLPFRVGNWSFSKADHDEDAIVYNCGQLFPQPPYPREVMQLETDPNLRRGLFLSIECI 325 Query: 184 NTLNEGLLLNHEKHGCPKPKWSKYLSFLKSDRFAELTRPKQLTPKSREMMEEAHVQEKQD 5 N LNE LLL+H +GCPKP WSKY++FLKS FAELT+PK +TP S EMME+ VQE D Sbjct: 326 NILNEALLLHHVANGCPKPPWSKYVNFLKSKAFAELTKPKLVTPASLEMMEDT-VQEHID 384 Score = 249 bits (637), Expect = 5e-64 Identities = 121/197 (61%), Positives = 141/197 (71%) Frame = -1 Query: 721 KGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVRA 542 +G+PV+ Y YLIGCDN LAKLH HPE CDKVGG++ H+DDLR FA +WL KTEEVRA Sbjct: 509 RGKPVSTPYDYLIGCDNELAKLHISHPEACDKVGGVIIMHIDDLRKFALLWLHKTEEVRA 568 Query: 541 DKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILMH 362 D+ H+A NITGDIY SGWISEMYGYSFGAAE+ LRH IN +MIYPGYVP GI+ + H Sbjct: 569 DRAHYARNITGDIYESGWISEMYGYSFGAAEMKLRHTINREIMIYPGYVPEPGIKYRVFH 628 Query: 361 YGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECIN 182 YGL F VGNWSF K + E +V C FPEPP + + D +R LSIEC+ Sbjct: 629 YGLEFHVGNWSFDKAEWREIDMVNRCWVKFPEPPDSSTLDHNDEDNFQRN--LLSIECMK 686 Query: 181 TLNEGLLLNHEKHGCPK 131 TLNE L L+HEK CPK Sbjct: 687 TLNEALHLHHEKRNCPK 703 >ref|NP_566148.2| uncharacterized protein [Arabidopsis thaliana] gi|18175797|gb|AAL59929.1| unknown protein [Arabidopsis thaliana] gi|20465701|gb|AAM20319.1| unknown protein [Arabidopsis thaliana] gi|332640186|gb|AEE73707.1| uncharacterized protein AT3G01720 [Arabidopsis thaliana] gi|377652301|dbj|BAL63044.1| peptidyl serine alpha-galactosyltransferase [Arabidopsis thaliana] Length = 802 Score = 405 bits (1041), Expect = e-111 Identities = 179/234 (76%), Positives = 205/234 (87%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 E+GRP AA+YGYL+GCDN+L +LHTKHPELCDKVGGLLA H+DDLRV AP+WLSKTE+VR Sbjct: 146 ERGRPFAAHYGYLVGCDNLLVRLHTKHPELCDKVGGLLAMHIDDLRVLAPLWLSKTEDVR 205 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 D HW TN+TGDIY GWISEMYGYSFGAAE GL+HKIND+LMIYPGYVPREG+EP+LM Sbjct: 206 QDTAHWTTNLTGDIYGKGWISEMYGYSFGAAEAGLKHKINDDLMIYPGYVPREGVEPVLM 265 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGLPFS+GNWSF+KL HHED+IVYDC RLFPEPPYPREVK ME DP+KRR L LS+EC+ Sbjct: 266 HYGLPFSIGNWSFTKLDHHEDNIVYDCNRLFPEPPYPREVKIMEPDPSKRRGLILSLECM 325 Query: 184 NTLNEGLLLNHEKHGCPKPKWSKYLSFLKSDRFAELTRPKQLTPKSREMMEEAH 23 NTLNEGL+L H ++GCPKPKW+KYLSFLKS F ELTRPK L P S ++ + H Sbjct: 326 NTLNEGLILRHAENGCPKPKWTKYLSFLKSKTFMELTRPKLLAPGSVHILPDQH 379 Score = 244 bits (624), Expect = 2e-62 Identities = 112/198 (56%), Positives = 140/198 (70%) Frame = -1 Query: 721 KGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVRA 542 +GRPV+ Y YLIGCDN LA+LHT++PE CDKVGG++ H++DLR FA WL KT+EVRA Sbjct: 509 RGRPVSTPYDYLIGCDNDLARLHTRNPEACDKVGGVIIMHIEDLRKFAMYWLLKTQEVRA 568 Query: 541 DKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILMH 362 DK+H+ +TGDIY SGWISEMYGYSFGAAE+ LRH IN +MIYPGYVP G + + H Sbjct: 569 DKEHYGKELTGDIYESGWISEMYGYSFGAAELNLRHSINKEIMIYPGYVPEPGADYRVFH 628 Query: 361 YGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECIN 182 YGL F VGNWSF K ++ C FP+PP P V + ++D +R LSIEC Sbjct: 629 YGLEFKVGNWSFDKANWRNTDLINKCWAKFPDPPSPSAVHQTDNDLRQRD--LLSIECGQ 686 Query: 181 TLNEGLLLNHEKHGCPKP 128 LNE L L+H++ CP+P Sbjct: 687 KLNEALFLHHKRRNCPEP 704 >gb|AAF01555.1|AC009325_25 unknown protein [Arabidopsis thaliana] gi|6091716|gb|AAF03428.1|AC010797_4 unknown protein [Arabidopsis thaliana] Length = 814 Score = 405 bits (1041), Expect = e-111 Identities = 179/234 (76%), Positives = 205/234 (87%) Frame = -1 Query: 724 EKGRPVAAYYGYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFAPMWLSKTEEVR 545 E+GRP AA+YGYL+GCDN+L +LHTKHPELCDKVGGLLA H+DDLRV AP+WLSKTE+VR Sbjct: 146 ERGRPFAAHYGYLVGCDNLLVRLHTKHPELCDKVGGLLAMHIDDLRVLAPLWLSKTEDVR 205 Query: 544 ADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGYVPREGIEPILM 365 D HW TN+TGDIY GWISEMYGYSFGAAE GL+HKIND+LMIYPGYVPREG+EP+LM Sbjct: 206 QDTAHWTTNLTGDIYGKGWISEMYGYSFGAAEAGLKHKINDDLMIYPGYVPREGVEPVLM 265 Query: 364 HYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNKRRALFLSIECI 185 HYGLPFS+GNWSF+KL HHED+IVYDC RLFPEPPYPREVK ME DP+KRR L LS+EC+ Sbjct: 266 HYGLPFSIGNWSFTKLDHHEDNIVYDCNRLFPEPPYPREVKIMEPDPSKRRGLILSLECM 325 Query: 184 NTLNEGLLLNHEKHGCPKPKWSKYLSFLKSDRFAELTRPKQLTPKSREMMEEAH 23 NTLNEGL+L H ++GCPKPKW+KYLSFLKS F ELTRPK L P S ++ + H Sbjct: 326 NTLNEGLILRHAENGCPKPKWTKYLSFLKSKTFMELTRPKLLAPGSVHILPDQH 379 Score = 236 bits (602), Expect = 6e-60 Identities = 112/210 (53%), Positives = 140/210 (66%), Gaps = 12/210 (5%) Frame = -1 Query: 721 KGRPVAAYY------------GYLIGCDNILAKLHTKHPELCDKVGGLLAFHMDDLRVFA 578 +GRPV+ Y YLIGCDN LA+LHT++PE CDKVGG++ H++DLR FA Sbjct: 509 RGRPVSTPYESPLKPSLFLLFSYLIGCDNDLARLHTRNPEACDKVGGVIIMHIEDLRKFA 568 Query: 577 PMWLSKTEEVRADKDHWATNITGDIYSSGWISEMYGYSFGAAEVGLRHKINDNLMIYPGY 398 WL KT+EVRADK+H+ +TGDIY SGWISEMYGYSFGAAE+ LRH IN +MIYPGY Sbjct: 569 MYWLLKTQEVRADKEHYGKELTGDIYESGWISEMYGYSFGAAELNLRHSINKEIMIYPGY 628 Query: 397 VPREGIEPILMHYGLPFSVGNWSFSKLKHHEDSIVYDCGRLFPEPPYPREVKEMESDPNK 218 VP G + + HYGL F VGNWSF K ++ C FP+PP P V + ++D + Sbjct: 629 VPEPGADYRVFHYGLEFKVGNWSFDKANWRNTDLINKCWAKFPDPPSPSAVHQTDNDLRQ 688 Query: 217 RRALFLSIECINTLNEGLLLNHEKHGCPKP 128 R LSIEC LNE L L+H++ CP+P Sbjct: 689 RD--LLSIECGQKLNEALFLHHKRRNCPEP 716