BLASTX nr result
ID: Achyranthes22_contig00002195
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Achyranthes22_contig00002195 (3739 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002276602.1| PREDICTED: uncharacterized protein LOC100255... 731 0.0 gb|EXC10838.1| hypothetical protein L484_003082 [Morus notabilis] 720 0.0 ref|XP_002297999.2| hypothetical protein POPTR_0001s10090g [Popu... 720 0.0 ref|XP_002509888.1| transcription factor, putative [Ricinus comm... 717 0.0 gb|EMJ12491.1| hypothetical protein PRUPE_ppa002459mg [Prunus pe... 717 0.0 ref|XP_003525359.1| PREDICTED: uncharacterized protein LOC100776... 716 0.0 ref|XP_006476466.1| PREDICTED: uncharacterized protein LOC102631... 714 0.0 ref|XP_006439437.1| hypothetical protein CICLE_v10019140mg [Citr... 714 0.0 ref|XP_003533040.1| PREDICTED: uncharacterized protein LOC100784... 705 0.0 gb|ABK96565.1| unknown [Populus trichocarpa x Populus deltoides] 704 0.0 gb|ESW32119.1| hypothetical protein PHAVU_002G294600g [Phaseolus... 698 0.0 ref|XP_004298851.1| PREDICTED: uncharacterized protein LOC101298... 696 0.0 ref|XP_006385780.1| hypothetical protein POPTR_0003s13400g [Popu... 696 0.0 ref|XP_004143880.1| PREDICTED: uncharacterized protein LOC101212... 685 0.0 ref|XP_006343911.1| PREDICTED: cell wall protein AWA1-like isofo... 682 0.0 ref|XP_004503653.1| PREDICTED: uncharacterized protein LOC101508... 677 0.0 ref|XP_004245556.1| PREDICTED: uncharacterized protein LOC101268... 666 0.0 gb|EOY24891.1| Uncharacterized protein isoform 1 [Theobroma cacao] 647 0.0 ref|XP_006391666.1| hypothetical protein EUTSA_v10023333mg [Eutr... 643 0.0 ref|NP_176596.2| uncharacterized protein [Arabidopsis thaliana] ... 620 e-174 >ref|XP_002276602.1| PREDICTED: uncharacterized protein LOC100255255 [Vitis vinifera] Length = 684 Score = 731 bits (1887), Expect = 0.0 Identities = 379/639 (59%), Positives = 437/639 (68%), Gaps = 17/639 (2%) Frame = +1 Query: 1372 MGFCSDNLSNAFKNVGDSLHVVGSSINHGADTALRLQPIGPNDAYFPTSKGVKRKWGCRD 1551 +GF +++ SNAFKN+G+S+ V G+ N+ DT LRL + SKGVKRKW D Sbjct: 8 LGFAANHSSNAFKNLGNSMQVGGARANYCMDTILRLDSPSSSIPDLTASKGVKRKWSLID 67 Query: 1552 GNMGLYVDPXXXXXXXXXXXXXXXXXXXATACTAISSVKETEVESSMDLDLDFTLHLGSD 1731 G G V ATACT +SS KE E ESSMDL+LDFTLHLG++ Sbjct: 68 GTRGQQVGSSLSLGLGRSSSSSDSKGSSATACTTMSSAKENEEESSMDLELDFTLHLGNE 127 Query: 1732 KSPSPKKPALSNFKRVGSQLNLDLELSLSTSQPESDITSIFGGLEASQHGVTPLVVGTTQ 1911 K+PS KK A S+ K + Q ++DLELSLST ESDITSI PL V Sbjct: 128 KTPSTKKYAGSSLKALELQTDIDLELSLSTGPAESDITSIHASSTLLHAMDMPLGVARAA 187 Query: 1912 FVDNETASSHWKVDNTSPVSLFPKAGESFVLKS--IPRKVDLVSVISDVSASTLSVPKSS 2085 +D + SS WK + SL + L S IP+++D S + D+S+S ++ PKSS Sbjct: 188 HLDEGSTSSPWKPGTSLSSSLHAPLIKKTSLFSHQIPQRMDPTSPVPDLSSSIITTPKSS 247 Query: 2086 VTCTSGFLXXXXXXXXXXXXKLCQSVGCGKGARGASGLCIAHGGGRRCQRPGCNKGAEGR 2265 VTCTSG K CQ GCGKGARGASGLCIAHGGGRRCQ+ GC+KGAEGR Sbjct: 248 VTCTSGITQQQPQRSTSS--KTCQFKGCGKGARGASGLCIAHGGGRRCQKTGCHKGAEGR 305 Query: 2266 TALCKAHGGGRRCEFLGCTKSAEGRTDFCIAXXXXXXXXXXXXSRAARGKSGLCIRHGGG 2445 T CKAHGGGRRCEFLGCTKSAEGRTD+CIA +RAARGKSGLCIRHGGG Sbjct: 306 TVYCKAHGGGRRCEFLGCTKSAEGRTDYCIAHGGGRRCSHEGCTRAARGKSGLCIRHGGG 365 Query: 2446 KRCQRETCTRSAEGLSGLCISHGGGRRCQYPACSKGAQGSTMFCKAHGGGKRCTVEGCTK 2625 KRCQ+E CT+SAEGLSGLCISHGGGRRCQ+PAC+KGAQGSTM+CKAHGGGKRCTV GCTK Sbjct: 366 KRCQKENCTKSAEGLSGLCISHGGGRRCQFPACTKGAQGSTMYCKAHGGGKRCTVPGCTK 425 Query: 2626 GAEGSTPFCKGHGGGKRCSFPG---CPKSVHGGTSFCVAHGGGKRCAITGCTKSARGRTD 2796 GAEGSTPFCKGHGGGKRCSF G CPKSVHGGT+FCVAHGGGKRCA+ CTKSARGRTD Sbjct: 426 GAEGSTPFCKGHGGGKRCSFQGGGICPKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTD 485 Query: 2797 FCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCAWGQPGSGYGNQADGPCGSFARGK 2976 +CVRHGGGKRCK EGCGKSAQGSTDFCKAHGGGKRC+WGQ GS +G+Q DGPC SFARGK Sbjct: 486 YCVRHGGGKRCKSEGCGKSAQGSTDFCKAHGGGKRCSWGQVGSQFGSQ-DGPCSSFARGK 544 Query: 2977 TGLCTMHGALVQDKRVHGGITLGPLLQDPNSA------DMI---DASAGVMSVDGMGF-G 3126 TGLC H ALVQDKRVHGG TL +Q P+ + D++ D + +M + G Sbjct: 545 TGLCASHNALVQDKRVHGGATLAHTVQIPSPSKPEKMKDVVATEDMNVDIMKMMGSSIVN 604 Query: 3127 PVGLSGEEWTASNVVRAQLSCMEKEPRTTP--SPEGRVH 3237 P G +G E + + L E P P +PEGRVH Sbjct: 605 PAGWTGLELKQVGLPQPHLPAREVRPSPVPVLAPEGRVH 643 >gb|EXC10838.1| hypothetical protein L484_003082 [Morus notabilis] Length = 664 Score = 720 bits (1858), Expect = 0.0 Identities = 355/570 (62%), Positives = 408/570 (71%), Gaps = 6/570 (1%) Frame = +1 Query: 1372 MGFCSDNLSNAFKNVGDSLHVVGSSINHGADTALRLQPIGPNDAYFPTSKGVKRKWGCRD 1551 +GF ++ SNAFK +G+S+H + + ADT LRL G + + +SKG KRKW D Sbjct: 8 LGFAANYSSNAFKVLGNSMHFGVARVETRADTILRLNSPGASLPFMSSSKGTKRKWSLVD 67 Query: 1552 GNMGLYVDPXXXXXXXXXXXXXXXXXXX-ATACTAISSVKETEVESSMDLDLDFTLHLGS 1728 ++ VD ATACT +SS KE + ESSMDL+LDFTLHLG+ Sbjct: 68 PSVCPQVDSSLSLGLLGRSSSSSDSKGSSATACTTMSSAKEADEESSMDLELDFTLHLGN 127 Query: 1729 DKSPSPKKPALSNFKRVGSQLNLDLELSLSTSQPESDITSIFGGLEASQHGVT-PLVVGT 1905 +K SPKK +SN K + Q +DLELSLST ES+ITS+ + Q GV P Sbjct: 128 EKLSSPKKHTISNVKALEVQPKVDLELSLSTGPSESEITSVHLCSNSLQSGVEMPPSAFQ 187 Query: 1906 TQFVDNETASSHWKVD-NTSPVSLFPKAGESFVLKSIPRKVDLVSVISDVSASTLSVPKS 2082 + D + S WK + P+ +SF+ K +P+K+D ++ D+S+S L+ PKS Sbjct: 188 SNSADEGSTSCQWKQEIAVQPLPTSANVRDSFLFKQVPQKIDPSPIVLDLSSSVLTAPKS 247 Query: 2083 SVTCTSGFLXXXXXXXXXXXXKLCQSVGCGKGARGASGLCIAHGGGRRCQRPGCNKGAEG 2262 SVTCTSG K C+ GCGKGARGASG CI+HGGGRRCQ+ GC+KGAEG Sbjct: 248 SVTCTSGLTQQQQTQLRSTSSKTCEVEGCGKGARGASGRCISHGGGRRCQKQGCHKGAEG 307 Query: 2263 RTALCKAHGGGRRCEFLGCTKSAEGRTDFCIAXXXXXXXXXXXXSRAARGKSGLCIRHGG 2442 RT CKAHGGGRRCEFLGCTKSAEGRTDFCIA +RAARGKSGLCIRHGG Sbjct: 308 RTVYCKAHGGGRRCEFLGCTKSAEGRTDFCIAHGGGRRCSHEGCTRAARGKSGLCIRHGG 367 Query: 2443 GKRCQRETCTRSAEGLSGLCISHGGGRRCQYPACSKGAQGSTMFCKAHGGGKRCTVEGCT 2622 GKRCQRE CT+SAEGLSGLCISHGGGRRCQ C+KGAQGSTMFCKAHGGGKRCT GCT Sbjct: 368 GKRCQRENCTKSAEGLSGLCISHGGGRRCQALGCTKGAQGSTMFCKAHGGGKRCTAPGCT 427 Query: 2623 KGAEGSTPFCKGHGGGKRCSFPG---CPKSVHGGTSFCVAHGGGKRCAITGCTKSARGRT 2793 KGAEGSTPFCKGHGGGKRC+F G C KSVHGGT+FCVAHGGGKRCA+ CTKSARGRT Sbjct: 428 KGAEGSTPFCKGHGGGKRCAFQGGGVCSKSVHGGTNFCVAHGGGKRCAMPECTKSARGRT 487 Query: 2794 DFCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCAWGQPGSGYGNQADGPCGSFARG 2973 D+CVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRC+WG PGS YGNQ GPC SFARG Sbjct: 488 DYCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGNQPTGPCNSFARG 547 Query: 2974 KTGLCTMHGALVQDKRVHGGITLGPLLQDP 3063 KTGLC +H LVQDKRVHGG+TLGPL+QDP Sbjct: 548 KTGLCALHSGLVQDKRVHGGVTLGPLVQDP 577 >ref|XP_002297999.2| hypothetical protein POPTR_0001s10090g [Populus trichocarpa] gi|550346937|gb|EEE82804.2| hypothetical protein POPTR_0001s10090g [Populus trichocarpa] Length = 655 Score = 720 bits (1858), Expect = 0.0 Identities = 372/601 (61%), Positives = 422/601 (70%), Gaps = 10/601 (1%) Frame = +1 Query: 1345 FMGTRNARVMGFCSDNLSNAFKNVGDSLHV-VG-SSINHGADTALRLQPIGPNDAYFPTS 1518 FM R R +GF +D SNA KN+G S+ V VG + ADT LRL +G + Y S Sbjct: 6 FMDNR-LRNLGFAADCPSNASKNLGSSMPVGVGVAGTKFSADTVLRLDSLGSSVPYGSPS 64 Query: 1519 KGVKRKWGCRDGNMGLYVDPXXXXXXXXXXXXXXXXXXXATACTAISSVKETEVESSMDL 1698 KG+KRK DG+MGL V ATACTA+SS KET+ ESSMDL Sbjct: 65 KGIKRKRNLIDGSMGLNVGSSLSLGLRRSSSSSDSKGSSATACTAMSSAKETDEESSMDL 124 Query: 1699 DLDFTLHLGSDKSPSPKKPALSNFKRVGSQLNLDLELSLSTSQPESDITSIFGGLEASQH 1878 +LDF+LHLG++K SPKKPA SN K + Q +DLELSLST ESDITSI + + Sbjct: 125 ELDFSLHLGNEKMSSPKKPAGSNLKGMELQPRVDLELSLSTGPSESDITSIHPHSSSLEF 184 Query: 1879 GVT-PLVVGTTQFVDNETASSHWKVDNTSPVSLFP----KAGESFVLKSIPRKVDLVSVI 2043 G+ PL +G VD S WK S ++L P + E+ IPR D S Sbjct: 185 GMDMPLAMGGASNVDERLTSDSWK----SGIALLPLQISQNKEASFFNQIPRTRDPTSSF 240 Query: 2044 SDVSASTLSVPKSSVTCTSGFLXXXXXXXXXXXXKLCQSVGCGKGARGASGLCIAHGGGR 2223 D S+S ++ PKSSVTCTSG KLCQ GCGKGARGASG CI+HGGGR Sbjct: 241 PDHSSSVIT-PKSSVTCTSGITQQQQPYQRSASSKLCQVEGCGKGARGASGRCISHGGGR 299 Query: 2224 RCQRPGCNKGAEGRTALCKAHGGGRRCEFLGCTKSAEGRTDFCIAXXXXXXXXXXXXSRA 2403 RCQ+ GC+KGAEGRT CKAHGGGRRCEFLGCTKSAEGRTDFCIA +RA Sbjct: 300 RCQKAGCHKGAEGRTVYCKAHGGGRRCEFLGCTKSAEGRTDFCIAHGGGRRCSREGCARA 359 Query: 2404 ARGKSGLCIRHGGGKRCQRETCTRSAEGLSGLCISHGGGRRCQYPACSKGAQGSTMFCKA 2583 ARGKSGLCIRHGGGKRCQ+E CT+SAEGLSGLCISHGGGRRCQ+ C+KGAQGSTM CKA Sbjct: 360 ARGKSGLCIRHGGGKRCQKENCTKSAEGLSGLCISHGGGRRCQFSGCTKGAQGSTMLCKA 419 Query: 2584 HGGGKRCTVEGCTKGAEGSTPFCKGHGGGKRCSFPG---CPKSVHGGTSFCVAHGGGKRC 2754 HGGGKRCT GCTKGAEGSTPFCKGHGGGKRC+F C KSVHGGT+FCVAHGGGKRC Sbjct: 420 HGGGKRCTAPGCTKGAEGSTPFCKGHGGGKRCAFQRGGVCSKSVHGGTNFCVAHGGGKRC 479 Query: 2755 AITGCTKSARGRTDFCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCAWGQPGSGYG 2934 A+ CTKSARGRTDFCVRHGGGKRCK EGCGKSAQGSTDFCKAHGGGKRC+WG PGS YG Sbjct: 480 AVPECTKSARGRTDFCVRHGGGKRCKVEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYG 539 Query: 2935 NQADGPCGSFARGKTGLCTMHGALVQDKRVHGGITLGPLLQDPNSADMIDASAGVMSVDG 3114 N GPC SFARGKTGLC +H LVQDKRVHGG+TLGP++QDP + + + V++V+ Sbjct: 540 NLPSGPCTSFARGKTGLCALHSGLVQDKRVHGGVTLGPMVQDPKISQS-EKTKEVVTVED 598 Query: 3115 M 3117 M Sbjct: 599 M 599 >ref|XP_002509888.1| transcription factor, putative [Ricinus communis] gi|223549787|gb|EEF51275.1| transcription factor, putative [Ricinus communis] Length = 677 Score = 717 bits (1852), Expect = 0.0 Identities = 368/635 (57%), Positives = 424/635 (66%), Gaps = 13/635 (2%) Frame = +1 Query: 1372 MGFCSDNLSNAFKNVGDSLHVVGSSINHGADTALRLQPIGPNDAYFPTSKGVKRKWGCRD 1551 +GF + SNAFK +G S V G + ADT LRL G + + SKG+KRKW D Sbjct: 8 LGFAATCPSNAFKILGSSTLVGGPVAEYCADTVLRLDSPGSSVSCTSQSKGIKRKWNFID 67 Query: 1552 GNMGLYVDPXXXXXXXXXXXXXXXXXXXATACTAISSVKETEVESSMDLDLDFTLHLGSD 1731 G MG ++ ATACT +SS KET+ ESS+DL+LDF LHLG++ Sbjct: 68 GTMGQHIGSSLSLGLGCSSSSSDSKGSSATACTTMSSAKETDEESSIDLELDFALHLGNE 127 Query: 1732 KSPSPKKPALSNFKRVGSQLNLDLELSLSTSQPESDITSIFGGLEASQHGVTP--LVVGT 1905 K SPKK A SN K + Q +DLELSLST ESD+TSI+ + G+ + G Sbjct: 128 KMSSPKKSANSNLKGLELQPKVDLELSLSTGPSESDVTSIYPSSTSLDFGMEMRHAIYGA 187 Query: 1906 TQFVDNETASSHWKVDNTSPVSLFPKAGESFVLKSIPRKVDLVSVISDVSASTLSVPKSS 2085 + VD T S WK ++L +SF +P+ D +SV+ D+S+S ++ P SS Sbjct: 188 SS-VDEGTISCGWKTG----IALLASQDKSFFFNQVPKTSDPISVLPDLSSSVITAPVSS 242 Query: 2086 VTCTSGFLXXXXXXXXXXXXKLCQSVGCGKGARGASGLCIAHGGGRRCQRPGCNKGAEGR 2265 VTCTSG KLC+ GCGKGARGASG CI+HGGGRRCQ+PGC+KGAEGR Sbjct: 243 VTCTSGITQRQQPHQRSSNSKLCEVEGCGKGARGASGRCISHGGGRRCQKPGCHKGAEGR 302 Query: 2266 TALCKAHGGGRRCEFLGCTKSAEGRTDFCIAXXXXXXXXXXXXSRAARGKSGLCIRHGGG 2445 T CK+HGGGRRCEFLGCTKSAEGRTDFCIA +RAARGKSGLCIRHGGG Sbjct: 303 TVYCKSHGGGRRCEFLGCTKSAEGRTDFCIAHGGGRRCSSEGCTRAARGKSGLCIRHGGG 362 Query: 2446 KRCQRETCTRSAEGLSGLCISHGGGRRCQYPACSKGAQGSTMFCKAHGGGKRCTVEGCTK 2625 KRCQ+E CT+SAEGLSGLCISHGGGRRCQ C+KGAQGSTMFCKAHGGGKRCT GCTK Sbjct: 363 KRCQKENCTKSAEGLSGLCISHGGGRRCQSLGCTKGAQGSTMFCKAHGGGKRCTAPGCTK 422 Query: 2626 GAEGSTPFCKGHGGGKRCSFPG---CPKSVHGGTSFCVAHGGGKRCAITGCTKSARGRTD 2796 GAEGSTPFCKGHGGGKRC+F G C KSVHGGT+FCVAHGGGKRCA+ CTKSARGRTD Sbjct: 423 GAEGSTPFCKGHGGGKRCAFQGGGVCTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTD 482 Query: 2797 FCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCAWGQPGSGYGNQADGPCGSFARGK 2976 FCVRHGGGKRCK EGCGKSAQGSTDFCKAHGGGKRC+WG PGS YG Q PC SFARGK Sbjct: 483 FCVRHGGGKRCKNEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGVQPTAPCNSFARGK 542 Query: 2977 TGLCTMHGALVQDKRVHGGITLGPLLQDPNSADMIDAS----AGVMSVDGMG--FGPVGL 3138 GLC +H LVQDKRVHGG TLGP++Q+P S A M+VD +G G Sbjct: 543 KGLCALHSGLVQDKRVHGGATLGPIIQEPKSIQTEKMKEVMIAEDMNVDNLGSSMGASSS 602 Query: 3139 SGEEWTASNVVRAQLSCMEKEPRTTP--SPEGRVH 3237 + V A + E + P PEGRVH Sbjct: 603 KSTDLKHFGVPNAHIPAGEAGMSSMPVFVPEGRVH 637 >gb|EMJ12491.1| hypothetical protein PRUPE_ppa002459mg [Prunus persica] Length = 670 Score = 717 bits (1851), Expect = 0.0 Identities = 359/627 (57%), Positives = 422/627 (67%), Gaps = 5/627 (0%) Frame = +1 Query: 1372 MGFCSDNLSNAFKNVGDSLHVVGSSINHGADTALRLQPIGPNDAYFPTSKGVKRKWGCRD 1551 +GF ++ SNAF VG+S+ V G+ ADT LRL G + A + +G+KRKW Sbjct: 8 LGFAANFSSNAFNIVGNSMQVGGAGSESCADTILRLNSPGSSMACMSSLQGIKRKWSSIG 67 Query: 1552 GNMGLYVDPXXXXXXXXXXXXXXXXXXXATACTAISSVKETEVESSMDLDLDFTLHLGSD 1731 GN+ + ATACT +SS KET+ ESSMD +LDF LHLG++ Sbjct: 68 GNVTEHFGSSLSLGLGRSTSSSDSKGSSATACTTMSSAKETDEESSMDFELDFALHLGNE 127 Query: 1732 KSPSPKKPALSNFKRVGSQLNLDLELSLSTSQPESDITSIFGGLEASQHGVT-PLVVGTT 1908 K PSPKKPA S + + Q +DLELSLST ES+IT + + G+ L G Sbjct: 128 KVPSPKKPANSKLRALELQPKVDLELSLSTGLSESEITCVNPSSTSPLSGMEMALAAGGA 187 Query: 1909 QFVDNETASSHWKVDNT-SPVSLFPKAGESFVLKSIPRKVDLVSVISDVSASTLSVPKSS 2085 Q D + HWK P+ G SF+ K +P+K+D +++ ++S+S L+ P SS Sbjct: 188 QNADEGSTPFHWKRGIAIQPLQTSFNPGASFLFKQVPQKIDSPAIVPELSSSILTTPNSS 247 Query: 2086 VTCTSGFLXXXXXXXXXXXXKLCQSVGCGKGARGASGLCIAHGGGRRCQRPGCNKGAEGR 2265 V+C+SG K CQ GCGKGARGASG CI+HGGGRRCQ+ GC+KGAEGR Sbjct: 248 VSCSSGMTQKQQSQHRSSNSKTCQVEGCGKGARGASGRCISHGGGRRCQKSGCHKGAEGR 307 Query: 2266 TALCKAHGGGRRCEFLGCTKSAEGRTDFCIAXXXXXXXXXXXXSRAARGKSGLCIRHGGG 2445 T CKAHGGGRRCEFLGCTKSAEGRTDFCIA +RAARGKSGLCIRHGGG Sbjct: 308 TVYCKAHGGGRRCEFLGCTKSAEGRTDFCIAHGGGRRCSHEGCTRAARGKSGLCIRHGGG 367 Query: 2446 KRCQRETCTRSAEGLSGLCISHGGGRRCQYPACSKGAQGSTMFCKAHGGGKRCTVEGCTK 2625 KRCQRE CT+SAEGLSGLCISHGGGRRCQ C+KGAQGSTMFCKAHGGGKRCT GCTK Sbjct: 368 KRCQRENCTKSAEGLSGLCISHGGGRRCQAIGCTKGAQGSTMFCKAHGGGKRCTAPGCTK 427 Query: 2626 GAEGSTPFCKGHGGGKRCSFPG---CPKSVHGGTSFCVAHGGGKRCAITGCTKSARGRTD 2796 GAEGSTP+CKGHGGGKRC+F G C KSVHGGT+FCVAHGGGKRCA+ CTKSARGRTD Sbjct: 428 GAEGSTPYCKGHGGGKRCAFQGGGHCTKSVHGGTNFCVAHGGGKRCAMPECTKSARGRTD 487 Query: 2797 FCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCAWGQPGSGYGNQADGPCGSFARGK 2976 +CVRHGGGKRCK EGCGKSAQGSTDFCKAHGGGKRC+WG PGS +G QA GPC SFARGK Sbjct: 488 YCVRHGGGKRCKSEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSVFGGQAIGPCNSFARGK 547 Query: 2977 TGLCTMHGALVQDKRVHGGITLGPLLQDPNSADMIDASAGVMSVDGMGFGPVGLSGEEWT 3156 TGLC +H LVQDKRVHGGITLGP++QDP D V + D M + + T Sbjct: 548 TGLCALHSGLVQDKRVHGGITLGPMVQDPKLGKS-DKKKEVATADDMNVDVMNIGSSIRT 606 Query: 3157 ASNVVRAQLSCMEKEPRTTPSPEGRVH 3237 ++ + + + PEGRVH Sbjct: 607 SATGTCSDMKQAGQSSAPVLIPEGRVH 633 >ref|XP_003525359.1| PREDICTED: uncharacterized protein LOC100776565 isoform X1 [Glycine max] gi|571456987|ref|XP_006580546.1| PREDICTED: uncharacterized protein LOC100776565 isoform X2 [Glycine max] Length = 683 Score = 716 bits (1847), Expect = 0.0 Identities = 373/647 (57%), Positives = 430/647 (66%), Gaps = 17/647 (2%) Frame = +1 Query: 1348 MGTRNARVMGFCSDNLSNAFKNVGDSLHVVGSSIN-HGADTALRLQPIGPN-DAYFPTSK 1521 M R +GF +++ +NAFK +G S+ + G + HG DT LRL G + P+SK Sbjct: 1 MDARFKNFLGFAANHSANAFKILGSSMQIEGRGADYHGTDTILRLDSPGSSIPTSVPSSK 60 Query: 1522 GVKRKWGCRDGNMGLYVDPXXXXXXXXXXXXXXXXXXXATACTAISSVKETEVESSMDLD 1701 G KRKW DG MG VD A ACTA+SS K+ + ESSMD++ Sbjct: 61 GTKRKWDLIDGCMGQRVDSSLSLGLGRSSSSSDSKGSSAAACTAMSSAKDIDEESSMDIE 120 Query: 1702 LDFTLHLGSDKSPSPKKPALSNFKRVGSQLNLDLELSLSTSQPESDITSIFGGLEASQHG 1881 LDF LHLGS+K S KKP SN K + Q DLELSLST ESDITS+ Q Sbjct: 121 LDFFLHLGSEKVQSHKKPVNSNLKTLELQPKFDLELSLSTGPCESDITSVHLNPSPLQLN 180 Query: 1882 VT-PLVVGTTQFVDNETASSHWKVDNTSPVS-LFPKAGESFVLKSIPRKVDLVSVISDVS 2055 + PL TQ D + S W+ P S + G +F+L ++ D ++ D+S Sbjct: 181 MEIPLTFSGTQNTDEGSTSCSWQPGIVLPSSKMSSNTGTNFLLSQSSKQFDHSPIVVDLS 240 Query: 2056 ASTLSVPKSSVTCTSGFLXXXXXXXXXXXXKLCQSVGCGKGARGASGLCIAHGGGRRCQR 2235 ++ PKSSVTCTSG K CQ GCGKGARGASG CI+HGGGRRCQ+ Sbjct: 241 STG---PKSSVTCTSGLTQQQQPLHRPGNSKTCQVEGCGKGARGASGRCISHGGGRRCQK 297 Query: 2236 PGCNKGAEGRTALCKAHGGGRRCEFLGCTKSAEGRTDFCIAXXXXXXXXXXXXSRAARGK 2415 PGC+KGAEGRT CKAHGGGRRCEFLGCTKSAEGRTDFCIA +RAARGK Sbjct: 298 PGCHKGAEGRTVYCKAHGGGRRCEFLGCTKSAEGRTDFCIAHGGGRRCSHEGCTRAARGK 357 Query: 2416 SGLCIRHGGGKRCQRETCTRSAEGLSGLCISHGGGRRCQYPACSKGAQGSTMFCKAHGGG 2595 SGLCIRHGGGKRCQRE CT+SAEGLSGLCISHGGGRRCQ P C+KGAQGSTMFCKAHGGG Sbjct: 358 SGLCIRHGGGKRCQRENCTKSAEGLSGLCISHGGGRRCQVPGCTKGAQGSTMFCKAHGGG 417 Query: 2596 KRCTVEGCTKGAEGSTPFCKGHGGGKRCSFPG---CPKSVHGGTSFCVAHGGGKRCAITG 2766 KRCT GCTKGAEGSTP+CKGHGGGKRC++ G C KSVHGGT+FCVAHGGGKRCA+ G Sbjct: 418 KRCTAPGCTKGAEGSTPYCKGHGGGKRCTYQGGGVCTKSVHGGTNFCVAHGGGKRCAVPG 477 Query: 2767 CTKSARGRTDFCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCAWGQPGSGYGNQAD 2946 CTKSARGRTD CVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRC+WG PGS YG Q D Sbjct: 478 CTKSARGRTDHCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGIQQD 537 Query: 2947 GPCGSFARGKTGLCTMHGALVQDKRVHGGITLGPLLQDPNSAD-------MIDASAGV-M 3102 GPC SFARGKTGLC +H LV DKRVHGGI+LG ++QDP+S+ +ID + V M Sbjct: 538 GPCNSFARGKTGLCALHSGLVHDKRVHGGISLGSVVQDPHSSKADELKQVLIDKNMDVNM 597 Query: 3103 SVDGMGFGPVGLSGEEWTASNVVRAQLSCME--KEPRTTPSPEGRVH 3237 G G + ++ A +S E P + PEGRVH Sbjct: 598 MKIGSSLG-AAATCSDFEQLEAATAHVSVKEGGHLPMSVVVPEGRVH 643 >ref|XP_006476466.1| PREDICTED: uncharacterized protein LOC102631154 isoform X1 [Citrus sinensis] gi|568845203|ref|XP_006476467.1| PREDICTED: uncharacterized protein LOC102631154 isoform X2 [Citrus sinensis] Length = 684 Score = 714 bits (1842), Expect = 0.0 Identities = 351/570 (61%), Positives = 395/570 (69%), Gaps = 5/570 (0%) Frame = +1 Query: 1372 MGFCSDNLSNAFKNVGDSLHVVGSSINHGADTALRLQPIGPNDAYFPTSKGVKRKWGCRD 1551 + F ++ NAFK G S G+ G DT LRL G ++ + SKG+KRKW D Sbjct: 8 LSFAANYSLNAFKTSGSSRQAGGAGAEDGTDTILRLDSPGSSNPHISASKGIKRKWSLID 67 Query: 1552 GNMGLYVDPXXXXXXXXXXXXXXXXXXXATACTAISSVKETEVESSMDLDLDFTLHLGSD 1731 G++ V ATACT SS KE E ESSMDLDLDFTLHLG+D Sbjct: 68 GSVHQQVGSTLSLGLGRSSSSSDSKGSSATACTTTSSAKENEEESSMDLDLDFTLHLGND 127 Query: 1732 KSPSPKKPALSNFKRVGSQLNLDLELSLSTSQPESDITSIFGGLEASQHGVT-PLVVGTT 1908 K P+PKK A SN K V Q +DL LSLST PES ITS+ G+ PL+ G T Sbjct: 128 KMPNPKKSAYSNMKGVELQPKVDLMLSLSTGSPESGITSLHPSSSLLHFGMEMPLLAGGT 187 Query: 1909 QFVDNETASSHWKVD-NTSPVSLFPKAGESFVLKSIPRKVDLVSVISDVSASTLSVPKSS 2085 D + S WK + P+ P F R DL + + D+S+S ++ P+SS Sbjct: 188 LNADEGSTSCGWKTGVSLPPLQTAPNKESRFFFNCALRTNDLTANVHDLSSSVVTTPRSS 247 Query: 2086 VTCTSGFLXXXXXXXXXXXXKLCQSVGCGKGARGASGLCIAHGGGRRCQRPGCNKGAEGR 2265 VTCTSG K CQ GCGKGARGASG CI+HGGGRRCQ+ GC+KGAEGR Sbjct: 248 VTCTSGITQQHQRLQRSSSSKTCQVEGCGKGARGASGRCISHGGGRRCQKLGCHKGAEGR 307 Query: 2266 TALCKAHGGGRRCEFLGCTKSAEGRTDFCIAXXXXXXXXXXXXSRAARGKSGLCIRHGGG 2445 T CKAHGGGRRCE+LGCTKSAEGRTD+CIA +RAARGKSGLCIRHGGG Sbjct: 308 TVYCKAHGGGRRCEYLGCTKSAEGRTDYCIAHGGGRRCSHEGCTRAARGKSGLCIRHGGG 367 Query: 2446 KRCQRETCTRSAEGLSGLCISHGGGRRCQYPACSKGAQGSTMFCKAHGGGKRCTVEGCTK 2625 KRCQ+E CT+SAEGLSGLCISHGGGRRCQ C+KGAQGSTMFCKAHGGGKRCT GCTK Sbjct: 368 KRCQKENCTKSAEGLSGLCISHGGGRRCQASGCTKGAQGSTMFCKAHGGGKRCTAPGCTK 427 Query: 2626 GAEGSTPFCKGHGGGKRCSFPG---CPKSVHGGTSFCVAHGGGKRCAITGCTKSARGRTD 2796 GAEGST FCKGHGGGKRC+F G C KSVHGGT+FCVAHGGGKRCA+ CTKSARGRTD Sbjct: 428 GAEGSTSFCKGHGGGKRCAFQGGGVCTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTD 487 Query: 2797 FCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCAWGQPGSGYGNQADGPCGSFARGK 2976 +CVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRC+WG PGS YG Q+ GPC SFARGK Sbjct: 488 YCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGPQSTGPCNSFARGK 547 Query: 2977 TGLCTMHGALVQDKRVHGGITLGPLLQDPN 3066 TGLC +H LVQDKRVHGG TLGP++ DPN Sbjct: 548 TGLCALHSGLVQDKRVHGGFTLGPVVLDPN 577 >ref|XP_006439437.1| hypothetical protein CICLE_v10019140mg [Citrus clementina] gi|567893900|ref|XP_006439438.1| hypothetical protein CICLE_v10019140mg [Citrus clementina] gi|557541699|gb|ESR52677.1| hypothetical protein CICLE_v10019140mg [Citrus clementina] gi|557541700|gb|ESR52678.1| hypothetical protein CICLE_v10019140mg [Citrus clementina] Length = 684 Score = 714 bits (1842), Expect = 0.0 Identities = 351/570 (61%), Positives = 396/570 (69%), Gaps = 5/570 (0%) Frame = +1 Query: 1372 MGFCSDNLSNAFKNVGDSLHVVGSSINHGADTALRLQPIGPNDAYFPTSKGVKRKWGCRD 1551 + F ++ NAFK G S G+ G DT LRL G ++ + SKG+KRKW D Sbjct: 8 LSFAANYSLNAFKTSGSSRQAGGAGAEDGTDTILRLDSPGSSNPHISASKGIKRKWSLID 67 Query: 1552 GNMGLYVDPXXXXXXXXXXXXXXXXXXXATACTAISSVKETEVESSMDLDLDFTLHLGSD 1731 G++ V ATACT SS KE E ESSMDLDLDFTLHLG+D Sbjct: 68 GSVHQQVGSTLSLGLGRPSSSSDSKGSSATACTTTSSAKENEEESSMDLDLDFTLHLGND 127 Query: 1732 KSPSPKKPALSNFKRVGSQLNLDLELSLSTSQPESDITSIFGGLEASQHGVT-PLVVGTT 1908 K P+PKK A SN K V Q +DL LSLST PES ITS+ G+ PL+ G T Sbjct: 128 KMPNPKKSAYSNMKGVELQPKVDLVLSLSTGSPESGITSLHPSSSLLHFGMEMPLLAGGT 187 Query: 1909 QFVDNETASSHWKVD-NTSPVSLFPKAGESFVLKSIPRKVDLVSVISDVSASTLSVPKSS 2085 D+ + S WK + P+ P F R DL + + D+S+S ++ P+SS Sbjct: 188 LNADDGSTSCGWKTGVSLPPLQTAPNKESRFFFDCALRTNDLTANVHDLSSSVVTTPRSS 247 Query: 2086 VTCTSGFLXXXXXXXXXXXXKLCQSVGCGKGARGASGLCIAHGGGRRCQRPGCNKGAEGR 2265 VTCTSG K CQ GCGKGARGASG CI+HGGGRRCQ+ GC+KGAEGR Sbjct: 248 VTCTSGITQQHQRLQRSSSSKTCQVEGCGKGARGASGRCISHGGGRRCQKLGCHKGAEGR 307 Query: 2266 TALCKAHGGGRRCEFLGCTKSAEGRTDFCIAXXXXXXXXXXXXSRAARGKSGLCIRHGGG 2445 T CKAHGGGRRCE+LGCTKSAEGRTD+CIA +RAARGKSGLCIRHGGG Sbjct: 308 TVYCKAHGGGRRCEYLGCTKSAEGRTDYCIAHGGGRRCSHEGCTRAARGKSGLCIRHGGG 367 Query: 2446 KRCQRETCTRSAEGLSGLCISHGGGRRCQYPACSKGAQGSTMFCKAHGGGKRCTVEGCTK 2625 KRCQ+E CT+SAEGLSGLCISHGGGRRCQ C+KGAQGSTMFCKAHGGGKRCT GCTK Sbjct: 368 KRCQKENCTKSAEGLSGLCISHGGGRRCQASGCTKGAQGSTMFCKAHGGGKRCTAPGCTK 427 Query: 2626 GAEGSTPFCKGHGGGKRCSFPG---CPKSVHGGTSFCVAHGGGKRCAITGCTKSARGRTD 2796 GAEGST FCKGHGGGKRC+F G C KSVHGGT+FCVAHGGGKRCA+ CTKSARGRTD Sbjct: 428 GAEGSTSFCKGHGGGKRCAFQGGGVCTKSVHGGTNFCVAHGGGKRCAVPECTKSARGRTD 487 Query: 2797 FCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCAWGQPGSGYGNQADGPCGSFARGK 2976 +CVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRC+WG PGS YG Q+ GPC SFARGK Sbjct: 488 YCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGPQSTGPCNSFARGK 547 Query: 2977 TGLCTMHGALVQDKRVHGGITLGPLLQDPN 3066 TGLC +H LVQDKRVHGG TLGP++ DPN Sbjct: 548 TGLCALHSGLVQDKRVHGGFTLGPVVLDPN 577 >ref|XP_003533040.1| PREDICTED: uncharacterized protein LOC100784837 isoform 1 [Glycine max] Length = 682 Score = 705 bits (1819), Expect = 0.0 Identities = 365/645 (56%), Positives = 422/645 (65%), Gaps = 15/645 (2%) Frame = +1 Query: 1348 MGTRNARVMGFCSDNLSNAFKNVGDSLHVVGSSIN-HGADTALRLQPIGPN-DAYFPTSK 1521 M R +GF +++ +NAFK +G+S+ V G N HG DT LRL G + P+ K Sbjct: 1 MDARFKNFLGFAANHSANAFKILGNSMQVEGRGANYHGTDTILRLDSPGSSIPTSVPSYK 60 Query: 1522 GVKRKWGCRDGNMGLYVDPXXXXXXXXXXXXXXXXXXXATACTAISSVKETEVESSMDLD 1701 G KRKW DG MG VD A ACTA+SS K+ + ESS+D++ Sbjct: 61 GTKRKWDLIDGCMGQRVDSSLSLGLGRSSSSSDSKGSSAAACTAMSSAKDIDEESSLDIE 120 Query: 1702 LDFTLHLGSDKSPSPKKPALSNFKRVGSQLNLDLELSLSTSQPESDITSIFGGLEASQHG 1881 LDF+LHLG +K S KKP + K + Q DLELSLST ESDITS+ Q Sbjct: 121 LDFSLHLGCEKVHSQKKPVNPSLKTMELQPKFDLELSLSTGPCESDITSVHLNPSPLQLN 180 Query: 1882 VT-PLVVGTTQFVDNETASSHWKVDNTSPVS-LFPKAGESFVLKSIPRKVDLVSVISDVS 2055 + PL TQ D + S WK P S G +F+L ++ D ++ ++S Sbjct: 181 MEMPLAFSGTQNTDEGSTSCSWKPGIALPSSKTSSNTGTNFLLNQSSKQFDHSPIVVELS 240 Query: 2056 ASTLSVPKSSVTCTSGFLXXXXXXXXXXXXKLCQSVGCGKGARGASGLCIAHGGGRRCQR 2235 ++ PKS VTC S K CQ GCGKGARGASG CI+HGGGRRCQ+ Sbjct: 241 STR---PKSLVTCISELTQQQQALHRPSNSKTCQVEGCGKGARGASGRCISHGGGRRCQK 297 Query: 2236 PGCNKGAEGRTALCKAHGGGRRCEFLGCTKSAEGRTDFCIAXXXXXXXXXXXXSRAARGK 2415 PGC+KGAEGRT CKAHGGGRRCEFLGCTKSAEGRTDFCIA +RAARGK Sbjct: 298 PGCHKGAEGRTVYCKAHGGGRRCEFLGCTKSAEGRTDFCIAHGGGRRCNHEGCTRAARGK 357 Query: 2416 SGLCIRHGGGKRCQRETCTRSAEGLSGLCISHGGGRRCQYPACSKGAQGSTMFCKAHGGG 2595 SGLCIRHGGGKRCQRE CT+SAEGLSGLCISHGGGRRCQ P C+KGAQGSTMFCKAHGGG Sbjct: 358 SGLCIRHGGGKRCQRENCTKSAEGLSGLCISHGGGRRCQAPGCTKGAQGSTMFCKAHGGG 417 Query: 2596 KRCTVEGCTKGAEGSTPFCKGHGGGKRCSFPG---CPKSVHGGTSFCVAHGGGKRCAITG 2766 KRCT GCTKGAEGSTP+CKGHGGGKRC++ G C KSVHGGT+FCVAHGGGKRCA+ G Sbjct: 418 KRCTAPGCTKGAEGSTPYCKGHGGGKRCTYQGGGVCTKSVHGGTNFCVAHGGGKRCAVPG 477 Query: 2767 CTKSARGRTDFCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCAWGQPGSGYGNQAD 2946 CTKSARGRTD CVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRC+WG PGS YG Q D Sbjct: 478 CTKSARGRTDHCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGIQQD 537 Query: 2947 GPCGSFARGKTGLCTMHGALVQDKRVHGGITLGPLLQDPNSAD-------MIDASAGV-M 3102 GPC SFARGKTGLC +H LV DKRVHGGI+LG ++QDP+S+ ++D + G+ M Sbjct: 538 GPCNSFARGKTGLCALHSGLVHDKRVHGGISLGSVVQDPHSSKTDELKQVLVDKNMGIDM 597 Query: 3103 SVDGMGFGPVGLSGEEWTASNVVRAQLSCMEKEPRTTPSPEGRVH 3237 G G S E + P + PEGRVH Sbjct: 598 MKIGSSLGTATCSDFEQFEAATAHVSAKEGSHLPVSVAVPEGRVH 642 >gb|ABK96565.1| unknown [Populus trichocarpa x Populus deltoides] Length = 681 Score = 704 bits (1817), Expect = 0.0 Identities = 358/608 (58%), Positives = 413/608 (67%), Gaps = 11/608 (1%) Frame = +1 Query: 1366 RVMGFCSDNLSNAFKNVGDSLHVVGSSINHGADTALRLQPIGPNDAYFPTSKGVKRKWGC 1545 R +GF +D SNAFK +G SL V + + ADT LRL G + Y +SKG+KRKW Sbjct: 6 RNLGFAADYPSNAFKILGSSLSVGAAGTKYSADTVLRLDSPGSSVPYGFSSKGIKRKWNL 65 Query: 1546 RDGNMGLYVDPXXXXXXXXXXXXXXXXXXXATACTAISSVKETEVESSMDLDLDFTLHLG 1725 +G+MG V ATACT++SS ET+ ESSMDL+ F+LHLG Sbjct: 66 INGSMGQNVGSSLSLGLGRSSSSSDSKGSSATACTSMSSAIETDEESSMDLE--FSLHLG 123 Query: 1726 SDKSPSPKKPALSNFKRVGSQLNLDLELSLSTSQPESDITSIFGGLEASQHGVT-PLVVG 1902 ++K SPKKPA S K G + +DLEL LST ESD+TSI + + G P +G Sbjct: 124 NEKMLSPKKPARSYLK--GREPKVDLELGLSTGPSESDVTSIHPPSSSLEFGFDMPSAMG 181 Query: 1903 TTQFVDNETASSHWKVDNTS-PVSLFPKAGESFVLKSIPRKVDLVSVISDVSASTLSVPK 2079 V+ + S WK T P+ + SF IPR D D S+S ++ PK Sbjct: 182 GASNVNEGSTSCSWKSGITLLPLQISSNKEASFFFNQIPRTRDPTPSFPDHSSSVITTPK 241 Query: 2080 SSVTCTSGFLXXXXXXXXXXXXKLCQSVGCGKGARGASGLCIAHGGGRRCQRPGCNKGAE 2259 SSVTCTSG KLCQ GCGKGARGASG CI+HGGGRRC++ GC KGAE Sbjct: 242 SSVTCTSGISQQQQPYQRGTSLKLCQVEGCGKGARGASGRCISHGGGRRCRKAGCLKGAE 301 Query: 2260 GRTALCKAHGGGRRCEFLGCTKSAEGRTDFCIAXXXXXXXXXXXXSRAARGKSGLCIRHG 2439 GRT CKAHGGGRRCEFLGCTKSAEGRTDFCIA +RAARGKSGLCIRHG Sbjct: 302 GRTVYCKAHGGGRRCEFLGCTKSAEGRTDFCIAHGGGRRCSHEGCTRAARGKSGLCIRHG 361 Query: 2440 GGKRCQRETCTRSAEGLSGLCISHGGGRRCQYPACSKGAQGSTMFCKAHGGGKRCTVEGC 2619 GGKRCQ+E CT+SAEGLSGLCISHGGGRRCQ+ C+KGAQGSTMFCKAHGGGKRCT GC Sbjct: 362 GGKRCQKENCTKSAEGLSGLCISHGGGRRCQFLGCTKGAQGSTMFCKAHGGGKRCTAPGC 421 Query: 2620 TKGAEGSTPFCKGHGGGKRCSFPG---CPKSVHGGTSFCVAHGGGKRCAITGCTKSARGR 2790 +KGAEGSTPFCKGHGGGKRC+F G C KSVHGGT+FCVAHGGGKRCA+ CTKSARGR Sbjct: 422 SKGAEGSTPFCKGHGGGKRCAFQGGGVCTKSVHGGTNFCVAHGGGKRCAVPECTKSARGR 481 Query: 2791 TDFCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCAWGQPGSGYGNQADGPCGSFAR 2970 T FCVRHGGGKRCKFEGCGKSAQGSTD+CKAHGGGKRC+WG PGS YGNQ GPC SFAR Sbjct: 482 THFCVRHGGGKRCKFEGCGKSAQGSTDYCKAHGGGKRCSWGHPGSEYGNQPTGPCNSFAR 541 Query: 2971 GKTGLCTMHGALVQDKRVHGGITLGPLLQDPNSA------DMIDASAGVMSVDGMGFGPV 3132 GKTGLC +H LV DKRVHGG+TLGP++QDP + +++ A + + MG Sbjct: 542 GKTGLCALHSGLVLDKRVHGGVTLGPMVQDPKISQSEKMKEVVTAEDMTIDIAKMGTSAA 601 Query: 3133 GLSGEEWT 3156 +G T Sbjct: 602 ASTGRTTT 609 >gb|ESW32119.1| hypothetical protein PHAVU_002G294600g [Phaseolus vulgaris] Length = 662 Score = 698 bits (1801), Expect = 0.0 Identities = 368/647 (56%), Positives = 427/647 (65%), Gaps = 17/647 (2%) Frame = +1 Query: 1348 MGTRNARVMGFCSDNLSNAFKNVGDSLHVVGSSINH-GADTALRLQPIGPN-DAYFPTSK 1521 M R + +GF +++ +NAFK +G+S+ G +H G DT LRL G + P+SK Sbjct: 1 MDVRLKQFLGFAANHSANAFKILGNSMQAEGRGADHYGTDTILRLDSPGSSIPTGVPSSK 60 Query: 1522 GVKRKWGCRDGNMGLYVDPXXXXXXXXXXXXXXXXXXXATACTAISSVKETEVESSMDLD 1701 G KRKW DG G VD A ACTA+SS K+T+ ESSMD++ Sbjct: 61 GTKRKWDLIDGCTGQKVDSSLSLGLGRSTCSSDSKGSSAAACTAMSSAKDTDEESSMDIE 120 Query: 1702 LDFTLHLGSDKSPSPKKPALSNFKRVGSQLNLDLELSLSTSQPESDITSIFGGLEASQHG 1881 LDF+LHLG +K S KKP S+ K + Q DLELSLST ESDITS+ Q Sbjct: 121 LDFSLHLGCEKVQSQKKPVNSSLKTLELQPKFDLELSLSTGPCESDITSVHLNPSPLQLN 180 Query: 1882 VT-PLVVGTTQFVDNETASSHWKVDNTSPVS-LFPKAGESFVLKSIPRKVDLVSVISDVS 2055 + PL TQ D + S WK P S G F+L ++ D V+ D+S Sbjct: 181 MEMPLTFSGTQNTDEGSTSCSWKPGIVLPSSKTSSNTGTGFLLNQALKQFDRSPVVLDLS 240 Query: 2056 ASTLSVPKSSVTCTSGFLXXXXXXXXXXXXKLCQSVGCGKGARGASGLCIAHGGGRRCQR 2235 ++ PKSSVTCTS L K CQ GCGKGARGASG CI+HGGGRRCQ+ Sbjct: 241 STR---PKSSVTCTSE-LTQQQQPLRPSNSKTCQVEGCGKGARGASGRCISHGGGRRCQK 296 Query: 2236 PGCNKGAEGRTALCKAHGGGRRCEFLGCTKSAEGRTDFCIAXXXXXXXXXXXXSRAARGK 2415 PGC KGAEGRT CKAHGGGRRCEFLGCTKSAEGRTDFCIA +RAARGK Sbjct: 297 PGCLKGAEGRTVYCKAHGGGRRCEFLGCTKSAEGRTDFCIAHGGGRRCSHEGCTRAARGK 356 Query: 2416 SGLCIRHGGGKRCQRETCTRSAEGLSGLCISHGGGRRCQYPACSKGAQGSTMFCKAHGGG 2595 SGLCIRHGGGKRCQRE CT+SAEGLSGLCISHGGGRRCQ C+KGAQGSTMFCKAHGGG Sbjct: 357 SGLCIRHGGGKRCQRENCTKSAEGLSGLCISHGGGRRCQATGCTKGAQGSTMFCKAHGGG 416 Query: 2596 KRCTVEGCTKGAEGSTPFCKGHGGGKRCSFPG---CPKSVHGGTSFCVAHGGGKRCAITG 2766 KRCT GCTKGAEGSTP+CKGHGGGKRC++ G C KSVHGGT+FCVAHGGGKRCA+ G Sbjct: 417 KRCTAPGCTKGAEGSTPYCKGHGGGKRCTYQGGGVCTKSVHGGTNFCVAHGGGKRCAVPG 476 Query: 2767 CTKSARGRTDFCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCAWGQPGSGYGNQAD 2946 CTKSARGRTD CVRHGGGKRCKF CGKSAQGSTDFCKAHGGGKRC+WG PGS YG Q D Sbjct: 477 CTKSARGRTDHCVRHGGGKRCKFIECGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGIQQD 536 Query: 2947 GPCGSFARGKTGLCTMHGALVQDKRVHGGITLGPLLQDPNSAD-------MIDASAGV-M 3102 GPC SFARGKTG+C +H LV DKRVHGGI+LG ++Q+P+S+ ++D + + M Sbjct: 537 GPCNSFARGKTGMCALHSGLVHDKRVHGGISLGSVVQNPHSSKTDELKQLLVDKNMDIDM 596 Query: 3103 SVDGMGFGPVGLSGEEWTASNVVRAQLSCME--KEPRTTPSPEGRVH 3237 G GP + ++ V A +S E P + PEGRVH Sbjct: 597 MKIGSSLGPAA-TCSDFKHYEAVTAHVSAKEGGHLPMSVAVPEGRVH 642 >ref|XP_004298851.1| PREDICTED: uncharacterized protein LOC101298314 [Fragaria vesca subsp. vesca] Length = 674 Score = 696 bits (1797), Expect = 0.0 Identities = 360/640 (56%), Positives = 425/640 (66%), Gaps = 18/640 (2%) Frame = +1 Query: 1372 MGFCSDNLSNAFKNVGDSLHVVGSSINHGADTALRLQPIGPNDAYFPTSKGVKRKWGCRD 1551 +GF ++N SN FK +G+S+ + G+ + DT LRL G + +Y P KG KRKW D Sbjct: 8 LGFAANNSSNVFKILGNSMPIRGTGGDFRTDTTLRLNSPGSSLSYMPGPKGTKRKWSVVD 67 Query: 1552 GNMGLYVDPXXXXXXXXXXXXXXXXXXXATACTAISSVKETEVESSMDLDLDFTLHLGSD 1731 G++ +V AT CT +SS KET+ ESSMDL+LDF+LHLG++ Sbjct: 68 GSLNQHVGSSLSLGIGHSPSSSDSKGSSATVCTTMSSAKETDEESSMDLELDFSLHLGNE 127 Query: 1732 KSPSPKKPALSNFKRVGSQLNLDLELSLSTSQPESDITSIFGGLEASQHGVT-PLVVGTT 1908 K PS K+PA S + Q +DLELSLST ES ITS+ + G+ P V+G Sbjct: 128 KVPSLKRPASSVLTALDIQPKVDLELSLSTVLSESGITSVDRSSTSVHSGMEMPQVIGA- 186 Query: 1909 QFVDNETASSHWKVD-NTSPV--SLFPKA-----------GESFVLKSIPRKVDLVSVIS 2046 Q D + S WK T P SL P A G F+ K +P+K+D + + Sbjct: 187 QNADEVSNSCLWKSGVATKPFQNSLNPGAAAKPFQTSVNPGAHFLFKPVPKKLDSIPTVL 246 Query: 2047 DVSASTLSVPKSSVTCTSGFLXXXXXXXXXXXXKLCQSVGCGKGARGASGLCIAHGGGRR 2226 D+S+S L+ P SS CTS K CQ GCGKGARGASG CI+HGGGRR Sbjct: 247 DISSSILTTPTSSA-CTSAITQRQQPHHRSSNSKTCQVEGCGKGARGASGRCISHGGGRR 305 Query: 2227 CQRPGCNKGAEGRTALCKAHGGGRRCEFLGCTKSAEGRTDFCIAXXXXXXXXXXXXSRAA 2406 CQR GC+KGAEGRT CKAHGGGRRCE+LGCTKSAEGRTDFCIA +RAA Sbjct: 306 CQRAGCHKGAEGRTVYCKAHGGGRRCEYLGCTKSAEGRTDFCIAHGGGRRCSHDGCTRAA 365 Query: 2407 RGKSGLCIRHGGGKRCQRETCTRSAEGLSGLCISHGGGRRCQYPACSKGAQGSTMFCKAH 2586 RGKSGLCIRHGGGKRCQRE CT+SAEGLSGLCISHGGGRRCQ C+KGAQGSTMFCKAH Sbjct: 366 RGKSGLCIRHGGGKRCQRENCTKSAEGLSGLCISHGGGRRCQAIGCTKGAQGSTMFCKAH 425 Query: 2587 GGGKRCTVEGCTKGAEGSTPFCKGHGGGKRCSFPG---CPKSVHGGTSFCVAHGGGKRCA 2757 GGGKRCT GCTKGAEGSTP+CKGHGGGKRC+F G C KSVHGGT+FCVAHGGGKRCA Sbjct: 426 GGGKRCTAPGCTKGAEGSTPYCKGHGGGKRCAFQGGGHCTKSVHGGTNFCVAHGGGKRCA 485 Query: 2758 ITGCTKSARGRTDFCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCAWGQPGSGYGN 2937 + CTKSARGRTD+CVRHGGGKRCK+ GCGKSAQGSTDFCKAHGGGKRC+WG PGS YG Sbjct: 486 MPECTKSARGRTDYCVRHGGGKRCKYSGCGKSAQGSTDFCKAHGGGKRCSWGHPGSVYGR 545 Query: 2938 QADGPCGSFARGKTGLCTMHGALVQDKRVHGGITLGPLLQDPNSADMIDASAGVMSVDGM 3117 +A GPC SFARGKTGLC +H LVQDKRVHGG+TLG ++Q+P ++ + D M Sbjct: 546 EAAGPCNSFARGKTGLCALHSGLVQDKRVHGGVTLGTIVQEPKFGKLVREVDEMNVDDAM 605 Query: 3118 GFGPVGLSGEEWTASNVVRAQLSCMEKEPRTTPSPEGRVH 3237 G + + + S + S + + PEGRVH Sbjct: 606 TNGSIATTA-SGSCSGLKNGGSSVGQP---SVIVPEGRVH 641 >ref|XP_006385780.1| hypothetical protein POPTR_0003s13400g [Populus trichocarpa] gi|566162467|ref|XP_002304530.2| hypothetical protein POPTR_0003s13400g [Populus trichocarpa] gi|550343095|gb|ERP63577.1| hypothetical protein POPTR_0003s13400g [Populus trichocarpa] gi|550343096|gb|EEE79509.2| hypothetical protein POPTR_0003s13400g [Populus trichocarpa] Length = 681 Score = 696 bits (1796), Expect = 0.0 Identities = 351/569 (61%), Positives = 396/569 (69%), Gaps = 5/569 (0%) Frame = +1 Query: 1372 MGFCSDNLSNAFKNVGDSLHVVGSSINHGADTALRLQPIGPNDAYFPTSKGVKRKWGCRD 1551 +GF +D SNAFK +G SL V + + ADT LRL G + +SKG+KRKW + Sbjct: 8 LGFAADYPSNAFKILGSSLSVGAAGTKYSADTVLRLDSPGSSVPSGFSSKGIKRKWNLIN 67 Query: 1552 GNMGLYVDPXXXXXXXXXXXXXXXXXXXATACTAISSVKETEVESSMDLDLDFTLHLGSD 1731 G+MG V ATACT++SS ET+ ESSMDLD F+LHLG + Sbjct: 68 GSMGQNVGSSLSLGLGRSSSSSDSKGSSATACTSMSSAIETDEESSMDLD--FSLHLGHE 125 Query: 1732 KSPSPKKPALSNFKRVGSQLNLDLELSLSTSQPESDITSIFGGLEASQHGVT-PLVVGTT 1908 K SPKKPA S K G + +DLEL LST ESD+TSI + + PL +G Sbjct: 126 KMLSPKKPARSYLK--GMEPKVDLELGLSTGPSESDVTSIHPPSSSLEFAFDMPLAMGGA 183 Query: 1909 QFVDNETASSHWKVDNTS-PVSLFPKAGESFVLKSIPRKVDLVSVISDVSASTLSVPKSS 2085 V+ + S WK T P+ + SF IPR D D S+S ++ PKSS Sbjct: 184 SNVNEGSTSCSWKSGITLLPLQISSNKEASFFFNQIPRTRDPTPSFPDHSSSVITTPKSS 243 Query: 2086 VTCTSGFLXXXXXXXXXXXXKLCQSVGCGKGARGASGLCIAHGGGRRCQRPGCNKGAEGR 2265 VTCTSG KLCQ GCGKGARGASG CI+HGGGRRC++ GC KGAEGR Sbjct: 244 VTCTSGISQQQQPYQRGTSLKLCQVEGCGKGARGASGRCISHGGGRRCRKAGCLKGAEGR 303 Query: 2266 TALCKAHGGGRRCEFLGCTKSAEGRTDFCIAXXXXXXXXXXXXSRAARGKSGLCIRHGGG 2445 T CKAHGGGRRCEFLGCTKSAEGRTDFCIA +RAARGKSGLCIRHGGG Sbjct: 304 TVYCKAHGGGRRCEFLGCTKSAEGRTDFCIAHGGGRRCSHEGCTRAARGKSGLCIRHGGG 363 Query: 2446 KRCQRETCTRSAEGLSGLCISHGGGRRCQYPACSKGAQGSTMFCKAHGGGKRCTVEGCTK 2625 KRCQ+E CT+SAEGLSGLCISHGGGRRCQ+ C+KGAQGSTMFCKAHGGGKRCT GC+K Sbjct: 364 KRCQKENCTKSAEGLSGLCISHGGGRRCQFLGCTKGAQGSTMFCKAHGGGKRCTAPGCSK 423 Query: 2626 GAEGSTPFCKGHGGGKRCSFPG---CPKSVHGGTSFCVAHGGGKRCAITGCTKSARGRTD 2796 GAEGSTPFCKGHGGGKRC+F G C KSVHGGT+FCVAHGGGKRCA CTKSARGRT Sbjct: 424 GAEGSTPFCKGHGGGKRCAFQGGGVCTKSVHGGTNFCVAHGGGKRCAAPECTKSARGRTQ 483 Query: 2797 FCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCAWGQPGSGYGNQADGPCGSFARGK 2976 FCVRHGGGKRCKFEGCGKSAQGSTD+CKAHGGGKRC+WG GS YGNQ GPC SFARGK Sbjct: 484 FCVRHGGGKRCKFEGCGKSAQGSTDYCKAHGGGKRCSWGHSGSEYGNQPTGPCNSFARGK 543 Query: 2977 TGLCTMHGALVQDKRVHGGITLGPLLQDP 3063 TGLC +H LV DKRVHGG+TLGP++QDP Sbjct: 544 TGLCALHSGLVLDKRVHGGVTLGPMVQDP 572 >ref|XP_004143880.1| PREDICTED: uncharacterized protein LOC101212238 isoform 1 [Cucumis sativus] gi|449452268|ref|XP_004143881.1| PREDICTED: uncharacterized protein LOC101212238 isoform 2 [Cucumis sativus] gi|449452270|ref|XP_004143882.1| PREDICTED: uncharacterized protein LOC101212238 isoform 3 [Cucumis sativus] Length = 670 Score = 685 bits (1767), Expect = 0.0 Identities = 353/630 (56%), Positives = 418/630 (66%), Gaps = 8/630 (1%) Frame = +1 Query: 1372 MGFCSDNLSNAFKNVGDSLHVVGSSINHGADTALRLQPIGPNDAYFPTSKGVKRKWGCRD 1551 + F ++ N FK +G S + ADT LRL G + S G+KRKW + Sbjct: 8 LNFAANYSLNVFKILGKSFQDGKTGAEDSADTILRLDSTGSSVPCGSISNGMKRKWSLVE 67 Query: 1552 GNMG-LYVDPXXXXXXXXXXXXXXXXXXXATACTAISSVKETEVESSMDLDLDFTLHLGS 1728 +MG V TACT +SS KET+ ESSM LDLDF+L+LGS Sbjct: 68 KSMGGQSVGSSLSLGFVHSSSSSDSKGSSGTACTRVSSAKETDEESSMALDLDFSLNLGS 127 Query: 1729 DKSPSPKKPALSNFKRVGSQLNLDLELSLSTSQPESDITSIFGGLEASQHGVT-PLVVGT 1905 D+ SPK+PA K + +DLELSLST ESD+TSI+ G + Q + PL Sbjct: 128 DRVASPKEPASKPLKVQKVKPKVDLELSLSTGPSESDVTSIYQGFPSLQLSMEKPLTFVE 187 Query: 1906 TQFVDNETASSHWKVDNTSPV---SLFPKAGESFVLKSIPRKVDLVSVISDVSASTLSVP 2076 T D+ S WK PV SL P+ G ++ + + + + D+S+S L++P Sbjct: 188 TSNTDDGETSCCWKPGTAQPVVPTSLNPQVG--YIFPPVTEIMIPPANVPDLSSSVLTMP 245 Query: 2077 KSSVTCTSGFLXXXXXXXXXXXXKLCQSVGCGKGARGASGLCIAHGGGRRCQRPGCNKGA 2256 KSSVTCTSG + K+CQ GCGKGARGASG CI+HGGGRRCQ+ GC+KGA Sbjct: 246 KSSVTCTSG-ITQQQRFNRSSNSKICQVEGCGKGARGASGRCISHGGGRRCQKLGCHKGA 304 Query: 2257 EGRTALCKAHGGGRRCEFLGCTKSAEGRTDFCIAXXXXXXXXXXXXSRAARGKSGLCIRH 2436 EGRT CKAHGGGRRC+ LGCTKSAEGRTD+CIA +RAARGKSGLCIRH Sbjct: 305 EGRTVYCKAHGGGRRCQHLGCTKSAEGRTDYCIAHGGGRRCNREGCTRAARGKSGLCIRH 364 Query: 2437 GGGKRCQRETCTRSAEGLSGLCISHGGGRRCQYPACSKGAQGSTMFCKAHGGGKRCTVEG 2616 GGGKRCQ+E CT+SAEGLSGLCISHGGGRRCQ P C+KGAQGSTM+CKAHGGGKRCT G Sbjct: 365 GGGKRCQKENCTKSAEGLSGLCISHGGGRRCQTPGCTKGAQGSTMYCKAHGGGKRCTAPG 424 Query: 2617 CTKGAEGSTPFCKGHGGGKRCSFPG---CPKSVHGGTSFCVAHGGGKRCAITGCTKSARG 2787 CTKGAEGSTPFCKGHGGGKRC F G C KSVHGGT+FCVAHGGGKRCA+ CTKSARG Sbjct: 425 CTKGAEGSTPFCKGHGGGKRCGFQGGGICTKSVHGGTNFCVAHGGGKRCAVPECTKSARG 484 Query: 2788 RTDFCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCAWGQPGSGYGNQADGPCGSFA 2967 RTD+CVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRC+WG PGS YG Q PC SFA Sbjct: 485 RTDYCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPGSEYGTQPCCPCNSFA 544 Query: 2968 RGKTGLCTMHGALVQDKRVHGGITLGPLLQDPNSADMIDASAGVMSVDGMGFGPVGLSGE 3147 RGK GLC +H LVQDKRVHGG+++GP++QDPN + G++ D M + + G+ Sbjct: 545 RGKMGLCALHSGLVQDKRVHGGVSIGPIIQDPN-LSKTEKMKGIVGEDYMNEDLIKVGGK 603 Query: 3148 EWTASNVVRAQLSCMEKEPRTTPSPEGRVH 3237 N+ S +K + +PEGRVH Sbjct: 604 --VGPNLEHFAGSEADKPSTSVLAPEGRVH 631 >ref|XP_006343911.1| PREDICTED: cell wall protein AWA1-like isoform X1 [Solanum tuberosum] gi|565354012|ref|XP_006343912.1| PREDICTED: cell wall protein AWA1-like isoform X2 [Solanum tuberosum] Length = 679 Score = 682 bits (1759), Expect = 0.0 Identities = 349/581 (60%), Positives = 402/581 (69%), Gaps = 11/581 (1%) Frame = +1 Query: 1366 RVMGFCSDNLSNAFKNVGDSLHV--VGSSINHGADTALRLQPIGPNDAYFPTSKGVKRKW 1539 R +GF + +AFKN+G S+ V G ++ ADT LRL IG + P KG+KRKW Sbjct: 6 RYVGFTVNPQLSAFKNLGKSIAVGEAGVGGSYCADTTLRLDSIGSSVPSIPAPKGIKRKW 65 Query: 1540 GCRDGNMGLYVDPXXXXXXXXXXXXXXXXXXXATACTAISSVKETEVESSMDLDLDFTLH 1719 G+ + ACT++SS +E + ESSMDLDLDF+LH Sbjct: 66 SSIGGSNDQPIGSSLSLRLGHSSSSSDSKGSSGAACTSMSSARENDEESSMDLDLDFSLH 125 Query: 1720 LGSDKSPSPKKPALSNFKRVGSQLNLDLELSLSTSQPESDITSIFGGLEASQHGV--TPL 1893 LGS+K+ S +K A K++ L +DLELSLS+ ESD+T++ L S + P Sbjct: 126 LGSEKTSSSRKSAHPEAKKLAKGLAVDLELSLSSGAAESDVTTVHL-LSTSPQSIMKAPQ 184 Query: 1894 VVGTTQFVDNETASSHWKVDNTSPVSLFPKAGE-SFVLKSIPRKVDLVSVISDVSASTLS 2070 + D + + HWK N P+ E S++L ++ L +V D+S+S ++ Sbjct: 185 AMAGAFHTDEVSTAIHWKTSNIFHPLRTPQETEASYLLNQAAMQIKLATVSPDLSSSIIT 244 Query: 2071 VPKSSVTCTSGFLXXXXXXXXXXXX-KLCQSVGCGKGARGASGLCIAHGGGRRCQRPGCN 2247 KSSVTCTSG KLCQ GC KGARGASGLCIAHGGGRRCQ+ GC+ Sbjct: 245 NSKSSVTCTSGLTNQQQQQQQRSSSTKLCQFKGCVKGARGASGLCIAHGGGRRCQKSGCH 304 Query: 2248 KGAEGRTALCKAHGGGRRCEFLGCTKSAEGRTDFCIAXXXXXXXXXXXXSRAARGKSGLC 2427 KGAEGRTA CKAHGGGRRCEFLGCTKSAEGRTDFCIA SRAARGKSGLC Sbjct: 305 KGAEGRTAFCKAHGGGRRCEFLGCTKSAEGRTDFCIAHGGGRRCSQEGCSRAARGKSGLC 364 Query: 2428 IRHGGGKRCQRETCTRSAEGLSGLCISHGGGRRCQYPACSKGAQGSTMFCKAHGGGKRCT 2607 IRHGGGKRCQ E CT+SAEGLSGLCISHGGGRRCQYP C+KGAQGSTMFCKAHGGGKRCT Sbjct: 365 IRHGGGKRCQHEGCTKSAEGLSGLCISHGGGRRCQYPQCTKGAQGSTMFCKAHGGGKRCT 424 Query: 2608 VEGCTKGAEGSTPFCKGHGGGKRCSFPG---CPKSVHGGTSFCVAHGGGKRCAITGCTKS 2778 EGC KGAEGST FCKGHGGGKRCSF G CPKSVHGGT FCVAHGGGKRCA++ CT+S Sbjct: 425 FEGCNKGAEGSTAFCKGHGGGKRCSFQGNGLCPKSVHGGTLFCVAHGGGKRCAVSECTRS 484 Query: 2779 ARGRTDFCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCAWGQPGSGYGNQADGPCG 2958 ARGRTDFCVRHGGGKRCK EGCGKSAQGSTDFCKAHGGGKRC+WGQP S +G Q DG C Sbjct: 485 ARGRTDFCVRHGGGKRCKVEGCGKSAQGSTDFCKAHGGGKRCSWGQPDSEFG-QGDGLCN 543 Query: 2959 SFARGKTGLCTMHGALVQDKRVHGGITLGPLLQD--PNSAD 3075 SFARGKTGLC HGALVQDKRVHGG TLG ++ D PN ++ Sbjct: 544 SFARGKTGLCASHGALVQDKRVHGGATLGTMVLDLEPNQSE 584 >ref|XP_004503653.1| PREDICTED: uncharacterized protein LOC101508470 isoform X1 [Cicer arietinum] gi|502139125|ref|XP_004503654.1| PREDICTED: uncharacterized protein LOC101508470 isoform X2 [Cicer arietinum] gi|502139128|ref|XP_004503655.1| PREDICTED: uncharacterized protein LOC101508470 isoform X3 [Cicer arietinum] Length = 665 Score = 677 bits (1746), Expect = 0.0 Identities = 358/635 (56%), Positives = 426/635 (67%), Gaps = 13/635 (2%) Frame = +1 Query: 1372 MGFCSDNLSNAFKNVGDSLHVVGSSINH-GADTALRLQPIGPN-DAYFPTSKGVKRKWGC 1545 +GF N +NAFK +G+S+ V GS+ ++ G DT LRL G + ++ +S+G KRKW Sbjct: 8 LGFPVHNSANAFKILGNSMQVEGSASDYYGTDTVLRLDSPGFSIPSHKASSRGTKRKWDL 67 Query: 1546 RDGNMGLYVDPXXXXXXXXXXXXXXXXXXXATACTAISSVKETEVESSMDLDLDFTLHLG 1725 DG MG V A ACTA+SS K+ + ESSMD++LDFTL+LG Sbjct: 68 IDGCMGQRVGSSLSLGLGLSTSSSDSKGSSAVACTAMSSGKDIDEESSMDIELDFTLNLG 127 Query: 1726 SDKSPSPKKPALSNFKRVGSQLNLDLELSLSTSQPESDITSIFGGLEASQHGVTPL-VVG 1902 +K S KK SN K + Q DLELSLST ESDITS+ Q + V Sbjct: 128 CEKVHSLKKSFDSNMKTLELQPKFDLELSLSTGPCESDITSVHLNRSLPQLNMEMASVFS 187 Query: 1903 TTQFVDNETASSHWKVDNTSPVSLFPKAGESFVLKSIPRKVDLVSVISDVSASTLSVPKS 2082 TQ D + S WK P SL +A S + P+++D ++ D+S++ Sbjct: 188 GTQNTDEGSTSCSWKPGLVLP-SLNTEA--SILFNQAPKQLDHSPIVLDLSSTR------ 238 Query: 2083 SVTCTSGFLXXXXXXXXXXXXKLCQSVGCGKGARGASGLCIAHGGGRRCQRPGCNKGAEG 2262 SVTCTSG K+CQ GCGKGARGASG CI+HGGGRRCQ+PGC+KGAEG Sbjct: 239 SVTCTSGITHQHQPPHRHGNSKICQVEGCGKGARGASGRCISHGGGRRCQKPGCHKGAEG 298 Query: 2263 RTALCKAHGGGRRCEFLGCTKSAEGRTDFCIAXXXXXXXXXXXXSRAARGKSGLCIRHGG 2442 RT CKAHGGGRRCE+LGCTKSAEGRTDFCIA SRAARGKSGLCIRHGG Sbjct: 299 RTVYCKAHGGGRRCEYLGCTKSAEGRTDFCIAHGGGRRCSHDGCSRAARGKSGLCIRHGG 358 Query: 2443 GKRCQRETCTRSAEGLSGLCISHGGGRRCQYPACSKGAQGSTMFCKAHGGGKRCTVEGCT 2622 GKRCQ+E CT+SAEGLSGLCISHGGGRRCQ C+KGAQGSTMFCKAHGGGKRCT GCT Sbjct: 359 GKRCQKENCTKSAEGLSGLCISHGGGRRCQVSGCTKGAQGSTMFCKAHGGGKRCTAPGCT 418 Query: 2623 KGAEGSTPFCKGHGGGKRCSFPG---CPKSVHGGTSFCVAHGGGKRCAITGCTKSARGRT 2793 KGAEGSTPFCKGHGGGKRC++ G C KSVHGGT+FCVAHGGGKRCA+TGCTKSARGRT Sbjct: 419 KGAEGSTPFCKGHGGGKRCTYQGGGVCTKSVHGGTNFCVAHGGGKRCAVTGCTKSARGRT 478 Query: 2794 DFCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCAWGQPGSGYGNQADGPCGSFARG 2973 D CVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRC+WG P S +G Q DGPC SFARG Sbjct: 479 DHCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCSWGHPESEHGIQPDGPCTSFARG 538 Query: 2974 KTGLCTMHGALVQDKRVHGGITLGPLLQDPNSADMIDASAGVMSVDGMGFGPVGLSGEEW 3153 KTGLC +H LV D+RVHGG++LG +QDP + ++ + +M + +G + Sbjct: 539 KTGLCALHSGLVHDRRVHGGVSLGS-VQDPRCSKPVEVT--MMKIG----SSIGNTAAPR 591 Query: 3154 TASNVVRAQL--SCMEKE-----PRTTPSPEGRVH 3237 T S++ + ++ +C E P + PEGRVH Sbjct: 592 TCSDLKQHEVASACASVEEGGRFPMSVAVPEGRVH 626 >ref|XP_004245556.1| PREDICTED: uncharacterized protein LOC101268782 isoform 1 [Solanum lycopersicum] gi|460400067|ref|XP_004245557.1| PREDICTED: uncharacterized protein LOC101268782 isoform 2 [Solanum lycopersicum] Length = 681 Score = 666 bits (1718), Expect = 0.0 Identities = 348/611 (56%), Positives = 407/611 (66%), Gaps = 11/611 (1%) Frame = +1 Query: 1366 RVMGFCSDNLSNAFKNVGDSLHV--VGSSINHGADTALRLQPIGPNDAYFPTSKGVKRKW 1539 R +GF + +AFKN+G S+ V G ++ ADT LRL I + P KG+KRKW Sbjct: 6 RYVGFTVNPQLSAFKNLGKSIAVGEAGVGGSYCADTTLRLDSICSSVPSIPAPKGIKRKW 65 Query: 1540 GCRDGNMGLYVDPXXXXXXXXXXXXXXXXXXXATACTAISSVKETEVESSMDLDLDFTLH 1719 + + ACT+I S E + ESSMDLDLDF+LH Sbjct: 66 SSLGVSNDQPIGSSLCLRLGHSSSSSDSKGSSGAACTSICSAIENDEESSMDLDLDFSLH 125 Query: 1720 LGSDKSPSPKKPALSNFKRVGSQLNLDLELSLSTSQPESDITSIFGGLEASQHGV--TPL 1893 LGS+K+ S +K K++ L +DLELSLS+ ESD+T++ L S + P Sbjct: 126 LGSEKTSSSRKSTHPESKKLAKGLAVDLELSLSSGAAESDVTTVHL-LSTSPQSIMKAPQ 184 Query: 1894 VVGTTQFVDNETASSHWKV-DNTSPVSLFPKAGESFVLKSIPRKVDLVSVISDVSASTLS 2070 + D + + HWK D P+ S++L + +V ++S+S ++ Sbjct: 185 EMTGAFHTDEVSTAVHWKPSDIFHPLRTSQGTEASYLLNQDATQFKQATVSPNLSSSIIT 244 Query: 2071 VPKSSVTCTSGFLXXXXXXXXXXXX-KLCQSVGCGKGARGASGLCIAHGGGRRCQRPGCN 2247 KSSVTCTSG K CQ GC KGARGASGLCIAHGGGRRCQ+PGC+ Sbjct: 245 NSKSSVTCTSGLTNQQQQQQQRSSSTKQCQFKGCVKGARGASGLCIAHGGGRRCQKPGCH 304 Query: 2248 KGAEGRTALCKAHGGGRRCEFLGCTKSAEGRTDFCIAXXXXXXXXXXXXSRAARGKSGLC 2427 KGAEGRTA CKAHGGGRRCEFLGCTKSAEGRTDFCIA SRAARGKSGLC Sbjct: 305 KGAEGRTAFCKAHGGGRRCEFLGCTKSAEGRTDFCIAHGGGRRCSHEGCSRAARGKSGLC 364 Query: 2428 IRHGGGKRCQRETCTRSAEGLSGLCISHGGGRRCQYPACSKGAQGSTMFCKAHGGGKRCT 2607 IRHGGGKRCQ E CT+SAEGLSGLCISHGGGRRCQYP C+KGAQGSTMFCKAHGGGKRCT Sbjct: 365 IRHGGGKRCQHEGCTKSAEGLSGLCISHGGGRRCQYPQCTKGAQGSTMFCKAHGGGKRCT 424 Query: 2608 VEGCTKGAEGSTPFCKGHGGGKRCSFPG---CPKSVHGGTSFCVAHGGGKRCAITGCTKS 2778 EGC KGAEGST FCKGHGGGKRCSF G CPKSVHGGT FCVAHGGGKRCA+ CT+S Sbjct: 425 FEGCNKGAEGSTAFCKGHGGGKRCSFQGNGLCPKSVHGGTLFCVAHGGGKRCAVAECTRS 484 Query: 2779 ARGRTDFCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCAWGQPGSGYGNQADGPCG 2958 ARGRTDFCVRHGGGKRCK +GCGKSAQGSTDFCKAHGGGKRC+WGQP S +G Q DGPC Sbjct: 485 ARGRTDFCVRHGGGKRCKVDGCGKSAQGSTDFCKAHGGGKRCSWGQPDSEFG-QGDGPCN 543 Query: 2959 SFARGKTGLCTMHGALVQDKRVHGGITLGPLLQD--PNSADMIDASAGVMSVDGMGFGPV 3132 SFARGKTGLC HGAL+QDKRVHGG TLG ++ D PN ++ I +++V+ + F Sbjct: 544 SFARGKTGLCASHGALIQDKRVHGGATLGTIVLDLAPNQSEKIKE---IVNVEDICFDVT 600 Query: 3133 GLSGEEWTASN 3165 + T+SN Sbjct: 601 KMQSIGMTSSN 611 >gb|EOY24891.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 657 Score = 647 bits (1670), Expect = 0.0 Identities = 343/636 (53%), Positives = 397/636 (62%), Gaps = 14/636 (2%) Frame = +1 Query: 1372 MGFCSDNLSNAFKNVGDSLHVVGSSINHGADTALRLQPIGPNDAYFPTSKGVKRKWGCRD 1551 +GF ++ SNAFK +G S+ V G+ + +G DT LRL G + Y TSKG KRKW D Sbjct: 8 LGFAANFSSNAFKILGGSMQVGGTGVAYGTDTVLRLDSPGSSIPYMSTSKGTKRKWSLMD 67 Query: 1552 GNMGLYVDPXXXXXXXXXXXXXXXXXXXATACTAISSVKETEVESSMDLDLDFTLHLGSD 1731 G++ V TACT SS KE + ESSMD++LDFTLHLG++ Sbjct: 68 GSVSEQVGSSLSLGLGRSSSSSDSKGSSTTACTTTSSAKEADEESSMDIELDFTLHLGNE 127 Query: 1732 KSPSPKKPALSNFKRVGSQLNLDLELSLSTSQPESDITSIFGGLEASQHGVT-PLVVGTT 1908 K + KK A N K + Q +DL LSLST ESDITS+ Q G+ P+ V Sbjct: 128 KVNNLKKHASPNLKGLELQPKVDLGLSLSTGPSESDITSVHLSSSPIQSGMEMPIAVDGA 187 Query: 1909 QFVDNETASSHWKVDNT-SPVSLFPKAGESFVLKSIPRKVDLVSVISDVSASTLSVPKSS 2085 D + S WK P+ P S K +PR +DL ++ D+S+S ++ PKSS Sbjct: 188 PNADEGSTSCCWKPRMALPPLQSLPGKQTSIFFKEVPRSIDLSPIVPDLSSSVITTPKSS 247 Query: 2086 VTCTSGFLXXXXXXXXXXXXKLCQSVGCGKGARGASGLCIAHGGGRRCQRPGCNKGAEGR 2265 VTCTSG K CQ GCGKGARGASG CI+HGGGRRCQ+PGC+KGAEGR Sbjct: 248 VTCTSGITRQQQPQQRSSSSKTCQVEGCGKGARGASGRCISHGGGRRCQKPGCHKGAEGR 307 Query: 2266 TALCKAHGGGRRCEFLGCTKSAEGRTDFCIAXXXXXXXXXXXXSRAARGKSGLCIRHGGG 2445 T CKAHGGGRRCEFLGCTKSAEGRTDFCIA +RAARGKSGLCIRHGGG Sbjct: 308 TVYCKAHGGGRRCEFLGCTKSAEGRTDFCIAHGGGRRCSHEGCTRAARGKSGLCIRHGGG 367 Query: 2446 KRCQRETCTRSAEGLSGLCISHGGGRRCQYPACSKGAQGSTMFCKAHGGGKRCTVEGCTK 2625 KRCQ+E CT+SAEGLSGLCISHGGGRRCQ+ C+KGAQGSTMFCKAHGGGKRCT CTK Sbjct: 368 KRCQKENCTKSAEGLSGLCISHGGGRRCQFLGCTKGAQGSTMFCKAHGGGKRCTYPDCTK 427 Query: 2626 GAEGSTPFCKGHGGGKRCSFPG---CPKSVHGGTSFCVAHGGGKRCAITGCTKSARGRTD 2796 GAEGSTPFCKGHGGGKRC+F G C KSVHGGT+FCVAHGGGKRC GC KSA+G TD Sbjct: 428 GAEGSTPFCKGHGGGKRCAFQGGGVCTKSVHGGTNFCVAHGGGKRCKFEGCGKSAQGSTD 487 Query: 2797 FCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCAWGQPGSGYGNQADGPCGSFARGK 2976 FC HGGGKR C+WG PGS YGNQ GPC SFARGK Sbjct: 488 FCKAHGGGKR-------------------------CSWGHPGSEYGNQLSGPCNSFARGK 522 Query: 2977 TGLCTMHGALVQDKRVHGGITLGPLLQDP---NSADMID-ASAGVMSVDGMGFG-----P 3129 TGLC +H LVQDKRVHGG TLGP++QDP S M + +A M+VD M G Sbjct: 523 TGLCALHSGLVQDKRVHGGATLGPIVQDPKVSKSEKMKEIVTAEDMNVDIMKMGSDMEAS 582 Query: 3130 VGLSGEEWTASNVVRAQLSCMEKEPRTTPSPEGRVH 3237 G + V A +S E+ + PEGRVH Sbjct: 583 AGRTCSSLNQYGVPNAHISVGER-GFSVFVPEGRVH 617 >ref|XP_006391666.1| hypothetical protein EUTSA_v10023333mg [Eutrema salsugineum] gi|557088172|gb|ESQ28952.1| hypothetical protein EUTSA_v10023333mg [Eutrema salsugineum] Length = 654 Score = 643 bits (1658), Expect = 0.0 Identities = 349/633 (55%), Positives = 407/633 (64%), Gaps = 12/633 (1%) Frame = +1 Query: 1375 GFCSDNLSNAFKNVGDSLHVVGSSINHGADTALRLQPIGPNDAYFPTSKGVKRKWGCRDG 1554 GF ++ SN+FK +G L V + GADT LRL + + +KG+KRKW DG Sbjct: 9 GFAGNSSSNSFKILGRPLQVEVPEVEFGADTTLRLDSLA---SPLSNTKGIKRKWSLIDG 65 Query: 1555 NMGLYVDPXXXXXXXXXXXXXXXXXXXATACTAISSVKETEVESSMDLDLDFTLHLGSDK 1734 G DP ATACT++SS KE E ESSMD++LDF+LHLGS+K Sbjct: 66 AKGQEADPLLSLRLGHSSSSSDSKGSSATACTSLSSAKEIEEESSMDIELDFSLHLGSEK 125 Query: 1735 SPSPKKPALSNFKRVGS-QLNLDLELSLSTSQP-ESDITSIFGGLEASQHGVTPLVVGTT 1908 S KKPA S K + DLELSLS +S+IT++ Q P + Sbjct: 126 PFSNKKPANSKMKDLQVLSPRFDLELSLSGGVSCQSEITAV-----QQQANQFPTLAEML 180 Query: 1909 QFVDNETASSHWKVDNTSPVSLFPKAGE-SFVLKSIPRKVDL-VSVISDVSASTLSV-PK 2079 + + + S W+ SP + E S L P+ + + S + D+S+ST + P Sbjct: 181 RATNEGSTSCGWRPGFASPTLQASSSKEKSSFLAHAPKNIIIPASHVLDLSSSTATTTPI 240 Query: 2080 SSVTCTSGFLXXXXXXXXXXXX-KLCQSVGCGKGARGASGLCIAHGGGRRCQRPGCNKGA 2256 SS TCTSG KLCQ GC KGARGASG CI+HGGGRRCQ+ GC+KGA Sbjct: 241 SSGTCTSGLTQQLKPQHKSSSSSKLCQIEGCQKGARGASGRCISHGGGRRCQKHGCHKGA 300 Query: 2257 EGRTALCKAHGGGRRCEFLGCTKSAEGRTDFCIAXXXXXXXXXXXXSRAARGKSGLCIRH 2436 EGRT CKAHGGGRRCEFLGCTKSAEGRTDFCIA +RAARG+SGLCIRH Sbjct: 301 EGRTVYCKAHGGGRRCEFLGCTKSAEGRTDFCIAHGGGRRCSHEDCTRAARGRSGLCIRH 360 Query: 2437 GGGKRCQRETCTRSAEGLSGLCISHGGGRRCQYPACSKGAQGSTMFCKAHGGGKRCTVEG 2616 GGGKRCQRE CT+SAEGLSGLCISHGGGRRCQ C+KGAQGSTMFCKAHGGGKRCT G Sbjct: 361 GGGKRCQRENCTKSAEGLSGLCISHGGGRRCQANGCTKGAQGSTMFCKAHGGGKRCTHAG 420 Query: 2617 CTKGAEGSTPFCKGHGGGKRCSFPG---CPKSVHGGTSFCVAHGGGKRCAITGCTKSARG 2787 CTKGAEGSTPFCKGHGGGKRC+F G C KSVHGGT+FCVAHGGGKRCA+ CTKSARG Sbjct: 421 CTKGAEGSTPFCKGHGGGKRCAFQGGDPCSKSVHGGTNFCVAHGGGKRCAVPECTKSARG 480 Query: 2788 RTDFCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCAWGQPGSGY-GNQADGPCGSF 2964 RTDFCVRHGGGKRCK EGCGKSAQGSTDFCKAHGGGKRCAWG P + Y G + GPC SF Sbjct: 481 RTDFCVRHGGGKRCKSEGCGKSAQGSTDFCKAHGGGKRCAWGHPETEYAGQSSSGPCTSF 540 Query: 2965 ARGKTGLCTMHGALVQDKRVHGGITLGPLLQD--PNSADMIDASAGVMSVDGMGFGPVGL 3138 ARGKTGLC +H +LVQD RVHGG+T+ Q+ P+S++ + S M Sbjct: 541 ARGKTGLCALHNSLVQDNRVHGGMTVTSESQEPRPSSSETENESQEFSGSQDMNID---- 596 Query: 3139 SGEEWTASNVVRAQLSCMEKEPRTTPSPEGRVH 3237 S + +A + E E +PEGRVH Sbjct: 597 SMKARSAVGSPETDIDLNEYEAGLGLAPEGRVH 629 >ref|NP_176596.2| uncharacterized protein [Arabidopsis thaliana] gi|6692098|gb|AAF24563.1|AC007764_5 F22C12.10 [Arabidopsis thaliana] gi|332196079|gb|AEE34200.1| uncharacterized protein AT1G64140 [Arabidopsis thaliana] Length = 646 Score = 620 bits (1600), Expect = e-174 Identities = 344/637 (54%), Positives = 413/637 (64%), Gaps = 15/637 (2%) Frame = +1 Query: 1372 MGFCSDNLSNAFKNVGDSLHVVGSSINHGADTALRLQPIGPNDAYFPTSKGVKRKWGCRD 1551 + F ++ SN++K +G SL V + ADT LRL + + +KG+KRKW D Sbjct: 8 IAFAGNSSSNSYKILGRSLQV---EVPEAADTTLRLDSLA---SPLSNAKGIKRKWNLID 61 Query: 1552 GNMGLYVDPXXXXXXXXXXXXXXXXXXXATACTAISSVKETEVESSMDLDLDFTLHLGSD 1731 G DP ATACT++SS +ETE SSMD++LDF+LHLG++ Sbjct: 62 G-----ADPLLSLRLGHSSSSSDSKGSSATACTSLSSARETEEASSMDIELDFSLHLGNE 116 Query: 1732 K-SPSPKKPALSNFKRVGSQL---NLDLELSLSTSQP-ESDITSIFGGLEASQHGVTPLV 1896 K + S KKPA N K G Q+ DLELSLS +S+IT++ QH Sbjct: 117 KPTASNKKPA--NLKMKGLQVPSPKFDLELSLSGGGSCQSEITAV------QQHANRFQS 168 Query: 1897 VGTTQFVDNE-TASSHWKVDNTSPVSLFPKAGE-SFVLKSIPRKVDLVSV-ISDVSASTL 2067 + +NE +A+ W+ P + E S L IP+ V + + + ++S++T Sbjct: 169 LADMLRANNEESATCGWRQGFGLPTLQASSSKETSSFLGHIPKNVIIPAAHVLELSSNTA 228 Query: 2068 SV-PKSSVTCTSGFLXXXXXXXXXXXX-KLCQSVGCGKGARGASGLCIAHGGGRRCQRPG 2241 + P SS TCTSG KLCQ GC KGARGASG CI+HGGGRRCQ+ G Sbjct: 229 ATTPISSGTCTSGLSQQLKPQLKNSSSSKLCQVEGCHKGARGASGRCISHGGGRRCQKHG 288 Query: 2242 CNKGAEGRTALCKAHGGGRRCEFLGCTKSAEGRTDFCIAXXXXXXXXXXXXSRAARGKSG 2421 C+KGAEGRT CKAHGGGRRCEFLGCTKSAEGRTDFCIA +RAARG+SG Sbjct: 289 CHKGAEGRTVYCKAHGGGRRCEFLGCTKSAEGRTDFCIAHGGGRRCSHEDCTRAARGRSG 348 Query: 2422 LCIRHGGGKRCQRETCTRSAEGLSGLCISHGGGRRCQYPACSKGAQGSTMFCKAHGGGKR 2601 LCIRHGGGKRCQRE CT+SAEGLSGLCISHGGGRRCQ C+KGAQGSTMFCKAHGGGKR Sbjct: 349 LCIRHGGGKRCQRENCTKSAEGLSGLCISHGGGRRCQSNGCTKGAQGSTMFCKAHGGGKR 408 Query: 2602 CTVEGCTKGAEGSTPFCKGHGGGKRCSFPG---CPKSVHGGTSFCVAHGGGKRCAITGCT 2772 CT GCTKGAEGSTPFCKGHGGGKRC+F G C KSVHGGT+FCVAHGGGKRCA+ CT Sbjct: 409 CTHSGCTKGAEGSTPFCKGHGGGKRCAFQGDDPCSKSVHGGTNFCVAHGGGKRCAVPECT 468 Query: 2773 KSARGRTDFCVRHGGGKRCKFEGCGKSAQGSTDFCKAHGGGKRCAWGQPGSGY-GNQADG 2949 KSARGRTDFCVRHGGGKRC+ EGCGKSAQGSTDFCKAHGGGKRCAWGQP + Y G + G Sbjct: 469 KSARGRTDFCVRHGGGKRCQSEGCGKSAQGSTDFCKAHGGGKRCAWGQPETEYAGQSSSG 528 Query: 2950 PCGSFARGKTGLCTMHGALVQDKRVHGGITLGPLLQDPNSADMIDASAGVMSVDGMGFGP 3129 PC SFARGKTGLC +H +LVQD RVHGG+T+ Q+P + +S + G Sbjct: 529 PCTSFARGKTGLCALHNSLVQDNRVHGGMTITSESQEPR----VSSSETENEEEFSGSQD 584 Query: 3130 VGL-SGEEWTASNVVRAQLSCMEKEPRTTPSPEGRVH 3237 + + + + +A+ + E E +PEGRVH Sbjct: 585 MNMDTMKARSATGSPETDVDLNEYEAGLGLAPEGRVH 621