BLASTX nr result
ID: Papaver29_contig00004162
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Papaver29_contig00004162 (1997 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010270835.1| PREDICTED: CAS1 domain-containing protein 1 ... 799 0.0 ref|XP_010270836.1| PREDICTED: CAS1 domain-containing protein 1 ... 791 0.0 ref|XP_004508117.1| PREDICTED: CAS1 domain-containing protein 1 ... 784 0.0 ref|XP_004508116.1| PREDICTED: CAS1 domain-containing protein 1 ... 784 0.0 ref|XP_009411616.1| PREDICTED: CAS1 domain-containing protein 1-... 779 0.0 emb|CDO98736.1| unnamed protein product [Coffea canephora] 775 0.0 ref|XP_012455820.1| PREDICTED: CAS1 domain-containing protein 1-... 773 0.0 ref|XP_012455818.1| PREDICTED: CAS1 domain-containing protein 1-... 773 0.0 ref|XP_007154416.1| hypothetical protein PHAVU_003G117800g [Phas... 773 0.0 ref|XP_003550779.1| PREDICTED: CAS1 domain-containing protein 1-... 773 0.0 ref|XP_007035865.1| O-acetyltransferase family protein isoform 2... 772 0.0 ref|XP_007035864.1| O-acetyltransferase family protein isoform 1... 771 0.0 gb|KRH50688.1| hypothetical protein GLYMA_07G237000 [Glycine max] 771 0.0 ref|XP_003609827.2| O-acetyltransferase family protein [Medicago... 771 0.0 ref|XP_006583993.1| PREDICTED: CAS1 domain-containing protein 1-... 771 0.0 gb|KRH50687.1| hypothetical protein GLYMA_07G237000 [Glycine max] 771 0.0 gb|KHN03266.1| CAS1 domain-containing protein 1 [Glycine soja] 771 0.0 ref|XP_006600388.1| PREDICTED: CAS1 domain-containing protein 1-... 771 0.0 ref|XP_006583994.1| PREDICTED: CAS1 domain-containing protein 1-... 771 0.0 ref|XP_010045355.1| PREDICTED: CAS1 domain-containing protein 1 ... 770 0.0 >ref|XP_010270835.1| PREDICTED: CAS1 domain-containing protein 1 isoform X1 [Nelumbo nucifera] Length = 547 Score = 799 bits (2063), Expect = 0.0 Identities = 385/543 (70%), Positives = 447/543 (82%), Gaps = 11/543 (2%) Frame = -2 Query: 1918 MEISAPITPGQVSILLGIIPLFAATLYAEWSEYRKNSKLSSEIGRISDIN---------- 1769 M I++P+TPGQVS LLGIIP+ A +Y+E+ EY+K S +SS++GR SDIN Sbjct: 1 MSITSPVTPGQVSFLLGIIPIIVAWIYSEFLEYKKTS-ISSKVGRHSDINLVELGKETVK 59 Query: 1768 -DGRAALLEDPNLPLSTINLDSPSVASQLLRFFLMDSSFFVENRLTLRAISEFGLYLVYM 1592 D + AL+E NL ++ S SV S L RFFLMD SF ENRL LRAISEFGL + Y Sbjct: 60 EDDKTALIESGNLQSASPKARSSSVTSHLARFFLMDESFLTENRLLLRAISEFGLIVFYF 119 Query: 1591 YICDRTDTFGYSIKSYSRDIFLFLYFLLIMVAAITSFTIHQDKSPITGKSILYLNRHQTE 1412 YICDRT+ FG S K+Y+RD+FLFLYFLLI+V+A+TSF IH DKSP +GKSILYLNRHQTE Sbjct: 120 YICDRTNVFGESKKTYNRDLFLFLYFLLIIVSAMTSFKIHHDKSPFSGKSILYLNRHQTE 179 Query: 1411 EWKGWMQVMFVMYHYFGALEIYNGIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMW 1232 EWKGWMQV+F+MYHYF A EIYN IRIFIAAYVWMTGFGNFSYYYVRKDFS+ RFAQMMW Sbjct: 180 EWKGWMQVLFLMYHYFAAAEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLTRFAQMMW 239 Query: 1231 RLNFLAAFVCIVLNNDYMFYYICPMHTFFTLMVYGALRILNKYNEIGSVIALKMASCFLI 1052 RLNF AF CIVLNN+YM YYICPMHT FTLMVYGAL ILNKYNEIGSVIALK+A+CF++ Sbjct: 240 RLNFFVAFCCIVLNNNYMLYYICPMHTLFTLMVYGALGILNKYNEIGSVIALKIAACFMV 299 Query: 1051 IILVWEVPGVFELVWSPFMFLLGCSVTFLGPEGTRSLKEWHVRTGLDRYIWIIGMIYAYY 872 +ILVWEVPGVF++VWSPF F LG S L EWH R+GLDRYIWIIGMIYAYY Sbjct: 300 VILVWEVPGVFDVVWSPFTFFLGYSDPNPSKPKYPLLHEWHFRSGLDRYIWIIGMIYAYY 359 Query: 871 HPTVEKWMEKLEEAEFKRRISIKLAAASVSLTVGYLWFEYIYKLDSIPYQKYHPYTSWIP 692 HPTVE+WMEKLEE E +RRISIK+A A+V +GYLWFEYIYKLD + Y KYHPYTSWIP Sbjct: 360 HPTVERWMEKLEETETRRRISIKIAVATVCSVMGYLWFEYIYKLDKVAYNKYHPYTSWIP 419 Query: 691 ITVYICLRNVTQYFRCYSLTLFAWLGKITLETYISQFHIWQRTGILDGEPKLVLSLIPDY 512 ITVYICLRNVTQ FR YSLTLFAWLGKITLETYISQ HIW R+G+ DG+P+ +LSLIPDY Sbjct: 420 ITVYICLRNVTQQFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPRWLLSLIPDY 479 Query: 511 PMLNFMLTTAIFVAISYRLFQLTNTLKTAFIPSGDDKRLMHNIVAAGAISISLYTLSIAI 332 PMLNFMLTT+I+VAIS+R+F+LTNTLK++F+PS D+KRLM+N++AA ISI LY+LS Sbjct: 480 PMLNFMLTTSIYVAISHRIFELTNTLKSSFVPSKDNKRLMYNMIAAAVISIMLYSLSFVF 539 Query: 331 CKL 323 ++ Sbjct: 540 LQI 542 >ref|XP_010270836.1| PREDICTED: CAS1 domain-containing protein 1 isoform X2 [Nelumbo nucifera] Length = 545 Score = 791 bits (2042), Expect = 0.0 Identities = 380/540 (70%), Positives = 442/540 (81%), Gaps = 8/540 (1%) Frame = -2 Query: 1918 MEISAPITPGQVSILLGIIPLFAATLYAEWSEYRKNSKLSS--------EIGRISDINDG 1763 M I++P+TPGQVS LLGIIP+ A +Y+E+ EY+K S S E+G+ + D Sbjct: 1 MSITSPVTPGQVSFLLGIIPIIVAWIYSEFLEYKKTSISSKVHSDINLVELGKETVKEDD 60 Query: 1762 RAALLEDPNLPLSTINLDSPSVASQLLRFFLMDSSFFVENRLTLRAISEFGLYLVYMYIC 1583 + AL+E NL ++ S SV S L RFFLMD SF ENRL LRAISEFGL + Y YIC Sbjct: 61 KTALIESGNLQSASPKARSSSVTSHLARFFLMDESFLTENRLLLRAISEFGLIVFYFYIC 120 Query: 1582 DRTDTFGYSIKSYSRDIFLFLYFLLIMVAAITSFTIHQDKSPITGKSILYLNRHQTEEWK 1403 DRT+ FG S K+Y+RD+FLFLYFLLI+V+A+TSF IH DKSP +GKSILYLNRHQTEEWK Sbjct: 121 DRTNVFGESKKTYNRDLFLFLYFLLIIVSAMTSFKIHHDKSPFSGKSILYLNRHQTEEWK 180 Query: 1402 GWMQVMFVMYHYFGALEIYNGIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMWRLN 1223 GWMQV+F+MYHYF A EIYN IRIFIAAYVWMTGFGNFSYYYVRKDFS+ RFAQMMWRLN Sbjct: 181 GWMQVLFLMYHYFAAAEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLTRFAQMMWRLN 240 Query: 1222 FLAAFVCIVLNNDYMFYYICPMHTFFTLMVYGALRILNKYNEIGSVIALKMASCFLIIIL 1043 F AF CIVLNN+YM YYICPMHT FTLMVYGAL ILNKYNEIGSVIALK+A+CF+++IL Sbjct: 241 FFVAFCCIVLNNNYMLYYICPMHTLFTLMVYGALGILNKYNEIGSVIALKIAACFMVVIL 300 Query: 1042 VWEVPGVFELVWSPFMFLLGCSVTFLGPEGTRSLKEWHVRTGLDRYIWIIGMIYAYYHPT 863 VWEVPGVF++VWSPF F LG S L EWH R+GLDRYIWIIGMIYAYYHPT Sbjct: 301 VWEVPGVFDVVWSPFTFFLGYSDPNPSKPKYPLLHEWHFRSGLDRYIWIIGMIYAYYHPT 360 Query: 862 VEKWMEKLEEAEFKRRISIKLAAASVSLTVGYLWFEYIYKLDSIPYQKYHPYTSWIPITV 683 VE+WMEKLEE E +RRISIK+A A+V +GYLWFEYIYKLD + Y KYHPYTSWIPITV Sbjct: 361 VERWMEKLEETETRRRISIKIAVATVCSVMGYLWFEYIYKLDKVAYNKYHPYTSWIPITV 420 Query: 682 YICLRNVTQYFRCYSLTLFAWLGKITLETYISQFHIWQRTGILDGEPKLVLSLIPDYPML 503 YICLRNVTQ FR YSLTLFAWLGKITLETYISQ HIW R+G+ DG+P+ +LSLIPDYPML Sbjct: 421 YICLRNVTQQFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPRWLLSLIPDYPML 480 Query: 502 NFMLTTAIFVAISYRLFQLTNTLKTAFIPSGDDKRLMHNIVAAGAISISLYTLSIAICKL 323 NFMLTT+I+VAIS+R+F+LTNTLK++F+PS D+KRLM+N++AA ISI LY+LS ++ Sbjct: 481 NFMLTTSIYVAISHRIFELTNTLKSSFVPSKDNKRLMYNMIAAAVISIMLYSLSFVFLQI 540 >ref|XP_004508117.1| PREDICTED: CAS1 domain-containing protein 1 isoform X2 [Cicer arietinum] Length = 556 Score = 784 bits (2025), Expect = 0.0 Identities = 381/559 (68%), Positives = 447/559 (79%), Gaps = 15/559 (2%) Frame = -2 Query: 1918 MEISAPITPGQVSILLGIIPLFAATLYAEWSEYRKNSKLSS--------EIGRISDINDG 1763 M I +P+TPGQ+S LLGIIP+ A +Y+E EYRKNS S E+ I+ ++ Sbjct: 1 MHILSPVTPGQISFLLGIIPVILAWIYSEILEYRKNSVTSKAHSDINLVEVSSIAVKDED 60 Query: 1762 RAALLED-PNLPLSTINLDSPSV--ASQLLRFFLMDSSFFVENRLTLRAISEFGLYLVYM 1592 R LLE LP S + S+ +S ++RF LMD +F +ENR TLRA+SEFGL L Y Sbjct: 61 REVLLEGGAQLPASPTGSKARSLTASSSVIRFLLMDENFLIENRSTLRAMSEFGLLLGYY 120 Query: 1591 YICDRTDTFGYSIKSYSRDIFLFLYFLLIMVAAITSFTIHQDKSPITGKSILYLNRHQTE 1412 Y+CDRTD FG S KSY+RD+FLFLYFLLI+V+AITSFTIH DKSP +GKSILYLNRHQTE Sbjct: 121 YLCDRTDFFGSSKKSYNRDLFLFLYFLLIIVSAITSFTIHHDKSPFSGKSILYLNRHQTE 180 Query: 1411 EWKGWMQVMFVMYHYFGALEIYNGIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMW 1232 EWKGWMQV+F+MYHYF A EIYN IR+FIAAYVWMTGFGNFSYYY+RKDFS+ARFAQMMW Sbjct: 181 EWKGWMQVLFLMYHYFAASEIYNAIRLFIAAYVWMTGFGNFSYYYIRKDFSMARFAQMMW 240 Query: 1231 RLNFLAAFVCIVLNNDYMFYYICPMHTFFTLMVYGALRILNKYNEIGSVIALKMASCFLI 1052 RLNFL F C+VLNN YM YYICPMHT FTLMVYGAL ILNKYNEIGSVIA K+ +CFL+ Sbjct: 241 RLNFLVLFCCVVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEIGSVIAAKIIACFLV 300 Query: 1051 IILVWEVPGVFELVWSPFMFLLGCSVTFLGPEGTRS----LKEWHVRTGLDRYIWIIGMI 884 +ILVWE PGVFE +WSPF F+LG + P+ ++S L EWH R+GLDRYIWIIGMI Sbjct: 301 VILVWETPGVFEWLWSPFTFMLG----YTDPDPSKSQFSRLHEWHFRSGLDRYIWIIGMI 356 Query: 883 YAYYHPTVEKWMEKLEEAEFKRRISIKLAAASVSLTVGYLWFEYIYKLDSIPYQKYHPYT 704 YAYYHPTVE+WMEKLEEAE KRRISIK A +S GYLWFEY+YKLD + Y KYHPYT Sbjct: 357 YAYYHPTVERWMEKLEEAEIKRRISIKAAVVLLSSVTGYLWFEYVYKLDKVTYNKYHPYT 416 Query: 703 SWIPITVYICLRNVTQYFRCYSLTLFAWLGKITLETYISQFHIWQRTGILDGEPKLVLSL 524 SWIPITVYICLRN+TQ FR Y+LTLFAWLGK+TLETYISQ HIW R+G+ DG+PKL+LSL Sbjct: 417 SWIPITVYICLRNITQSFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKLLLSL 476 Query: 523 IPDYPMLNFMLTTAIFVAISYRLFQLTNTLKTAFIPSGDDKRLMHNIVAAGAISISLYTL 344 IPDYPMLNFMLTT+I+VAISYRLFQLTNTLKTAF+PS DDKRL++N+V IS+ LY+L Sbjct: 477 IPDYPMLNFMLTTSIYVAISYRLFQLTNTLKTAFVPSKDDKRLINNLVIGTTISVVLYSL 536 Query: 343 SIAICKLSPTAYGMMRQDE 287 S ++ G M++++ Sbjct: 537 SFGFLRIPEMLLGKMKEEK 555 >ref|XP_004508116.1| PREDICTED: CAS1 domain-containing protein 1 isoform X1 [Cicer arietinum] Length = 557 Score = 784 bits (2024), Expect = 0.0 Identities = 384/562 (68%), Positives = 447/562 (79%), Gaps = 18/562 (3%) Frame = -2 Query: 1918 MEISAPITPGQVSILLGIIPLFAATLYAEWSEYRKNSKLSSEIGRISDIN---------- 1769 M I +P+TPGQ+S LLGIIP+ A +Y+E EYRKNS S R SDIN Sbjct: 1 MHILSPVTPGQISFLLGIIPVILAWIYSEILEYRKNSVTSK--ARHSDINLVEVSSIAVK 58 Query: 1768 -DGRAALLED-PNLPLSTINLDSPSV--ASQLLRFFLMDSSFFVENRLTLRAISEFGLYL 1601 + R LLE LP S + S+ +S ++RF LMD +F +ENR TLRA+SEFGL L Sbjct: 59 DEDREVLLEGGAQLPASPTGSKARSLTASSSVIRFLLMDENFLIENRSTLRAMSEFGLLL 118 Query: 1600 VYMYICDRTDTFGYSIKSYSRDIFLFLYFLLIMVAAITSFTIHQDKSPITGKSILYLNRH 1421 Y Y+CDRTD FG S KSY+RD+FLFLYFLLI+V+AITSFTIH DKSP +GKSILYLNRH Sbjct: 119 GYYYLCDRTDFFGSSKKSYNRDLFLFLYFLLIIVSAITSFTIHHDKSPFSGKSILYLNRH 178 Query: 1420 QTEEWKGWMQVMFVMYHYFGALEIYNGIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQ 1241 QTEEWKGWMQV+F+MYHYF A EIYN IR+FIAAYVWMTGFGNFSYYY+RKDFS+ARFAQ Sbjct: 179 QTEEWKGWMQVLFLMYHYFAASEIYNAIRLFIAAYVWMTGFGNFSYYYIRKDFSMARFAQ 238 Query: 1240 MMWRLNFLAAFVCIVLNNDYMFYYICPMHTFFTLMVYGALRILNKYNEIGSVIALKMASC 1061 MMWRLNFL F C+VLNN YM YYICPMHT FTLMVYGAL ILNKYNEIGSVIA K+ +C Sbjct: 239 MMWRLNFLVLFCCVVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEIGSVIAAKIIAC 298 Query: 1060 FLIIILVWEVPGVFELVWSPFMFLLGCSVTFLGPEGTRS----LKEWHVRTGLDRYIWII 893 FL++ILVWE PGVFE +WSPF F+LG + P+ ++S L EWH R+GLDRYIWII Sbjct: 299 FLVVILVWETPGVFEWLWSPFTFMLG----YTDPDPSKSQFSRLHEWHFRSGLDRYIWII 354 Query: 892 GMIYAYYHPTVEKWMEKLEEAEFKRRISIKLAAASVSLTVGYLWFEYIYKLDSIPYQKYH 713 GMIYAYYHPTVE+WMEKLEEAE KRRISIK A +S GYLWFEY+YKLD + Y KYH Sbjct: 355 GMIYAYYHPTVERWMEKLEEAEIKRRISIKAAVVLLSSVTGYLWFEYVYKLDKVTYNKYH 414 Query: 712 PYTSWIPITVYICLRNVTQYFRCYSLTLFAWLGKITLETYISQFHIWQRTGILDGEPKLV 533 PYTSWIPITVYICLRN+TQ FR Y+LTLFAWLGK+TLETYISQ HIW R+G+ DG+PKL+ Sbjct: 415 PYTSWIPITVYICLRNITQSFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKLL 474 Query: 532 LSLIPDYPMLNFMLTTAIFVAISYRLFQLTNTLKTAFIPSGDDKRLMHNIVAAGAISISL 353 LSLIPDYPMLNFMLTT+I+VAISYRLFQLTNTLKTAF+PS DDKRL++N+V IS+ L Sbjct: 475 LSLIPDYPMLNFMLTTSIYVAISYRLFQLTNTLKTAFVPSKDDKRLINNLVIGTTISVVL 534 Query: 352 YTLSIAICKLSPTAYGMMRQDE 287 Y+LS ++ G M++++ Sbjct: 535 YSLSFGFLRIPEMLLGKMKEEK 556 >ref|XP_009411616.1| PREDICTED: CAS1 domain-containing protein 1-like [Musa acuminata subsp. malaccensis] Length = 546 Score = 779 bits (2011), Expect = 0.0 Identities = 375/540 (69%), Positives = 439/540 (81%), Gaps = 11/540 (2%) Frame = -2 Query: 1918 MEISAPITPGQVSILLGIIPLFAATLYAEWSEYRKNSKLSSEIGRISDIN---------- 1769 MEIS P+T GQVS +LGIIPL AA +Y+E+ +YRKNS + GR SD+N Sbjct: 1 MEISGPVTAGQVSFMLGIIPLLAAWVYSEFLQYRKNSA-PLKAGRNSDVNLVVLDKEANK 59 Query: 1768 -DGRAALLEDPNLPLSTINLDSPSVASQLLRFFLMDSSFFVENRLTLRAISEFGLYLVYM 1592 D +A LLE L ++ S S LRFFLMD +F +ENRL LRAISEFG YL+Y Sbjct: 60 EDDQAVLLES-GLQAASPKAYHLSTTSHFLRFFLMDEAFLLENRLILRAISEFGAYLLYF 118 Query: 1591 YICDRTDTFGYSIKSYSRDIFLFLYFLLIMVAAITSFTIHQDKSPITGKSILYLNRHQTE 1412 Y+CDRT+ FG S K+YSRD+FLFLYFLLI+VA++TSF +HQDKSP +GKSILYLNRHQTE Sbjct: 119 YVCDRTNLFGESKKNYSRDLFLFLYFLLIVVASMTSFKVHQDKSPFSGKSILYLNRHQTE 178 Query: 1411 EWKGWMQVMFVMYHYFGALEIYNGIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMW 1232 EWKGWMQV+F+MYHYF A EIYN IR+FIAAYVWMTGFGNFSYYYVRKDFS+ARFAQMMW Sbjct: 179 EWKGWMQVLFLMYHYFNAKEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 238 Query: 1231 RLNFLAAFVCIVLNNDYMFYYICPMHTFFTLMVYGALRILNKYNEIGSVIALKMASCFLI 1052 RLNF AF CIVLNNDYM YYICPMHT FTLMVYGAL ILNKYNE+G+VIA+K+ +CFL+ Sbjct: 239 RLNFFVAFCCIVLNNDYMLYYICPMHTLFTLMVYGALGILNKYNELGAVIAIKVVACFLV 298 Query: 1051 IILVWEVPGVFELVWSPFMFLLGCSVTFLGPEGTRSLKEWHVRTGLDRYIWIIGMIYAYY 872 +IL+WEVPGVF++VWSPF FLLG S L EWH R+GLDRYIWI+GMIYAYY Sbjct: 299 VILIWEVPGVFDIVWSPFTFLLGYSDPDPSKPKFPRLHEWHFRSGLDRYIWIVGMIYAYY 358 Query: 871 HPTVEKWMEKLEEAEFKRRISIKLAAASVSLTVGYLWFEYIYKLDSIPYQKYHPYTSWIP 692 HPTVE+WMEKLEEAE ++RISIK + +VSL GYLW+EYIYKLD + Y KYHPYTSWIP Sbjct: 359 HPTVERWMEKLEEAETRKRISIKTSMVTVSLVAGYLWYEYIYKLDRVTYNKYHPYTSWIP 418 Query: 691 ITVYICLRNVTQYFRCYSLTLFAWLGKITLETYISQFHIWQRTGILDGEPKLVLSLIPDY 512 ITVYI LRN TQ FR SLTLFAWLGKITLETYISQ HIW R+G+ DG+P+ +LSL+PDY Sbjct: 419 ITVYISLRNFTQPFRSCSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPRWLLSLVPDY 478 Query: 511 PMLNFMLTTAIFVAISYRLFQLTNTLKTAFIPSGDDKRLMHNIVAAGAISISLYTLSIAI 332 PMLNFMLTTAI+VA+S+RLF+LTNTLK AF+PS DDKRL HN++A A+S+ LY++S + Sbjct: 479 PMLNFMLTTAIYVAVSHRLFELTNTLKMAFVPSRDDKRLAHNVIAGIAVSVILYSVSFVL 538 >emb|CDO98736.1| unnamed protein product [Coffea canephora] Length = 544 Score = 775 bits (2002), Expect = 0.0 Identities = 375/540 (69%), Positives = 433/540 (80%), Gaps = 8/540 (1%) Frame = -2 Query: 1912 ISAPITPGQVSILLGIIPLFAATLYAEWSEYRKNSKLSS--------EIGRISDINDGRA 1757 I P+TPGQVS LGI+P+FAA +YAE EY+K S S E+G + A Sbjct: 4 IYGPLTPGQVSFFLGIVPMFAAWIYAEILEYKKASVSKSRHSDITLVELGNGGVKEEDSA 63 Query: 1756 ALLEDPNLPLSTINLDSPSVASQLLRFFLMDSSFFVENRLTLRAISEFGLYLVYMYICDR 1577 LLE L ++ + S S ASQ+LRF +MD SF +ENRLTLRAISE G L+Y Y+CDR Sbjct: 64 VLLEGGGLQSASPRVRSSSAASQILRFLMMDESFLLENRLTLRAISELGALLIYFYVCDR 123 Query: 1576 TDTFGYSIKSYSRDIFLFLYFLLIMVAAITSFTIHQDKSPITGKSILYLNRHQTEEWKGW 1397 T+ FG S KSY+RD+FLFLYFLLI+V+AITSF IHQDKSP +GKSI+YLNRHQTEEWKGW Sbjct: 124 TNIFGQSKKSYNRDLFLFLYFLLIIVSAITSFKIHQDKSPFSGKSIMYLNRHQTEEWKGW 183 Query: 1396 MQVMFVMYHYFGALEIYNGIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMWRLNFL 1217 MQV+F+MYHYF A EIYN IRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMWRLNFL Sbjct: 184 MQVLFLMYHYFAAAEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMWRLNFL 243 Query: 1216 AAFVCIVLNNDYMFYYICPMHTFFTLMVYGALRILNKYNEIGSVIALKMASCFLIIILVW 1037 CI+L+N+Y YYICPMHT FTLMVYGAL ILNKYNE G+VIA K+ +CFL +IL+W Sbjct: 244 VLLCCIILDNNYTLYYICPMHTLFTLMVYGALGILNKYNESGTVIAAKIMTCFLAVILIW 303 Query: 1036 EVPGVFELVWSPFMFLLGCSVTFLGPEGTRSLKEWHVRTGLDRYIWIIGMIYAYYHPTVE 857 E+PGVFEL+WSPF FLLG S P+ L EWH R+GLDRYIWIIGMIYAYYHPTVE Sbjct: 304 EIPGVFELIWSPFTFLLGYSDPSKPPQ--PRLHEWHFRSGLDRYIWIIGMIYAYYHPTVE 361 Query: 856 KWMEKLEEAEFKRRISIKLAAASVSLTVGYLWFEYIYKLDSIPYQKYHPYTSWIPITVYI 677 +WMEKLEE E KRRISIK A +SL VGYLW EYIYKL I Y KYHPYTSWIPITVYI Sbjct: 362 RWMEKLEETEVKRRISIKTAVVIISLAVGYLWLEYIYKLPKITYNKYHPYTSWIPITVYI 421 Query: 676 CLRNVTQYFRCYSLTLFAWLGKITLETYISQFHIWQRTGILDGEPKLVLSLIPDYPMLNF 497 CLRNV+QYFR Y+LTLFAWLGKITLETYISQ HIW R+G+ DG+PKL+LSLIP+YP+LNF Sbjct: 422 CLRNVSQYFRSYTLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPEYPLLNF 481 Query: 496 MLTTAIFVAISYRLFQLTNTLKTAFIPSGDDKRLMHNIVAAGAISISLYTLSIAICKLSP 317 MLTT+I++A+SYRLF+LTN LK+ F+PS D+KRL HNIVAA I+ LY LS + ++ P Sbjct: 482 MLTTSIYIAVSYRLFELTNMLKSTFVPSKDNKRLGHNIVAAVVIASGLYMLSFVLLRIPP 541 >ref|XP_012455820.1| PREDICTED: CAS1 domain-containing protein 1-like isoform X2 [Gossypium raimondii] gi|763805945|gb|KJB72883.1| hypothetical protein B456_011G202400 [Gossypium raimondii] Length = 544 Score = 773 bits (1997), Expect = 0.0 Identities = 374/541 (69%), Positives = 433/541 (80%), Gaps = 9/541 (1%) Frame = -2 Query: 1918 MEISAPITPGQVSILLGIIPLFAATLYAEWSEYRKNSKLSS--------EIGRISDINDG 1763 M I PITPGQVS LG+ P+ +A +YAE+ +Y+KNS S EIG ++ + Sbjct: 1 MMIFGPITPGQVSFFLGVFPVISAWIYAEYLQYKKNSLASKAHSDVSLVEIGNVAVKEED 60 Query: 1762 RAALLEDPNLPLSTINL-DSPSVASQLLRFFLMDSSFFVENRLTLRAISEFGLYLVYMYI 1586 RA LLE L + S S S +L+F +MD +F +ENRLTLRAISEFG+ L Y YI Sbjct: 61 RAVLLEGGGLQSGSPKARSSTSSVSPILKFIMMDETFLIENRLTLRAISEFGVLLAYYYI 120 Query: 1585 CDRTDTFGYSIKSYSRDIFLFLYFLLIMVAAITSFTIHQDKSPITGKSILYLNRHQTEEW 1406 CDRTD F S KSY+RD+FLFLYFLLI+V+AITSF IH DKSP +GKSILYLNRHQTEEW Sbjct: 121 CDRTDVFASSKKSYNRDLFLFLYFLLIIVSAITSFKIHHDKSPFSGKSILYLNRHQTEEW 180 Query: 1405 KGWMQVMFVMYHYFGALEIYNGIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMWRL 1226 KGWMQV+F+MYHYF A EIYN IRIFIAAYVWMTGFGNFSYYYVRKDFS+ARFAQMMWRL Sbjct: 181 KGWMQVLFLMYHYFAASEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWRL 240 Query: 1225 NFLAAFVCIVLNNDYMFYYICPMHTFFTLMVYGALRILNKYNEIGSVIALKMASCFLIII 1046 NFL F C+VLNN YM YYICPMHT FTLMVYGAL ILNKYNE GSVIALK+ +CFL++I Sbjct: 241 NFLVFFCCVVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEKGSVIALKIIACFLVVI 300 Query: 1045 LVWEVPGVFELVWSPFMFLLGCSVTFLGPEGTRSLKEWHVRTGLDRYIWIIGMIYAYYHP 866 LVWEVPGVFEL+WSPF F LG T L EWH R+GLDRYIWIIGMIYAYYHP Sbjct: 301 LVWEVPGVFELLWSPFTFFLG--YTDPAKPNLPLLHEWHFRSGLDRYIWIIGMIYAYYHP 358 Query: 865 TVEKWMEKLEEAEFKRRISIKLAAASVSLTVGYLWFEYIYKLDSIPYQKYHPYTSWIPIT 686 TVE+WMEKLEE E KRR+SIK+A A ++L VG+LWFE+IYKLD + Y KYHPYTSWIPIT Sbjct: 359 TVERWMEKLEETEVKRRVSIKIAVAIIALMVGFLWFEHIYKLDKVTYNKYHPYTSWIPIT 418 Query: 685 VYICLRNVTQYFRCYSLTLFAWLGKITLETYISQFHIWQRTGILDGEPKLVLSLIPDYPM 506 VYICLRNVTQ FR YSLTLFAWLGKITLETYISQ HIW R+G+ DG+PKL+LSLIPDYPM Sbjct: 419 VYICLRNVTQSFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDYPM 478 Query: 505 LNFMLTTAIFVAISYRLFQLTNTLKTAFIPSGDDKRLMHNIVAAGAISISLYTLSIAICK 326 LNFMLTT+I++AISYRLF LTN LK+AF+P+ D+KRL+HN++ +S +Y+LS + Sbjct: 479 LNFMLTTSIYLAISYRLFDLTNILKSAFVPTKDNKRLLHNLITGVVVSSIVYSLSFVFLR 538 Query: 325 L 323 + Sbjct: 539 I 539 >ref|XP_012455818.1| PREDICTED: CAS1 domain-containing protein 1-like isoform X1 [Gossypium raimondii] gi|763805944|gb|KJB72882.1| hypothetical protein B456_011G202400 [Gossypium raimondii] Length = 545 Score = 773 bits (1996), Expect = 0.0 Identities = 374/542 (69%), Positives = 433/542 (79%), Gaps = 10/542 (1%) Frame = -2 Query: 1918 MEISAPITPGQVSILLGIIPLFAATLYAEWSEYRKNSKLSS---------EIGRISDIND 1766 M I PITPGQVS LG+ P+ +A +YAE+ +Y+KNS S EIG ++ + Sbjct: 1 MMIFGPITPGQVSFFLGVFPVISAWIYAEYLQYKKNSLASKARHSDVSLVEIGNVAVKEE 60 Query: 1765 GRAALLEDPNLPLSTINL-DSPSVASQLLRFFLMDSSFFVENRLTLRAISEFGLYLVYMY 1589 RA LLE L + S S S +L+F +MD +F +ENRLTLRAISEFG+ L Y Y Sbjct: 61 DRAVLLEGGGLQSGSPKARSSTSSVSPILKFIMMDETFLIENRLTLRAISEFGVLLAYYY 120 Query: 1588 ICDRTDTFGYSIKSYSRDIFLFLYFLLIMVAAITSFTIHQDKSPITGKSILYLNRHQTEE 1409 ICDRTD F S KSY+RD+FLFLYFLLI+V+AITSF IH DKSP +GKSILYLNRHQTEE Sbjct: 121 ICDRTDVFASSKKSYNRDLFLFLYFLLIIVSAITSFKIHHDKSPFSGKSILYLNRHQTEE 180 Query: 1408 WKGWMQVMFVMYHYFGALEIYNGIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMWR 1229 WKGWMQV+F+MYHYF A EIYN IRIFIAAYVWMTGFGNFSYYYVRKDFS+ARFAQMMWR Sbjct: 181 WKGWMQVLFLMYHYFAASEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWR 240 Query: 1228 LNFLAAFVCIVLNNDYMFYYICPMHTFFTLMVYGALRILNKYNEIGSVIALKMASCFLII 1049 LNFL F C+VLNN YM YYICPMHT FTLMVYGAL ILNKYNE GSVIALK+ +CFL++ Sbjct: 241 LNFLVFFCCVVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEKGSVIALKIIACFLVV 300 Query: 1048 ILVWEVPGVFELVWSPFMFLLGCSVTFLGPEGTRSLKEWHVRTGLDRYIWIIGMIYAYYH 869 ILVWEVPGVFEL+WSPF F LG T L EWH R+GLDRYIWIIGMIYAYYH Sbjct: 301 ILVWEVPGVFELLWSPFTFFLG--YTDPAKPNLPLLHEWHFRSGLDRYIWIIGMIYAYYH 358 Query: 868 PTVEKWMEKLEEAEFKRRISIKLAAASVSLTVGYLWFEYIYKLDSIPYQKYHPYTSWIPI 689 PTVE+WMEKLEE E KRR+SIK+A A ++L VG+LWFE+IYKLD + Y KYHPYTSWIPI Sbjct: 359 PTVERWMEKLEETEVKRRVSIKIAVAIIALMVGFLWFEHIYKLDKVTYNKYHPYTSWIPI 418 Query: 688 TVYICLRNVTQYFRCYSLTLFAWLGKITLETYISQFHIWQRTGILDGEPKLVLSLIPDYP 509 TVYICLRNVTQ FR YSLTLFAWLGKITLETYISQ HIW R+G+ DG+PKL+LSLIPDYP Sbjct: 419 TVYICLRNVTQSFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDYP 478 Query: 508 MLNFMLTTAIFVAISYRLFQLTNTLKTAFIPSGDDKRLMHNIVAAGAISISLYTLSIAIC 329 MLNFMLTT+I++AISYRLF LTN LK+AF+P+ D+KRL+HN++ +S +Y+LS Sbjct: 479 MLNFMLTTSIYLAISYRLFDLTNILKSAFVPTKDNKRLLHNLITGVVVSSIVYSLSFVFL 538 Query: 328 KL 323 ++ Sbjct: 539 RI 540 >ref|XP_007154416.1| hypothetical protein PHAVU_003G117800g [Phaseolus vulgaris] gi|561027770|gb|ESW26410.1| hypothetical protein PHAVU_003G117800g [Phaseolus vulgaris] Length = 546 Score = 773 bits (1996), Expect = 0.0 Identities = 379/543 (69%), Positives = 433/543 (79%), Gaps = 11/543 (2%) Frame = -2 Query: 1918 MEISAPITPGQVSILLGIIPLFAATLYAEWSEYRKNSKLSSEIGRISDIN---------- 1769 M I +P+TPGQVS LLGI P+ A +Y+E EYRKNS +SS+ G SDIN Sbjct: 1 MLILSPVTPGQVSFLLGITPVVVAWIYSEILEYRKNS-VSSKAGH-SDINLVEMGSDAVK 58 Query: 1768 -DGRAALLEDPNLPLSTINLDSPSVASQLLRFFLMDSSFFVENRLTLRAISEFGLYLVYM 1592 + +A LLE L + S + + ++RF LMD+ F +ENRLTLRA+SEFGL L Y Sbjct: 59 DEDKAVLLEGGALQSGSPRARSLTASPSIIRFLLMDNYFLLENRLTLRAMSEFGLLLAYF 118 Query: 1591 YICDRTDTFGYSIKSYSRDIFLFLYFLLIMVAAITSFTIHQDKSPITGKSILYLNRHQTE 1412 Y+CDRTD F S KSY+RDIFLFLYFLLI+V+A+TSF IHQDKSP +GKSILYLNRHQTE Sbjct: 119 YLCDRTDFFASSKKSYNRDIFLFLYFLLIIVSAMTSFKIHQDKSPFSGKSILYLNRHQTE 178 Query: 1411 EWKGWMQVMFVMYHYFGALEIYNGIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMW 1232 EWKGWMQV+F+MYHYF A EIYN IR+FIAAYVWMTGFGNFSYYYVRKDFS+ARFAQMMW Sbjct: 179 EWKGWMQVLFLMYHYFAASEIYNAIRLFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 238 Query: 1231 RLNFLAAFVCIVLNNDYMFYYICPMHTFFTLMVYGALRILNKYNEIGSVIALKMASCFLI 1052 RLNF F CIVLNN YM YYICPMHT FTLMVYGAL ILNKYNEIGSVIA+K+ +CFL+ Sbjct: 239 RLNFFVVFCCIVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEIGSVIAVKIIACFLV 298 Query: 1051 IILVWEVPGVFELVWSPFMFLLGCSVTFLGPEGTRSLKEWHVRTGLDRYIWIIGMIYAYY 872 +ILVWE+PGVFEL+WSPF F LG + L EWH R+GLDRYIWIIGMIYAYY Sbjct: 299 VILVWEIPGVFELLWSPFTFFLGYTDPNPAKSHLSRLHEWHFRSGLDRYIWIIGMIYAYY 358 Query: 871 HPTVEKWMEKLEEAEFKRRISIKLAAASVSLTVGYLWFEYIYKLDSIPYQKYHPYTSWIP 692 HPTVE+WMEKLEEAE KRRISIK + VGYLWFE+IYKLD + Y YHPYTSWIP Sbjct: 359 HPTVERWMEKLEEAEIKRRISIKATIVLICSLVGYLWFEHIYKLDKLTYNTYHPYTSWIP 418 Query: 691 ITVYICLRNVTQYFRCYSLTLFAWLGKITLETYISQFHIWQRTGILDGEPKLVLSLIPDY 512 ITVYICLRNVTQ FR Y+LTLFAWLGKITLETYISQ HIW R+G+ DG+PKL+LSLIPDY Sbjct: 419 ITVYICLRNVTQSFRSYTLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDY 478 Query: 511 PMLNFMLTTAIFVAISYRLFQLTNTLKTAFIPSGDDKRLMHNIVAAGAISISLYTLSIAI 332 PMLNFMLTT+I+VAIS RLF LTNTLK AF+PS DDKRL+HN++ A IS+ LY+LS Sbjct: 479 PMLNFMLTTSIYVAISCRLFDLTNTLKVAFVPSKDDKRLVHNLITATTISVVLYSLSFGF 538 Query: 331 CKL 323 +L Sbjct: 539 LRL 541 >ref|XP_003550779.1| PREDICTED: CAS1 domain-containing protein 1-like isoform X1 [Glycine max] gi|947052948|gb|KRH02401.1| hypothetical protein GLYMA_17G036600 [Glycine max] gi|947052949|gb|KRH02402.1| hypothetical protein GLYMA_17G036600 [Glycine max] Length = 545 Score = 773 bits (1996), Expect = 0.0 Identities = 375/540 (69%), Positives = 434/540 (80%), Gaps = 8/540 (1%) Frame = -2 Query: 1918 MEISAPITPGQVSILLGIIPLFAATLYAEWSEYRKN---SKLSSEIGRI---SDI--NDG 1763 M + +P+TPGQVS LLGIIP+ A +Y+E EYR N S+ S+I + SD+ ++ Sbjct: 1 MLLLSPVTPGQVSFLLGIIPVVVAWIYSEMLEYRNNYVPSRAQSDINLVEIGSDVVKDED 60 Query: 1762 RAALLEDPNLPLSTINLDSPSVASQLLRFFLMDSSFFVENRLTLRAISEFGLYLVYMYIC 1583 RAALLE L + S + + ++RF LMD F +ENRLTLRA+SEFGL L Y Y+C Sbjct: 61 RAALLEGGALQSGSPKARSLTASPSIIRFLLMDEYFLLENRLTLRAMSEFGLILAYFYLC 120 Query: 1582 DRTDTFGYSIKSYSRDIFLFLYFLLIMVAAITSFTIHQDKSPITGKSILYLNRHQTEEWK 1403 DRTD F S KSY+RD+FLFLYFLLI+V+A+TSF IH DKSP++GKSILYLNRHQTEEWK Sbjct: 121 DRTDFFASSKKSYNRDLFLFLYFLLIIVSAMTSFKIHHDKSPLSGKSILYLNRHQTEEWK 180 Query: 1402 GWMQVMFVMYHYFGALEIYNGIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMWRLN 1223 GWMQV+F+MYHYF A EIYN IR+FIAAYVWMTGFGNFSYYYVRKDFS+ARFAQMMWRLN Sbjct: 181 GWMQVLFLMYHYFAASEIYNAIRLFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWRLN 240 Query: 1222 FLAAFVCIVLNNDYMFYYICPMHTFFTLMVYGALRILNKYNEIGSVIALKMASCFLIIIL 1043 F F CIVLNN YM YYICPMHT FTLMVYGAL IL+KYNEIGSVIA+K+ CFL++IL Sbjct: 241 FFVVFSCIVLNNSYMLYYICPMHTLFTLMVYGALGILHKYNEIGSVIAVKIIGCFLVVIL 300 Query: 1042 VWEVPGVFELVWSPFMFLLGCSVTFLGPEGTRSLKEWHVRTGLDRYIWIIGMIYAYYHPT 863 VWE+PGVFE +WSPF F LG + L EWH R+GLDRYIWIIGMIYAYYHPT Sbjct: 301 VWEIPGVFEWLWSPFTFFLGYTDPNPAKSHLSRLHEWHFRSGLDRYIWIIGMIYAYYHPT 360 Query: 862 VEKWMEKLEEAEFKRRISIKLAAASVSLTVGYLWFEYIYKLDSIPYQKYHPYTSWIPITV 683 VE+WMEKLEEAE KRRISIK + VGYLWFE+IYKLD + Y KYHPYTSWIPITV Sbjct: 361 VERWMEKLEEAEIKRRISIKATVVLICSLVGYLWFEHIYKLDKVTYNKYHPYTSWIPITV 420 Query: 682 YICLRNVTQYFRCYSLTLFAWLGKITLETYISQFHIWQRTGILDGEPKLVLSLIPDYPML 503 YICLRNVTQ FR Y+LTLFAWLGKITLETYISQ HIW R+GI DG+PKL+LSLIPDYPML Sbjct: 421 YICLRNVTQSFRSYTLTLFAWLGKITLETYISQIHIWLRSGIPDGQPKLLLSLIPDYPML 480 Query: 502 NFMLTTAIFVAISYRLFQLTNTLKTAFIPSGDDKRLMHNIVAAGAISISLYTLSIAICKL 323 NFMLTT+I+VAISYRLF LTNTLK AF+PS DDKRL+HN++ A IS+ LY+LS+ ++ Sbjct: 481 NFMLTTSIYVAISYRLFDLTNTLKMAFVPSKDDKRLVHNLITATTISVVLYSLSLGFLRV 540 >ref|XP_007035865.1| O-acetyltransferase family protein isoform 2 [Theobroma cacao] gi|508714894|gb|EOY06791.1| O-acetyltransferase family protein isoform 2 [Theobroma cacao] Length = 544 Score = 772 bits (1993), Expect = 0.0 Identities = 377/541 (69%), Positives = 433/541 (80%), Gaps = 9/541 (1%) Frame = -2 Query: 1918 MEISAPITPGQVSILLGIIPLFAATLYAEWSEYRKNSKLSS--------EIGRISDINDG 1763 M I PITPGQVS LGI P+ +A +YAE+ EY+KNS S EIG + D Sbjct: 1 MAIFGPITPGQVSFFLGIFPVISAWIYAEYLEYKKNSLESKAHSDVNLVEIGNGAVKEDD 60 Query: 1762 RAALLEDPNLPLSTINL-DSPSVASQLLRFFLMDSSFFVENRLTLRAISEFGLYLVYMYI 1586 RA LLE L ++ S S S + +F +MD +F VENRLTLRAISEFG L Y YI Sbjct: 61 RAVLLEGGGLQSASPKARTSSSSLSPIFKFLMMDETFLVENRLTLRAISEFGGLLAYYYI 120 Query: 1585 CDRTDTFGYSIKSYSRDIFLFLYFLLIMVAAITSFTIHQDKSPITGKSILYLNRHQTEEW 1406 CDRTD F + K+Y+RD+FLFLYFLLI+V+AITSF IH DKSP +GKSILYLNRHQTEEW Sbjct: 121 CDRTDVFDSAKKNYNRDLFLFLYFLLIIVSAITSFKIHHDKSPFSGKSILYLNRHQTEEW 180 Query: 1405 KGWMQVMFVMYHYFGALEIYNGIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMWRL 1226 KGWMQV+F+MYHYF A EIYN IR+FIAAYVWMTGFGNFSYYYVRKDFS+ARFAQMMWRL Sbjct: 181 KGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWRL 240 Query: 1225 NFLAAFVCIVLNNDYMFYYICPMHTFFTLMVYGALRILNKYNEIGSVIALKMASCFLIII 1046 NFL F C++LNN Y+ YYICPMHT FTLMVYG L ILNKYNE GSVIA K+ +CFL++I Sbjct: 241 NFLVFFCCVILNNSYVLYYICPMHTLFTLMVYGTLGILNKYNENGSVIAAKIIACFLVVI 300 Query: 1045 LVWEVPGVFELVWSPFMFLLGCSVTFLGPEGTRSLKEWHVRTGLDRYIWIIGMIYAYYHP 866 LVWEVPGVFE++WSPF F LG T L EWH R+GLDRYIWIIGMIYAYYHP Sbjct: 301 LVWEVPGVFEILWSPFTFFLG--YTDPAKPNFPRLHEWHFRSGLDRYIWIIGMIYAYYHP 358 Query: 865 TVEKWMEKLEEAEFKRRISIKLAAASVSLTVGYLWFEYIYKLDSIPYQKYHPYTSWIPIT 686 TVE+WMEKLEEAE KRR+ IK+A A+++LT+GY WFEYIYKLD I Y KYHPYTSWIPIT Sbjct: 359 TVERWMEKLEEAEVKRRVLIKMAVATIALTMGYFWFEYIYKLDKITYNKYHPYTSWIPIT 418 Query: 685 VYICLRNVTQYFRCYSLTLFAWLGKITLETYISQFHIWQRTGILDGEPKLVLSLIPDYPM 506 VYICLRNVTQ FR YSLTLFAWLGKITLETYISQ HIW R+G+ DG+PKL+LSLIPDYPM Sbjct: 419 VYICLRNVTQSFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDYPM 478 Query: 505 LNFMLTTAIFVAISYRLFQLTNTLKTAFIPSGDDKRLMHNIVAAGAISISLYTLSIAICK 326 LNFMLTT+I+VAISYRLF LTN LKTAF+P+ DDKRL++N++ A IS LY+LS A+ + Sbjct: 479 LNFMLTTSIYVAISYRLFDLTNILKTAFVPTKDDKRLINNLITAVVISSILYSLSFALLR 538 Query: 325 L 323 + Sbjct: 539 I 539 >ref|XP_007035864.1| O-acetyltransferase family protein isoform 1 [Theobroma cacao] gi|508714893|gb|EOY06790.1| O-acetyltransferase family protein isoform 1 [Theobroma cacao] Length = 545 Score = 771 bits (1992), Expect = 0.0 Identities = 378/544 (69%), Positives = 434/544 (79%), Gaps = 12/544 (2%) Frame = -2 Query: 1918 MEISAPITPGQVSILLGIIPLFAATLYAEWSEYRKNSKLSSEIGRISDIN---------- 1769 M I PITPGQVS LGI P+ +A +YAE+ EY+KNS S R SD+N Sbjct: 1 MAIFGPITPGQVSFFLGIFPVISAWIYAEYLEYKKNSLESK--ARHSDVNLVEIGNGAVK 58 Query: 1768 -DGRAALLEDPNLPLSTINL-DSPSVASQLLRFFLMDSSFFVENRLTLRAISEFGLYLVY 1595 D RA LLE L ++ S S S + +F +MD +F VENRLTLRAISEFG L Y Sbjct: 59 EDDRAVLLEGGGLQSASPKARTSSSSLSPIFKFLMMDETFLVENRLTLRAISEFGGLLAY 118 Query: 1594 MYICDRTDTFGYSIKSYSRDIFLFLYFLLIMVAAITSFTIHQDKSPITGKSILYLNRHQT 1415 YICDRTD F + K+Y+RD+FLFLYFLLI+V+AITSF IH DKSP +GKSILYLNRHQT Sbjct: 119 YYICDRTDVFDSAKKNYNRDLFLFLYFLLIIVSAITSFKIHHDKSPFSGKSILYLNRHQT 178 Query: 1414 EEWKGWMQVMFVMYHYFGALEIYNGIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMM 1235 EEWKGWMQV+F+MYHYF A EIYN IR+FIAAYVWMTGFGNFSYYYVRKDFS+ARFAQMM Sbjct: 179 EEWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMM 238 Query: 1234 WRLNFLAAFVCIVLNNDYMFYYICPMHTFFTLMVYGALRILNKYNEIGSVIALKMASCFL 1055 WRLNFL F C++LNN Y+ YYICPMHT FTLMVYG L ILNKYNE GSVIA K+ +CFL Sbjct: 239 WRLNFLVFFCCVILNNSYVLYYICPMHTLFTLMVYGTLGILNKYNENGSVIAAKIIACFL 298 Query: 1054 IIILVWEVPGVFELVWSPFMFLLGCSVTFLGPEGTRSLKEWHVRTGLDRYIWIIGMIYAY 875 ++ILVWEVPGVFE++WSPF F LG T L EWH R+GLDRYIWIIGMIYAY Sbjct: 299 VVILVWEVPGVFEILWSPFTFFLG--YTDPAKPNFPRLHEWHFRSGLDRYIWIIGMIYAY 356 Query: 874 YHPTVEKWMEKLEEAEFKRRISIKLAAASVSLTVGYLWFEYIYKLDSIPYQKYHPYTSWI 695 YHPTVE+WMEKLEEAE KRR+ IK+A A+++LT+GY WFEYIYKLD I Y KYHPYTSWI Sbjct: 357 YHPTVERWMEKLEEAEVKRRVLIKMAVATIALTMGYFWFEYIYKLDKITYNKYHPYTSWI 416 Query: 694 PITVYICLRNVTQYFRCYSLTLFAWLGKITLETYISQFHIWQRTGILDGEPKLVLSLIPD 515 PITVYICLRNVTQ FR YSLTLFAWLGKITLETYISQ HIW R+G+ DG+PKL+LSLIPD Sbjct: 417 PITVYICLRNVTQSFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPD 476 Query: 514 YPMLNFMLTTAIFVAISYRLFQLTNTLKTAFIPSGDDKRLMHNIVAAGAISISLYTLSIA 335 YPMLNFMLTT+I+VAISYRLF LTN LKTAF+P+ DDKRL++N++ A IS LY+LS A Sbjct: 477 YPMLNFMLTTSIYVAISYRLFDLTNILKTAFVPTKDDKRLINNLITAVVISSILYSLSFA 536 Query: 334 ICKL 323 + ++ Sbjct: 537 LLRI 540 >gb|KRH50688.1| hypothetical protein GLYMA_07G237000 [Glycine max] Length = 546 Score = 771 bits (1991), Expect = 0.0 Identities = 376/543 (69%), Positives = 431/543 (79%), Gaps = 11/543 (2%) Frame = -2 Query: 1918 MEISAPITPGQVSILLGIIPLFAATLYAEWSEYRKNSKLSSEIGRISDIN---------- 1769 M + +P+TPGQVS LLGIIP+ A +Y+E EYRKNS S R SDIN Sbjct: 1 MLLLSPVTPGQVSFLLGIIPVVVAWIYSEILEYRKNSVSSR--ARQSDINLVEMGSDVVK 58 Query: 1768 -DGRAALLEDPNLPLSTINLDSPSVASQLLRFFLMDSSFFVENRLTLRAISEFGLYLVYM 1592 + RA LLE L + S + + ++RF LMD F +ENRLTLRA+SEFGL L Y Sbjct: 59 DEDRAVLLEGGALQSGSPKARSLTGSPSIIRFLLMDECFLLENRLTLRAMSEFGLILAYF 118 Query: 1591 YICDRTDTFGYSIKSYSRDIFLFLYFLLIMVAAITSFTIHQDKSPITGKSILYLNRHQTE 1412 Y+CDRTD F S KSY+RD+FLFLYFLLI+V+A+TSF IH DKSP++GKSILYLNRHQTE Sbjct: 119 YLCDRTDFFASSNKSYNRDLFLFLYFLLIIVSAMTSFKIHHDKSPLSGKSILYLNRHQTE 178 Query: 1411 EWKGWMQVMFVMYHYFGALEIYNGIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMW 1232 EWKGWMQV+F+MYHYF A EIYN IR+FIAAYVWMTGFGNFSYYYVRKDFS+ARFAQMMW Sbjct: 179 EWKGWMQVLFLMYHYFAASEIYNAIRLFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 238 Query: 1231 RLNFLAAFVCIVLNNDYMFYYICPMHTFFTLMVYGALRILNKYNEIGSVIALKMASCFLI 1052 RLNF F CIVLNN YM YYICPMHT FTLMVYGAL IL+KYNEIGSVIA+K+ +CFL+ Sbjct: 239 RLNFFVVFCCIVLNNSYMLYYICPMHTLFTLMVYGALGILHKYNEIGSVIAVKIIACFLV 298 Query: 1051 IILVWEVPGVFELVWSPFMFLLGCSVTFLGPEGTRSLKEWHVRTGLDRYIWIIGMIYAYY 872 +ILVWE+PGVFE VWSPF F LG + L EWH R+GLDRYIWIIGMIYAYY Sbjct: 299 VILVWEIPGVFEWVWSPFTFFLGYTDPNPAKSHLSRLHEWHFRSGLDRYIWIIGMIYAYY 358 Query: 871 HPTVEKWMEKLEEAEFKRRISIKLAAASVSLTVGYLWFEYIYKLDSIPYQKYHPYTSWIP 692 HPTVE+WMEKLEEAE KRRISIK + VGYLWFE+IYKLD I Y KYHPYTSWIP Sbjct: 359 HPTVERWMEKLEEAEIKRRISIKATVVLICSLVGYLWFEHIYKLDKIAYNKYHPYTSWIP 418 Query: 691 ITVYICLRNVTQYFRCYSLTLFAWLGKITLETYISQFHIWQRTGILDGEPKLVLSLIPDY 512 ITVYICLRNVTQ FR Y+LTLFAWLGKITLETYISQ HIW R+G+ DG+PKL+LSLIPD+ Sbjct: 419 ITVYICLRNVTQSFRSYTLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDF 478 Query: 511 PMLNFMLTTAIFVAISYRLFQLTNTLKTAFIPSGDDKRLMHNIVAAGAISISLYTLSIAI 332 PMLNFMLTT+I+VAISYRLF LTNTLK AF+PS DDKR +HN++ A IS+ LY+LS+ Sbjct: 479 PMLNFMLTTSIYVAISYRLFDLTNTLKMAFVPSKDDKRFIHNLITATTISVVLYSLSLGF 538 Query: 331 CKL 323 ++ Sbjct: 539 LRV 541 >ref|XP_003609827.2| O-acetyltransferase family protein [Medicago truncatula] gi|657390978|gb|AES92024.2| O-acetyltransferase family protein [Medicago truncatula] Length = 548 Score = 771 bits (1991), Expect = 0.0 Identities = 376/541 (69%), Positives = 439/541 (81%), Gaps = 15/541 (2%) Frame = -2 Query: 1918 MEISAPITPGQVSILLGIIPLFAATLYAEWSEYRKNSKLS----SEIGRI---SDI--ND 1766 M I +P+TPGQVS LLG+ P+ A +Y+E E+RKNS S S+IG + +D+ ++ Sbjct: 1 MHILSPVTPGQVSFLLGLFPVIIAWIYSEILEFRKNSLTSKARHSDIGLVEVRTDVVKDE 60 Query: 1765 GRAALLEDPNL-PLS-TINLDSPSVASQLLRFFLMDSSFFVENRLTLRAISEFGLYLVYM 1592 LLE L P S T S + ++ ++RFF +D F ENRLTLRA+SEFGL L Y Sbjct: 61 ETTVLLEGGALQPASPTPKARSFTASTSIIRFFFLDEHFLHENRLTLRAMSEFGLLLAYY 120 Query: 1591 YICDRTDTFGYSIKSYSRDIFLFLYFLLIMVAAITSFTIHQDKSPITGKSILYLNRHQTE 1412 Y+CDRTD FG S KSY+RD+F+FLYFLLI+V+AITSFTIH DKSP +GKSILYLNRHQTE Sbjct: 121 YLCDRTDFFGSSKKSYNRDLFIFLYFLLIIVSAITSFTIHHDKSPFSGKSILYLNRHQTE 180 Query: 1411 EWKGWMQVMFVMYHYFGALEIYNGIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMW 1232 EWKGWMQV+F+MYHYF A EIYN IR+FIAAYVWMTGFGNFSYYY+RKDFS+ARFAQMMW Sbjct: 181 EWKGWMQVLFLMYHYFAASEIYNSIRLFIAAYVWMTGFGNFSYYYIRKDFSMARFAQMMW 240 Query: 1231 RLNFLAAFVCIVLNNDYMFYYICPMHTFFTLMVYGALRILNKYNEIGSVIALKMASCFLI 1052 RLNFL F C+VLNN YM YYICPMHT FTLMVYGAL ILNKYNE GSVIA K+ +CFL+ Sbjct: 241 RLNFLVLFCCVVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEFGSVIAAKIGACFLV 300 Query: 1051 IILVWEVPGVFELVWSPFMFLLGCSVTFLGPEGTRS----LKEWHVRTGLDRYIWIIGMI 884 +ILVWE+PGVFE VWSPF F+LG + P+ ++S L EWH R+GLDRYIWIIGMI Sbjct: 301 VILVWEIPGVFEWVWSPFTFMLG----YTDPDPSKSHFTRLHEWHFRSGLDRYIWIIGMI 356 Query: 883 YAYYHPTVEKWMEKLEEAEFKRRISIKLAAASVSLTVGYLWFEYIYKLDSIPYQKYHPYT 704 YAYYHPTVE+WMEKLEE E KRRISIK + +S +GYLWFEYIYKLD + Y KYHPYT Sbjct: 357 YAYYHPTVERWMEKLEETEIKRRISIKASVVLISSVMGYLWFEYIYKLDKVTYNKYHPYT 416 Query: 703 SWIPITVYICLRNVTQYFRCYSLTLFAWLGKITLETYISQFHIWQRTGILDGEPKLVLSL 524 SWIPITVYICLRN+TQ FR YSLTLFAWLGK+TLETYISQ HIW R+GI DG+PKL+LSL Sbjct: 417 SWIPITVYICLRNITQSFRSYSLTLFAWLGKVTLETYISQIHIWLRSGIPDGQPKLLLSL 476 Query: 523 IPDYPMLNFMLTTAIFVAISYRLFQLTNTLKTAFIPSGDDKRLMHNIVAAGAISISLYTL 344 IPDYPMLNF+LTT+I+VAISYRLFQLTNTLK AF+PS DDKRL+HN++ IS+ LY+L Sbjct: 477 IPDYPMLNFLLTTSIYVAISYRLFQLTNTLKNAFVPSKDDKRLIHNLITGTTISVVLYSL 536 Query: 343 S 341 S Sbjct: 537 S 537 >ref|XP_006583993.1| PREDICTED: CAS1 domain-containing protein 1-like isoform X1 [Glycine max] gi|734318066|gb|KHN02869.1| CAS1 domain-containing protein 1 [Glycine soja] Length = 552 Score = 771 bits (1991), Expect = 0.0 Identities = 376/543 (69%), Positives = 431/543 (79%), Gaps = 11/543 (2%) Frame = -2 Query: 1918 MEISAPITPGQVSILLGIIPLFAATLYAEWSEYRKNSKLSSEIGRISDIN---------- 1769 M + +P+TPGQVS LLGIIP+ A +Y+E EYRKNS S R SDIN Sbjct: 1 MLLLSPVTPGQVSFLLGIIPVVVAWIYSEILEYRKNSVSSR--ARQSDINLVEMGSDVVK 58 Query: 1768 -DGRAALLEDPNLPLSTINLDSPSVASQLLRFFLMDSSFFVENRLTLRAISEFGLYLVYM 1592 + RA LLE L + S + + ++RF LMD F +ENRLTLRA+SEFGL L Y Sbjct: 59 DEDRAVLLEGGALQSGSPKARSLTGSPSIIRFLLMDECFLLENRLTLRAMSEFGLILAYF 118 Query: 1591 YICDRTDTFGYSIKSYSRDIFLFLYFLLIMVAAITSFTIHQDKSPITGKSILYLNRHQTE 1412 Y+CDRTD F S KSY+RD+FLFLYFLLI+V+A+TSF IH DKSP++GKSILYLNRHQTE Sbjct: 119 YLCDRTDFFASSNKSYNRDLFLFLYFLLIIVSAMTSFKIHHDKSPLSGKSILYLNRHQTE 178 Query: 1411 EWKGWMQVMFVMYHYFGALEIYNGIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMW 1232 EWKGWMQV+F+MYHYF A EIYN IR+FIAAYVWMTGFGNFSYYYVRKDFS+ARFAQMMW Sbjct: 179 EWKGWMQVLFLMYHYFAASEIYNAIRLFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 238 Query: 1231 RLNFLAAFVCIVLNNDYMFYYICPMHTFFTLMVYGALRILNKYNEIGSVIALKMASCFLI 1052 RLNF F CIVLNN YM YYICPMHT FTLMVYGAL IL+KYNEIGSVIA+K+ +CFL+ Sbjct: 239 RLNFFVVFCCIVLNNSYMLYYICPMHTLFTLMVYGALGILHKYNEIGSVIAVKIIACFLV 298 Query: 1051 IILVWEVPGVFELVWSPFMFLLGCSVTFLGPEGTRSLKEWHVRTGLDRYIWIIGMIYAYY 872 +ILVWE+PGVFE VWSPF F LG + L EWH R+GLDRYIWIIGMIYAYY Sbjct: 299 VILVWEIPGVFEWVWSPFTFFLGYTDPNPAKSHLSRLHEWHFRSGLDRYIWIIGMIYAYY 358 Query: 871 HPTVEKWMEKLEEAEFKRRISIKLAAASVSLTVGYLWFEYIYKLDSIPYQKYHPYTSWIP 692 HPTVE+WMEKLEEAE KRRISIK + VGYLWFE+IYKLD I Y KYHPYTSWIP Sbjct: 359 HPTVERWMEKLEEAEIKRRISIKATVVLICSLVGYLWFEHIYKLDKIAYNKYHPYTSWIP 418 Query: 691 ITVYICLRNVTQYFRCYSLTLFAWLGKITLETYISQFHIWQRTGILDGEPKLVLSLIPDY 512 ITVYICLRNVTQ FR Y+LTLFAWLGKITLETYISQ HIW R+G+ DG+PKL+LSLIPD+ Sbjct: 419 ITVYICLRNVTQSFRSYTLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDF 478 Query: 511 PMLNFMLTTAIFVAISYRLFQLTNTLKTAFIPSGDDKRLMHNIVAAGAISISLYTLSIAI 332 PMLNFMLTT+I+VAISYRLF LTNTLK AF+PS DDKR +HN++ A IS+ LY+LS+ Sbjct: 479 PMLNFMLTTSIYVAISYRLFDLTNTLKMAFVPSKDDKRFIHNLITATTISVVLYSLSLGF 538 Query: 331 CKL 323 ++ Sbjct: 539 LRV 541 >gb|KRH50687.1| hypothetical protein GLYMA_07G237000 [Glycine max] Length = 545 Score = 771 bits (1990), Expect = 0.0 Identities = 374/540 (69%), Positives = 434/540 (80%), Gaps = 8/540 (1%) Frame = -2 Query: 1918 MEISAPITPGQVSILLGIIPLFAATLYAEWSEYRKNS---KLSSEIGRI---SDI--NDG 1763 M + +P+TPGQVS LLGIIP+ A +Y+E EYRKNS + S+I + SD+ ++ Sbjct: 1 MLLLSPVTPGQVSFLLGIIPVVVAWIYSEILEYRKNSVSSRAQSDINLVEMGSDVVKDED 60 Query: 1762 RAALLEDPNLPLSTINLDSPSVASQLLRFFLMDSSFFVENRLTLRAISEFGLYLVYMYIC 1583 RA LLE L + S + + ++RF LMD F +ENRLTLRA+SEFGL L Y Y+C Sbjct: 61 RAVLLEGGALQSGSPKARSLTGSPSIIRFLLMDECFLLENRLTLRAMSEFGLILAYFYLC 120 Query: 1582 DRTDTFGYSIKSYSRDIFLFLYFLLIMVAAITSFTIHQDKSPITGKSILYLNRHQTEEWK 1403 DRTD F S KSY+RD+FLFLYFLLI+V+A+TSF IH DKSP++GKSILYLNRHQTEEWK Sbjct: 121 DRTDFFASSNKSYNRDLFLFLYFLLIIVSAMTSFKIHHDKSPLSGKSILYLNRHQTEEWK 180 Query: 1402 GWMQVMFVMYHYFGALEIYNGIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMWRLN 1223 GWMQV+F+MYHYF A EIYN IR+FIAAYVWMTGFGNFSYYYVRKDFS+ARFAQMMWRLN Sbjct: 181 GWMQVLFLMYHYFAASEIYNAIRLFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWRLN 240 Query: 1222 FLAAFVCIVLNNDYMFYYICPMHTFFTLMVYGALRILNKYNEIGSVIALKMASCFLIIIL 1043 F F CIVLNN YM YYICPMHT FTLMVYGAL IL+KYNEIGSVIA+K+ +CFL++IL Sbjct: 241 FFVVFCCIVLNNSYMLYYICPMHTLFTLMVYGALGILHKYNEIGSVIAVKIIACFLVVIL 300 Query: 1042 VWEVPGVFELVWSPFMFLLGCSVTFLGPEGTRSLKEWHVRTGLDRYIWIIGMIYAYYHPT 863 VWE+PGVFE VWSPF F LG + L EWH R+GLDRYIWIIGMIYAYYHPT Sbjct: 301 VWEIPGVFEWVWSPFTFFLGYTDPNPAKSHLSRLHEWHFRSGLDRYIWIIGMIYAYYHPT 360 Query: 862 VEKWMEKLEEAEFKRRISIKLAAASVSLTVGYLWFEYIYKLDSIPYQKYHPYTSWIPITV 683 VE+WMEKLEEAE KRRISIK + VGYLWFE+IYKLD I Y KYHPYTSWIPITV Sbjct: 361 VERWMEKLEEAEIKRRISIKATVVLICSLVGYLWFEHIYKLDKIAYNKYHPYTSWIPITV 420 Query: 682 YICLRNVTQYFRCYSLTLFAWLGKITLETYISQFHIWQRTGILDGEPKLVLSLIPDYPML 503 YICLRNVTQ FR Y+LTLFAWLGKITLETYISQ HIW R+G+ DG+PKL+LSLIPD+PML Sbjct: 421 YICLRNVTQSFRSYTLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDFPML 480 Query: 502 NFMLTTAIFVAISYRLFQLTNTLKTAFIPSGDDKRLMHNIVAAGAISISLYTLSIAICKL 323 NFMLTT+I+VAISYRLF LTNTLK AF+PS DDKR +HN++ A IS+ LY+LS+ ++ Sbjct: 481 NFMLTTSIYVAISYRLFDLTNTLKMAFVPSKDDKRFIHNLITATTISVVLYSLSLGFLRV 540 >gb|KHN03266.1| CAS1 domain-containing protein 1 [Glycine soja] Length = 552 Score = 771 bits (1990), Expect = 0.0 Identities = 376/543 (69%), Positives = 430/543 (79%), Gaps = 11/543 (2%) Frame = -2 Query: 1918 MEISAPITPGQVSILLGIIPLFAATLYAEWSEYRKNSKLSSEIGRISDIN---------- 1769 M + +P+TPGQVS LLGIIP+ A +Y+E EYR N S R SDIN Sbjct: 1 MLLLSPVTPGQVSFLLGIIPVVVAWIYSEMLEYRNNYVPSR--ARQSDINLVEIGSDVVK 58 Query: 1768 -DGRAALLEDPNLPLSTINLDSPSVASQLLRFFLMDSSFFVENRLTLRAISEFGLYLVYM 1592 + RAALLE L + S + + ++RF LMD F +ENRLTLRA+SEFGL L Y Sbjct: 59 DEDRAALLEGGALQSGSPKARSLTASPSIIRFLLMDEYFLLENRLTLRAMSEFGLILAYF 118 Query: 1591 YICDRTDTFGYSIKSYSRDIFLFLYFLLIMVAAITSFTIHQDKSPITGKSILYLNRHQTE 1412 Y+CDRTD F S KSY+RD+FLFLYFLLI+V+A+TSF IH DKSP++GKSILYLNRHQTE Sbjct: 119 YLCDRTDFFASSKKSYNRDLFLFLYFLLIIVSAMTSFKIHHDKSPLSGKSILYLNRHQTE 178 Query: 1411 EWKGWMQVMFVMYHYFGALEIYNGIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMW 1232 EWKGWMQV+F+MYHYF A EIYN IR+FIAAYVWMTGFGNFSYYYVRKDFS+ARFAQMMW Sbjct: 179 EWKGWMQVLFLMYHYFAASEIYNAIRLFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 238 Query: 1231 RLNFLAAFVCIVLNNDYMFYYICPMHTFFTLMVYGALRILNKYNEIGSVIALKMASCFLI 1052 RLNF F CIVLNN YM YYICPMHT FTLMVYGAL IL+KYNEIGSVIA+K+ CFL+ Sbjct: 239 RLNFFVVFSCIVLNNSYMLYYICPMHTLFTLMVYGALGILHKYNEIGSVIAVKIIGCFLV 298 Query: 1051 IILVWEVPGVFELVWSPFMFLLGCSVTFLGPEGTRSLKEWHVRTGLDRYIWIIGMIYAYY 872 +ILVWE+PGVFE +WSPF F LG + L EWH R+GLDRYIWIIGMIYAYY Sbjct: 299 VILVWEIPGVFEWLWSPFTFFLGYTDPNPAKSHLSRLHEWHFRSGLDRYIWIIGMIYAYY 358 Query: 871 HPTVEKWMEKLEEAEFKRRISIKLAAASVSLTVGYLWFEYIYKLDSIPYQKYHPYTSWIP 692 HPTVE+WMEKLEEAE KRRISIK + VGYLWFE+IYKLD + Y KYHPYTSWIP Sbjct: 359 HPTVERWMEKLEEAEIKRRISIKATVVLICSLVGYLWFEHIYKLDKVTYNKYHPYTSWIP 418 Query: 691 ITVYICLRNVTQYFRCYSLTLFAWLGKITLETYISQFHIWQRTGILDGEPKLVLSLIPDY 512 ITVYICLRNVTQ FR Y+LTLFAWLGKITLETYISQ HIW R+GI DG+PKL+LSLIPDY Sbjct: 419 ITVYICLRNVTQSFRSYTLTLFAWLGKITLETYISQIHIWLRSGIPDGQPKLLLSLIPDY 478 Query: 511 PMLNFMLTTAIFVAISYRLFQLTNTLKTAFIPSGDDKRLMHNIVAAGAISISLYTLSIAI 332 PMLNFMLTT+I+VAISYRLF LTNTLK AF+PS DDKRL+HN++ A IS+ LY+LS+ Sbjct: 479 PMLNFMLTTSIYVAISYRLFDLTNTLKMAFVPSKDDKRLVHNLITATTISVVLYSLSLGF 538 Query: 331 CKL 323 ++ Sbjct: 539 LRV 541 >ref|XP_006600388.1| PREDICTED: CAS1 domain-containing protein 1-like isoform X2 [Glycine max] gi|947052946|gb|KRH02399.1| hypothetical protein GLYMA_17G036600 [Glycine max] gi|947052947|gb|KRH02400.1| hypothetical protein GLYMA_17G036600 [Glycine max] Length = 546 Score = 771 bits (1990), Expect = 0.0 Identities = 376/543 (69%), Positives = 430/543 (79%), Gaps = 11/543 (2%) Frame = -2 Query: 1918 MEISAPITPGQVSILLGIIPLFAATLYAEWSEYRKNSKLSSEIGRISDIN---------- 1769 M + +P+TPGQVS LLGIIP+ A +Y+E EYR N S R SDIN Sbjct: 1 MLLLSPVTPGQVSFLLGIIPVVVAWIYSEMLEYRNNYVPSR--ARQSDINLVEIGSDVVK 58 Query: 1768 -DGRAALLEDPNLPLSTINLDSPSVASQLLRFFLMDSSFFVENRLTLRAISEFGLYLVYM 1592 + RAALLE L + S + + ++RF LMD F +ENRLTLRA+SEFGL L Y Sbjct: 59 DEDRAALLEGGALQSGSPKARSLTASPSIIRFLLMDEYFLLENRLTLRAMSEFGLILAYF 118 Query: 1591 YICDRTDTFGYSIKSYSRDIFLFLYFLLIMVAAITSFTIHQDKSPITGKSILYLNRHQTE 1412 Y+CDRTD F S KSY+RD+FLFLYFLLI+V+A+TSF IH DKSP++GKSILYLNRHQTE Sbjct: 119 YLCDRTDFFASSKKSYNRDLFLFLYFLLIIVSAMTSFKIHHDKSPLSGKSILYLNRHQTE 178 Query: 1411 EWKGWMQVMFVMYHYFGALEIYNGIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMW 1232 EWKGWMQV+F+MYHYF A EIYN IR+FIAAYVWMTGFGNFSYYYVRKDFS+ARFAQMMW Sbjct: 179 EWKGWMQVLFLMYHYFAASEIYNAIRLFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 238 Query: 1231 RLNFLAAFVCIVLNNDYMFYYICPMHTFFTLMVYGALRILNKYNEIGSVIALKMASCFLI 1052 RLNF F CIVLNN YM YYICPMHT FTLMVYGAL IL+KYNEIGSVIA+K+ CFL+ Sbjct: 239 RLNFFVVFSCIVLNNSYMLYYICPMHTLFTLMVYGALGILHKYNEIGSVIAVKIIGCFLV 298 Query: 1051 IILVWEVPGVFELVWSPFMFLLGCSVTFLGPEGTRSLKEWHVRTGLDRYIWIIGMIYAYY 872 +ILVWE+PGVFE +WSPF F LG + L EWH R+GLDRYIWIIGMIYAYY Sbjct: 299 VILVWEIPGVFEWLWSPFTFFLGYTDPNPAKSHLSRLHEWHFRSGLDRYIWIIGMIYAYY 358 Query: 871 HPTVEKWMEKLEEAEFKRRISIKLAAASVSLTVGYLWFEYIYKLDSIPYQKYHPYTSWIP 692 HPTVE+WMEKLEEAE KRRISIK + VGYLWFE+IYKLD + Y KYHPYTSWIP Sbjct: 359 HPTVERWMEKLEEAEIKRRISIKATVVLICSLVGYLWFEHIYKLDKVTYNKYHPYTSWIP 418 Query: 691 ITVYICLRNVTQYFRCYSLTLFAWLGKITLETYISQFHIWQRTGILDGEPKLVLSLIPDY 512 ITVYICLRNVTQ FR Y+LTLFAWLGKITLETYISQ HIW R+GI DG+PKL+LSLIPDY Sbjct: 419 ITVYICLRNVTQSFRSYTLTLFAWLGKITLETYISQIHIWLRSGIPDGQPKLLLSLIPDY 478 Query: 511 PMLNFMLTTAIFVAISYRLFQLTNTLKTAFIPSGDDKRLMHNIVAAGAISISLYTLSIAI 332 PMLNFMLTT+I+VAISYRLF LTNTLK AF+PS DDKRL+HN++ A IS+ LY+LS+ Sbjct: 479 PMLNFMLTTSIYVAISYRLFDLTNTLKMAFVPSKDDKRLVHNLITATTISVVLYSLSLGF 538 Query: 331 CKL 323 ++ Sbjct: 539 LRV 541 >ref|XP_006583994.1| PREDICTED: CAS1 domain-containing protein 1-like isoform X2 [Glycine max] Length = 551 Score = 771 bits (1990), Expect = 0.0 Identities = 374/540 (69%), Positives = 434/540 (80%), Gaps = 8/540 (1%) Frame = -2 Query: 1918 MEISAPITPGQVSILLGIIPLFAATLYAEWSEYRKNS---KLSSEIGRI---SDI--NDG 1763 M + +P+TPGQVS LLGIIP+ A +Y+E EYRKNS + S+I + SD+ ++ Sbjct: 1 MLLLSPVTPGQVSFLLGIIPVVVAWIYSEILEYRKNSVSSRAQSDINLVEMGSDVVKDED 60 Query: 1762 RAALLEDPNLPLSTINLDSPSVASQLLRFFLMDSSFFVENRLTLRAISEFGLYLVYMYIC 1583 RA LLE L + S + + ++RF LMD F +ENRLTLRA+SEFGL L Y Y+C Sbjct: 61 RAVLLEGGALQSGSPKARSLTGSPSIIRFLLMDECFLLENRLTLRAMSEFGLILAYFYLC 120 Query: 1582 DRTDTFGYSIKSYSRDIFLFLYFLLIMVAAITSFTIHQDKSPITGKSILYLNRHQTEEWK 1403 DRTD F S KSY+RD+FLFLYFLLI+V+A+TSF IH DKSP++GKSILYLNRHQTEEWK Sbjct: 121 DRTDFFASSNKSYNRDLFLFLYFLLIIVSAMTSFKIHHDKSPLSGKSILYLNRHQTEEWK 180 Query: 1402 GWMQVMFVMYHYFGALEIYNGIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMWRLN 1223 GWMQV+F+MYHYF A EIYN IR+FIAAYVWMTGFGNFSYYYVRKDFS+ARFAQMMWRLN Sbjct: 181 GWMQVLFLMYHYFAASEIYNAIRLFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWRLN 240 Query: 1222 FLAAFVCIVLNNDYMFYYICPMHTFFTLMVYGALRILNKYNEIGSVIALKMASCFLIIIL 1043 F F CIVLNN YM YYICPMHT FTLMVYGAL IL+KYNEIGSVIA+K+ +CFL++IL Sbjct: 241 FFVVFCCIVLNNSYMLYYICPMHTLFTLMVYGALGILHKYNEIGSVIAVKIIACFLVVIL 300 Query: 1042 VWEVPGVFELVWSPFMFLLGCSVTFLGPEGTRSLKEWHVRTGLDRYIWIIGMIYAYYHPT 863 VWE+PGVFE VWSPF F LG + L EWH R+GLDRYIWIIGMIYAYYHPT Sbjct: 301 VWEIPGVFEWVWSPFTFFLGYTDPNPAKSHLSRLHEWHFRSGLDRYIWIIGMIYAYYHPT 360 Query: 862 VEKWMEKLEEAEFKRRISIKLAAASVSLTVGYLWFEYIYKLDSIPYQKYHPYTSWIPITV 683 VE+WMEKLEEAE KRRISIK + VGYLWFE+IYKLD I Y KYHPYTSWIPITV Sbjct: 361 VERWMEKLEEAEIKRRISIKATVVLICSLVGYLWFEHIYKLDKIAYNKYHPYTSWIPITV 420 Query: 682 YICLRNVTQYFRCYSLTLFAWLGKITLETYISQFHIWQRTGILDGEPKLVLSLIPDYPML 503 YICLRNVTQ FR Y+LTLFAWLGKITLETYISQ HIW R+G+ DG+PKL+LSLIPD+PML Sbjct: 421 YICLRNVTQSFRSYTLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDFPML 480 Query: 502 NFMLTTAIFVAISYRLFQLTNTLKTAFIPSGDDKRLMHNIVAAGAISISLYTLSIAICKL 323 NFMLTT+I+VAISYRLF LTNTLK AF+PS DDKR +HN++ A IS+ LY+LS+ ++ Sbjct: 481 NFMLTTSIYVAISYRLFDLTNTLKMAFVPSKDDKRFIHNLITATTISVVLYSLSLGFLRV 540 >ref|XP_010045355.1| PREDICTED: CAS1 domain-containing protein 1 isoform X1 [Eucalyptus grandis] gi|629123038|gb|KCW87528.1| hypothetical protein EUGRSUZ_B03976 [Eucalyptus grandis] Length = 546 Score = 770 bits (1989), Expect = 0.0 Identities = 375/542 (69%), Positives = 438/542 (80%), Gaps = 11/542 (2%) Frame = -2 Query: 1918 MEISAPITPGQVSILLGIIPLFAATLYAEWSEYRKNSKLSSEIGRISDIN---------- 1769 M I P+TPGQVS L+GIIP AA +Y+E+ EY++NS +SS++ R SD+N Sbjct: 1 MAIHGPVTPGQVSFLIGIIPTIAAWIYSEFLEYKRNS-VSSKVRR-SDVNLVEMGNDVVK 58 Query: 1768 -DGRAALLEDPNLPLSTINLDSPSVASQLLRFFLMDSSFFVENRLTLRAISEFGLYLVYM 1592 D RA LLE L ++ + S S + RF +MD SF VENRLTLRAI+EF + L Y Sbjct: 59 EDDRAVLLEGGGLQSASPRSRNSSATSPIFRFLVMDESFLVENRLTLRAIAEFSMLLAYF 118 Query: 1591 YICDRTDTFGYSIKSYSRDIFLFLYFLLIMVAAITSFTIHQDKSPITGKSILYLNRHQTE 1412 ++CDRTD F S KSY+RD+FLFLYFLLI+V+A+TSFT H +KSPI+GKSILYLNRHQTE Sbjct: 119 FLCDRTDFFESSKKSYNRDLFLFLYFLLIIVSAMTSFTTHNEKSPISGKSILYLNRHQTE 178 Query: 1411 EWKGWMQVMFVMYHYFGALEIYNGIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMW 1232 EWKGWMQV+F+MYHYF A EIYN IRIFIAAYVWMTGFGNFSYYYVRKDFS+ARFAQMMW Sbjct: 179 EWKGWMQVLFLMYHYFAASEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 238 Query: 1231 RLNFLAAFVCIVLNNDYMFYYICPMHTFFTLMVYGALRILNKYNEIGSVIALKMASCFLI 1052 RLNFL F C+VLNN+Y+ YYICPMHT FTLMVYGAL ILNKYNE+G IALK+ +CFL+ Sbjct: 239 RLNFLVLFCCVVLNNNYVLYYICPMHTLFTLMVYGALGILNKYNEVGMAIALKIIACFLV 298 Query: 1051 IILVWEVPGVFELVWSPFMFLLGCSVTFLGPEGTRSLKEWHVRTGLDRYIWIIGMIYAYY 872 +ILVWEVPGVFELVWSPF FLLG S L+EWH R+GLDRYIWI+GMIYAYY Sbjct: 299 VILVWEVPGVFELVWSPFTFLLGYSDPDPSKPKFPLLREWHFRSGLDRYIWIVGMIYAYY 358 Query: 871 HPTVEKWMEKLEEAEFKRRISIKLAAASVSLTVGYLWFEYIYKLDSIPYQKYHPYTSWIP 692 HPTVE+WMEKLEEAE+KRR+ IK A S SLTVGYLWFEY+YKLD I Y KYHPYTSWIP Sbjct: 359 HPTVERWMEKLEEAEWKRRLLIKGAVISTSLTVGYLWFEYVYKLDKITYNKYHPYTSWIP 418 Query: 691 ITVYICLRNVTQYFRCYSLTLFAWLGKITLETYISQFHIWQRTGILDGEPKLVLSLIPDY 512 ITVYI LRNVTQ+ R SLTLFAWLGKITLETYISQ HIW R+GI DG+PKL+LSLIP+Y Sbjct: 419 ITVYISLRNVTQHLRSCSLTLFAWLGKITLETYISQIHIWLRSGIPDGQPKLLLSLIPNY 478 Query: 511 PMLNFMLTTAIFVAISYRLFQLTNTLKTAFIPSGDDKRLMHNIVAAGAISISLYTLSIAI 332 PMLNFMLTT+I++ +S+RLF LTNTLK AF+PS D+KRL++N+++A IS LY+LS Sbjct: 479 PMLNFMLTTSIYIVVSHRLFNLTNTLKNAFVPSKDNKRLLNNMISAAVISSVLYSLSFVF 538 Query: 331 CK 326 K Sbjct: 539 LK 540