BLASTX nr result
ID: Forsythia22_contig00011478
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia22_contig00011478 (2233 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011100735.1| PREDICTED: CAS1 domain-containing protein 1 ... 965 0.0 ref|XP_006338003.1| PREDICTED: CAS1 domain-containing protein 1-... 939 0.0 ref|XP_004229023.1| PREDICTED: CAS1 domain-containing protein 1 ... 938 0.0 ref|XP_009803356.1| PREDICTED: CAS1 domain-containing protein 1 ... 927 0.0 emb|CDO98736.1| unnamed protein product [Coffea canephora] 909 0.0 ref|XP_007035864.1| O-acetyltransferase family protein isoform 1... 899 0.0 ref|XP_007035865.1| O-acetyltransferase family protein isoform 2... 896 0.0 ref|XP_012829945.1| PREDICTED: CAS1 domain-containing protein 1-... 894 0.0 ref|XP_010270835.1| PREDICTED: CAS1 domain-containing protein 1 ... 892 0.0 ref|XP_012455818.1| PREDICTED: CAS1 domain-containing protein 1-... 892 0.0 gb|KHG01507.1| CAS1 domain-containing 1 [Gossypium arboreum] 890 0.0 ref|XP_012455820.1| PREDICTED: CAS1 domain-containing protein 1-... 889 0.0 ref|XP_007154416.1| hypothetical protein PHAVU_003G117800g [Phas... 885 0.0 ref|XP_010270836.1| PREDICTED: CAS1 domain-containing protein 1 ... 883 0.0 ref|XP_012084122.1| PREDICTED: probable O-acetyltransferase CAS1... 882 0.0 ref|XP_003631938.1| PREDICTED: CAS1 domain-containing protein 1 ... 882 0.0 gb|EYU43511.1| hypothetical protein MIMGU_mgv1a004398mg [Erythra... 882 0.0 ref|XP_012084124.1| PREDICTED: probable O-acetyltransferase CAS1... 880 0.0 ref|XP_010045355.1| PREDICTED: CAS1 domain-containing protein 1 ... 878 0.0 ref|XP_006583993.1| PREDICTED: CAS1 domain-containing protein 1-... 878 0.0 >ref|XP_011100735.1| PREDICTED: CAS1 domain-containing protein 1 [Sesamum indicum] Length = 545 Score = 965 bits (2495), Expect = 0.0 Identities = 470/547 (85%), Positives = 499/547 (91%) Frame = -1 Query: 2077 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSTKPGRNSDINLVELGEETVKE 1898 MVIYGPLTPGQV+FFLGI+P+ AW+YSEYLE+RK SS +K GRNSD LVEL TVKE Sbjct: 1 MVIYGPLTPGQVAFFLGIVPVFAAWLYSEYLEYRKNSSFSKHGRNSDSKLVELAG-TVKE 59 Query: 1897 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1718 DDRAVLLEGG LQSASPR RNSSVTSH++RFLTLDESFLLENRLTLRAISEFGALL+YFY Sbjct: 60 DDRAVLLEGGGLQSASPRERNSSVTSHVIRFLTLDESFLLENRLTLRAISEFGALLVYFY 119 Query: 1717 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1538 ICDRTNLLGESKKSYNRDLF FLYFLLIIVSAITSFKIHNDKSPFSGK IMYLNRHQTEE Sbjct: 120 ICDRTNLLGESKKSYNRDLFLFLYFLLIIVSAITSFKIHNDKSPFSGKSIMYLNRHQTEE 179 Query: 1537 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1358 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARF+QMMWR Sbjct: 180 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFMQMMWR 239 Query: 1357 LNFLVFFCCIVLNNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1178 LNFLVFFCC+VLNN+YMLYYICPMHTLFTLMVYGALGIFNKYN++GTVIA K +ACFLVV Sbjct: 240 LNFLVFFCCVVLNNNYMLYYICPMHTLFTLMVYGALGIFNKYNDRGTVIAAKFLACFLVV 299 Query: 1177 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 998 ILIWEVPGVF+L W PFTF LGY DP SK K LHEWHFRSGLDRYIWIIGMIYAYYH Sbjct: 300 ILIWEVPGVFDLFWGPFTFLLGYTDP-ASKVKFPLLHEWHFRSGLDRYIWIIGMIYAYYH 358 Query: 997 PTVERWMEKLEEAEPKRRMSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 818 PTVERWMEKLEEAE KRR+SIK I++IISLTIGYLWVE+IYKLPKITYNKYHPYTSWIPI Sbjct: 359 PTVERWMEKLEEAETKRRISIKTIIIIISLTIGYLWVEFIYKLPKITYNKYHPYTSWIPI 418 Query: 817 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 638 TVYI LRNVTQHFR YTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPK Sbjct: 419 TVYICLRNVTQHFRCYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKLLLSLIPNYP 478 Query: 637 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVMASILYTVSFVLL 458 NFMLTTSIY+AVSYRLFELTNTLKS FVPTKD+KRLGHNL AVV+A ILY ++ +L+ Sbjct: 479 LLNFMLTTSIYIAVSYRLFELTNTLKSTFVPTKDNKRLGHNLAAAVVIAGILYILAVILV 538 Query: 457 KVPQMLV 437 KVPQ++V Sbjct: 539 KVPQVMV 545 >ref|XP_006338003.1| PREDICTED: CAS1 domain-containing protein 1-like [Solanum tuberosum] Length = 546 Score = 939 bits (2428), Expect = 0.0 Identities = 452/547 (82%), Positives = 494/547 (90%) Frame = -1 Query: 2077 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSTKPGRNSDINLVELGEETVKE 1898 M+IYGPL+PGQVSFFLGI+P+C AW+YSEYLE++K S+S+K R+SDINLVELG E VKE Sbjct: 1 MLIYGPLSPGQVSFFLGIVPVCAAWLYSEYLEYKKNSASSKV-RHSDINLVELGNEAVKE 59 Query: 1897 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1718 DDRAVLLEGG LQS SPR+R+SSVTS I RF +DE+FLLENR TLRAISEFGALL YFY Sbjct: 60 DDRAVLLEGGGLQSTSPRIRSSSVTSQIARFFLMDETFLLENRSTLRAISEFGALLTYFY 119 Query: 1717 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1538 + DRTNL GESKKSYNRDLF FLYFLLIIVSAITSFKIH+DKSPFSGK IMYLNRHQTEE Sbjct: 120 LSDRTNLFGESKKSYNRDLFIFLYFLLIIVSAITSFKIHHDKSPFSGKSIMYLNRHQTEE 179 Query: 1537 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1358 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFS+ARF QMMWR Sbjct: 180 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSIARFTQMMWR 239 Query: 1357 LNFLVFFCCIVLNNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1178 LNFLVFF C++LNN+YMLYYICPMHTLFTLMVYGALGIFNKYNE GTVIA K IACFLVV Sbjct: 240 LNFLVFFSCVILNNNYMLYYICPMHTLFTLMVYGALGIFNKYNENGTVIAVKFIACFLVV 299 Query: 1177 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 998 IL+WEVPGVFE++WSPFTFFLGYADPDPSK K S LHEW FRSGLDRYIWIIGMIYAYYH Sbjct: 300 ILMWEVPGVFEVVWSPFTFFLGYADPDPSKPKQSLLHEWEFRSGLDRYIWIIGMIYAYYH 359 Query: 997 PTVERWMEKLEEAEPKRRMSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 818 PTVERWMEKLEE E KRR+SIK V ++SLT+GYLW EYIYKLPK+TYNKYHPYTSWIPI Sbjct: 360 PTVERWMEKLEETEVKRRISIKAAVALMSLTMGYLWYEYIYKLPKVTYNKYHPYTSWIPI 419 Query: 817 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 638 TVYISLRNVTQ+FRSY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK Sbjct: 420 TVYISLRNVTQYFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKKLLCLIPNYP 479 Query: 637 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVMASILYTVSFVLL 458 NFMLT +IYVAVS+RLFELTNTLKS F+P KDDKRLG+N+V A+V++ +LY +S V L Sbjct: 480 LMNFMLTAAIYVAVSHRLFELTNTLKSTFIPMKDDKRLGYNIVAALVVSGLLYVLSSVFL 539 Query: 457 KVPQMLV 437 +VPQMLV Sbjct: 540 RVPQMLV 546 >ref|XP_004229023.1| PREDICTED: CAS1 domain-containing protein 1 [Solanum lycopersicum] Length = 546 Score = 938 bits (2424), Expect = 0.0 Identities = 452/547 (82%), Positives = 493/547 (90%) Frame = -1 Query: 2077 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSTKPGRNSDINLVELGEETVKE 1898 M+IYGPL+PGQVSFFLGI+PIC AW+YSEYLE++K S+S+K R+SDINLVELG+E VKE Sbjct: 1 MLIYGPLSPGQVSFFLGIVPICAAWLYSEYLEYKKNSASSKV-RHSDINLVELGDEAVKE 59 Query: 1897 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1718 DDRAVLLEGG LQS SPR+R+SSVTS RF +DE+FLLENRLTLRAISEFG LLIYFY Sbjct: 60 DDRAVLLEGGGLQSTSPRIRSSSVTSQFTRFFLMDETFLLENRLTLRAISEFGTLLIYFY 119 Query: 1717 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1538 I DRTNL GESKKSYNRDLF FLYFLLIIVSAITSFKIH+DKSPFSGK IMYLNRHQTEE Sbjct: 120 ISDRTNLFGESKKSYNRDLFIFLYFLLIIVSAITSFKIHHDKSPFSGKSIMYLNRHQTEE 179 Query: 1537 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1358 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFS+ARF QMMWR Sbjct: 180 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSIARFAQMMWR 239 Query: 1357 LNFLVFFCCIVLNNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1178 LNFLVFF C++LNN+YMLYYICPMHTLFTLMVYGALGIFNKYNE GT+IA K I CFL V Sbjct: 240 LNFLVFFSCVILNNNYMLYYICPMHTLFTLMVYGALGIFNKYNENGTIIAVKFIVCFLFV 299 Query: 1177 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 998 IL+WEVPGVFE++WSPFTFFLGYADPDPSK K S LHEW FRSGLDRYIWIIGMIYAYYH Sbjct: 300 ILMWEVPGVFEVVWSPFTFFLGYADPDPSKPKQSLLHEWEFRSGLDRYIWIIGMIYAYYH 359 Query: 997 PTVERWMEKLEEAEPKRRMSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 818 PTVE+WMEKLEE E KRR+SIK V I+SLT+GYLW EYIYKLPK TYNKYHPYTSWIPI Sbjct: 360 PTVEKWMEKLEETEVKRRISIKAAVAIMSLTMGYLWYEYIYKLPKETYNKYHPYTSWIPI 419 Query: 817 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 638 TVYISLRNVTQ+FRSYTLTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK Sbjct: 420 TVYISLRNVTQYFRSYTLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKKLLCLIPGYP 479 Query: 637 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVMASILYTVSFVLL 458 NFMLTT+IYVAVS+RLFELTNTLKS F+P K+DKRLG+N+V A+V++ +LY +S V L Sbjct: 480 LMNFMLTTAIYVAVSHRLFELTNTLKSTFIPMKEDKRLGYNIVAALVVSGLLYVLSSVFL 539 Query: 457 KVPQMLV 437 +VPQMLV Sbjct: 540 RVPQMLV 546 >ref|XP_009803356.1| PREDICTED: CAS1 domain-containing protein 1 [Nicotiana sylvestris] Length = 546 Score = 927 bits (2396), Expect = 0.0 Identities = 447/547 (81%), Positives = 490/547 (89%) Frame = -1 Query: 2077 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSTKPGRNSDINLVELGEETVKE 1898 M++YGPLTP QVSFFLGI+ IC AW+YSEYLE+ K S+S+K R+SDINLVELG E VKE Sbjct: 1 MLLYGPLTPAQVSFFLGIVSICAAWLYSEYLEYEKNSASSKV-RHSDINLVELGNEAVKE 59 Query: 1897 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1718 DDRAVLLEGG LQSASPR+R+SSVTS I RF +DESF LENRLTLRAISEFGALL YFY Sbjct: 60 DDRAVLLEGGGLQSASPRMRSSSVTSQIARFCLMDESFFLENRLTLRAISEFGALLTYFY 119 Query: 1717 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1538 + DRTNL GESKKSYNRDLF FLYFLLII+SAITSF IH+DKSPFSG+ IMYLNRHQTEE Sbjct: 120 LSDRTNLFGESKKSYNRDLFLFLYFLLIIISAITSFTIHHDKSPFSGRSIMYLNRHQTEE 179 Query: 1537 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1358 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFS+ARF QMMWR Sbjct: 180 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSIARFTQMMWR 239 Query: 1357 LNFLVFFCCIVLNNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1178 LNFLVFF CIVLNN+YMLYYICPMHTLFTLMVYGALGIFNKYNE GT+IAGKII CFLVV Sbjct: 240 LNFLVFFSCIVLNNNYMLYYICPMHTLFTLMVYGALGIFNKYNENGTIIAGKIITCFLVV 299 Query: 1177 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 998 IL+WEVPGVFE++WSPFT FLGYADPDP K K S LHEW FRSGLDRYIWI+GMIYAYYH Sbjct: 300 ILMWEVPGVFEVVWSPFTCFLGYADPDPLKTKQSLLHEWQFRSGLDRYIWIVGMIYAYYH 359 Query: 997 PTVERWMEKLEEAEPKRRMSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 818 PTVERWMEKLEE E KRR+SIK +V IISL +GYLW E+IYKLPKITYNKYHPYTSWIPI Sbjct: 360 PTVERWMEKLEETEVKRRISIKAVVAIISLAMGYLWYEHIYKLPKITYNKYHPYTSWIPI 419 Query: 817 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 638 TVYISLRNVTQ+F SY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK Sbjct: 420 TVYISLRNVTQYFCSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKKLLCLIPNYP 479 Query: 637 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVMASILYTVSFVLL 458 NFMLTT+IYVAVS+RLFELTNTLKS F+P KD+KRLG+N++ A+V++ +LY +S V L Sbjct: 480 LLNFMLTTAIYVAVSHRLFELTNTLKSTFIPMKDNKRLGYNIIAALVVSGLLYVLSSVFL 539 Query: 457 KVPQMLV 437 +VPQ+LV Sbjct: 540 RVPQLLV 546 >emb|CDO98736.1| unnamed protein product [Coffea canephora] Length = 544 Score = 909 bits (2349), Expect = 0.0 Identities = 442/547 (80%), Positives = 481/547 (87%) Frame = -1 Query: 2077 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSTKPGRNSDINLVELGEETVKE 1898 +VIYGPLTPGQVSFFLGI+P+ AWIY+E LE++K S S R+SDI LVELG VKE Sbjct: 2 VVIYGPLTPGQVSFFLGIVPMFAAWIYAEILEYKKASVSKS--RHSDITLVELGNGGVKE 59 Query: 1897 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1718 +D AVLLEGG LQSASPRVR+SS S I+RFL +DESFLLENRLTLRAISE GALLIYFY Sbjct: 60 EDSAVLLEGGGLQSASPRVRSSSAASQILRFLMMDESFLLENRLTLRAISELGALLIYFY 119 Query: 1717 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1538 +CDRTN+ G+SKKSYNRDLF FLYFLLIIVSAITSFKIH DKSPFSGK IMYLNRHQTEE Sbjct: 120 VCDRTNIFGQSKKSYNRDLFLFLYFLLIIVSAITSFKIHQDKSPFSGKSIMYLNRHQTEE 179 Query: 1537 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1358 WKGWMQVLFLMYHYFAA EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFS+ARF QMMWR Sbjct: 180 WKGWMQVLFLMYHYFAAAEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMWR 239 Query: 1357 LNFLVFFCCIVLNNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1178 LNFLV CCI+L+N+Y LYYICPMHTLFTLMVYGALGI NKYNE GTVIA KI+ CFL V Sbjct: 240 LNFLVLLCCIILDNNYTLYYICPMHTLFTLMVYGALGILNKYNESGTVIAAKIMTCFLAV 299 Query: 1177 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 998 ILIWE+PGVFEL+WSPFTF LGY+ DPSK LHEWHFRSGLDRYIWIIGMIYAYYH Sbjct: 300 ILIWEIPGVFELIWSPFTFLLGYS--DPSKPPQPRLHEWHFRSGLDRYIWIIGMIYAYYH 357 Query: 997 PTVERWMEKLEEAEPKRRMSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 818 PTVERWMEKLEE E KRR+SIK VVIISL +GYLW+EYIYKLPKITYNKYHPYTSWIPI Sbjct: 358 PTVERWMEKLEETEVKRRISIKTAVVIISLAVGYLWLEYIYKLPKITYNKYHPYTSWIPI 417 Query: 817 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 638 TVYI LRNV+Q+FRSYTLTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK Sbjct: 418 TVYICLRNVSQYFRSYTLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPEYP 477 Query: 637 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVMASILYTVSFVLL 458 NFMLTTSIY+AVSYRLFELTN LKS FVP+KD+KRLGHN+V AVV+AS LY +SFVLL Sbjct: 478 LLNFMLTTSIYIAVSYRLFELTNMLKSTFVPSKDNKRLGHNIVAAVVIASGLYMLSFVLL 537 Query: 457 KVPQMLV 437 ++P M+V Sbjct: 538 RIPPMMV 544 >ref|XP_007035864.1| O-acetyltransferase family protein isoform 1 [Theobroma cacao] gi|508714893|gb|EOY06790.1| O-acetyltransferase family protein isoform 1 [Theobroma cacao] Length = 545 Score = 899 bits (2323), Expect = 0.0 Identities = 435/548 (79%), Positives = 482/548 (87%), Gaps = 1/548 (0%) Frame = -1 Query: 2077 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSTKPGRNSDINLVELGEETVKE 1898 M I+GP+TPGQVSFFLGI P+ AWIY+EYLE++K S +K R+SD+NLVE+G VKE Sbjct: 1 MAIFGPITPGQVSFFLGIFPVISAWIYAEYLEYKKNSLESK-ARHSDVNLVEIGNGAVKE 59 Query: 1897 DDRAVLLEGGALQSASPRVRNSSVT-SHIVRFLTLDESFLLENRLTLRAISEFGALLIYF 1721 DDRAVLLEGG LQSASP+ R SS + S I +FL +DE+FL+ENRLTLRAISEFG LL Y+ Sbjct: 60 DDRAVLLEGGGLQSASPKARTSSSSLSPIFKFLMMDETFLVENRLTLRAISEFGGLLAYY 119 Query: 1720 YICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTE 1541 YICDRT++ +KK+YNRDLF FLYFLLIIVSAITSFKIH+DKSPFSGK I+YLNRHQTE Sbjct: 120 YICDRTDVFDSAKKNYNRDLFLFLYFLLIIVSAITSFKIHHDKSPFSGKSILYLNRHQTE 179 Query: 1540 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMW 1361 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYY+RKDFSLARF QMMW Sbjct: 180 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 239 Query: 1360 RLNFLVFFCCIVLNNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLV 1181 RLNFLVFFCC++LNNSY+LYYICPMHTLFTLMVYG LGI NKYNE G+VIA KIIACFLV Sbjct: 240 RLNFLVFFCCVILNNSYVLYYICPMHTLFTLMVYGTLGILNKYNENGSVIAAKIIACFLV 299 Query: 1180 VILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYY 1001 VIL+WEVPGVFE+LWSPFTFFLGY DP+K LHEWHFRSGLDRYIWIIGMIYAYY Sbjct: 300 VILVWEVPGVFEILWSPFTFFLGYT--DPAKPNFPRLHEWHFRSGLDRYIWIIGMIYAYY 357 Query: 1000 HPTVERWMEKLEEAEPKRRMSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIP 821 HPTVERWMEKLEEAE KRR+ IKM V I+LT+GY W EYIYKL KITYNKYHPYTSWIP Sbjct: 358 HPTVERWMEKLEEAEVKRRVLIKMAVATIALTMGYFWFEYIYKLDKITYNKYHPYTSWIP 417 Query: 820 ITVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXX 641 ITVYI LRNVTQ FRSY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK Sbjct: 418 ITVYICLRNVTQSFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDY 477 Query: 640 XXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVMASILYTVSFVL 461 NFMLTTSIYVA+SYRLF+LTN LK+AFVPTKDDKRL +NL+ AVV++SILY++SF L Sbjct: 478 PMLNFMLTTSIYVAISYRLFDLTNILKTAFVPTKDDKRLINNLITAVVISSILYSLSFAL 537 Query: 460 LKVPQMLV 437 L++PQMLV Sbjct: 538 LRIPQMLV 545 >ref|XP_007035865.1| O-acetyltransferase family protein isoform 2 [Theobroma cacao] gi|508714894|gb|EOY06791.1| O-acetyltransferase family protein isoform 2 [Theobroma cacao] Length = 544 Score = 896 bits (2316), Expect = 0.0 Identities = 434/548 (79%), Positives = 481/548 (87%), Gaps = 1/548 (0%) Frame = -1 Query: 2077 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSTKPGRNSDINLVELGEETVKE 1898 M I+GP+TPGQVSFFLGI P+ AWIY+EYLE++K S +K +SD+NLVE+G VKE Sbjct: 1 MAIFGPITPGQVSFFLGIFPVISAWIYAEYLEYKKNSLESKA--HSDVNLVEIGNGAVKE 58 Query: 1897 DDRAVLLEGGALQSASPRVRNSSVT-SHIVRFLTLDESFLLENRLTLRAISEFGALLIYF 1721 DDRAVLLEGG LQSASP+ R SS + S I +FL +DE+FL+ENRLTLRAISEFG LL Y+ Sbjct: 59 DDRAVLLEGGGLQSASPKARTSSSSLSPIFKFLMMDETFLVENRLTLRAISEFGGLLAYY 118 Query: 1720 YICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTE 1541 YICDRT++ +KK+YNRDLF FLYFLLIIVSAITSFKIH+DKSPFSGK I+YLNRHQTE Sbjct: 119 YICDRTDVFDSAKKNYNRDLFLFLYFLLIIVSAITSFKIHHDKSPFSGKSILYLNRHQTE 178 Query: 1540 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMW 1361 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYY+RKDFSLARF QMMW Sbjct: 179 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 238 Query: 1360 RLNFLVFFCCIVLNNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLV 1181 RLNFLVFFCC++LNNSY+LYYICPMHTLFTLMVYG LGI NKYNE G+VIA KIIACFLV Sbjct: 239 RLNFLVFFCCVILNNSYVLYYICPMHTLFTLMVYGTLGILNKYNENGSVIAAKIIACFLV 298 Query: 1180 VILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYY 1001 VIL+WEVPGVFE+LWSPFTFFLGY DP+K LHEWHFRSGLDRYIWIIGMIYAYY Sbjct: 299 VILVWEVPGVFEILWSPFTFFLGYT--DPAKPNFPRLHEWHFRSGLDRYIWIIGMIYAYY 356 Query: 1000 HPTVERWMEKLEEAEPKRRMSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIP 821 HPTVERWMEKLEEAE KRR+ IKM V I+LT+GY W EYIYKL KITYNKYHPYTSWIP Sbjct: 357 HPTVERWMEKLEEAEVKRRVLIKMAVATIALTMGYFWFEYIYKLDKITYNKYHPYTSWIP 416 Query: 820 ITVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXX 641 ITVYI LRNVTQ FRSY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK Sbjct: 417 ITVYICLRNVTQSFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDY 476 Query: 640 XXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVMASILYTVSFVL 461 NFMLTTSIYVA+SYRLF+LTN LK+AFVPTKDDKRL +NL+ AVV++SILY++SF L Sbjct: 477 PMLNFMLTTSIYVAISYRLFDLTNILKTAFVPTKDDKRLINNLITAVVISSILYSLSFAL 536 Query: 460 LKVPQMLV 437 L++PQMLV Sbjct: 537 LRIPQMLV 544 >ref|XP_012829945.1| PREDICTED: CAS1 domain-containing protein 1-like [Erythranthe guttatus] Length = 557 Score = 894 bits (2309), Expect = 0.0 Identities = 447/559 (79%), Positives = 479/559 (85%), Gaps = 12/559 (2%) Frame = -1 Query: 2077 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSTKPGRNSDINLVELGEETVKE 1898 M+IYGPLTPGQVSFFLG++P+ AW+YSEYLE+ K SS K GRNSDINLVEL TVKE Sbjct: 1 MLIYGPLTPGQVSFFLGLVPVFAAWLYSEYLENAKSSSLPKHGRNSDINLVELAG-TVKE 59 Query: 1897 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1718 DDRAVLLEGG L SASP +RNSSVTS + R L LDESFLLENRL LRA+SEFGALLIYFY Sbjct: 60 DDRAVLLEGGGLHSASPTLRNSSVTSQLTRLLMLDESFLLENRLILRAMSEFGALLIYFY 119 Query: 1717 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1538 ICDRTNLLGE+KKSYNRDLF FLYFLLIIVSA TSFKIHNDKSP SGK IMYLNRHQTEE Sbjct: 120 ICDRTNLLGEAKKSYNRDLFLFLYFLLIIVSAKTSFKIHNDKSPLSGKSIMYLNRHQTEE 179 Query: 1537 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1358 WKGWMQVLFLMYHYFAATE YNAIRVFIA YVWMTGFGNFSYYYIRKDFSLARFIQMMWR Sbjct: 180 WKGWMQVLFLMYHYFAATEFYNAIRVFIAGYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 239 Query: 1357 LNFLVFFCCIVLNNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1178 LNFLVFFCC+VLNN+YMLYYICPMHTLFTLMVYGALGIFNKYNE+G VIA KI+ CFLVV Sbjct: 240 LNFLVFFCCVVLNNNYMLYYICPMHTLFTLMVYGALGIFNKYNERGIVIAAKIVVCFLVV 299 Query: 1177 ILIW----------EVPGVFE--LLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRY 1034 +L+W P FE S F GY DP SK K S LHEWHFRSGLDRY Sbjct: 300 VLLWGSARPICTTINKPFEFEKCTSLSQCEFVSGYTDP-ASKVKLSLLHEWHFRSGLDRY 358 Query: 1033 IWIIGMIYAYYHPTVERWMEKLEEAEPKRRMSIKMIVVIISLTIGYLWVEYIYKLPKITY 854 IWIIGMIYAYYHPTVERW+EKLEEAE KRR+ IK IV I+SLTIGYLWVE++YKLPKITY Sbjct: 359 IWIIGMIYAYYHPTVERWLEKLEEAEIKRRILIKSIVGIVSLTIGYLWVEFVYKLPKITY 418 Query: 853 NKYHPYTSWIPITVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQ 674 NKYHPYTSWIPITVYI LRNVTQHFR Y+LTLFAWLGKVTLETYISQIHIWLRS VP+GQ Sbjct: 419 NKYHPYTSWIPITVYICLRNVTQHFRCYSLTLFAWLGKVTLETYISQIHIWLRSSVPNGQ 478 Query: 673 PKXXXXXXXXXXXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVM 494 PK NFMLT SIY+AVSYRLFELTNTLKSAFVPTKD+KRLGHNL+ AV++ Sbjct: 479 PKLLLSLIPNYPLLNFMLTASIYIAVSYRLFELTNTLKSAFVPTKDNKRLGHNLIAAVII 538 Query: 493 ASILYTVSFVLLKVPQMLV 437 ASILY ++ VL+KVPQM+V Sbjct: 539 ASILYILAVVLIKVPQMIV 557 >ref|XP_010270835.1| PREDICTED: CAS1 domain-containing protein 1 isoform X1 [Nelumbo nucifera] Length = 547 Score = 892 bits (2306), Expect = 0.0 Identities = 421/546 (77%), Positives = 481/546 (88%) Frame = -1 Query: 2077 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSTKPGRNSDINLVELGEETVKE 1898 M I P+TPGQVSF LGI+PI VAWIYSE+LE++K S S+K GR+SDINLVELG+ETVKE Sbjct: 1 MSITSPVTPGQVSFLLGIIPIIVAWIYSEFLEYKKTSISSKVGRHSDINLVELGKETVKE 60 Query: 1897 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1718 DD+ L+E G LQSASP+ R+SSVTSH+ RF +DESFL ENRL LRAISEFG ++ YFY Sbjct: 61 DDKTALIESGNLQSASPKARSSSVTSHLARFFLMDESFLTENRLLLRAISEFGLIVFYFY 120 Query: 1717 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1538 ICDRTN+ GESKK+YNRDLF FLYFLLIIVSA+TSFKIH+DKSPFSGK I+YLNRHQTEE Sbjct: 121 ICDRTNVFGESKKTYNRDLFLFLYFLLIIVSAMTSFKIHHDKSPFSGKSILYLNRHQTEE 180 Query: 1537 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1358 WKGWMQVLFLMYHYFAA EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSL RF QMMWR Sbjct: 181 WKGWMQVLFLMYHYFAAAEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLTRFAQMMWR 240 Query: 1357 LNFLVFFCCIVLNNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1178 LNF V FCCIVLNN+YMLYYICPMHTLFTLMVYGALGI NKYNE G+VIA KI ACF+VV Sbjct: 241 LNFFVAFCCIVLNNNYMLYYICPMHTLFTLMVYGALGILNKYNEIGSVIALKIAACFMVV 300 Query: 1177 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 998 IL+WEVPGVF+++WSPFTFFLGY+DP+PSK K+ LHEWHFRSGLDRYIWIIGMIYAYYH Sbjct: 301 ILVWEVPGVFDVVWSPFTFFLGYSDPNPSKPKYPLLHEWHFRSGLDRYIWIIGMIYAYYH 360 Query: 997 PTVERWMEKLEEAEPKRRMSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 818 PTVERWMEKLEE E +RR+SIK+ V + +GYLW EYIYKL K+ YNKYHPYTSWIPI Sbjct: 361 PTVERWMEKLEETETRRRISIKIAVATVCSVMGYLWFEYIYKLDKVAYNKYHPYTSWIPI 420 Query: 817 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 638 TVYI LRNVTQ FRSY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQP+ Sbjct: 421 TVYICLRNVTQQFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPRWLLSLIPDYP 480 Query: 637 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVMASILYTVSFVLL 458 NFMLTTSIYVA+S+R+FELTNTLKS+FVP+KD+KRL +N++ A V++ +LY++SFV L Sbjct: 481 MLNFMLTTSIYVAISHRIFELTNTLKSSFVPSKDNKRLMYNMIAAAVISIMLYSLSFVFL 540 Query: 457 KVPQML 440 ++P++L Sbjct: 541 QIPKIL 546 >ref|XP_012455818.1| PREDICTED: CAS1 domain-containing protein 1-like isoform X1 [Gossypium raimondii] gi|763805944|gb|KJB72882.1| hypothetical protein B456_011G202400 [Gossypium raimondii] Length = 545 Score = 892 bits (2304), Expect = 0.0 Identities = 428/548 (78%), Positives = 486/548 (88%), Gaps = 1/548 (0%) Frame = -1 Query: 2077 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSTKPGRNSDINLVELGEETVKE 1898 M+I+GP+TPGQVSFFLG+ P+ AWIY+EYL+++K S ++K R+SD++LVE+G VKE Sbjct: 1 MMIFGPITPGQVSFFLGVFPVISAWIYAEYLQYKKNSLASK-ARHSDVSLVEIGNVAVKE 59 Query: 1897 DDRAVLLEGGALQSASPRVRNS-SVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYF 1721 +DRAVLLEGG LQS SP+ R+S S S I++F+ +DE+FL+ENRLTLRAISEFG LL Y+ Sbjct: 60 EDRAVLLEGGGLQSGSPKARSSTSSVSPILKFIMMDETFLIENRLTLRAISEFGVLLAYY 119 Query: 1720 YICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTE 1541 YICDRT++ SKKSYNRDLF FLYFLLIIVSAITSFKIH+DKSPFSGK I+YLNRHQTE Sbjct: 120 YICDRTDVFASSKKSYNRDLFLFLYFLLIIVSAITSFKIHHDKSPFSGKSILYLNRHQTE 179 Query: 1540 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMW 1361 EWKGWMQVLFLMYHYFAA+EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLARF QMMW Sbjct: 180 EWKGWMQVLFLMYHYFAASEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 239 Query: 1360 RLNFLVFFCCIVLNNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLV 1181 RLNFLVFFCC+VLNNSYMLYYICPMHTLFTLMVYGALGI NKYNE+G+VIA KIIACFLV Sbjct: 240 RLNFLVFFCCVVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEKGSVIALKIIACFLV 299 Query: 1180 VILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYY 1001 VIL+WEVPGVFELLWSPFTFFLGY DP+K LHEWHFRSGLDRYIWIIGMIYAYY Sbjct: 300 VILVWEVPGVFELLWSPFTFFLGYT--DPAKPNLPLLHEWHFRSGLDRYIWIIGMIYAYY 357 Query: 1000 HPTVERWMEKLEEAEPKRRMSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIP 821 HPTVERWMEKLEE E KRR+SIK+ V II+L +G+LW E+IYKL K+TYNKYHPYTSWIP Sbjct: 358 HPTVERWMEKLEETEVKRRVSIKIAVAIIALMVGFLWFEHIYKLDKVTYNKYHPYTSWIP 417 Query: 820 ITVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXX 641 ITVYI LRNVTQ FRSY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK Sbjct: 418 ITVYICLRNVTQSFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDY 477 Query: 640 XXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVMASILYTVSFVL 461 NFMLTTSIY+A+SYRLF+LTN LKSAFVPTKD+KRL HNL+ VV++SI+Y++SFV Sbjct: 478 PMLNFMLTTSIYLAISYRLFDLTNILKSAFVPTKDNKRLLHNLITGVVVSSIVYSLSFVF 537 Query: 460 LKVPQMLV 437 L++PQMLV Sbjct: 538 LRIPQMLV 545 >gb|KHG01507.1| CAS1 domain-containing 1 [Gossypium arboreum] Length = 544 Score = 890 bits (2301), Expect = 0.0 Identities = 429/548 (78%), Positives = 484/548 (88%), Gaps = 1/548 (0%) Frame = -1 Query: 2077 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSTKPGRNSDINLVELGEETVKE 1898 M+I+GP+TPGQVSFFLG+ P+ AWIY+EYL+++K S ++K +SD++LVE+G VKE Sbjct: 1 MMIFGPITPGQVSFFLGVFPVISAWIYAEYLQYKKNSLASKA--HSDVSLVEIGNGAVKE 58 Query: 1897 DDRAVLLEGGALQSASPRVRNS-SVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYF 1721 +DRAVLLEGG LQS SP+ R+S S S IV+FL +DE+FL+ENRLTLRAISEFG LL Y+ Sbjct: 59 EDRAVLLEGGGLQSGSPKARSSTSSVSPIVKFLMMDETFLIENRLTLRAISEFGVLLAYY 118 Query: 1720 YICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTE 1541 YICDRT++ SKKSYNRDLF FLYFLLIIVSAITSFKIH+DKSPFSGK I+YLNRHQTE Sbjct: 119 YICDRTDVFASSKKSYNRDLFLFLYFLLIIVSAITSFKIHHDKSPFSGKSILYLNRHQTE 178 Query: 1540 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMW 1361 EWKGWMQVLFLMYHYFAA+EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLARF QMMW Sbjct: 179 EWKGWMQVLFLMYHYFAASEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 238 Query: 1360 RLNFLVFFCCIVLNNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLV 1181 RLNFLVFFCC+VLNNSYMLYYICPMHTLFTLMVYGALGI NKYNE+G+VIA KIIACFLV Sbjct: 239 RLNFLVFFCCVVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEKGSVIALKIIACFLV 298 Query: 1180 VILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYY 1001 VIL+WEVPGVFE LWSPFTFFLGY DP+K LHEWHFRSGLDRYIWIIGMIYAYY Sbjct: 299 VILVWEVPGVFEFLWSPFTFFLGYT--DPAKPNLPLLHEWHFRSGLDRYIWIIGMIYAYY 356 Query: 1000 HPTVERWMEKLEEAEPKRRMSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIP 821 HPTVERWMEKLEE E KRR+SIK+ V II+L +G+LW E+IYKL KITYNKYHPYTSWIP Sbjct: 357 HPTVERWMEKLEETEVKRRVSIKIAVAIIALMVGFLWFEHIYKLDKITYNKYHPYTSWIP 416 Query: 820 ITVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXX 641 ITVYI LRNVTQ FRSY+LTLFAWLGK+TLETYISQIHIWLRSG+PDGQPK Sbjct: 417 ITVYICLRNVTQSFRSYSLTLFAWLGKITLETYISQIHIWLRSGIPDGQPKLLLSLIPDY 476 Query: 640 XXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVMASILYTVSFVL 461 NFMLTTSIY+A+SYRLF+LTN LKSAFVPTKD+KRL HNL+ VV++SILY++SFV Sbjct: 477 PMLNFMLTTSIYLAISYRLFDLTNILKSAFVPTKDNKRLLHNLITGVVVSSILYSLSFVF 536 Query: 460 LKVPQMLV 437 L++PQMLV Sbjct: 537 LRIPQMLV 544 >ref|XP_012455820.1| PREDICTED: CAS1 domain-containing protein 1-like isoform X2 [Gossypium raimondii] gi|763805945|gb|KJB72883.1| hypothetical protein B456_011G202400 [Gossypium raimondii] Length = 544 Score = 889 bits (2297), Expect = 0.0 Identities = 427/548 (77%), Positives = 485/548 (88%), Gaps = 1/548 (0%) Frame = -1 Query: 2077 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSTKPGRNSDINLVELGEETVKE 1898 M+I+GP+TPGQVSFFLG+ P+ AWIY+EYL+++K S ++K +SD++LVE+G VKE Sbjct: 1 MMIFGPITPGQVSFFLGVFPVISAWIYAEYLQYKKNSLASKA--HSDVSLVEIGNVAVKE 58 Query: 1897 DDRAVLLEGGALQSASPRVRNS-SVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYF 1721 +DRAVLLEGG LQS SP+ R+S S S I++F+ +DE+FL+ENRLTLRAISEFG LL Y+ Sbjct: 59 EDRAVLLEGGGLQSGSPKARSSTSSVSPILKFIMMDETFLIENRLTLRAISEFGVLLAYY 118 Query: 1720 YICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTE 1541 YICDRT++ SKKSYNRDLF FLYFLLIIVSAITSFKIH+DKSPFSGK I+YLNRHQTE Sbjct: 119 YICDRTDVFASSKKSYNRDLFLFLYFLLIIVSAITSFKIHHDKSPFSGKSILYLNRHQTE 178 Query: 1540 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMW 1361 EWKGWMQVLFLMYHYFAA+EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLARF QMMW Sbjct: 179 EWKGWMQVLFLMYHYFAASEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 238 Query: 1360 RLNFLVFFCCIVLNNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLV 1181 RLNFLVFFCC+VLNNSYMLYYICPMHTLFTLMVYGALGI NKYNE+G+VIA KIIACFLV Sbjct: 239 RLNFLVFFCCVVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEKGSVIALKIIACFLV 298 Query: 1180 VILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYY 1001 VIL+WEVPGVFELLWSPFTFFLGY DP+K LHEWHFRSGLDRYIWIIGMIYAYY Sbjct: 299 VILVWEVPGVFELLWSPFTFFLGYT--DPAKPNLPLLHEWHFRSGLDRYIWIIGMIYAYY 356 Query: 1000 HPTVERWMEKLEEAEPKRRMSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIP 821 HPTVERWMEKLEE E KRR+SIK+ V II+L +G+LW E+IYKL K+TYNKYHPYTSWIP Sbjct: 357 HPTVERWMEKLEETEVKRRVSIKIAVAIIALMVGFLWFEHIYKLDKVTYNKYHPYTSWIP 416 Query: 820 ITVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXX 641 ITVYI LRNVTQ FRSY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK Sbjct: 417 ITVYICLRNVTQSFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDY 476 Query: 640 XXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVMASILYTVSFVL 461 NFMLTTSIY+A+SYRLF+LTN LKSAFVPTKD+KRL HNL+ VV++SI+Y++SFV Sbjct: 477 PMLNFMLTTSIYLAISYRLFDLTNILKSAFVPTKDNKRLLHNLITGVVVSSIVYSLSFVF 536 Query: 460 LKVPQMLV 437 L++PQMLV Sbjct: 537 LRIPQMLV 544 >ref|XP_007154416.1| hypothetical protein PHAVU_003G117800g [Phaseolus vulgaris] gi|561027770|gb|ESW26410.1| hypothetical protein PHAVU_003G117800g [Phaseolus vulgaris] Length = 546 Score = 885 bits (2287), Expect = 0.0 Identities = 426/547 (77%), Positives = 475/547 (86%) Frame = -1 Query: 2077 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSTKPGRNSDINLVELGEETVKE 1898 M+I P+TPGQVSF LGI P+ VAWIYSE LE+RK S S+K G +SDINLVE+G + VK+ Sbjct: 1 MLILSPVTPGQVSFLLGITPVVVAWIYSEILEYRKNSVSSKAG-HSDINLVEMGSDAVKD 59 Query: 1897 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1718 +D+AVLLEGGALQS SPR R+ + + I+RFL +D FLLENRLTLRA+SEFG LL YFY Sbjct: 60 EDKAVLLEGGALQSGSPRARSLTASPSIIRFLLMDNYFLLENRLTLRAMSEFGLLLAYFY 119 Query: 1717 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1538 +CDRT+ SKKSYNRD+F FLYFLLIIVSA+TSFKIH DKSPFSGK I+YLNRHQTEE Sbjct: 120 LCDRTDFFASSKKSYNRDIFLFLYFLLIIVSAMTSFKIHQDKSPFSGKSILYLNRHQTEE 179 Query: 1537 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1358 WKGWMQVLFLMYHYFAA+EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLARF QMMWR Sbjct: 180 WKGWMQVLFLMYHYFAASEIYNAIRLFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWR 239 Query: 1357 LNFLVFFCCIVLNNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1178 LNF V FCCIVLNNSYMLYYICPMHTLFTLMVYGALGI NKYNE G+VIA KIIACFLVV Sbjct: 240 LNFFVVFCCIVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEIGSVIAVKIIACFLVV 299 Query: 1177 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 998 IL+WE+PGVFELLWSPFTFFLGY DP+P+K+ S LHEWHFRSGLDRYIWIIGMIYAYYH Sbjct: 300 ILVWEIPGVFELLWSPFTFFLGYTDPNPAKSHLSRLHEWHFRSGLDRYIWIIGMIYAYYH 359 Query: 997 PTVERWMEKLEEAEPKRRMSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 818 PTVERWMEKLEEAE KRR+SIK +V+I +GYLW E+IYKL K+TYN YHPYTSWIPI Sbjct: 360 PTVERWMEKLEEAEIKRRISIKATIVLICSLVGYLWFEHIYKLDKLTYNTYHPYTSWIPI 419 Query: 817 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 638 TVYI LRNVTQ FRSYTLTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK Sbjct: 420 TVYICLRNVTQSFRSYTLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDYP 479 Query: 637 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVMASILYTVSFVLL 458 NFMLTTSIYVA+S RLF+LTNTLK AFVP+KDDKRL HNL+ A ++ +LY++SF L Sbjct: 480 MLNFMLTTSIYVAISCRLFDLTNTLKVAFVPSKDDKRLVHNLITATTISVVLYSLSFGFL 539 Query: 457 KVPQMLV 437 ++PQMLV Sbjct: 540 RLPQMLV 546 >ref|XP_010270836.1| PREDICTED: CAS1 domain-containing protein 1 isoform X2 [Nelumbo nucifera] Length = 545 Score = 883 bits (2282), Expect = 0.0 Identities = 419/546 (76%), Positives = 479/546 (87%) Frame = -1 Query: 2077 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSTKPGRNSDINLVELGEETVKE 1898 M I P+TPGQVSF LGI+PI VAWIYSE+LE++K S S+K +SDINLVELG+ETVKE Sbjct: 1 MSITSPVTPGQVSFLLGIIPIIVAWIYSEFLEYKKTSISSKV--HSDINLVELGKETVKE 58 Query: 1897 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1718 DD+ L+E G LQSASP+ R+SSVTSH+ RF +DESFL ENRL LRAISEFG ++ YFY Sbjct: 59 DDKTALIESGNLQSASPKARSSSVTSHLARFFLMDESFLTENRLLLRAISEFGLIVFYFY 118 Query: 1717 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1538 ICDRTN+ GESKK+YNRDLF FLYFLLIIVSA+TSFKIH+DKSPFSGK I+YLNRHQTEE Sbjct: 119 ICDRTNVFGESKKTYNRDLFLFLYFLLIIVSAMTSFKIHHDKSPFSGKSILYLNRHQTEE 178 Query: 1537 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1358 WKGWMQVLFLMYHYFAA EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSL RF QMMWR Sbjct: 179 WKGWMQVLFLMYHYFAAAEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLTRFAQMMWR 238 Query: 1357 LNFLVFFCCIVLNNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1178 LNF V FCCIVLNN+YMLYYICPMHTLFTLMVYGALGI NKYNE G+VIA KI ACF+VV Sbjct: 239 LNFFVAFCCIVLNNNYMLYYICPMHTLFTLMVYGALGILNKYNEIGSVIALKIAACFMVV 298 Query: 1177 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 998 IL+WEVPGVF+++WSPFTFFLGY+DP+PSK K+ LHEWHFRSGLDRYIWIIGMIYAYYH Sbjct: 299 ILVWEVPGVFDVVWSPFTFFLGYSDPNPSKPKYPLLHEWHFRSGLDRYIWIIGMIYAYYH 358 Query: 997 PTVERWMEKLEEAEPKRRMSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 818 PTVERWMEKLEE E +RR+SIK+ V + +GYLW EYIYKL K+ YNKYHPYTSWIPI Sbjct: 359 PTVERWMEKLEETETRRRISIKIAVATVCSVMGYLWFEYIYKLDKVAYNKYHPYTSWIPI 418 Query: 817 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 638 TVYI LRNVTQ FRSY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQP+ Sbjct: 419 TVYICLRNVTQQFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPRWLLSLIPDYP 478 Query: 637 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVMASILYTVSFVLL 458 NFMLTTSIYVA+S+R+FELTNTLKS+FVP+KD+KRL +N++ A V++ +LY++SFV L Sbjct: 479 MLNFMLTTSIYVAISHRIFELTNTLKSSFVPSKDNKRLMYNMIAAAVISIMLYSLSFVFL 538 Query: 457 KVPQML 440 ++P++L Sbjct: 539 QIPKIL 544 >ref|XP_012084122.1| PREDICTED: probable O-acetyltransferase CAS1 isoform X1 [Jatropha curcas] Length = 546 Score = 882 bits (2280), Expect = 0.0 Identities = 428/549 (77%), Positives = 474/549 (86%), Gaps = 2/549 (0%) Frame = -1 Query: 2077 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSTKPGRNSDINLVELGEETVKE 1898 MVI P+TPGQVSF LG++PI AWIYSE LE++K +++ K R+SDI L E+G + VKE Sbjct: 1 MVISSPITPGQVSFLLGVVPIIAAWIYSEILEYKKNAAAAK-ARHSDIGLAEMGNDAVKE 59 Query: 1897 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1718 DDRAVLLEGG LQSASPR + S +S I RFL ++E FL+ENRLTLRAISEFGALL YFY Sbjct: 60 DDRAVLLEGGGLQSASPRAKTSPASSPIFRFLLMEEQFLIENRLTLRAISEFGALLGYFY 119 Query: 1717 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1538 +CDRT+ S KS+NRDLFWFLY LLIIVSAITSFKIH+D+SPFSGK I+YLNRHQTEE Sbjct: 120 LCDRTDFFNSSTKSFNRDLFWFLYSLLIIVSAITSFKIHHDRSPFSGKPILYLNRHQTEE 179 Query: 1537 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1358 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYY+RKDFSLARF QMMWR Sbjct: 180 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWR 239 Query: 1357 LNFLVFFCCIVLNNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1178 LNFLV FCC+VLNNSYMLYYICPMHTLFTLMVYGALGI NKYNE+G+VIA KIIACFLVV Sbjct: 240 LNFLVIFCCVVLNNSYMLYYICPMHTLFTLMVYGALGIMNKYNEKGSVIAVKIIACFLVV 299 Query: 1177 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 998 ILIWE+PGVFEL+WSPFTFFLGY DP+K LHEWHFRSGLDRYIWIIGMIYAYYH Sbjct: 300 ILIWEIPGVFELVWSPFTFFLGYT--DPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYH 357 Query: 997 PTVERWMEKLEEAEPKRRMSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 818 PTVERWMEKLEE E KRR+SIK V ISL GYLW E+IYK+ K+TYNKYHPYTSWIPI Sbjct: 358 PTVERWMEKLEETEVKRRISIKTAVASISLLTGYLWFEHIYKMDKVTYNKYHPYTSWIPI 417 Query: 817 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPK--XXXXXXXX 644 TVYISLRNV+Q+FRSYTLTLFAWLGK+TLETYISQ HIWLRS +PD QPK Sbjct: 418 TVYISLRNVSQYFRSYTLTLFAWLGKITLETYISQFHIWLRSSIPDAQPKLLLSIIPQSE 477 Query: 643 XXXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVMASILYTVSFV 464 NFMLTTSIYVAVSYRLF+LTNTLK AFVP+KDDKRL HN++ A V+ASILY+VSF+ Sbjct: 478 FPLLNFMLTTSIYVAVSYRLFDLTNTLKIAFVPSKDDKRLAHNMITAAVIASILYSVSFI 537 Query: 463 LLKVPQMLV 437 LK+PQ+LV Sbjct: 538 FLKIPQILV 546 >ref|XP_003631938.1| PREDICTED: CAS1 domain-containing protein 1 [Vitis vinifera] gi|296083216|emb|CBI22852.3| unnamed protein product [Vitis vinifera] Length = 544 Score = 882 bits (2279), Expect = 0.0 Identities = 417/547 (76%), Positives = 477/547 (87%) Frame = -1 Query: 2077 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSTKPGRNSDINLVELGEETVKE 1898 M+I GP+TPGQV+FF+G + + AWIY+E+LE++K + +K +SD+NLVEL E TVKE Sbjct: 1 MMIVGPITPGQVAFFIGFVSVFAAWIYAEFLEYKKNAFPSKT--HSDLNLVELNE-TVKE 57 Query: 1897 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1718 DDRAVLLEGG LQS SP+ R+SSVTSHI RFL ++ESFL+E RLTLRA+ EFGALL YFY Sbjct: 58 DDRAVLLEGGGLQSVSPKARSSSVTSHIFRFLLMEESFLIEYRLTLRAMCEFGALLAYFY 117 Query: 1717 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1538 +CDRTNL G+SKKSYNRDLF FLYFLLIIVSA+TSFK+H+DKS FSGK I+YLNRHQTEE Sbjct: 118 LCDRTNLFGDSKKSYNRDLFIFLYFLLIIVSAVTSFKVHHDKSSFSGKSILYLNRHQTEE 177 Query: 1537 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1358 WKGWMQVLFLMYHYFAATEIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLARF QMMWR Sbjct: 178 WKGWMQVLFLMYHYFAATEIYNAIRLFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWR 237 Query: 1357 LNFLVFFCCIVLNNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1178 LNFLV FCC+VLNNSYMLYYICPMHTLFTLMVYGALGI NKYNE G VIA KI+ACFLVV Sbjct: 238 LNFLVLFCCVVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEIGLVIAVKIVACFLVV 297 Query: 1177 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 998 +L+WE+PGVFE +WSP TF LGY DPDPSK K S LHEWHFRSGLDRYIWIIGMIYAYYH Sbjct: 298 VLLWEIPGVFEFVWSPLTFILGYTDPDPSKQKFSRLHEWHFRSGLDRYIWIIGMIYAYYH 357 Query: 997 PTVERWMEKLEEAEPKRRMSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 818 PTVERWMEKLEE E K R++IKM + ++LT+GYLW E+IYKL K+TYNKYHPYTSWIPI Sbjct: 358 PTVERWMEKLEETEVKLRVAIKMAIATVALTVGYLWFEHIYKLDKLTYNKYHPYTSWIPI 417 Query: 817 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 638 +VYI LRN+TQ FR Y+LTLFAWLGK+TLETYISQIHIWLRSG+PD QPK Sbjct: 418 SVYICLRNLTQQFRCYSLTLFAWLGKITLETYISQIHIWLRSGLPDAQPKLLLSLIPSYP 477 Query: 637 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVMASILYTVSFVLL 458 NFMLTTSIY+A+SYRLFELTNTLK AFVP+KD+K L HN++ + ++ ILYT+SF+ L Sbjct: 478 LLNFMLTTSIYIAISYRLFELTNTLKVAFVPSKDNKLLMHNIIAGIAISGILYTLSFIFL 537 Query: 457 KVPQMLV 437 +VPQ+LV Sbjct: 538 QVPQLLV 544 >gb|EYU43511.1| hypothetical protein MIMGU_mgv1a004398mg [Erythranthe guttata] Length = 529 Score = 882 bits (2278), Expect = 0.0 Identities = 439/547 (80%), Positives = 473/547 (86%) Frame = -1 Query: 2077 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSTKPGRNSDINLVELGEETVKE 1898 M+IYGPLTPGQVSFFLG++P+ AW+YSEYLE+ K SS+ P +NSDINLVEL TVKE Sbjct: 1 MLIYGPLTPGQVSFFLGLVPVFAAWLYSEYLENAK--SSSLPKQNSDINLVELAG-TVKE 57 Query: 1897 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1718 DDRAVLLEGG L SASP +RNSSVTS + R L LDESFLLENRL LRA+SEFGALLIYFY Sbjct: 58 DDRAVLLEGGGLHSASPTLRNSSVTSQLTRLLMLDESFLLENRLILRAMSEFGALLIYFY 117 Query: 1717 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1538 ICDRTNLLGE+KKSYNRDLF FLYFLLIIVSA TSFKIHNDKSP SGK IMYLNRHQTEE Sbjct: 118 ICDRTNLLGEAKKSYNRDLFLFLYFLLIIVSAKTSFKIHNDKSPLSGKSIMYLNRHQTEE 177 Query: 1537 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1358 WKGWMQVLFLMYHYFAATE YNAIRVFIA YVWMTGFGNFSYYYIRKDFSLARFIQMMWR Sbjct: 178 WKGWMQVLFLMYHYFAATEFYNAIRVFIAGYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 237 Query: 1357 LNFLVFFCCIVLNNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1178 LNFLVFFCC+VLNN+YMLYYICPMHTLFTLMVYGALGIFNKYNE+G VIA KI+ CFLVV Sbjct: 238 LNFLVFFCCVVLNNNYMLYYICPMHTLFTLMVYGALGIFNKYNERGIVIAAKIVVCFLVV 297 Query: 1177 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 998 +L+W Y DP SK K S LHEWHFRSGLDRYIWIIGMIYAYYH Sbjct: 298 VLLWG----------------RYTDP-ASKVKLSLLHEWHFRSGLDRYIWIIGMIYAYYH 340 Query: 997 PTVERWMEKLEEAEPKRRMSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 818 PTVERW+EKLEEAE KRR+ IK IV I+SLTIGYLWVE++YKLPKITYNKYHPYTSWIPI Sbjct: 341 PTVERWLEKLEEAEIKRRILIKSIVGIVSLTIGYLWVEFVYKLPKITYNKYHPYTSWIPI 400 Query: 817 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 638 TVYI LRNVTQHFR Y+LTLFAWLGKVTLETYISQIHIWLRS VP+GQPK Sbjct: 401 TVYICLRNVTQHFRCYSLTLFAWLGKVTLETYISQIHIWLRSSVPNGQPKLLLSLIPNYP 460 Query: 637 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVMASILYTVSFVLL 458 NFMLT SIY+AVSYRLFELTNTLKSAFVPTKD+KRLGHNL+ AV++ASILY ++ VL+ Sbjct: 461 LLNFMLTASIYIAVSYRLFELTNTLKSAFVPTKDNKRLGHNLIAAVIIASILYILAVVLI 520 Query: 457 KVPQMLV 437 KVPQM+V Sbjct: 521 KVPQMIV 527 >ref|XP_012084124.1| PREDICTED: probable O-acetyltransferase CAS1 isoform X2 [Jatropha curcas] gi|643716182|gb|KDP27955.1| hypothetical protein JCGZ_19035 [Jatropha curcas] Length = 545 Score = 880 bits (2273), Expect = 0.0 Identities = 427/549 (77%), Positives = 473/549 (86%), Gaps = 2/549 (0%) Frame = -1 Query: 2077 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSTKPGRNSDINLVELGEETVKE 1898 MVI P+TPGQVSF LG++PI AWIYSE LE++K +++ K +SDI L E+G + VKE Sbjct: 1 MVISSPITPGQVSFLLGVVPIIAAWIYSEILEYKKNAAAAKA--HSDIGLAEMGNDAVKE 58 Query: 1897 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1718 DDRAVLLEGG LQSASPR + S +S I RFL ++E FL+ENRLTLRAISEFGALL YFY Sbjct: 59 DDRAVLLEGGGLQSASPRAKTSPASSPIFRFLLMEEQFLIENRLTLRAISEFGALLGYFY 118 Query: 1717 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1538 +CDRT+ S KS+NRDLFWFLY LLIIVSAITSFKIH+D+SPFSGK I+YLNRHQTEE Sbjct: 119 LCDRTDFFNSSTKSFNRDLFWFLYSLLIIVSAITSFKIHHDRSPFSGKPILYLNRHQTEE 178 Query: 1537 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1358 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYY+RKDFSLARF QMMWR Sbjct: 179 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWR 238 Query: 1357 LNFLVFFCCIVLNNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1178 LNFLV FCC+VLNNSYMLYYICPMHTLFTLMVYGALGI NKYNE+G+VIA KIIACFLVV Sbjct: 239 LNFLVIFCCVVLNNSYMLYYICPMHTLFTLMVYGALGIMNKYNEKGSVIAVKIIACFLVV 298 Query: 1177 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 998 ILIWE+PGVFEL+WSPFTFFLGY DP+K LHEWHFRSGLDRYIWIIGMIYAYYH Sbjct: 299 ILIWEIPGVFELVWSPFTFFLGYT--DPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYH 356 Query: 997 PTVERWMEKLEEAEPKRRMSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 818 PTVERWMEKLEE E KRR+SIK V ISL GYLW E+IYK+ K+TYNKYHPYTSWIPI Sbjct: 357 PTVERWMEKLEETEVKRRISIKTAVASISLLTGYLWFEHIYKMDKVTYNKYHPYTSWIPI 416 Query: 817 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPK--XXXXXXXX 644 TVYISLRNV+Q+FRSYTLTLFAWLGK+TLETYISQ HIWLRS +PD QPK Sbjct: 417 TVYISLRNVSQYFRSYTLTLFAWLGKITLETYISQFHIWLRSSIPDAQPKLLLSIIPQSE 476 Query: 643 XXXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVMASILYTVSFV 464 NFMLTTSIYVAVSYRLF+LTNTLK AFVP+KDDKRL HN++ A V+ASILY+VSF+ Sbjct: 477 FPLLNFMLTTSIYVAVSYRLFDLTNTLKIAFVPSKDDKRLAHNMITAAVIASILYSVSFI 536 Query: 463 LLKVPQMLV 437 LK+PQ+LV Sbjct: 537 FLKIPQILV 545 >ref|XP_010045355.1| PREDICTED: CAS1 domain-containing protein 1 isoform X1 [Eucalyptus grandis] gi|629123038|gb|KCW87528.1| hypothetical protein EUGRSUZ_B03976 [Eucalyptus grandis] Length = 546 Score = 878 bits (2268), Expect = 0.0 Identities = 420/545 (77%), Positives = 473/545 (86%) Frame = -1 Query: 2077 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSTKPGRNSDINLVELGEETVKE 1898 M I+GP+TPGQVSF +GI+P AWIYSE+LE+++ S S+K R SD+NLVE+G + VKE Sbjct: 1 MAIHGPVTPGQVSFLIGIIPTIAAWIYSEFLEYKRNSVSSKV-RRSDVNLVEMGNDVVKE 59 Query: 1897 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1718 DDRAVLLEGG LQSASPR RNSS TS I RFL +DESFL+ENRLTLRAI+EF LL YF+ Sbjct: 60 DDRAVLLEGGGLQSASPRSRNSSATSPIFRFLVMDESFLVENRLTLRAIAEFSMLLAYFF 119 Query: 1717 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1538 +CDRT+ SKKSYNRDLF FLYFLLIIVSA+TSF HN+KSP SGK I+YLNRHQTEE Sbjct: 120 LCDRTDFFESSKKSYNRDLFLFLYFLLIIVSAMTSFTTHNEKSPISGKSILYLNRHQTEE 179 Query: 1537 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1358 WKGWMQVLFLMYHYFAA+EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLARF QMMWR Sbjct: 180 WKGWMQVLFLMYHYFAASEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWR 239 Query: 1357 LNFLVFFCCIVLNNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1178 LNFLV FCC+VLNN+Y+LYYICPMHTLFTLMVYGALGI NKYNE G IA KIIACFLVV Sbjct: 240 LNFLVLFCCVVLNNNYVLYYICPMHTLFTLMVYGALGILNKYNEVGMAIALKIIACFLVV 299 Query: 1177 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 998 IL+WEVPGVFEL+WSPFTF LGY+DPDPSK K L EWHFRSGLDRYIWI+GMIYAYYH Sbjct: 300 ILVWEVPGVFELVWSPFTFLLGYSDPDPSKPKFPLLREWHFRSGLDRYIWIVGMIYAYYH 359 Query: 997 PTVERWMEKLEEAEPKRRMSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 818 PTVERWMEKLEEAE KRR+ IK V+ SLT+GYLW EY+YKL KITYNKYHPYTSWIPI Sbjct: 360 PTVERWMEKLEEAEWKRRLLIKGAVISTSLTVGYLWFEYVYKLDKITYNKYHPYTSWIPI 419 Query: 817 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 638 TVYISLRNVTQH RS +LTLFAWLGK+TLETYISQIHIWLRSG+PDGQPK Sbjct: 420 TVYISLRNVTQHLRSCSLTLFAWLGKITLETYISQIHIWLRSGIPDGQPKLLLSLIPNYP 479 Query: 637 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVMASILYTVSFVLL 458 NFMLTTSIY+ VS+RLF LTNTLK+AFVP+KD+KRL +N++ A V++S+LY++SFV L Sbjct: 480 MLNFMLTTSIYIVVSHRLFNLTNTLKNAFVPSKDNKRLLNNMISAAVISSVLYSLSFVFL 539 Query: 457 KVPQM 443 K P++ Sbjct: 540 KFPRV 544 >ref|XP_006583993.1| PREDICTED: CAS1 domain-containing protein 1-like isoform X1 [Glycine max] gi|734318066|gb|KHN02869.1| CAS1 domain-containing protein 1 [Glycine soja] Length = 552 Score = 878 bits (2268), Expect = 0.0 Identities = 422/547 (77%), Positives = 473/547 (86%) Frame = -1 Query: 2077 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSTKPGRNSDINLVELGEETVKE 1898 M++ P+TPGQVSF LGI+P+ VAWIYSE LE+RK S S++ R SDINLVE+G + VK+ Sbjct: 1 MLLLSPVTPGQVSFLLGIIPVVVAWIYSEILEYRKNSVSSR-ARQSDINLVEMGSDVVKD 59 Query: 1897 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1718 +DRAVLLEGGALQS SP+ R+ + + I+RFL +DE FLLENRLTLRA+SEFG +L YFY Sbjct: 60 EDRAVLLEGGALQSGSPKARSLTGSPSIIRFLLMDECFLLENRLTLRAMSEFGLILAYFY 119 Query: 1717 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1538 +CDRT+ S KSYNRDLF FLYFLLIIVSA+TSFKIH+DKSP SGK I+YLNRHQTEE Sbjct: 120 LCDRTDFFASSNKSYNRDLFLFLYFLLIIVSAMTSFKIHHDKSPLSGKSILYLNRHQTEE 179 Query: 1537 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1358 WKGWMQVLFLMYHYFAA+EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLARF QMMWR Sbjct: 180 WKGWMQVLFLMYHYFAASEIYNAIRLFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWR 239 Query: 1357 LNFLVFFCCIVLNNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1178 LNF V FCCIVLNNSYMLYYICPMHTLFTLMVYGALGI +KYNE G+VIA KIIACFLVV Sbjct: 240 LNFFVVFCCIVLNNSYMLYYICPMHTLFTLMVYGALGILHKYNEIGSVIAVKIIACFLVV 299 Query: 1177 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 998 IL+WE+PGVFE +WSPFTFFLGY DP+P+K+ S LHEWHFRSGLDRYIWIIGMIYAYYH Sbjct: 300 ILVWEIPGVFEWVWSPFTFFLGYTDPNPAKSHLSRLHEWHFRSGLDRYIWIIGMIYAYYH 359 Query: 997 PTVERWMEKLEEAEPKRRMSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 818 PTVERWMEKLEEAE KRR+SIK VV+I +GYLW E+IYKL KI YNKYHPYTSWIPI Sbjct: 360 PTVERWMEKLEEAEIKRRISIKATVVLICSLVGYLWFEHIYKLDKIAYNKYHPYTSWIPI 419 Query: 817 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 638 TVYI LRNVTQ FRSYTLTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK Sbjct: 420 TVYICLRNVTQSFRSYTLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDFP 479 Query: 637 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVMASILYTVSFVLL 458 NFMLTTSIYVA+SYRLF+LTNTLK AFVP+KDDKR HNL+ A ++ +LY++S L Sbjct: 480 MLNFMLTTSIYVAISYRLFDLTNTLKMAFVPSKDDKRFIHNLITATTISVVLYSLSLGFL 539 Query: 457 KVPQMLV 437 +VPQMLV Sbjct: 540 RVPQMLV 546