BLASTX nr result
ID: Forsythia21_contig00022172
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia21_contig00022172 (2250 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011100735.1| PREDICTED: CAS1 domain-containing protein 1 ... 964 0.0 ref|XP_006338003.1| PREDICTED: CAS1 domain-containing protein 1-... 937 0.0 ref|XP_004229023.1| PREDICTED: CAS1 domain-containing protein 1 ... 936 0.0 ref|XP_009803356.1| PREDICTED: CAS1 domain-containing protein 1 ... 925 0.0 emb|CDO98736.1| unnamed protein product [Coffea canephora] 912 0.0 ref|XP_007035864.1| O-acetyltransferase family protein isoform 1... 898 0.0 ref|XP_007035865.1| O-acetyltransferase family protein isoform 2... 895 0.0 ref|XP_010270835.1| PREDICTED: CAS1 domain-containing protein 1 ... 894 0.0 ref|XP_012829945.1| PREDICTED: CAS1 domain-containing protein 1-... 891 0.0 ref|XP_012455818.1| PREDICTED: CAS1 domain-containing protein 1-... 890 0.0 gb|KHG01507.1| CAS1 domain-containing 1 [Gossypium arboreum] 889 0.0 ref|XP_012455820.1| PREDICTED: CAS1 domain-containing protein 1-... 887 0.0 ref|XP_007154416.1| hypothetical protein PHAVU_003G117800g [Phas... 887 0.0 ref|XP_003631938.1| PREDICTED: CAS1 domain-containing protein 1 ... 885 0.0 ref|XP_010270836.1| PREDICTED: CAS1 domain-containing protein 1 ... 884 0.0 ref|XP_012084122.1| PREDICTED: probable O-acetyltransferase CAS1... 881 0.0 gb|EYU43511.1| hypothetical protein MIMGU_mgv1a004398mg [Erythra... 880 0.0 ref|XP_010045355.1| PREDICTED: CAS1 domain-containing protein 1 ... 880 0.0 ref|XP_006583993.1| PREDICTED: CAS1 domain-containing protein 1-... 879 0.0 ref|XP_012084124.1| PREDICTED: probable O-acetyltransferase CAS1... 878 0.0 >ref|XP_011100735.1| PREDICTED: CAS1 domain-containing protein 1 [Sesamum indicum] Length = 545 Score = 964 bits (2491), Expect = 0.0 Identities = 471/547 (86%), Positives = 497/547 (90%) Frame = -3 Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931 MVIYGPLTPGQV+FFLGI+P+ AW+YSEYLE+RK SS SK GRNSD LVEL TVKE Sbjct: 1 MVIYGPLTPGQVAFFLGIVPVFAAWLYSEYLEYRKNSSFSKHGRNSDSKLVELAG-TVKE 59 Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751 DDRAVLLEGG LQSASPR RNSSVTSH++RFLTLDESFLLENRLTLRAISEFGALL+YFY Sbjct: 60 DDRAVLLEGGGLQSASPRERNSSVTSHVIRFLTLDESFLLENRLTLRAISEFGALLVYFY 119 Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571 ICDRTNLLGESKKSYNRDLF FLYFLLIIVSAITSFKIHNDKSPFSGK IMYLNRHQTEE Sbjct: 120 ICDRTNLLGESKKSYNRDLFLFLYFLLIIVSAITSFKIHNDKSPFSGKSIMYLNRHQTEE 179 Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARF+QMMWR Sbjct: 180 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFMQMMWR 239 Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211 LNFLV FCC+VL+N+YMLYYICPMHTLFTLMVYGALGIFNKYN++GTVIA K +ACFLVV Sbjct: 240 LNFLVFFCCVVLNNNYMLYYICPMHTLFTLMVYGALGIFNKYNDRGTVIAAKFLACFLVV 299 Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031 ILIWEVPGVF+L W PFTF LGY DP SK K LHEWHFRSGLDRYIWIIGMIYAYYH Sbjct: 300 ILIWEVPGVFDLFWGPFTFLLGYTDP-ASKVKFPLLHEWHFRSGLDRYIWIIGMIYAYYH 358 Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851 PTVERWMEKLEEAE KRR SIK I++IISLTIGYLWVE+IYKLPKITYNKYHPYTSWIPI Sbjct: 359 PTVERWMEKLEEAETKRRISIKTIIIIISLTIGYLWVEFIYKLPKITYNKYHPYTSWIPI 418 Query: 850 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671 TVYI LRNVTQHFR YTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPK Sbjct: 419 TVYICLRNVTQHFRCYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKLLLSLIPNYP 478 Query: 670 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491 NFMLTTSIY+AVSYRLFELTNTLKS FVPTKD+KRLGHNL AVVIA ILY L+ +L+ Sbjct: 479 LLNFMLTTSIYIAVSYRLFELTNTLKSTFVPTKDNKRLGHNLAAAVVIAGILYILAVILV 538 Query: 490 KVPQMLV 470 KVPQ++V Sbjct: 539 KVPQVMV 545 >ref|XP_006338003.1| PREDICTED: CAS1 domain-containing protein 1-like [Solanum tuberosum] Length = 546 Score = 937 bits (2423), Expect = 0.0 Identities = 452/547 (82%), Positives = 492/547 (89%) Frame = -3 Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931 M+IYGPL+PGQVSFFLGI+P+C AW+YSEYLE++K S+SSK R+SDINLVELG E VKE Sbjct: 1 MLIYGPLSPGQVSFFLGIVPVCAAWLYSEYLEYKKNSASSKV-RHSDINLVELGNEAVKE 59 Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751 DDRAVLLEGG LQS SPR+R+SSVTS I RF +DE+FLLENR TLRAISEFGALL YFY Sbjct: 60 DDRAVLLEGGGLQSTSPRIRSSSVTSQIARFFLMDETFLLENRSTLRAISEFGALLTYFY 119 Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571 + DRTNL GESKKSYNRDLF FLYFLLIIVSAITSFKIH+DKSPFSGK IMYLNRHQTEE Sbjct: 120 LSDRTNLFGESKKSYNRDLFIFLYFLLIIVSAITSFKIHHDKSPFSGKSIMYLNRHQTEE 179 Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFS+ARF QMMWR Sbjct: 180 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSIARFTQMMWR 239 Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211 LNFLV F C++L+N+YMLYYICPMHTLFTLMVYGALGIFNKYNE GTVIA K IACFLVV Sbjct: 240 LNFLVFFSCVILNNNYMLYYICPMHTLFTLMVYGALGIFNKYNENGTVIAVKFIACFLVV 299 Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031 IL+WEVPGVFE++WSPFTFFLGYADPDPSK K S LHEW FRSGLDRYIWIIGMIYAYYH Sbjct: 300 ILMWEVPGVFEVVWSPFTFFLGYADPDPSKPKQSLLHEWEFRSGLDRYIWIIGMIYAYYH 359 Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851 PTVERWMEKLEE E KRR SIK V ++SLT+GYLW EYIYKLPK+TYNKYHPYTSWIPI Sbjct: 360 PTVERWMEKLEETEVKRRISIKAAVALMSLTMGYLWYEYIYKLPKVTYNKYHPYTSWIPI 419 Query: 850 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671 TVYISLRNVTQ+FRSY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK Sbjct: 420 TVYISLRNVTQYFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKKLLCLIPNYP 479 Query: 670 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491 NFMLT +IYVAVS+RLFELTNTLKS F+P KDDKRLG+N+V A+V++ +LY LS V L Sbjct: 480 LMNFMLTAAIYVAVSHRLFELTNTLKSTFIPMKDDKRLGYNIVAALVVSGLLYVLSSVFL 539 Query: 490 KVPQMLV 470 +VPQMLV Sbjct: 540 RVPQMLV 546 >ref|XP_004229023.1| PREDICTED: CAS1 domain-containing protein 1 [Solanum lycopersicum] Length = 546 Score = 936 bits (2419), Expect = 0.0 Identities = 452/547 (82%), Positives = 491/547 (89%) Frame = -3 Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931 M+IYGPL+PGQVSFFLGI+PIC AW+YSEYLE++K S+SSK R+SDINLVELG+E VKE Sbjct: 1 MLIYGPLSPGQVSFFLGIVPICAAWLYSEYLEYKKNSASSKV-RHSDINLVELGDEAVKE 59 Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751 DDRAVLLEGG LQS SPR+R+SSVTS RF +DE+FLLENRLTLRAISEFG LLIYFY Sbjct: 60 DDRAVLLEGGGLQSTSPRIRSSSVTSQFTRFFLMDETFLLENRLTLRAISEFGTLLIYFY 119 Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571 I DRTNL GESKKSYNRDLF FLYFLLIIVSAITSFKIH+DKSPFSGK IMYLNRHQTEE Sbjct: 120 ISDRTNLFGESKKSYNRDLFIFLYFLLIIVSAITSFKIHHDKSPFSGKSIMYLNRHQTEE 179 Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFS+ARF QMMWR Sbjct: 180 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSIARFAQMMWR 239 Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211 LNFLV F C++L+N+YMLYYICPMHTLFTLMVYGALGIFNKYNE GT+IA K I CFL V Sbjct: 240 LNFLVFFSCVILNNNYMLYYICPMHTLFTLMVYGALGIFNKYNENGTIIAVKFIVCFLFV 299 Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031 IL+WEVPGVFE++WSPFTFFLGYADPDPSK K S LHEW FRSGLDRYIWIIGMIYAYYH Sbjct: 300 ILMWEVPGVFEVVWSPFTFFLGYADPDPSKPKQSLLHEWEFRSGLDRYIWIIGMIYAYYH 359 Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851 PTVE+WMEKLEE E KRR SIK V I+SLT+GYLW EYIYKLPK TYNKYHPYTSWIPI Sbjct: 360 PTVEKWMEKLEETEVKRRISIKAAVAIMSLTMGYLWYEYIYKLPKETYNKYHPYTSWIPI 419 Query: 850 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671 TVYISLRNVTQ+FRSYTLTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK Sbjct: 420 TVYISLRNVTQYFRSYTLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKKLLCLIPGYP 479 Query: 670 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491 NFMLTT+IYVAVS+RLFELTNTLKS F+P K+DKRLG+N+V A+V++ +LY LS V L Sbjct: 480 LMNFMLTTAIYVAVSHRLFELTNTLKSTFIPMKEDKRLGYNIVAALVVSGLLYVLSSVFL 539 Query: 490 KVPQMLV 470 +VPQMLV Sbjct: 540 RVPQMLV 546 >ref|XP_009803356.1| PREDICTED: CAS1 domain-containing protein 1 [Nicotiana sylvestris] Length = 546 Score = 925 bits (2391), Expect = 0.0 Identities = 447/547 (81%), Positives = 488/547 (89%) Frame = -3 Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931 M++YGPLTP QVSFFLGI+ IC AW+YSEYLE+ K S+SSK R+SDINLVELG E VKE Sbjct: 1 MLLYGPLTPAQVSFFLGIVSICAAWLYSEYLEYEKNSASSKV-RHSDINLVELGNEAVKE 59 Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751 DDRAVLLEGG LQSASPR+R+SSVTS I RF +DESF LENRLTLRAISEFGALL YFY Sbjct: 60 DDRAVLLEGGGLQSASPRMRSSSVTSQIARFCLMDESFFLENRLTLRAISEFGALLTYFY 119 Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571 + DRTNL GESKKSYNRDLF FLYFLLII+SAITSF IH+DKSPFSG+ IMYLNRHQTEE Sbjct: 120 LSDRTNLFGESKKSYNRDLFLFLYFLLIIISAITSFTIHHDKSPFSGRSIMYLNRHQTEE 179 Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFS+ARF QMMWR Sbjct: 180 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSIARFTQMMWR 239 Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211 LNFLV F CIVL+N+YMLYYICPMHTLFTLMVYGALGIFNKYNE GT+IAGKII CFLVV Sbjct: 240 LNFLVFFSCIVLNNNYMLYYICPMHTLFTLMVYGALGIFNKYNENGTIIAGKIITCFLVV 299 Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031 IL+WEVPGVFE++WSPFT FLGYADPDP K K S LHEW FRSGLDRYIWI+GMIYAYYH Sbjct: 300 ILMWEVPGVFEVVWSPFTCFLGYADPDPLKTKQSLLHEWQFRSGLDRYIWIVGMIYAYYH 359 Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851 PTVERWMEKLEE E KRR SIK +V IISL +GYLW E+IYKLPKITYNKYHPYTSWIPI Sbjct: 360 PTVERWMEKLEETEVKRRISIKAVVAIISLAMGYLWYEHIYKLPKITYNKYHPYTSWIPI 419 Query: 850 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671 TVYISLRNVTQ+F SY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK Sbjct: 420 TVYISLRNVTQYFCSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKKLLCLIPNYP 479 Query: 670 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491 NFMLTT+IYVAVS+RLFELTNTLKS F+P KD+KRLG+N++ A+V++ +LY LS V L Sbjct: 480 LLNFMLTTAIYVAVSHRLFELTNTLKSTFIPMKDNKRLGYNIIAALVVSGLLYVLSSVFL 539 Query: 490 KVPQMLV 470 +VPQ+LV Sbjct: 540 RVPQLLV 546 >emb|CDO98736.1| unnamed protein product [Coffea canephora] Length = 544 Score = 912 bits (2357), Expect = 0.0 Identities = 445/547 (81%), Positives = 480/547 (87%) Frame = -3 Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931 +VIYGPLTPGQVSFFLGI+P+ AWIY+E LE++K S S R+SDI LVELG VKE Sbjct: 2 VVIYGPLTPGQVSFFLGIVPMFAAWIYAEILEYKKASVSKS--RHSDITLVELGNGGVKE 59 Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751 +D AVLLEGG LQSASPRVR+SS S I+RFL +DESFLLENRLTLRAISE GALLIYFY Sbjct: 60 EDSAVLLEGGGLQSASPRVRSSSAASQILRFLMMDESFLLENRLTLRAISELGALLIYFY 119 Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571 +CDRTN+ G+SKKSYNRDLF FLYFLLIIVSAITSFKIH DKSPFSGK IMYLNRHQTEE Sbjct: 120 VCDRTNIFGQSKKSYNRDLFLFLYFLLIIVSAITSFKIHQDKSPFSGKSIMYLNRHQTEE 179 Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391 WKGWMQVLFLMYHYFAA EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFS+ARF QMMWR Sbjct: 180 WKGWMQVLFLMYHYFAAAEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMWR 239 Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211 LNFLVL CCI+L N+Y LYYICPMHTLFTLMVYGALGI NKYNE GTVIA KI+ CFL V Sbjct: 240 LNFLVLLCCIILDNNYTLYYICPMHTLFTLMVYGALGILNKYNESGTVIAAKIMTCFLAV 299 Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031 ILIWE+PGVFEL+WSPFTF LGY+ DPSK LHEWHFRSGLDRYIWIIGMIYAYYH Sbjct: 300 ILIWEIPGVFELIWSPFTFLLGYS--DPSKPPQPRLHEWHFRSGLDRYIWIIGMIYAYYH 357 Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851 PTVERWMEKLEE E KRR SIK VVIISL +GYLW+EYIYKLPKITYNKYHPYTSWIPI Sbjct: 358 PTVERWMEKLEETEVKRRISIKTAVVIISLAVGYLWLEYIYKLPKITYNKYHPYTSWIPI 417 Query: 850 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671 TVYI LRNV+Q+FRSYTLTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK Sbjct: 418 TVYICLRNVSQYFRSYTLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPEYP 477 Query: 670 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491 NFMLTTSIY+AVSYRLFELTN LKS FVP+KD+KRLGHN+V AVVIAS LY LSFVLL Sbjct: 478 LLNFMLTTSIYIAVSYRLFELTNMLKSTFVPSKDNKRLGHNIVAAVVIASGLYMLSFVLL 537 Query: 490 KVPQMLV 470 ++P M+V Sbjct: 538 RIPPMMV 544 >ref|XP_007035864.1| O-acetyltransferase family protein isoform 1 [Theobroma cacao] gi|508714893|gb|EOY06790.1| O-acetyltransferase family protein isoform 1 [Theobroma cacao] Length = 545 Score = 898 bits (2320), Expect = 0.0 Identities = 436/548 (79%), Positives = 480/548 (87%), Gaps = 1/548 (0%) Frame = -3 Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931 M I+GP+TPGQVSFFLGI P+ AWIY+EYLE++K S SK R+SD+NLVE+G VKE Sbjct: 1 MAIFGPITPGQVSFFLGIFPVISAWIYAEYLEYKKNSLESK-ARHSDVNLVEIGNGAVKE 59 Query: 1930 DDRAVLLEGGALQSASPRVRNSSVT-SHIVRFLTLDESFLLENRLTLRAISEFGALLIYF 1754 DDRAVLLEGG LQSASP+ R SS + S I +FL +DE+FL+ENRLTLRAISEFG LL Y+ Sbjct: 60 DDRAVLLEGGGLQSASPKARTSSSSLSPIFKFLMMDETFLVENRLTLRAISEFGGLLAYY 119 Query: 1753 YICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTE 1574 YICDRT++ +KK+YNRDLF FLYFLLIIVSAITSFKIH+DKSPFSGK I+YLNRHQTE Sbjct: 120 YICDRTDVFDSAKKNYNRDLFLFLYFLLIIVSAITSFKIHHDKSPFSGKSILYLNRHQTE 179 Query: 1573 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMW 1394 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYY+RKDFSLARF QMMW Sbjct: 180 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 239 Query: 1393 RLNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLV 1214 RLNFLV FCC++L+NSY+LYYICPMHTLFTLMVYG LGI NKYNE G+VIA KIIACFLV Sbjct: 240 RLNFLVFFCCVILNNSYVLYYICPMHTLFTLMVYGTLGILNKYNENGSVIAAKIIACFLV 299 Query: 1213 VILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYY 1034 VIL+WEVPGVFE+LWSPFTFFLGY DP+K LHEWHFRSGLDRYIWIIGMIYAYY Sbjct: 300 VILVWEVPGVFEILWSPFTFFLGYT--DPAKPNFPRLHEWHFRSGLDRYIWIIGMIYAYY 357 Query: 1033 HPTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIP 854 HPTVERWMEKLEEAE KRR IKM V I+LT+GY W EYIYKL KITYNKYHPYTSWIP Sbjct: 358 HPTVERWMEKLEEAEVKRRVLIKMAVATIALTMGYFWFEYIYKLDKITYNKYHPYTSWIP 417 Query: 853 ITVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXX 674 ITVYI LRNVTQ FRSY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK Sbjct: 418 ITVYICLRNVTQSFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDY 477 Query: 673 XXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVL 494 NFMLTTSIYVA+SYRLF+LTN LK+AFVPTKDDKRL +NL+ AVVI+SILY+LSF L Sbjct: 478 PMLNFMLTTSIYVAISYRLFDLTNILKTAFVPTKDDKRLINNLITAVVISSILYSLSFAL 537 Query: 493 LKVPQMLV 470 L++PQMLV Sbjct: 538 LRIPQMLV 545 >ref|XP_007035865.1| O-acetyltransferase family protein isoform 2 [Theobroma cacao] gi|508714894|gb|EOY06791.1| O-acetyltransferase family protein isoform 2 [Theobroma cacao] Length = 544 Score = 895 bits (2313), Expect = 0.0 Identities = 435/548 (79%), Positives = 479/548 (87%), Gaps = 1/548 (0%) Frame = -3 Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931 M I+GP+TPGQVSFFLGI P+ AWIY+EYLE++K S SK +SD+NLVE+G VKE Sbjct: 1 MAIFGPITPGQVSFFLGIFPVISAWIYAEYLEYKKNSLESKA--HSDVNLVEIGNGAVKE 58 Query: 1930 DDRAVLLEGGALQSASPRVRNSSVT-SHIVRFLTLDESFLLENRLTLRAISEFGALLIYF 1754 DDRAVLLEGG LQSASP+ R SS + S I +FL +DE+FL+ENRLTLRAISEFG LL Y+ Sbjct: 59 DDRAVLLEGGGLQSASPKARTSSSSLSPIFKFLMMDETFLVENRLTLRAISEFGGLLAYY 118 Query: 1753 YICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTE 1574 YICDRT++ +KK+YNRDLF FLYFLLIIVSAITSFKIH+DKSPFSGK I+YLNRHQTE Sbjct: 119 YICDRTDVFDSAKKNYNRDLFLFLYFLLIIVSAITSFKIHHDKSPFSGKSILYLNRHQTE 178 Query: 1573 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMW 1394 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYY+RKDFSLARF QMMW Sbjct: 179 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 238 Query: 1393 RLNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLV 1214 RLNFLV FCC++L+NSY+LYYICPMHTLFTLMVYG LGI NKYNE G+VIA KIIACFLV Sbjct: 239 RLNFLVFFCCVILNNSYVLYYICPMHTLFTLMVYGTLGILNKYNENGSVIAAKIIACFLV 298 Query: 1213 VILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYY 1034 VIL+WEVPGVFE+LWSPFTFFLGY DP+K LHEWHFRSGLDRYIWIIGMIYAYY Sbjct: 299 VILVWEVPGVFEILWSPFTFFLGYT--DPAKPNFPRLHEWHFRSGLDRYIWIIGMIYAYY 356 Query: 1033 HPTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIP 854 HPTVERWMEKLEEAE KRR IKM V I+LT+GY W EYIYKL KITYNKYHPYTSWIP Sbjct: 357 HPTVERWMEKLEEAEVKRRVLIKMAVATIALTMGYFWFEYIYKLDKITYNKYHPYTSWIP 416 Query: 853 ITVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXX 674 ITVYI LRNVTQ FRSY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK Sbjct: 417 ITVYICLRNVTQSFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDY 476 Query: 673 XXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVL 494 NFMLTTSIYVA+SYRLF+LTN LK+AFVPTKDDKRL +NL+ AVVI+SILY+LSF L Sbjct: 477 PMLNFMLTTSIYVAISYRLFDLTNILKTAFVPTKDDKRLINNLITAVVISSILYSLSFAL 536 Query: 493 LKVPQMLV 470 L++PQMLV Sbjct: 537 LRIPQMLV 544 >ref|XP_010270835.1| PREDICTED: CAS1 domain-containing protein 1 isoform X1 [Nelumbo nucifera] Length = 547 Score = 894 bits (2309), Expect = 0.0 Identities = 423/546 (77%), Positives = 480/546 (87%) Frame = -3 Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931 M I P+TPGQVSF LGI+PI VAWIYSE+LE++K S SSK GR+SDINLVELG+ETVKE Sbjct: 1 MSITSPVTPGQVSFLLGIIPIIVAWIYSEFLEYKKTSISSKVGRHSDINLVELGKETVKE 60 Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751 DD+ L+E G LQSASP+ R+SSVTSH+ RF +DESFL ENRL LRAISEFG ++ YFY Sbjct: 61 DDKTALIESGNLQSASPKARSSSVTSHLARFFLMDESFLTENRLLLRAISEFGLIVFYFY 120 Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571 ICDRTN+ GESKK+YNRDLF FLYFLLIIVSA+TSFKIH+DKSPFSGK I+YLNRHQTEE Sbjct: 121 ICDRTNVFGESKKTYNRDLFLFLYFLLIIVSAMTSFKIHHDKSPFSGKSILYLNRHQTEE 180 Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391 WKGWMQVLFLMYHYFAA EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSL RF QMMWR Sbjct: 181 WKGWMQVLFLMYHYFAAAEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLTRFAQMMWR 240 Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211 LNF V FCCIVL+N+YMLYYICPMHTLFTLMVYGALGI NKYNE G+VIA KI ACF+VV Sbjct: 241 LNFFVAFCCIVLNNNYMLYYICPMHTLFTLMVYGALGILNKYNEIGSVIALKIAACFMVV 300 Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031 IL+WEVPGVF+++WSPFTFFLGY+DP+PSK K+ LHEWHFRSGLDRYIWIIGMIYAYYH Sbjct: 301 ILVWEVPGVFDVVWSPFTFFLGYSDPNPSKPKYPLLHEWHFRSGLDRYIWIIGMIYAYYH 360 Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851 PTVERWMEKLEE E +RR SIK+ V + +GYLW EYIYKL K+ YNKYHPYTSWIPI Sbjct: 361 PTVERWMEKLEETETRRRISIKIAVATVCSVMGYLWFEYIYKLDKVAYNKYHPYTSWIPI 420 Query: 850 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671 TVYI LRNVTQ FRSY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQP+ Sbjct: 421 TVYICLRNVTQQFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPRWLLSLIPDYP 480 Query: 670 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491 NFMLTTSIYVA+S+R+FELTNTLKS+FVP+KD+KRL +N++ A VI+ +LY+LSFV L Sbjct: 481 MLNFMLTTSIYVAISHRIFELTNTLKSSFVPSKDNKRLMYNMIAAAVISIMLYSLSFVFL 540 Query: 490 KVPQML 473 ++P++L Sbjct: 541 QIPKIL 546 >ref|XP_012829945.1| PREDICTED: CAS1 domain-containing protein 1-like [Erythranthe guttatus] Length = 557 Score = 891 bits (2302), Expect = 0.0 Identities = 447/559 (79%), Positives = 477/559 (85%), Gaps = 12/559 (2%) Frame = -3 Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931 M+IYGPLTPGQVSFFLG++P+ AW+YSEYLE+ K SS K GRNSDINLVEL TVKE Sbjct: 1 MLIYGPLTPGQVSFFLGLVPVFAAWLYSEYLENAKSSSLPKHGRNSDINLVELAG-TVKE 59 Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751 DDRAVLLEGG L SASP +RNSSVTS + R L LDESFLLENRL LRA+SEFGALLIYFY Sbjct: 60 DDRAVLLEGGGLHSASPTLRNSSVTSQLTRLLMLDESFLLENRLILRAMSEFGALLIYFY 119 Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571 ICDRTNLLGE+KKSYNRDLF FLYFLLIIVSA TSFKIHNDKSP SGK IMYLNRHQTEE Sbjct: 120 ICDRTNLLGEAKKSYNRDLFLFLYFLLIIVSAKTSFKIHNDKSPLSGKSIMYLNRHQTEE 179 Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391 WKGWMQVLFLMYHYFAATE YNAIRVFIA YVWMTGFGNFSYYYIRKDFSLARFIQMMWR Sbjct: 180 WKGWMQVLFLMYHYFAATEFYNAIRVFIAGYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 239 Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211 LNFLV FCC+VL+N+YMLYYICPMHTLFTLMVYGALGIFNKYNE+G VIA KI+ CFLVV Sbjct: 240 LNFLVFFCCVVLNNNYMLYYICPMHTLFTLMVYGALGIFNKYNERGIVIAAKIVVCFLVV 299 Query: 1210 ILIW----------EVPGVFE--LLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRY 1067 +L+W P FE S F GY DP SK K S LHEWHFRSGLDRY Sbjct: 300 VLLWGSARPICTTINKPFEFEKCTSLSQCEFVSGYTDP-ASKVKLSLLHEWHFRSGLDRY 358 Query: 1066 IWIIGMIYAYYHPTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITY 887 IWIIGMIYAYYHPTVERW+EKLEEAE KRR IK IV I+SLTIGYLWVE++YKLPKITY Sbjct: 359 IWIIGMIYAYYHPTVERWLEKLEEAEIKRRILIKSIVGIVSLTIGYLWVEFVYKLPKITY 418 Query: 886 NKYHPYTSWIPITVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQ 707 NKYHPYTSWIPITVYI LRNVTQHFR Y+LTLFAWLGKVTLETYISQIHIWLRS VP+GQ Sbjct: 419 NKYHPYTSWIPITVYICLRNVTQHFRCYSLTLFAWLGKVTLETYISQIHIWLRSSVPNGQ 478 Query: 706 PKXXXXXXXXXXXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVI 527 PK NFMLT SIY+AVSYRLFELTNTLKSAFVPTKD+KRLGHNL+ AV+I Sbjct: 479 PKLLLSLIPNYPLLNFMLTASIYIAVSYRLFELTNTLKSAFVPTKDNKRLGHNLIAAVII 538 Query: 526 ASILYTLSFVLLKVPQMLV 470 ASILY L+ VL+KVPQM+V Sbjct: 539 ASILYILAVVLIKVPQMIV 557 >ref|XP_012455818.1| PREDICTED: CAS1 domain-containing protein 1-like isoform X1 [Gossypium raimondii] gi|763805944|gb|KJB72882.1| hypothetical protein B456_011G202400 [Gossypium raimondii] Length = 545 Score = 890 bits (2300), Expect = 0.0 Identities = 428/548 (78%), Positives = 484/548 (88%), Gaps = 1/548 (0%) Frame = -3 Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931 M+I+GP+TPGQVSFFLG+ P+ AWIY+EYL+++K S +SK R+SD++LVE+G VKE Sbjct: 1 MMIFGPITPGQVSFFLGVFPVISAWIYAEYLQYKKNSLASK-ARHSDVSLVEIGNVAVKE 59 Query: 1930 DDRAVLLEGGALQSASPRVRNS-SVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYF 1754 +DRAVLLEGG LQS SP+ R+S S S I++F+ +DE+FL+ENRLTLRAISEFG LL Y+ Sbjct: 60 EDRAVLLEGGGLQSGSPKARSSTSSVSPILKFIMMDETFLIENRLTLRAISEFGVLLAYY 119 Query: 1753 YICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTE 1574 YICDRT++ SKKSYNRDLF FLYFLLIIVSAITSFKIH+DKSPFSGK I+YLNRHQTE Sbjct: 120 YICDRTDVFASSKKSYNRDLFLFLYFLLIIVSAITSFKIHHDKSPFSGKSILYLNRHQTE 179 Query: 1573 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMW 1394 EWKGWMQVLFLMYHYFAA+EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLARF QMMW Sbjct: 180 EWKGWMQVLFLMYHYFAASEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 239 Query: 1393 RLNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLV 1214 RLNFLV FCC+VL+NSYMLYYICPMHTLFTLMVYGALGI NKYNE+G+VIA KIIACFLV Sbjct: 240 RLNFLVFFCCVVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEKGSVIALKIIACFLV 299 Query: 1213 VILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYY 1034 VIL+WEVPGVFELLWSPFTFFLGY DP+K LHEWHFRSGLDRYIWIIGMIYAYY Sbjct: 300 VILVWEVPGVFELLWSPFTFFLGYT--DPAKPNLPLLHEWHFRSGLDRYIWIIGMIYAYY 357 Query: 1033 HPTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIP 854 HPTVERWMEKLEE E KRR SIK+ V II+L +G+LW E+IYKL K+TYNKYHPYTSWIP Sbjct: 358 HPTVERWMEKLEETEVKRRVSIKIAVAIIALMVGFLWFEHIYKLDKVTYNKYHPYTSWIP 417 Query: 853 ITVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXX 674 ITVYI LRNVTQ FRSY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK Sbjct: 418 ITVYICLRNVTQSFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDY 477 Query: 673 XXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVL 494 NFMLTTSIY+A+SYRLF+LTN LKSAFVPTKD+KRL HNL+ VV++SI+Y+LSFV Sbjct: 478 PMLNFMLTTSIYLAISYRLFDLTNILKSAFVPTKDNKRLLHNLITGVVVSSIVYSLSFVF 537 Query: 493 LKVPQMLV 470 L++PQMLV Sbjct: 538 LRIPQMLV 545 >gb|KHG01507.1| CAS1 domain-containing 1 [Gossypium arboreum] Length = 544 Score = 889 bits (2297), Expect = 0.0 Identities = 429/548 (78%), Positives = 482/548 (87%), Gaps = 1/548 (0%) Frame = -3 Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931 M+I+GP+TPGQVSFFLG+ P+ AWIY+EYL+++K S +SK +SD++LVE+G VKE Sbjct: 1 MMIFGPITPGQVSFFLGVFPVISAWIYAEYLQYKKNSLASKA--HSDVSLVEIGNGAVKE 58 Query: 1930 DDRAVLLEGGALQSASPRVRNS-SVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYF 1754 +DRAVLLEGG LQS SP+ R+S S S IV+FL +DE+FL+ENRLTLRAISEFG LL Y+ Sbjct: 59 EDRAVLLEGGGLQSGSPKARSSTSSVSPIVKFLMMDETFLIENRLTLRAISEFGVLLAYY 118 Query: 1753 YICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTE 1574 YICDRT++ SKKSYNRDLF FLYFLLIIVSAITSFKIH+DKSPFSGK I+YLNRHQTE Sbjct: 119 YICDRTDVFASSKKSYNRDLFLFLYFLLIIVSAITSFKIHHDKSPFSGKSILYLNRHQTE 178 Query: 1573 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMW 1394 EWKGWMQVLFLMYHYFAA+EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLARF QMMW Sbjct: 179 EWKGWMQVLFLMYHYFAASEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 238 Query: 1393 RLNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLV 1214 RLNFLV FCC+VL+NSYMLYYICPMHTLFTLMVYGALGI NKYNE+G+VIA KIIACFLV Sbjct: 239 RLNFLVFFCCVVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEKGSVIALKIIACFLV 298 Query: 1213 VILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYY 1034 VIL+WEVPGVFE LWSPFTFFLGY DP+K LHEWHFRSGLDRYIWIIGMIYAYY Sbjct: 299 VILVWEVPGVFEFLWSPFTFFLGYT--DPAKPNLPLLHEWHFRSGLDRYIWIIGMIYAYY 356 Query: 1033 HPTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIP 854 HPTVERWMEKLEE E KRR SIK+ V II+L +G+LW E+IYKL KITYNKYHPYTSWIP Sbjct: 357 HPTVERWMEKLEETEVKRRVSIKIAVAIIALMVGFLWFEHIYKLDKITYNKYHPYTSWIP 416 Query: 853 ITVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXX 674 ITVYI LRNVTQ FRSY+LTLFAWLGK+TLETYISQIHIWLRSG+PDGQPK Sbjct: 417 ITVYICLRNVTQSFRSYSLTLFAWLGKITLETYISQIHIWLRSGIPDGQPKLLLSLIPDY 476 Query: 673 XXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVL 494 NFMLTTSIY+A+SYRLF+LTN LKSAFVPTKD+KRL HNL+ VV++SILY+LSFV Sbjct: 477 PMLNFMLTTSIYLAISYRLFDLTNILKSAFVPTKDNKRLLHNLITGVVVSSILYSLSFVF 536 Query: 493 LKVPQMLV 470 L++PQMLV Sbjct: 537 LRIPQMLV 544 >ref|XP_012455820.1| PREDICTED: CAS1 domain-containing protein 1-like isoform X2 [Gossypium raimondii] gi|763805945|gb|KJB72883.1| hypothetical protein B456_011G202400 [Gossypium raimondii] Length = 544 Score = 887 bits (2293), Expect = 0.0 Identities = 427/548 (77%), Positives = 483/548 (88%), Gaps = 1/548 (0%) Frame = -3 Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931 M+I+GP+TPGQVSFFLG+ P+ AWIY+EYL+++K S +SK +SD++LVE+G VKE Sbjct: 1 MMIFGPITPGQVSFFLGVFPVISAWIYAEYLQYKKNSLASKA--HSDVSLVEIGNVAVKE 58 Query: 1930 DDRAVLLEGGALQSASPRVRNS-SVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYF 1754 +DRAVLLEGG LQS SP+ R+S S S I++F+ +DE+FL+ENRLTLRAISEFG LL Y+ Sbjct: 59 EDRAVLLEGGGLQSGSPKARSSTSSVSPILKFIMMDETFLIENRLTLRAISEFGVLLAYY 118 Query: 1753 YICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTE 1574 YICDRT++ SKKSYNRDLF FLYFLLIIVSAITSFKIH+DKSPFSGK I+YLNRHQTE Sbjct: 119 YICDRTDVFASSKKSYNRDLFLFLYFLLIIVSAITSFKIHHDKSPFSGKSILYLNRHQTE 178 Query: 1573 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMW 1394 EWKGWMQVLFLMYHYFAA+EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLARF QMMW Sbjct: 179 EWKGWMQVLFLMYHYFAASEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 238 Query: 1393 RLNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLV 1214 RLNFLV FCC+VL+NSYMLYYICPMHTLFTLMVYGALGI NKYNE+G+VIA KIIACFLV Sbjct: 239 RLNFLVFFCCVVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEKGSVIALKIIACFLV 298 Query: 1213 VILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYY 1034 VIL+WEVPGVFELLWSPFTFFLGY DP+K LHEWHFRSGLDRYIWIIGMIYAYY Sbjct: 299 VILVWEVPGVFELLWSPFTFFLGYT--DPAKPNLPLLHEWHFRSGLDRYIWIIGMIYAYY 356 Query: 1033 HPTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIP 854 HPTVERWMEKLEE E KRR SIK+ V II+L +G+LW E+IYKL K+TYNKYHPYTSWIP Sbjct: 357 HPTVERWMEKLEETEVKRRVSIKIAVAIIALMVGFLWFEHIYKLDKVTYNKYHPYTSWIP 416 Query: 853 ITVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXX 674 ITVYI LRNVTQ FRSY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK Sbjct: 417 ITVYICLRNVTQSFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDY 476 Query: 673 XXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVL 494 NFMLTTSIY+A+SYRLF+LTN LKSAFVPTKD+KRL HNL+ VV++SI+Y+LSFV Sbjct: 477 PMLNFMLTTSIYLAISYRLFDLTNILKSAFVPTKDNKRLLHNLITGVVVSSIVYSLSFVF 536 Query: 493 LKVPQMLV 470 L++PQMLV Sbjct: 537 LRIPQMLV 544 >ref|XP_007154416.1| hypothetical protein PHAVU_003G117800g [Phaseolus vulgaris] gi|561027770|gb|ESW26410.1| hypothetical protein PHAVU_003G117800g [Phaseolus vulgaris] Length = 546 Score = 887 bits (2291), Expect = 0.0 Identities = 428/547 (78%), Positives = 475/547 (86%) Frame = -3 Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931 M+I P+TPGQVSF LGI P+ VAWIYSE LE+RK S SSK G +SDINLVE+G + VK+ Sbjct: 1 MLILSPVTPGQVSFLLGITPVVVAWIYSEILEYRKNSVSSKAG-HSDINLVEMGSDAVKD 59 Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751 +D+AVLLEGGALQS SPR R+ + + I+RFL +D FLLENRLTLRA+SEFG LL YFY Sbjct: 60 EDKAVLLEGGALQSGSPRARSLTASPSIIRFLLMDNYFLLENRLTLRAMSEFGLLLAYFY 119 Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571 +CDRT+ SKKSYNRD+F FLYFLLIIVSA+TSFKIH DKSPFSGK I+YLNRHQTEE Sbjct: 120 LCDRTDFFASSKKSYNRDIFLFLYFLLIIVSAMTSFKIHQDKSPFSGKSILYLNRHQTEE 179 Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391 WKGWMQVLFLMYHYFAA+EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLARF QMMWR Sbjct: 180 WKGWMQVLFLMYHYFAASEIYNAIRLFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWR 239 Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211 LNF V+FCCIVL+NSYMLYYICPMHTLFTLMVYGALGI NKYNE G+VIA KIIACFLVV Sbjct: 240 LNFFVVFCCIVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEIGSVIAVKIIACFLVV 299 Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031 IL+WE+PGVFELLWSPFTFFLGY DP+P+K+ S LHEWHFRSGLDRYIWIIGMIYAYYH Sbjct: 300 ILVWEIPGVFELLWSPFTFFLGYTDPNPAKSHLSRLHEWHFRSGLDRYIWIIGMIYAYYH 359 Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851 PTVERWMEKLEEAE KRR SIK +V+I +GYLW E+IYKL K+TYN YHPYTSWIPI Sbjct: 360 PTVERWMEKLEEAEIKRRISIKATIVLICSLVGYLWFEHIYKLDKLTYNTYHPYTSWIPI 419 Query: 850 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671 TVYI LRNVTQ FRSYTLTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK Sbjct: 420 TVYICLRNVTQSFRSYTLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDYP 479 Query: 670 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491 NFMLTTSIYVA+S RLF+LTNTLK AFVP+KDDKRL HNL+ A I+ +LY+LSF L Sbjct: 480 MLNFMLTTSIYVAISCRLFDLTNTLKVAFVPSKDDKRLVHNLITATTISVVLYSLSFGFL 539 Query: 490 KVPQMLV 470 ++PQMLV Sbjct: 540 RLPQMLV 546 >ref|XP_003631938.1| PREDICTED: CAS1 domain-containing protein 1 [Vitis vinifera] gi|296083216|emb|CBI22852.3| unnamed protein product [Vitis vinifera] Length = 544 Score = 885 bits (2286), Expect = 0.0 Identities = 420/547 (76%), Positives = 477/547 (87%) Frame = -3 Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931 M+I GP+TPGQV+FF+G + + AWIY+E+LE++K + SK +SD+NLVEL E TVKE Sbjct: 1 MMIVGPITPGQVAFFIGFVSVFAAWIYAEFLEYKKNAFPSKT--HSDLNLVELNE-TVKE 57 Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751 DDRAVLLEGG LQS SP+ R+SSVTSHI RFL ++ESFL+E RLTLRA+ EFGALL YFY Sbjct: 58 DDRAVLLEGGGLQSVSPKARSSSVTSHIFRFLLMEESFLIEYRLTLRAMCEFGALLAYFY 117 Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571 +CDRTNL G+SKKSYNRDLF FLYFLLIIVSA+TSFK+H+DKS FSGK I+YLNRHQTEE Sbjct: 118 LCDRTNLFGDSKKSYNRDLFIFLYFLLIIVSAVTSFKVHHDKSSFSGKSILYLNRHQTEE 177 Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391 WKGWMQVLFLMYHYFAATEIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLARF QMMWR Sbjct: 178 WKGWMQVLFLMYHYFAATEIYNAIRLFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWR 237 Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211 LNFLVLFCC+VL+NSYMLYYICPMHTLFTLMVYGALGI NKYNE G VIA KI+ACFLVV Sbjct: 238 LNFLVLFCCVVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEIGLVIAVKIVACFLVV 297 Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031 +L+WE+PGVFE +WSP TF LGY DPDPSK K S LHEWHFRSGLDRYIWIIGMIYAYYH Sbjct: 298 VLLWEIPGVFEFVWSPLTFILGYTDPDPSKQKFSRLHEWHFRSGLDRYIWIIGMIYAYYH 357 Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851 PTVERWMEKLEE E K R +IKM + ++LT+GYLW E+IYKL K+TYNKYHPYTSWIPI Sbjct: 358 PTVERWMEKLEETEVKLRVAIKMAIATVALTVGYLWFEHIYKLDKLTYNKYHPYTSWIPI 417 Query: 850 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671 +VYI LRN+TQ FR Y+LTLFAWLGK+TLETYISQIHIWLRSG+PD QPK Sbjct: 418 SVYICLRNLTQQFRCYSLTLFAWLGKITLETYISQIHIWLRSGLPDAQPKLLLSLIPSYP 477 Query: 670 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491 NFMLTTSIY+A+SYRLFELTNTLK AFVP+KD+K L HN++ + I+ ILYTLSF+ L Sbjct: 478 LLNFMLTTSIYIAISYRLFELTNTLKVAFVPSKDNKLLMHNIIAGIAISGILYTLSFIFL 537 Query: 490 KVPQMLV 470 +VPQ+LV Sbjct: 538 QVPQLLV 544 >ref|XP_010270836.1| PREDICTED: CAS1 domain-containing protein 1 isoform X2 [Nelumbo nucifera] Length = 545 Score = 884 bits (2285), Expect = 0.0 Identities = 421/546 (77%), Positives = 478/546 (87%) Frame = -3 Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931 M I P+TPGQVSF LGI+PI VAWIYSE+LE++K S SSK +SDINLVELG+ETVKE Sbjct: 1 MSITSPVTPGQVSFLLGIIPIIVAWIYSEFLEYKKTSISSKV--HSDINLVELGKETVKE 58 Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751 DD+ L+E G LQSASP+ R+SSVTSH+ RF +DESFL ENRL LRAISEFG ++ YFY Sbjct: 59 DDKTALIESGNLQSASPKARSSSVTSHLARFFLMDESFLTENRLLLRAISEFGLIVFYFY 118 Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571 ICDRTN+ GESKK+YNRDLF FLYFLLIIVSA+TSFKIH+DKSPFSGK I+YLNRHQTEE Sbjct: 119 ICDRTNVFGESKKTYNRDLFLFLYFLLIIVSAMTSFKIHHDKSPFSGKSILYLNRHQTEE 178 Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391 WKGWMQVLFLMYHYFAA EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSL RF QMMWR Sbjct: 179 WKGWMQVLFLMYHYFAAAEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLTRFAQMMWR 238 Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211 LNF V FCCIVL+N+YMLYYICPMHTLFTLMVYGALGI NKYNE G+VIA KI ACF+VV Sbjct: 239 LNFFVAFCCIVLNNNYMLYYICPMHTLFTLMVYGALGILNKYNEIGSVIALKIAACFMVV 298 Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031 IL+WEVPGVF+++WSPFTFFLGY+DP+PSK K+ LHEWHFRSGLDRYIWIIGMIYAYYH Sbjct: 299 ILVWEVPGVFDVVWSPFTFFLGYSDPNPSKPKYPLLHEWHFRSGLDRYIWIIGMIYAYYH 358 Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851 PTVERWMEKLEE E +RR SIK+ V + +GYLW EYIYKL K+ YNKYHPYTSWIPI Sbjct: 359 PTVERWMEKLEETETRRRISIKIAVATVCSVMGYLWFEYIYKLDKVAYNKYHPYTSWIPI 418 Query: 850 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671 TVYI LRNVTQ FRSY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQP+ Sbjct: 419 TVYICLRNVTQQFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPRWLLSLIPDYP 478 Query: 670 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491 NFMLTTSIYVA+S+R+FELTNTLKS+FVP+KD+KRL +N++ A VI+ +LY+LSFV L Sbjct: 479 MLNFMLTTSIYVAISHRIFELTNTLKSSFVPSKDNKRLMYNMIAAAVISIMLYSLSFVFL 538 Query: 490 KVPQML 473 ++P++L Sbjct: 539 QIPKIL 544 >ref|XP_012084122.1| PREDICTED: probable O-acetyltransferase CAS1 isoform X1 [Jatropha curcas] Length = 546 Score = 881 bits (2276), Expect = 0.0 Identities = 427/549 (77%), Positives = 475/549 (86%), Gaps = 2/549 (0%) Frame = -3 Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931 MVI P+TPGQVSF LG++PI AWIYSE LE++K ++++K R+SDI L E+G + VKE Sbjct: 1 MVISSPITPGQVSFLLGVVPIIAAWIYSEILEYKKNAAAAK-ARHSDIGLAEMGNDAVKE 59 Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751 DDRAVLLEGG LQSASPR + S +S I RFL ++E FL+ENRLTLRAISEFGALL YFY Sbjct: 60 DDRAVLLEGGGLQSASPRAKTSPASSPIFRFLLMEEQFLIENRLTLRAISEFGALLGYFY 119 Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571 +CDRT+ S KS+NRDLFWFLY LLIIVSAITSFKIH+D+SPFSGK I+YLNRHQTEE Sbjct: 120 LCDRTDFFNSSTKSFNRDLFWFLYSLLIIVSAITSFKIHHDRSPFSGKPILYLNRHQTEE 179 Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYY+RKDFSLARF QMMWR Sbjct: 180 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWR 239 Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211 LNFLV+FCC+VL+NSYMLYYICPMHTLFTLMVYGALGI NKYNE+G+VIA KIIACFLVV Sbjct: 240 LNFLVIFCCVVLNNSYMLYYICPMHTLFTLMVYGALGIMNKYNEKGSVIAVKIIACFLVV 299 Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031 ILIWE+PGVFEL+WSPFTFFLGY DP+K LHEWHFRSGLDRYIWIIGMIYAYYH Sbjct: 300 ILIWEIPGVFELVWSPFTFFLGYT--DPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYH 357 Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851 PTVERWMEKLEE E KRR SIK V ISL GYLW E+IYK+ K+TYNKYHPYTSWIPI Sbjct: 358 PTVERWMEKLEETEVKRRISIKTAVASISLLTGYLWFEHIYKMDKVTYNKYHPYTSWIPI 417 Query: 850 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPK--XXXXXXXX 677 TVYISLRNV+Q+FRSYTLTLFAWLGK+TLETYISQ HIWLRS +PD QPK Sbjct: 418 TVYISLRNVSQYFRSYTLTLFAWLGKITLETYISQFHIWLRSSIPDAQPKLLLSIIPQSE 477 Query: 676 XXXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFV 497 NFMLTTSIYVAVSYRLF+LTNTLK AFVP+KDDKRL HN++ A VIASILY++SF+ Sbjct: 478 FPLLNFMLTTSIYVAVSYRLFDLTNTLKIAFVPSKDDKRLAHNMITAAVIASILYSVSFI 537 Query: 496 LLKVPQMLV 470 LK+PQ+LV Sbjct: 538 FLKIPQILV 546 >gb|EYU43511.1| hypothetical protein MIMGU_mgv1a004398mg [Erythranthe guttata] Length = 529 Score = 880 bits (2274), Expect = 0.0 Identities = 440/547 (80%), Positives = 471/547 (86%) Frame = -3 Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931 M+IYGPLTPGQVSFFLG++P+ AW+YSEYLE+ K SSS P +NSDINLVEL TVKE Sbjct: 1 MLIYGPLTPGQVSFFLGLVPVFAAWLYSEYLENAK--SSSLPKQNSDINLVELAG-TVKE 57 Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751 DDRAVLLEGG L SASP +RNSSVTS + R L LDESFLLENRL LRA+SEFGALLIYFY Sbjct: 58 DDRAVLLEGGGLHSASPTLRNSSVTSQLTRLLMLDESFLLENRLILRAMSEFGALLIYFY 117 Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571 ICDRTNLLGE+KKSYNRDLF FLYFLLIIVSA TSFKIHNDKSP SGK IMYLNRHQTEE Sbjct: 118 ICDRTNLLGEAKKSYNRDLFLFLYFLLIIVSAKTSFKIHNDKSPLSGKSIMYLNRHQTEE 177 Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391 WKGWMQVLFLMYHYFAATE YNAIRVFIA YVWMTGFGNFSYYYIRKDFSLARFIQMMWR Sbjct: 178 WKGWMQVLFLMYHYFAATEFYNAIRVFIAGYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 237 Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211 LNFLV FCC+VL+N+YMLYYICPMHTLFTLMVYGALGIFNKYNE+G VIA KI+ CFLVV Sbjct: 238 LNFLVFFCCVVLNNNYMLYYICPMHTLFTLMVYGALGIFNKYNERGIVIAAKIVVCFLVV 297 Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031 +L+W Y DP SK K S LHEWHFRSGLDRYIWIIGMIYAYYH Sbjct: 298 VLLWG----------------RYTDP-ASKVKLSLLHEWHFRSGLDRYIWIIGMIYAYYH 340 Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851 PTVERW+EKLEEAE KRR IK IV I+SLTIGYLWVE++YKLPKITYNKYHPYTSWIPI Sbjct: 341 PTVERWLEKLEEAEIKRRILIKSIVGIVSLTIGYLWVEFVYKLPKITYNKYHPYTSWIPI 400 Query: 850 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671 TVYI LRNVTQHFR Y+LTLFAWLGKVTLETYISQIHIWLRS VP+GQPK Sbjct: 401 TVYICLRNVTQHFRCYSLTLFAWLGKVTLETYISQIHIWLRSSVPNGQPKLLLSLIPNYP 460 Query: 670 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491 NFMLT SIY+AVSYRLFELTNTLKSAFVPTKD+KRLGHNL+ AV+IASILY L+ VL+ Sbjct: 461 LLNFMLTASIYIAVSYRLFELTNTLKSAFVPTKDNKRLGHNLIAAVIIASILYILAVVLI 520 Query: 490 KVPQMLV 470 KVPQM+V Sbjct: 521 KVPQMIV 527 >ref|XP_010045355.1| PREDICTED: CAS1 domain-containing protein 1 isoform X1 [Eucalyptus grandis] gi|629123038|gb|KCW87528.1| hypothetical protein EUGRSUZ_B03976 [Eucalyptus grandis] Length = 546 Score = 880 bits (2273), Expect = 0.0 Identities = 423/545 (77%), Positives = 473/545 (86%) Frame = -3 Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931 M I+GP+TPGQVSF +GI+P AWIYSE+LE+++ S SSK R SD+NLVE+G + VKE Sbjct: 1 MAIHGPVTPGQVSFLIGIIPTIAAWIYSEFLEYKRNSVSSKV-RRSDVNLVEMGNDVVKE 59 Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751 DDRAVLLEGG LQSASPR RNSS TS I RFL +DESFL+ENRLTLRAI+EF LL YF+ Sbjct: 60 DDRAVLLEGGGLQSASPRSRNSSATSPIFRFLVMDESFLVENRLTLRAIAEFSMLLAYFF 119 Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571 +CDRT+ SKKSYNRDLF FLYFLLIIVSA+TSF HN+KSP SGK I+YLNRHQTEE Sbjct: 120 LCDRTDFFESSKKSYNRDLFLFLYFLLIIVSAMTSFTTHNEKSPISGKSILYLNRHQTEE 179 Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391 WKGWMQVLFLMYHYFAA+EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLARF QMMWR Sbjct: 180 WKGWMQVLFLMYHYFAASEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWR 239 Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211 LNFLVLFCC+VL+N+Y+LYYICPMHTLFTLMVYGALGI NKYNE G IA KIIACFLVV Sbjct: 240 LNFLVLFCCVVLNNNYVLYYICPMHTLFTLMVYGALGILNKYNEVGMAIALKIIACFLVV 299 Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031 IL+WEVPGVFEL+WSPFTF LGY+DPDPSK K L EWHFRSGLDRYIWI+GMIYAYYH Sbjct: 300 ILVWEVPGVFELVWSPFTFLLGYSDPDPSKPKFPLLREWHFRSGLDRYIWIVGMIYAYYH 359 Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851 PTVERWMEKLEEAE KRR IK V+ SLT+GYLW EY+YKL KITYNKYHPYTSWIPI Sbjct: 360 PTVERWMEKLEEAEWKRRLLIKGAVISTSLTVGYLWFEYVYKLDKITYNKYHPYTSWIPI 419 Query: 850 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671 TVYISLRNVTQH RS +LTLFAWLGK+TLETYISQIHIWLRSG+PDGQPK Sbjct: 420 TVYISLRNVTQHLRSCSLTLFAWLGKITLETYISQIHIWLRSGIPDGQPKLLLSLIPNYP 479 Query: 670 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491 NFMLTTSIY+ VS+RLF LTNTLK+AFVP+KD+KRL +N++ A VI+S+LY+LSFV L Sbjct: 480 MLNFMLTTSIYIVVSHRLFNLTNTLKNAFVPSKDNKRLLNNMISAAVISSVLYSLSFVFL 539 Query: 490 KVPQM 476 K P++ Sbjct: 540 KFPRV 544 >ref|XP_006583993.1| PREDICTED: CAS1 domain-containing protein 1-like isoform X1 [Glycine max] gi|734318066|gb|KHN02869.1| CAS1 domain-containing protein 1 [Glycine soja] Length = 552 Score = 879 bits (2272), Expect = 0.0 Identities = 424/547 (77%), Positives = 473/547 (86%) Frame = -3 Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931 M++ P+TPGQVSF LGI+P+ VAWIYSE LE+RK S SS+ R SDINLVE+G + VK+ Sbjct: 1 MLLLSPVTPGQVSFLLGIIPVVVAWIYSEILEYRKNSVSSR-ARQSDINLVEMGSDVVKD 59 Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751 +DRAVLLEGGALQS SP+ R+ + + I+RFL +DE FLLENRLTLRA+SEFG +L YFY Sbjct: 60 EDRAVLLEGGALQSGSPKARSLTGSPSIIRFLLMDECFLLENRLTLRAMSEFGLILAYFY 119 Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571 +CDRT+ S KSYNRDLF FLYFLLIIVSA+TSFKIH+DKSP SGK I+YLNRHQTEE Sbjct: 120 LCDRTDFFASSNKSYNRDLFLFLYFLLIIVSAMTSFKIHHDKSPLSGKSILYLNRHQTEE 179 Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391 WKGWMQVLFLMYHYFAA+EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLARF QMMWR Sbjct: 180 WKGWMQVLFLMYHYFAASEIYNAIRLFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWR 239 Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211 LNF V+FCCIVL+NSYMLYYICPMHTLFTLMVYGALGI +KYNE G+VIA KIIACFLVV Sbjct: 240 LNFFVVFCCIVLNNSYMLYYICPMHTLFTLMVYGALGILHKYNEIGSVIAVKIIACFLVV 299 Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031 IL+WE+PGVFE +WSPFTFFLGY DP+P+K+ S LHEWHFRSGLDRYIWIIGMIYAYYH Sbjct: 300 ILVWEIPGVFEWVWSPFTFFLGYTDPNPAKSHLSRLHEWHFRSGLDRYIWIIGMIYAYYH 359 Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851 PTVERWMEKLEEAE KRR SIK VV+I +GYLW E+IYKL KI YNKYHPYTSWIPI Sbjct: 360 PTVERWMEKLEEAEIKRRISIKATVVLICSLVGYLWFEHIYKLDKIAYNKYHPYTSWIPI 419 Query: 850 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671 TVYI LRNVTQ FRSYTLTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK Sbjct: 420 TVYICLRNVTQSFRSYTLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDFP 479 Query: 670 XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491 NFMLTTSIYVA+SYRLF+LTNTLK AFVP+KDDKR HNL+ A I+ +LY+LS L Sbjct: 480 MLNFMLTTSIYVAISYRLFDLTNTLKMAFVPSKDDKRFIHNLITATTISVVLYSLSLGFL 539 Query: 490 KVPQMLV 470 +VPQMLV Sbjct: 540 RVPQMLV 546 >ref|XP_012084124.1| PREDICTED: probable O-acetyltransferase CAS1 isoform X2 [Jatropha curcas] gi|643716182|gb|KDP27955.1| hypothetical protein JCGZ_19035 [Jatropha curcas] Length = 545 Score = 878 bits (2269), Expect = 0.0 Identities = 426/549 (77%), Positives = 474/549 (86%), Gaps = 2/549 (0%) Frame = -3 Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931 MVI P+TPGQVSF LG++PI AWIYSE LE++K ++++K +SDI L E+G + VKE Sbjct: 1 MVISSPITPGQVSFLLGVVPIIAAWIYSEILEYKKNAAAAKA--HSDIGLAEMGNDAVKE 58 Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751 DDRAVLLEGG LQSASPR + S +S I RFL ++E FL+ENRLTLRAISEFGALL YFY Sbjct: 59 DDRAVLLEGGGLQSASPRAKTSPASSPIFRFLLMEEQFLIENRLTLRAISEFGALLGYFY 118 Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571 +CDRT+ S KS+NRDLFWFLY LLIIVSAITSFKIH+D+SPFSGK I+YLNRHQTEE Sbjct: 119 LCDRTDFFNSSTKSFNRDLFWFLYSLLIIVSAITSFKIHHDRSPFSGKPILYLNRHQTEE 178 Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYY+RKDFSLARF QMMWR Sbjct: 179 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWR 238 Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211 LNFLV+FCC+VL+NSYMLYYICPMHTLFTLMVYGALGI NKYNE+G+VIA KIIACFLVV Sbjct: 239 LNFLVIFCCVVLNNSYMLYYICPMHTLFTLMVYGALGIMNKYNEKGSVIAVKIIACFLVV 298 Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031 ILIWE+PGVFEL+WSPFTFFLGY DP+K LHEWHFRSGLDRYIWIIGMIYAYYH Sbjct: 299 ILIWEIPGVFELVWSPFTFFLGYT--DPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYH 356 Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851 PTVERWMEKLEE E KRR SIK V ISL GYLW E+IYK+ K+TYNKYHPYTSWIPI Sbjct: 357 PTVERWMEKLEETEVKRRISIKTAVASISLLTGYLWFEHIYKMDKVTYNKYHPYTSWIPI 416 Query: 850 TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPK--XXXXXXXX 677 TVYISLRNV+Q+FRSYTLTLFAWLGK+TLETYISQ HIWLRS +PD QPK Sbjct: 417 TVYISLRNVSQYFRSYTLTLFAWLGKITLETYISQFHIWLRSSIPDAQPKLLLSIIPQSE 476 Query: 676 XXXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFV 497 NFMLTTSIYVAVSYRLF+LTNTLK AFVP+KDDKRL HN++ A VIASILY++SF+ Sbjct: 477 FPLLNFMLTTSIYVAVSYRLFDLTNTLKIAFVPSKDDKRLAHNMITAAVIASILYSVSFI 536 Query: 496 LLKVPQMLV 470 LK+PQ+LV Sbjct: 537 FLKIPQILV 545