BLASTX nr result

ID: Forsythia21_contig00022172 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia21_contig00022172
         (2250 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011100735.1| PREDICTED: CAS1 domain-containing protein 1 ...   964   0.0  
ref|XP_006338003.1| PREDICTED: CAS1 domain-containing protein 1-...   937   0.0  
ref|XP_004229023.1| PREDICTED: CAS1 domain-containing protein 1 ...   936   0.0  
ref|XP_009803356.1| PREDICTED: CAS1 domain-containing protein 1 ...   925   0.0  
emb|CDO98736.1| unnamed protein product [Coffea canephora]            912   0.0  
ref|XP_007035864.1| O-acetyltransferase family protein isoform 1...   898   0.0  
ref|XP_007035865.1| O-acetyltransferase family protein isoform 2...   895   0.0  
ref|XP_010270835.1| PREDICTED: CAS1 domain-containing protein 1 ...   894   0.0  
ref|XP_012829945.1| PREDICTED: CAS1 domain-containing protein 1-...   891   0.0  
ref|XP_012455818.1| PREDICTED: CAS1 domain-containing protein 1-...   890   0.0  
gb|KHG01507.1| CAS1 domain-containing 1 [Gossypium arboreum]          889   0.0  
ref|XP_012455820.1| PREDICTED: CAS1 domain-containing protein 1-...   887   0.0  
ref|XP_007154416.1| hypothetical protein PHAVU_003G117800g [Phas...   887   0.0  
ref|XP_003631938.1| PREDICTED: CAS1 domain-containing protein 1 ...   885   0.0  
ref|XP_010270836.1| PREDICTED: CAS1 domain-containing protein 1 ...   884   0.0  
ref|XP_012084122.1| PREDICTED: probable O-acetyltransferase CAS1...   881   0.0  
gb|EYU43511.1| hypothetical protein MIMGU_mgv1a004398mg [Erythra...   880   0.0  
ref|XP_010045355.1| PREDICTED: CAS1 domain-containing protein 1 ...   880   0.0  
ref|XP_006583993.1| PREDICTED: CAS1 domain-containing protein 1-...   879   0.0  
ref|XP_012084124.1| PREDICTED: probable O-acetyltransferase CAS1...   878   0.0  

>ref|XP_011100735.1| PREDICTED: CAS1 domain-containing protein 1 [Sesamum indicum]
          Length = 545

 Score =  964 bits (2491), Expect = 0.0
 Identities = 471/547 (86%), Positives = 497/547 (90%)
 Frame = -3

Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931
            MVIYGPLTPGQV+FFLGI+P+  AW+YSEYLE+RK SS SK GRNSD  LVEL   TVKE
Sbjct: 1    MVIYGPLTPGQVAFFLGIVPVFAAWLYSEYLEYRKNSSFSKHGRNSDSKLVELAG-TVKE 59

Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751
            DDRAVLLEGG LQSASPR RNSSVTSH++RFLTLDESFLLENRLTLRAISEFGALL+YFY
Sbjct: 60   DDRAVLLEGGGLQSASPRERNSSVTSHVIRFLTLDESFLLENRLTLRAISEFGALLVYFY 119

Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571
            ICDRTNLLGESKKSYNRDLF FLYFLLIIVSAITSFKIHNDKSPFSGK IMYLNRHQTEE
Sbjct: 120  ICDRTNLLGESKKSYNRDLFLFLYFLLIIVSAITSFKIHNDKSPFSGKSIMYLNRHQTEE 179

Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391
            WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARF+QMMWR
Sbjct: 180  WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFMQMMWR 239

Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211
            LNFLV FCC+VL+N+YMLYYICPMHTLFTLMVYGALGIFNKYN++GTVIA K +ACFLVV
Sbjct: 240  LNFLVFFCCVVLNNNYMLYYICPMHTLFTLMVYGALGIFNKYNDRGTVIAAKFLACFLVV 299

Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031
            ILIWEVPGVF+L W PFTF LGY DP  SK K   LHEWHFRSGLDRYIWIIGMIYAYYH
Sbjct: 300  ILIWEVPGVFDLFWGPFTFLLGYTDP-ASKVKFPLLHEWHFRSGLDRYIWIIGMIYAYYH 358

Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851
            PTVERWMEKLEEAE KRR SIK I++IISLTIGYLWVE+IYKLPKITYNKYHPYTSWIPI
Sbjct: 359  PTVERWMEKLEEAETKRRISIKTIIIIISLTIGYLWVEFIYKLPKITYNKYHPYTSWIPI 418

Query: 850  TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671
            TVYI LRNVTQHFR YTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPK          
Sbjct: 419  TVYICLRNVTQHFRCYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKLLLSLIPNYP 478

Query: 670  XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491
              NFMLTTSIY+AVSYRLFELTNTLKS FVPTKD+KRLGHNL  AVVIA ILY L+ +L+
Sbjct: 479  LLNFMLTTSIYIAVSYRLFELTNTLKSTFVPTKDNKRLGHNLAAAVVIAGILYILAVILV 538

Query: 490  KVPQMLV 470
            KVPQ++V
Sbjct: 539  KVPQVMV 545


>ref|XP_006338003.1| PREDICTED: CAS1 domain-containing protein 1-like [Solanum tuberosum]
          Length = 546

 Score =  937 bits (2423), Expect = 0.0
 Identities = 452/547 (82%), Positives = 492/547 (89%)
 Frame = -3

Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931
            M+IYGPL+PGQVSFFLGI+P+C AW+YSEYLE++K S+SSK  R+SDINLVELG E VKE
Sbjct: 1    MLIYGPLSPGQVSFFLGIVPVCAAWLYSEYLEYKKNSASSKV-RHSDINLVELGNEAVKE 59

Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751
            DDRAVLLEGG LQS SPR+R+SSVTS I RF  +DE+FLLENR TLRAISEFGALL YFY
Sbjct: 60   DDRAVLLEGGGLQSTSPRIRSSSVTSQIARFFLMDETFLLENRSTLRAISEFGALLTYFY 119

Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571
            + DRTNL GESKKSYNRDLF FLYFLLIIVSAITSFKIH+DKSPFSGK IMYLNRHQTEE
Sbjct: 120  LSDRTNLFGESKKSYNRDLFIFLYFLLIIVSAITSFKIHHDKSPFSGKSIMYLNRHQTEE 179

Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391
            WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFS+ARF QMMWR
Sbjct: 180  WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSIARFTQMMWR 239

Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211
            LNFLV F C++L+N+YMLYYICPMHTLFTLMVYGALGIFNKYNE GTVIA K IACFLVV
Sbjct: 240  LNFLVFFSCVILNNNYMLYYICPMHTLFTLMVYGALGIFNKYNENGTVIAVKFIACFLVV 299

Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031
            IL+WEVPGVFE++WSPFTFFLGYADPDPSK K S LHEW FRSGLDRYIWIIGMIYAYYH
Sbjct: 300  ILMWEVPGVFEVVWSPFTFFLGYADPDPSKPKQSLLHEWEFRSGLDRYIWIIGMIYAYYH 359

Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851
            PTVERWMEKLEE E KRR SIK  V ++SLT+GYLW EYIYKLPK+TYNKYHPYTSWIPI
Sbjct: 360  PTVERWMEKLEETEVKRRISIKAAVALMSLTMGYLWYEYIYKLPKVTYNKYHPYTSWIPI 419

Query: 850  TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671
            TVYISLRNVTQ+FRSY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK          
Sbjct: 420  TVYISLRNVTQYFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKKLLCLIPNYP 479

Query: 670  XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491
              NFMLT +IYVAVS+RLFELTNTLKS F+P KDDKRLG+N+V A+V++ +LY LS V L
Sbjct: 480  LMNFMLTAAIYVAVSHRLFELTNTLKSTFIPMKDDKRLGYNIVAALVVSGLLYVLSSVFL 539

Query: 490  KVPQMLV 470
            +VPQMLV
Sbjct: 540  RVPQMLV 546


>ref|XP_004229023.1| PREDICTED: CAS1 domain-containing protein 1 [Solanum lycopersicum]
          Length = 546

 Score =  936 bits (2419), Expect = 0.0
 Identities = 452/547 (82%), Positives = 491/547 (89%)
 Frame = -3

Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931
            M+IYGPL+PGQVSFFLGI+PIC AW+YSEYLE++K S+SSK  R+SDINLVELG+E VKE
Sbjct: 1    MLIYGPLSPGQVSFFLGIVPICAAWLYSEYLEYKKNSASSKV-RHSDINLVELGDEAVKE 59

Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751
            DDRAVLLEGG LQS SPR+R+SSVTS   RF  +DE+FLLENRLTLRAISEFG LLIYFY
Sbjct: 60   DDRAVLLEGGGLQSTSPRIRSSSVTSQFTRFFLMDETFLLENRLTLRAISEFGTLLIYFY 119

Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571
            I DRTNL GESKKSYNRDLF FLYFLLIIVSAITSFKIH+DKSPFSGK IMYLNRHQTEE
Sbjct: 120  ISDRTNLFGESKKSYNRDLFIFLYFLLIIVSAITSFKIHHDKSPFSGKSIMYLNRHQTEE 179

Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391
            WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFS+ARF QMMWR
Sbjct: 180  WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSIARFAQMMWR 239

Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211
            LNFLV F C++L+N+YMLYYICPMHTLFTLMVYGALGIFNKYNE GT+IA K I CFL V
Sbjct: 240  LNFLVFFSCVILNNNYMLYYICPMHTLFTLMVYGALGIFNKYNENGTIIAVKFIVCFLFV 299

Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031
            IL+WEVPGVFE++WSPFTFFLGYADPDPSK K S LHEW FRSGLDRYIWIIGMIYAYYH
Sbjct: 300  ILMWEVPGVFEVVWSPFTFFLGYADPDPSKPKQSLLHEWEFRSGLDRYIWIIGMIYAYYH 359

Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851
            PTVE+WMEKLEE E KRR SIK  V I+SLT+GYLW EYIYKLPK TYNKYHPYTSWIPI
Sbjct: 360  PTVEKWMEKLEETEVKRRISIKAAVAIMSLTMGYLWYEYIYKLPKETYNKYHPYTSWIPI 419

Query: 850  TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671
            TVYISLRNVTQ+FRSYTLTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK          
Sbjct: 420  TVYISLRNVTQYFRSYTLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKKLLCLIPGYP 479

Query: 670  XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491
              NFMLTT+IYVAVS+RLFELTNTLKS F+P K+DKRLG+N+V A+V++ +LY LS V L
Sbjct: 480  LMNFMLTTAIYVAVSHRLFELTNTLKSTFIPMKEDKRLGYNIVAALVVSGLLYVLSSVFL 539

Query: 490  KVPQMLV 470
            +VPQMLV
Sbjct: 540  RVPQMLV 546


>ref|XP_009803356.1| PREDICTED: CAS1 domain-containing protein 1 [Nicotiana sylvestris]
          Length = 546

 Score =  925 bits (2391), Expect = 0.0
 Identities = 447/547 (81%), Positives = 488/547 (89%)
 Frame = -3

Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931
            M++YGPLTP QVSFFLGI+ IC AW+YSEYLE+ K S+SSK  R+SDINLVELG E VKE
Sbjct: 1    MLLYGPLTPAQVSFFLGIVSICAAWLYSEYLEYEKNSASSKV-RHSDINLVELGNEAVKE 59

Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751
            DDRAVLLEGG LQSASPR+R+SSVTS I RF  +DESF LENRLTLRAISEFGALL YFY
Sbjct: 60   DDRAVLLEGGGLQSASPRMRSSSVTSQIARFCLMDESFFLENRLTLRAISEFGALLTYFY 119

Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571
            + DRTNL GESKKSYNRDLF FLYFLLII+SAITSF IH+DKSPFSG+ IMYLNRHQTEE
Sbjct: 120  LSDRTNLFGESKKSYNRDLFLFLYFLLIIISAITSFTIHHDKSPFSGRSIMYLNRHQTEE 179

Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391
            WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFS+ARF QMMWR
Sbjct: 180  WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSIARFTQMMWR 239

Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211
            LNFLV F CIVL+N+YMLYYICPMHTLFTLMVYGALGIFNKYNE GT+IAGKII CFLVV
Sbjct: 240  LNFLVFFSCIVLNNNYMLYYICPMHTLFTLMVYGALGIFNKYNENGTIIAGKIITCFLVV 299

Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031
            IL+WEVPGVFE++WSPFT FLGYADPDP K K S LHEW FRSGLDRYIWI+GMIYAYYH
Sbjct: 300  ILMWEVPGVFEVVWSPFTCFLGYADPDPLKTKQSLLHEWQFRSGLDRYIWIVGMIYAYYH 359

Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851
            PTVERWMEKLEE E KRR SIK +V IISL +GYLW E+IYKLPKITYNKYHPYTSWIPI
Sbjct: 360  PTVERWMEKLEETEVKRRISIKAVVAIISLAMGYLWYEHIYKLPKITYNKYHPYTSWIPI 419

Query: 850  TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671
            TVYISLRNVTQ+F SY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK          
Sbjct: 420  TVYISLRNVTQYFCSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKKLLCLIPNYP 479

Query: 670  XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491
              NFMLTT+IYVAVS+RLFELTNTLKS F+P KD+KRLG+N++ A+V++ +LY LS V L
Sbjct: 480  LLNFMLTTAIYVAVSHRLFELTNTLKSTFIPMKDNKRLGYNIIAALVVSGLLYVLSSVFL 539

Query: 490  KVPQMLV 470
            +VPQ+LV
Sbjct: 540  RVPQLLV 546


>emb|CDO98736.1| unnamed protein product [Coffea canephora]
          Length = 544

 Score =  912 bits (2357), Expect = 0.0
 Identities = 445/547 (81%), Positives = 480/547 (87%)
 Frame = -3

Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931
            +VIYGPLTPGQVSFFLGI+P+  AWIY+E LE++K S S    R+SDI LVELG   VKE
Sbjct: 2    VVIYGPLTPGQVSFFLGIVPMFAAWIYAEILEYKKASVSKS--RHSDITLVELGNGGVKE 59

Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751
            +D AVLLEGG LQSASPRVR+SS  S I+RFL +DESFLLENRLTLRAISE GALLIYFY
Sbjct: 60   EDSAVLLEGGGLQSASPRVRSSSAASQILRFLMMDESFLLENRLTLRAISELGALLIYFY 119

Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571
            +CDRTN+ G+SKKSYNRDLF FLYFLLIIVSAITSFKIH DKSPFSGK IMYLNRHQTEE
Sbjct: 120  VCDRTNIFGQSKKSYNRDLFLFLYFLLIIVSAITSFKIHQDKSPFSGKSIMYLNRHQTEE 179

Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391
            WKGWMQVLFLMYHYFAA EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFS+ARF QMMWR
Sbjct: 180  WKGWMQVLFLMYHYFAAAEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSIARFAQMMWR 239

Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211
            LNFLVL CCI+L N+Y LYYICPMHTLFTLMVYGALGI NKYNE GTVIA KI+ CFL V
Sbjct: 240  LNFLVLLCCIILDNNYTLYYICPMHTLFTLMVYGALGILNKYNESGTVIAAKIMTCFLAV 299

Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031
            ILIWE+PGVFEL+WSPFTF LGY+  DPSK     LHEWHFRSGLDRYIWIIGMIYAYYH
Sbjct: 300  ILIWEIPGVFELIWSPFTFLLGYS--DPSKPPQPRLHEWHFRSGLDRYIWIIGMIYAYYH 357

Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851
            PTVERWMEKLEE E KRR SIK  VVIISL +GYLW+EYIYKLPKITYNKYHPYTSWIPI
Sbjct: 358  PTVERWMEKLEETEVKRRISIKTAVVIISLAVGYLWLEYIYKLPKITYNKYHPYTSWIPI 417

Query: 850  TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671
            TVYI LRNV+Q+FRSYTLTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK          
Sbjct: 418  TVYICLRNVSQYFRSYTLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPEYP 477

Query: 670  XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491
              NFMLTTSIY+AVSYRLFELTN LKS FVP+KD+KRLGHN+V AVVIAS LY LSFVLL
Sbjct: 478  LLNFMLTTSIYIAVSYRLFELTNMLKSTFVPSKDNKRLGHNIVAAVVIASGLYMLSFVLL 537

Query: 490  KVPQMLV 470
            ++P M+V
Sbjct: 538  RIPPMMV 544


>ref|XP_007035864.1| O-acetyltransferase family protein isoform 1 [Theobroma cacao]
            gi|508714893|gb|EOY06790.1| O-acetyltransferase family
            protein isoform 1 [Theobroma cacao]
          Length = 545

 Score =  898 bits (2320), Expect = 0.0
 Identities = 436/548 (79%), Positives = 480/548 (87%), Gaps = 1/548 (0%)
 Frame = -3

Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931
            M I+GP+TPGQVSFFLGI P+  AWIY+EYLE++K S  SK  R+SD+NLVE+G   VKE
Sbjct: 1    MAIFGPITPGQVSFFLGIFPVISAWIYAEYLEYKKNSLESK-ARHSDVNLVEIGNGAVKE 59

Query: 1930 DDRAVLLEGGALQSASPRVRNSSVT-SHIVRFLTLDESFLLENRLTLRAISEFGALLIYF 1754
            DDRAVLLEGG LQSASP+ R SS + S I +FL +DE+FL+ENRLTLRAISEFG LL Y+
Sbjct: 60   DDRAVLLEGGGLQSASPKARTSSSSLSPIFKFLMMDETFLVENRLTLRAISEFGGLLAYY 119

Query: 1753 YICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTE 1574
            YICDRT++   +KK+YNRDLF FLYFLLIIVSAITSFKIH+DKSPFSGK I+YLNRHQTE
Sbjct: 120  YICDRTDVFDSAKKNYNRDLFLFLYFLLIIVSAITSFKIHHDKSPFSGKSILYLNRHQTE 179

Query: 1573 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMW 1394
            EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYY+RKDFSLARF QMMW
Sbjct: 180  EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 239

Query: 1393 RLNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLV 1214
            RLNFLV FCC++L+NSY+LYYICPMHTLFTLMVYG LGI NKYNE G+VIA KIIACFLV
Sbjct: 240  RLNFLVFFCCVILNNSYVLYYICPMHTLFTLMVYGTLGILNKYNENGSVIAAKIIACFLV 299

Query: 1213 VILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYY 1034
            VIL+WEVPGVFE+LWSPFTFFLGY   DP+K     LHEWHFRSGLDRYIWIIGMIYAYY
Sbjct: 300  VILVWEVPGVFEILWSPFTFFLGYT--DPAKPNFPRLHEWHFRSGLDRYIWIIGMIYAYY 357

Query: 1033 HPTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIP 854
            HPTVERWMEKLEEAE KRR  IKM V  I+LT+GY W EYIYKL KITYNKYHPYTSWIP
Sbjct: 358  HPTVERWMEKLEEAEVKRRVLIKMAVATIALTMGYFWFEYIYKLDKITYNKYHPYTSWIP 417

Query: 853  ITVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXX 674
            ITVYI LRNVTQ FRSY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK         
Sbjct: 418  ITVYICLRNVTQSFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDY 477

Query: 673  XXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVL 494
               NFMLTTSIYVA+SYRLF+LTN LK+AFVPTKDDKRL +NL+ AVVI+SILY+LSF L
Sbjct: 478  PMLNFMLTTSIYVAISYRLFDLTNILKTAFVPTKDDKRLINNLITAVVISSILYSLSFAL 537

Query: 493  LKVPQMLV 470
            L++PQMLV
Sbjct: 538  LRIPQMLV 545


>ref|XP_007035865.1| O-acetyltransferase family protein isoform 2 [Theobroma cacao]
            gi|508714894|gb|EOY06791.1| O-acetyltransferase family
            protein isoform 2 [Theobroma cacao]
          Length = 544

 Score =  895 bits (2313), Expect = 0.0
 Identities = 435/548 (79%), Positives = 479/548 (87%), Gaps = 1/548 (0%)
 Frame = -3

Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931
            M I+GP+TPGQVSFFLGI P+  AWIY+EYLE++K S  SK   +SD+NLVE+G   VKE
Sbjct: 1    MAIFGPITPGQVSFFLGIFPVISAWIYAEYLEYKKNSLESKA--HSDVNLVEIGNGAVKE 58

Query: 1930 DDRAVLLEGGALQSASPRVRNSSVT-SHIVRFLTLDESFLLENRLTLRAISEFGALLIYF 1754
            DDRAVLLEGG LQSASP+ R SS + S I +FL +DE+FL+ENRLTLRAISEFG LL Y+
Sbjct: 59   DDRAVLLEGGGLQSASPKARTSSSSLSPIFKFLMMDETFLVENRLTLRAISEFGGLLAYY 118

Query: 1753 YICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTE 1574
            YICDRT++   +KK+YNRDLF FLYFLLIIVSAITSFKIH+DKSPFSGK I+YLNRHQTE
Sbjct: 119  YICDRTDVFDSAKKNYNRDLFLFLYFLLIIVSAITSFKIHHDKSPFSGKSILYLNRHQTE 178

Query: 1573 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMW 1394
            EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYY+RKDFSLARF QMMW
Sbjct: 179  EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 238

Query: 1393 RLNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLV 1214
            RLNFLV FCC++L+NSY+LYYICPMHTLFTLMVYG LGI NKYNE G+VIA KIIACFLV
Sbjct: 239  RLNFLVFFCCVILNNSYVLYYICPMHTLFTLMVYGTLGILNKYNENGSVIAAKIIACFLV 298

Query: 1213 VILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYY 1034
            VIL+WEVPGVFE+LWSPFTFFLGY   DP+K     LHEWHFRSGLDRYIWIIGMIYAYY
Sbjct: 299  VILVWEVPGVFEILWSPFTFFLGYT--DPAKPNFPRLHEWHFRSGLDRYIWIIGMIYAYY 356

Query: 1033 HPTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIP 854
            HPTVERWMEKLEEAE KRR  IKM V  I+LT+GY W EYIYKL KITYNKYHPYTSWIP
Sbjct: 357  HPTVERWMEKLEEAEVKRRVLIKMAVATIALTMGYFWFEYIYKLDKITYNKYHPYTSWIP 416

Query: 853  ITVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXX 674
            ITVYI LRNVTQ FRSY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK         
Sbjct: 417  ITVYICLRNVTQSFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDY 476

Query: 673  XXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVL 494
               NFMLTTSIYVA+SYRLF+LTN LK+AFVPTKDDKRL +NL+ AVVI+SILY+LSF L
Sbjct: 477  PMLNFMLTTSIYVAISYRLFDLTNILKTAFVPTKDDKRLINNLITAVVISSILYSLSFAL 536

Query: 493  LKVPQMLV 470
            L++PQMLV
Sbjct: 537  LRIPQMLV 544


>ref|XP_010270835.1| PREDICTED: CAS1 domain-containing protein 1 isoform X1 [Nelumbo
            nucifera]
          Length = 547

 Score =  894 bits (2309), Expect = 0.0
 Identities = 423/546 (77%), Positives = 480/546 (87%)
 Frame = -3

Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931
            M I  P+TPGQVSF LGI+PI VAWIYSE+LE++K S SSK GR+SDINLVELG+ETVKE
Sbjct: 1    MSITSPVTPGQVSFLLGIIPIIVAWIYSEFLEYKKTSISSKVGRHSDINLVELGKETVKE 60

Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751
            DD+  L+E G LQSASP+ R+SSVTSH+ RF  +DESFL ENRL LRAISEFG ++ YFY
Sbjct: 61   DDKTALIESGNLQSASPKARSSSVTSHLARFFLMDESFLTENRLLLRAISEFGLIVFYFY 120

Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571
            ICDRTN+ GESKK+YNRDLF FLYFLLIIVSA+TSFKIH+DKSPFSGK I+YLNRHQTEE
Sbjct: 121  ICDRTNVFGESKKTYNRDLFLFLYFLLIIVSAMTSFKIHHDKSPFSGKSILYLNRHQTEE 180

Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391
            WKGWMQVLFLMYHYFAA EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSL RF QMMWR
Sbjct: 181  WKGWMQVLFLMYHYFAAAEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLTRFAQMMWR 240

Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211
            LNF V FCCIVL+N+YMLYYICPMHTLFTLMVYGALGI NKYNE G+VIA KI ACF+VV
Sbjct: 241  LNFFVAFCCIVLNNNYMLYYICPMHTLFTLMVYGALGILNKYNEIGSVIALKIAACFMVV 300

Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031
            IL+WEVPGVF+++WSPFTFFLGY+DP+PSK K+  LHEWHFRSGLDRYIWIIGMIYAYYH
Sbjct: 301  ILVWEVPGVFDVVWSPFTFFLGYSDPNPSKPKYPLLHEWHFRSGLDRYIWIIGMIYAYYH 360

Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851
            PTVERWMEKLEE E +RR SIK+ V  +   +GYLW EYIYKL K+ YNKYHPYTSWIPI
Sbjct: 361  PTVERWMEKLEETETRRRISIKIAVATVCSVMGYLWFEYIYKLDKVAYNKYHPYTSWIPI 420

Query: 850  TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671
            TVYI LRNVTQ FRSY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQP+          
Sbjct: 421  TVYICLRNVTQQFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPRWLLSLIPDYP 480

Query: 670  XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491
              NFMLTTSIYVA+S+R+FELTNTLKS+FVP+KD+KRL +N++ A VI+ +LY+LSFV L
Sbjct: 481  MLNFMLTTSIYVAISHRIFELTNTLKSSFVPSKDNKRLMYNMIAAAVISIMLYSLSFVFL 540

Query: 490  KVPQML 473
            ++P++L
Sbjct: 541  QIPKIL 546


>ref|XP_012829945.1| PREDICTED: CAS1 domain-containing protein 1-like [Erythranthe
            guttatus]
          Length = 557

 Score =  891 bits (2302), Expect = 0.0
 Identities = 447/559 (79%), Positives = 477/559 (85%), Gaps = 12/559 (2%)
 Frame = -3

Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931
            M+IYGPLTPGQVSFFLG++P+  AW+YSEYLE+ K SS  K GRNSDINLVEL   TVKE
Sbjct: 1    MLIYGPLTPGQVSFFLGLVPVFAAWLYSEYLENAKSSSLPKHGRNSDINLVELAG-TVKE 59

Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751
            DDRAVLLEGG L SASP +RNSSVTS + R L LDESFLLENRL LRA+SEFGALLIYFY
Sbjct: 60   DDRAVLLEGGGLHSASPTLRNSSVTSQLTRLLMLDESFLLENRLILRAMSEFGALLIYFY 119

Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571
            ICDRTNLLGE+KKSYNRDLF FLYFLLIIVSA TSFKIHNDKSP SGK IMYLNRHQTEE
Sbjct: 120  ICDRTNLLGEAKKSYNRDLFLFLYFLLIIVSAKTSFKIHNDKSPLSGKSIMYLNRHQTEE 179

Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391
            WKGWMQVLFLMYHYFAATE YNAIRVFIA YVWMTGFGNFSYYYIRKDFSLARFIQMMWR
Sbjct: 180  WKGWMQVLFLMYHYFAATEFYNAIRVFIAGYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 239

Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211
            LNFLV FCC+VL+N+YMLYYICPMHTLFTLMVYGALGIFNKYNE+G VIA KI+ CFLVV
Sbjct: 240  LNFLVFFCCVVLNNNYMLYYICPMHTLFTLMVYGALGIFNKYNERGIVIAAKIVVCFLVV 299

Query: 1210 ILIW----------EVPGVFE--LLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRY 1067
            +L+W            P  FE     S   F  GY DP  SK K S LHEWHFRSGLDRY
Sbjct: 300  VLLWGSARPICTTINKPFEFEKCTSLSQCEFVSGYTDP-ASKVKLSLLHEWHFRSGLDRY 358

Query: 1066 IWIIGMIYAYYHPTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITY 887
            IWIIGMIYAYYHPTVERW+EKLEEAE KRR  IK IV I+SLTIGYLWVE++YKLPKITY
Sbjct: 359  IWIIGMIYAYYHPTVERWLEKLEEAEIKRRILIKSIVGIVSLTIGYLWVEFVYKLPKITY 418

Query: 886  NKYHPYTSWIPITVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQ 707
            NKYHPYTSWIPITVYI LRNVTQHFR Y+LTLFAWLGKVTLETYISQIHIWLRS VP+GQ
Sbjct: 419  NKYHPYTSWIPITVYICLRNVTQHFRCYSLTLFAWLGKVTLETYISQIHIWLRSSVPNGQ 478

Query: 706  PKXXXXXXXXXXXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVI 527
            PK            NFMLT SIY+AVSYRLFELTNTLKSAFVPTKD+KRLGHNL+ AV+I
Sbjct: 479  PKLLLSLIPNYPLLNFMLTASIYIAVSYRLFELTNTLKSAFVPTKDNKRLGHNLIAAVII 538

Query: 526  ASILYTLSFVLLKVPQMLV 470
            ASILY L+ VL+KVPQM+V
Sbjct: 539  ASILYILAVVLIKVPQMIV 557


>ref|XP_012455818.1| PREDICTED: CAS1 domain-containing protein 1-like isoform X1
            [Gossypium raimondii] gi|763805944|gb|KJB72882.1|
            hypothetical protein B456_011G202400 [Gossypium
            raimondii]
          Length = 545

 Score =  890 bits (2300), Expect = 0.0
 Identities = 428/548 (78%), Positives = 484/548 (88%), Gaps = 1/548 (0%)
 Frame = -3

Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931
            M+I+GP+TPGQVSFFLG+ P+  AWIY+EYL+++K S +SK  R+SD++LVE+G   VKE
Sbjct: 1    MMIFGPITPGQVSFFLGVFPVISAWIYAEYLQYKKNSLASK-ARHSDVSLVEIGNVAVKE 59

Query: 1930 DDRAVLLEGGALQSASPRVRNS-SVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYF 1754
            +DRAVLLEGG LQS SP+ R+S S  S I++F+ +DE+FL+ENRLTLRAISEFG LL Y+
Sbjct: 60   EDRAVLLEGGGLQSGSPKARSSTSSVSPILKFIMMDETFLIENRLTLRAISEFGVLLAYY 119

Query: 1753 YICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTE 1574
            YICDRT++   SKKSYNRDLF FLYFLLIIVSAITSFKIH+DKSPFSGK I+YLNRHQTE
Sbjct: 120  YICDRTDVFASSKKSYNRDLFLFLYFLLIIVSAITSFKIHHDKSPFSGKSILYLNRHQTE 179

Query: 1573 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMW 1394
            EWKGWMQVLFLMYHYFAA+EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLARF QMMW
Sbjct: 180  EWKGWMQVLFLMYHYFAASEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 239

Query: 1393 RLNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLV 1214
            RLNFLV FCC+VL+NSYMLYYICPMHTLFTLMVYGALGI NKYNE+G+VIA KIIACFLV
Sbjct: 240  RLNFLVFFCCVVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEKGSVIALKIIACFLV 299

Query: 1213 VILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYY 1034
            VIL+WEVPGVFELLWSPFTFFLGY   DP+K     LHEWHFRSGLDRYIWIIGMIYAYY
Sbjct: 300  VILVWEVPGVFELLWSPFTFFLGYT--DPAKPNLPLLHEWHFRSGLDRYIWIIGMIYAYY 357

Query: 1033 HPTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIP 854
            HPTVERWMEKLEE E KRR SIK+ V II+L +G+LW E+IYKL K+TYNKYHPYTSWIP
Sbjct: 358  HPTVERWMEKLEETEVKRRVSIKIAVAIIALMVGFLWFEHIYKLDKVTYNKYHPYTSWIP 417

Query: 853  ITVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXX 674
            ITVYI LRNVTQ FRSY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK         
Sbjct: 418  ITVYICLRNVTQSFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDY 477

Query: 673  XXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVL 494
               NFMLTTSIY+A+SYRLF+LTN LKSAFVPTKD+KRL HNL+  VV++SI+Y+LSFV 
Sbjct: 478  PMLNFMLTTSIYLAISYRLFDLTNILKSAFVPTKDNKRLLHNLITGVVVSSIVYSLSFVF 537

Query: 493  LKVPQMLV 470
            L++PQMLV
Sbjct: 538  LRIPQMLV 545


>gb|KHG01507.1| CAS1 domain-containing 1 [Gossypium arboreum]
          Length = 544

 Score =  889 bits (2297), Expect = 0.0
 Identities = 429/548 (78%), Positives = 482/548 (87%), Gaps = 1/548 (0%)
 Frame = -3

Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931
            M+I+GP+TPGQVSFFLG+ P+  AWIY+EYL+++K S +SK   +SD++LVE+G   VKE
Sbjct: 1    MMIFGPITPGQVSFFLGVFPVISAWIYAEYLQYKKNSLASKA--HSDVSLVEIGNGAVKE 58

Query: 1930 DDRAVLLEGGALQSASPRVRNS-SVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYF 1754
            +DRAVLLEGG LQS SP+ R+S S  S IV+FL +DE+FL+ENRLTLRAISEFG LL Y+
Sbjct: 59   EDRAVLLEGGGLQSGSPKARSSTSSVSPIVKFLMMDETFLIENRLTLRAISEFGVLLAYY 118

Query: 1753 YICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTE 1574
            YICDRT++   SKKSYNRDLF FLYFLLIIVSAITSFKIH+DKSPFSGK I+YLNRHQTE
Sbjct: 119  YICDRTDVFASSKKSYNRDLFLFLYFLLIIVSAITSFKIHHDKSPFSGKSILYLNRHQTE 178

Query: 1573 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMW 1394
            EWKGWMQVLFLMYHYFAA+EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLARF QMMW
Sbjct: 179  EWKGWMQVLFLMYHYFAASEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 238

Query: 1393 RLNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLV 1214
            RLNFLV FCC+VL+NSYMLYYICPMHTLFTLMVYGALGI NKYNE+G+VIA KIIACFLV
Sbjct: 239  RLNFLVFFCCVVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEKGSVIALKIIACFLV 298

Query: 1213 VILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYY 1034
            VIL+WEVPGVFE LWSPFTFFLGY   DP+K     LHEWHFRSGLDRYIWIIGMIYAYY
Sbjct: 299  VILVWEVPGVFEFLWSPFTFFLGYT--DPAKPNLPLLHEWHFRSGLDRYIWIIGMIYAYY 356

Query: 1033 HPTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIP 854
            HPTVERWMEKLEE E KRR SIK+ V II+L +G+LW E+IYKL KITYNKYHPYTSWIP
Sbjct: 357  HPTVERWMEKLEETEVKRRVSIKIAVAIIALMVGFLWFEHIYKLDKITYNKYHPYTSWIP 416

Query: 853  ITVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXX 674
            ITVYI LRNVTQ FRSY+LTLFAWLGK+TLETYISQIHIWLRSG+PDGQPK         
Sbjct: 417  ITVYICLRNVTQSFRSYSLTLFAWLGKITLETYISQIHIWLRSGIPDGQPKLLLSLIPDY 476

Query: 673  XXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVL 494
               NFMLTTSIY+A+SYRLF+LTN LKSAFVPTKD+KRL HNL+  VV++SILY+LSFV 
Sbjct: 477  PMLNFMLTTSIYLAISYRLFDLTNILKSAFVPTKDNKRLLHNLITGVVVSSILYSLSFVF 536

Query: 493  LKVPQMLV 470
            L++PQMLV
Sbjct: 537  LRIPQMLV 544


>ref|XP_012455820.1| PREDICTED: CAS1 domain-containing protein 1-like isoform X2
            [Gossypium raimondii] gi|763805945|gb|KJB72883.1|
            hypothetical protein B456_011G202400 [Gossypium
            raimondii]
          Length = 544

 Score =  887 bits (2293), Expect = 0.0
 Identities = 427/548 (77%), Positives = 483/548 (88%), Gaps = 1/548 (0%)
 Frame = -3

Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931
            M+I+GP+TPGQVSFFLG+ P+  AWIY+EYL+++K S +SK   +SD++LVE+G   VKE
Sbjct: 1    MMIFGPITPGQVSFFLGVFPVISAWIYAEYLQYKKNSLASKA--HSDVSLVEIGNVAVKE 58

Query: 1930 DDRAVLLEGGALQSASPRVRNS-SVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYF 1754
            +DRAVLLEGG LQS SP+ R+S S  S I++F+ +DE+FL+ENRLTLRAISEFG LL Y+
Sbjct: 59   EDRAVLLEGGGLQSGSPKARSSTSSVSPILKFIMMDETFLIENRLTLRAISEFGVLLAYY 118

Query: 1753 YICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTE 1574
            YICDRT++   SKKSYNRDLF FLYFLLIIVSAITSFKIH+DKSPFSGK I+YLNRHQTE
Sbjct: 119  YICDRTDVFASSKKSYNRDLFLFLYFLLIIVSAITSFKIHHDKSPFSGKSILYLNRHQTE 178

Query: 1573 EWKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMW 1394
            EWKGWMQVLFLMYHYFAA+EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLARF QMMW
Sbjct: 179  EWKGWMQVLFLMYHYFAASEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMW 238

Query: 1393 RLNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLV 1214
            RLNFLV FCC+VL+NSYMLYYICPMHTLFTLMVYGALGI NKYNE+G+VIA KIIACFLV
Sbjct: 239  RLNFLVFFCCVVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEKGSVIALKIIACFLV 298

Query: 1213 VILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYY 1034
            VIL+WEVPGVFELLWSPFTFFLGY   DP+K     LHEWHFRSGLDRYIWIIGMIYAYY
Sbjct: 299  VILVWEVPGVFELLWSPFTFFLGYT--DPAKPNLPLLHEWHFRSGLDRYIWIIGMIYAYY 356

Query: 1033 HPTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIP 854
            HPTVERWMEKLEE E KRR SIK+ V II+L +G+LW E+IYKL K+TYNKYHPYTSWIP
Sbjct: 357  HPTVERWMEKLEETEVKRRVSIKIAVAIIALMVGFLWFEHIYKLDKVTYNKYHPYTSWIP 416

Query: 853  ITVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXX 674
            ITVYI LRNVTQ FRSY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK         
Sbjct: 417  ITVYICLRNVTQSFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDY 476

Query: 673  XXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVL 494
               NFMLTTSIY+A+SYRLF+LTN LKSAFVPTKD+KRL HNL+  VV++SI+Y+LSFV 
Sbjct: 477  PMLNFMLTTSIYLAISYRLFDLTNILKSAFVPTKDNKRLLHNLITGVVVSSIVYSLSFVF 536

Query: 493  LKVPQMLV 470
            L++PQMLV
Sbjct: 537  LRIPQMLV 544


>ref|XP_007154416.1| hypothetical protein PHAVU_003G117800g [Phaseolus vulgaris]
            gi|561027770|gb|ESW26410.1| hypothetical protein
            PHAVU_003G117800g [Phaseolus vulgaris]
          Length = 546

 Score =  887 bits (2291), Expect = 0.0
 Identities = 428/547 (78%), Positives = 475/547 (86%)
 Frame = -3

Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931
            M+I  P+TPGQVSF LGI P+ VAWIYSE LE+RK S SSK G +SDINLVE+G + VK+
Sbjct: 1    MLILSPVTPGQVSFLLGITPVVVAWIYSEILEYRKNSVSSKAG-HSDINLVEMGSDAVKD 59

Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751
            +D+AVLLEGGALQS SPR R+ + +  I+RFL +D  FLLENRLTLRA+SEFG LL YFY
Sbjct: 60   EDKAVLLEGGALQSGSPRARSLTASPSIIRFLLMDNYFLLENRLTLRAMSEFGLLLAYFY 119

Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571
            +CDRT+    SKKSYNRD+F FLYFLLIIVSA+TSFKIH DKSPFSGK I+YLNRHQTEE
Sbjct: 120  LCDRTDFFASSKKSYNRDIFLFLYFLLIIVSAMTSFKIHQDKSPFSGKSILYLNRHQTEE 179

Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391
            WKGWMQVLFLMYHYFAA+EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLARF QMMWR
Sbjct: 180  WKGWMQVLFLMYHYFAASEIYNAIRLFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWR 239

Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211
            LNF V+FCCIVL+NSYMLYYICPMHTLFTLMVYGALGI NKYNE G+VIA KIIACFLVV
Sbjct: 240  LNFFVVFCCIVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEIGSVIAVKIIACFLVV 299

Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031
            IL+WE+PGVFELLWSPFTFFLGY DP+P+K+  S LHEWHFRSGLDRYIWIIGMIYAYYH
Sbjct: 300  ILVWEIPGVFELLWSPFTFFLGYTDPNPAKSHLSRLHEWHFRSGLDRYIWIIGMIYAYYH 359

Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851
            PTVERWMEKLEEAE KRR SIK  +V+I   +GYLW E+IYKL K+TYN YHPYTSWIPI
Sbjct: 360  PTVERWMEKLEEAEIKRRISIKATIVLICSLVGYLWFEHIYKLDKLTYNTYHPYTSWIPI 419

Query: 850  TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671
            TVYI LRNVTQ FRSYTLTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK          
Sbjct: 420  TVYICLRNVTQSFRSYTLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDYP 479

Query: 670  XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491
              NFMLTTSIYVA+S RLF+LTNTLK AFVP+KDDKRL HNL+ A  I+ +LY+LSF  L
Sbjct: 480  MLNFMLTTSIYVAISCRLFDLTNTLKVAFVPSKDDKRLVHNLITATTISVVLYSLSFGFL 539

Query: 490  KVPQMLV 470
            ++PQMLV
Sbjct: 540  RLPQMLV 546


>ref|XP_003631938.1| PREDICTED: CAS1 domain-containing protein 1 [Vitis vinifera]
            gi|296083216|emb|CBI22852.3| unnamed protein product
            [Vitis vinifera]
          Length = 544

 Score =  885 bits (2286), Expect = 0.0
 Identities = 420/547 (76%), Positives = 477/547 (87%)
 Frame = -3

Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931
            M+I GP+TPGQV+FF+G + +  AWIY+E+LE++K +  SK   +SD+NLVEL E TVKE
Sbjct: 1    MMIVGPITPGQVAFFIGFVSVFAAWIYAEFLEYKKNAFPSKT--HSDLNLVELNE-TVKE 57

Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751
            DDRAVLLEGG LQS SP+ R+SSVTSHI RFL ++ESFL+E RLTLRA+ EFGALL YFY
Sbjct: 58   DDRAVLLEGGGLQSVSPKARSSSVTSHIFRFLLMEESFLIEYRLTLRAMCEFGALLAYFY 117

Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571
            +CDRTNL G+SKKSYNRDLF FLYFLLIIVSA+TSFK+H+DKS FSGK I+YLNRHQTEE
Sbjct: 118  LCDRTNLFGDSKKSYNRDLFIFLYFLLIIVSAVTSFKVHHDKSSFSGKSILYLNRHQTEE 177

Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391
            WKGWMQVLFLMYHYFAATEIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLARF QMMWR
Sbjct: 178  WKGWMQVLFLMYHYFAATEIYNAIRLFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWR 237

Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211
            LNFLVLFCC+VL+NSYMLYYICPMHTLFTLMVYGALGI NKYNE G VIA KI+ACFLVV
Sbjct: 238  LNFLVLFCCVVLNNSYMLYYICPMHTLFTLMVYGALGILNKYNEIGLVIAVKIVACFLVV 297

Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031
            +L+WE+PGVFE +WSP TF LGY DPDPSK K S LHEWHFRSGLDRYIWIIGMIYAYYH
Sbjct: 298  VLLWEIPGVFEFVWSPLTFILGYTDPDPSKQKFSRLHEWHFRSGLDRYIWIIGMIYAYYH 357

Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851
            PTVERWMEKLEE E K R +IKM +  ++LT+GYLW E+IYKL K+TYNKYHPYTSWIPI
Sbjct: 358  PTVERWMEKLEETEVKLRVAIKMAIATVALTVGYLWFEHIYKLDKLTYNKYHPYTSWIPI 417

Query: 850  TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671
            +VYI LRN+TQ FR Y+LTLFAWLGK+TLETYISQIHIWLRSG+PD QPK          
Sbjct: 418  SVYICLRNLTQQFRCYSLTLFAWLGKITLETYISQIHIWLRSGLPDAQPKLLLSLIPSYP 477

Query: 670  XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491
              NFMLTTSIY+A+SYRLFELTNTLK AFVP+KD+K L HN++  + I+ ILYTLSF+ L
Sbjct: 478  LLNFMLTTSIYIAISYRLFELTNTLKVAFVPSKDNKLLMHNIIAGIAISGILYTLSFIFL 537

Query: 490  KVPQMLV 470
            +VPQ+LV
Sbjct: 538  QVPQLLV 544


>ref|XP_010270836.1| PREDICTED: CAS1 domain-containing protein 1 isoform X2 [Nelumbo
            nucifera]
          Length = 545

 Score =  884 bits (2285), Expect = 0.0
 Identities = 421/546 (77%), Positives = 478/546 (87%)
 Frame = -3

Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931
            M I  P+TPGQVSF LGI+PI VAWIYSE+LE++K S SSK   +SDINLVELG+ETVKE
Sbjct: 1    MSITSPVTPGQVSFLLGIIPIIVAWIYSEFLEYKKTSISSKV--HSDINLVELGKETVKE 58

Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751
            DD+  L+E G LQSASP+ R+SSVTSH+ RF  +DESFL ENRL LRAISEFG ++ YFY
Sbjct: 59   DDKTALIESGNLQSASPKARSSSVTSHLARFFLMDESFLTENRLLLRAISEFGLIVFYFY 118

Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571
            ICDRTN+ GESKK+YNRDLF FLYFLLIIVSA+TSFKIH+DKSPFSGK I+YLNRHQTEE
Sbjct: 119  ICDRTNVFGESKKTYNRDLFLFLYFLLIIVSAMTSFKIHHDKSPFSGKSILYLNRHQTEE 178

Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391
            WKGWMQVLFLMYHYFAA EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSL RF QMMWR
Sbjct: 179  WKGWMQVLFLMYHYFAAAEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLTRFAQMMWR 238

Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211
            LNF V FCCIVL+N+YMLYYICPMHTLFTLMVYGALGI NKYNE G+VIA KI ACF+VV
Sbjct: 239  LNFFVAFCCIVLNNNYMLYYICPMHTLFTLMVYGALGILNKYNEIGSVIALKIAACFMVV 298

Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031
            IL+WEVPGVF+++WSPFTFFLGY+DP+PSK K+  LHEWHFRSGLDRYIWIIGMIYAYYH
Sbjct: 299  ILVWEVPGVFDVVWSPFTFFLGYSDPNPSKPKYPLLHEWHFRSGLDRYIWIIGMIYAYYH 358

Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851
            PTVERWMEKLEE E +RR SIK+ V  +   +GYLW EYIYKL K+ YNKYHPYTSWIPI
Sbjct: 359  PTVERWMEKLEETETRRRISIKIAVATVCSVMGYLWFEYIYKLDKVAYNKYHPYTSWIPI 418

Query: 850  TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671
            TVYI LRNVTQ FRSY+LTLFAWLGK+TLETYISQIHIWLRSGVPDGQP+          
Sbjct: 419  TVYICLRNVTQQFRSYSLTLFAWLGKITLETYISQIHIWLRSGVPDGQPRWLLSLIPDYP 478

Query: 670  XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491
              NFMLTTSIYVA+S+R+FELTNTLKS+FVP+KD+KRL +N++ A VI+ +LY+LSFV L
Sbjct: 479  MLNFMLTTSIYVAISHRIFELTNTLKSSFVPSKDNKRLMYNMIAAAVISIMLYSLSFVFL 538

Query: 490  KVPQML 473
            ++P++L
Sbjct: 539  QIPKIL 544


>ref|XP_012084122.1| PREDICTED: probable O-acetyltransferase CAS1 isoform X1 [Jatropha
            curcas]
          Length = 546

 Score =  881 bits (2276), Expect = 0.0
 Identities = 427/549 (77%), Positives = 475/549 (86%), Gaps = 2/549 (0%)
 Frame = -3

Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931
            MVI  P+TPGQVSF LG++PI  AWIYSE LE++K ++++K  R+SDI L E+G + VKE
Sbjct: 1    MVISSPITPGQVSFLLGVVPIIAAWIYSEILEYKKNAAAAK-ARHSDIGLAEMGNDAVKE 59

Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751
            DDRAVLLEGG LQSASPR + S  +S I RFL ++E FL+ENRLTLRAISEFGALL YFY
Sbjct: 60   DDRAVLLEGGGLQSASPRAKTSPASSPIFRFLLMEEQFLIENRLTLRAISEFGALLGYFY 119

Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571
            +CDRT+    S KS+NRDLFWFLY LLIIVSAITSFKIH+D+SPFSGK I+YLNRHQTEE
Sbjct: 120  LCDRTDFFNSSTKSFNRDLFWFLYSLLIIVSAITSFKIHHDRSPFSGKPILYLNRHQTEE 179

Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391
            WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYY+RKDFSLARF QMMWR
Sbjct: 180  WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWR 239

Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211
            LNFLV+FCC+VL+NSYMLYYICPMHTLFTLMVYGALGI NKYNE+G+VIA KIIACFLVV
Sbjct: 240  LNFLVIFCCVVLNNSYMLYYICPMHTLFTLMVYGALGIMNKYNEKGSVIAVKIIACFLVV 299

Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031
            ILIWE+PGVFEL+WSPFTFFLGY   DP+K     LHEWHFRSGLDRYIWIIGMIYAYYH
Sbjct: 300  ILIWEIPGVFELVWSPFTFFLGYT--DPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYH 357

Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851
            PTVERWMEKLEE E KRR SIK  V  ISL  GYLW E+IYK+ K+TYNKYHPYTSWIPI
Sbjct: 358  PTVERWMEKLEETEVKRRISIKTAVASISLLTGYLWFEHIYKMDKVTYNKYHPYTSWIPI 417

Query: 850  TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPK--XXXXXXXX 677
            TVYISLRNV+Q+FRSYTLTLFAWLGK+TLETYISQ HIWLRS +PD QPK          
Sbjct: 418  TVYISLRNVSQYFRSYTLTLFAWLGKITLETYISQFHIWLRSSIPDAQPKLLLSIIPQSE 477

Query: 676  XXXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFV 497
                NFMLTTSIYVAVSYRLF+LTNTLK AFVP+KDDKRL HN++ A VIASILY++SF+
Sbjct: 478  FPLLNFMLTTSIYVAVSYRLFDLTNTLKIAFVPSKDDKRLAHNMITAAVIASILYSVSFI 537

Query: 496  LLKVPQMLV 470
             LK+PQ+LV
Sbjct: 538  FLKIPQILV 546


>gb|EYU43511.1| hypothetical protein MIMGU_mgv1a004398mg [Erythranthe guttata]
          Length = 529

 Score =  880 bits (2274), Expect = 0.0
 Identities = 440/547 (80%), Positives = 471/547 (86%)
 Frame = -3

Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931
            M+IYGPLTPGQVSFFLG++P+  AW+YSEYLE+ K  SSS P +NSDINLVEL   TVKE
Sbjct: 1    MLIYGPLTPGQVSFFLGLVPVFAAWLYSEYLENAK--SSSLPKQNSDINLVELAG-TVKE 57

Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751
            DDRAVLLEGG L SASP +RNSSVTS + R L LDESFLLENRL LRA+SEFGALLIYFY
Sbjct: 58   DDRAVLLEGGGLHSASPTLRNSSVTSQLTRLLMLDESFLLENRLILRAMSEFGALLIYFY 117

Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571
            ICDRTNLLGE+KKSYNRDLF FLYFLLIIVSA TSFKIHNDKSP SGK IMYLNRHQTEE
Sbjct: 118  ICDRTNLLGEAKKSYNRDLFLFLYFLLIIVSAKTSFKIHNDKSPLSGKSIMYLNRHQTEE 177

Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391
            WKGWMQVLFLMYHYFAATE YNAIRVFIA YVWMTGFGNFSYYYIRKDFSLARFIQMMWR
Sbjct: 178  WKGWMQVLFLMYHYFAATEFYNAIRVFIAGYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 237

Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211
            LNFLV FCC+VL+N+YMLYYICPMHTLFTLMVYGALGIFNKYNE+G VIA KI+ CFLVV
Sbjct: 238  LNFLVFFCCVVLNNNYMLYYICPMHTLFTLMVYGALGIFNKYNERGIVIAAKIVVCFLVV 297

Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031
            +L+W                  Y DP  SK K S LHEWHFRSGLDRYIWIIGMIYAYYH
Sbjct: 298  VLLWG----------------RYTDP-ASKVKLSLLHEWHFRSGLDRYIWIIGMIYAYYH 340

Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851
            PTVERW+EKLEEAE KRR  IK IV I+SLTIGYLWVE++YKLPKITYNKYHPYTSWIPI
Sbjct: 341  PTVERWLEKLEEAEIKRRILIKSIVGIVSLTIGYLWVEFVYKLPKITYNKYHPYTSWIPI 400

Query: 850  TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671
            TVYI LRNVTQHFR Y+LTLFAWLGKVTLETYISQIHIWLRS VP+GQPK          
Sbjct: 401  TVYICLRNVTQHFRCYSLTLFAWLGKVTLETYISQIHIWLRSSVPNGQPKLLLSLIPNYP 460

Query: 670  XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491
              NFMLT SIY+AVSYRLFELTNTLKSAFVPTKD+KRLGHNL+ AV+IASILY L+ VL+
Sbjct: 461  LLNFMLTASIYIAVSYRLFELTNTLKSAFVPTKDNKRLGHNLIAAVIIASILYILAVVLI 520

Query: 490  KVPQMLV 470
            KVPQM+V
Sbjct: 521  KVPQMIV 527


>ref|XP_010045355.1| PREDICTED: CAS1 domain-containing protein 1 isoform X1 [Eucalyptus
            grandis] gi|629123038|gb|KCW87528.1| hypothetical protein
            EUGRSUZ_B03976 [Eucalyptus grandis]
          Length = 546

 Score =  880 bits (2273), Expect = 0.0
 Identities = 423/545 (77%), Positives = 473/545 (86%)
 Frame = -3

Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931
            M I+GP+TPGQVSF +GI+P   AWIYSE+LE+++ S SSK  R SD+NLVE+G + VKE
Sbjct: 1    MAIHGPVTPGQVSFLIGIIPTIAAWIYSEFLEYKRNSVSSKV-RRSDVNLVEMGNDVVKE 59

Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751
            DDRAVLLEGG LQSASPR RNSS TS I RFL +DESFL+ENRLTLRAI+EF  LL YF+
Sbjct: 60   DDRAVLLEGGGLQSASPRSRNSSATSPIFRFLVMDESFLVENRLTLRAIAEFSMLLAYFF 119

Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571
            +CDRT+    SKKSYNRDLF FLYFLLIIVSA+TSF  HN+KSP SGK I+YLNRHQTEE
Sbjct: 120  LCDRTDFFESSKKSYNRDLFLFLYFLLIIVSAMTSFTTHNEKSPISGKSILYLNRHQTEE 179

Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391
            WKGWMQVLFLMYHYFAA+EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLARF QMMWR
Sbjct: 180  WKGWMQVLFLMYHYFAASEIYNAIRIFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWR 239

Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211
            LNFLVLFCC+VL+N+Y+LYYICPMHTLFTLMVYGALGI NKYNE G  IA KIIACFLVV
Sbjct: 240  LNFLVLFCCVVLNNNYVLYYICPMHTLFTLMVYGALGILNKYNEVGMAIALKIIACFLVV 299

Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031
            IL+WEVPGVFEL+WSPFTF LGY+DPDPSK K   L EWHFRSGLDRYIWI+GMIYAYYH
Sbjct: 300  ILVWEVPGVFELVWSPFTFLLGYSDPDPSKPKFPLLREWHFRSGLDRYIWIVGMIYAYYH 359

Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851
            PTVERWMEKLEEAE KRR  IK  V+  SLT+GYLW EY+YKL KITYNKYHPYTSWIPI
Sbjct: 360  PTVERWMEKLEEAEWKRRLLIKGAVISTSLTVGYLWFEYVYKLDKITYNKYHPYTSWIPI 419

Query: 850  TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671
            TVYISLRNVTQH RS +LTLFAWLGK+TLETYISQIHIWLRSG+PDGQPK          
Sbjct: 420  TVYISLRNVTQHLRSCSLTLFAWLGKITLETYISQIHIWLRSGIPDGQPKLLLSLIPNYP 479

Query: 670  XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491
              NFMLTTSIY+ VS+RLF LTNTLK+AFVP+KD+KRL +N++ A VI+S+LY+LSFV L
Sbjct: 480  MLNFMLTTSIYIVVSHRLFNLTNTLKNAFVPSKDNKRLLNNMISAAVISSVLYSLSFVFL 539

Query: 490  KVPQM 476
            K P++
Sbjct: 540  KFPRV 544


>ref|XP_006583993.1| PREDICTED: CAS1 domain-containing protein 1-like isoform X1 [Glycine
            max] gi|734318066|gb|KHN02869.1| CAS1 domain-containing
            protein 1 [Glycine soja]
          Length = 552

 Score =  879 bits (2272), Expect = 0.0
 Identities = 424/547 (77%), Positives = 473/547 (86%)
 Frame = -3

Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931
            M++  P+TPGQVSF LGI+P+ VAWIYSE LE+RK S SS+  R SDINLVE+G + VK+
Sbjct: 1    MLLLSPVTPGQVSFLLGIIPVVVAWIYSEILEYRKNSVSSR-ARQSDINLVEMGSDVVKD 59

Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751
            +DRAVLLEGGALQS SP+ R+ + +  I+RFL +DE FLLENRLTLRA+SEFG +L YFY
Sbjct: 60   EDRAVLLEGGALQSGSPKARSLTGSPSIIRFLLMDECFLLENRLTLRAMSEFGLILAYFY 119

Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571
            +CDRT+    S KSYNRDLF FLYFLLIIVSA+TSFKIH+DKSP SGK I+YLNRHQTEE
Sbjct: 120  LCDRTDFFASSNKSYNRDLFLFLYFLLIIVSAMTSFKIHHDKSPLSGKSILYLNRHQTEE 179

Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391
            WKGWMQVLFLMYHYFAA+EIYNAIR+FIAAYVWMTGFGNFSYYY+RKDFSLARF QMMWR
Sbjct: 180  WKGWMQVLFLMYHYFAASEIYNAIRLFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWR 239

Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211
            LNF V+FCCIVL+NSYMLYYICPMHTLFTLMVYGALGI +KYNE G+VIA KIIACFLVV
Sbjct: 240  LNFFVVFCCIVLNNSYMLYYICPMHTLFTLMVYGALGILHKYNEIGSVIAVKIIACFLVV 299

Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031
            IL+WE+PGVFE +WSPFTFFLGY DP+P+K+  S LHEWHFRSGLDRYIWIIGMIYAYYH
Sbjct: 300  ILVWEIPGVFEWVWSPFTFFLGYTDPNPAKSHLSRLHEWHFRSGLDRYIWIIGMIYAYYH 359

Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851
            PTVERWMEKLEEAE KRR SIK  VV+I   +GYLW E+IYKL KI YNKYHPYTSWIPI
Sbjct: 360  PTVERWMEKLEEAEIKRRISIKATVVLICSLVGYLWFEHIYKLDKIAYNKYHPYTSWIPI 419

Query: 850  TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPKXXXXXXXXXX 671
            TVYI LRNVTQ FRSYTLTLFAWLGK+TLETYISQIHIWLRSGVPDGQPK          
Sbjct: 420  TVYICLRNVTQSFRSYTLTLFAWLGKITLETYISQIHIWLRSGVPDGQPKLLLSLIPDFP 479

Query: 670  XXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFVLL 491
              NFMLTTSIYVA+SYRLF+LTNTLK AFVP+KDDKR  HNL+ A  I+ +LY+LS   L
Sbjct: 480  MLNFMLTTSIYVAISYRLFDLTNTLKMAFVPSKDDKRFIHNLITATTISVVLYSLSLGFL 539

Query: 490  KVPQMLV 470
            +VPQMLV
Sbjct: 540  RVPQMLV 546


>ref|XP_012084124.1| PREDICTED: probable O-acetyltransferase CAS1 isoform X2 [Jatropha
            curcas] gi|643716182|gb|KDP27955.1| hypothetical protein
            JCGZ_19035 [Jatropha curcas]
          Length = 545

 Score =  878 bits (2269), Expect = 0.0
 Identities = 426/549 (77%), Positives = 474/549 (86%), Gaps = 2/549 (0%)
 Frame = -3

Query: 2110 MVIYGPLTPGQVSFFLGIMPICVAWIYSEYLEHRKKSSSSKPGRNSDINLVELGEETVKE 1931
            MVI  P+TPGQVSF LG++PI  AWIYSE LE++K ++++K   +SDI L E+G + VKE
Sbjct: 1    MVISSPITPGQVSFLLGVVPIIAAWIYSEILEYKKNAAAAKA--HSDIGLAEMGNDAVKE 58

Query: 1930 DDRAVLLEGGALQSASPRVRNSSVTSHIVRFLTLDESFLLENRLTLRAISEFGALLIYFY 1751
            DDRAVLLEGG LQSASPR + S  +S I RFL ++E FL+ENRLTLRAISEFGALL YFY
Sbjct: 59   DDRAVLLEGGGLQSASPRAKTSPASSPIFRFLLMEEQFLIENRLTLRAISEFGALLGYFY 118

Query: 1750 ICDRTNLLGESKKSYNRDLFWFLYFLLIIVSAITSFKIHNDKSPFSGKMIMYLNRHQTEE 1571
            +CDRT+    S KS+NRDLFWFLY LLIIVSAITSFKIH+D+SPFSGK I+YLNRHQTEE
Sbjct: 119  LCDRTDFFNSSTKSFNRDLFWFLYSLLIIVSAITSFKIHHDRSPFSGKPILYLNRHQTEE 178

Query: 1570 WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYIRKDFSLARFIQMMWR 1391
            WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYY+RKDFSLARF QMMWR
Sbjct: 179  WKGWMQVLFLMYHYFAATEIYNAIRVFIAAYVWMTGFGNFSYYYVRKDFSLARFAQMMWR 238

Query: 1390 LNFLVLFCCIVLSNSYMLYYICPMHTLFTLMVYGALGIFNKYNEQGTVIAGKIIACFLVV 1211
            LNFLV+FCC+VL+NSYMLYYICPMHTLFTLMVYGALGI NKYNE+G+VIA KIIACFLVV
Sbjct: 239  LNFLVIFCCVVLNNSYMLYYICPMHTLFTLMVYGALGIMNKYNEKGSVIAVKIIACFLVV 298

Query: 1210 ILIWEVPGVFELLWSPFTFFLGYADPDPSKAKHSPLHEWHFRSGLDRYIWIIGMIYAYYH 1031
            ILIWE+PGVFEL+WSPFTFFLGY   DP+K     LHEWHFRSGLDRYIWIIGMIYAYYH
Sbjct: 299  ILIWEIPGVFELVWSPFTFFLGYT--DPAKPDLPRLHEWHFRSGLDRYIWIIGMIYAYYH 356

Query: 1030 PTVERWMEKLEEAEPKRRTSIKMIVVIISLTIGYLWVEYIYKLPKITYNKYHPYTSWIPI 851
            PTVERWMEKLEE E KRR SIK  V  ISL  GYLW E+IYK+ K+TYNKYHPYTSWIPI
Sbjct: 357  PTVERWMEKLEETEVKRRISIKTAVASISLLTGYLWFEHIYKMDKVTYNKYHPYTSWIPI 416

Query: 850  TVYISLRNVTQHFRSYTLTLFAWLGKVTLETYISQIHIWLRSGVPDGQPK--XXXXXXXX 677
            TVYISLRNV+Q+FRSYTLTLFAWLGK+TLETYISQ HIWLRS +PD QPK          
Sbjct: 417  TVYISLRNVSQYFRSYTLTLFAWLGKITLETYISQFHIWLRSSIPDAQPKLLLSIIPQSE 476

Query: 676  XXXXNFMLTTSIYVAVSYRLFELTNTLKSAFVPTKDDKRLGHNLVVAVVIASILYTLSFV 497
                NFMLTTSIYVAVSYRLF+LTNTLK AFVP+KDDKRL HN++ A VIASILY++SF+
Sbjct: 477  FPLLNFMLTTSIYVAVSYRLFDLTNTLKIAFVPSKDDKRLAHNMITAAVIASILYSVSFI 536

Query: 496  LLKVPQMLV 470
             LK+PQ+LV
Sbjct: 537  FLKIPQILV 545


Top