BLASTX nr result

ID: Catharanthus23_contig00001610 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00001610
         (1821 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006355958.1| PREDICTED: uncharacterized protein LOC102582...   312   3e-82
ref|XP_004238694.1| PREDICTED: uncharacterized protein LOC101259...   301   6e-79
ref|XP_006438457.1| hypothetical protein CICLE_v10032202mg [Citr...   293   2e-76
ref|XP_006438456.1| hypothetical protein CICLE_v10032202mg [Citr...   289   2e-75
ref|XP_002312509.1| GCN5-related N-acetyltransferase family prot...   287   9e-75
ref|XP_004135226.1| PREDICTED: uncharacterized protein LOC101210...   286   2e-74
ref|XP_002893318.1| hypothetical protein ARALYDRAFT_889946 [Arab...   286   2e-74
ref|XP_002266260.2| PREDICTED: uncharacterized protein LOC100246...   286   2e-74
gb|EOY00439.1| Acyl-CoA N-acyltransferases (NAT) superfamily pro...   286   3e-74
gb|EOY00438.1| Acyl-CoA N-acyltransferases superfamily protein i...   286   3e-74
gb|EOY00437.1| Acyl-CoA N-acyltransferases (NAT) superfamily pro...   286   3e-74
ref|XP_004297657.1| PREDICTED: uncharacterized protein LOC101290...   283   2e-73
gb|EXB82640.1| hypothetical protein L484_027821 [Morus notabilis]     281   9e-73
ref|XP_002531402.1| N-acetyltransferase, putative [Ricinus commu...   280   1e-72
ref|NP_173815.2| acyl-CoA N-acyltransferases-like protein [Arabi...   280   2e-72
ref|XP_006305379.1| hypothetical protein CARUB_v10009770mg [Caps...   275   4e-71
ref|XP_006416034.1| hypothetical protein EUTSA_v10008192mg [Eutr...   274   8e-71
ref|XP_002314731.1| hypothetical protein POPTR_0010s10580g [Popu...   268   8e-69
gb|ABK96063.1| unknown [Populus trichocarpa]                          268   8e-69
ref|XP_003613302.1| hypothetical protein MTR_5g035060 [Medicago ...   259   4e-66

>ref|XP_006355958.1| PREDICTED: uncharacterized protein LOC102582103 isoform X1 [Solanum
            tuberosum] gi|565379056|ref|XP_006355959.1| PREDICTED:
            uncharacterized protein LOC102582103 isoform X2 [Solanum
            tuberosum]
          Length = 311

 Score =  312 bits (800), Expect = 3e-82
 Identities = 173/302 (57%), Positives = 208/302 (68%), Gaps = 3/302 (0%)
 Frame = -1

Query: 1548 STAFLSCHSLDPQLH-NPHNNFSYNPQLITQQAPIFNQKRINHFVHIPR-YXXXXXXXXX 1375
            +TA     SLDPQ H N HN+F +N    + +A IF  K    FV   + +         
Sbjct: 2    ATAISLSFSLDPQSHHNYHNHFHHNTSFNSHKASIFTPKYTFPFVISSKSHTSNLIIPLS 61

Query: 1374 XXXXXXXXXXXXXXXXXXXSYQRGRFLTNDEIEKLQFLENFRYFQELKSGLLYIRLMQHE 1195
                                 Q GRFLTN E+EKL+ L  +RYFQEL+SG L++R+M+ E
Sbjct: 62   TSQSSSSSSSTSPSSVLQTPLQTGRFLTNQELEKLESLGKYRYFQELESGSLWVRVMREE 121

Query: 1194 EIDMTVSLLAESFAESMLMPSGYTRLLEFLVKQYLIERRALMPHSATLLGFFKED-EEED 1018
            E+D+TV LLAESFAESMLMP GY + L +LVKQY+IERRALMPH+ATLLGF++E+ E+ D
Sbjct: 122  EMDVTVWLLAESFAESMLMPKGYVKFLAYLVKQYMIERRALMPHTATLLGFYRENGEDAD 181

Query: 1017 LQLAGTVEVSFNKRGXXXXXXXXXXXXXXPYICNMTVKEPLRRRGIGWHLLKASEELISQ 838
            LQLAGTVEV F+KRG              PYICNMTV + LRRRGIGWHLLKASEELISQ
Sbjct: 182  LQLAGTVEVCFDKRGANANPPTPTPPKNSPYICNMTVDKLLRRRGIGWHLLKASEELISQ 241

Query: 837  MTSSRDVYLHCRMIDTAPLSMYTKAGYNIVKTDNILVLLTLQRRKHLMYKQIPVSTSPYE 658
            M+SSR+VYLHCRMIDTAPL+MY KAGY IV+TDNI +LLTLQRRKHLM K +P S S +E
Sbjct: 242  MSSSREVYLHCRMIDTAPLNMYRKAGYTIVETDNIFILLTLQRRKHLMCKVLPDSESLFE 301

Query: 657  MD 652
            +D
Sbjct: 302  VD 303


>ref|XP_004238694.1| PREDICTED: uncharacterized protein LOC101259841 [Solanum
            lycopersicum]
          Length = 311

 Score =  301 bits (771), Expect = 6e-79
 Identities = 165/294 (56%), Positives = 203/294 (69%), Gaps = 3/294 (1%)
 Frame = -1

Query: 1524 SLDPQL-HNPHNNFSYNPQLITQQAPIFNQKRINHFVHIPR-YXXXXXXXXXXXXXXXXX 1351
            SLDPQ  HN H+ F +N    + +A IF  K    FV   + +                 
Sbjct: 10   SLDPQTQHNYHHRFHHNTSFNSHKASIFTPKYTFPFVISSKSHTSNLIIPLSTSQSSSSS 69

Query: 1350 XXXXXXXXXXXSYQRGRFLTNDEIEKLQFLENFRYFQELKSGLLYIRLMQHEEIDMTVSL 1171
                         Q GRFLTN E+EKL+ L  +RYFQEL+SG L++R+M+ EE+D+TV L
Sbjct: 70   SSTSPSSVFQTPLQTGRFLTNQELEKLESLGKYRYFQELESGSLWVRVMREEEMDVTVWL 129

Query: 1170 LAESFAESMLMPSGYTRLLEFLVKQYLIERRALMPHSATLLGFFKED-EEEDLQLAGTVE 994
            LAESF++SMLMP GY + + +LVKQY+IERRALMP++ATLLGF++E+ E+ DLQLAGTVE
Sbjct: 130  LAESFSDSMLMPKGYVKFMAYLVKQYMIERRALMPYTATLLGFYRENGEDADLQLAGTVE 189

Query: 993  VSFNKRGXXXXXXXXXXXXXXPYICNMTVKEPLRRRGIGWHLLKASEELISQMTSSRDVY 814
            V F+KRG              PYICNMTV + LRRRGIGWHLLKASEELISQM+SSR+VY
Sbjct: 190  VCFDKRGANANSPTPTPPKNSPYICNMTVDKLLRRRGIGWHLLKASEELISQMSSSREVY 249

Query: 813  LHCRMIDTAPLSMYTKAGYNIVKTDNILVLLTLQRRKHLMYKQIPVSTSPYEMD 652
            LHCRMIDTAPL+MY KAGY IV+TDNI +LL LQRRKHLM+K +P S S  E+D
Sbjct: 250  LHCRMIDTAPLNMYRKAGYTIVETDNIFILLALQRRKHLMWKVLPDSESLSEVD 303


>ref|XP_006438457.1| hypothetical protein CICLE_v10032202mg [Citrus clementina]
            gi|567891875|ref|XP_006438458.1| hypothetical protein
            CICLE_v10032202mg [Citrus clementina]
            gi|568860581|ref|XP_006483795.1| PREDICTED:
            uncharacterized protein LOC102624043 [Citrus sinensis]
            gi|557540653|gb|ESR51697.1| hypothetical protein
            CICLE_v10032202mg [Citrus clementina]
            gi|557540654|gb|ESR51698.1| hypothetical protein
            CICLE_v10032202mg [Citrus clementina]
          Length = 307

 Score =  293 bits (750), Expect = 2e-76
 Identities = 150/235 (63%), Positives = 184/235 (78%), Gaps = 6/235 (2%)
 Frame = -1

Query: 1314 YQRGRFLTNDEIEKLQFLENFRYFQELKSGLLYIRLMQHEEIDMTVSLLAESFAESMLMP 1135
            +  GRFLTN+E+EKL+ LE+F +FQEL+SG L++R+M+ EE+D TVSLLAESF+ESML+P
Sbjct: 73   FSTGRFLTNEELEKLKTLEHFVHFQELQSGFLWVRVMRPEEMDRTVSLLAESFSESMLLP 132

Query: 1134 SGYTRLLEFLVKQYLIERRALMPHSATLLGFFKEDEE----EDLQLAGTVEVSFNKRGXX 967
             GY +LL F VKQYLIERRA+MPH+ATL+GF++   E    ED+  AGTVEV F+KRG  
Sbjct: 133  VGYNKLLRFFVKQYLIERRAVMPHAATLIGFYRGKGESESGEDVDFAGTVEVCFDKRGAN 192

Query: 966  XXXXXXXXXXXXPYICNMTVKEPLRRRGIGWHLLKASEELISQMTSSRDVYLHCRMIDTA 787
                        PYICNMTV++  RRRGIGWHLLKASEELISQM+SS++VYLHCRMID A
Sbjct: 193  ASPATPTPPKNSPYICNMTVRKERRRRGIGWHLLKASEELISQMSSSKEVYLHCRMIDEA 252

Query: 786  PLSMYTKAGYNIVKTDNILVLLTLQRRKHLMYKQIPVSTSPYEMDMDS--REFPS 628
            P +MYTKAGY++VKTDNI+VLLTLQRRKHLM K++PV   P E D+     E PS
Sbjct: 253  PFNMYTKAGYSVVKTDNIIVLLTLQRRKHLMCKKLPVVDHPSESDVSGSVEELPS 307


>ref|XP_006438456.1| hypothetical protein CICLE_v10032202mg [Citrus clementina]
            gi|557540652|gb|ESR51696.1| hypothetical protein
            CICLE_v10032202mg [Citrus clementina]
          Length = 306

 Score =  289 bits (740), Expect = 2e-75
 Identities = 150/235 (63%), Positives = 184/235 (78%), Gaps = 6/235 (2%)
 Frame = -1

Query: 1314 YQRGRFLTNDEIEKLQFLENFRYFQELKSGLLYIRLMQHEEIDMTVSLLAESFAESMLMP 1135
            +  GRFLTN+E+EKL+ LE+F +FQEL+SG L++R+M+ EE+D TVSLLAESF+ESML+P
Sbjct: 73   FSTGRFLTNEELEKLKTLEHFVHFQELQSGFLWVRVMRPEEMDRTVSLLAESFSESMLLP 132

Query: 1134 SGYTRLLEFLVKQYLIERRALMPHSATLLGFFKEDEE----EDLQLAGTVEVSFNKRGXX 967
             GY +LL F VKQYLIERRA+MPH+ATL+GF++   E    ED+  AGTVEV F+KRG  
Sbjct: 133  VGYNKLLRFFVKQYLIERRAVMPHAATLIGFYRGKGESESGEDVDFAGTVEVCFDKRGAN 192

Query: 966  XXXXXXXXXXXXPYICNMTVKEPLRRRGIGWHLLKASEELISQMTSSRDVYLHCRMIDTA 787
                        PYICNMTV++  RRRGIGWHLLKASEELISQM+SS++VYLHCRMID A
Sbjct: 193  ASPATPTPPKNSPYICNMTVRKE-RRRGIGWHLLKASEELISQMSSSKEVYLHCRMIDEA 251

Query: 786  PLSMYTKAGYNIVKTDNILVLLTLQRRKHLMYKQIPVSTSPYEMDMDS--REFPS 628
            P +MYTKAGY++VKTDNI+VLLTLQRRKHLM K++PV   P E D+     E PS
Sbjct: 252  PFNMYTKAGYSVVKTDNIIVLLTLQRRKHLMCKKLPVVDHPSESDVSGSVEELPS 306


>ref|XP_002312509.1| GCN5-related N-acetyltransferase family protein [Populus trichocarpa]
            gi|222852329|gb|EEE89876.1| GCN5-related
            N-acetyltransferase family protein [Populus trichocarpa]
          Length = 334

 Score =  287 bits (735), Expect = 9e-75
 Identities = 149/234 (63%), Positives = 182/234 (77%), Gaps = 8/234 (3%)
 Frame = -1

Query: 1314 YQRGRFLTNDEIEKLQFLENFRYFQELKSGLLYIRLMQHEEIDMTVSLLAESFAESMLMP 1135
            Y+ GRFL+N+EIEKL  L+NFRY+Q+L++G + +RLM+ EE+D+TV LLAESF ESML+P
Sbjct: 89   YKTGRFLSNEEIEKLNALQNFRYYQQLETGSMCVRLMKPEEMDITVKLLAESFVESMLLP 148

Query: 1134 SGYTRLLEFLVKQYLIERRALMPHSATLLGFFKE-------DEEEDLQ-LAGTVEVSFNK 979
             GY  LL +LVKQYLIERRA MPH+ TL+GF+K        +E+EDL+ LAGTVEV F+K
Sbjct: 149  VGYVSLLRYLVKQYLIERRAAMPHAVTLIGFYKGKQEMNTGEEKEDLEELAGTVEVCFDK 208

Query: 978  RGXXXXXXXXXXXXXXPYICNMTVKEPLRRRGIGWHLLKASEELISQMTSSRDVYLHCRM 799
            RG              PYICNM VK+ LRRRGIGW+LLKASEELISQM+S RDVYLHCRM
Sbjct: 209  RGANTSPPTPTSPKNAPYICNMAVKQSLRRRGIGWNLLKASEELISQMSSMRDVYLHCRM 268

Query: 798  IDTAPLSMYTKAGYNIVKTDNILVLLTLQRRKHLMYKQIPVSTSPYEMDMDSRE 637
            ID APL+MYTKAGYNIVKTD+I VLL LQRRKHLM K++ V  +P E+D+   +
Sbjct: 269  IDLAPLNMYTKAGYNIVKTDSIRVLLMLQRRKHLMCKKLAVLKNPSELDISGSD 322


>ref|XP_004135226.1| PREDICTED: uncharacterized protein LOC101210740 [Cucumis sativus]
            gi|449478534|ref|XP_004155344.1| PREDICTED:
            uncharacterized LOC101210740 [Cucumis sativus]
          Length = 299

 Score =  286 bits (733), Expect = 2e-74
 Identities = 147/224 (65%), Positives = 176/224 (78%), Gaps = 5/224 (2%)
 Frame = -1

Query: 1305 GRFLTNDEIEKLQFLENFRYFQELKSGLLYIRLMQHEEIDMTVSLLAESFAESMLMPSGY 1126
            GRFLTNDE EKL+ L +F YF+EL+SG +++R+M+ +E+D TV LLAESFAESM  PS Y
Sbjct: 70   GRFLTNDEFEKLKLLGDFGYFKELESGFIWVRVMRDDELDATVGLLAESFAESMFWPSSY 129

Query: 1125 TRLLEFLVKQYLIERRALMPHSATLLGFFKE---DEEEDLQLAGTVEVSFNKRGXXXXXX 955
              LL FLVKQYLIERRALMPH+ATL+GF+K    DEEE  QLAGTVEV F+KRG      
Sbjct: 130  ISLLRFLVKQYLIERRALMPHTATLIGFYKRKDADEEEAEQLAGTVEVCFDKRGANASPP 189

Query: 954  XXXXXXXXPYICNMTVKEPLRRRGIGWHLLKASEELISQMTSSRDVYLHCRMIDTAPLSM 775
                    PYICNMTV++ LRRRGIGWHLLKA EELISQM++SR+VYLHCRMID AP +M
Sbjct: 190  TPTPPKDSPYICNMTVQKELRRRGIGWHLLKAGEELISQMSTSREVYLHCRMIDNAPFNM 249

Query: 774  YTKAGYNIVKTDNILVLLTLQRRKHLMYKQIPVST--SPYEMDM 649
            YTKAGY++V+TD I++LL LQRRKHLM K++P  T  SP E D+
Sbjct: 250  YTKAGYSVVQTDTIIILLMLQRRKHLMRKKLPAMTRSSPSESDV 293


>ref|XP_002893318.1| hypothetical protein ARALYDRAFT_889946 [Arabidopsis lyrata subsp.
            lyrata] gi|297339160|gb|EFH69577.1| hypothetical protein
            ARALYDRAFT_889946 [Arabidopsis lyrata subsp. lyrata]
          Length = 318

 Score =  286 bits (733), Expect = 2e-74
 Identities = 141/224 (62%), Positives = 176/224 (78%), Gaps = 7/224 (3%)
 Frame = -1

Query: 1314 YQRGRFLTNDEIEKLQFLENFRYFQELKSGLLYIRLMQHEEIDMTVSLLAESFAESMLMP 1135
            ++ GRFL+NDE+EKL+ LE F YFQEL+SG +++R+M+HEE+D TV LLAESF ESML+P
Sbjct: 78   FRTGRFLSNDELEKLKTLEGFAYFQELESGSMWVRVMRHEEMDSTVHLLAESFGESMLLP 137

Query: 1134 SGYTRLLEFLVKQYLIERRALMPHSATLLGFFK-------EDEEEDLQLAGTVEVSFNKR 976
            SGY  +L FLVKQYLIERR ++PH+ TL+GFF+       +D EE+ ++AGTVEV  +KR
Sbjct: 138  SGYQSVLRFLVKQYLIERREVLPHAVTLVGFFRKKVDGFSDDGEEEAEMAGTVEVCLDKR 197

Query: 975  GXXXXXXXXXXXXXXPYICNMTVKEPLRRRGIGWHLLKASEELISQMTSSRDVYLHCRMI 796
            G              PYICNMTVKE LRRRGIGWHLLKASEELISQ++ S+DVYLHCRM+
Sbjct: 198  GTNASPPSPTPPKESPYICNMTVKEDLRRRGIGWHLLKASEELISQISPSKDVYLHCRMV 257

Query: 795  DTAPLSMYTKAGYNIVKTDNILVLLTLQRRKHLMYKQIPVSTSP 664
            D AP +MY KAGY +VKTD +LVLL LQRRKHLM K++P+ T+P
Sbjct: 258  DEAPFNMYKKAGYEVVKTDTVLVLLMLQRRKHLMRKKLPLCTTP 301


>ref|XP_002266260.2| PREDICTED: uncharacterized protein LOC100246822 [Vitis vinifera]
          Length = 263

 Score =  286 bits (732), Expect = 2e-74
 Identities = 144/219 (65%), Positives = 176/219 (80%), Gaps = 2/219 (0%)
 Frame = -1

Query: 1305 GRFLTNDEIEKLQFLENFRYFQELKSGLLYIRLMQHEEIDMTVSLLAESFAESMLMPSGY 1126
            GRFL+N+E+EKL+ LENFRY  E + G +++R+M+ EEID+T +LLAESFA S+L+P  Y
Sbjct: 34   GRFLSNEELEKLRILENFRYSHEFEFGSMWVRVMRAEEIDITANLLAESFAVSLLLPIAY 93

Query: 1125 TRLLEFLVKQYLIERRALMPHSATLLGFFK--EDEEEDLQLAGTVEVSFNKRGXXXXXXX 952
             +LL +LVKQYLIE+RALMPH+ATL+GF+K  +  EE+ QLAGTVEVSFNKRG       
Sbjct: 94   VKLLAYLVKQYLIEKRALMPHTATLVGFYKGVDGGEEEEQLAGTVEVSFNKRGANASPPT 153

Query: 951  XXXXXXXPYICNMTVKEPLRRRGIGWHLLKASEELISQMTSSRDVYLHCRMIDTAPLSMY 772
                   PYICNMTV+EPLRRRGIGW+LLKASEELISQM+  RD+YLHCRMID AP +MY
Sbjct: 154  PTPPKNSPYICNMTVREPLRRRGIGWNLLKASEELISQMSLMRDIYLHCRMIDVAPFNMY 213

Query: 771  TKAGYNIVKTDNILVLLTLQRRKHLMYKQIPVSTSPYEM 655
            TKAGY IVKTD+IL+LL LQRRKHLM K++PV   P E+
Sbjct: 214  TKAGYKIVKTDSILILLALQRRKHLMCKKLPVLDDPSEI 252


>gb|EOY00439.1| Acyl-CoA N-acyltransferases (NAT) superfamily protein, putative
            isoform 3 [Theobroma cacao]
          Length = 332

 Score =  286 bits (731), Expect = 3e-74
 Identities = 143/222 (64%), Positives = 177/222 (79%)
 Frame = -1

Query: 1314 YQRGRFLTNDEIEKLQFLENFRYFQELKSGLLYIRLMQHEEIDMTVSLLAESFAESMLMP 1135
            ++  RFLTN+E+EKL+ LE+F Y QEL+SG L++R M+ EE+D+TV LLAESFAESMLMP
Sbjct: 75   FRGSRFLTNEELEKLKALESFVYLQELESGSLWVRAMRAEEMDLTVGLLAESFAESMLMP 134

Query: 1134 SGYTRLLEFLVKQYLIERRALMPHSATLLGFFKEDEEEDLQLAGTVEVSFNKRGXXXXXX 955
             GY  LL FLVKQYLIERRA+MPH+ TL+GF++E+ +   +LAGTVEV F+KRG      
Sbjct: 135  LGYEALLRFLVKQYLIERRAVMPHAVTLVGFYRENGQRGEELAGTVEVCFDKRGANSSPP 194

Query: 954  XXXXXXXXPYICNMTVKEPLRRRGIGWHLLKASEELISQMTSSRDVYLHCRMIDTAPLSM 775
                    PYICNMTV + LRRRGIGWHLLKASEELISQMTSS++VYLHCRMID AP +M
Sbjct: 195  SPTPPKNSPYICNMTVTKQLRRRGIGWHLLKASEELISQMTSSKEVYLHCRMIDEAPFNM 254

Query: 774  YTKAGYNIVKTDNILVLLTLQRRKHLMYKQIPVSTSPYEMDM 649
            Y KAGYN+++TD+I +LLTLQRRKHLM K++PV  +  E D+
Sbjct: 255  YIKAGYNVLQTDSIFILLTLQRRKHLMRKKLPVFNNLAESDI 296


>gb|EOY00438.1| Acyl-CoA N-acyltransferases superfamily protein isoform 2 [Theobroma
            cacao]
          Length = 347

 Score =  286 bits (731), Expect = 3e-74
 Identities = 143/222 (64%), Positives = 177/222 (79%)
 Frame = -1

Query: 1314 YQRGRFLTNDEIEKLQFLENFRYFQELKSGLLYIRLMQHEEIDMTVSLLAESFAESMLMP 1135
            ++  RFLTN+E+EKL+ LE+F Y QEL+SG L++R M+ EE+D+TV LLAESFAESMLMP
Sbjct: 75   FRGSRFLTNEELEKLKALESFVYLQELESGSLWVRAMRAEEMDLTVGLLAESFAESMLMP 134

Query: 1134 SGYTRLLEFLVKQYLIERRALMPHSATLLGFFKEDEEEDLQLAGTVEVSFNKRGXXXXXX 955
             GY  LL FLVKQYLIERRA+MPH+ TL+GF++E+ +   +LAGTVEV F+KRG      
Sbjct: 135  LGYEALLRFLVKQYLIERRAVMPHAVTLVGFYRENGQRGEELAGTVEVCFDKRGANSSPP 194

Query: 954  XXXXXXXXPYICNMTVKEPLRRRGIGWHLLKASEELISQMTSSRDVYLHCRMIDTAPLSM 775
                    PYICNMTV + LRRRGIGWHLLKASEELISQMTSS++VYLHCRMID AP +M
Sbjct: 195  SPTPPKNSPYICNMTVTKQLRRRGIGWHLLKASEELISQMTSSKEVYLHCRMIDEAPFNM 254

Query: 774  YTKAGYNIVKTDNILVLLTLQRRKHLMYKQIPVSTSPYEMDM 649
            Y KAGYN+++TD+I +LLTLQRRKHLM K++PV  +  E D+
Sbjct: 255  YIKAGYNVLQTDSIFILLTLQRRKHLMRKKLPVFNNLAESDI 296


>gb|EOY00437.1| Acyl-CoA N-acyltransferases (NAT) superfamily protein, putative
            isoform 1 [Theobroma cacao]
          Length = 301

 Score =  286 bits (731), Expect = 3e-74
 Identities = 143/222 (64%), Positives = 177/222 (79%)
 Frame = -1

Query: 1314 YQRGRFLTNDEIEKLQFLENFRYFQELKSGLLYIRLMQHEEIDMTVSLLAESFAESMLMP 1135
            ++  RFLTN+E+EKL+ LE+F Y QEL+SG L++R M+ EE+D+TV LLAESFAESMLMP
Sbjct: 75   FRGSRFLTNEELEKLKALESFVYLQELESGSLWVRAMRAEEMDLTVGLLAESFAESMLMP 134

Query: 1134 SGYTRLLEFLVKQYLIERRALMPHSATLLGFFKEDEEEDLQLAGTVEVSFNKRGXXXXXX 955
             GY  LL FLVKQYLIERRA+MPH+ TL+GF++E+ +   +LAGTVEV F+KRG      
Sbjct: 135  LGYEALLRFLVKQYLIERRAVMPHAVTLVGFYRENGQRGEELAGTVEVCFDKRGANSSPP 194

Query: 954  XXXXXXXXPYICNMTVKEPLRRRGIGWHLLKASEELISQMTSSRDVYLHCRMIDTAPLSM 775
                    PYICNMTV + LRRRGIGWHLLKASEELISQMTSS++VYLHCRMID AP +M
Sbjct: 195  SPTPPKNSPYICNMTVTKQLRRRGIGWHLLKASEELISQMTSSKEVYLHCRMIDEAPFNM 254

Query: 774  YTKAGYNIVKTDNILVLLTLQRRKHLMYKQIPVSTSPYEMDM 649
            Y KAGYN+++TD+I +LLTLQRRKHLM K++PV  +  E D+
Sbjct: 255  YIKAGYNVLQTDSIFILLTLQRRKHLMRKKLPVFNNLAESDI 296


>ref|XP_004297657.1| PREDICTED: uncharacterized protein LOC101290948 [Fragaria vesca
            subsp. vesca]
          Length = 331

 Score =  283 bits (723), Expect = 2e-73
 Identities = 147/245 (60%), Positives = 179/245 (73%), Gaps = 32/245 (13%)
 Frame = -1

Query: 1305 GRFLTNDEIEKLQFLENFRYFQELKSGLLYIRLMQHEEIDMTVSLLAESFAESMLMPSGY 1126
            GRFL+N+++EKL+ LENF Y+QEL+SG +++R+M+ EE+D+TV LLAESFAESMLMPSGY
Sbjct: 71   GRFLSNEDLEKLKSLENFSYYQELESGSMWVRVMRSEELDITVGLLAESFAESMLMPSGY 130

Query: 1125 TRLLEFLVKQYLIERRALMPHSATLLGFFKEDEEEDLQ---------------------- 1012
              LL FLVKQYL+ERR LMPH+ATL+GF++ +++E                         
Sbjct: 131  VALLGFLVKQYLMERRELMPHTATLIGFYRSNKDEGKDESFEVKGESFEGKGERFEEKDG 190

Query: 1011 ----------LAGTVEVSFNKRGXXXXXXXXXXXXXXPYICNMTVKEPLRRRGIGWHLLK 862
                      LAGTVEV F+K G              PYI NM V++PLRRRGIGWHLLK
Sbjct: 191  GFGGEDEGGVLAGTVEVCFDKMGANASTPTPTPPKNSPYISNMAVRKPLRRRGIGWHLLK 250

Query: 861  ASEELISQMTSSRDVYLHCRMIDTAPLSMYTKAGYNIVKTDNILVLLTLQRRKHLMYKQI 682
            ASEELISQM+SSR+ YLHCRMIDTAP +MYTKAGYNIVKTD+IL+LLTLQRRKHLMYK++
Sbjct: 251  ASEELISQMSSSREAYLHCRMIDTAPFNMYTKAGYNIVKTDSILILLTLQRRKHLMYKKL 310

Query: 681  PVSTS 667
            PV TS
Sbjct: 311  PVLTS 315


>gb|EXB82640.1| hypothetical protein L484_027821 [Morus notabilis]
          Length = 325

 Score =  281 bits (718), Expect = 9e-73
 Identities = 144/228 (63%), Positives = 177/228 (77%), Gaps = 10/228 (4%)
 Frame = -1

Query: 1305 GRFLTNDEIEKLQFLENFRYFQELKSGLLYIRLMQHEEIDMTVSLLAESFAESMLMPSGY 1126
            GRFL+N+E+EKL+FLE+  Y +EL+SG L++R M+ +E+D+TV+LLAESFAESML+PS Y
Sbjct: 89   GRFLSNEELEKLKFLEDVTYSRELRSGSLWVRAMRADEMDITVALLAESFAESMLLPSAY 148

Query: 1125 TRLLEFLVKQYLIERRALMPHSATLLGFFKE----------DEEEDLQLAGTVEVSFNKR 976
              LL FLVKQYLIERRA+MPH+ TL+GFF+E          +EE + +LAGTVEV F+K 
Sbjct: 149  NSLLRFLVKQYLIERRAVMPHAVTLVGFFRERKENGDGDEEEEEGEEKLAGTVEVCFDKI 208

Query: 975  GXXXXXXXXXXXXXXPYICNMTVKEPLRRRGIGWHLLKASEELISQMTSSRDVYLHCRMI 796
            G              PYICNMTVKE LRRRGIGWHLLKASEELISQM+S+ +VYLHCRM 
Sbjct: 209  GANASPPTPTPPKNSPYICNMTVKEQLRRRGIGWHLLKASEELISQMSSTSEVYLHCRMT 268

Query: 795  DTAPLSMYTKAGYNIVKTDNILVLLTLQRRKHLMYKQIPVSTSPYEMD 652
            D AP +MYTKAGY +VKTD++L+LL LQRRKHLM K++PV  SP E D
Sbjct: 269  DEAPFNMYTKAGYQVVKTDSLLILLMLQRRKHLMCKKLPVVKSPAESD 316


>ref|XP_002531402.1| N-acetyltransferase, putative [Ricinus communis]
            gi|223528995|gb|EEF30986.1| N-acetyltransferase, putative
            [Ricinus communis]
          Length = 315

 Score =  280 bits (717), Expect = 1e-72
 Identities = 143/230 (62%), Positives = 178/230 (77%), Gaps = 8/230 (3%)
 Frame = -1

Query: 1314 YQRGRFLTNDEIEKLQFLENFRYFQELKSGLLYIRLMQHEEIDMTVSLLAESFAESMLMP 1135
            ++ GRFL+N+E+EKL+ LE F YFQELK+G L +R+M+ EE+D+TV LLAESFAESML+P
Sbjct: 82   FKTGRFLSNEELEKLKTLEKFTYFQELKTGSLLVRVMRPEEMDITVKLLAESFAESMLLP 141

Query: 1134 SGYTRLLEFLVKQYLIERRALMPHSATLLGFF--------KEDEEEDLQLAGTVEVSFNK 979
             GY  LL FLVKQYLIERRA+MPH+ TL+GF+         + EEE+  LAGTVEV F+K
Sbjct: 142  VGYVSLLRFLVKQYLIERRAVMPHAVTLVGFYIGKDEGNNGDGEEEEEMLAGTVEVCFDK 201

Query: 978  RGXXXXXXXXXXXXXXPYICNMTVKEPLRRRGIGWHLLKASEELISQMTSSRDVYLHCRM 799
            RG              PYICNMTVK+ LRRRGIGW+LLKASEELISQM+   +VYLHCRM
Sbjct: 202  RGANASPPTPVPPKNSPYICNMTVKDSLRRRGIGWNLLKASEELISQMSCKGEVYLHCRM 261

Query: 798  IDTAPLSMYTKAGYNIVKTDNILVLLTLQRRKHLMYKQIPVSTSPYEMDM 649
            ID+AP +MY KAGY++VKTD+IL+LL LQRRKHLM K++PV   P E+++
Sbjct: 262  IDSAPFNMYIKAGYDVVKTDSILILLMLQRRKHLMCKKLPVLDDPSEVNL 311


>ref|NP_173815.2| acyl-CoA N-acyltransferases-like protein [Arabidopsis thaliana]
            gi|30688704|ref|NP_849703.1| acyl-CoA
            N-acyltransferases-like protein [Arabidopsis thaliana]
            gi|9369403|gb|AAF87151.1|AC002423_16 T23E23.19
            [Arabidopsis thaliana] gi|26450529|dbj|BAC42377.1|
            unknown protein [Arabidopsis thaliana]
            gi|38603846|gb|AAR24668.1| At1g24040 [Arabidopsis
            thaliana] gi|51969304|dbj|BAD43344.1| unknown protein
            [Arabidopsis thaliana] gi|51970058|dbj|BAD43721.1|
            unknown protein [Arabidopsis thaliana]
            gi|51970204|dbj|BAD43794.1| unknown protein [Arabidopsis
            thaliana] gi|110736204|dbj|BAF00073.1| hypothetical
            protein [Arabidopsis thaliana]
            gi|332192349|gb|AEE30470.1| acyl-CoA
            N-acyltransferases-like protein [Arabidopsis thaliana]
            gi|332192350|gb|AEE30471.1| acyl-CoA
            N-acyltransferases-like protein [Arabidopsis thaliana]
          Length = 319

 Score =  280 bits (715), Expect = 2e-72
 Identities = 140/225 (62%), Positives = 174/225 (77%), Gaps = 8/225 (3%)
 Frame = -1

Query: 1314 YQRGRFLTNDEIEKLQFLENFRYFQELKSGLLYIRLMQHEEIDMTVSLLAESFAESMLMP 1135
            ++ GRFL+NDE+EKL+ LE F YFQEL+SG +++R+M+HEE+D TV LLAESF ESML+P
Sbjct: 78   FRTGRFLSNDELEKLKTLEGFAYFQELESGSMWVRVMRHEEMDSTVHLLAESFGESMLLP 137

Query: 1134 SGYTRLLEFLVKQYLIERRALMPHSATLLGFFK-------EDEEEDLQLAGTVEVSFNKR 976
            SGY  +L FL+KQYLIERR ++PH+ TL+GFF+       +D EE+  +AGTVEV   KR
Sbjct: 138  SGYQSVLRFLIKQYLIERREVLPHAVTLVGFFRKKVDEFSDDGEEEAVMAGTVEVCLEKR 197

Query: 975  GXXXXXXXXXXXXXXPYICNMTVKEPLRRRGIGWHLLKASEELISQMTSSRDVYLHCRMI 796
            G              PYICNMTVKE LRRRGIGWHLLKASEELISQ++ S+DVYLHCRM+
Sbjct: 198  GANASPPSPTPPKESPYICNMTVKEDLRRRGIGWHLLKASEELISQISPSKDVYLHCRMV 257

Query: 795  DTAPLSMYTKAGYNIVKTDNILVLLTLQRRKHLMYKQ-IPVSTSP 664
            D AP +MY KAGY +VKTD +LVLL LQRRKHLM K+ +P+ T+P
Sbjct: 258  DEAPFNMYKKAGYEVVKTDTVLVLLMLQRRKHLMRKKLLPLCTNP 302


>ref|XP_006305379.1| hypothetical protein CARUB_v10009770mg [Capsella rubella]
            gi|482574090|gb|EOA38277.1| hypothetical protein
            CARUB_v10009770mg [Capsella rubella]
          Length = 320

 Score =  275 bits (704), Expect = 4e-71
 Identities = 137/221 (61%), Positives = 172/221 (77%), Gaps = 5/221 (2%)
 Frame = -1

Query: 1314 YQRGRFLTNDEIEKLQFLENFRYFQELKSGLLYIRLMQHEEIDMTVSLLAESFAESMLMP 1135
            ++ GRFL+N+E+EKL+ LE F YFQEL+SG + +R+M+ +E+D TV LLAESF ESML+P
Sbjct: 80   FRTGRFLSNEELEKLKTLEGFAYFQELESGSMCVRVMRSDEMDSTVHLLAESFGESMLLP 139

Query: 1134 SGYTRLLEFLVKQYLIERRALMPHSATLLGFFKEDEE-----EDLQLAGTVEVSFNKRGX 970
            SGY  +L FLVKQYLIERR ++PH+ TL+GFF++  +     ++ ++AGTVEV  +KRG 
Sbjct: 140  SGYQSVLRFLVKQYLIERREVLPHAVTLVGFFRKKTDGYSDGDEAEMAGTVEVCLDKRGA 199

Query: 969  XXXXXXXXXXXXXPYICNMTVKEPLRRRGIGWHLLKASEELISQMTSSRDVYLHCRMIDT 790
                         PYICNMTVKE LRRRGIGWHLLKASEELISQ++ S+DVYLHCRM+D 
Sbjct: 200  NASPPSPTPPKESPYICNMTVKEDLRRRGIGWHLLKASEELISQISPSKDVYLHCRMVDE 259

Query: 789  APLSMYTKAGYNIVKTDNILVLLTLQRRKHLMYKQIPVSTS 667
            AP SMY KAGY +VKTDN+LVLL LQRRKHLM K++P  TS
Sbjct: 260  APFSMYKKAGYEVVKTDNVLVLLMLQRRKHLMRKKLPPCTS 300


>ref|XP_006416034.1| hypothetical protein EUTSA_v10008192mg [Eutrema salsugineum]
            gi|557093805|gb|ESQ34387.1| hypothetical protein
            EUTSA_v10008192mg [Eutrema salsugineum]
          Length = 326

 Score =  274 bits (701), Expect = 8e-71
 Identities = 138/234 (58%), Positives = 175/234 (74%), Gaps = 5/234 (2%)
 Frame = -1

Query: 1314 YQRGRFLTNDEIEKLQFLENFRYFQELKSGLLYIRLMQHEEIDMTVSLLAESFAESMLMP 1135
            ++ GRFL+N E+EKL+ LE F YFQEL+SG +++R+M+  E+D TV+LLAESF ESML+P
Sbjct: 79   FRTGRFLSNAELEKLKALEGFAYFQELESGSMWVRVMRAGEMDSTVNLLAESFGESMLLP 138

Query: 1134 SGYTRLLEFLVKQYLIERRALMPHSATLLGFFKEDEE-----EDLQLAGTVEVSFNKRGX 970
            SGY  +L FLVKQYLIERR ++PH+ TL+GF+++  +     E+ ++AGTVEV  +KRG 
Sbjct: 139  SGYQSVLRFLVKQYLIERREVLPHAVTLVGFYRKKSDSCSDGEEAEMAGTVEVCLDKRGA 198

Query: 969  XXXXXXXXXXXXXPYICNMTVKEPLRRRGIGWHLLKASEELISQMTSSRDVYLHCRMIDT 790
                         PY+CNMTVKE LRRRGIGWHLLKASEELISQ++ S+DVYLHCRM+D 
Sbjct: 199  NASPPSPTPPKESPYVCNMTVKEDLRRRGIGWHLLKASEELISQLSPSKDVYLHCRMVDE 258

Query: 789  APLSMYTKAGYNIVKTDNILVLLTLQRRKHLMYKQIPVSTSPYEMDMDSREFPS 628
            AP SMY KAGY +VKTD ILVLL LQRRKHLM K++P   SP ++     E  S
Sbjct: 259  APFSMYKKAGYEVVKTDTILVLLMLQRRKHLMRKKLPPCISPSDIVGSDNELTS 312


>ref|XP_002314731.1| hypothetical protein POPTR_0010s10580g [Populus trichocarpa]
            gi|222863771|gb|EEF00902.1| hypothetical protein
            POPTR_0010s10580g [Populus trichocarpa]
          Length = 333

 Score =  268 bits (684), Expect = 8e-69
 Identities = 144/242 (59%), Positives = 181/242 (74%), Gaps = 10/242 (4%)
 Frame = -1

Query: 1314 YQRGRFLTNDEIEKLQFLENFRYFQELKSGLLYIRLMQHEEIDMTVSLLAESFAESMLMP 1135
            Y+ GRFL+N+EIEKL+ L++FR +Q+L++G L +R+M+  E+D+TV LLAESF ESM +P
Sbjct: 88   YKTGRFLSNEEIEKLKALQDFRCYQQLETGSLLVRVMKPGEMDITVKLLAESFVESMSLP 147

Query: 1134 SGYTRLLEFLVKQYLIERRALMPHSATLLGFFKE-------DEEEDLQ-LAGTVEVSFNK 979
             GY  L+ + V+QYL ERRA +PH+ TL+GF+K        +EEEDL+ LAGTVEV F+K
Sbjct: 148  VGYVSLVCYFVQQYLTERRAAIPHAVTLIGFYKGKQETNGGEEEEDLEELAGTVEVCFDK 207

Query: 978  RGXXXXXXXXXXXXXXPYICNMTVKEPLRRRGIGWHLLKASEELISQMTSSRDVYLHCRM 799
            RG              PYICNM VK+  RRRGIGW+LLKASEELIS+M+S RDVYLHCRM
Sbjct: 208  RGANASPPTPTPPKNAPYICNMAVKQSHRRRGIGWNLLKASEELISKMSSMRDVYLHCRM 267

Query: 798  IDTAPLSMYTKAGYNIVKTDNILVLLTLQRRKHLMYKQIPVSTSPYEMDM--DSREFPS* 625
            ID+AP +MYTKAGYNIVKTD+I VLL LQRRKHLM K++ VS +P E+D      EF S 
Sbjct: 268  IDSAPFNMYTKAGYNIVKTDSIWVLLMLQRRKHLMCKKLLVSKNPSELDTSGSDMEFSSQ 327

Query: 624  MD 619
            MD
Sbjct: 328  MD 329


>gb|ABK96063.1| unknown [Populus trichocarpa]
          Length = 329

 Score =  268 bits (684), Expect = 8e-69
 Identities = 144/242 (59%), Positives = 181/242 (74%), Gaps = 10/242 (4%)
 Frame = -1

Query: 1314 YQRGRFLTNDEIEKLQFLENFRYFQELKSGLLYIRLMQHEEIDMTVSLLAESFAESMLMP 1135
            Y+ GRFL+N+EIEKL+ L++FR +Q+L++G L +R+M+  E+D+TV LLAESF ESM +P
Sbjct: 84   YKTGRFLSNEEIEKLKALQDFRCYQQLETGSLLVRVMKPGEMDITVKLLAESFVESMSLP 143

Query: 1134 SGYTRLLEFLVKQYLIERRALMPHSATLLGFFKE-------DEEEDLQ-LAGTVEVSFNK 979
             GY  L+ + V+QYL ERRA +PH+ TL+GF+K        +EEEDL+ LAGTVEV F+K
Sbjct: 144  VGYVSLVCYFVQQYLTERRAAIPHAVTLIGFYKGKQETNGGEEEEDLEELAGTVEVCFDK 203

Query: 978  RGXXXXXXXXXXXXXXPYICNMTVKEPLRRRGIGWHLLKASEELISQMTSSRDVYLHCRM 799
            RG              PYICNM VK+  RRRGIGW+LLKASEELIS+M+S RDVYLHCRM
Sbjct: 204  RGANASPPTPTPPKNAPYICNMAVKQSHRRRGIGWNLLKASEELISKMSSMRDVYLHCRM 263

Query: 798  IDTAPLSMYTKAGYNIVKTDNILVLLTLQRRKHLMYKQIPVSTSPYEMDM--DSREFPS* 625
            ID+AP +MYTKAGYNIVKTD+I VLL LQRRKHLM K++ VS +P E+D      EF S 
Sbjct: 264  IDSAPFNMYTKAGYNIVKTDSIWVLLMLQRRKHLMCKKLLVSKNPSELDTSGSDMEFSSQ 323

Query: 624  MD 619
            MD
Sbjct: 324  MD 325


>ref|XP_003613302.1| hypothetical protein MTR_5g035060 [Medicago truncatula]
            gi|355514637|gb|AES96260.1| hypothetical protein
            MTR_5g035060 [Medicago truncatula]
          Length = 299

 Score =  259 bits (661), Expect = 4e-66
 Identities = 150/317 (47%), Positives = 184/317 (58%), Gaps = 15/317 (4%)
 Frame = -1

Query: 1554 ISSTAFLSCHSLDPQLHNPHN---------NFSYNPQLITQQAPIFNQKRINHFVHIPRY 1402
            +++T  LS  SLDP  HN  N         NFS+ P L   + P F              
Sbjct: 1    MAATLSLSFSSLDPHNHNRFNIITAKTTRRNFSFTPPLPHNKPPHFK------------- 47

Query: 1401 XXXXXXXXXXXXXXXXXXXXXXXXXXXXSYQRGRFLTNDEIEKLQFLENFRYFQELKSGL 1222
                                          + G+FLTN E+  L  L  + Y   LKSG 
Sbjct: 48   ------------LFSTSSSSQSQTLTDSFLKPGKFLTNTELTTLHHLSTYLYTHTLKSGT 95

Query: 1221 LYIRLMQHEEIDMTVSLLAESFAESMLMPSGYTRLLEFLVKQYLIERRALMPHSATLLGF 1042
            +++R+M+  E+D  V LLA SFAESM+ P GY  +L FLVKQYLIERR+LMPH ATL+ F
Sbjct: 96   VWVRVMRDSEVDAIVCLLANSFAESMMFPKGYINVLRFLVKQYLIERRSLMPHMATLIAF 155

Query: 1041 FK------EDEEEDLQLAGTVEVSFNKRGXXXXXXXXXXXXXXPYICNMTVKEPLRRRGI 880
            +K      + EEE++QLAGTVE+SFN  G              PYICNM V + LRRRGI
Sbjct: 156  YKGSGVNGDGEEEEMQLAGTVEISFNVYGANSTLPSPDPPKDKPYICNMAVDKSLRRRGI 215

Query: 879  GWHLLKASEELISQMTSSRDVYLHCRMIDTAPLSMYTKAGYNIVKTDNILVLLTLQRRKH 700
            GWHLLKASEELIS+M+SS +VYLHCRMID AP +MYTKA Y IV TD+ILVLL LQRRKH
Sbjct: 216  GWHLLKASEELISRMSSSGEVYLHCRMIDEAPFNMYTKADYKIVTTDSILVLLLLQRRKH 275

Query: 699  LMYKQIPVSTSPYEMDM 649
            LM K++P+   P E D+
Sbjct: 276  LMCKKLPLINMPSETDV 292


Top