BLASTX nr result
ID: Angelica23_contig00017054
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00017054 (1482 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AFP19450.1| choline monooxygenase [Camellia sinensis] 622 e-176 ref|XP_002264997.1| PREDICTED: choline monooxygenase, chloroplas... 610 e-172 ref|XP_002308100.1| predicted protein [Populus trichocarpa] gi|2... 609 e-172 ref|XP_003549280.1| PREDICTED: choline monooxygenase, chloroplas... 593 e-167 gb|AER10510.1| choline monooxygenase [Pyrus betulifolia] gi|3535... 586 e-165 >gb|AFP19450.1| choline monooxygenase [Camellia sinensis] Length = 434 Score = 622 bits (1605), Expect = e-176 Identities = 298/401 (74%), Positives = 338/401 (84%), Gaps = 5/401 (1%) Frame = +2 Query: 74 RIKASPFQKSNVH---KSRFISKVSNSWVSLSEGRILVDEFNPKIPIEKAATPPSSWYTD 244 R +P KS+ + S + S S+S +L E R LV +F+PKIP+E+A TPPSSWYT Sbjct: 34 RRSINPIFKSSYNPSFSSSYSSPSSSSLETLDEARRLVYQFDPKIPLEEALTPPSSWYTQ 93 Query: 245 SSFLQLELDHVFYKGWQAVGYNEQIKEAGDFFTGRLGNIEYVVCRDDNGELHAFHNVCRH 424 SFL LE D VF++GWQAVG EQIKE+G+FFTGRLGN+EYVVCRDDNG +HAFHNVCRH Sbjct: 94 PSFLSLEFDRVFFRGWQAVGCTEQIKESGNFFTGRLGNVEYVVCRDDNGNVHAFHNVCRH 153 Query: 425 HASLLAFGSGKKACFTCPYHGWTYGLDGKLLKATRITGIQNFNVNEFGLVPLGVASWGPF 604 HASLLA GSG +CF CPYHGWTY LDG LLKATRITGI+NFNVNEFGL+PL VA WGPF Sbjct: 154 HASLLASGSGLLSCFVCPYHGWTYXLDGALLKATRITGIRNFNVNEFGLIPLKVAIWGPF 213 Query: 605 ILLNMEPENFPHQQS--LGNVGAEWLGSASEILSVNGVDSSLSYLCRREYFIECNWKVFC 778 +LLN+ E+ P QQ+ + N+G EWLGS+SEILS NGVDSSLSY+CRREY IECNWKVFC Sbjct: 214 VLLNL--EDIPPQQAADVNNIGKEWLGSSSEILSTNGVDSSLSYICRREYTIECNWKVFC 271 Query: 779 DNYLDGGYHVQYAHKGLASGLNLESYSTSVFEKVSIQRCDGGQVESQEDTDRLGSKALYA 958 DNYLDGGYHV YAHKGL+SGLNLESYST+ FEKVSIQ+CDGG E++ + +RLGSKALYA Sbjct: 272 DNYLDGGYHVPYAHKGLSSGLNLESYSTTTFEKVSIQQCDGGSAETENEYERLGSKALYA 331 Query: 959 FIYPNFMINRYGPWMDTNLVLPLGPRKCQVIFDYFLDPSLKDDKEFIESSLKDSESVQDE 1138 FIYPNFMINRYGPWMDTNLVLP+G RKCQVIFDYFLD SLK+D FIESSL+DSE VQ E Sbjct: 332 FIYPNFMINRYGPWMDTNLVLPIGSRKCQVIFDYFLDTSLKEDIAFIESSLEDSERVQME 391 Query: 1139 DIFLCEAVQKGLESPAYSSGRYAPSIEKAMHHFHCLLHKNL 1261 DI LCE VQ+GLESPAY SGRYAPS+EKAMHHFHCLL+ NL Sbjct: 392 DIILCEGVQRGLESPAYCSGRYAPSVEKAMHHFHCLLYHNL 432 >ref|XP_002264997.1| PREDICTED: choline monooxygenase, chloroplastic [Vitis vinifera] gi|297735449|emb|CBI17889.3| unnamed protein product [Vitis vinifera] Length = 441 Score = 610 bits (1574), Expect = e-172 Identities = 290/411 (70%), Positives = 338/411 (82%), Gaps = 3/411 (0%) Frame = +2 Query: 38 PLHSSQSFQENIRIKASP--FQKSNVHKSRFISKVSNSWVSLSEGRILVDEFNPKIPIEK 211 PL SS S + + S FQK +R I NS + + L+ +FNP+IP+E+ Sbjct: 32 PLSSSSSSRSKFNSRNSHILFQKRCPFPNRTIV---NSSSAAGKAPTLLHKFNPRIPVEQ 88 Query: 212 AATPPSSWYTDSSFLQLELDHVFYKGWQAVGYNEQIKEAGDFFTGRLGNIEYVVCRDDNG 391 A TPPSSWYTD SFL LELD VFY+GWQAVGY EQIK DFFTGRLGN+E+VVCRD+NG Sbjct: 89 ALTPPSSWYTDPSFLALELDRVFYRGWQAVGYTEQIKNPRDFFTGRLGNVEFVVCRDNNG 148 Query: 392 ELHAFHNVCRHHASLLAFGSGKKACFTCPYHGWTYGLDGKLLKATRITGIQNFNVNEFGL 571 +LHAFHNVCRHHASLLA+GSG+K+CF CPYH WTYGLDG LLKATRITGI++F++NEFGL Sbjct: 149 KLHAFHNVCRHHASLLAYGSGQKSCFVCPYHAWTYGLDGALLKATRITGIKDFSINEFGL 208 Query: 572 VPLGVASWGPFILLNMEPENFP-HQQSLGNVGAEWLGSASEILSVNGVDSSLSYLCRREY 748 +PL +A+WGPF+LLN+ + P H+ VG EWLGS+S+ILS G+D+SLSY+CRREY Sbjct: 209 IPLRIATWGPFVLLNINNDVSPQHEADSKIVGKEWLGSSSDILSNGGIDTSLSYVCRREY 268 Query: 749 FIECNWKVFCDNYLDGGYHVQYAHKGLASGLNLESYSTSVFEKVSIQRCDGGQVESQEDT 928 IECNWKVFCDNYLDGGYHV YAHKGLASGL LESYST+ FE+VSIQ C+GG ES++D Sbjct: 269 TIECNWKVFCDNYLDGGYHVPYAHKGLASGLKLESYSTTTFERVSIQSCEGGPGESEDDF 328 Query: 929 DRLGSKALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCQVIFDYFLDPSLKDDKEFIESS 1108 DRLG+KALYAFIYPNFMINRYGPWMDTNLVLPLGPR C+V+FDYFL+ SLKDDK FIE S Sbjct: 329 DRLGTKALYAFIYPNFMINRYGPWMDTNLVLPLGPRTCKVVFDYFLEASLKDDKAFIERS 388 Query: 1109 LKDSESVQDEDIFLCEAVQKGLESPAYSSGRYAPSIEKAMHHFHCLLHKNL 1261 L+DSE VQ EDI LCE VQ+GLESPAY SGRYAP++E AMHHFHCLLH+NL Sbjct: 389 LEDSERVQMEDIILCEGVQRGLESPAYCSGRYAPTVEMAMHHFHCLLHENL 439 >ref|XP_002308100.1| predicted protein [Populus trichocarpa] gi|222854076|gb|EEE91623.1| predicted protein [Populus trichocarpa] Length = 409 Score = 609 bits (1570), Expect = e-172 Identities = 281/360 (78%), Positives = 310/360 (86%) Frame = +2 Query: 173 LVDEFNPKIPIEKAATPPSSWYTDSSFLQLELDHVFYKGWQAVGYNEQIKEAGDFFTGRL 352 LVDEF+P IPIEKA TPPSSWYTD SF EL VFYKGWQAVGY EQIK DFFTGRL Sbjct: 45 LVDEFDPNIPIEKALTPPSSWYTDPSFFDFELHRVFYKGWQAVGYTEQIKNPRDFFTGRL 104 Query: 353 GNIEYVVCRDDNGELHAFHNVCRHHASLLAFGSGKKACFTCPYHGWTYGLDGKLLKATRI 532 GN+E++VCRDD+G++HAFHNVCRHHASL+A G+G+K+CF CPYHGWTYGLDG LLKATRI Sbjct: 105 GNVEFLVCRDDDGKIHAFHNVCRHHASLVASGNGQKSCFVCPYHGWTYGLDGALLKATRI 164 Query: 533 TGIQNFNVNEFGLVPLGVASWGPFILLNMEPENFPHQQSLGNVGAEWLGSASEILSVNGV 712 TGIQNF+VNEFGL PL VA+WGPF+LLN++ E P Q++ VG+EWLGS SE L+ NGV Sbjct: 165 TGIQNFDVNEFGLKPLNVATWGPFVLLNLDKEILPQQEADNTVGSEWLGSCSEYLAANGV 224 Query: 713 DSSLSYLCRREYFIECNWKVFCDNYLDGGYHVQYAHKGLASGLNLESYSTSVFEKVSIQR 892 DSSLSYLCRR Y IECNWKVFCDNYLDGGYHV YAHKGLASGL L SYST +EKVSIQ Sbjct: 225 DSSLSYLCRRVYDIECNWKVFCDNYLDGGYHVPYAHKGLASGLKLNSYSTKTYEKVSIQS 284 Query: 893 CDGGQVESQEDTDRLGSKALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCQVIFDYFLDP 1072 CDGG ES++D DRLGSKALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCQVIFDYF++ Sbjct: 285 CDGGSTESEDDIDRLGSKALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCQVIFDYFIEA 344 Query: 1073 SLKDDKEFIESSLKDSESVQDEDIFLCEAVQKGLESPAYSSGRYAPSIEKAMHHFHCLLH 1252 LKDDK+FIE SL DSE VQ EDI LCE VQ+GLE+PAY SGRYAP +E AMHHFH LLH Sbjct: 345 HLKDDKDFIERSLVDSERVQIEDIVLCEGVQRGLETPAYCSGRYAPMVEHAMHHFHQLLH 404 >ref|XP_003549280.1| PREDICTED: choline monooxygenase, chloroplastic-like [Glycine max] Length = 418 Score = 593 bits (1530), Expect = e-167 Identities = 273/388 (70%), Positives = 325/388 (83%), Gaps = 1/388 (0%) Frame = +2 Query: 104 NVHKSRFISKVSNSWVSLSEGRILVDEFNPKIPIEKAATPPSSWYTDSSFLQLELDHVFY 283 N H + + NS + LS+ + LV FNPK PIE+A TPP+SWYT SF LELD VFY Sbjct: 30 NKHSTLTCCAIRNSDLKLSQTQRLVHHFNPKTPIEEAVTPPTSWYTHPSFFHLELDRVFY 89 Query: 284 KGWQAVGYNEQIKEAGDFFTGRLGNIEYVVCRDDNGELHAFHNVCRHHASLLAFGSGKKA 463 +GWQ VG EQIK+ D+FTGRLG++EYVVCRDD+G + AFHNVCRHHASLLA+GSGKK+ Sbjct: 90 RGWQVVGSTEQIKDPRDYFTGRLGDVEYVVCRDDSGIVRAFHNVCRHHASLLAYGSGKKS 149 Query: 464 CFTCPYHGWTYGLDGKLLKATRITGIQNFNVNEFGLVPLGVASWGPFILLNMEPENFPHQ 643 CF CPYHGWTYG +G LLKATRI+G++NFNVN+FGL+P+ VA+WGPF+LLN+E EN + Sbjct: 150 CFVCPYHGWTYGFNGALLKATRISGMRNFNVNDFGLLPMKVATWGPFVLLNLEKENLSKK 209 Query: 644 Q-SLGNVGAEWLGSASEILSVNGVDSSLSYLCRREYFIECNWKVFCDNYLDGGYHVQYAH 820 + NV EWLGS+SEILS NGVDSSLSY+CRREY IECNWKVFCDNYLDGGYHV YAH Sbjct: 210 EVDSHNVSKEWLGSSSEILSTNGVDSSLSYVCRREYTIECNWKVFCDNYLDGGYHVPYAH 269 Query: 821 KGLASGLNLESYSTSVFEKVSIQRCDGGQVESQEDTDRLGSKALYAFIYPNFMINRYGPW 1000 KGLASGL L+SYS ++FE+VSIQ C+G +++ + DRLG KA+YAF+YPNFMINRYGPW Sbjct: 270 KGLASGLKLDSYSITMFERVSIQSCEGSSEKNKGNYDRLGRKAIYAFVYPNFMINRYGPW 329 Query: 1001 MDTNLVLPLGPRKCQVIFDYFLDPSLKDDKEFIESSLKDSESVQDEDIFLCEAVQKGLES 1180 MDTNLV+PLGP KCQVIFDY+L+ SLKDDK+FIE SL+DSE VQ EDI LCE VQKGL+S Sbjct: 330 MDTNLVVPLGPNKCQVIFDYYLEHSLKDDKDFIEKSLQDSEKVQIEDIVLCEGVQKGLQS 389 Query: 1181 PAYSSGRYAPSIEKAMHHFHCLLHKNLA 1264 PAY GRYAP++E+AMHHFHCLL++NLA Sbjct: 390 PAYRVGRYAPTVEQAMHHFHCLLYENLA 417 >gb|AER10510.1| choline monooxygenase [Pyrus betulifolia] gi|353529378|gb|AER10511.1| choline monooxygenase [Pyrus betulifolia] Length = 408 Score = 586 bits (1510), Expect = e-165 Identities = 271/366 (74%), Positives = 311/366 (84%), Gaps = 3/366 (0%) Frame = +2 Query: 173 LVDEFNPKIPIEKAATPPSSWYTDSSFLQLELDHVFYKGWQAVGYNEQIKEAGDFFTGRL 352 LV +F P IPIE+A TPPSSWYTD SF LELD +FY+GWQAVGY EQI+ AG+FFTGRL Sbjct: 44 LVGQFEPTIPIERAGTPPSSWYTDPSFYSLELDTLFYRGWQAVGYTEQIRNAGEFFTGRL 103 Query: 353 GNIEYVVCRDDNGELHAFHNVCRHHASLLAFGSGKKACFTCPYH---GWTYGLDGKLLKA 523 GN+E+VVC+D +G+L AFHNVCRHHA LLA+GSG+K+CF CPYH GWTYG DG LLKA Sbjct: 104 GNVEFVVCQDGDGKLQAFHNVCRHHAMLLAYGSGRKSCFVCPYHVRQGWTYGFDGALLKA 163 Query: 524 TRITGIQNFNVNEFGLVPLGVASWGPFILLNMEPENFPHQQSLGNVGAEWLGSASEILSV 703 TRITGIQNFN +EFGLVPL VA+WGPFILLNME G+V EWLGS++E+LS Sbjct: 164 TRITGIQNFNEHEFGLVPLKVATWGPFILLNMEQNV---DSDPGSVQNEWLGSSAEVLSN 220 Query: 704 NGVDSSLSYLCRREYFIECNWKVFCDNYLDGGYHVQYAHKGLASGLNLESYSTSVFEKVS 883 NG+D+SLS++CRR+Y IECNWKVFCDNYLDGGYHV YAHKGLASGLNL+ YST+V+EKVS Sbjct: 221 NGIDTSLSFVCRRDYVIECNWKVFCDNYLDGGYHVPYAHKGLASGLNLDGYSTTVYEKVS 280 Query: 884 IQRCDGGQVESQEDTDRLGSKALYAFIYPNFMINRYGPWMDTNLVLPLGPRKCQVIFDYF 1063 IQ+C+ G E + D DRLGSKALYAF+YPNFMINRYGPWMDTNLVLPLG RKCQV FDYF Sbjct: 281 IQKCESGSTERKNDFDRLGSKALYAFVYPNFMINRYGPWMDTNLVLPLGQRKCQVRFDYF 340 Query: 1064 LDPSLKDDKEFIESSLKDSESVQDEDIFLCEAVQKGLESPAYSSGRYAPSIEKAMHHFHC 1243 ++ SLKDD +FIE SLKDSE VQ ED+ LCE VQ+GLESPAY+ GRYAP++E AMHHFHC Sbjct: 341 IEASLKDDTDFIERSLKDSERVQMEDVMLCEGVQRGLESPAYNIGRYAPTVENAMHHFHC 400 Query: 1244 LLHKNL 1261 LLHK L Sbjct: 401 LLHKGL 406