BLASTX nr result

ID: Coptis24_contig00016113 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00016113
         (2184 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|NP_001190226.1| UDP-glycosyltransferase family protein [Arab...   659   0.0  
ref|NP_568137.1| UDP-glycosyltransferase family protein [Arabido...   659   0.0  
ref|XP_003535489.1| PREDICTED: uncharacterized protein LOC100779...   654   0.0  
ref|XP_002873152.1| hypothetical protein ARALYDRAFT_487229 [Arab...   652   0.0  
emb|CAB85554.1| putative protein [Arabidopsis thaliana]               651   0.0  

>ref|NP_001190226.1| UDP-glycosyltransferase family protein [Arabidopsis thaliana]
            gi|332003368|gb|AED90751.1| UDP-glycosyltransferase
            family protein [Arabidopsis thaliana]
          Length = 1035

 Score =  659 bits (1699), Expect = 0.0
 Identities = 317/569 (55%), Positives = 409/569 (71%), Gaps = 2/569 (0%)
 Frame = -2

Query: 2183 YVVDEVHGLIYQMRNPDTLMRAFSLLVTNGKLSKFAHLVASSGKMLAKNMLASESVYGFA 2004
            Y+ DEVHG+ ++  +PD L++AFS L+++G+LSKFA  +ASSG++L KN++A+E + G+A
Sbjct: 468  YMADEVHGIFFRRNDPDALLKAFSPLISDGRLSKFAQTIASSGRLLTKNLMATECITGYA 527

Query: 2003 KLLENVLHFPSDVFLPDPC-QLQPHTWEWTTFRKEMEQRGSEVSDFDVNNFMKKASVVYS 1827
            +LLEN+LHFPSD FLP    QLQ   WEW  FR E+EQ  S + D     F+ K+ +V+ 
Sbjct: 528  RLLENMLHFPSDTFLPGSISQLQVAAWEWNFFRSELEQPKSFILD-SAYAFIGKSGIVFQ 586

Query: 1826 VEEEYTALGNVSNMSKNETDVLTQETPTKLDWEILKEMEDSEDSERLEAEQLEERMEKTM 1647
            VEE++  +   +N   N T  ++ E P+KLDW++L+E+E +E+ E++E+E+LE+RME+ +
Sbjct: 587  VEEKFMGVIESTNPVDNNTLFVSDELPSKLDWDVLEEIEGAEEYEKVESEELEDRMERDV 646

Query: 1646 GSWEEIYRTARKAEKLKFEVNERDEGELERIGQQLCIYEIYTGAGAWPFLRHGSXXXXXX 1467
              WEEIYR ARK+EKLKFEVNERDEGELER G+ LCIYEIY GAGAWPFL HGS      
Sbjct: 647  EDWEEIYRNARKSEKLKFEVNERDEGELERTGEPLCIYEIYNGAGAWPFLHHGSLYRGLS 706

Query: 1466 XXXXXXXXXXXXXXAVLRLPLLNDTYYQNLLCELGGMFSIANRVDNIHNVPWIGFQSWHA 1287
                          A  RLPLLNDTYY+++LCE+GGMFS+AN+VD+IH  PWIGFQSW A
Sbjct: 707  LSSKDRRLSSDDVDAADRLPLLNDTYYRDILCEIGGMFSVANKVDSIHMRPWIGFQSWRA 766

Query: 1286 AGRKVSLSGKSEIALEETIQAESEGDVIYYWARLELDNGNAGGDDTLTFWSLCDILNGGQ 1107
            AGRKVSLS K+E +LE  I+ E++G++IY+W RL++D    G  + LTFWS+CDILN G 
Sbjct: 767  AGRKVSLSSKAEESLENIIKQETKGEIIYFWTRLDIDGDAYGSKNALTFWSMCDILNQGN 826

Query: 1106 CRTAFADAFRQMYGLSPNIEALPPMPDDGGHWSALHSWVMPTPSFLEYMMFSRMFVDSLD 927
            CRT F DAFR MYGL  +IEALPPMP+DG HWS+LH+WVMPTPSFLE++MFSRMF +SLD
Sbjct: 827  CRTTFEDAFRHMYGLPEHIEALPPMPEDGHHWSSLHNWVMPTPSFLEFVMFSRMFSESLD 886

Query: 926  SLNNERN-TTTCLLGSSELEKRHCYCRMLELLVNVWAYHSARRMVYMDPSSGLFEEQHLI 750
            +L+N  N + +C L SS LE++HCYCR+LELLVNVWAYHS R+MVY++P  G  EEQH +
Sbjct: 887  ALHNNLNDSKSCSLASSLLERKHCYCRVLELLVNVWAYHSGRKMVYINPRDGSLEEQHPL 946

Query: 749  EQRKGFMWVKFFNFTLLKNMXXXXXXXXXXXDHIREGWLWPYTGEVHWKGIXXXXXXXXX 570
            +QRKG MW K+FNFTLLK+M           DH RE WLWP TGEVHWKG+         
Sbjct: 947  QQRKGLMWAKYFNFTLLKSMDEDLAEAADDKDHPRERWLWPLTGEVHWKGVYEREREERY 1006

Query: 569  RXXXXXXXXXXXXXXXXXKYGYRQKTLGG 483
            R                 K GY+QK+LGG
Sbjct: 1007 RLKMDKKRKTKEKLYDRIKNGYKQKSLGG 1035


>ref|NP_568137.1| UDP-glycosyltransferase family protein [Arabidopsis thaliana]
            gi|15450503|gb|AAK96544.1| AT5g04480/T32M21_80
            [Arabidopsis thaliana] gi|24111433|gb|AAN46867.1|
            At5g04480/T32M21_80 [Arabidopsis thaliana]
            gi|332003367|gb|AED90750.1| UDP-glycosyltransferase
            family protein [Arabidopsis thaliana]
          Length = 1050

 Score =  659 bits (1699), Expect = 0.0
 Identities = 317/569 (55%), Positives = 409/569 (71%), Gaps = 2/569 (0%)
 Frame = -2

Query: 2183 YVVDEVHGLIYQMRNPDTLMRAFSLLVTNGKLSKFAHLVASSGKMLAKNMLASESVYGFA 2004
            Y+ DEVHG+ ++  +PD L++AFS L+++G+LSKFA  +ASSG++L KN++A+E + G+A
Sbjct: 483  YMADEVHGIFFRRNDPDALLKAFSPLISDGRLSKFAQTIASSGRLLTKNLMATECITGYA 542

Query: 2003 KLLENVLHFPSDVFLPDPC-QLQPHTWEWTTFRKEMEQRGSEVSDFDVNNFMKKASVVYS 1827
            +LLEN+LHFPSD FLP    QLQ   WEW  FR E+EQ  S + D     F+ K+ +V+ 
Sbjct: 543  RLLENMLHFPSDTFLPGSISQLQVAAWEWNFFRSELEQPKSFILD-SAYAFIGKSGIVFQ 601

Query: 1826 VEEEYTALGNVSNMSKNETDVLTQETPTKLDWEILKEMEDSEDSERLEAEQLEERMEKTM 1647
            VEE++  +   +N   N T  ++ E P+KLDW++L+E+E +E+ E++E+E+LE+RME+ +
Sbjct: 602  VEEKFMGVIESTNPVDNNTLFVSDELPSKLDWDVLEEIEGAEEYEKVESEELEDRMERDV 661

Query: 1646 GSWEEIYRTARKAEKLKFEVNERDEGELERIGQQLCIYEIYTGAGAWPFLRHGSXXXXXX 1467
              WEEIYR ARK+EKLKFEVNERDEGELER G+ LCIYEIY GAGAWPFL HGS      
Sbjct: 662  EDWEEIYRNARKSEKLKFEVNERDEGELERTGEPLCIYEIYNGAGAWPFLHHGSLYRGLS 721

Query: 1466 XXXXXXXXXXXXXXAVLRLPLLNDTYYQNLLCELGGMFSIANRVDNIHNVPWIGFQSWHA 1287
                          A  RLPLLNDTYY+++LCE+GGMFS+AN+VD+IH  PWIGFQSW A
Sbjct: 722  LSSKDRRLSSDDVDAADRLPLLNDTYYRDILCEIGGMFSVANKVDSIHMRPWIGFQSWRA 781

Query: 1286 AGRKVSLSGKSEIALEETIQAESEGDVIYYWARLELDNGNAGGDDTLTFWSLCDILNGGQ 1107
            AGRKVSLS K+E +LE  I+ E++G++IY+W RL++D    G  + LTFWS+CDILN G 
Sbjct: 782  AGRKVSLSSKAEESLENIIKQETKGEIIYFWTRLDIDGDAYGSKNALTFWSMCDILNQGN 841

Query: 1106 CRTAFADAFRQMYGLSPNIEALPPMPDDGGHWSALHSWVMPTPSFLEYMMFSRMFVDSLD 927
            CRT F DAFR MYGL  +IEALPPMP+DG HWS+LH+WVMPTPSFLE++MFSRMF +SLD
Sbjct: 842  CRTTFEDAFRHMYGLPEHIEALPPMPEDGHHWSSLHNWVMPTPSFLEFVMFSRMFSESLD 901

Query: 926  SLNNERN-TTTCLLGSSELEKRHCYCRMLELLVNVWAYHSARRMVYMDPSSGLFEEQHLI 750
            +L+N  N + +C L SS LE++HCYCR+LELLVNVWAYHS R+MVY++P  G  EEQH +
Sbjct: 902  ALHNNLNDSKSCSLASSLLERKHCYCRVLELLVNVWAYHSGRKMVYINPRDGSLEEQHPL 961

Query: 749  EQRKGFMWVKFFNFTLLKNMXXXXXXXXXXXDHIREGWLWPYTGEVHWKGIXXXXXXXXX 570
            +QRKG MW K+FNFTLLK+M           DH RE WLWP TGEVHWKG+         
Sbjct: 962  QQRKGLMWAKYFNFTLLKSMDEDLAEAADDKDHPRERWLWPLTGEVHWKGVYEREREERY 1021

Query: 569  RXXXXXXXXXXXXXXXXXKYGYRQKTLGG 483
            R                 K GY+QK+LGG
Sbjct: 1022 RLKMDKKRKTKEKLYDRIKNGYKQKSLGG 1050


>ref|XP_003535489.1| PREDICTED: uncharacterized protein LOC100779157 [Glycine max]
          Length = 1044

 Score =  654 bits (1688), Expect = 0.0
 Identities = 321/568 (56%), Positives = 405/568 (71%), Gaps = 2/568 (0%)
 Frame = -2

Query: 2183 YVVDEVHGLIYQMRNPDTLMRAFSLLVTNGKLSKFAHLVASSGKMLAKNMLASESVYGFA 2004
            Y+VD VHG+ +   NP+ LM AFSLL++NG+LSKFA  +ASSG+ LAKN+LA + + G+A
Sbjct: 482  YIVDGVHGIFFSKHNPEALMNAFSLLLSNGRLSKFAQAIASSGRQLAKNVLALDCITGYA 541

Query: 2003 KLLENVLHFPSDVFLPDPC-QLQPHTWEWTTFRKEMEQRGSEVSDFDVNNFMKKASVVYS 1827
            +LLENVL+FPSD  LP P  Q+Q  +WEW  FR E++     +S  D +   +K S+VY+
Sbjct: 542  RLLENVLNFPSDALLPGPVSQIQQGSWEWNLFRNEID-----LSKIDGDFSNRKVSIVYA 596

Query: 1826 VEEEYTALGNVSNMSKNETDVLTQETPTKLDWEILKEMEDSEDSERLEAEQLEERMEKTM 1647
            VE E  +L   +++ +N T+V  ++  T+LDW+IL+E+E SE++E  E E+ EER EK +
Sbjct: 597  VEHELASLNYSTSIFENGTEVPLRDELTQLDWDILREIEISEENEMFEVEEAEERREKGV 656

Query: 1646 GSWEEIYRTARKAEKLKFEVNERDEGELERIGQQLCIYEIYTGAGAWPFLRHGSXXXXXX 1467
            G W++IYR ARK+EKLKFEVNERDEGELER GQ +CIYEIY GAG WPFL HGS      
Sbjct: 657  GVWDDIYRNARKSEKLKFEVNERDEGELERTGQPVCIYEIYNGAGVWPFLHHGSLYRGLS 716

Query: 1466 XXXXXXXXXXXXXXAVLRLPLLNDTYYQNLLCELGGMFSIANRVDNIHNVPWIGFQSWHA 1287
                          AV RLPLLNDTYY+++LCE+GGMF+IANRVDNIH  PWIGFQSW A
Sbjct: 717  LSRRAQRQSSDDVDAVGRLPLLNDTYYRDILCEMGGMFAIANRVDNIHRRPWIGFQSWRA 776

Query: 1286 AGRKVSLSGKSEIALEETIQAESEGDVIYYWARLELDNGNAGGDDTLTFWSLCDILNGGQ 1107
            AGRKV+LS K+E  LEET+Q    GDVIY+W R ++D    G  +  +FW +CDILNGG 
Sbjct: 777  AGRKVALSAKAEKVLEETMQENFRGDVIYFWGRFDMDQSVIGNHNANSFWYMCDILNGGN 836

Query: 1106 CRTAFADAFRQMYGLSPNIEALPPMPDDGGHWSALHSWVMPTPSFLEYMMFSRMFVDSLD 927
            CR  F + FRQMY L P+ EALPPMP+D G+WSALHSWVMPTPSFLE++MFSRMFVDS+D
Sbjct: 837  CRIVFQEGFRQMYALPPHAEALPPMPED-GYWSALHSWVMPTPSFLEFIMFSRMFVDSID 895

Query: 926  SLNNERNT-TTCLLGSSELEKRHCYCRMLELLVNVWAYHSARRMVYMDPSSGLFEEQHLI 750
            +L+ +    + CLLGSSE+EK+HCYCR+LELL+NVWAYHSAR+MVY++P++G  EEQH I
Sbjct: 896  ALHRDSTKYSLCLLGSSEIEKKHCYCRVLELLINVWAYHSARKMVYINPNTGSMEEQHPI 955

Query: 749  EQRKGFMWVKFFNFTLLKNMXXXXXXXXXXXDHIREGWLWPYTGEVHWKGIXXXXXXXXX 570
            EQRKGFMW K+FN +LLK+M           DH RE WLWP TGEVHW+GI         
Sbjct: 956  EQRKGFMWAKYFNISLLKSMDEDLAEAADDGDHPREMWLWPMTGEVHWQGIYEREREERY 1015

Query: 569  RXXXXXXXXXXXXXXXXXKYGYRQKTLG 486
            R                 KYGY+QK+LG
Sbjct: 1016 RLKMDKKRKTKEKLFERMKYGYKQKSLG 1043


>ref|XP_002873152.1| hypothetical protein ARALYDRAFT_487229 [Arabidopsis lyrata subsp.
            lyrata] gi|297318989|gb|EFH49411.1| hypothetical protein
            ARALYDRAFT_487229 [Arabidopsis lyrata subsp. lyrata]
          Length = 1051

 Score =  652 bits (1681), Expect = 0.0
 Identities = 315/569 (55%), Positives = 409/569 (71%), Gaps = 2/569 (0%)
 Frame = -2

Query: 2183 YVVDEVHGLIYQMRNPDTLMRAFSLLVTNGKLSKFAHLVASSGKMLAKNMLASESVYGFA 2004
            Y+ DEVHG+ ++  +PD L++AFS L+++G+LS+FA  +ASSG++L KN++A+E + G+A
Sbjct: 484  YLADEVHGIFFRRNDPDALLKAFSPLISDGRLSEFAQTIASSGRLLTKNLMATECITGYA 543

Query: 2003 KLLENVLHFPSDVFLPDPC-QLQPHTWEWTTFRKEMEQRGSEVSDFDVNNFMKKASVVYS 1827
            +LLEN+LHFPSD FLP    QLQ  +WEW+ FR E+EQ  S + D    + + K+ +V+ 
Sbjct: 544  RLLENILHFPSDTFLPGSISQLQGASWEWSFFRSELEQPKSFILDSAYAS-IGKSGIVFQ 602

Query: 1826 VEEEYTALGNVSNMSKNETDVLTQETPTKLDWEILKEMEDSEDSERLEAEQLEERMEKTM 1647
            VEE+Y  +   +N   N T  ++ E P+KLDW++L+E+E +E+ E +E+E+LE+RME+ +
Sbjct: 603  VEEKYMGVIESTNPVDNSTLFVSDELPSKLDWDVLEEIEGAEEYENVESEELEDRMERDV 662

Query: 1646 GSWEEIYRTARKAEKLKFEVNERDEGELERIGQQLCIYEIYTGAGAWPFLRHGSXXXXXX 1467
              WEEIYR ARK+EKLKFEVNERDEGELER GQ +CIYEIY GAGAWPFL HGS      
Sbjct: 663  EDWEEIYRNARKSEKLKFEVNERDEGELERTGQPVCIYEIYDGAGAWPFLHHGSLYRGLS 722

Query: 1466 XXXXXXXXXXXXXXAVLRLPLLNDTYYQNLLCELGGMFSIANRVDNIHNVPWIGFQSWHA 1287
                          A  RLPLLNDTYY+++LCE+GGMFS+AN+VD+IH  PWIGFQSW A
Sbjct: 723  LSSKDRRLSSDDVDAADRLPLLNDTYYRDILCEIGGMFSVANKVDSIHMRPWIGFQSWRA 782

Query: 1286 AGRKVSLSGKSEIALEETIQAESEGDVIYYWARLELDNGNAGGDDTLTFWSLCDILNGGQ 1107
            AGRKVSLS K+E +LE  I+ E++G++IY+W RL++D    G  + LTFWS+CDILN G 
Sbjct: 783  AGRKVSLSSKAEESLENIIKQETKGEIIYFWTRLDIDGDAYGRKNALTFWSMCDILNQGN 842

Query: 1106 CRTAFADAFRQMYGLSPNIEALPPMPDDGGHWSALHSWVMPTPSFLEYMMFSRMFVDSLD 927
            CRT F DAFR +YGL  +IEALPPMP+DG HWS+LH+WVMPTPSFLE++MFSRMF +SLD
Sbjct: 843  CRTTFEDAFRHIYGLPEHIEALPPMPEDGHHWSSLHNWVMPTPSFLEFVMFSRMFSESLD 902

Query: 926  SLNNERN-TTTCLLGSSELEKRHCYCRMLELLVNVWAYHSARRMVYMDPSSGLFEEQHLI 750
            +L+N  N + +C L SS LE++HCYCR+LELLVNVWAYHS R+MVY++P  G  EEQH +
Sbjct: 903  ALHNNLNDSKSCSLASSLLERKHCYCRVLELLVNVWAYHSGRKMVYINPRDGSLEEQHPL 962

Query: 749  EQRKGFMWVKFFNFTLLKNMXXXXXXXXXXXDHIREGWLWPYTGEVHWKGIXXXXXXXXX 570
             QRKG MW K+FNFTLLK+M           DH RE WLWP TGEVHWKG+         
Sbjct: 963  LQRKGLMWAKYFNFTLLKSMDEDLAEAADDKDHPRERWLWPLTGEVHWKGVYEREREERY 1022

Query: 569  RXXXXXXXXXXXXXXXXXKYGYRQKTLGG 483
            R                 K GY+QK+LGG
Sbjct: 1023 RLKMDKKRKTKEKLYDRIKNGYKQKSLGG 1051


>emb|CAB85554.1| putative protein [Arabidopsis thaliana]
          Length = 1091

 Score =  651 bits (1680), Expect = 0.0
 Identities = 316/569 (55%), Positives = 407/569 (71%), Gaps = 2/569 (0%)
 Frame = -2

Query: 2183 YVVDEVHGLIYQMRNPDTLMRAFSLLVTNGKLSKFAHLVASSGKMLAKNMLASESVYGFA 2004
            Y+ DEVHG+ ++  +PD L++AFS L+++G+LSKFA  +ASSG++L KN++A+E + G+A
Sbjct: 526  YMADEVHGIFFRRNDPDALLKAFSPLISDGRLSKFAQTIASSGRLLTKNLMATECITGYA 585

Query: 2003 KLLENVLHFPSDVFLPDPC-QLQPHTWEWTTFRKEMEQRGSEVSDFDVNNFMKKASVVYS 1827
            +LLEN+LHFPSD FLP    QLQ   WEW  FR E+EQ  S + D     F+ K+ +V+ 
Sbjct: 586  RLLENMLHFPSDTFLPGSISQLQVAAWEWNFFRSELEQPKSFILD-SAYAFIGKSGIVFQ 644

Query: 1826 VEEEYTALGNVSNMSKNETDVLTQETPTKLDWEILKEMEDSEDSERLEAEQLEERMEKTM 1647
            VEE++  +   +N   N T  ++ E P+KLDW++L+E+E +E+ E++E+E  E+RME+ +
Sbjct: 645  VEEKFMGVIESTNPVDNNTLFVSDELPSKLDWDVLEEIEGAEEYEKVESE--EDRMERDV 702

Query: 1646 GSWEEIYRTARKAEKLKFEVNERDEGELERIGQQLCIYEIYTGAGAWPFLRHGSXXXXXX 1467
              WEEIYR ARK+EKLKFEVNERDEGELER G+ LCIYEIY GAGAWPFL HGS      
Sbjct: 703  EDWEEIYRNARKSEKLKFEVNERDEGELERTGEPLCIYEIYNGAGAWPFLHHGSLYRGLS 762

Query: 1466 XXXXXXXXXXXXXXAVLRLPLLNDTYYQNLLCELGGMFSIANRVDNIHNVPWIGFQSWHA 1287
                          A  RLPLLNDTYY+++LCE+GGMFS+AN+VD+IH  PWIGFQSW A
Sbjct: 763  LSSKDRRLSSDDVDAADRLPLLNDTYYRDILCEIGGMFSVANKVDSIHMRPWIGFQSWRA 822

Query: 1286 AGRKVSLSGKSEIALEETIQAESEGDVIYYWARLELDNGNAGGDDTLTFWSLCDILNGGQ 1107
            AGRKVSLS K+E +LE  I+ E++G++IY+W RL++D    G  + LTFWS+CDILN G 
Sbjct: 823  AGRKVSLSSKAEESLENIIKQETKGEIIYFWTRLDIDGDAYGSKNALTFWSMCDILNQGN 882

Query: 1106 CRTAFADAFRQMYGLSPNIEALPPMPDDGGHWSALHSWVMPTPSFLEYMMFSRMFVDSLD 927
            CRT F DAFR MYGL  +IEALPPMP+DG HWS+LH+WVMPTPSFLE++MFSRMF +SLD
Sbjct: 883  CRTTFEDAFRHMYGLPEHIEALPPMPEDGHHWSSLHNWVMPTPSFLEFVMFSRMFSESLD 942

Query: 926  SLNNERN-TTTCLLGSSELEKRHCYCRMLELLVNVWAYHSARRMVYMDPSSGLFEEQHLI 750
            +L+N  N + +C L SS LE++HCYCR+LELLVNVWAYHS R+MVY++P  G  EEQH +
Sbjct: 943  ALHNNLNDSKSCSLASSLLERKHCYCRVLELLVNVWAYHSGRKMVYINPRDGSLEEQHPL 1002

Query: 749  EQRKGFMWVKFFNFTLLKNMXXXXXXXXXXXDHIREGWLWPYTGEVHWKGIXXXXXXXXX 570
            +QRKG MW K+FNFTLLK+M           DH RE WLWP TGEVHWKG+         
Sbjct: 1003 QQRKGLMWAKYFNFTLLKSMDEDLAEAADDKDHPRERWLWPLTGEVHWKGVYEREREERY 1062

Query: 569  RXXXXXXXXXXXXXXXXXKYGYRQKTLGG 483
            R                 K GY+QK+LGG
Sbjct: 1063 RLKMDKKRKTKEKLYDRIKNGYKQKSLGG 1091


Top