BLASTX nr result

ID: Coptis23_contig00024776 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis23_contig00024776
         (1156 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulga...   110   7e-22
emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulga...   106   1e-20
emb|CCA66050.1| hypothetical protein [Beta vulgaris subsp. vulga...   100   5e-19
emb|CCA66044.1| hypothetical protein [Beta vulgaris subsp. vulga...   100   7e-19
gb|AAD21778.1| putative non-LTR retroelement reverse transcripta...    92   2e-16

>emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1363

 Score =  110 bits (275), Expect = 7e-22
 Identities = 100/343 (29%), Positives = 144/343 (41%), Gaps = 14/343 (4%)
 Frame = -3

Query: 1070 DKLHWNLTKNGKHSVKSAYQQLSLRYVTNENHKTSLTFWRKDLRPRETLFAWKCYHSIIP 891
            D   WN  KNG  SVKSAY  ++ R        +    WRK++  +  L  W   H+I+P
Sbjct: 1004 DDFTWNFEKNGTFSVKSAYYLINRREEETGGKGSWRGLWRKNIPFKYKLLIWNGIHNILP 1063

Query: 890  TRDHLSTLFYVADKTCLLCNSSIESVHHLLLVCPFTRSLWFS-----TPWNFRLIDFENL 726
            T   L+   +  +  C+ C+  IE + HL   C    S+W        P N  L  F NL
Sbjct: 1064 TALFLAKRIHNFNPQCVACDHPIEDMIHLFRDCCVASSVWIEILKHHKPNNQNL--FFNL 1121

Query: 725  QVKEWIDMWYKPPMDWPIEDHD-WTRFCLYINWIIWLER------CRVLHDNTKPNWTHV 567
            + +EWI        D+ +  HD W        W IW  R      C V H    P +T+ 
Sbjct: 1122 EWEEWI--------DFNLNQHDYWVTKFTTAFWHIWCSRNKTVFECAVNH----PKFTYN 1169

Query: 566  YSLVHGFVNTLVGGLQHNLVISQPVAFETWKPPRLGWVKMNIVTSFKSNKYYPIGIGYVI 387
              +   F N     + +       V    WKPP  G++K+N   ++K++ +   GIG V 
Sbjct: 1170 RVVADFFTNIRAFQVNNTQGNGSKVVLR-WKPPHQGFLKLNTDGAWKAD-WENAGIGGVF 1227

Query: 386  RDSKGCFIAAGTHRGRGRTAEEAECRGALKGLQ--WCRKFSSSVEVELDAKEVCNFLQNK 213
            RD+ G +      R    + E AE     +GLQ  W   +   +EVE DAK V   L   
Sbjct: 1228 RDAVGNWELGFAKRVDAGSPEAAELMAIREGLQVAWDCNY-HKLEVECDAKGVVQLLAKP 1286

Query: 212  PTSISWSSKTILEDVLEELVTFDNVNIKWINRSANCVAHKLAS 84
              + +     I+ D+   L    +V    I R  N VAH LA+
Sbjct: 1287 LEAENHPLGVIVMDICILLTRHWSVEFLHIKREGNKVAHCLAA 1329


>emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1369

 Score =  106 bits (265), Expect = 1e-20
 Identities = 96/386 (24%), Positives = 159/386 (41%), Gaps = 12/386 (3%)
 Frame = -3

Query: 1154 NIGILNTIFDINTSLSIQNIYIPRNNPVDKLHWNLTKNGKHSVKSAYQQLSLR------- 996
            N+ +LNT+F    S +IQ I +      D+  W ++KNG+ +V+SAY    L        
Sbjct: 982  NVELLNTLFQPWESTAIQRIPVALQKKPDQWMWMMSKNGQFTVRSAYYHELLEDRKTGPS 1041

Query: 995  YVTNENHKTSLTFWRKDLRPRETLFAWKCYHSIIPTRDHLSTLFYVADKTCLLCNSSIES 816
                 N K     W+  + P+  LF+WK  H+ +    ++       D  C  C    E+
Sbjct: 1042 TSRGPNLKLWQKIWKAKIPPKVKLFSWKAIHNGLAVYTNMRKRGMNIDGACPRCGEKEET 1101

Query: 815  VHHLLLVCPFTRSLWFSTPWNFRLIDFENLQVKEWIDMWYKPPMDWPIEDHDWTRFCLYI 636
              HL+  C  +   W+ +P      + E    + W++           +D +W      I
Sbjct: 1102 TEHLIWGCDESSRAWYISPLRIHTGNIEAGSFRIWVESLLDTH-----KDTEWWALFWMI 1156

Query: 635  NWIIWLERCRVLHDNTKPNWTHVYSLVHGFVNTLVGGLQHNLVISQPVAFET-WKPPRLG 459
             W IWL R + + +  K  +  V       V        H   +      E  W  P +G
Sbjct: 1157 CWNIWLGRNKWVFEKKKLAFQEVVERAVRGVMEFEEECAHTSPVETLNTHENGWSVPPVG 1216

Query: 458  WVKMNIVTSFKSNKYYPIGIGYVIRDSKGCFIAA----GTHRGRGRTAEEAECRGALKGL 291
             VK+N+  +    K+  IG+G V+RD++G  + A    G        AE    R  LK +
Sbjct: 1217 MVKLNVDAAV--FKHVGIGMGGVVRDAEGDVLLATCCGGWAMEDPAMAEACSLRYGLK-V 1273

Query: 290  QWCRKFSSSVEVELDAKEVCNFLQNKPTSISWSSKTILEDVLEELVTFDNVNIKWINRSA 111
             +   F + V VE+D K++   L+ K + ++   + +++D+L       NV  + + R  
Sbjct: 1274 AYEAGFRNLV-VEMDCKKLFLQLRGKASDVTPFGR-VVDDILYLASKCSNVVFEHVKRHC 1331

Query: 110  NCVAHKLASAGHFLLPSSEWLSHPPS 33
            N VAH LA      +    WL   PS
Sbjct: 1332 NKVAHLLAQMCKNAMEKRVWLEEYPS 1357


>emb|CCA66050.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1357

 Score =  100 bits (250), Expect = 5e-19
 Identities = 96/365 (26%), Positives = 148/365 (40%), Gaps = 9/365 (2%)
 Frame = -3

Query: 1070 DKLHWNLTKNGKHSVKSAYQQLSLRYVTNENHKTSLTFWRKDLRPRETLFAWKCYHSIIP 891
            D+L W  +K+G +SVK+AY  L      ++ H+     W  ++ P+   F W+   S +P
Sbjct: 1006 DELTWAYSKDGTYSVKTAYM-LGKGGNLDDFHRVWNILWSLNVSPKVRHFLWRACTSSLP 1064

Query: 890  TRDHLSTLFYVADKTCLLCNSSIESVHHLLLVCPFTRSLWFSTPWNFRLIDFENLQVKEW 711
             R  L     + +  C  C    E+  HL   CP +  LW        L   E+  + + 
Sbjct: 1065 VRKVLQRRHLIDEAGCPCCAREDETQFHLFYRCPMSLKLWEELGSYILLPGIEDEAMCDT 1124

Query: 710  IDMWYKPPMDWPIEDHDWTRFCLYINWIIWLERCRVLHDNTKPNWT----HVYSLVHGFV 543
            +  W +  MD  +          YI W +W+ER R + ++T    T     +   V  F 
Sbjct: 1125 LVRWSQ--MDAKVVQKG-----CYILWNVWVERNRRVFEHTSQPATVVGQRIMRQVEDFN 1177

Query: 542  NTLV---GGLQHNLVISQPVAFETWKPPRLGWVKMNIVTSFKSNKYYPIGIGYVIRDSKG 372
            N  V   GG++ +  +S       W  P +G +K+N   S     +  +G+G + RDS+G
Sbjct: 1178 NYAVKIYGGMRSSAALSP----SRWYAPPVGAIKLNTDASLAEEGW--VGLGVIARDSEG 1231

Query: 371  CFIAAGTHRGRGR-TAEEAECRGALKGLQWCRKFS-SSVEVELDAKEVCNFLQNKPTSIS 198
                A T R R     E AEC+      +  +      V  E D+      L       S
Sbjct: 1232 KVCFAATRRVRAYWPPEVAECKAIYMATRLAQAHGYGDVIFESDSLVATKRLTKAAIFFS 1291

Query: 197  WSSKTILEDVLEELVTFDNVNIKWINRSANCVAHKLASAGHFLLPSSEWLSHPPSWLMHY 18
                 IL D+L     F +V+   + R  N VAH LA    F +    W  H PS +  Y
Sbjct: 1292 -DLDAILGDILSMCNAFSSVSFSHVKRDGNTVAHNLARVVPFGVEQC-WEHHCPSSVTPY 1349

Query: 17   LEDDT 3
            +  DT
Sbjct: 1350 VLMDT 1354


>emb|CCA66044.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1355

 Score =  100 bits (249), Expect = 7e-19
 Identities = 98/391 (25%), Positives = 166/391 (42%), Gaps = 12/391 (3%)
 Frame = -3

Query: 1151 IGILNTIF---DINTSLSIQNIYIPRNNPVDKLHWNLTKNGKHSVKSAYQQLSLRYVTNE 981
            + ++ T+F   DI   LSI    +P     D+L W  TKN  +SVK+AY  L      + 
Sbjct: 976  VSLIETVFNERDIKCILSIPLSSLPLK---DELTWAFTKNAHYSVKTAYM-LGKGGNLDS 1031

Query: 980  NHKTSLTFWRKDLRPRETLFAWKCYHSIIPTRDHLSTLFYVADKTCLLCNSSIESVHHLL 801
             H+  +  W  ++ P+   F W+   + +P R  L     + D  C       ES  H +
Sbjct: 1032 FHQAWIDIWSMEVSPKVKHFLWRLGTNTLPVRSLLKHRHMLDDDLCPRGCGEPESQFHAI 1091

Query: 800  LVCPFTRSLWFSTPW-NFRLIDFENLQVKEWIDMWYKPPMDWPIEDHDWTRFCLYINWII 624
              CPF R LW  +   NFR +  +    +  ++      +D  +          ++ W++
Sbjct: 1092 FGCPFIRDLWVDSGCDNFRALTTDTAMTEALVN---SHGLDASVRTKG-----AFMAWVL 1143

Query: 623  WLERCRVL--HDNTKPN--WTHVYSLV--HGFVNTLVGGLQHNLVISQPVAFETWKPPRL 462
            W ER  ++    +T P+     V  LV  HG   T    +  N       +   W  P  
Sbjct: 1144 WSERNSIVFNQSSTPPHILLARVSRLVEEHG---TYTARIYPNRNCCAIPSARVWAAPPP 1200

Query: 461  GWVKMNIVTSFKSNKYYPIGIGYVIRDSKGCFIAAGTHRGRGR-TAEEAECRGALKGLQW 285
              +K+N+  S  S  +  +G+  + RDS G  + A   + R + +AE AE +     L+ 
Sbjct: 1201 EVIKLNVDASLASAGW--VGLSVIARDSHGTVLFAAVRKVRAQWSAEIAEAKAIEMALRL 1258

Query: 284  CRKFS-SSVEVELDAKEVCNFLQNKPTSISWSSKTILEDVLEELVTFDNVNIKWINRSAN 108
             R++  +++ VE D + V N L  +   ++     IL ++    + F +V    + R AN
Sbjct: 1259 GRRYGFAAIIVESDCQVVVNRLSKQALYLA-DLDIILHNIFSSCINFPSVLWSHVKRDAN 1317

Query: 107  CVAHKLASAGHFLLPSSEWLSHPPSWLMHYL 15
             VAH LA    F +    W +H P  +  Y+
Sbjct: 1318 SVAHHLAKLTPFGI-EQIWENHVPPEVAPYV 1347


>gb|AAD21778.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1715

 Score = 92.0 bits (227), Expect = 2e-16
 Identities = 92/385 (23%), Positives = 161/385 (41%), Gaps = 22/385 (5%)
 Frame = -3

Query: 1103 QNIYIPRNNPVDKLHWNLTKNGKHSVKSAYQQLSLRYVTNENHKTSLT--------FWRK 948
            +++Y+      D   W  T+N +++V+S Y   +   +T E     L          WR 
Sbjct: 1332 KSLYLSNYAARDSYKWAYTRNTQYTVRSGYWVATHVNLTEEEIINPLEGDVPLKQEIWRL 1391

Query: 947  DLRPRETLFAWKCYHSIIPTRDHLSTLFYVADKTCLLCNSSIESVHHLLLVCPFTRSLWF 768
             + P+   F W+C    + T   L      AD TC  C ++ E+++H++  C + + +W 
Sbjct: 1392 KITPKIKHFIWRCLSGALSTTTQLRNRNIPADPTCQRCCNADETINHIIFTCSYAQVVWR 1451

Query: 767  STPW--NFRLIDFENLQVKEWIDMWYKPPMDWPIEDHDWTRFCLYINWIIWLERCRVLHD 594
            S  +  + RL   +NL+    + +  K   + PI +        +I W +W  R   L  
Sbjct: 1452 SANFSGSNRLCFTDNLEENIRLILQGKKNQNLPILN---GLMPFWIMWRLWKSRNEYLFQ 1508

Query: 593  N-TKPNWTHVYSL---VHGFVNTLVG--GLQHNLVIS--QPVA-FETWKPPRLGWVKMNI 441
               +  W            +V T+V    + HN   S  +P++  + W  P  G++K N 
Sbjct: 1509 QLDRFPWKVAQKAEQEATEWVETMVNDTAISHNTAQSNDRPLSRSKQWSSPPEGFLKCNF 1568

Query: 440  VTSFKSNKYYPIGIGYVIRDSKGCFIAAGTHR-GRGRTAEEAECRGALKGLQ--WCRKFS 270
             + +   + Y    G+++RD  G  + +G  +  +  +A +AE  G L  LQ  W R + 
Sbjct: 1569 DSGYVQGRDY-TSTGWILRDCNGRVLHSGCAKLQQSYSALQAEALGFLHALQMVWIRGY- 1626

Query: 269  SSVEVELDAKEVCNFLQNKPTSISWSSKTILEDVLEELVTFDNVNIKWINRSANCVAHKL 90
              V  E D  E+ N +    T      +T+L D+   +      +I ++NR  N  A KL
Sbjct: 1627 CYVWFEGDNLELTNLINK--TEDHHLLETLLYDIRFWMTKLPFSSIGYVNRERNLAADKL 1684

Query: 89   ASAGHFLLPSSEWLSHPPSWLMHYL 15
                + +    E    PP WL  YL
Sbjct: 1685 TKYANSMSSLYETFHVPPRWLQLYL 1709


Top