BLASTX nr result

ID: Coptis25_contig00028090 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis25_contig00028090
         (1373 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002466618.1| hypothetical protein SORBIDRAFT_01g011130 [S...   116   2e-23
gb|EEC83784.1| hypothetical protein OsI_29682 [Oryza sativa Indi...   105   4e-20
emb|CCA66050.1| hypothetical protein [Beta vulgaris subsp. vulga...   102   2e-19
gb|EEE66057.1| hypothetical protein OsJ_22054 [Oryza sativa Japo...   102   2e-19
gb|AAD17398.1| putative non-LTR retroelement reverse transcripta...   102   2e-19

>ref|XP_002466618.1| hypothetical protein SORBIDRAFT_01g011130 [Sorghum bicolor]
            gi|241920472|gb|EER93616.1| hypothetical protein
            SORBIDRAFT_01g011130 [Sorghum bicolor]
          Length = 463

 Score =  116 bits (290), Expect = 2e-23
 Identities = 109/451 (24%), Positives = 188/451 (41%), Gaps = 13/451 (2%)
 Frame = -3

Query: 1371 SFFWKGIHKTFNAISKGLIWGIGTGDTTSVLQDPWIK--CNEQPMTLTELGVIDPLLQNL 1198
            S+ W+ I      +  G+IW +G G   ++  +PWI      +P TL    +I       
Sbjct: 2    SYSWRSILNGLKVVKMGMIWRVGDGVGLNIWSEPWIPRDSGRKPFTLRGHHIITE----- 56

Query: 1197 MVSDLFLPDKSGWDEAKIKALVNPLICQSILEIKVRPENAPADKMRWNKDKHGVLTVKST 1018
             V++L  P    WDE  ++ +      + IL + V      ++ + W+ DKHGV +VKS 
Sbjct: 57   -VAELINPVTRQWDELLVRDIFWEEDAEVILALPVY--QGRSNMVAWHYDKHGVFSVKSA 113

Query: 1017 Y-----SFLKSDEELGQSSTCNPLTL---KHIWKTPIPAYIQMFLWKMYMQMLPMGDVKA 862
            Y     +++++    GQ    NP  L   K +WK   P  I+ FLW+      P+ D   
Sbjct: 114  YKVARDTYIRNHTSQGQQGGSNPGPLSLWKRVWKLSCPNKIKHFLWRFLHNSHPLRDNLI 173

Query: 861  ERKLQGEFSCPFCYKEIESAEHLFFSCDWIRSLWFSSHIGMRMQKLPDQTLRERVDTFVA 682
             R ++    CP C +  E   HLFF C   R +W    +G+  ++         +D    
Sbjct: 174  RRGMEIVPRCPVCNQVGEDGGHLFFKCGMARQVW--ELLGLSTEREVLANFYTPIDVVEF 231

Query: 681  WSCSSNTDKSMVGIHSAFLLNQIWKARNECKFENRKPDRERMLRDSRKLADNCLQAHREE 502
               +S + K M+ +     L   W  RN      R+ DR R  +   +  +  +Q  R  
Sbjct: 232  ILRASESRKLMMIV----ALWYTWSERNAI----REEDRRRSPQTLARCVELYVQEMRTT 283

Query: 501  HLNSNPSSSLLDKCFEKVFPIIPNESIIIRFDVSYHRQTGWAGAGAIAVDSEGRIVGAAV 322
               +NP+++   + ++   P  P + + +  D S+  +T     G +  D EG ++ +  
Sbjct: 284  ETTANPTANQEQQQYKWSKP--PVDILKLNCDGSFSPETRAGSWGVLIRDHEGDVIMSGR 341

Query: 321  RRF-RXXXXXXXXXXXXXXXXXXAQHLRVKHFVFEGDNAEVIQALQGNSYRWGWTA-LIA 148
             R                     A +L +   + E D  EV++A++ ++Y +     L+ 
Sbjct: 342  GRVNHLMTPMQAELIACLQGVQLAANLGIGRLILETDALEVVKAIKTSAYNYAAVGYLVE 401

Query: 147  NITSLLS-SFNSAAFRWISRILNGDADSLAA 58
             I SL+  +F S    +  RI N  A  LAA
Sbjct: 402  EIKSLIELNFISVECVFACRICNRAAHELAA 432


>gb|EEC83784.1| hypothetical protein OsI_29682 [Oryza sativa Indica Group]
          Length = 666

 Score =  105 bits (261), Expect = 4e-20
 Identities = 87/351 (24%), Positives = 146/351 (41%), Gaps = 8/351 (2%)
 Frame = -3

Query: 1362 WKGIHKTFNAISKGLIWGIGTGDTTSVLQDPWIKCNEQPMTLTELGVIDPLLQNLMVSDL 1183
            W+ I      + +GL+W IG G    + +DPWI  +     +T  G      +   V+DL
Sbjct: 262  WRAIEHGLELLKEGLVWRIGNGTRVRIWRDPWIPRSSTRKVITSQG----RCRIKWVADL 317

Query: 1182 FLPDKSGWDEAKIKALVNPLICQSILEIKVRPENAPADKMRWNKDKHGVLTVKSTYSFL- 1006
             L   + W+E  ++ +  P+   +IL I+   +    D + W+ +K G+ TVK+ Y    
Sbjct: 318  -LDANTNWNEQLVRQIFLPMDADAILSIRTSRQGED-DFLAWHLEKSGIFTVKTAYRLAI 375

Query: 1005 ------KSDEELGQSSTCNPLTLKHIWKTPIPAYIQMFLWKMYMQMLPMGDVKAERKLQG 844
                  K+    G S   +      IW  P+P  +++F W++    L     K  R+L+ 
Sbjct: 376  ENKLNSKNSNASGSSIEGSKSLWNTIWSCPVPPKVRIFAWRVASDCLATRVNKKGRRLEA 435

Query: 843  EFSCPFCYKEIESAEHLFFSCDWIRSLWFSSHIGMRMQKLPDQTLRERVDTFVAWSCSSN 664
              +C  C  E E+A H    C + R+LW +      + ++PDQT      T   W   + 
Sbjct: 436  LDTCTLCGTESETAFHALCRCTYARALWAALR---EVWQIPDQTTWTYQGT--KWLLLTL 490

Query: 663  TDKS-MVGIHSAFLLNQIWKARNECKFENRKPDRERMLRDSRKLADNCLQAHREEHLNSN 487
               S M  +    LL +IW  RNE   + R    E   R      D+ L   +    + +
Sbjct: 491  VKLSEMERMFILMLLWRIWHVRNEVVHDKRHAPIEVSKRFLVSYVDSLLGIRQHPTKDIH 550

Query: 486  PSSSLLDKCFEKVFPIIPNESIIIRFDVSYHRQTGWAGAGAIAVDSEGRIV 334
                ++  C++     +PN S     +      +G AG G I  +SEG  +
Sbjct: 551  KGKGVI--CYQ-----LPNSSRQGGSERQTRLGSGQAGIGMILRNSEGEAI 594


>emb|CCA66050.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1357

 Score =  102 bits (255), Expect = 2e-19
 Identities = 109/444 (24%), Positives = 177/444 (39%), Gaps = 7/444 (1%)
 Frame = -3

Query: 1371 SFFWKGIHKTFNAISKGLIWGIGTGDTTSVLQDPWIKCNEQPMTLTELGVIDPLLQNL-M 1195
            S+ W+ I    + + +GLIW +G G    +   PW+   E         +    ++ L +
Sbjct: 912  SYSWRSIWGAKSLVLEGLIWRVGDGTKIDIWSAPWVGDEEGRF------IKSARVEGLEV 965

Query: 1194 VSDLFLPDKSGWDEAKIKALVNPLICQSILEIKVRPENAPADKMRWNKDKHGVLTVKSTY 1015
            V DL   ++  W+   I+   N    Q IL I +       D++ W   K G  +VK+ Y
Sbjct: 966  VGDLMDVERKEWNVELIERHFNERDQQCILAIPLSTR-CLQDELTWAYSKDGTYSVKTAY 1024

Query: 1014 SFLKSDEELGQSSTCNPLTLKHIWKTPIPAYIQMFLWKMYMQMLPMGDVKAERKLQGEFS 835
               K           N L     W   +   ++ FLW+     LP+  V   R L  E  
Sbjct: 1025 MLGKGGNLDDFHRVWNIL-----WSLNVSPKVRHFLWRACTSSLPVRKVLQRRHLIDEAG 1079

Query: 834  CPFCYKEIESAEHLFFSCDWIRSLWFSSHIGMRMQKLPDQTLRERVDTFVAWSCSSNTDK 655
            CP C +E E+  HLF+ C     LW      + +  + D+ +    DT V W   S  D 
Sbjct: 1080 CPCCAREDETQFHLFYRCPMSLKLWEELGSYILLPGIEDEAM---CDTLVRW---SQMDA 1133

Query: 654  SMVGIHSAFLLNQIWKARNECKFENRKPDR----ERMLRDSRKLADNCLQAHREEHLNSN 487
             +V     ++L  +W  RN   FE+         +R++R      +  ++ +        
Sbjct: 1134 KVVQ-KGCYILWNVWVERNRRVFEHTSQPATVVGQRIMRQVEDFNNYAVKIY-----GGM 1187

Query: 486  PSSSLLDKCFEKVFPIIPNESIIIRFDVSYHRQTGWAGAGAIAVDSEGRIVGAAVRRFR- 310
             SS+ L        P+    +I +  D S   + GW G G IA DSEG++  AA RR R 
Sbjct: 1188 RSSAALSPSRWYAPPV---GAIKLNTDASL-AEEGWVGLGVIARDSEGKVCFAATRRVRA 1243

Query: 309  XXXXXXXXXXXXXXXXXXAQHLRVKHFVFEGDNAEVIQAL-QGNSYRWGWTALIANITSL 133
                              AQ       +FE D+    + L +   +     A++ +I S+
Sbjct: 1244 YWPPEVAECKAIYMATRLAQAHGYGDVIFESDSLVATKRLTKAAIFFSDLDAILGDILSM 1303

Query: 132  LSSFNSAAFRWISRILNGDADSLA 61
             ++F+S +F  + R  N  A +LA
Sbjct: 1304 CNAFSSVSFSHVKRDGNTVAHNLA 1327


>gb|EEE66057.1| hypothetical protein OsJ_22054 [Oryza sativa Japonica Group]
          Length = 940

 Score =  102 bits (255), Expect = 2e-19
 Identities = 110/461 (23%), Positives = 185/461 (40%), Gaps = 24/461 (5%)
 Frame = -3

Query: 1371 SFFWKGIHKTFNAISKGLIWGIGTGDTTSVLQDPWIKCNEQPMTLTELGVIDPLLQNLMV 1192
            SF W+ I    + + KG+ WGIG G +  +L+D WI   +  M    L + D +  + +V
Sbjct: 478  SFTWRSILFGRDLLRKGVRWGIGNGSSVKILKDHWIPGIKPSMVRPLLPMPDDVTVDFLV 537

Query: 1191 SDLFLPDKSGWDEAKIKALVNPLICQSILEIKVRPENAPADKMRWNKDKHGVLTVKSTYS 1012
            +         WDE K+ +  +    Q IL+I V       D + W  DK GV +V+S Y+
Sbjct: 538  NAAI----GEWDEDKVFSFFDETTAQQILQIPVSAHGG-EDFISWPHDKRGVFSVRSAYN 592

Query: 1011 FLKSDEELGQSSTCNPLTL----------KHIWKTPIPAYIQMFLWKMYMQMLPMGDVKA 862
              +S+  +   S      L          K +W+   P  +   LW++    LP G    
Sbjct: 593  LARSEIFMAAQSENGRGMLSGLQESANRWKELWRINAPGKMLTNLWRIVHDCLPSGFQLR 652

Query: 861  ERKLQGEFSCPFCYKEIESAEHLFFSCDWIRSLWFS--SHIGMRMQKLPDQTLRERVDTF 688
             R +     C FC ++ +  EH+F  C +   +W S   H  +++       +++ V  F
Sbjct: 653  RRHIPATDGCCFCERD-DRIEHIFLLCPFAVCIWDSIKQHFDLKLCMTDLSNMKQWVFDF 711

Query: 687  VAWSCSSNTDKSMVGIHSAFLLNQIWKARNECK----FENRKPDRERMLRDSRKLADNC- 523
            +    SSN  K+ +    A  L  IW+ARN  +      N +   +++L     +  +C 
Sbjct: 712  L--GRSSNIQKTAL----AVTLWHIWEARNHSRNNPTLANPRQVIQKILAYVEMIEQHCC 765

Query: 522  --LQAHREEHLNSNPSSSLLDKCFEKVFPIIPNESIIIRFDVSYHRQTGWAGAGAIAVDS 349
              +QA R + L   P            +   P  +I+I  D +  +     G G +  D 
Sbjct: 766  CAVQAVRGDALRPVPR-----------WRPPPEGTILINTDAAVFQSVNSFGLGFLFRDH 814

Query: 348  EGRIVGAAVRR----FRXXXXXXXXXXXXXXXXXXAQHLRVKHFVFEGDNAEVIQALQ-G 184
             G  + AA  R     +                    H ++   V   D   +IQ +Q G
Sbjct: 815  SGLCLFAANERHSGCIQPEMAEALAIRCALRTAMEEGHQKI---VLASDCLAIIQKIQSG 871

Query: 183  NSYRWGWTALIANITSLLSSFNSAAFRWISRILNGDADSLA 61
               R    AL+++I  L + F   +F  ++R+ N  A  LA
Sbjct: 872  ARDRSMVGALVSDINFLAAGFLDCSFIHVNRVTNAAAHLLA 912


>gb|AAD17398.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1225

 Score =  102 bits (254), Expect = 2e-19
 Identities = 112/442 (25%), Positives = 173/442 (39%), Gaps = 5/442 (1%)
 Frame = -3

Query: 1371 SFFWKGIHKTFNAISKGLIWGIGTGDTTSVLQDPWIKCNEQPMTLTELGVIDPLLQNLMV 1192
            SF W+ I      +  GL   IG+G  T V +DPWI         + L + D    +L V
Sbjct: 785  SFGWRSIMAAKPLLLSGLRRTIGSGMLTRVWEDPWIPSFPPRPAKSILNIRDT---HLYV 841

Query: 1191 SDLFLPDKSGWDEAKIKALVNPLICQSILEIKVRPENA-PADKMRWNKDKHGVLTVKSTY 1015
            +DL  P    W   +++ LV+P     IL I  RP     +D   W+  K G  TVKS Y
Sbjct: 842  NDLIDPVTKQWKLGRLQELVDPSDIPLILGI--RPSRTYKSDDFSWSFTKSGNYTVKSGY 899

Query: 1014 ----SFLKSDEELGQSSTCNPLTLKHIWKTPIPAYIQMFLWKMYMQMLPMGDVKAERKLQ 847
                   +   +L             +WK       + F W+     L        R + 
Sbjct: 900  WAARDLSRPTCDLPFQGPSVSALQAQVWKIKTTRKFKHFEWQCLSGCLATNQRLFSRHIG 959

Query: 846  GEFSCPFCYKEIESAEHLFFSCDWIRSLWFSSHIGMRMQKLPDQTLRERVDTFVAWSCSS 667
             E  CP C  E ES  HL F C   R +W  S I       P  +L    D  ++     
Sbjct: 960  TEKVCPRCGAEEESINHLLFLCPPSRQIWALSPIPSSEYIFPRNSLFYNFDFLLSRGKEF 1019

Query: 666  NTDKSMVGIHSAFLLNQIWKARNECKFENRKPDRERMLRDSRKLADNCLQAHREEHLNSN 487
            +  + ++ I   ++L  IWK+RN   FEN     + +L  + + A+   QA+ +E     
Sbjct: 1020 DIAEDIMEIF-PWILWYIWKSRNRFIFENVIESPQVILDFAIQEANVWKQANSKEVATEY 1078

Query: 486  PSSSLLDKCFEKVFPIIPNESIIIRFDVSYHRQTGWAGAGAIAVDSEGRIVGAAVRRFRX 307
            P   +       V   +P    + +FD S+H +   +G G + VD +  ++       + 
Sbjct: 1079 PPPQV-------VPANLPPTRNVCQFDASWHLKDTLSGHGWVLVDQDIVLLLGLKSARKS 1131

Query: 306  XXXXXXXXXXXXXXXXXAQHLRVKHFVFEGDNAEVIQALQGNSYRWGWTALIANITSLLS 127
                                L V    F  D+A+ I  L+  S    + A +A  +SL+ 
Sbjct: 1132 LSPLHAEVDSLLWAMECMISLGVSDCSFASDSADFISLLENPSEWPTFVAELATFSSLVC 1191

Query: 126  SFNSAAFRWISRILNGDADSLA 61
             F S + ++ SRI N  AD L+
Sbjct: 1192 FFPSFSIKFFSRIYNVRADCLS 1213


Top