BLASTX nr result

ID: Coptis21_contig00000032 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis21_contig00000032
         (1567 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002276294.1| PREDICTED: uncharacterized protein LOC100250...   590   e-166
ref|XP_002510523.1| nucleic acid binding protein, putative [Rici...   559   e-157
ref|XP_002301980.1| predicted protein [Populus trichocarpa] gi|2...   543   e-152
ref|XP_003550423.1| PREDICTED: uncharacterized protein LOC100816...   525   e-146
ref|XP_003545035.1| PREDICTED: uncharacterized protein LOC100782...   513   e-143

>ref|XP_002276294.1| PREDICTED: uncharacterized protein LOC100250572 [Vitis vinifera]
          Length = 419

 Score =  590 bits (1520), Expect = e-166
 Identities = 304/419 (72%), Positives = 338/419 (80%), Gaps = 4/419 (0%)
 Frame = -1

Query: 1537 LFSLYPLFSAFMAVSSFFPKTRQTST----PTRKKKVQQTLAPSSWEQIKNLLTCKQVEG 1370
            L   Y L   FMA+ +F P+  +T+T    P+++K+ QQ   PSSW QIKNLLTCKQ+EG
Sbjct: 10   LLFFYLLGLLFMALLTFLPEASETNTKKQQPSKRKRKQQK-QPSSWNQIKNLLTCKQIEG 68

Query: 1369 TQVHDPSKNIVAGYSKLXXXXXXXXSVRDVVHGNTRVVHRADNSPESSKSSVGQETRLLS 1190
            +QVHDPSKN   GYSKL        S RDVVHGNTRVVHRADNSPESS  SVGQET LLS
Sbjct: 69   SQVHDPSKN-PGGYSKLGSSCGSICSFRDVVHGNTRVVHRADNSPESS--SVGQETGLLS 125

Query: 1189 RNKPLIGPSTLSTTSGSMRSNAACXXXXXXXXXXXGVQFRKLSGCYECHMIVDPSRYPVP 1010
            R       S+  + S S+RSNA+             +QFRKLSGCYECHMIVDP+RYP P
Sbjct: 126  RKTVSGSTSSTRSLSSSVRSNASATYTSSSRG----MQFRKLSGCYECHMIVDPNRYPSP 181

Query: 1009 RSTICRCTECGEIFPKIESLELHQAVRHAVKELGPEDSGRNIVEIIFKSSWLKKDNPICK 830
            R+TIC C+ECGE+FPK ESLELHQAVRHAV ELGPEDSGRNIVEIIFKSSWLKKDNPICK
Sbjct: 182  RTTICACSECGEVFPKTESLELHQAVRHAVSELGPEDSGRNIVEIIFKSSWLKKDNPICK 241

Query: 829  IERILKVHNTQRTIQRFEDCRDAVKARANMNTKKNPRCAADGNELLRFYCTTLTCALGAR 650
            IERILKVHNTQRTIQRFE+CRDAVK RAN NTKKNPRCAADGNELLRF+CTTLTCALG+R
Sbjct: 242  IERILKVHNTQRTIQRFEECRDAVKVRANNNTKKNPRCAADGNELLRFHCTTLTCALGSR 301

Query: 649  GSSNLCGSIPGCGVCTIIRHGFSSKAHDCKGVRTTASSGRAHDGLNCTDGRRAMLVCRVI 470
            GSS+LCGS+PGCGVCTIIRHGF  KA + KGVRTT SSGRAHD L CTDGRRAMLVCRVI
Sbjct: 302  GSSSLCGSVPGCGVCTIIRHGFQGKAGEAKGVRTTDSSGRAHDCLPCTDGRRAMLVCRVI 361

Query: 469  AGRVRRISDDAPLDEGILSVGSYDSIGGCAGIYSDLEELVIVNPRAILPCFVVIYKALD 293
            AGRV+R++DDAP DE   S GSYDS+ G +GIYS+LE+L + NPRAILPCFVVIYKALD
Sbjct: 362  AGRVKRMADDAP-DEDGASAGSYDSVAGYSGIYSNLEDLFVFNPRAILPCFVVIYKALD 419


>ref|XP_002510523.1| nucleic acid binding protein, putative [Ricinus communis]
            gi|223551224|gb|EEF52710.1| nucleic acid binding protein,
            putative [Ricinus communis]
          Length = 406

 Score =  559 bits (1440), Expect = e-157
 Identities = 292/412 (70%), Positives = 324/412 (78%), Gaps = 8/412 (1%)
 Frame = -1

Query: 1504 MAVSSFFPK-TRQTSTPTRKKKVQQTLA---PSSWEQIKNLLTCKQVEGTQVHDPSKNIV 1337
            MA+ +F P+ T     P+++K  QQ      PSSW+QIKNLLTCKQ+EG+ VHDPSKN  
Sbjct: 1    MALLTFLPEQTEHAKQPSKRKGKQQKQKQKQPSSWDQIKNLLTCKQIEGSSVHDPSKNSN 60

Query: 1336 -AGYSKLXXXXXXXXSVRDVVHGNTRVVHRADNSPESSKSSVGQETRLLSRNKPLIGPST 1160
              GYSKL        S RD+VHGNTRVVHRADNSPESS  +VGQET LLSR     G S+
Sbjct: 61   NIGYSKLGSSCSSICSFRDIVHGNTRVVHRADNSPESS--TVGQETGLLSRKAT--GGSS 116

Query: 1159 LSTTSGSMRSNAACXXXXXXXXXXXGVQFRKLSGCYECHMIVDPSRYPVPRSTICRCTEC 980
              T  GS RSN               +QFRKLSGCYECHMIVDPSRYP PR+TIC C +C
Sbjct: 117  TRTLGGSGRSNGGATYSSHVSSSRG-MQFRKLSGCYECHMIVDPSRYPAPRTTICTCAQC 175

Query: 979  GEIFPKIESLELHQAVRHAVKELGPEDSGRNIVEIIFKSSWLKKDNPICKIERILKVHNT 800
            GE+FPK ESLELHQ VRHAV ELGPEDSGRNIVEIIFKSSWLKKDNPICKIERILKVHNT
Sbjct: 176  GEVFPKTESLELHQKVRHAVSELGPEDSGRNIVEIIFKSSWLKKDNPICKIERILKVHNT 235

Query: 799  QRTIQRFEDCRDAVKARANMNTKKNPRCAADGNELLRFYCTTLTCALGARGSSNLCGSIP 620
            QRTIQRFEDCRDAVK RA  +TKKNPRCAADGNELLRF+CTTL+C+LGARGSS+LCGSIP
Sbjct: 236  QRTIQRFEDCRDAVKTRALNSTKKNPRCAADGNELLRFHCTTLSCSLGARGSSSLCGSIP 295

Query: 619  GCGVCTIIRHGFSSKAHDCKGVRTTASSGRAHDG-LNCTDGRRAMLVCRVIAGRVRRISD 443
             CGVCTIIRHGF  K  +CKGVRTTASSGRAHD  L CTDGRRAMLVCRVIAGRV+R++D
Sbjct: 296  CCGVCTIIRHGFQGK--ECKGVRTTASSGRAHDSLLGCTDGRRAMLVCRVIAGRVKRVAD 353

Query: 442  DAP--LDEGILSVGSYDSIGGCAGIYSDLEELVIVNPRAILPCFVVIYKALD 293
            D P   +E + + GSYDS+ G AGIYS+LEEL + NPRAILPCFVVIY AL+
Sbjct: 354  DTPPAEEEALAAAGSYDSVAGYAGIYSNLEELFVFNPRAILPCFVVIYSALE 405


>ref|XP_002301980.1| predicted protein [Populus trichocarpa] gi|222843706|gb|EEE81253.1|
            predicted protein [Populus trichocarpa]
          Length = 405

 Score =  543 bits (1398), Expect = e-152
 Identities = 284/400 (71%), Positives = 320/400 (80%), Gaps = 4/400 (1%)
 Frame = -1

Query: 1480 KTRQTSTPTRK--KKVQQTLAPSSWEQIKNLLTCKQVEGTQVHDPSKNIVAGYSKLXXXX 1307
            + +Q S   RK  K+ ++   PSSW+QIKNLLTCKQ+EG++VHDPSKN + GYSKL    
Sbjct: 14   RQKQPSRCKRKLQKEKEKEKQPSSWDQIKNLLTCKQIEGSRVHDPSKNPI-GYSKLGSSC 72

Query: 1306 XXXXSVRDVVHGNTRVVHRADNSPESSKSSVGQETRLLSRNKPLIGPSTLSTTSGSMRSN 1127
                S +DVVHGNTRVVHRADNSPESS  ++GQET LLSR     G S+  + + S RSN
Sbjct: 73   SSICSFKDVVHGNTRVVHRADNSPESS--TLGQETGLLSRKGVSTGSSSTRSLTSSGRSN 130

Query: 1126 AACXXXXXXXXXXXGVQFRKLSGCYECHMIVDPSRYPVPRSTICRCTECGEIFPKIESLE 947
            +              +QFRKLSGCYECHMIVDPSRYP  R+TI  CT+CGE+FPKIESLE
Sbjct: 131  SGVTCSSSSRG----MQFRKLSGCYECHMIVDPSRYPSARTTISACTQCGEVFPKIESLE 186

Query: 946  LHQAVRHAVKELGPEDSGRNIVEIIFKSSWLKKDNPICKIERILKVHNTQRTIQRFEDCR 767
            LHQ VRHAV ELGPEDSGRNIVEIIFKSSWLKKDNPICKIERILKVHNTQRTIQRFEDCR
Sbjct: 187  LHQKVRHAVSELGPEDSGRNIVEIIFKSSWLKKDNPICKIERILKVHNTQRTIQRFEDCR 246

Query: 766  DAVKARANMNTKKNPRCAADGNELLRFYCTTLTCALGARGSSNLCGSIPGCGVCTIIRHG 587
            DAVK RA  +TKKNPRCAADGNELLRF+CTTLTC+LG+ GSS+LCGSIP CGVCTIIRHG
Sbjct: 247  DAVKTRALNSTKKNPRCAADGNELLRFHCTTLTCSLGSLGSSSLCGSIPVCGVCTIIRHG 306

Query: 586  FSSKAHDCKGVRTTASSGRAHDGL-NCTDGRRAMLVCRVIAGRVRRISDDA-PLDEGILS 413
            F  +  +CKGV TTASSGRAHD L  CTDGRRAMLVCRVIAGRV+R+++DA P +E   S
Sbjct: 307  F--QGIECKGVSTTASSGRAHDSLWGCTDGRRAMLVCRVIAGRVKRVAEDALPPEEDGAS 364

Query: 412  VGSYDSIGGCAGIYSDLEELVIVNPRAILPCFVVIYKALD 293
             GSYDS+ G AGIYS LEEL + NPRAILPCFVVIYKAL+
Sbjct: 365  AGSYDSVAGGAGIYSSLEELSVFNPRAILPCFVVIYKALE 404


>ref|XP_003550423.1| PREDICTED: uncharacterized protein LOC100816726 [Glycine max]
          Length = 417

 Score =  525 bits (1351), Expect = e-146
 Identities = 277/417 (66%), Positives = 319/417 (76%), Gaps = 11/417 (2%)
 Frame = -1

Query: 1519 LFSAFMAVSSFFPKTRQTSTPTRKKKVQQTLAPSSWEQIKNLLTCKQVEGTQVHDPSKNI 1340
            L ++    SS  PK      P ++++ Q+    SSW+QIKNLLTCKQ+EG++VHDPSK +
Sbjct: 3    LLTSLPEQSSSTPKRHHKRKPQQQQQQQKQKPASSWDQIKNLLTCKQIEGSRVHDPSK-V 61

Query: 1339 VAGYSKLXXXXXXXXSVRDVVHGNTRVVHRADNS-PESSKSSVGQETRLLSRNKPLIGPS 1163
            V+GYSKL        S RDVVHGNTRVVHR+DNS PESS  S+GQET  L   KP +  +
Sbjct: 62   VSGYSKLGSSCSSICSFRDVVHGNTRVVHRSDNSSPESS--SLGQETNGLLTRKP-VTTT 118

Query: 1162 TLSTTSGSMRSNAACXXXXXXXXXXXGVQFRKLSGCYECHMIVDPSRYPVPRSTICRCTE 983
            T +TT  S +S+               +QFRKLSGCYECHMI+DPSR P+ RST+C C+ 
Sbjct: 119  TTTTTRSSAKSHGGATYTSSSSSRG--MQFRKLSGCYECHMIIDPSRLPIARSTVCACSH 176

Query: 982  CGEIFPKIESLELHQAVRHAVKELGPEDSGRNIVEIIFKSSWLKKDNPICKIERILKVHN 803
            CGE+FPK+ESLELHQAVRHAV ELGPEDSGRNIVEIIFKSSWLKKDNPICKIERILKVHN
Sbjct: 177  CGEVFPKMESLELHQAVRHAVSELGPEDSGRNIVEIIFKSSWLKKDNPICKIERILKVHN 236

Query: 802  TQRTIQRFEDCRDAVKARANMNTKKNPRCAADGNELLRFYCTTLTCALGARGSSNLCGSI 623
            TQRTIQRFE+CRD VK RA  +TKKNPRCAADGNELLRF+CTTLTCALGARGSS+LC S+
Sbjct: 237  TQRTIQRFEECRDTVKNRALGSTKKNPRCAADGNELLRFHCTTLTCALGARGSSSLCASV 296

Query: 622  PG-CGVCTIIRHGF--------SSKAHDCKGVRTTASSGRAHDGLNCTDG-RRAMLVCRV 473
             G CGVCTIIRHGF        S      KGVRTTASSGRAHD + C D  RRAMLVCRV
Sbjct: 297  HGSCGVCTIIRHGFQGGSCGGGSGDHGKAKGVRTTASSGRAHDSVVCGDATRRAMLVCRV 356

Query: 472  IAGRVRRISDDAPLDEGILSVGSYDSIGGCAGIYSDLEELVIVNPRAILPCFVVIYK 302
            IAGRV+R+ +DAP +E  +SV SYDS+ G AGIYS+LEELV+ NP+AILPCFVVIYK
Sbjct: 357  IAGRVKRVVEDAPSEEEHVSVASYDSVAGYAGIYSNLEELVVFNPKAILPCFVVIYK 413


>ref|XP_003545035.1| PREDICTED: uncharacterized protein LOC100782665 [Glycine max]
          Length = 414

 Score =  513 bits (1322), Expect = e-143
 Identities = 269/402 (66%), Positives = 307/402 (76%), Gaps = 8/402 (1%)
 Frame = -1

Query: 1483 PKTRQTSTPTRKKKVQQTLAPSSWEQIKNLLTCKQVEGTQVHDPSKNIVAGYSKLXXXXX 1304
            P+ +Q     ++K+ Q+    SSW+QIKNLLTCKQ+E ++VHDPSK  + GYSKL     
Sbjct: 22   PQQQQQKQKQKQKQKQKQKPASSWDQIKNLLTCKQMEESRVHDPSK--ITGYSKLGSSCS 79

Query: 1303 XXXSVRDVVHGNTRVVHRADNS-PESSKSSVGQETRLLSRNKPLIGPSTLSTTSGSMRSN 1127
               S RDVVHGNTRVVHR+DNS PESS  S+GQET  L   KP+   +T +T S      
Sbjct: 80   SICSFRDVVHGNTRVVHRSDNSSPESS--SLGQETNGLLTRKPV---TTTTTRSAKSNGG 134

Query: 1126 AACXXXXXXXXXXXGVQFRKLSGCYECHMIVDPSRYPVPRSTICRCTECGEIFPKIESLE 947
            A C            +QFRKLSGCYECHMI+DPSR P+ RST+C C+ CGE+FPK+ESLE
Sbjct: 135  ATCTSSSSSSRG---MQFRKLSGCYECHMIIDPSRLPIARSTVCACSHCGEVFPKMESLE 191

Query: 946  LHQAVRHAVKELGPEDSGRNIVEIIFKSSWLKKDNPICKIERILKVHNTQRTIQRFEDCR 767
            LHQAVRHAV ELGPEDSGRNIVEIIFKSSWLKKDNPICKIERILKVHNTQRTIQRFE+CR
Sbjct: 192  LHQAVRHAVSELGPEDSGRNIVEIIFKSSWLKKDNPICKIERILKVHNTQRTIQRFEECR 251

Query: 766  DAVKARANMNTKKNPRCAADGNELLRFYCTTLTCALGARGSSNLCGSIPGCGVCTIIRHG 587
            D VK RA  +TKKNPRCAADGNELLRF+CTTLTCALGARGSS+LC S+PGC VCTIIRHG
Sbjct: 252  DTVKNRALGSTKKNPRCAADGNELLRFHCTTLTCALGARGSSSLCASVPGCSVCTIIRHG 311

Query: 586  FSSKAHD------CKGVRTTASSGRAHDGLNCTDG-RRAMLVCRVIAGRVRRISDDAPLD 428
            F             KGVRTTASSGRAHD + C D  RRAMLVCRVIAGRV+R+ +DAP +
Sbjct: 312  FQGGCGGGGDHARAKGVRTTASSGRAHDSVVCGDATRRAMLVCRVIAGRVKRVVEDAPSE 371

Query: 427  EGILSVGSYDSIGGCAGIYSDLEELVIVNPRAILPCFVVIYK 302
            E  +   SYDS+ G AGIYS+LEELV+ NP+AILPCFVVIYK
Sbjct: 372  EEHV---SYDSVAGYAGIYSNLEELVVFNPKAILPCFVVIYK 410


Top