BLASTX nr result

ID: Coptis23_contig00010206 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis23_contig00010206
         (1518 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AFK37128.1| unknown [Medicago truncatula]                          325   1e-86
ref|XP_002523496.1| conserved hypothetical protein [Ricinus comm...   323   7e-86
ref|XP_002324550.1| predicted protein [Populus trichocarpa] gi|2...   323   7e-86
ref|XP_003610353.1| hypothetical protein MTR_4g131190 [Medicago ...   323   1e-85
ref|XP_004146693.1| PREDICTED: UPF0549 protein C20orf43 homolog ...   320   5e-85

>gb|AFK37128.1| unknown [Medicago truncatula]
          Length = 361

 Score =  325 bits (834), Expect = 1e-86
 Identities = 192/358 (53%), Positives = 219/358 (61%), Gaps = 43/358 (12%)
 Frame = +3

Query: 84   QIFIQSPDLKIPTLPLTITQNSXXXXXXXXXXXXXXXTLYFTLNGKPLSDFSH----PIP 251
            QI +QSPDL+I   P ++T +                + YFTLNGKPLSD ++     I 
Sbjct: 7    QILVQSPDLQIH--PKSVTGDETLSDLKHSIFPNSQSSFYFTLNGKPLSDDTNFSTSRIA 64

Query: 252  NFSTIILHIKTLXXXXXXXSTCAESRDCYLNMYAVKKPDKVDPNEARLSRWTNCALSFEP 431
              ST++L  +         STCAESRDCYL MYA KKPDKVDPNE RLS+W NCALS EP
Sbjct: 65   PLSTLVLQSRLRGGGGDGGSTCAESRDCYLKMYAEKKPDKVDPNEQRLSKWQNCALSNEP 124

Query: 432  LKPPCVLDKLGNLFNKQVLVHALLSKKLPKEFGYIKGLKDMITLSLD----ESGSGFFQC 599
            L+ PCV+DKLGN+FNK+ L  ALL KKLPKEFGYIKGLKDMI + L+    E     F+C
Sbjct: 125  LREPCVIDKLGNIFNKESLAEALLGKKLPKEFGYIKGLKDMIKIKLESVPGEDDGAKFRC 184

Query: 600  PVSGLEFNGKYKFYVLRTCGHVLSGKAFKEVKSSGCLVCRKEFSEEDKIVINGNVEEVAV 779
            PV+G EFNGKYKF+ LR CGHVLS KA KEVKSS CLVC +EF E DKIVINGN EEV V
Sbjct: 185  PVAGREFNGKYKFFALRNCGHVLSAKALKEVKSSACLVCHEEFGEGDKIVINGNEEEVEV 244

Query: 780  LRXXXXXXXXXXXXXXXXXXRH----CVDGESL-----------VEVKKGSVK------- 893
            LR                  ++     VDG SL           + V+K S K       
Sbjct: 245  LRERMEEEKAKVREKKTKKVKNNDSEAVDGLSLEASKLTGTKHGLNVEKASAKVDKNGKV 304

Query: 894  ----KLKTGGAL---------VAPVNATKEVYASIFTSSRKSEFKETYSCRSLPLGRN 1028
                K   GGA          +AP NAT EVYASIFTSSRKSEFKETYSCRSLPLGRN
Sbjct: 305  ANGNKGVNGGAAAAKRFKATDIAPANAT-EVYASIFTSSRKSEFKETYSCRSLPLGRN 361


>ref|XP_002523496.1| conserved hypothetical protein [Ricinus communis]
            gi|223537203|gb|EEF38835.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 375

 Score =  323 bits (828), Expect = 7e-86
 Identities = 194/370 (52%), Positives = 221/370 (59%), Gaps = 55/370 (14%)
 Frame = +3

Query: 84   QIFIQSPDLKIPTLPLTITQNSXXXXXXXXXXXXXXXTL---YFTLNGKPLSDFSHPIPN 254
            QIF+Q P+ K+ TL L  TQ                  L   +FTLNGKPL D S PIPN
Sbjct: 7    QIFLQLPNSKLQTLTLDSTQILTLHDLKLSLFPNNHQNLSSFFFTLNGKPLLD-STPIPN 65

Query: 255  -----FSTIILHIKTLXXXXXXXSTCAESRDCYLNMYAVKKPDKVDPNEARLSRWTNCAL 419
                  ST++LH +         +T AESRDCYLNMYA KKPDKVDPNE RLS+W NCAL
Sbjct: 66   PQITSLSTLVLHSRLPGGGGDGGATGAESRDCYLNMYAEKKPDKVDPNEQRLSKWLNCAL 125

Query: 420  SFEPLKPPCVLDKLGNLFNKQVLVHALLSKKLPKEFGYIKGLKDMITLSL-------DES 578
            S EPL  PCV+DKLGN+FNK+ LV AL+ KKLPKEFGYIKGLKDMI + L       +E 
Sbjct: 126  SNEPLMQPCVIDKLGNVFNKEALVEALIGKKLPKEFGYIKGLKDMINIKLEPIPGEKEEL 185

Query: 579  GSGFFQCPVSGLEFNGKYKFYVLRTCGHVLSGKAFKEVKSSGCLVCRKEFSEEDKIVING 758
             S  F CP+SGLEFNGKYKFY L+ CGHVLS KA KEVKSS CLVC KEF E DKIVING
Sbjct: 186  YSAKFHCPISGLEFNGKYKFYALKNCGHVLSAKALKEVKSSACLVCYKEFEEFDKIVING 245

Query: 759  NVEEVAVLR------------------------XXXXXXXXXXXXXXXXXXRHCVDGESL 866
            + EEVA LR                                          +H +  +  
Sbjct: 246  SDEEVADLRERLEEERLKVHDKKSKKVKKGEVGVNGGDDCVDLDTSRLIGKKHGISDDKG 305

Query: 867  VE------VKKGSVKKLK---TGGAL-------VAPVNATKEVYASIFTSSRKSEFKETY 998
            VE      V  G V+K+K    GGA+       +AP NATKEVYASIFTSS+KS+FKETY
Sbjct: 306  VEKVVSKVVGSGKVEKVKGVGNGGAVKKFKAADMAPANATKEVYASIFTSSKKSDFKETY 365

Query: 999  SCRSLPLGRN 1028
             CRSLPLGRN
Sbjct: 366  MCRSLPLGRN 375


>ref|XP_002324550.1| predicted protein [Populus trichocarpa] gi|222865984|gb|EEF03115.1|
            predicted protein [Populus trichocarpa]
          Length = 346

 Score =  323 bits (828), Expect = 7e-86
 Identities = 187/341 (54%), Positives = 217/341 (63%), Gaps = 26/341 (7%)
 Frame = +3

Query: 84   QIFIQS--PDLKIPTLPLTITQNSXXXXXXXXXXXXXXX--TLYFTLNGKPLSDFSHPIP 251
            QIFIQS  P  K  TL L  TQ                   + YFTLNGKPL D S  +P
Sbjct: 7    QIFIQSQNPQFKTQTLTLDPTQTLTLYNLKLSLITDNQNPSSFYFTLNGKPLKD-STCLP 65

Query: 252  N-----FSTIILHIKTLXXXXXXXSTCAESRDCYLNMYAVKKPDKVDPNEARLSRWTNCA 416
            N       T+IL ++         +T AESRDCYLNMYA KKPDKVDP+E RLS+W NC+
Sbjct: 66   NPQITPLCTLILQVRLSGGGGDGGATGAESRDCYLNMYADKKPDKVDPHELRLSKWLNCS 125

Query: 417  LSFEPLKPPCVLDKLGNLFNKQVLVHALLSKKLPKEFGYIKGLKDMITLSL-----DESG 581
            LS EPL+ PCV+D+LGN+FNK+ LV AL+ KKLPKEFGYIKGLKDMI + L     D SG
Sbjct: 126  LSNEPLRQPCVIDRLGNMFNKEALVEALIGKKLPKEFGYIKGLKDMIDIQLEVVPGDGSG 185

Query: 582  SGFFQCPVSGLEFNGKYKFYVLRTCGHVLSGKAFKEVKSSGCLVCRKEFSEEDKIVINGN 761
            +  FQCPV+GLEFNGKYKF+ L+ CGHVLS KA KEVKSS CLVC KEF E DKIVING 
Sbjct: 186  NARFQCPVTGLEFNGKYKFFALKNCGHVLSAKALKEVKSSECLVCYKEFEECDKIVINGG 245

Query: 762  VEEVAVLRXXXXXXXXXXXXXXXXXXRHCVDGESLVEVKKG-----SVKKLKTGGAL--- 917
             EEVAVLR                  ++    + +V   KG     +VK +  GG++   
Sbjct: 246  DEEVAVLRERMEEERSKMKEKKMKKVKNGEGVDKVVGKVKGNGKVENVKGVSHGGSVKRF 305

Query: 918  ----VAPVNATKEVYASIFTSSRKSEFKETYSCRSLPLGRN 1028
                + P NATKEVYASIFTSS+K  FKETYSCRSLPLGRN
Sbjct: 306  KATDMVPTNATKEVYASIFTSSKKQSFKETYSCRSLPLGRN 346


>ref|XP_003610353.1| hypothetical protein MTR_4g131190 [Medicago truncatula]
            gi|355511408|gb|AES92550.1| hypothetical protein
            MTR_4g131190 [Medicago truncatula]
          Length = 459

 Score =  323 bits (827), Expect = 1e-85
 Identities = 191/355 (53%), Positives = 218/355 (61%), Gaps = 43/355 (12%)
 Frame = +3

Query: 84   QIFIQSPDLKIPTLPLTITQNSXXXXXXXXXXXXXXXTLYFTLNGKPLSDFSH----PIP 251
            QI +QSPDL+I   P ++T +                + YFTLNGKPLSD ++     I 
Sbjct: 7    QILVQSPDLQIH--PKSVTGDETLSDLKHSIFPNSQSSFYFTLNGKPLSDDTNFSTSRIA 64

Query: 252  NFSTIILHIKTLXXXXXXXSTCAESRDCYLNMYAVKKPDKVDPNEARLSRWTNCALSFEP 431
              ST++L  +         STCAESRDCYL MYA KKPDKVDPNE RLS+W NCALS EP
Sbjct: 65   PLSTLVLQSRLRGGGGDGGSTCAESRDCYLKMYAEKKPDKVDPNEQRLSKWQNCALSNEP 124

Query: 432  LKPPCVLDKLGNLFNKQVLVHALLSKKLPKEFGYIKGLKDMITLSLD----ESGSGFFQC 599
            L+ PCV+DKLGN+FNK+ LV ALL KKLPKEFGYIKGLKDMI + L+    E     F+C
Sbjct: 125  LREPCVIDKLGNIFNKESLVEALLGKKLPKEFGYIKGLKDMIKIKLESVPGEDDGAKFRC 184

Query: 600  PVSGLEFNGKYKFYVLRTCGHVLSGKAFKEVKSSGCLVCRKEFSEEDKIVINGNVEEVAV 779
            PV+GLEFNGKYKF+ LR CGHVLS KA KEVKSS CLVC +EF E DKIVINGN EEV V
Sbjct: 185  PVAGLEFNGKYKFFALRNCGHVLSAKALKEVKSSACLVCHEEFGEGDKIVINGNEEEVEV 244

Query: 780  LRXXXXXXXXXXXXXXXXXXRH----CVDGESL-----------VEVKKGSVK------- 893
            LR                  ++     VDG SL           + V+K S K       
Sbjct: 245  LRERMEEEKAKVREKKTKKVKNNDSEAVDGLSLEASKLTGTKHGLNVEKASAKVDKNGKV 304

Query: 894  ----KLKTGGAL---------VAPVNATKEVYASIFTSSRKSEFKETYSCRSLPL 1019
                K   GGA          +AP NAT EVYASIFTSSRKSEFKETYSCRSLPL
Sbjct: 305  ANGNKGVNGGAAAAKRFKATDIAPANAT-EVYASIFTSSRKSEFKETYSCRSLPL 358



 Score = 58.5 bits (140), Expect = 4e-06
 Identities = 37/62 (59%), Positives = 40/62 (64%), Gaps = 8/62 (12%)
 Frame = +3

Query: 867  VEVKKGSVKKLKTGGAL--------VAPVNATKEVYASIFTSSRKSEFKETYSCRSLPLG 1022
            +  K  +  K  TGGA         +AP NAT EVYA IFTS RKSEFKET SCRSLPLG
Sbjct: 400  IRFKAANGNKSLTGGAAAKRFKATDIAPANAT-EVYALIFTS-RKSEFKETCSCRSLPLG 457

Query: 1023 RN 1028
            RN
Sbjct: 458  RN 459


>ref|XP_004146693.1| PREDICTED: UPF0549 protein C20orf43 homolog [Cucumis sativus]
            gi|449517820|ref|XP_004165942.1| PREDICTED: UPF0549
            protein C20orf43 homolog [Cucumis sativus]
          Length = 349

 Score =  320 bits (821), Expect = 5e-85
 Identities = 183/342 (53%), Positives = 216/342 (63%), Gaps = 27/342 (7%)
 Frame = +3

Query: 84   QIFIQSPDLKI-------PTLPLTITQNSXXXXXXXXXXXXXXXTLYFTLNGKPLSDFS- 239
            QIF+QSPDL+I       P  P    ++                + YFTLNGKPL D + 
Sbjct: 10   QIFLQSPDLQIESKIVNLPQTPAKTLEDLKFSLLTEILASRIASSFYFTLNGKPLLDSTT 69

Query: 240  -HPIPNFSTIILHIKTLXXXXXXXSTCAESRDCYLNMYAVKKPDKVDPNEARLSRWTNCA 416
               IP  ST+IL  + L       +T AESRDCYLNMYA KKPDKVDPNE RLS+W NCA
Sbjct: 70   ISLIPPLSTLILRTRVLGGGGDGGATGAESRDCYLNMYAEKKPDKVDPNEQRLSKWLNCA 129

Query: 417  LSFEPLKPPCVLDKLGNLFNKQVLVHALLSKKLPKEFGYIKGLKDMITLSL------DES 578
            LS EPL+ PCV+D LGN+FNK+ LV ALL KKLPK FG+IKGLKDMI ++       +  
Sbjct: 130  LSNEPLREPCVIDWLGNVFNKESLVQALLEKKLPKGFGHIKGLKDMIKINFSMIPGTESR 189

Query: 579  GSGF----FQCPVSGLEFNGKYKFYVLRTCGHVLSGKAFKEVKSSGCLVCRKEFSEEDKI 746
            G+      +QCPV+GLEFNGKYKF+ LRTCGHVLS KA KEVKSS CLVC  EF+E DK 
Sbjct: 190  GNAISEPRYQCPVTGLEFNGKYKFFALRTCGHVLSAKALKEVKSSSCLVCHAEFAERDKF 249

Query: 747  VINGNVEEVAVLRXXXXXXXXXXXXXXXXXXR------HCVDGESLVE--VKKGSVKKLK 902
            VING+ EEV  +R                  +        +DG + V+     G+VK+ K
Sbjct: 250  VINGSEEEVEEMRERMEEEKSKSKSKEKKTKKVRNGEVERLDGGAQVKDATSNGAVKRFK 309

Query: 903  TGGALVAPVNATKEVYASIFTSSRKSEFKETYSCRSLPLGRN 1028
               A + P NATKEVYASIFTSSRKS+FKETYSCRSLPLGRN
Sbjct: 310  --AADMVPANATKEVYASIFTSSRKSDFKETYSCRSLPLGRN 349


Top