BLASTX nr result

ID: Acanthopanax21_contig00023790 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Acanthopanax21_contig00023790
         (645 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_017248645.1| PREDICTED: von Willebrand factor A domain-co...   182   4e-51
ref|XP_017248644.1| PREDICTED: von Willebrand factor A domain-co...   182   5e-50
gb|KZM92864.1| hypothetical protein DCAR_016109 [Daucus carota s...   182   7e-50
ref|XP_017255564.1| PREDICTED: uncharacterized protein LOC108225...   147   3e-37
gb|KZM89953.1| hypothetical protein DCAR_022684 [Daucus carota s...   147   3e-37
ref|XP_019194485.1| PREDICTED: uncharacterized protein LOC109188...   132   4e-32
emb|CDO97834.1| unnamed protein product [Coffea canephora]            129   1e-30
ref|XP_022889529.1| uncharacterized protein LOC111405068 isoform...   128   2e-30
ref|XP_022889530.1| uncharacterized protein LOC111405068 isoform...   127   5e-30
ref|XP_010654417.1| PREDICTED: inter-alpha-trypsin inhibitor hea...   126   9e-30
ref|XP_021984557.1| uncharacterized protein LOC110880287 [Helian...   123   1e-28
ref|XP_010654418.1| PREDICTED: inter alpha-trypsin inhibitor, he...   122   2e-28
ref|XP_002271630.1| PREDICTED: inter-alpha-trypsin inhibitor hea...   122   3e-28
gb|PLY78632.1| hypothetical protein LSAT_4X92761 [Lactuca sativa]     121   5e-28
ref|XP_023772683.1| uncharacterized protein LOC111921359 [Lactuc...   121   5e-28
ref|XP_011072601.1| LOW QUALITY PROTEIN: uncharacterized protein...   120   9e-28
ref|XP_016504589.1| PREDICTED: uncharacterized protein LOC107822...   111   1e-27
gb|KCW71308.1| hypothetical protein EUGRSUZ_F04395 [Eucalyptus g...   110   1e-27
gb|PHT47276.1| hypothetical protein CQW23_11484 [Capsicum baccatum]   118   2e-27
ref|XP_015162217.1| PREDICTED: uncharacterized protein LOC102599...   119   3e-27

>ref|XP_017248645.1| PREDICTED: von Willebrand factor A domain-containing protein
           DDB_G0292028-like isoform X2 [Daucus carota subsp.
           sativus]
          Length = 552

 Score =  182 bits (463), Expect = 4e-51
 Identities = 81/113 (71%), Positives = 96/113 (84%)
 Frame = +2

Query: 2   PVLVQEMIGLNGRKIIFLRNLGVGFGSLTATAENLPPRSGEPKLHESKDFIMKAATNYWG 181
           PVLVQEM+G+NGRK+IFLRN G+GFG L AT+ENLP  SGEPKLH SKDF+MKAATN+ G
Sbjct: 439 PVLVQEMMGMNGRKVIFLRNTGIGFGDLMATSENLPLGSGEPKLHVSKDFMMKAATNFCG 498

Query: 182 RLVDRXXXXXXIRACSQLNDRCAIAFTQLCTAFACFECLNCCCELSDSCSQCC 340
            L+DR      I+ACSQ+ND+CA+AFTQLCTA ACF+CL+CCCE+ DSC QCC
Sbjct: 499 SLMDRCCCMCFIQACSQVNDKCAVAFTQLCTALACFQCLSCCCEICDSCDQCC 551


>ref|XP_017248644.1| PREDICTED: von Willebrand factor A domain-containing protein
           DDB_G0292740-like isoform X1 [Daucus carota subsp.
           sativus]
          Length = 747

 Score =  182 bits (463), Expect = 5e-50
 Identities = 81/113 (71%), Positives = 96/113 (84%)
 Frame = +2

Query: 2   PVLVQEMIGLNGRKIIFLRNLGVGFGSLTATAENLPPRSGEPKLHESKDFIMKAATNYWG 181
           PVLVQEM+G+NGRK+IFLRN G+GFG L AT+ENLP  SGEPKLH SKDF+MKAATN+ G
Sbjct: 634 PVLVQEMMGMNGRKVIFLRNTGIGFGDLMATSENLPLGSGEPKLHVSKDFMMKAATNFCG 693

Query: 182 RLVDRXXXXXXIRACSQLNDRCAIAFTQLCTAFACFECLNCCCELSDSCSQCC 340
            L+DR      I+ACSQ+ND+CA+AFTQLCTA ACF+CL+CCCE+ DSC QCC
Sbjct: 694 SLMDRCCCMCFIQACSQVNDKCAVAFTQLCTALACFQCLSCCCEICDSCDQCC 746


>gb|KZM92864.1| hypothetical protein DCAR_016109 [Daucus carota subsp. sativus]
          Length = 776

 Score =  182 bits (463), Expect = 7e-50
 Identities = 81/113 (71%), Positives = 96/113 (84%)
 Frame = +2

Query: 2    PVLVQEMIGLNGRKIIFLRNLGVGFGSLTATAENLPPRSGEPKLHESKDFIMKAATNYWG 181
            PVLVQEM+G+NGRK+IFLRN G+GFG L AT+ENLP  SGEPKLH SKDF+MKAATN+ G
Sbjct: 663  PVLVQEMMGMNGRKVIFLRNTGIGFGDLMATSENLPLGSGEPKLHVSKDFMMKAATNFCG 722

Query: 182  RLVDRXXXXXXIRACSQLNDRCAIAFTQLCTAFACFECLNCCCELSDSCSQCC 340
             L+DR      I+ACSQ+ND+CA+AFTQLCTA ACF+CL+CCCE+ DSC QCC
Sbjct: 723  SLMDRCCCMCFIQACSQVNDKCAVAFTQLCTALACFQCLSCCCEICDSCDQCC 775


>ref|XP_017255564.1| PREDICTED: uncharacterized protein LOC108225246 isoform X1 [Daucus
           carota subsp. sativus]
          Length = 748

 Score =  147 bits (371), Expect = 3e-37
 Identities = 70/113 (61%), Positives = 87/113 (76%)
 Frame = +2

Query: 2   PVLVQEMIGLNGRKIIFLRNLGVGFGSLTATAENLPPRSGEPKLHESKDFIMKAATNYWG 181
           PVLV+E++GLNGRK+I L+  G GFG+LTAT ++LPP S E KLHESK  +MK A+N   
Sbjct: 637 PVLVKEIMGLNGRKVICLQTTGFGFGNLTATYKSLPPGSEEIKLHESK--LMKVASNIQE 694

Query: 182 RLVDRXXXXXXIRACSQLNDRCAIAFTQLCTAFACFECLNCCCELSDSCSQCC 340
           RLVDR      I+ACSQ+N+RC I FTQLCTA ACF+CL+CCCE+ D+C Q C
Sbjct: 695 RLVDRCCCKCFIQACSQVNERCTIIFTQLCTALACFQCLSCCCEICDTCEQYC 747


>gb|KZM89953.1| hypothetical protein DCAR_022684 [Daucus carota subsp. sativus]
          Length = 770

 Score =  147 bits (371), Expect = 3e-37
 Identities = 70/113 (61%), Positives = 87/113 (76%)
 Frame = +2

Query: 2   PVLVQEMIGLNGRKIIFLRNLGVGFGSLTATAENLPPRSGEPKLHESKDFIMKAATNYWG 181
           PVLV+E++GLNGRK+I L+  G GFG+LTAT ++LPP S E KLHESK  +MK A+N   
Sbjct: 659 PVLVKEIMGLNGRKVICLQTTGFGFGNLTATYKSLPPGSEEIKLHESK--LMKVASNIQE 716

Query: 182 RLVDRXXXXXXIRACSQLNDRCAIAFTQLCTAFACFECLNCCCELSDSCSQCC 340
           RLVDR      I+ACSQ+N+RC I FTQLCTA ACF+CL+CCCE+ D+C Q C
Sbjct: 717 RLVDRCCCKCFIQACSQVNERCTIIFTQLCTALACFQCLSCCCEICDTCEQYC 769


>ref|XP_019194485.1| PREDICTED: uncharacterized protein LOC109188315 [Ipomoea nil]
          Length = 749

 Score =  132 bits (333), Expect = 4e-32
 Identities = 59/100 (59%), Positives = 74/100 (74%)
 Frame = +2

Query: 38  RKIIFLRNLGVGFGSLTATAENLPPRSGEPKLHESKDFIMKAATNYWGRLVDRXXXXXXI 217
           RKI++LR LG+GFG+L ATA+NLPP + EPKLHE+ D I  AA N  G+LVD       I
Sbjct: 649 RKILYLRQLGLGFGNLKATADNLPPEAAEPKLHETSDAIFTAAANCCGKLVDCCCCMCFI 708

Query: 218 RACSQLNDRCAIAFTQLCTAFACFECLNCCCELSDSCSQC 337
           + CS+L+DRCA+ FTQLCTA ACFEC+N CCE+ + C  C
Sbjct: 709 QFCSKLSDRCAVTFTQLCTALACFECINVCCEICECCDFC 748


>emb|CDO97834.1| unnamed protein product [Coffea canephora]
          Length = 736

 Score =  129 bits (323), Expect = 1e-30
 Identities = 58/102 (56%), Positives = 73/102 (71%)
 Frame = +2

Query: 32  NGRKIIFLRNLGVGFGSLTATAENLPPRSGEPKLHESKDFIMKAATNYWGRLVDRXXXXX 211
           +G+K+I LR +G GFGSL ATAENLPP + EPKLHE+ + I KAA N  GR++D      
Sbjct: 634 SGKKVIVLRRVGAGFGSLKATAENLPPETAEPKLHETSEMITKAARNLCGRMLDCCCCMC 693

Query: 212 XIRACSQLNDRCAIAFTQLCTAFACFECLNCCCELSDSCSQC 337
            I+ CS+LND+CA+  TQLCTA ACF C+N CCE+  SC  C
Sbjct: 694 FIQFCSRLNDQCAVVLTQLCTALACFGCMNFCCEVCFSCDFC 735


>ref|XP_022889529.1| uncharacterized protein LOC111405068 isoform X1 [Olea europaea var.
           sylvestris]
          Length = 756

 Score =  128 bits (321), Expect = 2e-30
 Identities = 62/112 (55%), Positives = 75/112 (66%), Gaps = 3/112 (2%)
 Frame = +2

Query: 11  VQEMIG-LNGRKIIFLRNLGVGFGSLTATAENLPPRSGEPKLHESKDFIMKAATNYWGRL 187
           V+E  G   G KII LRN+G GFG L ATAENLPP   EP L+E+   + KAA+N  GR+
Sbjct: 645 VEERTGDFKGEKIIVLRNIGTGFGDLEATAENLPPEKAEPNLYETSTMLFKAASNIGGRV 704

Query: 188 VDRXXXXXXIRACSQLNDRCAIAFTQLCTAFACFECLNCCCELSD--SCSQC 337
            DR      ++ C++LND+CAIA TQLCTA ACF CL  CCE+ D  SC QC
Sbjct: 705 ADRCCCMCFLQFCNRLNDQCAIALTQLCTALACFGCLKVCCEICDLCSCDQC 756


>ref|XP_022889530.1| uncharacterized protein LOC111405068 isoform X2 [Olea europaea var.
           sylvestris]
          Length = 720

 Score =  127 bits (318), Expect = 5e-30
 Identities = 59/103 (57%), Positives = 71/103 (68%), Gaps = 2/103 (1%)
 Frame = +2

Query: 35  GRKIIFLRNLGVGFGSLTATAENLPPRSGEPKLHESKDFIMKAATNYWGRLVDRXXXXXX 214
           G KII LRN+G GFG L ATAENLPP   EP L+E+   + KAA+N  GR+ DR      
Sbjct: 618 GEKIIVLRNIGTGFGDLEATAENLPPEKAEPNLYETSTMLFKAASNIGGRVADRCCCMCF 677

Query: 215 IRACSQLNDRCAIAFTQLCTAFACFECLNCCCELSD--SCSQC 337
           ++ C++LND+CAIA TQLCTA ACF CL  CCE+ D  SC QC
Sbjct: 678 LQFCNRLNDQCAIALTQLCTALACFGCLKVCCEICDLCSCDQC 720


>ref|XP_010654417.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 isoform X2
           [Vitis vinifera]
          Length = 746

 Score =  126 bits (316), Expect = 9e-30
 Identities = 61/108 (56%), Positives = 75/108 (69%)
 Frame = +2

Query: 5   VLVQEMIGLNGRKIIFLRNLGVGFGSLTATAENLPPRSGEPKLHESKDFIMKAATNYWGR 184
           V++QEM+   G+KII L  LG+GFG LTAT+ENLP    EPK  E  D I KAATN    
Sbjct: 638 VMLQEMVDSIGQKIILLGTLGIGFGDLTATSENLPSGVEEPKPPEGTDVIFKAATNCCAM 697

Query: 185 LVDRXXXXXXIRACSQLNDRCAIAFTQLCTAFACFECLNCCCELSDSC 328
           + DR      I+ACS+LN++CAIAFTQLC+A  CF C++CCCEL  SC
Sbjct: 698 VADRICCMCFIQACSKLNNQCAIAFTQLCSALTCFGCMDCCCELCVSC 745


>ref|XP_021984557.1| uncharacterized protein LOC110880287 [Helianthus annuus]
 gb|OTG16955.1| putative von Willebrand factor, type A [Helianthus annuus]
          Length = 798

 Score =  123 bits (308), Expect = 1e-28
 Identities = 55/104 (52%), Positives = 73/104 (70%)
 Frame = +2

Query: 29   LNGRKIIFLRNLGVGFGSLTATAENLPPRSGEPKLHESKDFIMKAATNYWGRLVDRXXXX 208
            L  RK+I+LRN+ VGFG L AT +NLPP   + +++E    +M+AA++ WG  +DR    
Sbjct: 695  LENRKMIYLRNISVGFGDLKATIDNLPPGIVDLEVNEPAGIMMQAASSCWGMFLDRCCCM 754

Query: 209  XXIRACSQLNDRCAIAFTQLCTAFACFECLNCCCELSDSCSQCC 340
              I++CS++ND+CAI  TQLCTA ACFECLN CCEL D CS  C
Sbjct: 755  CLIQSCSRMNDQCAIVLTQLCTALACFECLNWCCELCDCCSDFC 798


>ref|XP_010654418.1| PREDICTED: inter alpha-trypsin inhibitor, heavy chain 4 isoform X3
           [Vitis vinifera]
          Length = 624

 Score =  122 bits (305), Expect = 2e-28
 Identities = 59/108 (54%), Positives = 74/108 (68%)
 Frame = +2

Query: 5   VLVQEMIGLNGRKIIFLRNLGVGFGSLTATAENLPPRSGEPKLHESKDFIMKAATNYWGR 184
           +L ++M+   G+KII L  LG+GFG LTAT+ENLP    EPK  E  D I KAATN    
Sbjct: 516 MLQEKMVDSIGQKIILLGTLGIGFGDLTATSENLPSGVEEPKPPEGTDVIFKAATNCCAM 575

Query: 185 LVDRXXXXXXIRACSQLNDRCAIAFTQLCTAFACFECLNCCCELSDSC 328
           + DR      I+ACS+LN++CAIAFTQLC+A  CF C++CCCEL  SC
Sbjct: 576 VADRICCMCFIQACSKLNNQCAIAFTQLCSALTCFGCMDCCCELCVSC 623


>ref|XP_002271630.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 isoform X1
           [Vitis vinifera]
 emb|CBI35850.3| unnamed protein product, partial [Vitis vinifera]
          Length = 747

 Score =  122 bits (305), Expect = 3e-28
 Identities = 59/108 (54%), Positives = 74/108 (68%)
 Frame = +2

Query: 5   VLVQEMIGLNGRKIIFLRNLGVGFGSLTATAENLPPRSGEPKLHESKDFIMKAATNYWGR 184
           +L ++M+   G+KII L  LG+GFG LTAT+ENLP    EPK  E  D I KAATN    
Sbjct: 639 MLQEKMVDSIGQKIILLGTLGIGFGDLTATSENLPSGVEEPKPPEGTDVIFKAATNCCAM 698

Query: 185 LVDRXXXXXXIRACSQLNDRCAIAFTQLCTAFACFECLNCCCELSDSC 328
           + DR      I+ACS+LN++CAIAFTQLC+A  CF C++CCCEL  SC
Sbjct: 699 VADRICCMCFIQACSKLNNQCAIAFTQLCSALTCFGCMDCCCELCVSC 746


>gb|PLY78632.1| hypothetical protein LSAT_4X92761 [Lactuca sativa]
          Length = 762

 Score =  121 bits (303), Expect = 5e-28
 Identities = 55/109 (50%), Positives = 74/109 (67%)
 Frame = +2

Query: 14  QEMIGLNGRKIIFLRNLGVGFGSLTATAENLPPRSGEPKLHESKDFIMKAATNYWGRLVD 193
           +E   L   KI++L NL VGFG++ A+ +NL P   E KL E    +M+AA++ +G ++D
Sbjct: 653 EEYSKLENHKIMYLSNLSVGFGNVMASVDNLAPGIEEVKLSEPAGMMMQAASSCYGVVLD 712

Query: 194 RXXXXXXIRACSQLNDRCAIAFTQLCTAFACFECLNCCCELSDSCSQCC 340
           R      +R CS++ND+CAIA TQLCTA ACFECLNCCCE+ DSC   C
Sbjct: 713 RCCCVSVLRCCSRMNDQCAIALTQLCTALACFECLNCCCEVCDSCGDLC 761


>ref|XP_023772683.1| uncharacterized protein LOC111921359 [Lactuca sativa]
          Length = 812

 Score =  121 bits (303), Expect = 5e-28
 Identities = 55/109 (50%), Positives = 74/109 (67%)
 Frame = +2

Query: 14   QEMIGLNGRKIIFLRNLGVGFGSLTATAENLPPRSGEPKLHESKDFIMKAATNYWGRLVD 193
            +E   L   KI++L NL VGFG++ A+ +NL P   E KL E    +M+AA++ +G ++D
Sbjct: 703  EEYSKLENHKIMYLSNLSVGFGNVMASVDNLAPGIEEVKLSEPAGMMMQAASSCYGVVLD 762

Query: 194  RXXXXXXIRACSQLNDRCAIAFTQLCTAFACFECLNCCCELSDSCSQCC 340
            R      +R CS++ND+CAIA TQLCTA ACFECLNCCCE+ DSC   C
Sbjct: 763  RCCCVSVLRCCSRMNDQCAIALTQLCTALACFECLNCCCEVCDSCGDLC 811


>ref|XP_011072601.1| LOW QUALITY PROTEIN: uncharacterized protein LOC105157816 [Sesamum
           indicum]
          Length = 745

 Score =  120 bits (301), Expect = 9e-28
 Identities = 53/96 (55%), Positives = 72/96 (75%)
 Frame = +2

Query: 29  LNGRKIIFLRNLGVGFGSLTATAENLPPRSGEPKLHESKDFIMKAATNYWGRLVDRXXXX 208
           L G+KI+FLR LG+GFGS+ ATAEN PP   EPKL+E+ + I KAA+++ GR+VD     
Sbjct: 646 LKGQKIVFLRGLGIGFGSVQATAENRPPEHAEPKLYETSEKIFKAASDFCGRVVDCCCCM 705

Query: 209 XXIRACSQLNDRCAIAFTQLCTAFACFECLNCCCEL 316
             I+ C++LND+CA+A TQLCTA ACF C+  CC++
Sbjct: 706 CFIQFCNRLNDQCAVALTQLCTALACFGCMEVCCDI 741


>ref|XP_016504589.1| PREDICTED: uncharacterized protein LOC107822546 [Nicotiana tabacum]
          Length = 138

 Score =  111 bits (278), Expect = 1e-27
 Identities = 50/100 (50%), Positives = 67/100 (67%)
 Frame = +2

Query: 29  LNGRKIIFLRNLGVGFGSLTATAENLPPRSGEPKLHESKDFIMKAATNYWGRLVDRXXXX 208
           LN +KII+LR LG GFG+L AT  NLP  + EPKLHE+ + +  AA++  G+L D     
Sbjct: 38  LNVKKIIYLRALGFGFGNLKATMNNLPVEAAEPKLHETSEMVFAAASSLCGKLCDCCCCM 97

Query: 209 XXIRACSQLNDRCAIAFTQLCTAFACFECLNCCCELSDSC 328
             I+ CS++ND+CA+   QLCTA ACFEC + CCE+   C
Sbjct: 98  CFIQFCSRVNDQCAVTLAQLCTALACFECFSFCCEVCGEC 137


>gb|KCW71308.1| hypothetical protein EUGRSUZ_F04395 [Eucalyptus grandis]
          Length = 105

 Score =  110 bits (275), Expect = 1e-27
 Identities = 51/104 (49%), Positives = 72/104 (69%)
 Frame = +2

Query: 20  MIGLNGRKIIFLRNLGVGFGSLTATAENLPPRSGEPKLHESKDFIMKAATNYWGRLVDRX 199
           M+ L G+++I L +LG GFG+LTAT +N+PP   E K   + D ++KAA+N  G+L+DR 
Sbjct: 1   MLDLKGQRVILLGSLGKGFGNLTATVKNIPPGL-EEKTSYAADVLVKAASNCCGKLMDRI 59

Query: 200 XXXXXIRACSQLNDRCAIAFTQLCTAFACFECLNCCCELSDSCS 331
                I+ CS +NDRCA+A TQLCT  ACF C++CC +L + CS
Sbjct: 60  CCMCFIQTCSYVNDRCAVALTQLCTGLACFGCIDCCFDLCECCS 103


>gb|PHT47276.1| hypothetical protein CQW23_11484 [Capsicum baccatum]
          Length = 514

 Score =  118 bits (296), Expect = 2e-27
 Identities = 53/101 (52%), Positives = 70/101 (69%)
 Frame = +2

Query: 29  LNGRKIIFLRNLGVGFGSLTATAENLPPRSGEPKLHESKDFIMKAATNYWGRLVDRXXXX 208
           LN +KII+LR LGVGFG+L ATA+NLP  + EPKLHE+ + +  AA++  G+L D     
Sbjct: 414 LNVKKIIYLRALGVGFGNLKATADNLPVEAAEPKLHETSEMVFAAASSLCGKLCDTCCCM 473

Query: 209 XXIRACSQLNDRCAIAFTQLCTAFACFECLNCCCELSDSCS 331
             I+ CS+ NDRCA+   QLCTA ACFEC + CCE+   C+
Sbjct: 474 CFIQFCSRFNDRCAVTLAQLCTALACFECFSFCCEVCGECN 514


>ref|XP_015162217.1| PREDICTED: uncharacterized protein LOC102599336 isoform X2 [Solanum
           tuberosum]
          Length = 743

 Score =  119 bits (297), Expect = 3e-27
 Identities = 53/100 (53%), Positives = 71/100 (71%)
 Frame = +2

Query: 29  LNGRKIIFLRNLGVGFGSLTATAENLPPRSGEPKLHESKDFIMKAATNYWGRLVDRXXXX 208
           LN +KII+LR LGVGFG+L ATA+NLP  + EPKLHE+ + +  AA+N  G+L D     
Sbjct: 643 LNVKKIIYLRALGVGFGNLKATADNLPVEAAEPKLHETSEMVFSAASNLCGKLCDFCCCM 702

Query: 209 XXIRACSQLNDRCAIAFTQLCTAFACFECLNCCCELSDSC 328
             I+ CS+++DRCA+   QLCTA ACFEC++ CCE+   C
Sbjct: 703 CFIQFCSRVSDRCAVTLAQLCTALACFECISFCCEVCGEC 742


Top