BLASTX nr result

ID: Coptis23_contig00024360 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis23_contig00024360
         (1130 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002283542.1| PREDICTED: uncharacterized protein LOC100248...   296   8e-78
ref|XP_003535145.1| PREDICTED: uncharacterized protein LOC100818...   270   4e-70
ref|XP_002323209.1| predicted protein [Populus trichocarpa] gi|2...   267   3e-69
ref|XP_003518991.1| PREDICTED: uncharacterized protein LOC100786...   265   2e-68
ref|NP_001118848.1| hydroxyproline-rich glycoprotein family prot...   258   1e-66

>ref|XP_002283542.1| PREDICTED: uncharacterized protein LOC100248215 [Vitis vinifera]
            gi|297741707|emb|CBI32839.3| unnamed protein product
            [Vitis vinifera]
          Length = 529

 Score =  296 bits (757), Expect = 8e-78
 Identities = 170/316 (53%), Positives = 217/316 (68%), Gaps = 2/316 (0%)
 Frame = -1

Query: 1001 MGKFEQDNYLLPSSVLSQFSNSNQNGISLCCSKF-LGFRCILVFMFGLAVLLSAIFLLPP 825
            MGK E++  L  + V+S+   S+QN  S C  +  +GFRC+L  + G AV+LSAIF LPP
Sbjct: 1    MGKVEEEQPLPSAIVVSE--PSDQNVGSRCRIRGRVGFRCVLALLLGAAVMLSAIFWLPP 58

Query: 824  FSSRYKHRSDLDLKSLIQAHEVVASFKLQRPVSMLKANILQIELGIFDEIAVPNTTVVVI 645
            F  +Y  + DLDL S  + H++VASFK+++ +S+L+  +LQ+E  IF EI    + VVV+
Sbjct: 59   FL-QYADQRDLDLDSRFRGHDIVASFKVKKSISLLEDYLLQLENDIFVEIEGIESKVVVL 117

Query: 644  FLEPLAESNATSVVFAVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLHLTSSFGDPFSF 465
             LEP A +N T VVFAV                                  S FGDPF+F
Sbjct: 118  SLEPSAGTNITKVVFAVDLDAKSSRILTSQSLIRELFESLVTQQSSLRLTASLFGDPFTF 177

Query: 464  EVLKFKGGIKVIPEQTAFLLQR-QTFFNFTLNFSIHELLENFDELREQLKAGLHLTSYEN 288
            EVLKF GGI V P Q+AFLLQ+ Q  FNFTLNFSI ++LENF+EL  QLK+GLHL SYEN
Sbjct: 178  EVLKFPGGITVSPPQSAFLLQKVQILFNFTLNFSIEQILENFNELTSQLKSGLHLASYEN 237

Query: 287  IYVSLVNSYGSTVYPPTTVQAAVLLAIGNRSPPLPRLKQLARTITGPHAKNLGLNHTIFG 108
            +Y+SL NS GSTV PPTTVQ++VLLA+GN +P LPRLKQLA+TITG H++NLGLN+T+FG
Sbjct: 238  LYISLTNSKGSTVSPPTTVQSSVLLAVGN-TPSLPRLKQLAQTITGSHSRNLGLNNTVFG 296

Query: 107  RVKQVRLSSAMQHSLN 60
            RVKQVRLSS +QHSL+
Sbjct: 297  RVKQVRLSSILQHSLH 312


>ref|XP_003535145.1| PREDICTED: uncharacterized protein LOC100818532 [Glycine max]
          Length = 507

 Score =  270 bits (691), Expect = 4e-70
 Identities = 162/316 (51%), Positives = 205/316 (64%), Gaps = 2/316 (0%)
 Frame = -1

Query: 1001 MGKFEQDNYLLPSSVLSQFSNSNQNGISLCCSKFLGFRCILVFMFGLAVLLSAIFLLPPF 822
            MGK   +++LLPS V ++    N      C    +GFRC++V +F +AV LSA+F LPPF
Sbjct: 1    MGK-PGEHHLLPSGVAAEDPRRNAASPPGCA---VGFRCLVVLLFSVAVFLSALFWLPPF 56

Query: 821  SSRYKHRSDLDLKSLIQAHEVVASFKLQRPVSMLKANILQIELGIFDEIAVPNTTVVVIF 642
            +  +    DL + S  + H++VASF +Q+PVS+L+ NILQ+   IF+EI V +T VV++ 
Sbjct: 57   A-HFADPKDLHINSKYKDHDIVASFYVQKPVSLLEENILQLSNDIFEEIGVLSTKVVILS 115

Query: 641  LEPLAESNATSVVFAVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLHL-TSSFGDPFSF 465
            L+PL +SN T VVFAV                              L L TS FG P  F
Sbjct: 116  LDPLPQSNTTKVVFAVDPDSKYSEMSAAAISLIRASFKYLVIRQSYLQLSTSLFGVPSVF 175

Query: 464  EVLKFKGGIKVIPEQTAFLLQR-QTFFNFTLNFSIHELLENFDELREQLKAGLHLTSYEN 288
            EVLKFKGGI +IP+Q+ F LQ  QT FNFTLNFSI+E+  NFDEL  QLK+GLHL  YEN
Sbjct: 176  EVLKFKGGITIIPQQSVFPLQMVQTLFNFTLNFSIYEIQSNFDELTSQLKSGLHLAPYEN 235

Query: 287  IYVSLVNSYGSTVYPPTTVQAAVLLAIGNRSPPLPRLKQLARTITGPHAKNLGLNHTIFG 108
            +YV L NS GSTV  PT VQ++VLLA+G   P   RLKQLA+TI G H+ NLGLN+T FG
Sbjct: 236  LYVILSNSEGSTVTAPTVVQSSVLLAVG-IPPSKERLKQLAQTIMGHHSWNLGLNNTQFG 294

Query: 107  RVKQVRLSSAMQHSLN 60
            RVKQVRLSS +QHSL+
Sbjct: 295  RVKQVRLSSILQHSLH 310


>ref|XP_002323209.1| predicted protein [Populus trichocarpa] gi|222867839|gb|EEF04970.1|
           predicted protein [Populus trichocarpa]
          Length = 497

 Score =  267 bits (683), Expect = 3e-69
 Identities = 155/285 (54%), Positives = 194/285 (68%), Gaps = 2/285 (0%)
 Frame = -1

Query: 908 SKFLGFRCILVFMFGLAVLLSAIFLLPPFSSRYKHRSDLDLKSLIQAHEVVASFKLQRPV 729
           ++F+GFRC+ V +  +AV LSA+F LPPF   +  + DLDL   I+ H++VASF +++PV
Sbjct: 44  TRFIGFRCVFVLLLSVAVFLSAVFWLPPFL-HFADQGDLDLDYRIKDHDIVASFLVKKPV 102

Query: 728 SMLKANILQIELGIFDEIAVPNTTVVVIFLEPLAESNATSVVFAVVXXXXXXXXXXXXXX 549
            +L+ N L+++  IFDE+ VPNT VV++ LEPLA SN T VVF V               
Sbjct: 103 FLLEDNKLKLQGDIFDEMRVPNTKVVILSLEPLAGSNRTKVVFGVDPLENDSKISSTDQS 162

Query: 548 XXXXXXXXXXXXXXXLHLTSS-FGDPFSFEVLKFKGGIKVIPEQTAFLLQR-QTFFNFTL 375
                          L LT S FGD  SFEVLKF GGI +IP Q AFLLQ+ Q  FNFTL
Sbjct: 163 LIRGSFVSLVVNDSSLELTKSLFGDASSFEVLKFPGGITIIPPQRAFLLQKVQIPFNFTL 222

Query: 374 NFSIHELLENFDELREQLKAGLHLTSYENIYVSLVNSYGSTVYPPTTVQAAVLLAIGNRS 195
           NFSI ++ E F EL+ QLKAGLHLT  EN+Y+ L NS GSTV PPTTV+++VLL IGN  
Sbjct: 223 NFSILQIREKFAELKSQLKAGLHLTPIENLYIELWNSQGSTVSPPTTVKSSVLLVIGN-- 280

Query: 194 PPLPRLKQLARTITGPHAKNLGLNHTIFGRVKQVRLSSAMQHSLN 60
              PRLKQLA+TI G ++KNLGLN+TIFGRVKQVRLSS +QHSL+
Sbjct: 281 --TPRLKQLAQTIRG-NSKNLGLNNTIFGRVKQVRLSSILQHSLH 322


>ref|XP_003518991.1| PREDICTED: uncharacterized protein LOC100786981 [Glycine max]
          Length = 483

 Score =  265 bits (677), Expect = 2e-68
 Identities = 159/316 (50%), Positives = 201/316 (63%), Gaps = 2/316 (0%)
 Frame = -1

Query: 1001 MGKFEQDNYLLPSSVLSQFSNSNQNGISLCCSKFLGFRCILVFMFGLAVLLSAIFLLPPF 822
            MGK  +D+  LPS+   +    N    S C    +G RC++V +F +AV LSA+F LPPF
Sbjct: 1    MGKPGEDHLSLPSA---EDPRRNAAAASGCA---VGLRCLVVLLFSVAVFLSALFWLPPF 54

Query: 821  SSRYKHRSDLDLKSLIQAHEVVASFKLQRPVSMLKANILQIELGIFDEIAVPNTTVVVIF 642
            +  +    DL L S  + H++VASF +Q+PVS+L+ NILQ+   IF+EI  P+T V+++ 
Sbjct: 55   A-HFADPKDLYLNSKYKDHDIVASFYVQKPVSLLEDNILQLSNDIFEEIGAPSTKVIILS 113

Query: 641  LEPLAESNATSVVFAVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLHLTSS-FGDPFSF 465
            L+PL  SN T VVFAV                              L LT+  FG P  F
Sbjct: 114  LDPLPRSNTTKVVFAVDPDGKYSEMSAAAISLIRASFKYLVIRQSYLQLTTFLFGVPSVF 173

Query: 464  EVLKFKGGIKVIPEQTAFLLQR-QTFFNFTLNFSIHELLENFDELREQLKAGLHLTSYEN 288
            EVLKFKGGI +IP+Q+ F LQ  QT FNFTLNFSI+E+  NFDEL  QLK+GLHL  YEN
Sbjct: 174  EVLKFKGGITIIPQQSVFPLQTVQTLFNFTLNFSIYEIQSNFDELTSQLKSGLHLAPYEN 233

Query: 287  IYVSLVNSYGSTVYPPTTVQAAVLLAIGNRSPPLPRLKQLARTITGPHAKNLGLNHTIFG 108
            +YV L NS GSTV  PT VQ++VLLA+G   P   RLKQLA+TI G H+ NLGLN+T FG
Sbjct: 234  LYVILSNSEGSTVVAPTVVQSSVLLAVG-IPPSKERLKQLAQTIMGHHSWNLGLNNTQFG 292

Query: 107  RVKQVRLSSAMQHSLN 60
            R KQVRLSS +QHSL+
Sbjct: 293  RAKQVRLSSILQHSLH 308


>ref|NP_001118848.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis
            thaliana] gi|332646020|gb|AEE79541.1| hydroxyproline-rich
            glycoprotein family protein [Arabidopsis thaliana]
          Length = 489

 Score =  258 bits (660), Expect = 1e-66
 Identities = 151/320 (47%), Positives = 193/320 (60%), Gaps = 8/320 (2%)
 Frame = -1

Query: 1001 MGKFEQDNYLLP-SSVLSQFSNSNQNGISLCC-----SKFLGFRCILVFMFGLAVLLSAI 840
            MGK   +   LP S   +   N+   GIS CC     S +   RC+L+  F  AV LSA+
Sbjct: 1    MGKNTVEEQNLPVSDGAASARNNGGGGISTCCCCDWISSYFSLRCVLILAFSAAVFLSAL 60

Query: 839  FLLPPFSSRYKHRSDLDLKSLIQAHEVVASFKLQRPVSMLKANILQIELGIFDEIAVPNT 660
            F LPPF   +    DLDL    + H +VASF + +P+S ++ N++Q+E  I DEI+ P T
Sbjct: 61   FWLPPFLG-FADPGDLDLDPRFKDHRIVASFDVGKPISFMEDNLMQLENDITDEISFPMT 119

Query: 659  TVVVIFLEPLAESNATSVVFAVVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLHLTSS-F 483
             VVV+ LE L + N T V+FA+                                LT S F
Sbjct: 120  KVVVLALERLGDLNRTMVIFAIDPEKENSKIPAEIESLIKAAFETLVQKQLSFRLTESLF 179

Query: 482  GDPFSFEVLKFKGGIKVIPEQTAFLLQR-QTFFNFTLNFSIHELLENFDELREQLKAGLH 306
            G+PF FEVLKF GGI VIP Q  F LQ+ Q  FNFTLNFSI+++  NF+EL  QLK G++
Sbjct: 180  GEPFFFEVLKFPGGITVIPPQPIFPLQKAQLLFNFTLNFSIYQIQSNFEELASQLKKGIN 239

Query: 305  LTSYENIYVSLVNSYGSTVYPPTTVQAAVLLAIGNRSPPLPRLKQLARTITGPHAKNLGL 126
            L SYEN+Y++L NS GSTV PPT V ++VLL  G+ S    RLKQLA+TIT  H+KNLGL
Sbjct: 240  LASYENLYITLSNSRGSTVAPPTIVHSSVLLTFGSSS----RLKQLAQTITSSHSKNLGL 295

Query: 125  NHTIFGRVKQVRLSSAMQHS 66
            NHT+FG+VKQVRLSS + HS
Sbjct: 296  NHTVFGKVKQVRLSSILPHS 315


Top