BLASTX nr result

ID: Coptis23_contig00006928 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis23_contig00006928
         (1597 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI24209.3| unnamed protein product [Vitis vinifera]              281   5e-73
ref|XP_002513535.1| hypothetical protein RCOM_1578820 [Ricinus c...   261   3e-67
ref|XP_003540620.1| PREDICTED: uncharacterized protein LOC100791...   224   7e-56
ref|XP_003538947.1| PREDICTED: uncharacterized protein LOC100800...   221   3e-55
ref|XP_002463336.1| hypothetical protein SORBIDRAFT_02g042000 [S...   192   2e-46

>emb|CBI24209.3| unnamed protein product [Vitis vinifera]
          Length = 1805

 Score =  281 bits (718), Expect = 5e-73
 Identities = 162/422 (38%), Positives = 220/422 (52%), Gaps = 32/422 (7%)
 Frame = +1

Query: 1    LDFGKRKAIPDTVICNGIMLEASTSERKKYWVAESLVPLHILKAYEEKKLARISSKMKPG 180
            LDFGKRK IPD V+ +G +LE S+SERKKYW+ ES VPLH+LKA+EEK++AR SS +  G
Sbjct: 1211 LDFGKRKIIPDVVVKHGSILEESSSERKKYWLDESHVPLHLLKAFEEKRIARKSSNINSG 1270

Query: 181  HIDXXXXXXXXXXXXXXXXXXXXXXXXXDYYKCGHCNKDVLIRDAVNCQDCEGFFHKRHV 360
             ++                         + Y+CGHC KDVL R+AV+CQ C+G+FHKRHV
Sbjct: 1271 KLNEGGREMKKPSKDKGFSYLFLKAERSENYQCGHCKKDVLTREAVSCQYCKGYFHKRHV 1330

Query: 361  RKSEHSSAAECTYTCQRCRD-EVLXXXXXXXXXXXXXXXXXXXXMAIANGRPKRLARRV- 534
            RKS  S +AECTYTC +C+D + +                    +    G+  +  R + 
Sbjct: 1331 RKSAGSISAECTYTCHKCQDGKPMKINAKIGNVQSQKGKKGSTDLYKKKGKAYKNCRLLG 1390

Query: 535  ---------KYMPVQ----RKKIRGAK----------------KRKQGKSRNQQCNKSKK 627
                     K  PV+    RK   G +                +R   K + +   K KK
Sbjct: 1391 SKSGKKIFTKEQPVRSCKGRKPSTGKRPVRSLVKREVSTVVPLRRSARKIKFRTPKKPKK 1450

Query: 628  GMCW-SKGRRTHVYYSYWRNGIRFTRSPNDGCAVQFRETKLLLPSEHPRSNIILPKCCLC 804
               W  K RRT V YSYW NG+  +R PND   +QFR  +L +PSEH    I  P C LC
Sbjct: 1451 ETSWKKKKRRTLVCYSYWLNGLLLSRMPNDDRVMQFRRERLFVPSEHLNVVIDKPTCHLC 1510

Query: 805  SEASYRSTLIYVNCESCQEWFHGKAFGVTTENSSTILGFRCHNCCKRNPPICPYLLDAQV 984
            +EA +   L Y+NCE C +WFHG AFG+  E    ++GFRCH CCKR PP CP+L     
Sbjct: 1511 AEAGHTPMLNYINCEICGDWFHGDAFGLDVETIGNLIGFRCHECCKRTPPACPHLQGMSR 1570

Query: 985  NEVGLRKGENDAGVECIVTDKFAGNNGTSEEQNTHPELDLTNIDQKIVADPNSFLERQNA 1164
            +E  L + ++D G++C+V    A     S+     P L +  +D+ I  +     E+  A
Sbjct: 1571 DEAQLDEVKSDVGIDCLVPQSEAYVRQESQSDEDSPGLFV--VDESIHKE-----EQVGA 1623

Query: 1165 VP 1170
            VP
Sbjct: 1624 VP 1625


>ref|XP_002513535.1| hypothetical protein RCOM_1578820 [Ricinus communis]
            gi|223547443|gb|EEF48938.1| hypothetical protein
            RCOM_1578820 [Ricinus communis]
          Length = 1915

 Score =  261 bits (668), Expect = 3e-67
 Identities = 147/379 (38%), Positives = 200/379 (52%), Gaps = 40/379 (10%)
 Frame = +1

Query: 1    LDFGKRKAIPDTVICNGIMLEASTSERKKYWVAESLVPLHILKAYEEKKLARISSKMKPG 180
            LDFGKRK IP+ V  NG ++E S+SERKKYW+ ES VPL++LK++E+K++AR SSKM  G
Sbjct: 1364 LDFGKRKCIPEIVSKNGSIVEESSSERKKYWLNESYVPLYLLKSFEQKRIARRSSKMTSG 1423

Query: 181  HIDXXXXXXXXXXXXXXXXXXXXXXXXXDYYKCGHCNKDVLIRDAVNCQDCEGFFHKRHV 360
             +                          ++++CGHCNKDV +R+AV CQ C+GFFHKRHV
Sbjct: 1424 KLSDASVSMKKPLKKRGFSYLFAKAERPEHHQCGHCNKDVPVREAVCCQYCKGFFHKRHV 1483

Query: 361  RKSEHSSAAECTYTCQRC-------------RDEVLXXXXXXXXXXXXXXXXXXXXMAIA 501
            RKS  S +AEC YTC RC             +++                      +  +
Sbjct: 1484 RKSAGSMSAECKYTCHRCVAGKYMKMDSKTGKNDEKRGKNKNRSTKTHNQKSKKTTVGSS 1543

Query: 502  NGRPK---------RLARRVK------YMPVQR------------KKIRGAKKRKQGKSR 600
            +  PK         RL R  K       +P++R            KK RG KK KQ K +
Sbjct: 1544 SVHPKNSKKTLRSSRLLRSQKNKKATVVVPLRRSPRKAKLNSLQNKKSRGRKKGKQAKPK 1603

Query: 601  NQQCNKSKKGMCWSKGRRTHVYYSYWRNGIRFTRSPNDGCAVQFRETKLLLPSEHPRSNI 780
                 K  K   W K +RT  Y+++W NG+  TR P+D   + FR  + L PSE    + 
Sbjct: 1604 KTTGKKPTKVTSWRK-KRTQAYHNFWLNGLFLTRKPDDERVMHFRRKRFLAPSESAIHD- 1661

Query: 781  ILPKCCLCSEASYRSTLIYVNCESCQEWFHGKAFGVTTENSSTILGFRCHNCCKRNPPIC 960
              PKC LCSEA   STL Y++CE C EW+HG AFG+  ENS+ ++GFRCH C    PP+C
Sbjct: 1662 -QPKCHLCSEAGNTSTLSYISCEICGEWYHGAAFGLDAENSNKLIGFRCHMCRNCKPPVC 1720

Query: 961  PYLLDAQVNEVGLRKGEND 1017
            P++   + +E  +   END
Sbjct: 1721 PFVAVTRNHESQMASAEND 1739


>ref|XP_003540620.1| PREDICTED: uncharacterized protein LOC100791832 [Glycine max]
          Length = 1702

 Score =  224 bits (570), Expect = 7e-56
 Identities = 137/396 (34%), Positives = 194/396 (48%), Gaps = 52/396 (13%)
 Frame = +1

Query: 1    LDFGKRKAIPDTVICNGIMLEASTSERKKYWVAESLVPLHILKAYEEKKLARISSKMKPG 180
            +DFGKR+AIPD VI +G +LE S SERKKYW+ ES VPLH+LK +EEK++ R S+  K G
Sbjct: 1298 IDFGKRRAIPDVVIKHGSLLEQSASERKKYWLEESYVPLHLLKNFEEKRIVRKSTDKKLG 1357

Query: 181  HIDXXXXXXXXXXXXXXXXXXXXXXXXXDYYKCGHCNKDVLIRDAVNCQDCEGFFHKRHV 360
             I                          D ++C HCNKDV +RDAV C  C+G+FHKRH 
Sbjct: 1358 KILEIGRVNKKIPQQRGFSYLFTRLERSDCHQCRHCNKDVAMRDAVRCLHCKGYFHKRHA 1417

Query: 361  RKSEHSSAAECTYTCQRCRD------------------EVLXXXXXXXXXXXXXXXXXXX 486
            RKS        +Y+C RC+D                  ++                    
Sbjct: 1418 RKSGGKRTTGSSYSCHRCQDGLHAKTNTNKRKVDSKLQKIQAKKRKTVPSVCKPVNLKGN 1477

Query: 487  XMAIANGRPKRL-ARRVKYMPVQ---RKKIRGAK----------------------KRKQ 588
              A++N + ++  +R  K +P     R+  R AK                      K KQ
Sbjct: 1478 KKALSNNKIRQARSRNSKNIPSSIPLRRSTRKAKSLYMQSQLNGGHKKGKKNVGRKKGKQ 1537

Query: 589  GKSRNQQCNKSKK--------GMCWSKGRRTHVYYSYWRNGIRFTRSPNDGCAVQFRETK 744
            GK++     KSK+         +  ++ +RT +  SYW NG++ +R PND   + F+E K
Sbjct: 1538 GKTKKVIPQKSKETTGQYKKSEVTTARKKRTKICNSYWLNGLQLSRKPNDERVMLFKEKK 1597

Query: 745  LLLPSEHPRSNIILPKCCLCSEASYRSTLIYVNCESCQEWFHGKAFGVTTENSSTILGFR 924
             +  S+    ++  PKCCLC       TL Y+ CE C +WFHG AFG+  EN+  ++GF+
Sbjct: 1598 RVASSKDFSGSLDHPKCCLC--CGNECTLNYIACEICGDWFHGDAFGLNVENARQLIGFK 1655

Query: 925  CHNCCKRNPPICPYLLDAQVNEVGLRKGENDAGVEC 1032
            CH C  R  PICP+L   +VN   L   E++A +EC
Sbjct: 1656 CHVCLDRTAPICPHL---KVN--ALSCTESNAAIEC 1686


>ref|XP_003538947.1| PREDICTED: uncharacterized protein LOC100800973 [Glycine max]
          Length = 1735

 Score =  221 bits (564), Expect = 3e-55
 Identities = 135/399 (33%), Positives = 193/399 (48%), Gaps = 55/399 (13%)
 Frame = +1

Query: 1    LDFGKRKAIPDTVICNGIMLEASTSERKKYWVAESLVPLHILKAYEEKKLARISSKMKPG 180
            +DFGKR+AIPD VI  G +LE S+SERKKYW+ E+ VPLH+LK +EEK++ R S+  K G
Sbjct: 1328 IDFGKRRAIPDVVIKQGSLLEQSSSERKKYWLEETYVPLHLLKNFEEKRIVRKSTDKKLG 1387

Query: 181  HIDXXXXXXXXXXXXXXXXXXXXXXXXXDYYKCGHCNKDVLIRDAVNCQDCEGFFHKRHV 360
             I                          D ++CGHCNKDV +RDAV C  C+G+FHKRHV
Sbjct: 1388 KILEIGRVNKKIPQQKGFSYLFTRLERSDCHQCGHCNKDVAMRDAVRCLHCKGYFHKRHV 1447

Query: 361  RKSEHSSAAECTYTCQRCRD------------------EVLXXXXXXXXXXXXXXXXXXX 486
            RKS  +     +Y+C RC+D                  ++                    
Sbjct: 1448 RKSSGTRTTGSSYSCHRCQDGLQAKTNTNKRKVDSKLQKIQAKKRKIVPSVCKSLNLKGN 1507

Query: 487  XMAIANGRPKRL-ARRVKYMPVQ---RKKIRGAKK----------RKQGKSRNQQCNKSK 624
              A +  + +++ +R  K +P     R+  R AK            K+GKS  +   + K
Sbjct: 1508 KKASSKNKIRQVRSRNSKNIPSSIPLRRSTRKAKSLYMHSQLNGGHKKGKSTKKNVGRKK 1567

Query: 625  ------KGMCWSKGRRTHVYY-----------------SYWRNGIRFTRSPNDGCAVQFR 735
                  K +   K + T   Y                 SYW NG++ +R  ND   + F+
Sbjct: 1568 GKQSQTKKVTPQKSKETTDQYKKLPVTTAHKKRTRTCNSYWLNGLQLSRKSNDERVMLFK 1627

Query: 736  ETKLLLPSEHPRSNIILPKCCLCSEASYRSTLIYVNCESCQEWFHGKAFGVTTENSSTIL 915
            E K ++ SE    ++  PKCCLC       TL Y+ CE C +WFHG AFG+  EN+  ++
Sbjct: 1628 EKKCVVSSEDFSGSVDYPKCCLC--CGNECTLNYIACEICGDWFHGDAFGLNVENTRQLI 1685

Query: 916  GFRCHNCCKRNPPICPYLLDAQVNEVGLRKGENDAGVEC 1032
            GF+CH C  R  PICP+L   ++N   L + E++A +EC
Sbjct: 1686 GFKCHVCLDRTAPICPHL---KIN--ALSRTESNAAIEC 1719


>ref|XP_002463336.1| hypothetical protein SORBIDRAFT_02g042000 [Sorghum bicolor]
            gi|241926713|gb|EER99857.1| hypothetical protein
            SORBIDRAFT_02g042000 [Sorghum bicolor]
          Length = 1688

 Score =  192 bits (488), Expect = 2e-46
 Identities = 143/487 (29%), Positives = 210/487 (43%), Gaps = 71/487 (14%)
 Frame = +1

Query: 1    LDFGKRKAIPDTVICNGIMLEASTSERKKYWVAESLVPLHILKAYEEKKLARISSKMK-- 174
            LDFGKR+ IP  +  +G  LE  +SER +YW++E  VPL++LKAYE K  AR+  K +  
Sbjct: 1018 LDFGKRENIPPVISKHGTKLEEPSSERNRYWLSEGHVPLNLLKAYEAKTFARLLKKKETD 1077

Query: 175  --PGHIDXXXXXXXXXXXXXXXXXXXXXXXXXDYYKCGHCNKDVLIRDAVNCQDCEGFFH 348
              P                                 CGHC+K+V+  +AVNCQ CE  FH
Sbjct: 1078 ELPKKTKKMRVPKPEMPRKTGFDYLFEKAEKRSTMFCGHCHKEVIASEAVNCQYCEVIFH 1137

Query: 349  KRHVRKSEHSSAAECTYTCQRCRD-EVLXXXXXXXXXXXXXXXXXXXXMAIANGRPKRLA 525
            ++H +      A    Y C +C D +VL                          + K+ +
Sbjct: 1138 RKHFKVPR--GAKNAVYVCNKCLDEKVLKVESPQKKAAPKKPSPRKKQKKQNKQKQKKQS 1195

Query: 526  RRVKYMPVQ-----RKKIRGAKKRKQGKSRNQ---------------------------- 606
            R+++    Q     +KKI   KK K+G+ R                              
Sbjct: 1196 RKIETRRNQIVLKYKKKI--GKKGKRGRPRKNPPDLSKNESSKILESEPSNVSKNEPVKR 1253

Query: 607  ------------QCNKSKKGMCWSKGRRTHVYYSYWRNGIRFTRSPNDGCAVQFRETKLL 750
                          N S+      K +RT + YSYW NG+R+T++P+D  A+ FR+ +++
Sbjct: 1254 ISKRLYDKYMKGNSNVSENAASSRKRKRTALQYSYWLNGLRWTQNPHDERAISFRKERVV 1313

Query: 751  LPSEHPRSNIILPKCCLCSEASYRSTLIYVNCESCQEWFHGKAFGVTTENSSTILGFRCH 930
             PSE    + + P CCLC E  Y    IY+ CE C++WFHG  + VT EN + ++GF+CH
Sbjct: 1314 FPSEEAEISEVSPVCCLC-EKCYCDEDIYIACEKCEDWFHGDIYSVTIENVNNLIGFKCH 1372

Query: 931  NCCKRNPPICPYLLDAQVNEVGLRKGENDA--GVECI--VTDKFA-----------GNNG 1065
             C  R+ P+CPY        V + KG++D   G++ +    DKF            G  G
Sbjct: 1373 RCRLRSLPVCPY-----AETVTILKGQSDKDHGIKFVDNSVDKFVEDEDPNCPKDLGALG 1427

Query: 1066 TSEEQNTHP-ELDLTNIDQKIVADPNSFLERQN-----AVPDIACDEKGLEHSGSYLRKQ 1227
            + +E + H  E  L     +I    N+ LE  N        D    EK LE   S  +  
Sbjct: 1428 SLQELHDHDIERRLNGHITEIEFSYNNCLEELNDHGSLKEFDAHSTEKDLEDDESLKKLD 1487

Query: 1228 RMDELSE 1248
              +EL E
Sbjct: 1488 THNELKE 1494


Top