BLASTX nr result

ID: Coptis21_contig00007780 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis21_contig00007780
         (1329 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN68458.1| hypothetical protein VITISV_031449 [Vitis vinifera]   286   6e-75
ref|NP_181184.1| uncharacterized protein [Arabidopsis thaliana] ...   241   4e-61
ref|XP_002524913.1| conserved hypothetical protein [Ricinus comm...   238   3e-60
ref|XP_002327764.1| predicted protein [Populus trichocarpa] gi|2...   238   3e-60
ref|XP_002515544.1| conserved hypothetical protein [Ricinus comm...   237   4e-60

>emb|CAN68458.1| hypothetical protein VITISV_031449 [Vitis vinifera]
          Length = 439

 Score =  286 bits (733), Expect = 6e-75
 Identities = 171/432 (39%), Positives = 248/432 (57%), Gaps = 9/432 (2%)
 Frame = -3

Query: 1327 KKNEKEEAAPSEEQDKLLATAMEGNLGNIYXXXXXXXXXXSRVPQKLRKLNESCYEPAIA 1148
            +++E+ EA  +   D LLA ++   L ++            RVP  LR++NE  + P I 
Sbjct: 9    EQSERREA--NGNADSLLAASINEKLSSL--TSLPSQCCIYRVPDTLRRVNEEAFVPRIL 64

Query: 1147 SIGPLHRGKAALQFVENHKWHYLGKLISRKSGLTLEVCIKTVRELEERARCCYTEYIDLG 968
            SIGP+H GK  L+ +E HKW YL  L+ RK G  +E  +K +RELE R R CY E I   
Sbjct: 65   SIGPVHHGKKRLRDMEGHKWQYLKALLQRKPGTMVERYVKAMRELEARTRGCYAEIIKFD 124

Query: 967  SDAFVEMMVLDGCFIIETFFRFTHMRFA-EDDFLGTVGWWSCNLVRDLLLIENQIPFLIL 791
            SD FV MM+LDGCFIIE F +  + +   EDD +        +L RDL+L+ENQ+PF +L
Sbjct: 125  SDEFVTMMLLDGCFIIELFLKNKNKQLRDEDDPIFNRTMVLTDLHRDLILLENQLPFFVL 184

Query: 790  ECLFGLTVVPEVEEFTLSELAL-----YCLGYGSPVDKEEVRNLKGMHLLDLMRQSFLSP 626
            E LF L    + E  + S L L       LG    + ++   ++K  HLLDL+R  F+ P
Sbjct: 185  ETLFNLIENTDQEGPSTSVLELTYVFFKFLGLQEVLIRDSQPDVK--HLLDLLRLWFVPP 242

Query: 625  DTNMLADESNDDDKLFKLVHSLTELEEAGVKVKVNHNRHLVGIRFKDGVLEIPHLVVDEF 446
             +     +S      F+L+ S+TEL EAGVK ++     L+ I+F +GVLEIP L +++ 
Sbjct: 243  SSTKSTSKSK-----FELIRSVTELHEAGVKFRMGTVSCLMEIKFINGVLEIPPLTIEDT 297

Query: 445  TEPLFRNIIAYEQLFDCDRM---FVTNYIVLMDRLINTSKDACVLHGCGVIENCLGRNEK 275
            T+ L  N+IA+EQ   C+R    ++T+Y++LM+ LIN+ KD  +L   G+I N LG NE 
Sbjct: 298  TDSLLGNLIAFEQC--CNRFTPHYITDYVILMEYLINSPKDVALLSRYGIINNLLGDNEG 355

Query: 274  LSLLFNNLNQEVCLNRRHFQYTQICRELNAYLEQPWHLWKASLRRKCFNSPWXXXXXXXX 95
            +S LF  L +EV  N   FQ++ +CR++N Y +  WH+W+A+LRR  FN+PW        
Sbjct: 356  VSHLFKKLGKEVVFNSDKFQFSNLCRDVNKYHKTRWHIWRATLRRDYFNNPWAIISFIGA 415

Query: 94   XXXXISTFLQTV 59
                  T +QTV
Sbjct: 416  VLLLFFTLIQTV 427


>ref|NP_181184.1| uncharacterized protein [Arabidopsis thaliana]
            gi|4581143|gb|AAD24627.1| hypothetical protein
            [Arabidopsis thaliana] gi|18377795|gb|AAL67047.1| unknown
            protein [Arabidopsis thaliana] gi|21280851|gb|AAM45097.1|
            unknown protein [Arabidopsis thaliana]
            gi|330254159|gb|AEC09253.1| uncharacterized protein
            [Arabidopsis thaliana]
          Length = 448

 Score =  241 bits (614), Expect = 4e-61
 Identities = 138/393 (35%), Positives = 211/393 (53%), Gaps = 10/393 (2%)
 Frame = -3

Query: 1204 RVPQKLRKLNESCYEPAIASIGPLHRGKAALQFVENHKWHYLGKLISRKSGLTLEVCIKT 1025
            RVPQ +   N  CYEP + SIGP HRG+  L+ +E HKW YL  L++R   LTLE  +K+
Sbjct: 49   RVPQSMIDCNGRCYEPRVVSIGPYHRGQTQLKMIEEHKWRYLNVLLTRTQNLTLEDYMKS 108

Query: 1024 VRELEERARCCYTEYIDLGSDAFVEMMVLDGCFIIETFFRFTHM-RFAEDDFLGTVGWWS 848
            V+ +EE AR CY+E I + S+ F EMMVLDGCF++E F +  ++  F  +D L  + W  
Sbjct: 109  VKNVEEVARECYSETIHMDSEEFNEMMVLDGCFLLELFRKVNNLVPFEPNDPLVAMAWVL 168

Query: 847  CNLVRDLLLIENQIPFLILECLFGLTVVPEVEEFTLSELALYCLGYGSPVDKEE-----V 683
                RD L +ENQIPF +LE LF LT      E   S  +L    + + + + E      
Sbjct: 169  PFFYRDFLCLENQIPFFVLETLFNLTRGDNENETNASLQSLAFAFFNNMMHRTEEDLARF 228

Query: 682  RNLKGMHLLDLMRQSFLSPDTNMLADESNDDDK---LFKLVHSLTELEEAGVKVK-VNHN 515
            + L+  HLLDL+R SF+ P++ +    + +  K      ++HS+++L  AG+K++ +   
Sbjct: 229  KELRAKHLLDLLRSSFI-PESELHTPPATNPGKEKMPSHIIHSISKLRRAGIKLRELKDA 287

Query: 514  RHLVGIRFKDGVLEIPHLVVDEFTEPLFRNIIAYEQLFDCDRMFVTNYIVLMDRLINTSK 335
               + +RF+ G +E+P + VD+F      N +AYEQ      M  T Y  L+D L NT K
Sbjct: 288  ESFLVVRFRHGTIEMPAITVDDFMSSFLENCVAYEQCHVACSMHFTTYATLLDCLTNTYK 347

Query: 334  DACVLHGCGVIENCLGRNEKLSLLFNNLNQEVCLNRRHFQYTQICRELNAYLEQPWHLWK 155
            D   L    +IEN  G + +L+   N+L ++V  +        +  E+N Y +  WH+  
Sbjct: 348  DVEYLCDQNIIENYFGTDTELAKFVNSLGRDVAFDITQCYLKDLFEEVNEYYKSSWHVEW 407

Query: 154  ASLRRKCFNSPWXXXXXXXXXXXXISTFLQTVY 56
            A+ +   FNSPW            + + +QT+Y
Sbjct: 408  ATFKFTYFNSPWSFVSALAALVLLVLSVIQTIY 440


>ref|XP_002524913.1| conserved hypothetical protein [Ricinus communis]
            gi|223535748|gb|EEF37410.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 531

 Score =  238 bits (607), Expect = 3e-60
 Identities = 146/417 (35%), Positives = 224/417 (53%), Gaps = 34/417 (8%)
 Frame = -3

Query: 1204 RVPQKLRKLNESCYEPAIASIGPLHRGKAALQFVENHKWHYLGKLISRKSGLTLEVCIKT 1025
            RVP  LR+ ++    P I S+GP H  K  L+ ++ HKW  L  ++ R +   +++ + +
Sbjct: 107  RVPHYLREGDDKAIVPQIVSLGPYHHAKRRLRQMDRHKWRSLQHVLKR-TNKDIKLFLDS 165

Query: 1024 VRELEERARCCYTEYIDLGSDAFVEMMVLDGCFIIETFFR----FTHMRFAEDDFLGTVG 857
            VRE+EERAR CY   I L S+ FVEMMVLDGCF++E F      F  + +A +D +  + 
Sbjct: 166  VREVEERARSCYEGTIGLSSNEFVEMMVLDGCFVLELFRGAAEGFKQLGYARNDPIFAMR 225

Query: 856  WWSCNLVRDLLLIENQIPFLILECLFGLTVVPEVEEFTLSELALY----CLGYGSPVDKE 689
                ++ RD++++ENQ+P  IL+ L GL      ++  +++LAL      +    P+ K 
Sbjct: 226  GSMHSIQRDMIMLENQLPLFILDLLLGLQFDNPDQKGFVAKLALTFFDPLMPTDEPLTKS 285

Query: 688  EVRNLK----------------GMHLLDLMRQSFLSPDT-----NMLADESND----DDK 584
            E   L+                G+H LD+ R+S L         N +   SN+    D +
Sbjct: 286  EKNKLESSLGYATTFDPLADQGGLHCLDVFRRSLLRSGPKPVPRNWIKRWSNNNRVADKR 345

Query: 583  LFKLVHSLTELEEAGVKVKVNHNRHLVGIRFKDGVLEIPHLVVDEFTEPLFRNIIAYEQL 404
              +L+H +TEL EAG+K K         I+FKDGVL IP L+V + T+ LF N+IA+EQ 
Sbjct: 346  RTQLIHCVTELREAGIKFKKRKTDRFWDIKFKDGVLRIPRLLVHDGTKSLFLNLIAFEQC 405

Query: 403  -FDCDRMFVTNYIVLMDRLINTSKDACVLHGCGVIENCLGRNEKLSLLFNNLNQEVCLNR 227
              DC    +T+Y+V MD LIN+ +D   LH CG++E+ LG + +++ LFN L QEV  + 
Sbjct: 406  HLDCGND-ITSYVVFMDNLINSPQDVAYLHYCGILEHWLGSDAEVADLFNRLCQEVVFDI 464

Query: 226  RHFQYTQICRELNAYLEQPWHLWKASLRRKCFNSPWXXXXXXXXXXXXISTFLQTVY 56
                 +Q+ +++N Y    W+ W+ASL+   F +PW              TF QT Y
Sbjct: 465  NDSYLSQLSQDVNQYYNHRWNTWRASLKHNYFGNPWAIISLVAAVVLLALTFAQTFY 521


>ref|XP_002327764.1| predicted protein [Populus trichocarpa] gi|222836849|gb|EEE75242.1|
            predicted protein [Populus trichocarpa]
          Length = 419

 Score =  238 bits (606), Expect = 3e-60
 Identities = 136/391 (34%), Positives = 214/391 (54%), Gaps = 8/391 (2%)
 Frame = -3

Query: 1204 RVPQKLRKLNESCYEPAIASIGPLHRGKAALQFVENHKWHYLGKLISR--KSGLTLEVCI 1031
            +VPQ+   +N   Y+P + SIGP H G+  L+ +E HKW YLG ++SR    GL LEV +
Sbjct: 25   KVPQRFIDINGKSYQPHVVSIGPYHHGEEHLKMIEEHKWRYLGSILSRTQNKGLDLEVLL 84

Query: 1030 KTVRELEERARCCYTEYIDLGSDAFVEMMVLDGCFIIETFFRFTHM-RFAEDDFLGTVGW 854
            K ++ LE++AR CY++ I   +D F+EMMV+DGCFIIE F +  ++  F  DD + T+ W
Sbjct: 85   KAIQPLEKKARECYSQIIHFDTDEFIEMMVVDGCFIIELFRKVGNVVEFEVDDPIVTMAW 144

Query: 853  WSCNLVRDLLLIENQIPFLILECLFGLTVVPEVEEF-TLSELALYCLGYGSPVDKEEV-- 683
                  RDLL +ENQIPF +LECLF +T +P  E   +L +L+L    Y        +  
Sbjct: 145  IIPFFYRDLLRLENQIPFFVLECLFDITRMPGEESGPSLCKLSLDFFNYALQRPDNIIAR 204

Query: 682  -RNLKGMHLLDLMRQSFLSPDTNMLADESNDDDKLFKLVHSLTELEEAGVKVKV-NHNRH 509
              +L   HLLDL+R SF+  +      +S   D    ++HS+++L  AG+++   N    
Sbjct: 205  HNDLNAKHLLDLVRSSFIDFEQG----QSLHVDTSTPMIHSVSKLRRAGIELSQGNPEDS 260

Query: 508  LVGIRFKDGVLEIPHLVVDEFTEPLFRNIIAYEQLFDCDRMFVTNYIVLMDRLINTSKDA 329
             + ++FK+GV+E+P + +DE       N +A+EQ  +      T Y  L+D L+NT KD 
Sbjct: 261  FLVVKFKNGVIEMPTITIDETVTSFLLNCVAFEQCHNGSSKHFTTYATLLDCLVNTFKDV 320

Query: 328  CVLHGCGVIENCLGRNEKLSLLFNNLNQEVCLNRRHFQYTQICRELNAYLEQPWHLWKAS 149
              L  C +IEN  G + +++   N+L +EV  +      + +  +++ Y +   H+  AS
Sbjct: 321  EHLSDCNIIENYFGTDSEVASFINDLGKEVAFDIERCYLSSLFHDVDQYYKNSRHVQWAS 380

Query: 148  LRRKCFNSPWXXXXXXXXXXXXISTFLQTVY 56
             +   F++PW            + T  QTVY
Sbjct: 381  FKYTYFSTPWSFVSALAALIILLLTVSQTVY 411


>ref|XP_002515544.1| conserved hypothetical protein [Ricinus communis]
            gi|223545488|gb|EEF46993.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 439

 Score =  237 bits (605), Expect = 4e-60
 Identities = 147/419 (35%), Positives = 225/419 (53%), Gaps = 12/419 (2%)
 Frame = -3

Query: 1279 LLATAMEGNLGNIYXXXXXXXXXXSRVPQKLRKLNESCYEPAIASIGPLHRGKAALQFVE 1100
            LL T+M+  L N+            RVP+++R +N + Y P + SIGP H GK  L+ +E
Sbjct: 22   LLLTSMKSRLENL--SPVSSERCIYRVPKRIRDVNHNAYTPRLVSIGPFHHGKPGLKAME 79

Query: 1099 NHKWHYLGKLISRKSGLTLEVCIKTVRELEERARCCYTEYIDLGSDAFVEMMVLDGCFII 920
             HKW +L   + R++ + L+  +K +++ EERAR CY E I L SD FV+++++D  F I
Sbjct: 80   EHKWRHLQNFL-RQTRVKLDDLVKFIKDREERARNCYAETIALTSDEFVQILIVDATFTI 138

Query: 919  ETFF-RFTHMRFAEDDFLGTVGWWSCNLVRDLLLIENQIPFLILECLF---------GLT 770
            +    +         + +        ++ RD+LLIENQ+P+ IL  +          G +
Sbjct: 139  DILLGKVIPQLTCAIECVYDRSSLMFDIYRDMLLIENQLPYFILGDILDFAKSIAASGSS 198

Query: 769  VVPEVEEFTLSELALYC-LGYGS-PVDKEEVRNLKGMHLLDLMRQSFLSPDTNMLADESN 596
              P + E T      Y  LG  S P+ + EV      H +D +R         +   ++ 
Sbjct: 199  QWPSILELTRVYFNSYMQLGRASHPMRRSEVN-----HFVDFLRLCHQP----IKPRQTP 249

Query: 595  DDDKLFKLVHSLTELEEAGVKVKVNHNRHLVGIRFKDGVLEIPHLVVDEFTEPLFRNIIA 416
             +++ F++  S TEL EAGVK KV    HL+ I+F DGVLEIP++ V E TE  FRN+IA
Sbjct: 250  RENRKFEMTRSSTELREAGVKFKVASTTHLLDIQFNDGVLEIPYIRVSEITEAFFRNLIA 309

Query: 415  YEQLFDCDRMFVTNYIVLMDRLINTSKDACVLHGCGVIENCLGRNEKLSLLFNNLNQEVC 236
            +EQ   C   ++++YIV+MD LINT  D  VL   G+++  L  N + S LFNNL +E+ 
Sbjct: 310  FEQCH-CHTSYISDYIVIMDSLINTPHDVEVLVKYGIMKVMLANNVEASTLFNNLAKEIL 368

Query: 235  LNRRHFQYTQICRELNAYLEQPWHLWKASLRRKCFNSPWXXXXXXXXXXXXISTFLQTV 59
             +   F Y+ +C +LN + +  WH WKA+L+   FN+PW            + TF+Q V
Sbjct: 369  YDSHVFYYSLLCEDLNTFCKVRWHRWKATLKHNYFNTPWTAISVIAGVILLVLTFIQAV 427


Top