BLASTX nr result

ID: Coptis25_contig00011868 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis25_contig00011868
         (2397 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002285507.2| PREDICTED: outer envelope protein of 80 kDa,...   785   0.0  
ref|XP_002513472.1| sorting and assembly machinery (sam50) prote...   782   0.0  
ref|XP_002871896.1| hypothetical protein ARALYDRAFT_909999 [Arab...   776   0.0  
ref|NP_568378.1| outer envelope protein [Arabidopsis thaliana] g...   768   0.0  
ref|XP_003542049.1| PREDICTED: outer envelope protein of 80 kDa,...   764   0.0  

>ref|XP_002285507.2| PREDICTED: outer envelope protein of 80 kDa, chloroplastic [Vitis
            vinifera]
          Length = 673

 Score =  785 bits (2028), Expect = 0.0
 Identities = 411/587 (70%), Positives = 459/587 (78%), Gaps = 12/587 (2%)
 Frame = +1

Query: 49   NENEDVKFTQSSIKLPTLNIYSPTTTLPFCSQTLTSNLLKTRQSFTNFISSITTRVKTNK 228
            ++NEDV+FT SS+K+P     SP +   F SQTL S+L +  +S  + ++S     K   
Sbjct: 2    SKNEDVRFTSSSLKIPL----SPPS---FFSQTLGSHLTEATKSVIHLVNSFRNFRKPLN 54

Query: 229  XXXXXXXXXXXXNK-----------QDXXXXXXXXXXXXXXXXXXXRVLISEVLIRNKDG 375
                        +            +                    RVLISEVL+RNKDG
Sbjct: 55   FLARPSPLLCSASLSLSQPAESTQLEVAATQPKGQTVARHPREDEERVLISEVLVRNKDG 114

Query: 376  EELERKDLEAEASMALKACRPNSALTVREVQEDVHRIMARGYFCSCMPVAVDTRDGIRLV 555
            EELERKDLEAEA  ALKACRPNSALTVREVQEDVHRI+  G F SCMPVAVDTRDGIRLV
Sbjct: 115  EELERKDLEAEAVAALKACRPNSALTVREVQEDVHRIIDSGLFWSCMPVAVDTRDGIRLV 174

Query: 556  FQVEPNQDFQGLVCEGANVLPSKFLEDAFRDGHGKIVNIRRLNEVVHSIDGWYRERGLFG 735
            FQVEPNQ+FQGLVCEGANVLPSKFLEDAFRDG+GK+VNIRRL++V+ SI+ WY ERGLFG
Sbjct: 175  FQVEPNQEFQGLVCEGANVLPSKFLEDAFRDGYGKVVNIRRLDDVITSINDWYNERGLFG 234

Query: 736  LVSDLEILSGGIIRLQVSEAEVNNITIRFLDKRTGEPTTGKTKPETILRQLTTKKGQVYS 915
            +VS +EILSGGIIRL+VSEAEVN+I++RFLD++TGEPT GKTKPETILRQLTTKKGQVYS
Sbjct: 235  MVSGVEILSGGIIRLKVSEAEVNDISVRFLDRKTGEPTIGKTKPETILRQLTTKKGQVYS 294

Query: 916  LLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLIMNVVERV-XXXXXXXXXXXXXXXX 1092
            L+QGKRD ETVLTMGIMEDVSII Q  GD  K+DL+MNVVERV                 
Sbjct: 295  LIQGKRDAETVLTMGIMEDVSIIHQSVGDRDKIDLVMNVVERVSGGFSAGGGISRGITTS 354

Query: 1093 XXXXXLVGSFAYSHRNVFGRNQKLNVSWERGQIDSIFRINYTDPWIEGDDKRTSRSIMVQ 1272
                 L+GSFAYSHRNVFGRNQKLNVS ERGQ+DSIFRINYTDPWIEGDDKRTSRSIM+Q
Sbjct: 355  RPLSGLIGSFAYSHRNVFGRNQKLNVSLERGQVDSIFRINYTDPWIEGDDKRTSRSIMIQ 414

Query: 1273 NSRTPGTLVHGNQPDSSNVTIGRITAGIEFSRPIRPNWSGTAGIIYQRAGARDERGNPLR 1452
            NSRTPG LVHG QP +S++TIGR+TAGIEFSRP RPNWSGT G+I+Q AGA DE G P+ 
Sbjct: 415  NSRTPGILVHGGQPANSSLTIGRVTAGIEFSRPFRPNWSGTVGLIFQHAGAHDEHGKPII 474

Query: 1453 KDYYSSPLTASGNTHDDMLLAKLESVYTDSGDHGSSMLVFNMEQGLPVMAEWLSFNRVNA 1632
            KD+YSSPLTASGNTHDD LLAK ESVYT SGDHGSSM VFNMEQGLPV+ EWL FNRVNA
Sbjct: 475  KDFYSSPLTASGNTHDDALLAKFESVYTGSGDHGSSMFVFNMEQGLPVLPEWLFFNRVNA 534

Query: 1633 RARKGVELGPARFLLSLSGGHVVGSFSPHEAFAIGGTNSVRGYEEGA 1773
            RARKGVE+GPA  LLSLSGGHVVG+FSPHEAFAIGGTNSVRGYEEGA
Sbjct: 535  RARKGVEIGPACLLLSLSGGHVVGNFSPHEAFAIGGTNSVRGYEEGA 581


>ref|XP_002513472.1| sorting and assembly machinery (sam50) protein, putative [Ricinus
            communis] gi|223547380|gb|EEF48875.1| sorting and
            assembly machinery (sam50) protein, putative [Ricinus
            communis]
          Length = 700

 Score =  782 bits (2020), Expect = 0.0
 Identities = 385/480 (80%), Positives = 427/480 (88%)
 Frame = +1

Query: 334  RVLISEVLIRNKDGEELERKDLEAEASMALKACRPNSALTVREVQEDVHRIMARGYFCSC 513
            RVLISEVL+RNKDGEELERKDLEAEA  ALKACR NSALTVREVQEDVHRI+  GYFCSC
Sbjct: 129  RVLISEVLVRNKDGEELERKDLEAEAVAALKACRANSALTVREVQEDVHRIIDSGYFCSC 188

Query: 514  MPVAVDTRDGIRLVFQVEPNQDFQGLVCEGANVLPSKFLEDAFRDGHGKIVNIRRLNEVV 693
             PVAVDTRDGIRLVFQVEPNQ+F GLVCEGA+VLP+KFL+DAFR+G+GK+VNIR L++V+
Sbjct: 189  TPVAVDTRDGIRLVFQVEPNQEFHGLVCEGASVLPTKFLQDAFREGYGKVVNIRHLDDVI 248

Query: 694  HSIDGWYRERGLFGLVSDLEILSGGIIRLQVSEAEVNNITIRFLDKRTGEPTTGKTKPET 873
             SI+GWY ERGLFGLVS +EILSGGI+RLQV+EAEVNNI+IRFLD++TGEPT GKTKPET
Sbjct: 249  TSINGWYMERGLFGLVSGVEILSGGILRLQVAEAEVNNISIRFLDRKTGEPTKGKTKPET 308

Query: 874  ILRQLTTKKGQVYSLLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLIMNVVERVXXX 1053
            ILRQLTTKKGQVYS+LQGKRDV+TVLTMGIMEDVSIIPQPAGDTGKVDL+MNVVER    
Sbjct: 309  ILRQLTTKKGQVYSMLQGKRDVDTVLTMGIMEDVSIIPQPAGDTGKVDLVMNVVERPSGG 368

Query: 1054 XXXXXXXXXXXXXXXXXXLVGSFAYSHRNVFGRNQKLNVSWERGQIDSIFRINYTDPWIE 1233
                              L+GSF YSHRNVFGRNQKLN+S ERGQIDSIFRINYTDPWI+
Sbjct: 369  FSAGGGISSGITSGPLSGLIGSFTYSHRNVFGRNQKLNISLERGQIDSIFRINYTDPWIQ 428

Query: 1234 GDDKRTSRSIMVQNSRTPGTLVHGNQPDSSNVTIGRITAGIEFSRPIRPNWSGTAGIIYQ 1413
            GDDKRTSR+IMVQNSRTPG LVH  QP +S++TIGR+TAG+EFSRP+RP WSGTAG+I+Q
Sbjct: 429  GDDKRTSRTIMVQNSRTPGNLVHSYQPGNSSLTIGRVTAGVEFSRPLRPKWSGTAGLIFQ 488

Query: 1414 RAGARDERGNPLRKDYYSSPLTASGNTHDDMLLAKLESVYTDSGDHGSSMLVFNMEQGLP 1593
             AGA DE+GNP+ KD+YSSPLTASG THD+MLLAK ESVYT SGDHGSSM V N+EQGLP
Sbjct: 489  HAGAHDEKGNPIIKDHYSSPLTASGKTHDNMLLAKFESVYTGSGDHGSSMFVLNVEQGLP 548

Query: 1594 VMAEWLSFNRVNARARKGVELGPARFLLSLSGGHVVGSFSPHEAFAIGGTNSVRGYEEGA 1773
            +  EWL FNRVNARARKGVE+GPA FLLSLSGGHVVG+FSPHEAFAIGGTNSVRGYEEGA
Sbjct: 549  LWPEWLFFNRVNARARKGVEIGPALFLLSLSGGHVVGNFSPHEAFAIGGTNSVRGYEEGA 608


>ref|XP_002871896.1| hypothetical protein ARALYDRAFT_909999 [Arabidopsis lyrata subsp.
            lyrata] gi|297317733|gb|EFH48155.1| hypothetical protein
            ARALYDRAFT_909999 [Arabidopsis lyrata subsp. lyrata]
          Length = 732

 Score =  776 bits (2004), Expect = 0.0
 Identities = 377/480 (78%), Positives = 420/480 (87%)
 Frame = +1

Query: 334  RVLISEVLIRNKDGEELERKDLEAEASMALKACRPNSALTVREVQEDVHRIMARGYFCSC 513
            RVLISEVL+R KDGEELERKDLE EA  ALKACR NSALT+REVQEDVHRI+  GYFCSC
Sbjct: 161  RVLISEVLVRTKDGEELERKDLEMEALAALKACRANSALTIREVQEDVHRIIESGYFCSC 220

Query: 514  MPVAVDTRDGIRLVFQVEPNQDFQGLVCEGANVLPSKFLEDAFRDGHGKIVNIRRLNEVV 693
             PVAVDTRDGIRL+FQVEPNQ+F+GLVCE ANVLPSKF+++AFRDG GK++NI+RL E +
Sbjct: 221  TPVAVDTRDGIRLMFQVEPNQEFRGLVCENANVLPSKFIQEAFRDGFGKVINIKRLEEAI 280

Query: 694  HSIDGWYRERGLFGLVSDLEILSGGIIRLQVSEAEVNNITIRFLDKRTGEPTTGKTKPET 873
             SI+GWY ERGLFG+VSD++ LSGGI+RLQV+EAEVNNI+IRFLD++TGEPT GKT PET
Sbjct: 281  TSINGWYMERGLFGIVSDIDTLSGGIVRLQVAEAEVNNISIRFLDRKTGEPTKGKTSPET 340

Query: 874  ILRQLTTKKGQVYSLLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLIMNVVERVXXX 1053
            ILRQLTTKKGQVYS+LQGKRDV+TVL MGIMEDVSIIPQPAGDTGKVDLIMN VER    
Sbjct: 341  ILRQLTTKKGQVYSMLQGKRDVDTVLAMGIMEDVSIIPQPAGDTGKVDLIMNCVERPSGG 400

Query: 1054 XXXXXXXXXXXXXXXXXXLVGSFAYSHRNVFGRNQKLNVSWERGQIDSIFRINYTDPWIE 1233
                              L+GSFAYSHRN+FGRNQKLNVS ERGQIDSIFRINYTDPWIE
Sbjct: 401  FSAGGGISSGITSGPLSGLIGSFAYSHRNLFGRNQKLNVSLERGQIDSIFRINYTDPWIE 460

Query: 1234 GDDKRTSRSIMVQNSRTPGTLVHGNQPDSSNVTIGRITAGIEFSRPIRPNWSGTAGIIYQ 1413
            GDDKRTSRSIMVQNSRTPG LVHGNQPD+S++TIGR+TAGIE+SRP RP WSGTAG+I+Q
Sbjct: 461  GDDKRTSRSIMVQNSRTPGNLVHGNQPDNSSLTIGRVTAGIEYSRPFRPKWSGTAGLIFQ 520

Query: 1414 RAGARDERGNPLRKDYYSSPLTASGNTHDDMLLAKLESVYTDSGDHGSSMLVFNMEQGLP 1593
             AGARDE+GNP+ KD+YSSPLTASG THDD LLAKLES+YT SGD GS+M  FNMEQGLP
Sbjct: 521  HAGARDEQGNPIIKDFYSSPLTASGKTHDDTLLAKLESIYTGSGDRGSTMFAFNMEQGLP 580

Query: 1594 VMAEWLSFNRVNARARKGVELGPARFLLSLSGGHVVGSFSPHEAFAIGGTNSVRGYEEGA 1773
            V+ EWL FNRV  RARKG+ +GPARFL SLSGGHVVG+FSPHEAF IGGTNS+RGYEEGA
Sbjct: 581  VLPEWLCFNRVTGRARKGIHIGPARFLFSLSGGHVVGNFSPHEAFVIGGTNSIRGYEEGA 640


>ref|NP_568378.1| outer envelope protein [Arabidopsis thaliana]
            gi|75168961|sp|Q9C5J8.1|OEP80_ARATH RecName: Full=Outer
            envelope protein 80, chloroplastic; AltName:
            Full=Chloroplastic outer envelope protein of 80 kDa;
            Short=AtOEP80; AltName: Full=Protein TOC75-V;
            Short=AtToc75-V gi|13430586|gb|AAK25915.1|AF360205_1
            unknown protein [Arabidopsis thaliana]
            gi|14532858|gb|AAK64111.1| unknown protein [Arabidopsis
            thaliana] gi|332005348|gb|AED92731.1| outer envelope
            protein [Arabidopsis thaliana]
          Length = 732

 Score =  768 bits (1982), Expect = 0.0
 Identities = 372/480 (77%), Positives = 417/480 (86%)
 Frame = +1

Query: 334  RVLISEVLIRNKDGEELERKDLEAEASMALKACRPNSALTVREVQEDVHRIMARGYFCSC 513
            RVLISEVL+R KDGEELERKDLE EA  ALKACR NSALT+REVQEDVHRI+  GYFCSC
Sbjct: 161  RVLISEVLVRTKDGEELERKDLEMEALAALKACRANSALTIREVQEDVHRIIESGYFCSC 220

Query: 514  MPVAVDTRDGIRLVFQVEPNQDFQGLVCEGANVLPSKFLEDAFRDGHGKIVNIRRLNEVV 693
             PVAVDTRDGIRL+FQVEPNQ+F+GLVCE ANVLPSKF+ +AFRDG GK++NI+RL E +
Sbjct: 221  TPVAVDTRDGIRLMFQVEPNQEFRGLVCENANVLPSKFIHEAFRDGFGKVINIKRLEEAI 280

Query: 694  HSIDGWYRERGLFGLVSDLEILSGGIIRLQVSEAEVNNITIRFLDKRTGEPTTGKTKPET 873
             SI+GWY ERGLFG+VSD++ LSGGI+RLQV+EAEVNNI+IRFLD++TGEPT GKT PET
Sbjct: 281  TSINGWYMERGLFGIVSDIDTLSGGIVRLQVAEAEVNNISIRFLDRKTGEPTKGKTSPET 340

Query: 874  ILRQLTTKKGQVYSLLQGKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLIMNVVERVXXX 1053
            ILRQLTTKKGQVYS+LQGKRDV+TVL MGIMEDVSIIPQPAGD+GKVDLIMN VER    
Sbjct: 341  ILRQLTTKKGQVYSMLQGKRDVDTVLAMGIMEDVSIIPQPAGDSGKVDLIMNCVERPSGG 400

Query: 1054 XXXXXXXXXXXXXXXXXXLVGSFAYSHRNVFGRNQKLNVSWERGQIDSIFRINYTDPWIE 1233
                              L+GSFAYSHRN+FGRNQKLNVS ERGQIDSIFRINYTDPWIE
Sbjct: 401  FSAGGGISSGITSGPLSGLIGSFAYSHRNLFGRNQKLNVSLERGQIDSIFRINYTDPWIE 460

Query: 1234 GDDKRTSRSIMVQNSRTPGTLVHGNQPDSSNVTIGRITAGIEFSRPIRPNWSGTAGIIYQ 1413
            GDDKRTSRSIMVQNSRTPG LVHGNQPD+S++TIGR+TAG+E+SRP RP W+GTAG+I+Q
Sbjct: 461  GDDKRTSRSIMVQNSRTPGNLVHGNQPDNSSLTIGRVTAGVEYSRPFRPKWNGTAGLIFQ 520

Query: 1414 RAGARDERGNPLRKDYYSSPLTASGNTHDDMLLAKLESVYTDSGDHGSSMLVFNMEQGLP 1593
             AGARDE+GNP+ KD+YSSPLTASG  HD+ +LAKLES+YT SGD GS+M  FNMEQGLP
Sbjct: 521  HAGARDEQGNPIIKDFYSSPLTASGKPHDETMLAKLESIYTGSGDQGSTMFAFNMEQGLP 580

Query: 1594 VMAEWLSFNRVNARARKGVELGPARFLLSLSGGHVVGSFSPHEAFAIGGTNSVRGYEEGA 1773
            V+ EWL FNRV  RARKG+ +GPARFL SLSGGHVVG FSPHEAF IGGTNSVRGYEEGA
Sbjct: 581  VLPEWLCFNRVTGRARKGIHIGPARFLFSLSGGHVVGKFSPHEAFVIGGTNSVRGYEEGA 640


>ref|XP_003542049.1| PREDICTED: outer envelope protein of 80 kDa, chloroplastic-like
            [Glycine max]
          Length = 677

 Score =  764 bits (1974), Expect = 0.0
 Identities = 394/601 (65%), Positives = 458/601 (76%), Gaps = 10/601 (1%)
 Frame = +1

Query: 55   NEDVKFTQSSIKLPTLNIYS----PTTTLPFCSQTLTSNLLKTRQSFTNFISSITTRVKT 222
            N+DV+   SSIK+P  +I      P  T        T+++ +   SFT+  + +T  V  
Sbjct: 4    NDDVRIVSSSIKIPLPSISKHPTCPLRTAHSHIANATNSIAQLINSFTSHSAELTRSVIQ 63

Query: 223  NKXXXXXXXXXXXXNKQDXXXXXXXXXXXXXXXXXXX------RVLISEVLIRNKDGEEL 384
                          +++                          RVLISEVL+RNKDGEEL
Sbjct: 64   KSSLLCSATLSLTGDRKRKCPIRRLASLSLAEEAQQKARQNEERVLISEVLVRNKDGEEL 123

Query: 385  ERKDLEAEASMALKACRPNSALTVREVQEDVHRIMARGYFCSCMPVAVDTRDGIRLVFQV 564
            ERKDLEAEA+ ALKACRPNSALTVREVQEDVHRI+  GYF SCMPVAVDTRDGIRLVFQV
Sbjct: 124  ERKDLEAEAAQALKACRPNSALTVREVQEDVHRIINSGYFSSCMPVAVDTRDGIRLVFQV 183

Query: 565  EPNQDFQGLVCEGANVLPSKFLEDAFRDGHGKIVNIRRLNEVVHSIDGWYRERGLFGLVS 744
            EPNQ+FQGLVCEGANVLP+KFLED+ RDG+GKI+N+RRL+E + SI+ WY ERGLF +VS
Sbjct: 184  EPNQEFQGLVCEGANVLPAKFLEDSMRDGYGKIINLRRLDEAISSINNWYMERGLFAMVS 243

Query: 745  DLEILSGGIIRLQVSEAEVNNITIRFLDKRTGEPTTGKTKPETILRQLTTKKGQVYSLLQ 924
             +EILSGGI+RLQVSEAEV+NI+IRFLD++TGE T GKTKPETILRQ+TTKKGQVYS+L+
Sbjct: 244  AVEILSGGILRLQVSEAEVDNISIRFLDRKTGETTMGKTKPETILRQITTKKGQVYSMLE 303

Query: 925  GKRDVETVLTMGIMEDVSIIPQPAGDTGKVDLIMNVVERVXXXXXXXXXXXXXXXXXXXX 1104
            GKRDVETVLTMGIMEDVSIIPQPA DTGKVDL+MNVVER                     
Sbjct: 304  GKRDVETVLTMGIMEDVSIIPQPA-DTGKVDLVMNVVERPSGGFSAGGGISSGITNGPLR 362

Query: 1105 XLVGSFAYSHRNVFGRNQKLNVSWERGQIDSIFRINYTDPWIEGDDKRTSRSIMVQNSRT 1284
             L+GSFAYSHRNVFG+NQKLN+S ERGQIDS++RINYTDPWI+GDDKRTSR+IM+QNSRT
Sbjct: 363  GLIGSFAYSHRNVFGKNQKLNISLERGQIDSVYRINYTDPWIQGDDKRTSRTIMIQNSRT 422

Query: 1285 PGTLVHGNQPDSSNVTIGRITAGIEFSRPIRPNWSGTAGIIYQRAGARDERGNPLRKDYY 1464
            PGT+VHGN   + ++TIGRIT GIEFSRPIRP WSGTAG+++Q AG RDE+G P+ KD Y
Sbjct: 423  PGTIVHGNADGNGSLTIGRITGGIEFSRPIRPKWSGTAGLVFQHAGVRDEKGIPIIKDCY 482

Query: 1465 SSPLTASGNTHDDMLLAKLESVYTDSGDHGSSMLVFNMEQGLPVMAEWLSFNRVNARARK 1644
            SSPLTASGNTHDD LLAKLE+VYT SGDHGSS+ V NME+GLP++ EWLSF RVNARARK
Sbjct: 483  SSPLTASGNTHDDTLLAKLETVYTGSGDHGSSLFVLNMEKGLPLLPEWLSFTRVNARARK 542

Query: 1645 GVELGPARFLLSLSGGHVVGSFSPHEAFAIGGTNSVRGYEEGAXXXXXXXXXXXXXXXFP 1824
            GVE+GPAR  LS+SGGHVVG+FSP+EAFAIGGTNSVRGYEEG+               FP
Sbjct: 543  GVEIGPARLHLSISGGHVVGNFSPYEAFAIGGTNSVRGYEEGSVGSGRSYIVGSGEISFP 602

Query: 1825 M 1827
            M
Sbjct: 603  M 603


Top