BLASTX nr result

ID: Coptis23_contig00025647 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis23_contig00025647
         (1151 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003634725.1| PREDICTED: uncharacterized protein LOC100264...   264   3e-68
emb|CAN83957.1| hypothetical protein VITISV_039906 [Vitis vinifera]   257   4e-66
ref|XP_002533083.1| conserved hypothetical protein [Ricinus comm...   245   2e-62
ref|NP_001154270.2| uncharacterized protein [Arabidopsis thalian...   203   8e-50
ref|NP_194431.3| uncharacterized protein [Arabidopsis thaliana] ...   203   8e-50

>ref|XP_003634725.1| PREDICTED: uncharacterized protein LOC100264016 [Vitis vinifera]
          Length = 2563

 Score =  264 bits (675), Expect = 3e-68
 Identities = 170/398 (42%), Positives = 224/398 (56%), Gaps = 15/398 (3%)
 Frame = +1

Query: 1    PDFSPLFICTLEKGIRLLDSDSGSFKLSEKSMISMYVSNTLSYILQTQVDGRXXXXXXXX 180
            P FSPL IC LEK  R+L S SG+F L+EKS+IS+YVSNTL+Y+LQTQVD          
Sbjct: 768  PHFSPLIICVLEKCQRVLKSGSGTFTLAEKSIISLYVSNTLTYLLQTQVDPGLLSSLLDL 827

Query: 181  XXXERFGTSAVCDNSATSFCEWRPLKNLLLFSRSISYQNDDSREFIHEREPLKNPLLLSG 360
               ER       ++      EWRPLKNLLLFS+ IS          H+R           
Sbjct: 828  VLSERL------EDQCLDSMEWRPLKNLLLFSQDIS----------HQRH---------- 861

Query: 361  IASYQEAGGSMCSTRKDPSA--TCRSFAKALSKTKKIIKSVCGGSLAGVAIAFCSSLVCA 534
                       C    D  A  T  SF   L++ ++I++S     L G+A  F SS+V  
Sbjct: 862  ----------YCIFSIDEKARHTDSSFNDTLAEVQRIVRSGHDSGLTGIAKMFSSSIVGT 911

Query: 535  SADEILENFPTVITVSAQVLGSRLPFLSTIFFHGENLLARVASLWPDIFCSSLELV-VVK 711
            + D+IL+NFP+VITVS  + G     LS+I FH  +LLAR + LWPDIF S L+ V ++ 
Sbjct: 912  TPDDILKNFPSVITVSQDLQGVPFALLSSISFHDRSLLARASKLWPDIFFSGLQRVGLMI 971

Query: 712  GSTSYIKDETLIRTTNSSLEELISSRDFDSKESAAVAFASYLKQESFHVLCPAIMGISSR 891
             S     D   I + + S EE+    DF   ESA+VAF+ +L+Q  FHVL PAIM I   
Sbjct: 972  HSKGKGDDNCRIPSHSLSAEEIFPKTDFGLSESASVAFSLFLQQAPFHVLFPAIMNIDGP 1031

Query: 892  RLLDSTKLIDFLQAKLSDCSMDSLVASLRLLLFWAHQIQSSYRVEPVGELEQLSRVCFIL 1071
             LL+ +K+   L AKLS+ + D L+ SLR +LFW HQI+S YR+ P+GELE L  VCFIL
Sbjct: 1032 YLLEPSKVQQLLLAKLSEQTTDYLILSLRHVLFWIHQIRSYYRIRPLGELEHLFEVCFIL 1091

Query: 1072 IKHIFG--LVAKPDTACT----------QEIAEIIFHH 1149
            ++ +    LV +PD+ C+          QE+AEIIF H
Sbjct: 1092 VERMLDELLVLRPDSDCSTTIGVPFSTVQEVAEIIFCH 1129


>emb|CAN83957.1| hypothetical protein VITISV_039906 [Vitis vinifera]
          Length = 2715

 Score =  257 bits (656), Expect = 4e-66
 Identities = 169/398 (42%), Positives = 222/398 (55%), Gaps = 15/398 (3%)
 Frame = +1

Query: 1    PDFSPLFICTLEKGIRLLDSDSGSFKLSEKSMISMYVSNTLSYILQTQVDGRXXXXXXXX 180
            P FSPL IC LEK  R+L S SG+F L+EKS+IS+YVSNTL+Y+LQTQ+           
Sbjct: 814  PHFSPLIICVLEKCQRVLKSGSGTFTLAEKSIISLYVSNTLTYLLQTQILD-CYLSLLDL 872

Query: 181  XXXERFGTSAVCDNSATSFCEWRPLKNLLLFSRSISYQNDDSREFIHEREPLKNPLLLSG 360
               ER       ++      EWRPLKNLLLFS+ IS          H R           
Sbjct: 873  VLSERL------EDQCLDSMEWRPLKNLLLFSQDIS----------HXRH---------- 906

Query: 361  IASYQEAGGSMCSTRKDPSA--TCRSFAKALSKTKKIIKSVCGGSLAGVAIAFCSSLVCA 534
                       C    D  A  T  SF   L++ ++I++S     L G+A  F SS+V  
Sbjct: 907  ----------YCIFSIDEKARHTDSSFNDTLAEVQRIVRSGHDSGLTGIAKMFSSSIVGT 956

Query: 535  SADEILENFPTVITVSAQVLGSRLPFLSTIFFHGENLLARVASLWPDIFCSSLELV-VVK 711
            + D+IL+NFP+VITVS  + G     LS+I FH  +LLAR + LWPDIF S L+ V ++ 
Sbjct: 957  TPDDILKNFPSVITVSQDLQGVPFALLSSISFHDRSLLARASKLWPDIFFSGLQRVGLMI 1016

Query: 712  GSTSYIKDETLIRTTNSSLEELISSRDFDSKESAAVAFASYLKQESFHVLCPAIMGISSR 891
             S     D   I + + S EE+    DF   ESA+VAF+ +L+Q  FHVL PAIM I   
Sbjct: 1017 HSKGKGDDNCRIPSHSLSAEEIFPKTDFGLSESASVAFSLFLQQAPFHVLFPAIMNIDGP 1076

Query: 892  RLLDSTKLIDFLQAKLSDCSMDSLVASLRLLLFWAHQIQSSYRVEPVGELEQLSRVCFIL 1071
             LL+ +K+   L AKLS+ + D L+ SLR +LFW HQIQS YR+ P+GELE L  VCFIL
Sbjct: 1077 YLLEPSKVQQLLLAKLSEQTTDYLILSLRHVLFWIHQIQSYYRIRPLGELEHLFEVCFIL 1136

Query: 1072 IKHIFG--LVAKPDTACT----------QEIAEIIFHH 1149
            ++ +    LV +PD+ C+          QE+AEIIF H
Sbjct: 1137 VERMLDELLVLRPDSDCSTTIGVPFSTVQEVAEIIFCH 1174


>ref|XP_002533083.1| conserved hypothetical protein [Ricinus communis]
            gi|223527122|gb|EEF29298.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 2587

 Score =  245 bits (625), Expect = 2e-62
 Identities = 155/395 (39%), Positives = 224/395 (56%), Gaps = 12/395 (3%)
 Frame = +1

Query: 1    PDFSPLFICTLEKGIRLLDSDSGSFKLSEKSMISMYVSNTLSYILQTQVDGRXXXXXXXX 180
            P FSPL IC L+K +RLL S+SG+F + EKSMIS YV NTL Y+LQTQVD R        
Sbjct: 809  PKFSPLIICVLQKCMRLLSSESGTFSIPEKSMISAYVCNTLKYLLQTQVDARLLAALIRS 868

Query: 181  XXXERFGTSAVCDNSATSFCEWRPLKNLLLFSRSISYQNDDSREFIHEREPLKNPLLLSG 360
               E        D    S CEW+PLKNLLL + S+  Q      F+ +++ L   +    
Sbjct: 869  VLSEGLEDHVSVD----SLCEWQPLKNLLLMAESLLNQKTCCL-FLTDQKDLPIDI---- 919

Query: 361  IASYQEAGGSMCSTRKDPSATCRSFAKALSKTKKIIKSVC-GGSLAGVAIAFCSSLVCAS 537
                                   SF KAL + +KIIKS   GG +AG+  AFCS+++C +
Sbjct: 920  -----------------------SFTKALGEIRKIIKSENDGGEIAGITKAFCSAIICTT 956

Query: 538  ADEILENFPTVITVSAQVLGSRLPF--LSTIFFHGENLLARVASLWPDIFCSSLELVVVK 711
            +D +L+NFP V+T+S Q+   R+P   LS+I F  ++ L+  + LWP +F   LE     
Sbjct: 957  SDVVLKNFPAVMTISQQI---RVPLSCLSSIVFQHQSSLSGASKLWPQVFFPGLEK---- 1009

Query: 712  GSTSYIKDETLIRTTNSSLEELISSRDFDSKES-AAVAFASYLKQESFHVLCPAIMGISS 888
             + S I  + +    ++  +E++ + DFD+ E+ AA AF  +L+Q  FHVL P I+  + 
Sbjct: 1010 -ACSMINPQGM--GNDAVAQEIMLNMDFDASEATAAAAFGLFLRQAPFHVLFPTIISSNG 1066

Query: 889  RRLLDSTKLIDFLQAKLSDCSMDSLVASLRLLLFWAHQIQSSYRVEPVGELEQLSRVCFI 1068
              LL+ +K  D L AKLS+C  D +V+ LRLLLFW +QIQ SYR++P+ +LE+ + +C+I
Sbjct: 1067 TCLLEPSKTKDLLMAKLSECKSDFVVSYLRLLLFWFYQIQVSYRIKPLVKLEEFAEICYI 1126

Query: 1069 LIKHIFG--LVAKPDTA------CTQEIAEIIFHH 1149
            L+KH+    LV K D+         +E AE IF+H
Sbjct: 1127 LVKHMLDQLLVLKADSGNPLSAELIREAAESIFYH 1161


>ref|NP_001154270.2| uncharacterized protein [Arabidopsis thaliana]
            gi|332659884|gb|AEE85284.1| uncharacterized protein
            [Arabidopsis thaliana]
          Length = 2402

 Score =  203 bits (516), Expect = 8e-50
 Identities = 129/367 (35%), Positives = 199/367 (54%)
 Frame = +1

Query: 7    FSPLFICTLEKGIRLLDSDSGSFKLSEKSMISMYVSNTLSYILQTQVDGRXXXXXXXXXX 186
            FSPL IC L+K +RLL+S+S +  L EKS IS+YV +TL Y+LQTQVD +          
Sbjct: 621  FSPLIICLLQKCVRLLNSESKT-SLPEKSAISLYVCSTLKYLLQTQVDSKLLSCLIQSVL 679

Query: 187  XERFGTSAVCDNSATSFCEWRPLKNLLLFSRSISYQNDDSREFIHEREPLKNPLLLSGIA 366
             E      V D S  S CEWRPL+ LL FS+S+S +                P++L    
Sbjct: 680  SE------VVDESKDSLCEWRPLRMLLCFSQSLSNE---------------KPIILH--- 715

Query: 367  SYQEAGGSMCSTRKDPSATCRSFAKALSKTKKIIKSVCGGSLAGVAIAFCSSLVCASADE 546
                      S R        SFA+ L + K++++S+    +AG+  AF S+L+CA+ + 
Sbjct: 716  ----------SRRTTGLPADSSFAETLDEIKRLVRSISPDEIAGIVKAFSSALICATPES 765

Query: 547  ILENFPTVITVSAQVLGSRLPFLSTIFFHGENLLARVASLWPDIFCSSLELVVVKGSTSY 726
            IL+NF +V+ VS    G+   FL +I F  EN L  ++ L PD+F S  E          
Sbjct: 766  ILQNFASVMDVSWAFYGTPFSFLQSITFLEENFLGNLSKLSPDLFASGSEFTGSGNLCEG 825

Query: 727  IKDETLIRTTNSSLEELISSRDFDSKESAAVAFASYLKQESFHVLCPAIMGISSRRLLDS 906
              D  +  + +SS+ E I S+  D+++  + AF+ +LKQ  F VL  AIM +    L + 
Sbjct: 826  TVDSEIDFSGHSSVTEEIRSK-MDNRDMESSAFSIFLKQAPFPVLLNAIMSMDISCLPEF 884

Query: 907  TKLIDFLQAKLSDCSMDSLVASLRLLLFWAHQIQSSYRVEPVGELEQLSRVCFILIKHIF 1086
             ++ + L  K+S     S+ ++++L+LFW  QI+SSY+V+P   L QLS +C  L+K++F
Sbjct: 885  PRISELLLLKVSQPKSGSIDSNIQLILFWLFQIRSSYKVQPAPVLHQLSEICLRLMKNLF 944

Query: 1087 GLVAKPD 1107
              +++P+
Sbjct: 945  SQISEPE 951


>ref|NP_194431.3| uncharacterized protein [Arabidopsis thaliana]
            gi|332659883|gb|AEE85283.1| uncharacterized protein
            [Arabidopsis thaliana]
          Length = 2374

 Score =  203 bits (516), Expect = 8e-50
 Identities = 129/367 (35%), Positives = 199/367 (54%)
 Frame = +1

Query: 7    FSPLFICTLEKGIRLLDSDSGSFKLSEKSMISMYVSNTLSYILQTQVDGRXXXXXXXXXX 186
            FSPL IC L+K +RLL+S+S +  L EKS IS+YV +TL Y+LQTQVD +          
Sbjct: 621  FSPLIICLLQKCVRLLNSESKT-SLPEKSAISLYVCSTLKYLLQTQVDSKLLSCLIQSVL 679

Query: 187  XERFGTSAVCDNSATSFCEWRPLKNLLLFSRSISYQNDDSREFIHEREPLKNPLLLSGIA 366
             E      V D S  S CEWRPL+ LL FS+S+S +                P++L    
Sbjct: 680  SE------VVDESKDSLCEWRPLRMLLCFSQSLSNE---------------KPIILH--- 715

Query: 367  SYQEAGGSMCSTRKDPSATCRSFAKALSKTKKIIKSVCGGSLAGVAIAFCSSLVCASADE 546
                      S R        SFA+ L + K++++S+    +AG+  AF S+L+CA+ + 
Sbjct: 716  ----------SRRTTGLPADSSFAETLDEIKRLVRSISPDEIAGIVKAFSSALICATPES 765

Query: 547  ILENFPTVITVSAQVLGSRLPFLSTIFFHGENLLARVASLWPDIFCSSLELVVVKGSTSY 726
            IL+NF +V+ VS    G+   FL +I F  EN L  ++ L PD+F S  E          
Sbjct: 766  ILQNFASVMDVSWAFYGTPFSFLQSITFLEENFLGNLSKLSPDLFASGSEFTGSGNLCEG 825

Query: 727  IKDETLIRTTNSSLEELISSRDFDSKESAAVAFASYLKQESFHVLCPAIMGISSRRLLDS 906
              D  +  + +SS+ E I S+  D+++  + AF+ +LKQ  F VL  AIM +    L + 
Sbjct: 826  TVDSEIDFSGHSSVTEEIRSK-MDNRDMESSAFSIFLKQAPFPVLLNAIMSMDISCLPEF 884

Query: 907  TKLIDFLQAKLSDCSMDSLVASLRLLLFWAHQIQSSYRVEPVGELEQLSRVCFILIKHIF 1086
             ++ + L  K+S     S+ ++++L+LFW  QI+SSY+V+P   L QLS +C  L+K++F
Sbjct: 885  PRISELLLLKVSQPKSGSIDSNIQLILFWLFQIRSSYKVQPAPVLHQLSEICLRLMKNLF 944

Query: 1087 GLVAKPD 1107
              +++P+
Sbjct: 945  SQISEPE 951


Top