BLASTX nr result

ID: Rehmannia32_contig00021897 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia32_contig00021897
         (1021 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PIN26064.1| hypothetical protein CDL12_01200 [Handroanthus im...   384   e-122
ref|XP_020550380.1| uncharacterized protein LOC105163355 [Sesamu...   346   e-107
ref|XP_012832877.1| PREDICTED: uncharacterized protein LOC105953...   191   5e-51
gb|EYU46650.1| hypothetical protein MIMGU_mgv1a022848mg, partial...   191   6e-51
ref|XP_022869089.1| uncharacterized protein LOC111388563 isoform...   149   4e-36
ref|XP_022869087.1| uncharacterized protein LOC111388563 isoform...   149   4e-36
gb|KZV32385.1| hypothetical protein F511_03668 [Dorcoceras hygro...   134   4e-31

>gb|PIN26064.1| hypothetical protein CDL12_01200 [Handroanthus impetiginosus]
          Length = 1018

 Score =  384 bits (986), Expect = e-122
 Identities = 202/341 (59%), Positives = 249/341 (73%), Gaps = 1/341 (0%)
 Frame = -1

Query: 1021 NKQNNVANPEEAFVGWRCKGKRSRCMKIYTVQDKNVSGFAAKRIENQRGTVVLLPVKKNI 842
            +K N VA  EEAF GWRC+ KRS+C+KI TVQDKN+S  AA R       V  LPV +N+
Sbjct: 435  SKPNYVAKLEEAFAGWRCRQKRSKCVKISTVQDKNISISAATR-----PAVAQLPVSENL 489

Query: 841  MTEKLMAGEVLNSRSPGNIRNSKCGGRSGITVQQEEYGSTEKESLKEFQRLHCVYGRRSK 662
            +TEK M  + L+SR+P ++R SKCGG + ITVQQ  YG TE++SLKEF ++HC YGRR K
Sbjct: 490  ITEKPMTRKNLDSRNPEDMRISKCGGATEITVQQPAYGFTERDSLKEFGKVHCSYGRRLK 549

Query: 661  ETANGLESDSPPQRRKMISFPDDVCKIGVRGTCLKLNPNSSFKVKTDTLKREENPYLDVT 482
            +T  G++ DS P+RRK IS  DDV +IG  GTCLKLN +SSF++++DT KREEN  LDV 
Sbjct: 550  QTIEGIKPDSHPKRRKKISGHDDVRQIGAGGTCLKLNYDSSFRMRSDTEKREENSDLDVA 609

Query: 481  DQPLSELNPWIYPKRKRSRPCAPRHSIRSSNQCSPCAAIENNVEKNPTVNDETHNRVAKL 302
            D+ +  LN W +PK+KRSR  AP HS++S  Q SPC A+ENNVEK P VN+E +NRVAKL
Sbjct: 610  DKAIPMLNVWKFPKQKRSRQFAPHHSVKSGKQSSPCTAVENNVEKYPIVNEEANNRVAKL 669

Query: 301  LVYKRRCKVPLEREHIGCSLELAGDLSS-SVNHNATSDRQVQISVELDTANPSRQCGKLD 125
            LVY R+ K PLERE +G SL+LAGD SS SVN +A  DR+VQ S+E D A PS QC KLD
Sbjct: 670  LVYARQHKFPLEREQLGSSLKLAGDFSSPSVNASAILDREVQASIEFDQAKPSMQCSKLD 729

Query: 124  DRKPSSVDDFVNITSDVLSNGHCMTLQVFEGSDKSKQPFNN 2
            D KP  V+   N+ SD+ SNGH MT QVFE SDKSKQ FN+
Sbjct: 730  DGKPLVVNTPGNMKSDIFSNGHSMTSQVFEVSDKSKQTFNH 770


>ref|XP_020550380.1| uncharacterized protein LOC105163355 [Sesamum indicum]
          Length = 1048

 Score =  346 bits (887), Expect = e-107
 Identities = 190/339 (56%), Positives = 235/339 (69%), Gaps = 3/339 (0%)
 Frame = -1

Query: 1021 NKQNNVANPEEAFVGWRCKGKRSRCMKIYTVQDKNVSGFAAKRIENQRGTVVLLPVKKNI 842
            ++  + A PEEAFVGWRCK KRS+C+KIYTVQ KN S  +AK+ +  R TV  L    N+
Sbjct: 459  SESKDAAYPEEAFVGWRCKDKRSKCVKIYTVQGKNGSCSSAKQFKKHRRTVAQLSASTNV 518

Query: 841  MTEKLMAGEV--LNSRSPGNIRNSKCGGRSGITVQQEEYGSTEKESLKEFQRLHCVYGRR 668
            MTEK M G+   +N     N++  KCG    I+V Q+EYG TE+ESLK FQ+++    RR
Sbjct: 519  MTEKPMTGKYFSINQEDTRNLQTGKCGKPPEISVPQDEYGYTERESLKGFQKVY----RR 574

Query: 667  SKETANGLESDSPPQRRKMISFPDDVCKIGVRGTCLKLNPNSSFKVKTDTLKREENPYLD 488
            S++T + ++SDSP +RRK++SF +DV +IG+ GTCLK+N +SSF+VKT+T K +EN YLD
Sbjct: 575  SEKTTHRVQSDSPTKRRKIVSFHNDVREIGLGGTCLKINSDSSFQVKTETSKIDENSYLD 634

Query: 487  VTDQPLSELNPWIYPKRKRSRPCAPRHSIRSSNQCSPCAAIENNVEKNPTVNDETHNRVA 308
            V D+ LS+ N WIYPK KRSR   P HSIRS  QCS  AA ENNVEK P VN+E H R  
Sbjct: 635  VADRSLSQFNLWIYPKGKRSRQRLPPHSIRSRVQCSGRAATENNVEKYPVVNEEAHKR-- 692

Query: 307  KLLVYKRRCKVPLEREHIGCSLELAGDLSS-SVNHNATSDRQVQISVELDTANPSRQCGK 131
            KLLVY R+ K  L+RE +G SL  AGDLSS SVN N  SD  V+ SVE D A P  Q  K
Sbjct: 693  KLLVYTRQHKFSLDREQVGSSLRFAGDLSSPSVNDNVVSDSLVENSVEFDRAKPLMQFDK 752

Query: 130  LDDRKPSSVDDFVNITSDVLSNGHCMTLQVFEGSDKSKQ 14
            LDD KPSSVD   N+ SD LSNG  MT QVFEG DK+K+
Sbjct: 753  LDDGKPSSVDPSGNMKSDALSNGCSMTSQVFEGCDKTKK 791


>ref|XP_012832877.1| PREDICTED: uncharacterized protein LOC105953732 [Erythranthe guttata]
          Length = 881

 Score =  191 bits (486), Expect = 5e-51
 Identities = 141/342 (41%), Positives = 175/342 (51%), Gaps = 3/342 (0%)
 Frame = -1

Query: 1021 NKQNN---VANPEEAFVGWRCKGKRSRCMKIYTVQDKNVSGFAAKRIENQRGTVVLLPVK 851
            N++NN   +A+ +EAFVGWRCKGKRS+C+KI TV   NVS                LP  
Sbjct: 416  NEKNNSKYLASYDEAFVGWRCKGKRSKCVKISTVDHTNVS----------------LPAA 459

Query: 850  KNIMTEKLMAGEVLNSRSPGNIRNSKCGGRSGITVQQEEYGSTEKESLKEFQRLHCVYGR 671
            K +  ++    ++L S+S                            S K+      + GR
Sbjct: 460  KQLKNKRRTVSKLLLSQSL---------------------------STKKLMTGKVLNGR 492

Query: 670  RSKETANGLESDSPPQRRKMISFPDDVCKIGVRGTCLKLNPNSSFKVKTDTLKREENPYL 491
                            RRK            V G  LKL+ +SS +VKTDT KREEN  L
Sbjct: 493  N---------------RRK------------VEGAYLKLSSDSSSEVKTDTAKREENFNL 525

Query: 490  DVTDQPLSELNPWIYPKRKRSRPCAPRHSIRSSNQCSPCAAIENNVEKNPTVNDETHNRV 311
            ++ D+ LS++N W YPKRKRS  C P HSIRSSNQCSPC AIEN VEK+P VN+E +NRV
Sbjct: 526  ELADRTLSKMNMWNYPKRKRSGHCLPCHSIRSSNQCSPCTAIENIVEKHPVVNEEANNRV 585

Query: 310  AKLLVYKRRCKVPLEREHIGCSLELAGDLSSSVNHNATSDRQVQISVELDTANPSRQCGK 131
             +LLVYKRR K  +ERE IG +L+LAG L SS   NA SD  +Q SVE  T   S   GK
Sbjct: 586  KELLVYKRRRKFSVEREQIGSALKLAGHLYSSGKDNAVSD-GLQNSVEFHT---SVAPGK 641

Query: 130  LDDRKPSSVDDFVNITSDVLSNGHCMTLQVFEGSDKSKQPFN 5
            +   K  S+                 TLQV E +DKSKQ  N
Sbjct: 642  M---KSDSI-----------------TLQVLENTDKSKQACN 663


>gb|EYU46650.1| hypothetical protein MIMGU_mgv1a022848mg, partial [Erythranthe
            guttata]
          Length = 895

 Score =  191 bits (486), Expect = 6e-51
 Identities = 141/342 (41%), Positives = 175/342 (51%), Gaps = 3/342 (0%)
 Frame = -1

Query: 1021 NKQNN---VANPEEAFVGWRCKGKRSRCMKIYTVQDKNVSGFAAKRIENQRGTVVLLPVK 851
            N++NN   +A+ +EAFVGWRCKGKRS+C+KI TV   NVS                LP  
Sbjct: 429  NEKNNSKYLASYDEAFVGWRCKGKRSKCVKISTVDHTNVS----------------LPAA 472

Query: 850  KNIMTEKLMAGEVLNSRSPGNIRNSKCGGRSGITVQQEEYGSTEKESLKEFQRLHCVYGR 671
            K +  ++    ++L S+S                            S K+      + GR
Sbjct: 473  KQLKNKRRTVSKLLLSQSL---------------------------STKKLMTGKVLNGR 505

Query: 670  RSKETANGLESDSPPQRRKMISFPDDVCKIGVRGTCLKLNPNSSFKVKTDTLKREENPYL 491
                            RRK            V G  LKL+ +SS +VKTDT KREEN  L
Sbjct: 506  N---------------RRK------------VEGAYLKLSSDSSSEVKTDTAKREENFNL 538

Query: 490  DVTDQPLSELNPWIYPKRKRSRPCAPRHSIRSSNQCSPCAAIENNVEKNPTVNDETHNRV 311
            ++ D+ LS++N W YPKRKRS  C P HSIRSSNQCSPC AIEN VEK+P VN+E +NRV
Sbjct: 539  ELADRTLSKMNMWNYPKRKRSGHCLPCHSIRSSNQCSPCTAIENIVEKHPVVNEEANNRV 598

Query: 310  AKLLVYKRRCKVPLEREHIGCSLELAGDLSSSVNHNATSDRQVQISVELDTANPSRQCGK 131
             +LLVYKRR K  +ERE IG +L+LAG L SS   NA SD  +Q SVE  T   S   GK
Sbjct: 599  KELLVYKRRRKFSVEREQIGSALKLAGHLYSSGKDNAVSD-GLQNSVEFHT---SVAPGK 654

Query: 130  LDDRKPSSVDDFVNITSDVLSNGHCMTLQVFEGSDKSKQPFN 5
            +   K  S+                 TLQV E +DKSKQ  N
Sbjct: 655  M---KSDSI-----------------TLQVLENTDKSKQACN 676


>ref|XP_022869089.1| uncharacterized protein LOC111388563 isoform X2 [Olea europaea var.
            sylvestris]
          Length = 1064

 Score =  149 bits (376), Expect = 4e-36
 Identities = 125/387 (32%), Positives = 179/387 (46%), Gaps = 47/387 (12%)
 Frame = -1

Query: 1021 NKQNNVANPEEAFVGWRCKGKRSRCMKIYT-VQDKNVSGFAAKRIENQRGTVVLLPVKKN 845
            +K  N    E  F+G   K K ++ +K    +  +N+S    +  E Q G V  L  K+ 
Sbjct: 435  HKSKNCTYLENDFLGSADKRKPTKRVKRSAKMVYENISCSLDEHFEGQSGAVSPLLPKRK 494

Query: 844  IMTEKLMAGEVLNSRSPGNIRNSKCGGRSGITVQQEEYGSTEKESLKE------------ 701
            +M ++ +  + LN R+PGN +NS CG      VQ +E   TEK +LKE            
Sbjct: 495  LMMKEPLTEKDLNGRNPGNKKNSNCGRAPRTVVQHKEDEFTEKINLKEVRKNKFLLESIG 554

Query: 700  ------------------------------FQRLHCVYGRRSKETANGLESDSPPQRRKM 611
                                          F++         + T + ++S +P QRR M
Sbjct: 555  CNSDGQCSKDKSTNPSHAARVSGHYSSVISFEKAEDSIDSSEERTNDIMKSGAPLQRRNM 614

Query: 610  ISFPDDVCKIGVRGTCLKLNPNSSFKVKTDTLKREENPYLDVTDQPLSELNPWIYPKRKR 431
             SF  DVCK GVRG CLKL+  S FK  +  LK +E   L V   PL +LNPWIYP+RKR
Sbjct: 615  TSFHKDVCKSGVRGNCLKLDSVSCFKDISPPLKHQEKLNL-VVGSPL-KLNPWIYPRRKR 672

Query: 430  SRPCAPRHSIRSSNQCSPCAAI-ENNVEKNPTVNDETHNRVAKLLVYKRRCKVPLEREHI 254
            +R   P HS  +SNQ  P   + E N      V+ +   +VA  LV+ R  +   ER H 
Sbjct: 673  TRHNVPCHS--NSNQLPPSMIVHEKNENTCHKVSRKACKKVAGHLVHSREFQFSPERNHF 730

Query: 253  -GCSLELAGDLS-SSVNHNATSDRQVQISVELDTANPSRQCGKLDDRKPSSVDDFVNITS 80
               + +  GD   +  N+N ++   V++ V+LD +   +Q GKLD   P        I  
Sbjct: 731  ASLTRQHIGDWDCTGFNNNVSATTMVRVPVDLDRSVTLKQSGKLDGGCPIPFVGTSGIMK 790

Query: 79   -DVLSNGHCMTLQVFEGSDKSKQPFNN 2
             D LSNG+    +V+E   KSK+P NN
Sbjct: 791  LDALSNGNSKGFKVYECGKKSKRPRNN 817


>ref|XP_022869087.1| uncharacterized protein LOC111388563 isoform X1 [Olea europaea var.
            sylvestris]
          Length = 1096

 Score =  149 bits (376), Expect = 4e-36
 Identities = 125/387 (32%), Positives = 179/387 (46%), Gaps = 47/387 (12%)
 Frame = -1

Query: 1021 NKQNNVANPEEAFVGWRCKGKRSRCMKIYT-VQDKNVSGFAAKRIENQRGTVVLLPVKKN 845
            +K  N    E  F+G   K K ++ +K    +  +N+S    +  E Q G V  L  K+ 
Sbjct: 435  HKSKNCTYLENDFLGSADKRKPTKRVKRSAKMVYENISCSLDEHFEGQSGAVSPLLPKRK 494

Query: 844  IMTEKLMAGEVLNSRSPGNIRNSKCGGRSGITVQQEEYGSTEKESLKE------------ 701
            +M ++ +  + LN R+PGN +NS CG      VQ +E   TEK +LKE            
Sbjct: 495  LMMKEPLTEKDLNGRNPGNKKNSNCGRAPRTVVQHKEDEFTEKINLKEVRKNKFLLESIG 554

Query: 700  ------------------------------FQRLHCVYGRRSKETANGLESDSPPQRRKM 611
                                          F++         + T + ++S +P QRR M
Sbjct: 555  CNSDGQCSKDKSTNPSHAARVSGHYSSVISFEKAEDSIDSSEERTNDIMKSGAPLQRRNM 614

Query: 610  ISFPDDVCKIGVRGTCLKLNPNSSFKVKTDTLKREENPYLDVTDQPLSELNPWIYPKRKR 431
             SF  DVCK GVRG CLKL+  S FK  +  LK +E   L V   PL +LNPWIYP+RKR
Sbjct: 615  TSFHKDVCKSGVRGNCLKLDSVSCFKDISPPLKHQEKLNL-VVGSPL-KLNPWIYPRRKR 672

Query: 430  SRPCAPRHSIRSSNQCSPCAAI-ENNVEKNPTVNDETHNRVAKLLVYKRRCKVPLEREHI 254
            +R   P HS  +SNQ  P   + E N      V+ +   +VA  LV+ R  +   ER H 
Sbjct: 673  TRHNVPCHS--NSNQLPPSMIVHEKNENTCHKVSRKACKKVAGHLVHSREFQFSPERNHF 730

Query: 253  -GCSLELAGDLS-SSVNHNATSDRQVQISVELDTANPSRQCGKLDDRKPSSVDDFVNITS 80
               + +  GD   +  N+N ++   V++ V+LD +   +Q GKLD   P        I  
Sbjct: 731  ASLTRQHIGDWDCTGFNNNVSATTMVRVPVDLDRSVTLKQSGKLDGGCPIPFVGTSGIMK 790

Query: 79   -DVLSNGHCMTLQVFEGSDKSKQPFNN 2
             D LSNG+    +V+E   KSK+P NN
Sbjct: 791  LDALSNGNSKGFKVYECGKKSKRPRNN 817


>gb|KZV32385.1| hypothetical protein F511_03668 [Dorcoceras hygrometricum]
          Length = 908

 Score =  134 bits (338), Expect = 4e-31
 Identities = 91/220 (41%), Positives = 127/220 (57%), Gaps = 3/220 (1%)
 Frame = -1

Query: 661  ETANGLESDSPPQRRKMISFPDDVCKIGVRGTCLKLNPNSSFKVKTDTLKREENPYLDVT 482
            ET + ++SDSP +R+ +ISF D VC  GVRGT  +LN ++  +  + T KREE+  + VT
Sbjct: 467  ETNDAIKSDSPLKRKGIISFRDVVCNKGVRGT--RLNCSTYLRGTSKTAKREESQNIAVT 524

Query: 481  DQPLSELNPWIYPKRKRSRPCAPRHSIRSSNQCSPCAAIENNVEKNPTVNDETHNRVAKL 302
            D  LS+   WI+P+ +R    AP+ S +S  + S  A +E N EK   VND++++RV KL
Sbjct: 525  DVRLSKFKSWIHPRGRRFHSHAPQRSNKSIKKSSCDAGVEKNAEKCSVVNDKSYDRVGKL 584

Query: 301  LVYKRRCKVPLEREHIGCSLELAGDLSS-SVNHNATSDRQVQISVELDTANPSRQCGKLD 125
            LVY RR     ER+H G  + L GD SS SV++N  S RQ + S+E            LD
Sbjct: 585  LVYTRRRHFLAERKHSGNYINLTGDSSSPSVSYNECSGRQAESSIEF---------SNLD 635

Query: 124  DRKPSSVDDFVNITSDV-LSNGHCMTLQVFEG-SDKSKQP 11
            D   S+  D   +T  V LS+G+  T +   G SDK KQP
Sbjct: 636  DEGLSTSLDTSGVTKSVALSDGNSETSRGCMGRSDKRKQP 675


Top