BLASTX nr result

ID: Rehmannia22_contig00012194 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00012194
         (997 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EPS72654.1| hypothetical protein M569_02103, partial [Genlise...   108   3e-21
gb|EOY11311.1| Recovery protein 3 isoform 3 [Theobroma cacao] gi...    90   2e-15
gb|EOY11310.1| Recovery protein 3 isoform 2, partial [Theobroma ...    90   2e-15
gb|EOY11309.1| Recovery protein 3 isoform 1 [Theobroma cacao]          90   2e-15
gb|EMJ07643.1| hypothetical protein PRUPE_ppa000111mg [Prunus pe...    74   1e-10

>gb|EPS72654.1| hypothetical protein M569_02103, partial [Genlisea aurea]
          Length = 1762

 Score =  108 bits (270), Expect = 3e-21
 Identities = 98/318 (30%), Positives = 146/318 (45%), Gaps = 14/318 (4%)
 Frame = -3

Query: 995  KRPRWGSLPVSRQKVIDVLPPEKFCILSGNDSEKNEGLGTSCLGNESERRPTVKGDEQKD 816
            KRPRWGSLPV  + +ID   P+   +  G D+EK + L  S  GNE+E    V    ++ 
Sbjct: 550  KRPRWGSLPVFSKNLIDFGKPDTSSVPGGYDNEK-KALSISWSGNETEILSGVTPLYERK 608

Query: 815  VSYTNQASVIVECSARDLMRRKRSHRI---------ELSESISSPDAECGKGGINHDPKF 663
             S++ Q++  +ECS RDLMRRKRSH+          E++E ++S D E  +G I+ +P  
Sbjct: 609  FSHSKQSNTELECSIRDLMRRKRSHQSQNPEYGITEEMNEEVAS-DGETEEGDIDPEPCE 667

Query: 662  CSDIEDPKDSLSPMHKTPINNETKFHGRDINFLKTEIIGEDSRCRMLQTGAVG--SCAVP 489
             +D ED K       K PIN        ++  +   ++G   R        +G  S A  
Sbjct: 668  MNDSEDAK-----ALKLPIN--------EVPSMTRRLLGSPER-------GIGYTSKACD 707

Query: 488  LNSSPSNLVSKTDCISESGYQTSTSMCQLPGIPAKDDLNGMEGETTIVQFENSPCHGRYP 309
              ++  NL S+   +  +    ++  C L   P        + +T I          R  
Sbjct: 708  EGNNSKNLGSE---LKNNNLMPTSGSCTLNHAP--------DSKTIIFGEHCDSSVPRDV 756

Query: 308  GLSNPHDALENVKKESETVNLIGKTFSMKPPTIDWTDAPESEA---LGNGGSKVGTSLQG 138
            GL +P + L     ++E V L+  TFS KPP +   +  E +    L N   +VG SL  
Sbjct: 757  GLHSPTEILGETSNKAEIVGLVAMTFSKKPPDVVCINDAEGDVPFLLENESRQVGASLCH 816

Query: 137  RYADSCLPFFVRSCPEEE 84
             Y    LPFF R+  EEE
Sbjct: 817  NY----LPFFTRNGTEEE 830


>gb|EOY11311.1| Recovery protein 3 isoform 3 [Theobroma cacao]
            gi|508719415|gb|EOY11312.1| Recovery protein 3 isoform 3
            [Theobroma cacao]
          Length = 1590

 Score = 89.7 bits (221), Expect = 2e-15
 Identities = 118/440 (26%), Positives = 162/440 (36%), Gaps = 109/440 (24%)
 Frame = -3

Query: 995  KRPRWGSLPVS-RQKVIDVLPPEKFCILSGNDSEKNEGLGTSC-----LGNESERRPTVK 834
            K+  WGSLP+S   K  D      F I      E  E LGTS      LG  S+  P  K
Sbjct: 154  KKLLWGSLPLSVTGKGKDNSDSVSFNITEACADEIKECLGTSFSAENDLGKASD--PLNK 211

Query: 833  GDEQKDVSYTNQASVIVECSARDLMRRKRSHRIELSE--SISSPDAEC------------ 696
                 D     +A ++VEC+ RDLMRRKRS RIE ++  S+ S +               
Sbjct: 212  NAHASDDK--QEAGILVECTVRDLMRRKRSRRIEPADCGSVRSENVHLKMEKGKDSFFCP 269

Query: 695  ---------------GKGGINHDPKFCSDIEDPKDSLS--PMH----------------- 618
                           G G +NH P   ++ ++  +++   P H                 
Sbjct: 270  KQLNFHGSHNELDKKGPGSLNHSPSLANEQKEFPEAVGFKPTHSDSVYCTLPQLSGISNP 329

Query: 617  -------------KTPINNETKFH-------------GRDINFLKTEIIGEDSRC----- 531
                         K  +N   K H             G++ +F  T     +S       
Sbjct: 330  AQANTGHPEQMGKKLVLNFYPKKHDSAISIGHCETYKGKEFDFRVTSAESRNSDAHTSKA 389

Query: 530  ---------RMLQTGAVGSCAVPLNSSPSNLVSKTDCISESGYQTSTSMCQLPGIPAKDD 378
                     R+ QT   GS  +  +     ++     I E+ Y+   S+       + D 
Sbjct: 390  HKEIDSPDERLQQTDTNGSWCLSASPRTHKMLGMDGYIHETYYEGEISL-------SADK 442

Query: 377  LNGMEGETTIVQFENSPCHGRYPGLSNPHDALENVKKESETVNLIGKTFSMKPPTIDWTD 198
              G++  T     +N  C G   G          V  E++ V LIG TF  KPPT DW D
Sbjct: 443  PVGIDATTDKSYPQNEDCGGGKQGCITGLV----VDVEAKPVELIGMTFCKKPPTADWND 498

Query: 197  AP-----------ESEALGNGGSKVGTSLQGRYADSCLPFFVRSCPEEEELQGINPRKCK 51
                          S +L N  +  GTS  GR  D  LPFF R C EE+E+Q     KC 
Sbjct: 499  GATENVTHLPTTQHSPSLFNEENCQGTS--GRALDEVLPFFSRGCEEEKEVQ----NKCL 552

Query: 50   DNDN----QEPVMGVPILYQ 3
             N+N    QE  +GVPI YQ
Sbjct: 553  GNNNSNFHQEAALGVPIHYQ 572


>gb|EOY11310.1| Recovery protein 3 isoform 2, partial [Theobroma cacao]
          Length = 1425

 Score = 89.7 bits (221), Expect = 2e-15
 Identities = 118/440 (26%), Positives = 162/440 (36%), Gaps = 109/440 (24%)
 Frame = -3

Query: 995  KRPRWGSLPVS-RQKVIDVLPPEKFCILSGNDSEKNEGLGTSC-----LGNESERRPTVK 834
            K+  WGSLP+S   K  D      F I      E  E LGTS      LG  S+  P  K
Sbjct: 345  KKLLWGSLPLSVTGKGKDNSDSVSFNITEACADEIKECLGTSFSAENDLGKASD--PLNK 402

Query: 833  GDEQKDVSYTNQASVIVECSARDLMRRKRSHRIELSE--SISSPDAEC------------ 696
                 D     +A ++VEC+ RDLMRRKRS RIE ++  S+ S +               
Sbjct: 403  NAHASDDK--QEAGILVECTVRDLMRRKRSRRIEPADCGSVRSENVHLKMEKGKDSFFCP 460

Query: 695  ---------------GKGGINHDPKFCSDIEDPKDSLS--PMH----------------- 618
                           G G +NH P   ++ ++  +++   P H                 
Sbjct: 461  KQLNFHGSHNELDKKGPGSLNHSPSLANEQKEFPEAVGFKPTHSDSVYCTLPQLSGISNP 520

Query: 617  -------------KTPINNETKFH-------------GRDINFLKTEIIGEDSRC----- 531
                         K  +N   K H             G++ +F  T     +S       
Sbjct: 521  AQANTGHPEQMGKKLVLNFYPKKHDSAISIGHCETYKGKEFDFRVTSAESRNSDAHTSKA 580

Query: 530  ---------RMLQTGAVGSCAVPLNSSPSNLVSKTDCISESGYQTSTSMCQLPGIPAKDD 378
                     R+ QT   GS  +  +     ++     I E+ Y+   S+       + D 
Sbjct: 581  HKEIDSPDERLQQTDTNGSWCLSASPRTHKMLGMDGYIHETYYEGEISL-------SADK 633

Query: 377  LNGMEGETTIVQFENSPCHGRYPGLSNPHDALENVKKESETVNLIGKTFSMKPPTIDWTD 198
              G++  T     +N  C G   G          V  E++ V LIG TF  KPPT DW D
Sbjct: 634  PVGIDATTDKSYPQNEDCGGGKQGCITGLV----VDVEAKPVELIGMTFCKKPPTADWND 689

Query: 197  AP-----------ESEALGNGGSKVGTSLQGRYADSCLPFFVRSCPEEEELQGINPRKCK 51
                          S +L N  +  GTS  GR  D  LPFF R C EE+E+Q     KC 
Sbjct: 690  GATENVTHLPTTQHSPSLFNEENCQGTS--GRALDEVLPFFSRGCEEEKEVQ----NKCL 743

Query: 50   DNDN----QEPVMGVPILYQ 3
             N+N    QE  +GVPI YQ
Sbjct: 744  GNNNSNFHQEAALGVPIHYQ 763


>gb|EOY11309.1| Recovery protein 3 isoform 1 [Theobroma cacao]
          Length = 2035

 Score = 89.7 bits (221), Expect = 2e-15
 Identities = 118/440 (26%), Positives = 162/440 (36%), Gaps = 109/440 (24%)
 Frame = -3

Query: 995  KRPRWGSLPVS-RQKVIDVLPPEKFCILSGNDSEKNEGLGTSC-----LGNESERRPTVK 834
            K+  WGSLP+S   K  D      F I      E  E LGTS      LG  S+  P  K
Sbjct: 599  KKLLWGSLPLSVTGKGKDNSDSVSFNITEACADEIKECLGTSFSAENDLGKASD--PLNK 656

Query: 833  GDEQKDVSYTNQASVIVECSARDLMRRKRSHRIELSE--SISSPDAEC------------ 696
                 D     +A ++VEC+ RDLMRRKRS RIE ++  S+ S +               
Sbjct: 657  NAHASDDK--QEAGILVECTVRDLMRRKRSRRIEPADCGSVRSENVHLKMEKGKDSFFCP 714

Query: 695  ---------------GKGGINHDPKFCSDIEDPKDSLS--PMH----------------- 618
                           G G +NH P   ++ ++  +++   P H                 
Sbjct: 715  KQLNFHGSHNELDKKGPGSLNHSPSLANEQKEFPEAVGFKPTHSDSVYCTLPQLSGISNP 774

Query: 617  -------------KTPINNETKFH-------------GRDINFLKTEIIGEDSRC----- 531
                         K  +N   K H             G++ +F  T     +S       
Sbjct: 775  AQANTGHPEQMGKKLVLNFYPKKHDSAISIGHCETYKGKEFDFRVTSAESRNSDAHTSKA 834

Query: 530  ---------RMLQTGAVGSCAVPLNSSPSNLVSKTDCISESGYQTSTSMCQLPGIPAKDD 378
                     R+ QT   GS  +  +     ++     I E+ Y+   S+       + D 
Sbjct: 835  HKEIDSPDERLQQTDTNGSWCLSASPRTHKMLGMDGYIHETYYEGEISL-------SADK 887

Query: 377  LNGMEGETTIVQFENSPCHGRYPGLSNPHDALENVKKESETVNLIGKTFSMKPPTIDWTD 198
              G++  T     +N  C G   G          V  E++ V LIG TF  KPPT DW D
Sbjct: 888  PVGIDATTDKSYPQNEDCGGGKQGCITGLV----VDVEAKPVELIGMTFCKKPPTADWND 943

Query: 197  AP-----------ESEALGNGGSKVGTSLQGRYADSCLPFFVRSCPEEEELQGINPRKCK 51
                          S +L N  +  GTS  GR  D  LPFF R C EE+E+Q     KC 
Sbjct: 944  GATENVTHLPTTQHSPSLFNEENCQGTS--GRALDEVLPFFSRGCEEEKEVQ----NKCL 997

Query: 50   DNDN----QEPVMGVPILYQ 3
             N+N    QE  +GVPI YQ
Sbjct: 998  GNNNSNFHQEAALGVPIHYQ 1017


>gb|EMJ07643.1| hypothetical protein PRUPE_ppa000111mg [Prunus persica]
          Length = 1771

 Score = 73.6 bits (179), Expect = 1e-10
 Identities = 90/337 (26%), Positives = 128/337 (37%), Gaps = 6/337 (1%)
 Frame = -3

Query: 995  KRPRWGSLPVSRQKVIDVLPPEKFCILSGNDSE--KNEGLGTSCLGNESERRPTVKGDEQ 822
            K+  WGSLP+S  + +     E   I S ++ +  K  G+G   L               
Sbjct: 563  KKSLWGSLPLSATQKMKT---EGELINSSSEDQVGKRAGIGACDL--------------- 604

Query: 821  KDVSYTNQASVIVECSARDLMRRKRSHRIELSESISSPDAECGKGGINHDPKFCSDIEDP 642
                   ++S++  CS RDLMRRKRS+RIE          ECG  GI             
Sbjct: 605  ------KESSMLARCSVRDLMRRKRSYRIE--------PPECGSQGI------------- 637

Query: 641  KDSLSPMHKTPINNETKFHGRDINFLKTEIIGEDSRCRMLQTGAVGSCAVPLNSSPSNLV 462
            K+ L    +   N +T    + ++F                     SCA           
Sbjct: 638  KEVLLGREE---NEDTLLCAKRLDFQM-------------------SCA----------- 664

Query: 461  SKTDCISESGYQTSTSMCQLPGIPAKDDLNGMEGETTIVQFENSPCHGRYPGLSNPHDAL 282
               D  +  G  +   +C++P     ++  G+   T      N    G+  G+ +    L
Sbjct: 665  ---DATTFEGLSSKGGVCEMPF----ENPVGVNAITVATFLNNEGSGGQKLGVDSVLCGL 717

Query: 281  ENVKK---ESETVNLIGKTFSMKPPTIDWTDAPESEALGNGGSKVGTSL-QGRYADSCLP 114
             N       S+   LI  +F  KPP  DW           G SK  +SL  GR  D   P
Sbjct: 718  RNSPFGVIPSDDKGLIEMSFCRKPPVADWN---------YGESKNASSLYDGRATDEFCP 768

Query: 113  FFVRSCPEEEELQGINPRKCKDNDNQEPVMGVPILYQ 3
            FFVR C +E E+Q    R  + + +QE VMGVPI YQ
Sbjct: 769  FFVRDCQDEREIQNKCVRS-ESSSHQESVMGVPIHYQ 804


Top