BLASTX nr result

ID: Cephaelis21_contig00002610 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00002610
         (1598 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002518861.1| transcription factor, putative [Ricinus comm...   146   1e-32
ref|XP_002303671.1| predicted protein [Populus trichocarpa] gi|2...   133   1e-28
ref|XP_002299449.1| predicted protein [Populus trichocarpa] gi|2...   131   4e-28
ref|NP_001042530.1| Os01g0236700 [Oryza sativa Japonica Group] g...   126   2e-26
gb|EAY73195.1| hypothetical protein OsI_01067 [Oryza sativa Indi...   125   2e-26

>ref|XP_002518861.1| transcription factor, putative [Ricinus communis]
            gi|223541848|gb|EEF43394.1| transcription factor,
            putative [Ricinus communis]
          Length = 1003

 Score =  146 bits (369), Expect = 1e-32
 Identities = 145/498 (29%), Positives = 216/498 (43%), Gaps = 21/498 (4%)
 Frame = +2

Query: 5    LGPPGRVFKNGLPEVCYDIRDYTSIEYPPDLREKAVVCGIYAYVALPLSESSEYKRGSVL 184
            LG PGRVF+  LPE   +++ Y+S EY    R+ A+   +   +ALP+ E S      V+
Sbjct: 224  LGLPGRVFRQKLPEWTPNVQYYSSKEY--SRRDHALNYNVQGTLALPVFEPSGQSCVGVI 281

Query: 185  ECVFTD---DYKIPLFVIPHLISIVGC--PEFFVEPSL---------ANFDIRDSLRAIC 322
            E + T    +Y   +  +   +  V     E    PS          A  +I + L  +C
Sbjct: 282  ELIMTSQKINYAPEVDKVCKALEAVNLRSSEILDHPSTQICNEGRKNALAEILEILTVVC 341

Query: 323  KTHKLPHAQTWV---YSSTYKNETVRRHERSYFTNYDLVTNDDIINLMCFRGASL--YIK 487
            +T+KL  AQTW+   + S+  +     + +   +  DL +     ++  FR A L  +++
Sbjct: 342  ETYKLALAQTWIPCMHRSSCTSFDGSCNGQVCMSTTDLASYVVDPHMWGFRDACLEHHLQ 401

Query: 488  RGRGVVGKAFSSKGACFCKDITQLSIDEYSLVPSAKTAGLTGCFAICLK-HHTGWEDYLF 664
            +G+GV G+AF S  ACFC+DITQ    EY LV  A+  GLTGCFAICL+  +TG +DY  
Sbjct: 402  KGQGVAGRAFLSHNACFCQDITQFCKTEYPLVHYARLFGLTGCFAICLRSSYTGDDDY-- 459

Query: 665  IVEFFLPNKKTGSGDPRTSIKMLLATIKEQLKDFRVASGQDLGENLS-VEVIKTSPTEDL 841
            ++EFFLP   + S + ++ +  LLAT+K+  +   VASG DL E    VE+I+TS +  L
Sbjct: 460  VLEFFLPPTISDSYEQKSLLGSLLATMKQHFQSLNVASGMDLKEEEGFVEIIQTSTSGRL 519

Query: 842  DSFEICDLTGIDNTLSDLAIVLHGGAGRALDHESGANVPPETNNLNQHGGEVVHLVDGRS 1021
            D    C                       +      N PP TN   + G   V L     
Sbjct: 520  DLRLEC-----------------------IQIPQSPNSPPNTNTFPKDGH--VTLPHSSK 554

Query: 1022 HATNVNNVGQNNVKMEQCGINRMEEIVQRDDPIXXXXXXXXXXXXXXXXXIERRDANSNN 1201
            H   V+     +V      I   E       P+                           
Sbjct: 555  HPLMVD----LDVVDNGGNIGHAEGTHTSPPPV-------------------ENKGTRKP 591

Query: 1202 SENQRGIKRINREYGITHADLLQHSGKKQEEVAAHFCVSRSTFKRICRTHQILR*PPRKA 1381
            SE +RG      E  I+   L Q+     ++ A    V  +T KRICR H I R P RK 
Sbjct: 592  SEKKRG----KAEKSISLEVLQQYFAGSLKDAAKSLGVCPTTMKRICRQHGISRWPSRKI 647

Query: 1382 RHVDGPFNIAQEFVQSTK 1435
              V+      +  ++S +
Sbjct: 648  NKVNRSLTKLKRVIESVQ 665


>ref|XP_002303671.1| predicted protein [Populus trichocarpa] gi|222841103|gb|EEE78650.1|
            predicted protein [Populus trichocarpa]
          Length = 953

 Score =  133 bits (335), Expect = 1e-28
 Identities = 105/314 (33%), Positives = 156/314 (49%), Gaps = 30/314 (9%)
 Frame = +2

Query: 5    LGPPGRVFKNGLPEVCYDIRDYTSIEYPPDLREKAVVCGIYAYVALPLSESSEYKRGSVL 184
            LG PGRVF+   PE   +++ Y+S EY     + A+   +   +ALP+ E S      VL
Sbjct: 212  LGLPGRVFRQKSPEWTPNVQYYSSKEY--SRLDHALRYNVRGTLALPVFEPSGQSCVGVL 269

Query: 185  ECVFTD---DYKIPLFVIPHLISIVGCP--EFFVEPSL---------ANFDIRDSLRAIC 322
            E +      +Y   +  +   +  V     E    PS+         A  +I + L  +C
Sbjct: 270  ELIMNSQKINYAPEVDKVCKALEAVNLKSSEILDPPSIQICNEGRQNALSEILEILTMVC 329

Query: 323  KTHKLPHAQTWVYSSTYKNETVRRHERSYFTNYDLVTNDDII-------------NLMCF 463
            +THKLP AQTWV        T     +   T++D   N  +               +  F
Sbjct: 330  ETHKLPLAQTWVPCIHRSVLTYGGGLKKSCTSFDGNCNGQVCMSTTDVAFYVVDARMWGF 389

Query: 464  RGASL--YIKRGRGVVGKAFSSKGACFCKDITQLSIDEYSLVPSAKTAGLTGCFAICLK- 634
            R A L  ++++G+GV G+AF S+ +CFC DITQ    EY LV  A+  GLT CFAI L+ 
Sbjct: 390  REACLEHHLQKGQGVAGRAFLSQNSCFCPDITQFCKTEYPLVHYARMFGLTSCFAIFLRS 449

Query: 635  HHTGWEDYLFIVEFFLPNKKTGSGDPRTSIKMLLATIKEQLKDFRVASGQDLGENLSVEV 814
             +TG +DY  I+EFFLP   T S + +T +  +LAT+K+  +  +VASG DL E   VE+
Sbjct: 450  SYTGDDDY--ILEFFLPPSITDSHEQKTFLGSILATMKQDFQSLKVASGMDLEEEGFVEM 507

Query: 815  IKTSPTEDLDSFEI 856
            I+ +    L+  +I
Sbjct: 508  IEATTNGRLECIQI 521


>ref|XP_002299449.1| predicted protein [Populus trichocarpa] gi|222846707|gb|EEE84254.1|
            predicted protein [Populus trichocarpa]
          Length = 915

 Score =  131 bits (330), Expect = 4e-28
 Identities = 107/311 (34%), Positives = 153/311 (49%), Gaps = 31/311 (9%)
 Frame = +2

Query: 5    LGPPGRVFKNGLPEVCYDIRDYTSIEYPPDLREKAVVCGIYAYVALPLSESSEYKRGSVL 184
            LG PGRVF+  LPE   +++ Y+S EY     + A+   +   VALP+ E S      V+
Sbjct: 169  LGLPGRVFRQKLPEWTPNVQYYSSKEY--SRLDHALHYNVRGTVALPVFEPSGQSCVGVV 226

Query: 185  ECVFTD---DYKIPLFVIPHLISIVGCP--EFFVEPSL---------ANFDIRDSLRAIC 322
            E + T    +Y   +  +   +  V     E    PS          A  +I + L  +C
Sbjct: 227  ELIMTSQKINYAPEVDKVCKALEAVDLKSSEILDPPSTQICNEGRQNALAEILEILTMVC 286

Query: 323  KTHKLPHAQTWVYSSTYKNETVRRHERSYFTNYDLVTNDDII-------------NLMCF 463
            +THKLP AQTWV              +   T++D   N  +              ++  F
Sbjct: 287  ETHKLPLAQTWVPCMHRSVLAYGGGLKKSCTSFDGSCNGQVCMSTTDVAFYVVDAHMWGF 346

Query: 464  RGASL--YIKRGRGVVGKAFSSKGACFCKDITQLSIDEYSLVPSAKTAGLTGCFAICLK- 634
            R A L  ++++G+GV G+AF S   CFC DITQ    EY LV  A+  GLT CFAICL+ 
Sbjct: 347  REACLEHHLQKGQGVAGRAFFSHNLCFCPDITQFCKTEYPLVHYARMFGLTSCFAICLRS 406

Query: 635  HHTGWEDYLFIVEFFLPNKKTGSGDPRTSIKMLLATIKEQLKDFRVASGQDLGENLS-VE 811
             +TG +DY  I+EFFLP   T S + +T +  +LA +K+  +  +VASG DL E    VE
Sbjct: 407  SYTGDDDY--ILEFFLPPSFTDSREWKTLLGSILAIMKQDFQSLQVASGMDLEEEEGFVE 464

Query: 812  VIKTSPTEDLD 844
            +I+ S    LD
Sbjct: 465  MIQVSTNGRLD 475


>ref|NP_001042530.1| Os01g0236700 [Oryza sativa Japonica Group]
            gi|75107518|sp|Q5NB82.1|NLP3_ORYSJ RecName: Full=Protein
            NLP3; Short=AtNLP3; AltName: Full=NIN-like protein 3;
            AltName: Full=Nodule inception protein-like protein 3
            gi|56783862|dbj|BAD81274.1| nodule inception protein
            -like [Oryza sativa Japonica Group]
            gi|113532061|dbj|BAF04444.1| Os01g0236700 [Oryza sativa
            Japonica Group] gi|125569669|gb|EAZ11184.1| hypothetical
            protein OsJ_01032 [Oryza sativa Japonica Group]
          Length = 938

 Score =  126 bits (316), Expect = 2e-26
 Identities = 143/553 (25%), Positives = 228/553 (41%), Gaps = 47/553 (8%)
 Frame = +2

Query: 5    LGPPGRVFKNGLPEVCYDIRDYTSIEYPPDLREKAVVCGIYAYVALPLSESSEYKRGSVL 184
            LG PGRV+K  +PE   +++ Y+S EYP      A+   ++  VALP+ + S     +V+
Sbjct: 208  LGLPGRVYKQKVPEWTPNVQYYSSTEYPR--LNHAISYNVHGTVALPVFDPSVQNCIAVV 265

Query: 185  ECVFTD---DYKIPLFVIPHLISIVGCP--EFFVEPSL---------ANFDIRDSLRAIC 322
            E + T    +Y   +  +   +  V     E    P++         A  +I + L  +C
Sbjct: 266  ELIMTSKKINYAGEVDKVCKALEAVNLKSTEILDHPNVQICNEGRQSALVEILEILTVVC 325

Query: 323  KTHKLPHAQTWV---YSSTYKN-----ETVRRHERSYFTNYDLVTNDDIINLMC-----F 463
            + HKLP AQTWV   Y S   +     ++    + S      + T+D   +++      F
Sbjct: 326  EEHKLPLAQTWVPCKYRSVLAHGGGVKKSCLSFDGSCMGEVCMSTSDVAFHVIDAHMWGF 385

Query: 464  RGASL--YIKRGRGVVGKAFSSKGACFCKDITQLSIDEYSLVPSAKTAGLTGCFAICLKH 637
            R A +  ++++G+GV GKAF  +  CF KDI+Q    EY LV  A+  GL GCFAICL+ 
Sbjct: 386  RDACVEHHLQKGQGVSGKAFIYRRPCFSKDISQFCKLEYPLVHYARMFGLAGCFAICLQS 445

Query: 638  -HTGWEDYLFIVEFFLPNKKTGSGDPRTSIKMLLATIKEQLKDFRVASGQDLGE-NLSVE 811
             +TG +DY  I+EFFLP       D    ++ +LA +K+ L+  +V    D  E  L + 
Sbjct: 446  MYTGDDDY--ILEFFLPPNCRNEDDQNALLESILARMKKCLRTLKVVGNGDTNEVCLQIS 503

Query: 812  VIKTSPTEDLDSFEICDLTGIDNTLSDLAIVLHGGAGRALDHESGANVPPETNNLNQHGG 991
             +    TEDL +                   +H         ES     PE+N     G 
Sbjct: 504  NVLIIETEDLKT------------------NVHFENSEGCFRES-----PESN-----GS 535

Query: 992  EVVHLVDGRSHATNVNN----VGQNNVKMEQCGINRMEEIVQRDDPIXXXXXXXXXXXXX 1159
            + VH VD   +  ++ +    +  +N +     + R       D                
Sbjct: 536  QRVHEVDNDGNKVSIMSERHLLADDNSQNNGASVGRPNGSGASD---------------- 579

Query: 1160 XXXXIERRDANSNNSENQRGIKRINREYGITHADLLQHSGKKQEEVAAHFCVSRSTFKRI 1339
                      + +N   +R  +R   E  I+   L Q+     +  A    V  +T KRI
Sbjct: 580  --------SLHKSNKPPER--RRGKAEKTISLDVLQQYFSGSLKNAAKSLGVCPTTMKRI 629

Query: 1340 CRTHQILR*PPRKARHVDGPFNIAQEFVQSTKEE------------LPLPVHPVGQDMNS 1483
            CR H I R P RK   V+   +  ++ ++S +              LP+PV P     N 
Sbjct: 630  CRQHGISRWPSRKINKVNRSLSKLKQVIESVQGSDAAFNLTSITGPLPIPVGPSSDSQNL 689

Query: 1484 KPDSVQNATTVQN 1522
            +  S      + N
Sbjct: 690  EKASPNKVAELSN 702


>gb|EAY73195.1| hypothetical protein OsI_01067 [Oryza sativa Indica Group]
          Length = 866

 Score =  125 bits (315), Expect = 2e-26
 Identities = 145/554 (26%), Positives = 230/554 (41%), Gaps = 48/554 (8%)
 Frame = +2

Query: 5    LGPPGRVFKNGLPEVCYDIRDYTSIEYPPDLREKAVVCGIYAYVALPLSESSEYKRGSVL 184
            LG PGRV+K  +PE   +++ Y+S EYP      A+   ++  VALP+ + S     +V+
Sbjct: 133  LGLPGRVYKQKVPEWTPNVQYYSSTEYPR--LNHAISYNVHGTVALPVFDPSVQNCIAVV 190

Query: 185  ECVFTD---DYKIPLFVIPHLISIVGCP--EFFVEPSL---------ANFDIRDSLRAIC 322
            E + T    +Y   +  +   +  V     E    P++         A  +I + L  +C
Sbjct: 191  ELIMTSKKINYAGEVDKVCKALEAVNLKSTEILDHPNVQICNEGRQSALVEILEILTVVC 250

Query: 323  KTHKLPHAQTWV---YSSTYKN-----ETVRRHERSYFTNYDLVTNDDIINLMC-----F 463
            + HKLP AQTWV   Y S   +     ++    + S      + T+D   +++      F
Sbjct: 251  EEHKLPLAQTWVPCKYRSVLAHGGGVKKSCLSFDGSCMGEVCMSTSDVAFHVIDAHMWGF 310

Query: 464  RGASL--YIKRGRGVVGKAFSSKGACFCKDITQLSIDEYSLVPSAKTAGLTGCFAICLKH 637
            R A +  ++++G+GV GKAF  +  CF KDI+Q    EY LV  A+  GL GCFAICL+ 
Sbjct: 311  RDACVEHHLQKGQGVSGKAFIYRRPCFSKDISQFCKLEYPLVHYARMFGLAGCFAICLQS 370

Query: 638  -HTGWEDYLFIVEFFLPNKKTGSGDPRTSIKMLLATIKEQLKDFRVASGQDLGE-NLSVE 811
             +TG +DY  I+EFFLP       D    ++ +LA +K+ L+  +V    D  E  L + 
Sbjct: 371  MYTGDDDY--ILEFFLPPNCRNEDDQNALLESILARMKKCLRTLKVVGNGDTNEVCLQIS 428

Query: 812  VIKTSPTEDLDSFEICDLTGIDNTLSDLAIVLHGGAGRALDHESGANVPPETNNLNQHGG 991
             +    TEDL        T +    S+       G  R     +G+    E +N    G 
Sbjct: 429  NVLIIETEDLK-------TNVHFENSE-------GCFRESPESNGSQRAHEVDN---DGN 471

Query: 992  EVV-----HLVDGRSHATNVNNVGQNNVKMEQCGINRMEEIVQRDDPIXXXXXXXXXXXX 1156
            ++      HL+   +   N  +VG+ N      G    + + + + P             
Sbjct: 472  KISIMSERHLLADDNSQNNGASVGRPN------GSGASDSLHKSNKP------------- 512

Query: 1157 XXXXXIERRDANSNNSENQRGIKRINREYGITHADLLQHSGKKQEEVAAHFCVSRSTFKR 1336
                            E +RG      E  I+   L Q+     +  A    V  +T KR
Sbjct: 513  ---------------PERRRG----KAEKTISLDVLQQYFSGSLKNAAKSLGVCPTTMKR 553

Query: 1337 ICRTHQILR*PPRKARHVDGPFNIAQEFVQSTKEE------------LPLPVHPVGQDMN 1480
            ICR H I R P RK   V+   +  ++ ++S +              LP+PV P     N
Sbjct: 554  ICRQHGISRWPSRKINKVNRSLSKLKQVIESVQGSDASFNLTSITGPLPIPVGPSSDSQN 613

Query: 1481 SKPDSVQNATTVQN 1522
             +  S      + N
Sbjct: 614  LEKASPNKVAELSN 627


Top