BLASTX nr result

ID: Cephaelis21_contig00023132 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00023132
         (1695 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002522374.1| hypothetical protein RCOM_0603630 [Ricinus c...   284   6e-74
ref|XP_002310176.1| predicted protein [Populus trichocarpa] gi|2...   278   3e-72
ref|XP_003548909.1| PREDICTED: uncharacterized protein LOC100818...   248   3e-63
ref|NP_190607.2| uncharacterized protein [Arabidopsis thaliana] ...   226   1e-56
emb|CAB62317.1| putative protein [Arabidopsis thaliana]               226   1e-56

>ref|XP_002522374.1| hypothetical protein RCOM_0603630 [Ricinus communis]
            gi|223538452|gb|EEF40058.1| hypothetical protein
            RCOM_0603630 [Ricinus communis]
          Length = 1720

 Score =  284 bits (726), Expect = 6e-74
 Identities = 186/574 (32%), Positives = 291/574 (50%), Gaps = 28/574 (4%)
 Frame = +3

Query: 3    DERKVLAEIDPEGCALHDAIERI--SKINIGNWRDNMVATFLRYCHFQMDEIHILLLPPS 176
            +++K +A  DPEG ALHD +E+I  S  +   +  +++   L++CH Q+ +  + +  P 
Sbjct: 115  EKKKAVAGFDPEGGALHDVLEKILISTPSRKGFTTSLLNLILKHCHLQVFDTKLQVQVPI 174

Query: 177  SYESVSCLLEIKEVGAKSEYIQQTCFLGELISSFLVTSQICSFDLHMRDFEIGLQRESFI 356
              + + CLLE+KE   +SEY +  C L   +       +  S  ++ +   IG       
Sbjct: 175  LNDDLVCLLELKEFNGESEYFEHGCLLRGFLGVAFNPPKETSIVMNFKGLGIGYWMNDKE 234

Query: 357  SYVVSSTGLFVSIKMENLLFKHYFFSLPALCFSFSPADLSFISLFYSLAYKESKYTRSGR 536
            + VVSST LF  I++ +L        +P L    SP DL  +S+   L  KE K+ R+GR
Sbjct: 235  NSVVSSTDLFSCIRLNDLQLADISIRVPGLNLLLSPLDLLVLSVLGRLPLKEPKHVRNGR 294

Query: 537  QLWRIAASRISYLTPMPKLSWYRVTSSIKLWLYYLHTYENMLSIIGYPARDLVKQSCNKM 716
            QLWR+AA+R+ Y+T  P+LS + +   + +WL YL+ YE++LS IGY   +L+K+    M
Sbjct: 295  QLWRLAANRLGYVTSFPRLSLHNLADFVCMWLRYLNAYEHLLSFIGYTQVNLLKRPSIGM 354

Query: 717  LSDKKYSRSVKHHWDLISECEQKLPSEAIAQARRLIRYRASESAQLAKQAGPEFG----- 881
            L DK +  SVK HW+LIS  E++LP EAIAQARR+ RY+A+ S    + +  E+      
Sbjct: 355  LRDKMFHSSVKQHWELISRTEKELPPEAIAQARRIARYKATLSIPQGEDSYKEYSVRSQF 414

Query: 882  RIWWKVFHFLSLIWAL---TCXXXXXXXXXXXXXRVQPKIIQQSKGVCDDSFPKCCFYLS 1052
            +++ KV   L   W +                  R +PK       + +D  P+ CF L+
Sbjct: 415  QVFSKVLSLLVFTWNVIHRVVLSNIHAFLSIVFSRQEPKFDGHLGIISEDHCPQYCFLLN 474

Query: 1053 VGKFSIAVFPEKEVLPSVHGLPPSAISFSCDDLISFHISLDDFLLKYLQSFSDRHVMFAC 1232
             GK  I  F     + +V     S I  S  D+ SF +SLD  LL Y+    ++    +C
Sbjct: 475  FGKVLI-TFCSGNTIHNVIKKLESHIGISLPDIHSFCLSLDALLLVYVDDIFEQSFSLSC 533

Query: 1233 GCVKVASS-----VFEDGSDR------NNRRLREQWSKKIL--EPQDILW-----CEPAR 1358
            G +KV +S        +GS +      N  R+    SK +L  EP  I        + A 
Sbjct: 534  GKLKVKTSSVTGDTATEGSSKHHTVKGNRERMTANDSKTVLQGEPAQIFLPLQNSQKNAE 593

Query: 1359 SVEMVTSIPLLEFLVGQMWLNWKNACPNFERNDFKSSEAPCILFESKCYLIDQTANNPTS 1538
              +     P L+  +G+MWL W+ AC  ++ N+ + SE P +L E K  L+      P S
Sbjct: 594  GQDESAHGPFLKTFLGEMWLTWRRACKKYDDNEIEYSENPWLLCEIKNCLLHPGLKGPNS 653

Query: 1539 GLREEVMVVGRLNFVLENSAVLSLAVLFEQIQQA 1640
            GL +  + VG+LN  L   +++S+A+L EQ+Q A
Sbjct: 654  GLWKCNLTVGKLNITLGYLSMISMAILLEQMQHA 687


>ref|XP_002310176.1| predicted protein [Populus trichocarpa] gi|222853079|gb|EEE90626.1|
            predicted protein [Populus trichocarpa]
          Length = 868

 Score =  278 bits (711), Expect = 3e-72
 Identities = 189/578 (32%), Positives = 303/578 (52%), Gaps = 32/578 (5%)
 Frame = +3

Query: 3    DERKVLAEIDPEGCALHDAIERISKIN--IGNW-RDNMVATFLRYCHFQMDEIHILLLPP 173
            +++K +A  DPEG ALH+ +ERI  +N    NW + +++   L++CH Q+ + ++ +  P
Sbjct: 110  EKKKAVAGFDPEGSALHNVLERIL-LNPPSRNWFKTSLLNLLLKHCHLQISDTNLQVQFP 168

Query: 174  SSYESVSCLLEIKEVGAKSEYIQQTCFLGELISSFLVTSQICSFDLHMRDFEIGLQRESF 353
               ++V  LLE+K+   +SE+    C L  ++ +     ++ SF +  R F    + E  
Sbjct: 169  DLNDAVVFLLELKDFNGESEHSDPGCLLRGVVGAVFKPLKVVSFVMDFRGFGFAYKMEDQ 228

Query: 354  ISYVVSSTGLFVSIKMENLLFKHYFFSLPALCFSFSPADLSFISLFYSLAYKESKYTRSG 533
            I+++ S T L   IK+ +L    +   +P L   FSP DL  +S F  L+ KE K+ RSG
Sbjct: 229  INHISSFTDLLSCIKLNDLRVADFNIRVPKLSLLFSPLDLLVLSAFGKLSTKERKHVRSG 288

Query: 534  RQLWRIAASRISYLTPMPKLSWYRVTSSIKLWLYYLHTYENMLSIIGYPARDLVKQSCNK 713
            RQLW++AA+R+ Y+   P+LS +++   I LWL Y + YE +LS++GY A +L+K+S  K
Sbjct: 289  RQLWKLAANRLGYVPSSPRLSLHKLVDFICLWLRYQNAYEYLLSLLGYSADNLLKKSVIK 348

Query: 714  MLSDKKYSRSVKHHWDLISECEQKLPSEAIAQARRLIRYRASESAQLAKQAGPEFG---- 881
            +  DK +  SVKH+W  IS  E++LP+EAIAQARR+ RYRA  + Q  K +  E      
Sbjct: 349  LSEDKMFLNSVKHNWGEISGIEKELPAEAIAQARRIARYRAVSNIQNGKNSFKESSMDKQ 408

Query: 882  -RIWWKVFHFLSLIWALTCXXXXXXXXXXXXXRV---QPKIIQQSKGVCDDSFPKCCFYL 1049
              ++ K+     +IW +                +   +PK+        +D   + CF L
Sbjct: 409  VNVFSKILSVFIVIWNVMYKILLSILHCFFFIILFFQRPKLDWNPGNNSEDYSSRYCFLL 468

Query: 1050 SVGKFSIAVFPEKEVLPSVHGLPPSAISFSCDDLISFHISLDDFLLKYLQSFSDRHVMFA 1229
            + GK  +  F       +V     S    S  D+ SF +S+   LL Y+    ++ +  +
Sbjct: 469  NFGKI-LVTFSSTSKHKNVDERIESHTGISYSDIHSFSLSIHMLLLAYVDEVFEQSLSLS 527

Query: 1230 CGCVKV-ASSVFE----DGSDRN---NRRLREQWSKKILEPQDILWCEPAR----SVEMV 1373
            CG +KV +SSV E    D S +N   ++++R + S   L  + IL  +PA+    S    
Sbjct: 528  CGKLKVKSSSVMETAIVDRSVKNPFSSKKVRRKGSVDKL--KTILMGKPAQVFLPSQTSE 585

Query: 1374 TSI---------PLLEFLVGQMWLNWKNACPNFERNDFKSSEAPCILFESKCYLIDQTAN 1526
            TS+         P L+ L+G+MWL W+ +   ++ N+   SE P +L E K  L+D    
Sbjct: 586  TSVANPAEGTCNPYLQTLMGEMWLAWQKSSAGYKDNEIAYSETPWLLCEIKNCLMDPNLK 645

Query: 1527 NPTSGLREEVMVVGRLNFVLENSAVLSLAVLFEQIQQA 1640
             P SG  +  +  G+LN  L  S+VLSLA+L  QIQ A
Sbjct: 646  RPVSGFWKCSLTAGKLNLALGYSSVLSLAILLGQIQHA 683


>ref|XP_003548909.1| PREDICTED: uncharacterized protein LOC100818143 [Glycine max]
          Length = 3602

 Score =  248 bits (634), Expect = 3e-63
 Identities = 172/570 (30%), Positives = 275/570 (48%), Gaps = 26/570 (4%)
 Frame = +3

Query: 9    RKVLAEIDPEGCALHDAIERISKINIGNWRDNMVATF----LRYCHFQMDEIHILLLPPS 176
            RK L+ +DPEGC+LHD +ERI  +     + +   +F    L+ CH     IH+ +  P 
Sbjct: 117  RKNLSALDPEGCSLHDILERI--LFAAPEKKDFTTSFWNLILKNCHLVAHCIHVEIQLPV 174

Query: 177  SYESVSCLLEIKEVGAKSEYIQQTCFLGELISSFLVTSQICSFDLHMRDFEIGLQRESFI 356
              +   C  EIKE+  +S+Y+ + C L   +SS  +  +  +  L    F   L  +   
Sbjct: 175  LNDEFMCFGEIKELSVRSKYVDKKCLLRGFLSSVFIPMKDSTLVLKGVGFRARLVGKDHT 234

Query: 357  SYVVSSTGLFVSIKMENLLFKHYFFSLPALCFSFSPADLSFISLFYSLAYKESKYTRSGR 536
              V+ S+ + + IK  +L         P L FSFSP  +S   LF  L       +R  R
Sbjct: 235  GNVLLSSDMQIDIKFRDLKLASCTLCFPELVFSFSPDGISVCLLFLKLVSNNYNQSRGAR 294

Query: 537  QLWRIAASRISYLTPMPKLSWYRVTSSIKLWLYYLHTYENMLSIIGYPARDLVKQSCNKM 716
            +LWRIAASRI ++T  P+LS++R+   I  W++Y + YEN+L +IGY      K+S +K+
Sbjct: 295  ELWRIAASRIGHVTVTPRLSFHRLVGVIGQWIHYANAYENILLLIGYSTSHTWKKSISKL 354

Query: 717  LSDKKYSRSVKHHWDLISECEQKLPSEAIAQARRLIRYRASESAQLAKQAGPEFGRIWWK 896
              +K    S   HW LIS+ E+KLP E I+ ARR+ R+RA+    +           +++
Sbjct: 355  TRNKLILSSASRHWKLISDIEKKLPVEGISLARRIARHRAALKDSINCHEDFVTTNKFFR 414

Query: 897  VFHF-LSLIWALTCXXXXXXXXXXXXXRVQPKIIQQS--KGVC-----DDSFPKCCFYLS 1052
             F F LS +W L                 + KI+Q     G C     +D    CCF L+
Sbjct: 415  PFIFLLSFMWKLISTIIHCLVNIFS----REKIVQDPDIDGCCLESLIEDPCQSCCFVLN 470

Query: 1053 VGKFSIAVFPEKEVLPSVHGLPPSAISFSCDDLISFHISLDDFLLKYLQSFSDRHVMFAC 1232
             GK  I V    E+ PSV+    S    +C   +S    +D  LL  ++   ++ +  +C
Sbjct: 471  FGKIIITVSQINEIDPSVYEKLQSLAGIACSAFLSICFCIDALLLISVKDIFEQRIFLSC 530

Query: 1233 GCVKVAS---SVFEDGSDRNNRRLREQWSKK-ILEPQDILWCEPARSVEMVTSI------ 1382
            G +KV S   ++ E+    +     +   K+ I   + I+W EPA+ + +++ I      
Sbjct: 531  GQMKVESAPLTMSEEACTMDPLSSAKGNEKEGINHMESIMWVEPAK-IFLLSEIDGGQAE 589

Query: 1383 ----PLLEFLVGQMWLNWKNACPNFERNDFKSSEAPCILFESKCYLIDQTANNPTSGLRE 1550
                  +E  + +  +NWK  C     N+ + SE PCIL + +    +    NP  G  E
Sbjct: 590  DCCDSHIEIFMKKFSVNWKRICRKLNENEIEFSENPCILSKIEISSTNPDPKNPDFGFCE 649

Query: 1551 EVMVVGRLNFVLENSAVLSLAVLFEQIQQA 1640
              +++G+LN VL +S+V SL+++  QIQ A
Sbjct: 650  CGLMLGKLNLVLTHSSVSSLSLILSQIQHA 679


>ref|NP_190607.2| uncharacterized protein [Arabidopsis thaliana]
            gi|332645140|gb|AEE78661.1| uncharacterized protein
            [Arabidopsis thaliana]
          Length = 3072

 Score =  226 bits (577), Expect = 1e-56
 Identities = 163/579 (28%), Positives = 283/579 (48%), Gaps = 35/579 (6%)
 Frame = +3

Query: 9    RKVLAEIDPEGCALHDAIERI---SKINIGNWRDNMVATFLRYCHFQMDEIHILLLPPSS 179
            +KVL+ IDP+GC LHD +E++   S   I   + +     LR+   Q+  I++ +  P S
Sbjct: 116  KKVLSSIDPKGCVLHDILEKMLGRSTSQISKLKTSFSNLILRHFRIQIHGINVQVCLPGS 175

Query: 180  YESVSCLLEIKEVGAKSEYIQQTCFLGELISSFLVTSQICSFDLHMRDFEIGLQRESFIS 359
             + +SCL+EI E+ + SE       +    ++ L   +  SF L    F IG +R++ I 
Sbjct: 176  SD-LSCLMEINELRSDSENFGNLSLVRSSAAAVLFPLRRSSFTLSCFGFNIGYKRDNEIV 234

Query: 360  YVVSSTGLFVSIKMENLLFKHYFFSLPALCFSFSPADLSFISLFYSLAYKESKYTRSGRQ 539
             +     L + I + NL        +P L FSF P DL  +    +L+ K+S Y R+GR 
Sbjct: 235  DLCGFDSLVMLITLHNLQLVDLVVRVPELSFSFRPTDLPVLMGLANLSSKDSNYVRNGRY 294

Query: 540  LWRIAASRISYLTPMPKLSWYRVTSSIKLWLYYLHTYENMLSIIGYPARDLVKQSCNKML 719
            LW++AA R   +     +S+  + S + LWL Y++ YE +LS+ GY  +   K    K  
Sbjct: 295  LWKVAARRTGLMISPHSVSFQNLVSVVILWLRYVNAYEYLLSLAGYSRKMPEKSLLWKFS 354

Query: 720  SDKKYSRSVKHHWDLISECEQKLPSEAIAQARRLIRYRASESAQLAKQAGPE---FGRIW 890
             +K++  + +  W++I   E++LP+EAIA+ARR+ RYRA  ++Q A     E   +G   
Sbjct: 355  ENKRHFVTARRKWEMICNIEKELPAEAIARARRVARYRACLNSQDADDDYDESSLYGHFK 414

Query: 891  W--KVFHFLSLIWALTCXXXXXXXXXXXXXRVQPKIIQQSKGVCDDSFPKCC-----FYL 1049
            +  K    L+ IW L               ++  + +Q  +   DDS  +C        +
Sbjct: 415  YLSKTTWVLAYIWRLISRTFWSIACFLWLNKLLTQELQTDRNNEDDS--ECVSLEFHAVV 472

Query: 1050 SVGKFSIAVFPEKEVLPSVHGLPPSAISFSCDDLISFHISLDDFLLKYLQSFSDRHVMFA 1229
            ++GK S+  +PEK +  S       +      +++   +S+D+FL+ Y      +++  +
Sbjct: 473  NLGKLSVTCYPEKII--SSFMTSKDSTGHVDSNIVMLCLSVDEFLVLYTVGCLTQYLSAS 530

Query: 1230 CGCVKVASSVFED-------------GSDRNNRRLREQWSKKILEPQDILWCEPARSVEM 1370
            CG +KV SS F++              S+ N + +RE       + + IL  +PA+ +  
Sbjct: 531  CGKLKVESSSFKNTSRFMKSTKDPSSSSEGNKKHMRE-------DVKTILDMDPAQQISK 583

Query: 1371 VTS---------IPLLEFLVGQMWLNWKNACPNFERNDFKSSEAPCILFESKCYLIDQTA 1523
              +         +  L+ L+ +MWLNW + C   +++ F  S+ PC+L + K  +  +  
Sbjct: 584  TVNNHGSDQHEGMLHLQNLLREMWLNWNSNCMKLDKSTFTISDKPCLLVDIKSCMAYEVV 643

Query: 1524 NNPTSGLREEVMVVGRLNFVLENSAVLSLAVLFEQIQQA 1640
             N  S   +  MV+G+L+ V E S++ SLA+L  QI+ A
Sbjct: 644  GNQDSEFWKCSMVLGKLDIVFEYSSLFSLALLIWQIEWA 682


>emb|CAB62317.1| putative protein [Arabidopsis thaliana]
          Length = 3071

 Score =  226 bits (577), Expect = 1e-56
 Identities = 163/579 (28%), Positives = 283/579 (48%), Gaps = 35/579 (6%)
 Frame = +3

Query: 9    RKVLAEIDPEGCALHDAIERI---SKINIGNWRDNMVATFLRYCHFQMDEIHILLLPPSS 179
            +KVL+ IDP+GC LHD +E++   S   I   + +     LR+   Q+  I++ +  P S
Sbjct: 116  KKVLSSIDPKGCVLHDILEKMLGRSTSQISKLKTSFSNLILRHFRIQIHGINVQVCLPGS 175

Query: 180  YESVSCLLEIKEVGAKSEYIQQTCFLGELISSFLVTSQICSFDLHMRDFEIGLQRESFIS 359
             + +SCL+EI E+ + SE       +    ++ L   +  SF L    F IG +R++ I 
Sbjct: 176  SD-LSCLMEINELRSDSENFGNLSLVRSSAAAVLFPLRRSSFTLSCFGFNIGYKRDNEIV 234

Query: 360  YVVSSTGLFVSIKMENLLFKHYFFSLPALCFSFSPADLSFISLFYSLAYKESKYTRSGRQ 539
             +     L + I + NL        +P L FSF P DL  +    +L+ K+S Y R+GR 
Sbjct: 235  DLCGFDSLVMLITLHNLQLVDLVVRVPELSFSFRPTDLPVLMGLANLSSKDSNYVRNGRY 294

Query: 540  LWRIAASRISYLTPMPKLSWYRVTSSIKLWLYYLHTYENMLSIIGYPARDLVKQSCNKML 719
            LW++AA R   +     +S+  + S + LWL Y++ YE +LS+ GY  +   K    K  
Sbjct: 295  LWKVAARRTGLMISPHSVSFQNLVSVVILWLRYVNAYEYLLSLAGYSRKMPEKSLLWKFS 354

Query: 720  SDKKYSRSVKHHWDLISECEQKLPSEAIAQARRLIRYRASESAQLAKQAGPE---FGRIW 890
             +K++  + +  W++I   E++LP+EAIA+ARR+ RYRA  ++Q A     E   +G   
Sbjct: 355  ENKRHFVTARRKWEMICNIEKELPAEAIARARRVARYRACLNSQDADDDYDESSLYGHFK 414

Query: 891  W--KVFHFLSLIWALTCXXXXXXXXXXXXXRVQPKIIQQSKGVCDDSFPKCC-----FYL 1049
            +  K    L+ IW L               ++  + +Q  +   DDS  +C        +
Sbjct: 415  YLSKTTWVLAYIWRLISRTFWSIACFLWLNKLLTQELQTDRNNEDDS--ECVSLEFHAVV 472

Query: 1050 SVGKFSIAVFPEKEVLPSVHGLPPSAISFSCDDLISFHISLDDFLLKYLQSFSDRHVMFA 1229
            ++GK S+  +PEK +  S       +      +++   +S+D+FL+ Y      +++  +
Sbjct: 473  NLGKLSVTCYPEKII--SSFMTSKDSTGHVDSNIVMLCLSVDEFLVLYTVGCLTQYLSAS 530

Query: 1230 CGCVKVASSVFED-------------GSDRNNRRLREQWSKKILEPQDILWCEPARSVEM 1370
            CG +KV SS F++              S+ N + +RE       + + IL  +PA+ +  
Sbjct: 531  CGKLKVESSSFKNTSRFMKSTKDPSSSSEGNKKHMRE-------DVKTILDMDPAQQISK 583

Query: 1371 VTS---------IPLLEFLVGQMWLNWKNACPNFERNDFKSSEAPCILFESKCYLIDQTA 1523
              +         +  L+ L+ +MWLNW + C   +++ F  S+ PC+L + K  +  +  
Sbjct: 584  TVNNHGSDQHEGMLHLQNLLREMWLNWNSNCMKLDKSTFTISDKPCLLVDIKSCMAYEVV 643

Query: 1524 NNPTSGLREEVMVVGRLNFVLENSAVLSLAVLFEQIQQA 1640
             N  S   +  MV+G+L+ V E S++ SLA+L  QI+ A
Sbjct: 644  GNQDSEFWKCSMVLGKLDIVFEYSSLFSLALLIWQIEWA 682


Top