BLASTX nr result

ID: Cephaelis21_contig00037651 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00037651
         (2189 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI15020.3| unnamed protein product [Vitis vinifera]              152   4e-34
ref|XP_002511393.1| hypothetical protein RCOM_1510520 [Ricinus c...   129   3e-27
ref|XP_004152555.1| PREDICTED: uncharacterized protein LOC101223...    90   2e-15
ref|XP_003544001.1| PREDICTED: uncharacterized protein LOC100820...    72   6e-10
ref|XP_002318649.1| predicted protein [Populus trichocarpa] gi|2...    67   3e-08

>emb|CBI15020.3| unnamed protein product [Vitis vinifera]
          Length = 1185

 Score =  152 bits (384), Expect = 4e-34
 Identities = 193/725 (26%), Positives = 266/725 (36%), Gaps = 161/725 (22%)
 Frame = -2

Query: 2113 CTDVKNSRKLKSSTTEAKVELLIPPKPIENDFL-IGENDWKVDGKSGKGFESRERRRSKY 1937
            C + ++  KL SS+ EA +     P  +EN    + + D     KS KG ESRER++SKY
Sbjct: 473  CHEAEDVGKLASSSEEAGLSR--SPTTMENKASNVRDGDSGTGTKSEKGVESRERKKSKY 530

Query: 1936 LSFPFIGPKKG-----LENASTSGENVPGQGHGRVDVNSKSSQ---------CNGXXXXX 1799
            LS P+I    G     LE++ T    VP      V +N  S Q         C+G     
Sbjct: 531  LSPPYINLNWGRKGPVLEDSETEDPKVPKVSCAGVGMNEASEQLGAPPPIVKCSG-KAQK 589

Query: 1798 XXXXXXSCCFDMFGKGKDIRSSSAEILTELHLAALDCLHTKGWKHSGSVGSFLYGFR--- 1628
                      +  G    I +SSA +L+EL  AALDCL+    K+  S+  F + FR   
Sbjct: 590  KRSRKSVSEGNTSGDVDSINASSAVMLSELRFAALDCLYPSERKNFVSIERFFHRFRCSM 649

Query: 1627 -----RFAFINSEIAGE--------------------------------------HIGRP 1577
                 +     + I+GE                                      H+   
Sbjct: 650  YSEASQCKMYENNISGEKEALAAEPSSLEKGPLEIKLPIKPEPKKRKKKEKVTLKHLAEL 709

Query: 1576 KEGTTMSSNYGLEES-----------NGLDAVRHTL---SCGMSKKRQKSKEDKTSAGT- 1442
              G   +S   ++ S            G++   H +   SC  SK +   K  K   GT 
Sbjct: 710  TAGIPDASGNHVKSSLLGKDSAGDELRGVNGHSHNMQEMSCQSSKGKPGRKMMKKKEGTN 769

Query: 1441 -----------VFSCVVENATNGSEVTKCQEAGSDTPKNEKVPRKRKKAGVNLGMPEKKP 1295
                       +    V   T+ S +    E     P  +  P KRKK G        K 
Sbjct: 770  SKRSKTKPTPGLLDVNVGIVTSSSLINDSGEVKPLAPNGKPEPNKRKKEGATSERLHMKF 829

Query: 1294 APGLPDLNGNIPASSDNQVTESTAFCHVLEFGTESSHTQTAGLLDPGRNNIKFVQLLKDV 1115
              G+PDLN N P  S +                E     +   LD   NN K    +KD+
Sbjct: 830  TAGIPDLNRNSPVPSPS---------------VEDLQVMSTVALDVNGNNAKPSPSMKDL 874

Query: 1114 QSMGPNFLHSMSQQSQHNYMMGETPFTLECKERQSEANLEFKERQLEANGNNTLSGSLMK 935
              MG   LHS+    + N   G         +  + +N EF     E NGN      +++
Sbjct: 875  PGMG---LHSLGVIPELNGREG---------KEGASSNGEFTVSLPEVNGNIAKFSLMVE 922

Query: 934  NMQSISF---STKCEPKKSKRKVK-MSNPTDTQVASSIPDLNGNVVDCVSSGNKTPDITS 767
            + Q  S      K +P+K KRK K M    +   A+SIPDLNGN  +  S+     +I  
Sbjct: 923  DSQVTSLLAPGGKLKPRKRKRKEKAMMECPEINCAASIPDLNGNSAEPSSTEKHLLEINC 982

Query: 766  ASSKGKVPKKRSRSK---STAIVS---DMNANHNKANNTVRVSALAST------------ 641
             SSK K  +K+ R K      IV    D+N N+NK  NT        T            
Sbjct: 983  LSSKVKPERKKRRRKGEVGNKIVGGMLDINMNYNKVANTAEALGTTLTLTFAQGSPMPSK 1042

Query: 640  -RLVQPLLPMDGLKT------------------------------------PAMPPLPGS 572
              LV+       LK                                     PA+P    S
Sbjct: 1043 EALVEAFFKFGPLKESETEVLKDSPGAQVVFIRYSDAREAFQSLEKCSPFGPALPAALAS 1102

Query: 571  TPQN---------------GDPPDLLFMKRNLEMMTSTLEKAGSSLSPEMRTKLESEIKG 437
             P                 G+   L F+++NLEMMTS LEK+G +LSPEMR KLE EIKG
Sbjct: 1103 QPVESLKTPARSSGSKPPIGEARPLFFIRQNLEMMTSMLEKSGDNLSPEMRAKLEGEIKG 1162

Query: 436  FMKKI 422
             +KK+
Sbjct: 1163 LLKKV 1167


>ref|XP_002511393.1| hypothetical protein RCOM_1510520 [Ricinus communis]
            gi|223550508|gb|EEF51995.1| hypothetical protein
            RCOM_1510520 [Ricinus communis]
          Length = 1097

 Score =  129 bits (324), Expect = 3e-27
 Identities = 159/634 (25%), Positives = 262/634 (41%), Gaps = 69/634 (10%)
 Frame = -2

Query: 2119 NYCTDVKNSRKLKSSTTEAKVE---LLIPPKPIENDFLIGENDWKVDGKSGKGFESRERR 1949
            N   D+ ++ + +     A+VE   + +P  P + +  I  +   ++    KG + RER+
Sbjct: 439  NVLNDLASNSRKRKRKKYAEVEGYDVSLPDSPPQVEASIFGSATMIE----KGSDLRERK 494

Query: 1948 RSKYLSFPFIGPK-KGL---------ENASTSGENVPGQGHGRVDVNSKSSQCNGXXXXX 1799
            +SKYLS+P++  + KGL         +  S   E+     H  +  +S S   +G     
Sbjct: 495  KSKYLSYPYVNLEHKGLPSEIEDPKSQKVSQGAEHEKAVSHQFIGSHSVSKS-SGKRFQK 553

Query: 1798 XXXXXXSCCFDMFGKGKDIRSSSAEILTELHLAALDCLHTKGWKHSGSVGSFLYGFRRFA 1619
                      D       I +S A++L+EL L A+DCL++   K+   +  F   FR  A
Sbjct: 554  KWFRKFIHNNDASNNPDLINASVADLLSELCLTAMDCLYSNESKNFDLIEWFFARFRISA 613

Query: 1618 FINSEIAGEHIGRPKEGTTMSSNYGLEESNGLDAVRHTLSCGMSKKRQKSKEDKTSAGTV 1439
            F +  I   H     +    SSN  L+  + L+  +  L     +K QK K++  SA T 
Sbjct: 614  FHDESIYEMHC----KNMIGSSNEALQGKDTLEPTQTLLDVKAEQKMQKKKKNGNSAPTK 669

Query: 1438 FSCV-------VENATNGSEVTKCQEAGSDTPKNEKVPRKRKKAGVNLGMPEKKPAPGLP 1280
               +       +  A +G+ V    + G  TP     P+K+KK        +     GLP
Sbjct: 670  IKSLRGLSDVNINIAADGTLVKDFCDMGPPTPNGRPGPKKKKKK-------QGTSPAGLP 722

Query: 1279 DLNGNIPASSDNQVTESTAFCHVLEFGTESSHTQTAGLLDPGRNNIKFVQLLKDVQSMGP 1100
            DLN +  A+S   V    +  HV      +   + AG  +   ++ +   LL D+Q  GP
Sbjct: 723  DLNSS-GATSSLLVESFESVSHVEH--EPNQREKKAGSENVNLSDAEPGSLLLDLQVTGP 779

Query: 1099 NFLHSMSQQSQHNYMMGETPFT--------LECKERQSEANLEF--------KERQLEAN 968
              ++++ ++          P +        L  KE  S + L          ++R+ ++ 
Sbjct: 780  FSVNTIPKEIMGEGSAPSIPTSDGNCAIPGLLAKEPPSISPLSAEGLPEPKKRKRKDKST 839

Query: 967  GNNT----LSGSLMKNMQSISFSTKCEPKKSKRKVKMSNPTDTQVASSIPDLN--GNVVD 806
               T    +   L   +   S   K E K++++K         + A  +PD+N   N++D
Sbjct: 840  AEQTTVAAIEAGLEGTLAESSMLVKPEKKRARKKEVKPRRPRRKSAVRLPDININYNIMD 899

Query: 805  CVSSGNKTPDITSASSKGKVPKKRSRSKSTAIVSDMNANH---NKANNTVRVSALAST-- 641
                G  T  I + +    +P K     +      +  +     K +NT +V  L ST  
Sbjct: 900  TNGEGLGTALILTFAQGVSLPSKEVLVATFCRFGPLKESEIHLMKDSNTAQVVFLKSTDA 959

Query: 640  ----------------------RLVQPLLPMDGLKTPAMPPLPGSTPQNGDPPDLLFMKR 527
                                   L+      +G   PAM    GS P   + P + F+++
Sbjct: 960  AEAARSLENCSPFGATLVNYRLHLLSAAGSKEGTTAPAMSY--GSMPSPAEAPPIDFIRQ 1017

Query: 526  NLEMMTSTLEKAGSSLSPEMRTKLESEIKGFMKK 425
            NLEMMTS LEKAG +LSPEMR KLE+EIKG +KK
Sbjct: 1018 NLEMMTSMLEKAGDNLSPEMRAKLETEIKGLLKK 1051


>ref|XP_004152555.1| PREDICTED: uncharacterized protein LOC101223078 [Cucumis sativus]
          Length = 723

 Score = 90.1 bits (222), Expect = 2e-15
 Identities = 132/528 (25%), Positives = 218/528 (41%), Gaps = 81/528 (15%)
 Frame = -2

Query: 1762 FGKGKDIRSSS-AEILTELHLAALDCLHTKGWKHSGSVGSFLYGFRRFAFINSEIAGEH- 1589
            F   +D+ S S AE L+ELH  A+DCL+     + G+V  F   FR   F+  +++ +  
Sbjct: 202  FVDNQDLMSGSPAEFLSELHFTAVDCLYPNVNNNFGTVAQFFSIFRILMFLGEKVSEDKQ 261

Query: 1588 ---------IGRPKEGTTMSSNYGLEESNG--------LDAVRHTLSCGMSKKRQ----- 1475
                      G  K     SS   +EE           L         G ++K+      
Sbjct: 262  QQQPSSAAKSGIRKRKGQSSSIKKMEEMKSKPVSGDVDLTGNAEISPAGDAQKKTPSTSK 321

Query: 1474 -KSKEDKTSAG-------TVFSCVVENATNGSEVTK-CQEAGSDTPKNEKVPRKRKKAGV 1322
             KSK+DK S G       +  S V    ++ S + K   EAG  +P      RKR+  GV
Sbjct: 322  VKSKKDKESLGRLKTKSLSALSDVNITLSSCSLLAKDSPEAGPLSPNGLPKRRKRRNNGV 381

Query: 1321 NLGMPEKKPAPGLPDLNGNIPASSDNQVTESTAFCHVL-EFGTESSHTQTAGLL-DPGRN 1148
            +   P+ KP   +PDLNG+  A +   V +  A  HV  +   E    +  G+  +  + 
Sbjct: 382  H---PQSKPTTEIPDLNGS-GAVAGLLVEDQQAVSHVAAQLKREPKRRRKRGVSKENSKA 437

Query: 1147 NIKFVQL-LKDVQSMG-PNFLHSMSQQSQHNYMMGETPFTLECKERQSEANLEFKERQLE 974
            + +F+ + + D    G PN       QS ++  +G+       K+R+ +      +    
Sbjct: 438  STEFINVNVNDSNKPGAPN-------QSVNDQTIGQDQSKSGGKKRKRKEKPPLADPDAV 490

Query: 973  ---ANGNNTLSGSLMKNMQSISFSTKCEPKKSKRK-----VKMSNPTDTQ------VASS 836
               +NG  T +     +  + +   + +PK+ +R+     +   NP+D++      V + 
Sbjct: 491  LSYSNGVGTDTSQGKDSQLTNNLPPQPKPKRRRRRKGQASLNHPNPSDSRSYIYNRVETD 550

Query: 835  IPDLNGNVVDCVSSGNKTPD----ITSASSKGKVPKKRSRSKSTAI-------------V 707
               L   ++   SS    P     IT+ S  G + +   + K + +             V
Sbjct: 551  GEGLGSLLLLTFSSEAPLPPREQVITTFSQFGSLKESEIQLKDSTVEIVFLRSADAMEAV 610

Query: 706  SDMNAN-------------HNKANNTVRVSALASTRLVQPLLPMDGLKTPAMPPLPGSTP 566
              +  N             H  A      S  A T L  P    +G   P+     G+  
Sbjct: 611  RSLKKNNIFGPTLLKYQLYHLSAPPKTSDSDRACTALAYPA--SEGTLNPSKSAESGN-- 666

Query: 565  QNGDPPDLLFMKRNLEMMTSTLEKAGSSLSPEMRTKLESEIKGFMKKI 422
            Q GD P + F+++NL+MMTS LEK+G +LSP+MR KLE +I+G +KK+
Sbjct: 667  QAGDAPPIEFIRKNLQMMTSMLEKSGDNLSPDMRAKLECDIEGLLKKV 714


>ref|XP_003544001.1| PREDICTED: uncharacterized protein LOC100820046 [Glycine max]
          Length = 935

 Score = 72.0 bits (175), Expect = 6e-10
 Identities = 38/73 (52%), Positives = 48/73 (65%), Gaps = 3/73 (4%)
 Frame = -2

Query: 634 VQPLLPMDGLKTPAMPPLP--GSTPQNGD-PPDLLFMKRNLEMMTSTLEKAGSSLSPEMR 464
           V P  P   +  P + P P  GS    G+ PP L F+K+NL+MMTSTLE +GSSLSP MR
Sbjct: 682 VMPTQPTGSMAVPGVTPTPPTGSMAMPGETPPSLQFIKQNLQMMTSTLENSGSSLSPRMR 741

Query: 463 TKLESEIKGFMKK 425
            KL+SEIK  ++K
Sbjct: 742 AKLDSEIKNLLRK 754


>ref|XP_002318649.1| predicted protein [Populus trichocarpa] gi|222859322|gb|EEE96869.1|
           predicted protein [Populus trichocarpa]
          Length = 171

 Score = 66.6 bits (161), Expect = 3e-08
 Identities = 31/52 (59%), Positives = 41/52 (78%)
 Frame = -2

Query: 577 GSTPQNGDPPDLLFMKRNLEMMTSTLEKAGSSLSPEMRTKLESEIKGFMKKI 422
           GS P+  + P + F+++NLEMMTS LEK+G +LSPEMR KLE EIKG +KK+
Sbjct: 112 GSMPKLAEAPPIDFIRQNLEMMTSMLEKSGDNLSPEMRAKLEIEIKGLLKKV 163


Top