BLASTX nr result

ID: Cephaelis21_contig00007370 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00007370
         (1949 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI22504.3| unnamed protein product [Vitis vinifera]              220   9e-55
ref|XP_002269077.1| PREDICTED: homeobox protein HAT3.1-like [Vit...   220   9e-55
ref|XP_003555282.1| PREDICTED: homeobox protein HAT3.1-like [Gly...   215   4e-53
ref|XP_003535696.1| PREDICTED: uncharacterized protein LOC100306...   210   1e-51
ref|XP_002300247.1| predicted protein [Populus trichocarpa] gi|2...   201   4e-49

>emb|CBI22504.3| unnamed protein product [Vitis vinifera]
          Length = 977

 Score =  220 bits (561), Expect = 9e-55
 Identities = 153/446 (34%), Positives = 212/446 (47%), Gaps = 19/446 (4%)
 Frame = +3

Query: 3    PTDDEGWLCPGCDCKVDCMEVLSDFQGSNFSVLDQWEVVFPXXXXXXX-----SGMKMDD 167
            P DDEGWLCP CDCKVDCM++L+D QG+  SV+D WE VFP            SG   DD
Sbjct: 351  PPDDEGWLCPACDCKVDCMDLLNDSQGTKLSVIDSWEKVFPEAAAAGNNQDNNSGFSSDD 410

Query: 168  YSGXXXXXXXXXXXXXXXXXXX------HMEEESGSEGSDYFSACDDHVTTLDDRQILGL 329
                                          +E   S+ SD+ SA DD V + ++ Q LGL
Sbjct: 411  SEDNDYDPDCPEVDEKGQGDKSSSDKFDESDEFDESDESDFTSASDDMVVSPNNEQCLGL 470

Query: 330  XXXXXXXXXXXXXXXXXXXNVKPXXXXXXXXXXXXXXXLDTILDGNESLGDEERRASSIS 509
                               +  P                D   D  +     +RR  S +
Sbjct: 471  PSDDSEDD-----------DFDPDAPEIDEQVNQGSSSSDFTSDSEDFTATLDRRNFSDN 519

Query: 510  DQSLPRVGSVGEKVNVGRANRQSLSNELEYLLQSN----DVPISAKRHVERLDYQKLHQE 677
            +  L       E+   GR  + +L +EL  +L+SN    + P+SAKRHVERLDY+KLH E
Sbjct: 520  EDGLD------EQRRFGRKKKDTLKDELLSVLESNSGQDNAPLSAKRHVERLDYKKLHDE 573

Query: 678  TYGDTSSDSSD-EDYGETASPKRRKNCAKKATPLSINAPPTIN-NGGDSKDGNHS--QSE 845
             YG+ SSDSSD ED+ E   P++RKN +     +S N   +I  NG ++KD  H    + 
Sbjct: 574  AYGNVSSDSSDDEDWTENVIPRKRKNLSGNVASVSPNGNTSITENGTNTKDIKHDLEAAG 633

Query: 846  CENKASKEINKNTQVGSLKYFTDLESATAEGGSNGKSSIRSHKRLDEAAVQRLLQSFREN 1025
            C  K       N +  +       + + + G +  KS   S+K+L EA  +RL +SF+EN
Sbjct: 634  CTPKRRTRQKLNFESTNNSLAESHKDSRSPGSTGEKSGQSSYKKLGEAVTERLYKSFQEN 693

Query: 1026 QYPKHTVKECLAKELGLRIQQVSKWFENARWSSRHSSRMESKMAGGAFANGTSSPEMSET 1205
            QYP   +KE LA+ELG+  +QVSKWFENARWS RH    E+  AG +     +S   ++ 
Sbjct: 694  QYPDRAMKEKLAEELGITSRQVSKWFENARWSFRHRPPKEAS-AGKSAVKKDASTSQTDQ 752

Query: 1206 APELRPKFLAENASSNGPELTPLPQA 1283
             PE   + +   +S NG      P+A
Sbjct: 753  KPE--QEVVLRESSHNGVGKKESPKA 776


>ref|XP_002269077.1| PREDICTED: homeobox protein HAT3.1-like [Vitis vinifera]
          Length = 968

 Score =  220 bits (561), Expect = 9e-55
 Identities = 153/446 (34%), Positives = 212/446 (47%), Gaps = 19/446 (4%)
 Frame = +3

Query: 3    PTDDEGWLCPGCDCKVDCMEVLSDFQGSNFSVLDQWEVVFPXXXXXXX-----SGMKMDD 167
            P DDEGWLCP CDCKVDCM++L+D QG+  SV+D WE VFP            SG   DD
Sbjct: 351  PPDDEGWLCPACDCKVDCMDLLNDSQGTKLSVIDSWEKVFPEAAAAGNNQDNNSGFSSDD 410

Query: 168  YSGXXXXXXXXXXXXXXXXXXX------HMEEESGSEGSDYFSACDDHVTTLDDRQILGL 329
                                          +E   S+ SD+ SA DD V + ++ Q LGL
Sbjct: 411  SEDNDYDPDCPEVDEKGQGDKSSSDKFDESDEFDESDESDFTSASDDMVVSPNNEQCLGL 470

Query: 330  XXXXXXXXXXXXXXXXXXXNVKPXXXXXXXXXXXXXXXLDTILDGNESLGDEERRASSIS 509
                               +  P                D   D  +     +RR  S +
Sbjct: 471  PSDDSEDD-----------DFDPDAPEIDEQVNQGSSSSDFTSDSEDFTATLDRRNFSDN 519

Query: 510  DQSLPRVGSVGEKVNVGRANRQSLSNELEYLLQSN----DVPISAKRHVERLDYQKLHQE 677
            +  L       E+   GR  + +L +EL  +L+SN    + P+SAKRHVERLDY+KLH E
Sbjct: 520  EDGLD------EQRRFGRKKKDTLKDELLSVLESNSGQDNAPLSAKRHVERLDYKKLHDE 573

Query: 678  TYGDTSSDSSD-EDYGETASPKRRKNCAKKATPLSINAPPTIN-NGGDSKDGNHS--QSE 845
             YG+ SSDSSD ED+ E   P++RKN +     +S N   +I  NG ++KD  H    + 
Sbjct: 574  AYGNVSSDSSDDEDWTENVIPRKRKNLSGNVASVSPNGNTSITENGTNTKDIKHDLEAAG 633

Query: 846  CENKASKEINKNTQVGSLKYFTDLESATAEGGSNGKSSIRSHKRLDEAAVQRLLQSFREN 1025
            C  K       N +  +       + + + G +  KS   S+K+L EA  +RL +SF+EN
Sbjct: 634  CTPKRRTRQKLNFESTNNSLAESHKDSRSPGSTGEKSGQSSYKKLGEAVTERLYKSFQEN 693

Query: 1026 QYPKHTVKECLAKELGLRIQQVSKWFENARWSSRHSSRMESKMAGGAFANGTSSPEMSET 1205
            QYP   +KE LA+ELG+  +QVSKWFENARWS RH    E+  AG +     +S   ++ 
Sbjct: 694  QYPDRAMKEKLAEELGITSRQVSKWFENARWSFRHRPPKEAS-AGKSAVKKDASTSQTDQ 752

Query: 1206 APELRPKFLAENASSNGPELTPLPQA 1283
             PE   + +   +S NG      P+A
Sbjct: 753  KPE--QEVVLRESSHNGVGKKESPKA 776


>ref|XP_003555282.1| PREDICTED: homeobox protein HAT3.1-like [Glycine max]
          Length = 820

 Score =  215 bits (547), Expect = 4e-53
 Identities = 150/458 (32%), Positives = 213/458 (46%), Gaps = 25/458 (5%)
 Frame = +3

Query: 3    PTDDEGWLCPGCDCKVDCMEVLSDFQGSNFSVLDQWEVVFPXXXXXXXSGMKMDDYSGXX 182
            P  DEGWLCPGCDCK DCM++++D  G++ S+ D WE VFP       +G  MD+  G  
Sbjct: 385  PPGDEGWLCPGCDCKDDCMDLVNDSFGTSLSISDTWERVFPEAASF--AGNNMDNNLGLP 442

Query: 183  XXXXXXXXXXXXXXXXXHME-EESGSEGSDYFSACDDHVTTLDDRQILGLXXXXXXXXXX 359
                              +E +ES S+ S+Y SA +       + Q LGL          
Sbjct: 443  SDDSDDDDYNPNGSDDVKIEGDESSSDESEYASASEKLEGGSHEDQYLGLPSEDSDDGDY 502

Query: 360  XXXXXXXXXNVKPXXXXXXXXXXXXXXXLDTILDGNESLGDEERRASSISDQSLP-RVGS 536
                      V                        ++   D E  A++  D + P + G 
Sbjct: 503  DPDAPDVDCKVNEES------------------SSSDFTSDSEDLAAAFEDNTSPGQDGG 544

Query: 537  VGEKVNVGRANRQSLSNELEYLL-----QSNDVPISAKRHVERLDYQKLHQETYGDTSSD 701
            +      G+  + S+++EL  LL     Q    P+S KRHVERLDY+KL++ETY   +SD
Sbjct: 545  INSSKKKGKVGKLSMADELSSLLEPDSGQGGPTPVSGKRHVERLDYKKLYEETYHSDTSD 604

Query: 702  SSDEDYGETASPKRRKNCAKKATPLSINAPPTINNGGDSKDGNHSQSECENKASKEINKN 881
              DED+ + A+P R+K      TP+S NA  + NN   +   N  Q++ EN  S      
Sbjct: 605  --DEDWNDAAAPSRKKKLTGNVTPVSPNANAS-NNSIHTLKRNAHQNKVENTNSSPTKS- 660

Query: 882  TQVGSLKYFTDLESATAEGGSNGKSSIRSHKRLDEAAVQRLLQSFRENQYPKHTVKECLA 1061
                       L+  +  G  + +S   +HKRL EA VQRL +SF+ENQYP  + KE LA
Sbjct: 661  -----------LDGRSKSGSRDKRSGSSAHKRLGEAVVQRLHKSFKENQYPDRSTKESLA 709

Query: 1062 KELGLRIQQVSKWFENARWSSRHSSRMESKMAGGAFANGTS--------------SPEM- 1196
            +ELGL  QQV+KWF+N RWS RHSS+ME+     A    T               SPE+ 
Sbjct: 710  QELGLTYQQVAKWFDNTRWSFRHSSQMETNSGRNASPEATDGRAENEGEKQCESMSPEVS 769

Query: 1197 ---SETAPELRPKFLAENASSNGPELTPLPQASPCIEE 1301
               S+T    + K L+E  S    ++  L  +SP + +
Sbjct: 770  GKNSKTTSSRKRKHLSEPLSEAQLDINGLATSSPNVHQ 807


>ref|XP_003535696.1| PREDICTED: uncharacterized protein LOC100306715 [Glycine max]
          Length = 963

 Score =  210 bits (534), Expect = 1e-51
 Identities = 139/390 (35%), Positives = 192/390 (49%), Gaps = 8/390 (2%)
 Frame = +3

Query: 3    PTDDEGWLCPGCDCKVDCMEVLSDFQGSNFSVLDQWEVVFPXXXXXXXSGMKMDDYSGXX 182
            P  DEGWLCPGCDCK DCM++++D  G++ S+ D WE VFP       +G  MD+ SG  
Sbjct: 527  PPGDEGWLCPGCDCKDDCMDLVNDSFGTSLSISDTWERVFPEAASF--AGNNMDNNSGVP 584

Query: 183  XXXXXXXXXXXXXXXXXHME-EESGSEGSDYFSACDDHVTTLDDRQILGLXXXXXXXXXX 359
                              +E +ES S+ S+Y SA +       + Q LGL          
Sbjct: 585  SDDSDDDDYNPNGPDDVKVEGDESSSDESEYASASEKLEGGSHEDQYLGLPSEDSDDGDY 644

Query: 360  XXXXXXXXXNVKPXXXXXXXXXXXXXXXLDTILDGNESLGDEERRASSISDQSLP-RVGS 536
                      V                        ++   D E  A++I D + P + G 
Sbjct: 645  DPDAPDVECKVNEES------------------SSSDFTSDSEDLAAAIEDNTSPGQDGG 686

Query: 537  VGEKVNVGRANRQ-SLSNELEYLLQSND-----VPISAKRHVERLDYQKLHQETYGDTSS 698
            +      G+  ++ SL +EL  LL+ +       P+S KRHVERLDY+KL++ETY   +S
Sbjct: 687  ISSSKKKGKVGKKLSLPDELSSLLEPDSGQEAPTPVSGKRHVERLDYKKLYEETYHSDTS 746

Query: 699  DSSDEDYGETASPKRRKNCAKKATPLSINAPPTINNGGDSKDGNHSQSECENKASKEINK 878
            D  DED+ +TA+P  +K      TP+S        NG  S +  H+     ++ + E   
Sbjct: 747  D--DEDWNDTAAPSGKKKLTGNVTPVS-------PNGNASNNSIHTPKRNAHQNNVENTN 797

Query: 879  NTQVGSLKYFTDLESATAEGGSNGKSSIRSHKRLDEAAVQRLLQSFRENQYPKHTVKECL 1058
            N+   SL      E  +  G  + KS   +HKRL EA VQRL +SF+ENQYP  T KE L
Sbjct: 798  NSPTKSL------EGCSKSGSRDKKSGSSAHKRLGEAVVQRLHKSFKENQYPDRTTKESL 851

Query: 1059 AKELGLRIQQVSKWFENARWSSRHSSRMES 1148
            A+ELGL  QQV+KWF N RWS RHSS+ME+
Sbjct: 852  AQELGLTYQQVAKWFGNTRWSFRHSSQMET 881


>ref|XP_002300247.1| predicted protein [Populus trichocarpa] gi|222847505|gb|EEE85052.1|
            predicted protein [Populus trichocarpa]
          Length = 930

 Score =  201 bits (512), Expect = 4e-49
 Identities = 142/407 (34%), Positives = 197/407 (48%), Gaps = 14/407 (3%)
 Frame = +3

Query: 3    PTDDEGWLCPGCDCKVDCMEVLSDFQGSNFSVLDQWEVVFPXXXXXXXSGMKMDDYSGXX 182
            P  DEGWLCPGCDCKVDC+++L+D QG+N S+ D+W+ VFP       SG K+D   G  
Sbjct: 512  PPGDEGWLCPGCDCKVDCIDLLNDSQGTNISISDRWDNVFPEAAAVA-SGQKLDYNFGLS 570

Query: 183  XXXXXXXXXXXXXXXXXHM-EEESGSEGSDYFSACDDHVTTLDDRQILGLXXXXXXXXXX 359
                                +EES S+ SD+ SA D+     DD+Q LGL          
Sbjct: 571  SDDSDDNDYDPDGPDIDEKSQEESSSDESDFSSASDEFEAPPDDKQYLGLPSDDSEDDDY 630

Query: 360  XXXXXXXXXNVKPXXXXXXXXXXXXXXXLDTILDGNE-SLGDEERRASSISDQSLPRVGS 536
                      +K                LD  L+G+  SLGDE            P   S
Sbjct: 631  DPDAPVLEEKLKQESSSSDFTSDSED--LDATLNGDGLSLGDEYHMPIE------PHEDS 682

Query: 537  VGEKVNVGRANRQSLSNELEYLL-----QSNDVPISAKRHVERLDYQKLHQETYGDTSSD 701
             G +   G     SL+++L  +L     Q    P+S KR++ERLDY+KL+ ETYG+  + 
Sbjct: 683  NGRRSRFGGKKNHSLNSKLLSMLEPDSHQEKSAPVSGKRNIERLDYKKLYDETYGNICT- 741

Query: 702  SSDEDYGETASP-KRRKNCAKKATPLSINAPPTINNGGDSKDGNHSQSECENKASKEINK 878
            SSD+D+ +T +P KRRKN    A  ++        NG +SK+ N      E K ++  + 
Sbjct: 742  SSDDDFTDTVAPRKRRKNTGDVAMGIANGDASVTENGLNSKNMNQ-----ELKKNEHTSG 796

Query: 879  NTQVGSLKYFTDLESATAEGGSN--GKSSIR----SHKRLDEAAVQRLLQSFRENQYPKH 1040
             T   S    T++  A    G +  G SS R    ++K+L EA  Q+L   F+EN+YP  
Sbjct: 797  RTHQNSSFQDTNVSPAKTHVGESLSGSSSKRVRPSAYKKLGEAVTQKLYSFFKENRYPDQ 856

Query: 1041 TVKECLAKELGLRIQQVSKWFENARWSSRHSSRMESKMAGGAFANGT 1181
              K  LA+ELG+  +QV+KWF NARWS  HSS   +  A  A   G+
Sbjct: 857  AAKASLAEELGITFEQVNKWFMNARWSFNHSSPEGTSKAESASGKGS 903


Top