BLASTX nr result

ID: Cephaelis21_contig00023224 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00023224
         (1214 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like ...   403   e-110
ref|XP_003616150.1| Protein SET DOMAIN GROUP [Medicago truncatul...   369   e-100
ref|XP_003544959.1| PREDICTED: protein SET DOMAIN GROUP 40-like ...   365   1e-98
ref|XP_002305239.1| SET domain protein [Populus trichocarpa] gi|...   365   1e-98
ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus ...   364   3e-98

>ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like [Vitis vinifera]
          Length = 504

 Score =  403 bits (1035), Expect = e-110
 Identities = 204/364 (56%), Positives = 255/364 (70%), Gaps = 15/364 (4%)
 Frame = -1

Query: 1214 VDDAIWDAEKATGKLKSEWEEASVVMCQLKFKLALRNFKAWLWASATISSRTMHIPWDDA 1035
            VDDAIW  E+A  K + EW++A  +M +LK K  L+NF+AWLWAS+T+SSRTMHIPWDDA
Sbjct: 142  VDDAIWVTERAILKAELEWKKAIPLMEELKLKPQLQNFRAWLWASSTVSSRTMHIPWDDA 201

Query: 1034 GCLCPVGDFFNYAAPTEEQCDCSTLETSGNGMSVQT---------------EFDANAQRL 900
            GCLCPVGDF+NYAAP EE C    L+ S N  S+Q                + D  +QRL
Sbjct: 202  GCLCPVGDFYNYAAPGEEPCGWEDLKGSRNESSLQDSSFWNKDATSNSDAEQDDVLSQRL 261

Query: 899  IDAGFEEDVAAYCFFARRNYREGEQVLLSYGMYTNLELLEHYGFLLNDNPNDKVFIPLEP 720
             D G++ED+AAYCF+AR+NY++GEQVLLSYG YTNLELLEHYGFLL++NPNDK FIPLEP
Sbjct: 262  TDGGYKEDLAAYCFYARKNYKKGEQVLLSYGTYTNLELLEHYGFLLDENPNDKAFIPLEP 321

Query: 719  DMYSLCSWPKELLYIDQEGKPSFALLSTMRLWATPSNKRRSIGHLAYSGKQLSSENEIIV 540
            ++Y+  SWPK+ LYI Q GKPSFALLS +RLWATP+++RRS+GHL YSG QLSSENEI V
Sbjct: 322  EVYASSSWPKDSLYIHQNGKPSFALLSALRLWATPASQRRSVGHLVYSGTQLSSENEIFV 381

Query: 539  MAWIVKKCQDILCNFMTSVEEDKSLLGIIEKIQDNNLPREDLEKLTPTCKRELRAFLQRH 360
            M WI K C  +L N  TSVEED  LL  ++K+QD +LP E +     +   E  AFL+ H
Sbjct: 382  MEWIAKSCHVVLENLPTSVEEDSLLLCALDKMQDPDLPME-VGNALRSSGVEFSAFLEAH 440

Query: 359  DVTTAEDFTVLCKSRKARQSISTWKLAVEWRLNYKRKLQSCITG*TQRISKFLDKYCATI 180
            D+   +    L  S KAR+S+  WKLAV+WRL +KR L  CI+    R ++ +     T 
Sbjct: 441  DLKIGDGNVGLLLSEKARRSMERWKLAVQWRLRHKRILVDCIS----RCTEIISSLSPTF 496

Query: 179  LGKR 168
            LG R
Sbjct: 497  LGHR 500


>ref|XP_003616150.1| Protein SET DOMAIN GROUP [Medicago truncatula]
            gi|355517485|gb|AES99108.1| Protein SET DOMAIN GROUP
            [Medicago truncatula]
          Length = 532

 Score =  369 bits (948), Expect = e-100
 Identities = 192/346 (55%), Positives = 235/346 (67%), Gaps = 18/346 (5%)
 Frame = -1

Query: 1214 VDDAIWDAEKATGKLKSEWEEASVVMCQLKFKLALRNFKAWLWASAT------------- 1074
            VD+A+W  EKA  K KSEW+EA  +M  L FK  L  FKAW+WA+AT             
Sbjct: 178  VDEAMWVTEKAVQKAKSEWKEAHALMEDLMFKPQLLTFKAWVWAAATGRTVPETFHLPGL 237

Query: 1073 ISSRTMHIPWDDAGCLCPVGDFFNYAAPTEEQCDCSTLE--TSGNGMSV---QTEFDANA 909
            ISSRT+HIPWD+AGCLCPVGD FNY AP EE      ++   S   M+V   + + D N+
Sbjct: 238  ISSRTLHIPWDEAGCLCPVGDLFNYDAPGEELSGVEDVDHFLSNGDMNVVIDEGQIDFNS 297

Query: 908  QRLIDAGFEEDVAAYCFFARRNYREGEQVLLSYGMYTNLELLEHYGFLLNDNPNDKVFIP 729
            QRL D GFEED  AYCF+AR NY++G+QVLL YG YTNLELLEHYGFLL +NPNDK+FIP
Sbjct: 298  QRLTDGGFEEDANAYCFYARTNYKKGDQVLLCYGTYTNLELLEHYGFLLQENPNDKIFIP 357

Query: 728  LEPDMYSLCSWPKELLYIDQEGKPSFALLSTMRLWATPSNKRRSIGHLAYSGKQLSSENE 549
            LEP MY+  SW KE LYI   GKPSFALL+ +RLWATP NKRRSIGHLAYSG QLS++NE
Sbjct: 358  LEPAMYTSTSWSKESLYIHPNGKPSFALLAALRLWATPHNKRRSIGHLAYSGSQLSADNE 417

Query: 548  IIVMAWIVKKCQDILCNFMTSVEEDKSLLGIIEKIQDNNLPREDLEKLTPTCKRELRAFL 369
            IIVM W+ K C  +L N  TS+E+D  LL  ++  QD  +    + KL  + + E+  FL
Sbjct: 418  IIVMKWLSKTCDAVLKNMPTSIEDDTLLLNALDCSQD-FITFMKIVKLM-SSRDEVYTFL 475

Query: 368  QRHDVTTAEDFTVLCKSRKARQSISTWKLAVEWRLNYKRKLQSCIT 231
            + H++T A  F     S+K R+S+  WKLAV WRL YKR L  CI+
Sbjct: 476  EAHNITDALSFCDTISSKKTRRSMDRWKLAVLWRLRYKRVLVDCIS 521


>ref|XP_003544959.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Glycine max]
          Length = 475

 Score =  365 bits (937), Expect = 1e-98
 Identities = 180/328 (54%), Positives = 229/328 (69%)
 Frame = -1

Query: 1214 VDDAIWDAEKATGKLKSEWEEASVVMCQLKFKLALRNFKAWLWASATISSRTMHIPWDDA 1035
            VD+A+W  EKA  K KSEW+EA  +M  L FK     FKAW+WA+ATISSRT+HIPWD+A
Sbjct: 146  VDEAMWVTEKAMLKAKSEWKEAHSLMQDLMFKPQFFTFKAWVWAAATISSRTLHIPWDEA 205

Query: 1034 GCLCPVGDFFNYAAPTEEQCDCSTLETSGNGMSVQTEFDANAQRLIDAGFEEDVAAYCFF 855
            GCLCPVGD FNY AP  E      L+ +        + D+++ RL D GFEED  AYCF+
Sbjct: 206  GCLCPVGDLFNYDAPGIEPSGIEDLDHA-------EQLDSHSWRLTDGGFEEDANAYCFY 258

Query: 854  ARRNYREGEQVLLSYGMYTNLELLEHYGFLLNDNPNDKVFIPLEPDMYSLCSWPKELLYI 675
            AR +Y++G+QVLL YG YTNLELLEHYGFLL +NPNDKVFIPLEP +YS  SW KE LYI
Sbjct: 259  AREHYKKGDQVLLCYGTYTNLELLEHYGFLLQENPNDKVFIPLEPALYSSTSWSKESLYI 318

Query: 674  DQEGKPSFALLSTMRLWATPSNKRRSIGHLAYSGKQLSSENEIIVMAWIVKKCQDILCNF 495
               GKPSFALL+ +RLWATP N+RRS+GHL YSG ++S++NEI +M W+ K C  +L N 
Sbjct: 319  HHNGKPSFALLAALRLWATPQNRRRSVGHLVYSGSRVSTDNEIFIMKWLSKTCDAVLRNL 378

Query: 494  MTSVEEDKSLLGIIEKIQDNNLPREDLEKLTPTCKRELRAFLQRHDVTTAEDFTVLCKSR 315
             TS+EED  LL  ++  QD +   E + KL  + + E   FL+ H++     FT +  SR
Sbjct: 379  PTSLEEDTLLLNAMDNSQDFSTFME-ITKLV-SSREETYTFLETHNMKDTHSFTDVILSR 436

Query: 314  KARQSISTWKLAVEWRLNYKRKLQSCIT 231
            KAR+S+  WKLAV+WRL YK+ +  CI+
Sbjct: 437  KARRSMDRWKLAVQWRLKYKKVIFDCIS 464


>ref|XP_002305239.1| SET domain protein [Populus trichocarpa] gi|222848203|gb|EEE85750.1|
            SET domain protein [Populus trichocarpa]
          Length = 518

 Score =  365 bits (937), Expect = 1e-98
 Identities = 189/341 (55%), Positives = 233/341 (68%), Gaps = 15/341 (4%)
 Frame = -1

Query: 1208 DAIWDAEKATGKLKSEWEEASVVMCQLKFKLALRNFKAWLWASATISSRTMHIPWDDAGC 1029
            D +   +KA  K KSEW+EA+ +M  LK K  L  F+AW+WASATISSR +HIPWD+AGC
Sbjct: 162  DVLASFKKAVSKAKSEWKEANSLMDALKLKPQLLTFRAWIWASATISSRALHIPWDEAGC 221

Query: 1028 LCPVGDFFNYAAPTEEQCD---------CSTLETSG--NGMS----VQTEFDANAQRLID 894
            LCPVGD FNYAAP EE  D          S+LE S   NG +    +  + D   +RL D
Sbjct: 222  LCPVGDLFNYAAPGEESNDLENVVHWMNASSLEDSSLSNGETTDDFIGDQPDIGLERLTD 281

Query: 893  AGFEEDVAAYCFFARRNYREGEQVLLSYGMYTNLELLEHYGFLLNDNPNDKVFIPLEPDM 714
             GF+E++AAYCF+AR+NY++G QVLL YG YTNLELLEHYGFLLN+NPNDKVFIPLEP M
Sbjct: 282  GGFDENMAAYCFYARKNYKKGTQVLLGYGTYTNLELLEHYGFLLNENPNDKVFIPLEPSM 341

Query: 713  YSLCSWPKELLYIDQEGKPSFALLSTMRLWATPSNKRRSIGHLAYSGKQLSSENEIIVMA 534
            YS  SWPK  +YI Q+GKPSFALLS +RLWATP N+RRSI HL YSG +LS  NEI V+ 
Sbjct: 342  YSFISWPKVSMYIHQDGKPSFALLSALRLWATPPNQRRSISHLVYSGSRLSVYNEISVLK 401

Query: 533  WIVKKCQDILCNFMTSVEEDKSLLGIIEKIQDNNLPREDLEKLTPTCKRELRAFLQRHDV 354
            WI K C  IL N  T +EED  LL  I KI++ + P E    L  T   E RAFL+  D+
Sbjct: 402  WISKNCAMILSNLPTVIEEDSLLLSTINKIENFDKPTE----LVCTSGGEARAFLEASDL 457

Query: 353  TTAEDFTVLCKSRKARQSISTWKLAVEWRLNYKRKLQSCIT 231
               ++ + L  S K ++ I  WKLAV+WR++YK+ L  CI+
Sbjct: 458  QKGKNGSELMFSGKTKRVIERWKLAVQWRISYKKTLIDCIS 498


>ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
            gi|223527460|gb|EEF29592.1| Protein SET DOMAIN GROUP,
            putative [Ricinus communis]
          Length = 510

 Score =  364 bits (934), Expect = 3e-98
 Identities = 190/352 (53%), Positives = 240/352 (68%), Gaps = 18/352 (5%)
 Frame = -1

Query: 1214 VDDAIWDAEKATGKLKSEWEEASVVMCQLKFKLALRNFKAWLWASATISSRTMHIPWDDA 1035
            VDDAIW AEKA  K + + +EA  +M +L+ K      +AW+WA ATISSRTMHIPWD+A
Sbjct: 148  VDDAIWTAEKAISKAELDRKEAYSLMQELRLKPQFLTLRAWIWACATISSRTMHIPWDEA 207

Query: 1034 GCLCPVGDFFNYAAPTEEQCD---------CSTLETSGNGMSVQTE------FDANAQRL 900
            GCLCPVGDFFNYAAP EE             S LE +       T       FD   + L
Sbjct: 208  GCLCPVGDFFNYAAPGEESSSPENDESWKPASCLEDASLSSERSTSNFCSETFDVQLKSL 267

Query: 899  IDAGFEEDVAAYCFFARRNYREGEQVLLSYGMYTNLELLEHYGFLLNDNPNDKVFIPLEP 720
             D GF+ED AAYCF+AR+NY++G QVLLSYG YTNLELLEHYGFLLN+NPNDKVFIPLE 
Sbjct: 268  TDGGFDEDKAAYCFYARQNYKKGAQVLLSYGTYTNLELLEHYGFLLNENPNDKVFIPLEL 327

Query: 719  DMYSLCSWPKELLYIDQEGKPSFALLSTMRLWATPSNKRRSIGHLAYSGKQLSSENEIIV 540
             M S  +WPKE +YI Q+GKPSF+LL  +RLWATPSN+RRS+GHLAYSG QLS ENE+ +
Sbjct: 328  SMQSSNTWPKESMYIHQDGKPSFSLLCALRLWATPSNRRRSMGHLAYSGSQLSVENEVSI 387

Query: 539  MAWIVKKCQDILCNFMTSVEEDKSLLGIIEKIQDNNLPREDLEKLTPTCKRELRAFLQRH 360
            + WI +KC  +L    T+VEED  LL  I+KIQ+ + P E L K+    + +  AF++ H
Sbjct: 388  LKWISRKCHAVLKKLPTTVEEDSLLLSAIDKIQNCHSPLE-LGKMLHGFEGQASAFVEAH 446

Query: 359  ---DVTTAEDFTVLCKSRKARQSISTWKLAVEWRLNYKRKLQSCITG*TQRI 213
               ++    + T+LC   KA++S+  WKLAV+WRL+YK+ L  CI+  T+ I
Sbjct: 447  NLLNIKIGTESTMLC--GKAKRSMERWKLAVKWRLSYKKTLIDCISYCTEVI 496


Top