BLASTX nr result

ID: Cephaelis21_contig00026922 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00026922
         (1747 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|NP_001235560.1| uncharacterized protein LOC100527145 [Glycin...   220   8e-55
emb|CAN73381.1| hypothetical protein VITISV_003162 [Vitis vinifera]   215   3e-53
ref|XP_002304561.1| predicted protein [Populus trichocarpa] gi|2...   211   4e-52
ref|XP_002297973.1| predicted protein [Populus trichocarpa] gi|2...   211   5e-52
ref|NP_194120.1| uncharacterized protein [Arabidopsis thaliana] ...   209   2e-51

>ref|NP_001235560.1| uncharacterized protein LOC100527145 [Glycine max]
            gi|255631654|gb|ACU16194.1| unknown [Glycine max]
          Length = 243

 Score =  220 bits (561), Expect = 8e-55
 Identities = 133/247 (53%), Positives = 161/247 (65%), Gaps = 12/247 (4%)
 Frame = -1

Query: 1699 MASSF-----LQGALLQKSNFLGHTILSNSTHKNCSLTFPKQTPSVHFSPVAKFNLFEML 1535
            M SSF     L G+LL +S FLG   L++   +N + T   +  +    P AKF++ ++L
Sbjct: 1    MMSSFVVLHGLHGSLL-RSQFLGQDTLTHLYPRNKASTIHNKPTTAQ--PRAKFDMLQVL 57

Query: 1534 GGRGLCNGEAGIQEELKKNVSVE----TSKTKVEISEIPEVS--TAAGASIPAK-GFEKE 1376
            GGRGLCNGEAG+++ELK+ + V+     S T  +  E+ E S  T + AS+ A+ GFEKE
Sbjct: 58   GGRGLCNGEAGLKQELKRELGVDEKAPASATSDKEQELEEESSTTQSLASVAAEDGFEKE 117

Query: 1375 LQGLTGGFPGGEKGLLKFIQENXXXXXXXXPETPLSLGFNQSLVXXXXXXXXXXXXPGMI 1196
            L GLTGGFPGGEKGL KFIQEN            L L    +L             PGMI
Sbjct: 118  LMGLTGGFPGGEKGLKKFIQENPPPLKPSQGSKSLKL----ALPKKPKPPELPLLLPGMI 173

Query: 1195 VIVKNPNSPYYMYCGNVQRITDGKAGVLFEGGVWDRLITFRLEELERREKGPPMVNPRSA 1016
             IVKNPN+P+YMYCG VQRITDGKAGVLFEGG WDRLITFRLEELERREKGPPM NP+SA
Sbjct: 174  AIVKNPNNPFYMYCGIVQRITDGKAGVLFEGGNWDRLITFRLEELERREKGPPMKNPKSA 233

Query: 1015 ILETMLE 995
            +LE  LE
Sbjct: 234  VLEPFLE 240


>emb|CAN73381.1| hypothetical protein VITISV_003162 [Vitis vinifera]
          Length = 231

 Score =  215 bits (548), Expect = 3e-53
 Identities = 127/239 (53%), Positives = 154/239 (64%), Gaps = 4/239 (1%)
 Frame = -1

Query: 1699 MASSF----LQGALLQKSNFLGHTILSNSTHKNCSLTFPKQTPSVHFSPVAKFNLFEMLG 1532
            MA SF    LQ  L  KS+FLG     N+  K  SL+  +    V  S  AKF+LF ++G
Sbjct: 1    MAXSFTVPSLQRPLPHKSHFLGQGHFPNNIQK-ASLSRTRTPLPVKAS--AKFDLFGIMG 57

Query: 1531 GRGLCNGEAGIQEELKKNVSVETSKTKVEISEIPEVSTAAGASIPAKGFEKELQGLTGGF 1352
            GRGLCNGE G+Q+ELK+N+    S   V+  E P +  AA   +P  GF+KEL GLTGGF
Sbjct: 58   GRGLCNGEEGLQQELKRNIEPAPSPDSVKDEEKPAL--AAVDDVPEDGFDKELLGLTGGF 115

Query: 1351 PGGEKGLLKFIQENXXXXXXXXPETPLSLGFNQSLVXXXXXXXXXXXXPGMIVIVKNPNS 1172
            PGGEKGL +F+++N         E         + +            PGMI IVKNPN+
Sbjct: 116  PGGEKGLKQFLEKNPPP------EKTSGNIIENARLRKPKPPELPLLMPGMIAIVKNPNN 169

Query: 1171 PYYMYCGNVQRITDGKAGVLFEGGVWDRLITFRLEELERREKGPPMVNPRSAILETMLE 995
            P+YMYCG VQRITDGKAGVLFEGG WDRLITFRLEEL+RR+KGPPM NP+SAILET+LE
Sbjct: 170  PFYMYCGIVQRITDGKAGVLFEGGNWDRLITFRLEELQRRDKGPPMKNPKSAILETLLE 228


>ref|XP_002304561.1| predicted protein [Populus trichocarpa] gi|222841993|gb|EEE79540.1|
            predicted protein [Populus trichocarpa]
          Length = 229

 Score =  211 bits (538), Expect = 4e-52
 Identities = 121/235 (51%), Positives = 145/235 (61%)
 Frame = -1

Query: 1699 MASSFLQGALLQKSNFLGHTILSNSTHKNCSLTFPKQTPSVHFSPVAKFNLFEMLGGRGL 1520
            MASS    + L +SNFLG     N  HK  SL  PK    +     AK +LFE+LGGRGL
Sbjct: 1    MASSITLQSTLLRSNFLGQNNCFNHPHKPYSL-IPKDH-RLKLKTCAKLDLFEILGGRGL 58

Query: 1519 CNGEAGIQEELKKNVSVETSKTKVEISEIPEVSTAAGASIPAKGFEKELQGLTGGFPGGE 1340
            CNGE G+Q+ELK+N+  + S T           +   +S+P   FEKEL GLTGGFPGGE
Sbjct: 59   CNGEKGVQQELKRNIEEQASSTA---GREENSGSLEKSSVPDDAFEKELMGLTGGFPGGE 115

Query: 1339 KGLLKFIQENXXXXXXXXPETPLSLGFNQSLVXXXXXXXXXXXXPGMIVIVKNPNSPYYM 1160
            KGL +FI+EN        P+         ++             PGMI IVKNPN+P+YM
Sbjct: 116  KGLKRFIEENPSPKKQSVPKL--------TITSRPKPPELPLLLPGMIAIVKNPNNPFYM 167

Query: 1159 YCGNVQRITDGKAGVLFEGGVWDRLITFRLEELERREKGPPMVNPRSAILETMLE 995
            Y G VQRITDGKAGV+FEGG WD+L+TFRLEELERREKGPP  NPRSAI+E   E
Sbjct: 168  YTGIVQRITDGKAGVIFEGGNWDKLVTFRLEELERREKGPPGKNPRSAIIEAFYE 222


>ref|XP_002297973.1| predicted protein [Populus trichocarpa] gi|222845231|gb|EEE82778.1|
            predicted protein [Populus trichocarpa]
          Length = 225

 Score =  211 bits (537), Expect = 5e-52
 Identities = 123/233 (52%), Positives = 147/233 (63%), Gaps = 2/233 (0%)
 Frame = -1

Query: 1699 MASSFLQGALLQKSNFLGHTILSNSTHKNCSLTFPKQTPSVHFSPVAKFNLFEMLGGRGL 1520
            MASS    + L +S+FLG     N  HK  SL  PK+   +     AKF+ FE+LGGRGL
Sbjct: 1    MASSITLQSTLLRSSFLGQNNFPNHPHKPYSL-IPKEH-RLKIKTCAKFDPFEILGGRGL 58

Query: 1519 CNGEAGIQEELKKNVSVETSKT--KVEISEIPEVSTAAGASIPAKGFEKELQGLTGGFPG 1346
            CNGE G+Q+EL++N+  E      + E S   E+S     S+P  GFEKEL GLTGGFPG
Sbjct: 59   CNGEKGVQQELQRNIEEEAPPAAGEEEYSGNLEIS-----SVPEDGFEKELMGLTGGFPG 113

Query: 1345 GEKGLLKFIQENXXXXXXXXPETPLSLGFNQSLVXXXXXXXXXXXXPGMIVIVKNPNSPY 1166
            GEKGL KFI+EN         +         ++             PGMI IVKNPN+P+
Sbjct: 114  GEKGLEKFIEENPPPKKQPAAKL--------TITNKPKPPELPLLLPGMIAIVKNPNNPF 165

Query: 1165 YMYCGNVQRITDGKAGVLFEGGVWDRLITFRLEELERREKGPPMVNPRSAILE 1007
            YMY G VQRITDGKAGV+FEGG WDRL+TFRLEELERREKGPP  NPRSAI+E
Sbjct: 166  YMYTGIVQRITDGKAGVIFEGGNWDRLVTFRLEELERREKGPPGKNPRSAIIE 218


>ref|NP_194120.1| uncharacterized protein [Arabidopsis thaliana]
            gi|4972093|emb|CAB43889.1| putative protein [Arabidopsis
            thaliana] gi|7269238|emb|CAB81307.1| putative protein
            [Arabidopsis thaliana] gi|21592640|gb|AAM64589.1| unknown
            [Arabidopsis thaliana] gi|22135940|gb|AAM91552.1|
            putative protein [Arabidopsis thaliana]
            gi|23197588|gb|AAN15321.1| putative protein [Arabidopsis
            thaliana] gi|332659420|gb|AEE84820.1| uncharacterized
            protein [Arabidopsis thaliana]
          Length = 250

 Score =  209 bits (532), Expect = 2e-51
 Identities = 115/230 (50%), Positives = 145/230 (63%), Gaps = 5/230 (2%)
 Frame = -1

Query: 1669 LQKSNFLGHTILSNSTHKNCSLTFPKQTPSVHFSPVAKFNLFEMLGGRGLCNGEAGIQEE 1490
            + +S FLG T   ++ +++      +Q+       + KFNL+E++GGRGLCNGE GI++E
Sbjct: 15   IHRSKFLGQTHQFSTVNRSVFPPPKQQSKLYQVKAMGKFNLWEVMGGRGLCNGEKGIEKE 74

Query: 1489 LKKNVSVETSKTKVEISEIPEVSTAAGAS--IPAKGFEKELQGLTGGFPGGEKGLLKFIQ 1316
            L++N+  E   +K E +E    S  +  S  +P  GFEKE+ GLTGGFPGGEKGL  FI+
Sbjct: 75   LQRNIEDEQETSKAENNETERESDDSNLSFKVPEDGFEKEMMGLTGGFPGGEKGLKTFIE 134

Query: 1315 ENXXXXXXXXPETPLSLGFNQSLVXXXXXXXXXXXXP---GMIVIVKNPNSPYYMYCGNV 1145
            +N           P   G + S V                GMI IVKN NSPY+MYCG V
Sbjct: 135  KNPPPPPPPP---PAKQGSDASAVATDKKPKAPKLPLLMPGMIAIVKNQNSPYHMYCGIV 191

Query: 1144 QRITDGKAGVLFEGGVWDRLITFRLEELERREKGPPMVNPRSAILETMLE 995
            QRITDGKAGVLFEGG WDRLITFRLEELERREKGPP  NP+S ILE ++E
Sbjct: 192  QRITDGKAGVLFEGGNWDRLITFRLEELERREKGPPGKNPKSCILEPLIE 241


Top