BLASTX nr result

ID: Angelica22_contig00004862 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00004862
         (1413 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI40787.3| unnamed protein product [Vitis vinifera]              270   9e-70
ref|XP_003530323.1| PREDICTED: uncharacterized protein LOC100820...   239   1e-60
ref|XP_002510745.1| conserved hypothetical protein [Ricinus comm...   229   1e-57
ref|XP_003556620.1| PREDICTED: uncharacterized protein LOC100798...   227   7e-57
ref|XP_002308370.1| predicted protein [Populus trichocarpa] gi|2...   226   1e-56

>emb|CBI40787.3| unnamed protein product [Vitis vinifera]
          Length = 1477

 Score =  270 bits (689), Expect = 9e-70
 Identities = 169/386 (43%), Positives = 225/386 (58%), Gaps = 20/386 (5%)
 Frame = +2

Query: 20   KECNDESKKIRELQARFRAANDIRQDAYKDLIGLKRQMHEKGKYFWIXXXXXXXXXXXXL 199
            K+  DE++K+ ELQARF+AA+DIRQ+AY  L  L++++ EK KYF +             
Sbjct: 1085 KKYYDENEKLNELQARFKAADDIRQEAYTHLQSLRKKLSEKNKYFRMYKDNLKAANDYAS 1144

Query: 200  NGDNEALYHLCAKQVDTFMDLWNKNDEFREDYVRCNMKSTLRRLKTLDGRSLGPDEEVHV 379
             GD EAL  LC  +V+T M+LWN NDEFR++YVRCN +STLRRL+TLDGRSLGPDEE  V
Sbjct: 1145 AGDKEALQRLCVNEVETIMELWNNNDEFRKEYVRCNTRSTLRRLRTLDGRSLGPDEEPPV 1204

Query: 380  FPVYVGERESRQLNNPSRTTNPSSPTIMKQENTVTPVEREPIDGKSLV-MTEPKSKMLKS 556
             P ++ ER  R L  P++ ++    + +++E  + P   E  D KS+V +T  K++  K+
Sbjct: 1205 IPNFLNERIGRSLFAPTKDSSVLIVSTVEREKQMVPATAESADDKSVVNVTNQKNRTAKN 1264

Query: 557  KISVNPIPESGLHIGLR---QLEVEETKEVAKQKTEEELELARKDEILRKEEIDAKLKEQ 727
            K   NP   +   +      + E+EETKE  KQ  EEE ELARK E LRKEE  AKLKEQ
Sbjct: 1265 K---NPTKSATGAVSATISGRDEIEETKEEHKQTKEEE-ELARKAEELRKEEEAAKLKEQ 1320

Query: 728  LRQEEKVKAQEALERKKRNADKAQVRALXXXXXXXXXXXXXXXXXXXXXXXXTTDGENGL 907
             R EEK KA+EALERKKRNA+KAQ RA                          +    G 
Sbjct: 1321 RRLEEKAKAKEALERKKRNAEKAQARAELRAQKEAEQKQREREKKARKKERRKSSSAEGT 1380

Query: 908  E--------------LQTNHVKE--ESKDSPTTKPIKTLHFNRYNKTKATIPPALRNRGK 1039
            E               +T    E  E   + T KP K+  F +  K+K+ IPP LR+RGK
Sbjct: 1381 EGCNEAESAPSSETSFETTLDSEIIEKPRAITKKPHKSSQFTKQPKSKS-IPPPLRSRGK 1439

Query: 1040 RRLKQFMWWIFGALIVLFIFLVGNSG 1117
            RR++ +MW +  AL+VL +FL+GNSG
Sbjct: 1440 RRIQSWMWVVLIALLVLALFLLGNSG 1465


>ref|XP_003530323.1| PREDICTED: uncharacterized protein LOC100820077 [Glycine max]
          Length = 1296

 Score =  239 bits (610), Expect = 1e-60
 Identities = 159/389 (40%), Positives = 205/389 (52%), Gaps = 19/389 (4%)
 Frame = +2

Query: 20   KECNDESKKIRELQARFRAANDIRQDAYKDLIGLKRQMHEKGKYFWIXXXXXXXXXXXXL 199
            K+ NDE  K+ EL ARFRAA+D RQ+AY  L+ LK+Q+HEK K FW              
Sbjct: 907  KKYNDECDKLNELLARFRAADDTRQEAYAKLLALKKQLHEKSKNFWEYRDAATKAQELAA 966

Query: 200  NGDNEALYHLCAKQVDTFMDLWNKNDEFREDYVRCNMKSTLRRLKTLDGRSLGPDEEVHV 379
             G  E L   C  +V+  M+LWNKNDEFR DYVRCN +STLRRL+TLDGRSLGPDEE  V
Sbjct: 967  GGKKEELQCFCVDEVERIMELWNKNDEFRRDYVRCNTRSTLRRLQTLDGRSLGPDEEPLV 1026

Query: 380  FPVYVGERESRQLNNPSRTTNPSSPTIMKQENTVTPVEREPIDGKSLVMTEPKSKMLKSK 559
             P  + ER S+ +   S TT        K       V+ EP D K +      S+  K+K
Sbjct: 1027 MPNAITERASKNIPMVSNTTMEQEK---KSPRESVNVKDEP-DSKVVAQRTETSQTTKAK 1082

Query: 560  ISVNPIPESGLHIGLRQLEVEETKEVAKQ-----KTEEELELARKDEILRKEEIDAKLKE 724
                P P    H+     E +E ++  K      +T+EE EL  K E  RKEE +AKLKE
Sbjct: 1083 KPTKPAPLEK-HVARWGDESDEDEDKDKNEEEPVRTKEEEELILKAEKARKEEEEAKLKE 1141

Query: 725  QLRQEEKVKAQEALERKKRNADKAQVRA---------LXXXXXXXXXXXXXXXXXXXXXX 877
            + R EE  KA+EAL+RKKRNA+KAQ RA         L                      
Sbjct: 1142 KRRLEEIEKAKEALQRKKRNAEKAQQRAALKAQKEAELKEKEREKRAKKKERRKTSSAVT 1201

Query: 878  XXTTDGENGLELQTNHVKEESK--DSP---TTKPIKTLHFNRYNKTKATIPPALRNRGKR 1042
               T+ E+    +T    EES   + P   T KP K   F R  K K+ +P ALRNR KR
Sbjct: 1202 AENTEQESAHTTETLTSVEESDLTEKPAEVTKKPQKPSQFTRQTKVKS-VPAALRNRAKR 1260

Query: 1043 RLKQFMWWIFGALIVLFIFLVGNSGAFKS 1129
            R++ +MW +   ++V+ +F VGNS + +S
Sbjct: 1261 RIQPWMWVLIAVVVVVALFYVGNSSSLRS 1289


>ref|XP_002510745.1| conserved hypothetical protein [Ricinus communis]
            gi|223551446|gb|EEF52932.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 1553

 Score =  229 bits (585), Expect = 1e-57
 Identities = 151/385 (39%), Positives = 210/385 (54%), Gaps = 13/385 (3%)
 Frame = +2

Query: 20   KECNDESKKIRELQARFRAANDIRQDAYKDLIGLKRQMHEKGKYFWIXXXXXXXXXXXXL 199
            K+  +E  K+ EL  RFRAA+DIRQ+A+  L  L++++++K K F+              
Sbjct: 1177 KKYQEEKAKLGELIGRFRAADDIRQEAFAHLQSLRKRLYDKHKNFYKYKEDAKAASDLAS 1236

Query: 200  NGDNEALYHLCAKQVDTFMDLWNKNDEFREDYVRCNMKSTLRRLKTLDGRSLGPDEEVHV 379
             GD   L + C  QV+  M+LWN NDEFR+DY+RCN++ST+RRL+TLDGRSLGPDEE  V
Sbjct: 1237 KGDQGELQYHCVNQVERVMELWNNNDEFRKDYIRCNLRSTVRRLRTLDGRSLGPDEEPPV 1296

Query: 380  FPVYVGERESRQLNNPSRTTNPSSPTIMKQENTVTPVEREPIDGKSLVMTEPKSKMLKSK 559
             P +V ER +R+   PS +T       +++E  + P E E  D KS+   + K+   KSK
Sbjct: 1297 IPNFVSERFARRNVVPSIST-------LQEEKIIAPTETENKDDKSI--AKVKNPTAKSK 1347

Query: 560  ISVNPIPESGLHIGLRQLEVEETKEVAKQKTEEELELARKDEILRKEEIDAKLKEQLRQE 739
                    + +     ++E+EE      + T+EE ELARK E LRKEE  A LKE+   E
Sbjct: 1348 KPAKHALGNSMATVSNRVEIEEEGVEEHKLTKEEEELARKAEELRKEEEAATLKERQLLE 1407

Query: 740  EKVKAQEALERKKRNADKAQ----VRALXXXXXXXXXXXXXXXXXXXXXXXXTTDGEN-G 904
             K KA EALERKKR+A+KAQ    VRA                           +G N G
Sbjct: 1408 AKTKANEALERKKRSANKAQARAEVRARKEAEQKEKEKEKRARKKEKRRALEAANGSNEG 1467

Query: 905  LELQTNHVKEESKDSPT-TKPI-------KTLHFNRYNKTKATIPPALRNRGKRRLKQFM 1060
                ++    ++K+S T  KP+       K LHF +  K K   PP LRNRGKRR++ +M
Sbjct: 1468 ESAPSSETPTDTKESETIEKPVALRKRSQKPLHFAKQTKPKIK-PPPLRNRGKRRMQTWM 1526

Query: 1061 WWIFGALIVLFIFLVGNSGAFKSLR 1135
            W +    I+  +FL+GN G+F   R
Sbjct: 1527 WVLLTITIIFALFLIGN-GSFSLQR 1550


>ref|XP_003556620.1| PREDICTED: uncharacterized protein LOC100798700 [Glycine max]
          Length = 1501

 Score =  227 bits (578), Expect = 7e-57
 Identities = 155/391 (39%), Positives = 206/391 (52%), Gaps = 21/391 (5%)
 Frame = +2

Query: 20   KECNDESKKIRELQARFRAANDIRQDAYKDLIGLKRQMHEKGKYFWIXXXXXXXXXXXXL 199
            K+ NDE  K+ EL ARFRAA+D RQ+AY  L+ LK+Q+HEK K FW              
Sbjct: 1113 KKYNDECDKLNELLARFRAADDSRQEAYAKLLALKKQLHEKSKNFWEYRDAANKAQELAA 1172

Query: 200  NGDNEALYHLCAKQVDTFMDLWNKNDEFREDYVRCNMKSTLRRLKTLDGRSLGPDEEVHV 379
             G  E L   C  QV+  M+LWNKND FR DYVRCN +STLRRL+TLDGRSLGPDEE  V
Sbjct: 1173 GGKKEELQCFCVDQVERIMELWNKNDGFRRDYVRCNTRSTLRRLQTLDGRSLGPDEEPPV 1232

Query: 380  FPVYVGERESRQLNNPSRTTNPSSPTIMKQENTVTPVER-----EPIDGKSLVMTEPKSK 544
             P  + ER S+ +    ++T       ++QE   TP E      EP+  K +V     S+
Sbjct: 1233 IPNVITERASKNIPMVLQST-------LEQEKKSTPTESVNVKDEPVS-KVVVQRTETSQ 1284

Query: 545  MLKSKISVNPIP-ESGLHIGLRQLEVEETKEVAKQKTEEELELARKDEILRKEEIDAKLK 721
              K+K    P P E  +     + + +E K+    +T+EE EL  K E  R EE +AKLK
Sbjct: 1285 TTKAKKPTKPAPLEKHVARWGDESDEDEVKKEEPVRTKEEEELILKAEKARMEEEEAKLK 1344

Query: 722  EQLRQEEKVKAQEALERKKRNADKAQVRA---------LXXXXXXXXXXXXXXXXXXXXX 874
            E+ R EE  KA+EAL RKKRNA+KAQ RA         L                     
Sbjct: 1345 EKRRLEEIEKAKEALLRKKRNAEKAQQRAALKAQKEAELKEKEREKRAKKKERRKAGSAV 1404

Query: 875  XXXTTDGENGL--ELQTNHVKE----ESKDSPTTKPIKTLHFNRYNKTKATIPPALRNRG 1036
                T+ E+    E  T  V+E    E     T KP KT  F R  K K+ +P ALRNRG
Sbjct: 1405 TAENTEQESAPIPETLTRSVEEFEQTEKTAEVTKKPQKTSQFTRQTKVKS-VPAALRNRG 1463

Query: 1037 KRRLKQFMWWIFGALIVLFIFLVGNSGAFKS 1129
            KRR++ ++  +   ++ + +F VG++ + +S
Sbjct: 1464 KRRIQPWVCVLIALVVAVALFYVGHNCSLRS 1494


>ref|XP_002308370.1| predicted protein [Populus trichocarpa] gi|222854346|gb|EEE91893.1|
            predicted protein [Populus trichocarpa]
          Length = 485

 Score =  226 bits (576), Expect = 1e-56
 Identities = 148/401 (36%), Positives = 220/401 (54%), Gaps = 33/401 (8%)
 Frame = +2

Query: 20   KECNDESKKIRELQARFRAANDIRQDAYKDLIGLKRQMHEKGKYFWIXXXXXXXXXXXXL 199
            K+ NDE +KI +L  + RAANDIRQ+A+  L  L++Q++EK K+F+             L
Sbjct: 82   KKYNDEHEKINQLLFQHRAANDIRQEAFAHLQSLRKQLYEKSKFFYKYKDDLTAATNLAL 141

Query: 200  NGDNEALYHLCAKQVDTFMDLWNKNDEFREDYVRCNMKSTLRRLKTLDGRSLGPDEEVHV 379
             GD E L   CA QV+  M+LWN NDEFR++Y+  NM++TLRRL+TLDGR+LGPDE+  +
Sbjct: 142  KGDKEELQRHCANQVERVMELWNNNDEFRKEYMSSNMRNTLRRLRTLDGRALGPDEQPPI 201

Query: 380  FPVYVGERESRQLNNPSRTTNPSSPTIMKQENTVTPVEREPIDGKSLV-MTEPKSKMLKS 556
             P  V +R ++          PS+P  ++ E  VTPVE + ID KS   + + K++ +K+
Sbjct: 202  IPNVVSQRATK------HNVAPSAPA-LEVEKPVTPVETQRIDEKSTAKLGDKKNQTVKT 254

Query: 557  KISVNPIP-ESGLHIGLRQLEVEETK---------EVAKQK---------------TEEE 661
            K    P   E+GL     + ++EE++         E ++Q+               T+EE
Sbjct: 255  KRQAKPASLENGLPTVSGRDQIEESRQEENKLPKEEESRQENKLTKEEESRQENKLTKEE 314

Query: 662  LELARKDEILRKEEIDAKLKEQLRQEEKVKAQEALERKKRNADKAQVRALXXXXXXXXXX 841
            +ELARK E LRKE+  A LKEQ R EEK KA+EA+ERKKRNA+KAQ RA           
Sbjct: 315  VELARKIEELRKEKEAAMLKEQRRLEEKAKAKEAMERKKRNAEKAQARASLRAQREAEQK 374

Query: 842  XXXXXXXXXXXXXXTTDGENGLELQ------TNHVKEESKDSPTTKPIKTLHFNRYNKTK 1003
                              E+  ++       ++    E+ +S  T+   T+      +TK
Sbjct: 375  EKEKEKKAKKKEKRKAAAEDTKDIDEVESAPSSETPTETNESERTEKPVTVAKRPQKQTK 434

Query: 1004 A-TIPPALRNRGKRRLKQFMWWIFGALIVLFIFLVGNSGAF 1123
            A ++P  LRN+GKR+++ +MW +   L V+ +F +GNS  F
Sbjct: 435  AKSMPLPLRNKGKRKMQTWMWALITLLAVVALFFMGNSSFF 475


Top