BLASTX nr result

ID: Angelica22_contig00004863 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00004863
         (1435 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI40787.3| unnamed protein product [Vitis vinifera]              239   1e-60
ref|XP_003530323.1| PREDICTED: uncharacterized protein LOC100820...   223   7e-56
ref|XP_003556620.1| PREDICTED: uncharacterized protein LOC100798...   213   1e-52
ref|XP_002510745.1| conserved hypothetical protein [Ricinus comm...   209   2e-51
ref|XP_004145608.1| PREDICTED: uncharacterized protein LOC101219...   204   5e-50

>emb|CBI40787.3| unnamed protein product [Vitis vinifera]
          Length = 1477

 Score =  239 bits (610), Expect = 1e-60
 Identities = 158/392 (40%), Positives = 214/392 (54%), Gaps = 18/392 (4%)
 Frame = -2

Query: 1416 KKYNDESKKLRELQARFRAANDIRQDAYKELLGLKKQLHEKGKHFWIXXXXXXXXXXXAL 1237
            KKY DE++KL ELQARF+AA+DIRQ+AY  L  L+K+L EK K+F +           A 
Sbjct: 1085 KKYYDENEKLNELQARFKAADDIRQEAYTHLQSLRKKLSEKNKYFRMYKDNLKAANDYAS 1144

Query: 1236 NGNKEALYHLCAKQVDTFMDLWNKNDEFREDYVRCNMRSTLRRLKTLDGRSLGPDEEVHV 1057
             G+KEAL  LC  +V+T M+LWN NDEFR++YVRCN RSTLRRL+TLDGRSLGPDEE  V
Sbjct: 1145 AGDKEALQRLCVNEVETIMELWNNNDEFRKEYVRCNTRSTLRRLRTLDGRSLGPDEEPPV 1204

Query: 1056 FPVYVGERESRQLNDPSRTTNPSSPTILKQGNTVQPVEREPVDGKSLV-MAEPKSKMSKS 880
             P ++ ER  R L  P++ ++    + +++   + P   E  D KS+V +   K++ +K+
Sbjct: 1205 IPNFLNERIGRSLFAPTKDSSVLIVSTVEREKQMVPATAESADDKSVVNVTNQKNRTAKN 1264

Query: 879  KTSVNPILESGLHVGSRQFXXXXXXXXXXXXXXXXXLARKDEMLRKEEIDAKLKEQLRQE 700
            K        +     S +                  LARK E LRKEE  AKLKEQ R E
Sbjct: 1265 KNPTKSATGAVSATISGRDEIEETKEEHKQTKEEEELARKAEELRKEEEAAKLKEQRRLE 1324

Query: 699  EKVKAQEALKRKQRNADKAQMRAVXXXXXXXXXXXXXXXXXXXXXXXKTTDGENGLE--- 529
            EK KA+EAL+RK+RNA+KAQ RA                        + +    G E   
Sbjct: 1325 EKAKAKEALERKKRNAEKAQARAELRAQKEAEQKQREREKKARKKERRKSSSAEGTEGCN 1384

Query: 528  -----------LQTNHIKDESKDSP---TTKPSKTSHFNRYNKTKATIPPALRNRGKRRL 391
                        +T  +  E  + P   T KP K+S F +  K+K+ IPP LR+RGKRR+
Sbjct: 1385 EAESAPSSETSFETT-LDSEIIEKPRAITKKPHKSSQFTKQPKSKS-IPPPLRSRGKRRI 1442

Query: 390  KQFMWWIFGGLMIVLFILLVVNGGASKNLRSR 295
            + +MW +   L +VL + L+ N G S  L  R
Sbjct: 1443 QSWMWVVLIAL-LVLALFLLGNSGFSYGLGLR 1473


>ref|XP_003530323.1| PREDICTED: uncharacterized protein LOC100820077 [Glycine max]
          Length = 1296

 Score =  223 bits (569), Expect = 7e-56
 Identities = 153/394 (38%), Positives = 197/394 (50%), Gaps = 21/394 (5%)
 Frame = -2

Query: 1416 KKYNDESKKLRELQARFRAANDIRQDAYKELLGLKKQLHEKGKHFWIXXXXXXXXXXXAL 1237
            KKYNDE  KL EL ARFRAA+D RQ+AY +LL LKKQLHEK K+FW            A 
Sbjct: 907  KKYNDECDKLNELLARFRAADDTRQEAYAKLLALKKQLHEKSKNFWEYRDAATKAQELAA 966

Query: 1236 NGNKEALYHLCAKQVDTFMDLWNKNDEFREDYVRCNMRSTLRRLKTLDGRSLGPDEEVHV 1057
             G KE L   C  +V+  M+LWNKNDEFR DYVRCN RSTLRRL+TLDGRSLGPDEE  V
Sbjct: 967  GGKKEELQCFCVDEVERIMELWNKNDEFRRDYVRCNTRSTLRRLQTLDGRSLGPDEEPLV 1026

Query: 1056 FPVYVGERESRQLNDPSRTTNPSSPTILKQGNTVQPVEREPVDGKSLVMAEPKSKMSKSK 877
             P  + ER S+ +   S TT        ++   V    ++  D K +      S+ +K+K
Sbjct: 1027 MPNAITERASKNIPMVSNTTMEQEKKSPRESVNV----KDEPDSKVVAQRTETSQTTKAK 1082

Query: 876  TSVNPI-LESGLHVGSRQFXXXXXXXXXXXXXXXXXLARKDEML------RKEEIDAKLK 718
                P  LE   HV                         ++E++      RKEE +AKLK
Sbjct: 1083 KPTKPAPLEK--HVARWGDESDEDEDKDKNEEEPVRTKEEEELILKAEKARKEEEEAKLK 1140

Query: 717  EQLRQEEKVKAQEALKRKQRNADKAQMRAVXXXXXXXXXXXXXXXXXXXXXXXKTTDG-- 544
            E+ R EE  KA+EAL+RK+RNA+KAQ RA                        + T    
Sbjct: 1141 EKRRLEEIEKAKEALQRKKRNAEKAQQRAALKAQKEAELKEKEREKRAKKKERRKTSSAV 1200

Query: 543  -ENGLELQTNHIKD-----------ESKDSPTTKPSKTSHFNRYNKTKATIPPALRNRGK 400
                 E ++ H  +           E     T KP K S F R  K K ++P ALRNR K
Sbjct: 1201 TAENTEQESAHTTETLTSVEESDLTEKPAEVTKKPQKPSQFTRQTKVK-SVPAALRNRAK 1259

Query: 399  RRLKQFMWWIFGGLMIVLFILLVVNGGASKNLRS 298
            RR++ +MW +   +++V    +    G S +LRS
Sbjct: 1260 RRIQPWMWVLIAVVVVVALFYV----GNSSSLRS 1289


>ref|XP_003556620.1| PREDICTED: uncharacterized protein LOC100798700 [Glycine max]
          Length = 1501

 Score =  213 bits (541), Expect = 1e-52
 Identities = 156/386 (40%), Positives = 198/386 (51%), Gaps = 21/386 (5%)
 Frame = -2

Query: 1416 KKYNDESKKLRELQARFRAANDIRQDAYKELLGLKKQLHEKGKHFWIXXXXXXXXXXXAL 1237
            KKYNDE  KL EL ARFRAA+D RQ+AY +LL LKKQLHEK K+FW            A 
Sbjct: 1113 KKYNDECDKLNELLARFRAADDSRQEAYAKLLALKKQLHEKSKNFWEYRDAANKAQELAA 1172

Query: 1236 NGNKEALYHLCAKQVDTFMDLWNKNDEFREDYVRCNMRSTLRRLKTLDGRSLGPDEEVHV 1057
             G KE L   C  QV+  M+LWNKND FR DYVRCN RSTLRRL+TLDGRSLGPDEE  V
Sbjct: 1173 GGKKEELQCFCVDQVERIMELWNKNDGFRRDYVRCNTRSTLRRLQTLDGRSLGPDEEPPV 1232

Query: 1056 FPVYVGERESRQLNDPSRTT----NPSSPTILKQGNTVQPVEREPVDGKSLVMAEPKSKM 889
             P  + ER S+ +    ++T      S+PT      +V  V+ EPV  K +V     S+ 
Sbjct: 1233 IPNVITERASKNIPMVLQSTLEQEKKSTPT-----ESVN-VKDEPV-SKVVVQRTETSQT 1285

Query: 888  SKSK--TSVNPILESGLHVGSRQFXXXXXXXXXXXXXXXXXLARKDEMLRKEEIDAKLKE 715
            +K+K  T   P+ +     G                     L  K E  R EE +AKLKE
Sbjct: 1286 TKAKKPTKPAPLEKHVARWGDESDEDEVKKEEPVRTKEEEELILKAEKARMEEEEAKLKE 1345

Query: 714  QLRQEEKVKAQEALKRKQRNADKAQMRA---------VXXXXXXXXXXXXXXXXXXXXXX 562
            + R EE  KA+EAL RK+RNA+KAQ RA         +                      
Sbjct: 1346 KRRLEEIEKAKEALLRKKRNAEKAQQRAALKAQKEAELKEKEREKRAKKKERRKAGSAVT 1405

Query: 561  XKTTDGENG--LELQTNHIKD----ESKDSPTTKPSKTSHFNRYNKTKATIPPALRNRGK 400
             + T+ E+    E  T  +++    E     T KP KTS F R  K K ++P ALRNRGK
Sbjct: 1406 AENTEQESAPIPETLTRSVEEFEQTEKTAEVTKKPQKTSQFTRQTKVK-SVPAALRNRGK 1464

Query: 399  RRLKQFMWWIFGGLMIVLFILLVVNG 322
            RR++    W+   + +V+ + L   G
Sbjct: 1465 RRIQP---WVCVLIALVVAVALFYVG 1487


>ref|XP_002510745.1| conserved hypothetical protein [Ricinus communis]
            gi|223551446|gb|EEF52932.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 1553

 Score =  209 bits (531), Expect = 2e-51
 Identities = 146/382 (38%), Positives = 201/382 (52%), Gaps = 14/382 (3%)
 Frame = -2

Query: 1416 KKYNDESKKLRELQARFRAANDIRQDAYKELLGLKKQLHEKGKHFWIXXXXXXXXXXXAL 1237
            KKY +E  KL EL  RFRAA+DIRQ+A+  L  L+K+L++K K+F+            A 
Sbjct: 1177 KKYQEEKAKLGELIGRFRAADDIRQEAFAHLQSLRKRLYDKHKNFYKYKEDAKAASDLAS 1236

Query: 1236 NGNKEALYHLCAKQVDTFMDLWNKNDEFREDYVRCNMRSTLRRLKTLDGRSLGPDEEVHV 1057
             G++  L + C  QV+  M+LWN NDEFR+DY+RCN+RST+RRL+TLDGRSLGPDEE  V
Sbjct: 1237 KGDQGELQYHCVNQVERVMELWNNNDEFRKDYIRCNLRSTVRRLRTLDGRSLGPDEEPPV 1296

Query: 1056 FPVYVGERESRQLNDPSRTTNPSSPTILKQGNTVQPVEREPVDGKSLVMAEPKSKMSKSK 877
             P +V ER +R+   PS +T       L++   + P E E  D KS+  A+ K+  +KSK
Sbjct: 1297 IPNFVSERFARRNVVPSIST-------LQEEKIIAPTETENKDDKSI--AKVKNPTAKSK 1347

Query: 876  TSVNPILESGLH-VGSRQFXXXXXXXXXXXXXXXXXLARKDEMLRKEEIDAKLKEQLRQE 700
                  L + +  V +R                   LARK E LRKEE  A LKE+   E
Sbjct: 1348 KPAKHALGNSMATVSNRVEIEEEGVEEHKLTKEEEELARKAEELRKEEEAATLKERQLLE 1407

Query: 699  EKVKAQEALKRKQRNADKAQMRA-VXXXXXXXXXXXXXXXXXXXXXXXKTTDGENGL--- 532
             K KA EAL+RK+R+A+KAQ RA V                       +  +  NG    
Sbjct: 1408 AKTKANEALERKKRSANKAQARAEVRARKEAEQKEKEKEKRARKKEKRRALEAANGSNEG 1467

Query: 531  ------ELQTNHIKDESKDSPTT---KPSKTSHFNRYNKTKATIPPALRNRGKRRLKQFM 379
                  E  T+  + E+ + P     +  K  HF +  K K   PP LRNRGKRR++ +M
Sbjct: 1468 ESAPSSETPTDTKESETIEKPVALRKRSQKPLHFAKQTKPKIK-PPPLRNRGKRRMQTWM 1526

Query: 378  WWIFGGLMIVLFILLVVNGGAS 313
             W+   + I+  + L+ NG  S
Sbjct: 1527 -WVLLTITIIFALFLIGNGSFS 1547


>ref|XP_004145608.1| PREDICTED: uncharacterized protein LOC101219495 [Cucumis sativus]
          Length = 1463

 Score =  204 bits (519), Expect = 5e-50
 Identities = 145/388 (37%), Positives = 197/388 (50%), Gaps = 25/388 (6%)
 Frame = -2

Query: 1416 KKYNDESKKLRELQARFRAANDIRQDAYKELLGLKKQLHEKGKHFWIXXXXXXXXXXXAL 1237
            KKYNDES KL ELQ++F+AA+ IRQ+AY  L  ++KQL+EK K+ W            A 
Sbjct: 1085 KKYNDESIKLDELQSQFKAADKIRQEAYANLQSMRKQLYEKNKYCWKYRDDAKEASEIAS 1144

Query: 1236 NGNKEALYHLCAKQVDTFMDLWNKNDEFREDYVRCNMRSTLRRLKTLDGRSLGPDEEVHV 1057
            + + E + H C  QV+  M+LWN N EFRE+Y++ NMRST+RRLKTLDGRSLGP+EE HV
Sbjct: 1145 SRDIEKVQHFCVNQVERMMELWNTNAEFREEYIKSNMRSTVRRLKTLDGRSLGPNEEPHV 1204

Query: 1056 FPVYVGERESR--QLNDPSRTTNPSSPTILKQGNTVQPVEREPVDGKSLVMAEPKSKMSK 883
              + V E  +R   L+  S T     P      +  +P  +         +AE K++M+K
Sbjct: 1205 LNLIVKEGSARDNSLSTVSTTEESGKPISAYDASDNKPETK---------VAEEKNQMTK 1255

Query: 882  SKTSVNPILESGLHVGSR------QFXXXXXXXXXXXXXXXXXLARKDEMLRKEEIDAKL 721
             K    P+   GL    R      +                  LA K E LRKEE   KL
Sbjct: 1256 KK----PVTVVGLVTAPRNISRENEVEEPPRPEEIKRTREEEELAAKVEELRKEEEAMKL 1311

Query: 720  KEQLRQEEKVKAQEALKRKQRNADKAQMRAVXXXXXXXXXXXXXXXXXXXXXXXKTT--- 550
            KEQ + EE+ KA+EAL+RK+RNA+KAQ RAV                       K     
Sbjct: 1312 KEQRKLEERAKAKEALERKKRNAEKAQARAVIKARKEAEEREKLREKRAKKKERKMAAET 1371

Query: 549  ---------DGENGLELQTNHIKDESKDS-----PTTKPSKTSHFNRYNKTKATIPPALR 412
                     D     E  +   K+ES+++        KP K   + + +KTK +IPP LR
Sbjct: 1372 EAGNDWDERDSALVTETPSETQKEESENTGKPGMAAKKPQKALQYTKQSKTK-SIPPPLR 1430

Query: 411  NRGKRRLKQFMWWIFGGLMIVLFILLVV 328
            NRGKRR++ +MW +     +V+F L  V
Sbjct: 1431 NRGKRRMQPWMWVLLS--TVVVFALFFV 1456


Top