BLASTX nr result
ID: Angelica22_contig00004862
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica22_contig00004862 (1413 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI40787.3| unnamed protein product [Vitis vinifera] 270 9e-70 ref|XP_003530323.1| PREDICTED: uncharacterized protein LOC100820... 239 1e-60 ref|XP_002510745.1| conserved hypothetical protein [Ricinus comm... 229 1e-57 ref|XP_003556620.1| PREDICTED: uncharacterized protein LOC100798... 227 7e-57 ref|XP_002308370.1| predicted protein [Populus trichocarpa] gi|2... 226 1e-56 >emb|CBI40787.3| unnamed protein product [Vitis vinifera] Length = 1477 Score = 270 bits (689), Expect = 9e-70 Identities = 169/386 (43%), Positives = 225/386 (58%), Gaps = 20/386 (5%) Frame = +2 Query: 20 KECNDESKKIRELQARFRAANDIRQDAYKDLIGLKRQMHEKGKYFWIXXXXXXXXXXXXL 199 K+ DE++K+ ELQARF+AA+DIRQ+AY L L++++ EK KYF + Sbjct: 1085 KKYYDENEKLNELQARFKAADDIRQEAYTHLQSLRKKLSEKNKYFRMYKDNLKAANDYAS 1144 Query: 200 NGDNEALYHLCAKQVDTFMDLWNKNDEFREDYVRCNMKSTLRRLKTLDGRSLGPDEEVHV 379 GD EAL LC +V+T M+LWN NDEFR++YVRCN +STLRRL+TLDGRSLGPDEE V Sbjct: 1145 AGDKEALQRLCVNEVETIMELWNNNDEFRKEYVRCNTRSTLRRLRTLDGRSLGPDEEPPV 1204 Query: 380 FPVYVGERESRQLNNPSRTTNPSSPTIMKQENTVTPVEREPIDGKSLV-MTEPKSKMLKS 556 P ++ ER R L P++ ++ + +++E + P E D KS+V +T K++ K+ Sbjct: 1205 IPNFLNERIGRSLFAPTKDSSVLIVSTVEREKQMVPATAESADDKSVVNVTNQKNRTAKN 1264 Query: 557 KISVNPIPESGLHIGLR---QLEVEETKEVAKQKTEEELELARKDEILRKEEIDAKLKEQ 727 K NP + + + E+EETKE KQ EEE ELARK E LRKEE AKLKEQ Sbjct: 1265 K---NPTKSATGAVSATISGRDEIEETKEEHKQTKEEE-ELARKAEELRKEEEAAKLKEQ 1320 Query: 728 LRQEEKVKAQEALERKKRNADKAQVRALXXXXXXXXXXXXXXXXXXXXXXXXTTDGENGL 907 R EEK KA+EALERKKRNA+KAQ RA + G Sbjct: 1321 RRLEEKAKAKEALERKKRNAEKAQARAELRAQKEAEQKQREREKKARKKERRKSSSAEGT 1380 Query: 908 E--------------LQTNHVKE--ESKDSPTTKPIKTLHFNRYNKTKATIPPALRNRGK 1039 E +T E E + T KP K+ F + K+K+ IPP LR+RGK Sbjct: 1381 EGCNEAESAPSSETSFETTLDSEIIEKPRAITKKPHKSSQFTKQPKSKS-IPPPLRSRGK 1439 Query: 1040 RRLKQFMWWIFGALIVLFIFLVGNSG 1117 RR++ +MW + AL+VL +FL+GNSG Sbjct: 1440 RRIQSWMWVVLIALLVLALFLLGNSG 1465 >ref|XP_003530323.1| PREDICTED: uncharacterized protein LOC100820077 [Glycine max] Length = 1296 Score = 239 bits (610), Expect = 1e-60 Identities = 159/389 (40%), Positives = 205/389 (52%), Gaps = 19/389 (4%) Frame = +2 Query: 20 KECNDESKKIRELQARFRAANDIRQDAYKDLIGLKRQMHEKGKYFWIXXXXXXXXXXXXL 199 K+ NDE K+ EL ARFRAA+D RQ+AY L+ LK+Q+HEK K FW Sbjct: 907 KKYNDECDKLNELLARFRAADDTRQEAYAKLLALKKQLHEKSKNFWEYRDAATKAQELAA 966 Query: 200 NGDNEALYHLCAKQVDTFMDLWNKNDEFREDYVRCNMKSTLRRLKTLDGRSLGPDEEVHV 379 G E L C +V+ M+LWNKNDEFR DYVRCN +STLRRL+TLDGRSLGPDEE V Sbjct: 967 GGKKEELQCFCVDEVERIMELWNKNDEFRRDYVRCNTRSTLRRLQTLDGRSLGPDEEPLV 1026 Query: 380 FPVYVGERESRQLNNPSRTTNPSSPTIMKQENTVTPVEREPIDGKSLVMTEPKSKMLKSK 559 P + ER S+ + S TT K V+ EP D K + S+ K+K Sbjct: 1027 MPNAITERASKNIPMVSNTTMEQEK---KSPRESVNVKDEP-DSKVVAQRTETSQTTKAK 1082 Query: 560 ISVNPIPESGLHIGLRQLEVEETKEVAKQ-----KTEEELELARKDEILRKEEIDAKLKE 724 P P H+ E +E ++ K +T+EE EL K E RKEE +AKLKE Sbjct: 1083 KPTKPAPLEK-HVARWGDESDEDEDKDKNEEEPVRTKEEEELILKAEKARKEEEEAKLKE 1141 Query: 725 QLRQEEKVKAQEALERKKRNADKAQVRA---------LXXXXXXXXXXXXXXXXXXXXXX 877 + R EE KA+EAL+RKKRNA+KAQ RA L Sbjct: 1142 KRRLEEIEKAKEALQRKKRNAEKAQQRAALKAQKEAELKEKEREKRAKKKERRKTSSAVT 1201 Query: 878 XXTTDGENGLELQTNHVKEESK--DSP---TTKPIKTLHFNRYNKTKATIPPALRNRGKR 1042 T+ E+ +T EES + P T KP K F R K K+ +P ALRNR KR Sbjct: 1202 AENTEQESAHTTETLTSVEESDLTEKPAEVTKKPQKPSQFTRQTKVKS-VPAALRNRAKR 1260 Query: 1043 RLKQFMWWIFGALIVLFIFLVGNSGAFKS 1129 R++ +MW + ++V+ +F VGNS + +S Sbjct: 1261 RIQPWMWVLIAVVVVVALFYVGNSSSLRS 1289 >ref|XP_002510745.1| conserved hypothetical protein [Ricinus communis] gi|223551446|gb|EEF52932.1| conserved hypothetical protein [Ricinus communis] Length = 1553 Score = 229 bits (585), Expect = 1e-57 Identities = 151/385 (39%), Positives = 210/385 (54%), Gaps = 13/385 (3%) Frame = +2 Query: 20 KECNDESKKIRELQARFRAANDIRQDAYKDLIGLKRQMHEKGKYFWIXXXXXXXXXXXXL 199 K+ +E K+ EL RFRAA+DIRQ+A+ L L++++++K K F+ Sbjct: 1177 KKYQEEKAKLGELIGRFRAADDIRQEAFAHLQSLRKRLYDKHKNFYKYKEDAKAASDLAS 1236 Query: 200 NGDNEALYHLCAKQVDTFMDLWNKNDEFREDYVRCNMKSTLRRLKTLDGRSLGPDEEVHV 379 GD L + C QV+ M+LWN NDEFR+DY+RCN++ST+RRL+TLDGRSLGPDEE V Sbjct: 1237 KGDQGELQYHCVNQVERVMELWNNNDEFRKDYIRCNLRSTVRRLRTLDGRSLGPDEEPPV 1296 Query: 380 FPVYVGERESRQLNNPSRTTNPSSPTIMKQENTVTPVEREPIDGKSLVMTEPKSKMLKSK 559 P +V ER +R+ PS +T +++E + P E E D KS+ + K+ KSK Sbjct: 1297 IPNFVSERFARRNVVPSIST-------LQEEKIIAPTETENKDDKSI--AKVKNPTAKSK 1347 Query: 560 ISVNPIPESGLHIGLRQLEVEETKEVAKQKTEEELELARKDEILRKEEIDAKLKEQLRQE 739 + + ++E+EE + T+EE ELARK E LRKEE A LKE+ E Sbjct: 1348 KPAKHALGNSMATVSNRVEIEEEGVEEHKLTKEEEELARKAEELRKEEEAATLKERQLLE 1407 Query: 740 EKVKAQEALERKKRNADKAQ----VRALXXXXXXXXXXXXXXXXXXXXXXXXTTDGEN-G 904 K KA EALERKKR+A+KAQ VRA +G N G Sbjct: 1408 AKTKANEALERKKRSANKAQARAEVRARKEAEQKEKEKEKRARKKEKRRALEAANGSNEG 1467 Query: 905 LELQTNHVKEESKDSPT-TKPI-------KTLHFNRYNKTKATIPPALRNRGKRRLKQFM 1060 ++ ++K+S T KP+ K LHF + K K PP LRNRGKRR++ +M Sbjct: 1468 ESAPSSETPTDTKESETIEKPVALRKRSQKPLHFAKQTKPKIK-PPPLRNRGKRRMQTWM 1526 Query: 1061 WWIFGALIVLFIFLVGNSGAFKSLR 1135 W + I+ +FL+GN G+F R Sbjct: 1527 WVLLTITIIFALFLIGN-GSFSLQR 1550 >ref|XP_003556620.1| PREDICTED: uncharacterized protein LOC100798700 [Glycine max] Length = 1501 Score = 227 bits (578), Expect = 7e-57 Identities = 155/391 (39%), Positives = 206/391 (52%), Gaps = 21/391 (5%) Frame = +2 Query: 20 KECNDESKKIRELQARFRAANDIRQDAYKDLIGLKRQMHEKGKYFWIXXXXXXXXXXXXL 199 K+ NDE K+ EL ARFRAA+D RQ+AY L+ LK+Q+HEK K FW Sbjct: 1113 KKYNDECDKLNELLARFRAADDSRQEAYAKLLALKKQLHEKSKNFWEYRDAANKAQELAA 1172 Query: 200 NGDNEALYHLCAKQVDTFMDLWNKNDEFREDYVRCNMKSTLRRLKTLDGRSLGPDEEVHV 379 G E L C QV+ M+LWNKND FR DYVRCN +STLRRL+TLDGRSLGPDEE V Sbjct: 1173 GGKKEELQCFCVDQVERIMELWNKNDGFRRDYVRCNTRSTLRRLQTLDGRSLGPDEEPPV 1232 Query: 380 FPVYVGERESRQLNNPSRTTNPSSPTIMKQENTVTPVER-----EPIDGKSLVMTEPKSK 544 P + ER S+ + ++T ++QE TP E EP+ K +V S+ Sbjct: 1233 IPNVITERASKNIPMVLQST-------LEQEKKSTPTESVNVKDEPVS-KVVVQRTETSQ 1284 Query: 545 MLKSKISVNPIP-ESGLHIGLRQLEVEETKEVAKQKTEEELELARKDEILRKEEIDAKLK 721 K+K P P E + + + +E K+ +T+EE EL K E R EE +AKLK Sbjct: 1285 TTKAKKPTKPAPLEKHVARWGDESDEDEVKKEEPVRTKEEEELILKAEKARMEEEEAKLK 1344 Query: 722 EQLRQEEKVKAQEALERKKRNADKAQVRA---------LXXXXXXXXXXXXXXXXXXXXX 874 E+ R EE KA+EAL RKKRNA+KAQ RA L Sbjct: 1345 EKRRLEEIEKAKEALLRKKRNAEKAQQRAALKAQKEAELKEKEREKRAKKKERRKAGSAV 1404 Query: 875 XXXTTDGENGL--ELQTNHVKE----ESKDSPTTKPIKTLHFNRYNKTKATIPPALRNRG 1036 T+ E+ E T V+E E T KP KT F R K K+ +P ALRNRG Sbjct: 1405 TAENTEQESAPIPETLTRSVEEFEQTEKTAEVTKKPQKTSQFTRQTKVKS-VPAALRNRG 1463 Query: 1037 KRRLKQFMWWIFGALIVLFIFLVGNSGAFKS 1129 KRR++ ++ + ++ + +F VG++ + +S Sbjct: 1464 KRRIQPWVCVLIALVVAVALFYVGHNCSLRS 1494 >ref|XP_002308370.1| predicted protein [Populus trichocarpa] gi|222854346|gb|EEE91893.1| predicted protein [Populus trichocarpa] Length = 485 Score = 226 bits (576), Expect = 1e-56 Identities = 148/401 (36%), Positives = 220/401 (54%), Gaps = 33/401 (8%) Frame = +2 Query: 20 KECNDESKKIRELQARFRAANDIRQDAYKDLIGLKRQMHEKGKYFWIXXXXXXXXXXXXL 199 K+ NDE +KI +L + RAANDIRQ+A+ L L++Q++EK K+F+ L Sbjct: 82 KKYNDEHEKINQLLFQHRAANDIRQEAFAHLQSLRKQLYEKSKFFYKYKDDLTAATNLAL 141 Query: 200 NGDNEALYHLCAKQVDTFMDLWNKNDEFREDYVRCNMKSTLRRLKTLDGRSLGPDEEVHV 379 GD E L CA QV+ M+LWN NDEFR++Y+ NM++TLRRL+TLDGR+LGPDE+ + Sbjct: 142 KGDKEELQRHCANQVERVMELWNNNDEFRKEYMSSNMRNTLRRLRTLDGRALGPDEQPPI 201 Query: 380 FPVYVGERESRQLNNPSRTTNPSSPTIMKQENTVTPVEREPIDGKSLV-MTEPKSKMLKS 556 P V +R ++ PS+P ++ E VTPVE + ID KS + + K++ +K+ Sbjct: 202 IPNVVSQRATK------HNVAPSAPA-LEVEKPVTPVETQRIDEKSTAKLGDKKNQTVKT 254 Query: 557 KISVNPIP-ESGLHIGLRQLEVEETK---------EVAKQK---------------TEEE 661 K P E+GL + ++EE++ E ++Q+ T+EE Sbjct: 255 KRQAKPASLENGLPTVSGRDQIEESRQEENKLPKEEESRQENKLTKEEESRQENKLTKEE 314 Query: 662 LELARKDEILRKEEIDAKLKEQLRQEEKVKAQEALERKKRNADKAQVRALXXXXXXXXXX 841 +ELARK E LRKE+ A LKEQ R EEK KA+EA+ERKKRNA+KAQ RA Sbjct: 315 VELARKIEELRKEKEAAMLKEQRRLEEKAKAKEAMERKKRNAEKAQARASLRAQREAEQK 374 Query: 842 XXXXXXXXXXXXXXTTDGENGLELQ------TNHVKEESKDSPTTKPIKTLHFNRYNKTK 1003 E+ ++ ++ E+ +S T+ T+ +TK Sbjct: 375 EKEKEKKAKKKEKRKAAAEDTKDIDEVESAPSSETPTETNESERTEKPVTVAKRPQKQTK 434 Query: 1004 A-TIPPALRNRGKRRLKQFMWWIFGALIVLFIFLVGNSGAF 1123 A ++P LRN+GKR+++ +MW + L V+ +F +GNS F Sbjct: 435 AKSMPLPLRNKGKRKMQTWMWALITLLAVVALFFMGNSSFF 475