BLASTX nr result
ID: Angelica22_contig00014165
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica22_contig00014165 (1296 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI18955.3| unnamed protein product [Vitis vinifera] 236 1e-59 ref|XP_002520293.1| DNA binding protein, putative [Ricinus commu... 216 8e-54 ref|XP_002302212.1| predicted protein [Populus trichocarpa] gi|2... 216 1e-53 ref|XP_003545667.1| PREDICTED: uncharacterized protein LOC100810... 208 2e-51 ref|XP_003527955.1| PREDICTED: uncharacterized protein LOC100795... 207 5e-51 >emb|CBI18955.3| unnamed protein product [Vitis vinifera] Length = 795 Score = 236 bits (601), Expect = 1e-59 Identities = 135/349 (38%), Positives = 195/349 (55%), Gaps = 36/349 (10%) Frame = +1 Query: 352 ASRVSLVRMDSHDSRNLSIDSSVKDCRKIFLEQMYQSLK-TEGGLQDCIQDALLSHPEGD 528 + R + +D H N DS + R I L+QMY+SL +EGG++ C++ ALLS PE D Sbjct: 288 SKRAVIPMIDHHAIVNGLDDSPQQHWRNIVLDQMYRSLSDSEGGIRGCVRAALLSCPEVD 347 Query: 529 FTSTKRESLHFGEGI------------NSFKDNVGVMLNGSRNNSSPSTNTDCCKRALFN 672 T+T ++ +HF + + ++ + +VGV NGS + S T T+ C+R+ F Sbjct: 348 HTTTIKKPVHFHKDVRCPPHTGLLPNESASRSHVGVTSNGSLSESDHHTITELCRRSFFK 407 Query: 673 VLTSAKFAELCDLLHGNFQGLKMSSLFDINTIQSRIKEGAYESSPLLFHHDIQQVWSKLQ 852 ++ S KFA LC L+ NFQG+K+ + FD + I SR+ EGAYE SP+LF D+QQVW KLQ Sbjct: 408 LIMSEKFASLCKLMLENFQGIKVDNFFDFSLIHSRMIEGAYERSPMLFSSDVQQVWKKLQ 467 Query: 853 NVGTEMXXXXXXXXXXXXXXXXD--EGFVL-------------ESSAHPKVELTEGSCQL 987 +GTE+ + EG VL ES +H K+E Sbjct: 468 RIGTEIVSLGTTLSEMSRTSYSELVEGAVLSASEDGKNEVCTRESDSHTKLEQLVACGVF 527 Query: 988 RASTCRQCKEKAESQNCLVCDYCEDSYHILCIRPALQEISLKSWYCTSCTAKGIGSPHEN 1167 + +CR C EKA+ ++CLVCD CE+ YHI C+ PA++ I KSWYC C A + PHEN Sbjct: 528 KVCSCRHCGEKADGRDCLVCDSCEEVYHISCVEPAVKVIPHKSWYCVDCIASRL--PHEN 585 Query: 1168 CLLCESKNVPRSLSTG--------DDEEELELEKCSNDIEEDVIQNRAE 1290 C++C+ N R+L G ++E ++ELE+ SN I E IQ + E Sbjct: 586 CVVCKKLNAQRTLINGVGDDIISMNEETDMELEESSNCITEVGIQQQKE 634 >ref|XP_002520293.1| DNA binding protein, putative [Ricinus communis] gi|223540512|gb|EEF42079.1| DNA binding protein, putative [Ricinus communis] Length = 510 Score = 216 bits (551), Expect = 8e-54 Identities = 125/319 (39%), Positives = 174/319 (54%), Gaps = 16/319 (5%) Frame = +1 Query: 379 DSHDSRNLSIDSSVKDCRKIFLEQMYQSLK-TEGGLQDCIQDA-LLSHPEGDFTSTKRE- 549 DSH S + S D K+ R LE +YQSL G+Q CIQD +++ + D R Sbjct: 29 DSHASLDGSNDVLHKESRNFVLENIYQSLTDNHDGIQGCIQDTHMMTIKDSDAADKDRNT 88 Query: 550 -SLHFGEGIN----SFKDNVGVMLNGSRNNSSPSTNTDCCKRALFNVLTSAKFAELCDLL 714 S G N + + N+ V LN S ++S S T+ C+ A N++ S KF+ LC LL Sbjct: 89 WSSQLGWMPNGTHYAARGNIDVTLNKSLDDSQRSV-TEMCQHAFANIIISEKFSLLCKLL 147 Query: 715 HGNFQGLKMSSLFDINTIQSRIKEGAYESSPLLFHHDIQQVWSKLQNVGTEMXXXXXXXX 894 NFQ +K + ++ I+ ++K+G YE SP+LF+ DIQ+VW KLQ +G E+ Sbjct: 148 SENFQEMKPDNFLSLSRIKIKMKDGVYERSPMLFYEDIQRVWKKLQGIGNELISLAKSLS 207 Query: 895 XXXXXXXXDEGFVLESSAHPKVELTEGSCQLRASTCRQCKEKAESQNCLVCDYCEDSYHI 1074 ++ ES H K E E TCR+C KA+ +NCLVCD CE+ YH+ Sbjct: 208 DVSSTSYDEQFHPQESHFHGKPEQIEACGAYSVCTCRRCGGKADGRNCLVCDSCEEMYHV 267 Query: 1075 LCIRPALQEISLKSWYCTSCTAKGIGSPHENCLLCESKNVPRSLSTGDDEEE-------- 1230 CI P ++EI KSWYC SC+A G+GSPHENC +CE N PR+L T +E+ Sbjct: 268 SCIEPVVKEIPSKSWYCASCSAAGMGSPHENCAVCERLNAPRNLCTQASDEKGSPTIENG 327 Query: 1231 LELEKCSNDIEEDVIQNRA 1287 E E+ SN IE+ Q+ A Sbjct: 328 SEFEEASNHIEDGFHQSPA 346 >ref|XP_002302212.1| predicted protein [Populus trichocarpa] gi|222843938|gb|EEE81485.1| predicted protein [Populus trichocarpa] Length = 604 Score = 216 bits (550), Expect = 1e-53 Identities = 144/427 (33%), Positives = 213/427 (49%), Gaps = 27/427 (6%) Frame = +1 Query: 67 MVGEEENVEIGSDMVLDNVGMNNVSVTETENKATGDVDKSSAGDCVQKYSRKKFRRISRS 246 MVGEE +G+ + + + + + +N D ++S+G + + K RR +RS Sbjct: 1 MVGEEG---MGNGEGTEEI-VQPLKIEAMDNGFGNDGVEASSGSS-EGFRTYKRRRNTRS 55 Query: 247 CSMGSVLQDSRVSTNITTQITDKNLKEPPNVCLPDASRVSLVRMDSHDSRNLSIDSSVKD 426 G QD + +++ D+ +K L ++H S N S D S + Sbjct: 56 SLDGKGQQDGKSFMEAASRLADQTIKNDSQDHL----------RENHASLNHSSDVSQRQ 105 Query: 427 CRKIFLEQMYQSLKT-EGGLQDCIQDALLS----------HPEGDFTSTKRESLHFGEGI 573 RK L+ MYQS E G+Q CI+DAL+ + G+ + +S G Sbjct: 106 WRKFVLDYMYQSSSNDEHGIQRCIRDALMMAVKIYAAIKLNESGNCNADWHKSPSMGRMA 165 Query: 574 N----SFKDNVGVMLNGSRNNSSPSTNTDCCKRALFNVLTSAKFAELCDLLHGNFQGLKM 741 N + K +VGV+ NG+ S + TD C+ A N L S KF LC LL NF+G+ Sbjct: 166 NGTHSTAKGHVGVISNGTLEESQHHSVTDLCQHAFLNTLLSEKFTSLCKLLFENFKGMTT 225 Query: 742 SSLFDINTIQSRIKEGAYESSPLLFHHDIQQVWSKLQNVGTEMXXXXXXXXXXXXXXXXD 921 S+ +N I R+KEGAY+ P+LF DI+Q W KLQ G E+ + Sbjct: 226 DSILSLNFIDKRMKEGAYDRLPVLFCEDIEQFWRKLQGFGAELISLAKSLSNISKTCYNE 285 Query: 922 E--GFV---------LESSAHPKVELTEGSCQLRASTCRQCKEKAESQNCLVCDYCEDSY 1068 + G V +S++H K E T+ R +CR+C EKA+ ++CLVCD CE+ Y Sbjct: 286 QVGGLVDCTFEDKKHEDSNSHGKPEQTDACYVYRVCSCRRCGEKADGRDCLVCDSCEEMY 345 Query: 1069 HILCIRPALQEISLKSWYCTSCTAKGIGSPHENCLLCESKNVPR-SLSTGDDEEELELEK 1245 H+ CI PA++EI KSWYC +CT G+GSPH+NC+ CE + R + DDE L ++ Sbjct: 346 HVSCIVPAVREIPPKSWYCHNCTTSGMGSPHKNCVACERLSCCRIQNNQADDEIGLSTQE 405 Query: 1246 CSNDIEE 1266 ND EE Sbjct: 406 PFNDFEE 412 >ref|XP_003545667.1| PREDICTED: uncharacterized protein LOC100810450 [Glycine max] Length = 832 Score = 208 bits (530), Expect = 2e-51 Identities = 128/393 (32%), Positives = 204/393 (51%), Gaps = 19/393 (4%) Frame = +1 Query: 148 ETENKATGDVDKSSAG--DCVQKYSRKKFRRISRSCSMGSVLQDSRVSTNITTQITDKNL 321 ET N + D+ ++G +C Q Y R+K ++S S V ++SR +Q+ + + Sbjct: 267 ETVNNVVANADEGNSGAVECFQTYKRRKH---AKSSSEFKVQENSRKHMGAASQLLVQAV 323 Query: 322 KEPPNVCLPDASRVSLVRMDSHDSRNLSIDSSVKDCRKIFLEQMYQSLKTE-GGLQDCIQ 498 K+P ++ + + S+ D S + L+ +Y SL + GG++ CI+ Sbjct: 324 KKPFDLAVGNTSK----------------DHSHDHWGNVVLKHLYHSLGNDNGGMKWCIR 367 Query: 499 DALLSHPEGDFTSTKRESLHF---GEGINSFKDNV------------GVMLNGSRNNSSP 633 +AL+S P+ T +E+L G+ + +++ V+ NG + S+ Sbjct: 368 EALMSCPKISCAPTMKETLKIVKDGQECSPQLESLFYRLQSEANGHENVVHNGFSSESNG 427 Query: 634 STNTDCCKRALFNVLTSAKFAELCDLLHGNFQGLKMSSLFDINTIQSRIKEGAYESSPLL 813 T+ C+R ++L S KF+ LC +L NFQG K ++FD + I SR+K AYE SP L Sbjct: 428 RDTTEGCQRVFRDILASEKFSSLCKVLLENFQGTKPETVFDFSLINSRMKGQAYEQSPTL 487 Query: 814 FHHDIQQVWSKLQNVGTEMXXXXXXXXXXXXXXXXDEGFVLESSAHPKVELTEGSCQLRA 993 F D+QQVW KLQ+ G ++ ++ ES +H K E T R Sbjct: 488 FLSDVQQVWRKLQSTGNQIVAMARSLSNMSKASFCEQLCNQESISHMKPEQTVECVAFRL 547 Query: 994 STCRQCKEKAESQNCLVCDYCEDSYHILCIRPALQEISLKSWYCTSCTAKGIGSPHENCL 1173 TC C +KA+ +CLVCD CE+ YH+ CI PA++EI KSW+C +CTA GIG H+NC+ Sbjct: 548 GTCWHCGDKADGTDCLVCDSCEEMYHLSCIEPAVKEIPYKSWFCANCTANGIGCRHKNCV 607 Query: 1174 LCESKNVPRSLSTGDDEEELEL-EKCSNDIEED 1269 +CE N ++L EE + E+ N++EE+ Sbjct: 608 VCERLNALKTLDDIVGEENIPTNEETLNELEEN 640 >ref|XP_003527955.1| PREDICTED: uncharacterized protein LOC100795906 [Glycine max] Length = 646 Score = 207 bits (527), Expect = 5e-51 Identities = 140/423 (33%), Positives = 199/423 (47%), Gaps = 29/423 (6%) Frame = +1 Query: 88 VEIGSDMVLDNVGMNNVSVTETENKATGDVDKSSAGDCVQKYSRKKFRRISRSCSMGSVL 267 V I + + G S T A D S +C+Q Y R+K +S S G V Sbjct: 55 VAIADENGVAEEGRIGKSETFCNRVAVADKGDSGGVECLQTYKRRK-----KSSSKGEVQ 109 Query: 268 QDSRVSTNITTQITDKNLKEPPNVCLPDASRVSLVRMDSHDSRNLSIDSSVKDCRKIFLE 447 + R + +T I D+++ +P +V L N S D S I L+ Sbjct: 110 EQCRKNVETSTHIADQDVTKPCDVALC----------------NTSDDCSHGQWGNIVLK 153 Query: 448 QMYQSLKT-EGGLQDCIQDALLSHPEGDFTSTKRESLHFGEG---------------INS 579 +YQSL GG++ CI++AL+ +P+ + T+T E+ + Sbjct: 154 HLYQSLGDGNGGIEGCIREALIHYPKHNHTTTVMETFKIDKDGQECSLQFEPLSHRTEKE 213 Query: 580 FKDNVGVMLNGSRNNSSPSTNTDCCKRALFNVLTSAKFAELCDLLHGNFQGLKMSSLFDI 759 + VM NG + S T+ C+R L NVLTS KF+ LC L NFQG+K S+ D Sbjct: 214 ANGHADVMCNGGSSESPDHGVTEMCQRVLCNVLTSEKFSSLCKALLENFQGMKPESVLDF 273 Query: 760 NTIQSRIKEGAYESSPLLFHHDIQQVWSKLQNVGTEMXXXXXXXXXXXXXXXXD------ 921 + SR+KE AYE SP LF DIQQVW KLQ+ G E+ + Sbjct: 274 TVMNSRMKEQAYEQSPTLFLSDIQQVWRKLQDAGNEIVALAKSLSNMSRTSYSELVGIPA 333 Query: 922 -----EGFVLESSAHPKVELTEGSCQLRASTCRQCKEKAESQNCLVCDYCEDSYHILCIR 1086 + +E K E T+ + +C+ C EKA+ +CLVCD CE+ YH+ CI Sbjct: 334 QSTFQDEKQVEFDCCMKPEQTQACAMYKICSCKCCGEKADDTDCLVCDSCEEIYHVSCIE 393 Query: 1087 PALQE-ISLKSWYCTSCTAKGIGSPHENCLLCESKNVPRSL-STGDDEEELELEKCSNDI 1260 PA++E I KSWYC +CTA I S HENC+LCE N ++L D +E+ N+ Sbjct: 394 PAVKEIIPHKSWYCANCTANVIESLHENCVLCERLNDAKTLDDVIGDGSFPTIEETQNEF 453 Query: 1261 EED 1269 EE+ Sbjct: 454 EEN 456