BLASTX nr result
ID: Angelica23_contig00011503
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00011503 (2266 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI18955.3| unnamed protein product [Vitis vinifera] 409 e-111 ref|XP_003545667.1| PREDICTED: uncharacterized protein LOC100810... 395 e-107 ref|XP_003543969.1| PREDICTED: uncharacterized protein LOC100786... 394 e-107 ref|XP_002520293.1| DNA binding protein, putative [Ricinus commu... 388 e-105 ref|XP_002302212.1| predicted protein [Populus trichocarpa] gi|2... 388 e-105 >emb|CBI18955.3| unnamed protein product [Vitis vinifera] Length = 795 Score = 409 bits (1051), Expect = e-111 Identities = 225/543 (41%), Positives = 325/543 (59%), Gaps = 33/543 (6%) Frame = +3 Query: 300 DSQVTNKVSSFFGLLQSIKEPDLRSSCKQVSHVRMDSHTSTNVSHECSVKDWRKVVLEQM 479 D VT K S GLL + K+ +D H N + + WR +VL+QM Sbjct: 270 DQLVTVKQSMDVGLLNNFS--------KRAVIPMIDHHAIVNGLDDSPQQHWRNIVLDQM 321 Query: 480 HQSLSETEGGLQDCIRDALVFNHERSCTSSIKE---FRFGERM--HSGAY---NAFKDHV 635 ++SLS++EGG++ C+R AL+ E T++IK+ F R H+G +A + HV Sbjct: 322 YRSLSDSEGGIRGCVRAALLSCPEVDHTTTIKKPVHFHKDVRCPPHTGLLPNESASRSHV 381 Query: 636 GATSNVSTNESNSCLDSDMWKQALFSVLISPKFAELCDLICGNFHGMNVSSLFNLGLINS 815 G TSN S +ES+ +++ +++ F +++S KFA LC L+ NF G+ V + F+ LI+S Sbjct: 382 GVTSNGSLSESDHHTITELCRRSFFKLIMSEKFASLCKLMLENFQGIKVDNFFDFSLIHS 441 Query: 816 RINEGAYKNSPVLFHSDIQQVWTKLRKVGTEMVTVAMSLSETSKASC------------- 956 R+ EGAY+ SP+LF SD+QQVW KL+++GTE+V++ +LSE S+ S Sbjct: 442 RMIEGAYERSPMLFSSDVQQVWKKLQRIGTEIVSLGTTLSEMSRTSYSELVEGAVLSASE 501 Query: 957 --QNTFYMQETNGLSNVEQREGC-CNRACTCRHCEGKTEGRDCLVCENCKDIYHVSCIEP 1127 +N +E++ + +EQ C + C+CRHC K +GRDCLVC++C+++YH+SC+EP Sbjct: 502 DGKNEVCTRESDSHTKLEQLVACGVFKVCSCRHCGEKADGRDCLVCDSCEEVYHISCVEP 561 Query: 1128 -VQEASRKSWHCAKCRSNGIGPRHEFCVICESINATRSSCTGV--DGLAANGK---EVEE 1289 V+ KSW+C C ++ + HE CV+C+ +NA R+ GV D ++ N + E+EE Sbjct: 562 AVKVIPHKSWYCVDCIASRLP--HENCVVCKKLNAQRTLINGVGDDIISMNEETDMELEE 619 Query: 1290 SFDDLEQDGLVNDGAS--LPCCKICKIDIGVEDFRI-CGHPFCPNKYYHTRCLTVEQLNT 1460 S + + + G+ + CKIC D+ + + CGHPFCPNKYYH CLT +L Sbjct: 620 SSNCITEVGIQQQKETKYFQLCKICGSDMEFGEHLLECGHPFCPNKYYHKSCLTSTELRM 679 Query: 1461 YGSCWYCPSCLCRGCLTDRDDDKIVLCDGCDQAFHIYCMQPPRTSIPKGKWFCTICEEGI 1640 YG CWYCPSCLCR CLTDRDD+KI+LCDGCD A+HIYCM PPRTSIP+GKWFC C+ I Sbjct: 680 YGPCWYCPSCLCRACLTDRDDEKIILCDGCDHAYHIYCMNPPRTSIPRGKWFCRKCDADI 739 Query: 1641 QLIRKAKRAHANSECKMIKKVERGNGTYENGPVGVSKEMEGEVDTSGGVDMLLTAAQTLN 1820 Q IRKAK + E + +K E+ E GP +D+LL AAQTLN Sbjct: 740 QKIRKAKMVFEDLERERKQKGEQVIDKDEEGP----------------MDILLNAAQTLN 783 Query: 1821 NED 1829 ++ Sbjct: 784 LQE 786 >ref|XP_003545667.1| PREDICTED: uncharacterized protein LOC100810450 [Glycine max] Length = 832 Score = 395 bits (1015), Expect = e-107 Identities = 225/561 (40%), Positives = 326/561 (58%), Gaps = 27/561 (4%) Frame = +3 Query: 228 TYRRRKRAKMNSD--PEEKLLIAWNSDSQVTNKVSSFFGLLQSIKEP-DLRSSCKQVSHV 398 TY+RRK AK +S+ +E + SQ+ L+Q++K+P DL Sbjct: 289 TYKRRKHAKSSSEFKVQENSRKHMGAASQL---------LVQAVKKPFDLAVG------- 332 Query: 399 RMDSHTSTNVSHECSVKDWRKVVLEQMHQSLSETEGGLQDCIRDALVFNHERSCTSSIKE 578 +TS + SH+ W VVL+ ++ SL GG++ CIR+AL+ + SC ++KE Sbjct: 333 ----NTSKDHSHD----HWGNVVLKHLYHSLGNDNGGMKWCIREALMSCPKISCAPTMKE 384 Query: 579 ----FRFGERMHSGAYNAF-------KDHVGATSNVSTNESNSCLDSDMWKQALFSVLIS 725 + G+ + F H N ++ESN ++ ++ +L S Sbjct: 385 TLKIVKDGQECSPQLESLFYRLQSEANGHENVVHNGFSSESNGRDTTEGCQRVFRDILAS 444 Query: 726 PKFAELCDLICGNFHGMNVSSLFNLGLINSRINEGAYKNSPVLFHSDIQQVWTKLRKVGT 905 KF+ LC ++ NF G ++F+ LINSR+ AY+ SP LF SD+QQVW KL+ G Sbjct: 445 EKFSSLCKVLLENFQGTKPETVFDFSLINSRMKGQAYEQSPTLFLSDVQQVWRKLQSTGN 504 Query: 906 EMVTVAMSLSETSKASCQNTFYMQETNGLSNVEQREGCCN-RACTCRHCEGKTEGRDCLV 1082 ++V +A SLS SKAS QE+ EQ C R TC HC K +G DCLV Sbjct: 505 QIVAMARSLSNMSKASFCEQLCNQESISHMKPEQTVECVAFRLGTCWHCGDKADGTDCLV 564 Query: 1083 CENCKDIYHVSCIEP-VQEASRKSWHCAKCRSNGIGPRHEFCVICESINA--TRSSCTGV 1253 C++C+++YH+SCIEP V+E KSW CA C +NGIG RH+ CV+CE +NA T G Sbjct: 565 CDSCEEMYHLSCIEPAVKEIPYKSWFCANCTANGIGCRHKNCVVCERLNALKTLDDIVGE 624 Query: 1254 DGLAANGKEVEESFDDLEQDG--------LVNDGASLPCCKICKIDIGVEDFRICGHPFC 1409 + + N EE+ ++LE++ + D + CKICK+ + E +ICGH FC Sbjct: 625 ENIPTN----EETLNELEENSNCTYDGIQISTDRRNSSDCKICKMAVDGEKVKICGHSFC 680 Query: 1410 PNKYYHTRCLTVEQLNTYGSCWYCPSCLCRGCLTDRDDDKIVLCDGCDQAFHIYCMQPPR 1589 P+KYYH CL+ +QL +YG CWYCPSC+C+ CLTD+DD+KIVLCD CD A+H+YCM+PP+ Sbjct: 681 PSKYYHVSCLSSKQLKSYGHCWYCPSCICQVCLTDKDDNKIVLCDACDHAYHVYCMKPPQ 740 Query: 1590 TSIPKGKWFCTICEEGIQLIRKAKRAHANSECKMIKKVERGNGTYENGPVGVSKEMEGEV 1769 SIPKGKWFC CE GIQ IR+A++A+ +++ K+ + + N E+ +K+ E+ Sbjct: 741 NSIPKGKWFCIKCEAGIQAIRQARKAYESNKGKVGQNDSKPN---EDIDKKWNKKRGREL 797 Query: 1770 DTSGG-VDMLLTAAQTLNNED 1829 D GG +DML+TAA TLN+E+ Sbjct: 798 DNVGGMMDMLITAANTLNSEE 818 >ref|XP_003543969.1| PREDICTED: uncharacterized protein LOC100786712 [Glycine max] Length = 525 Score = 394 bits (1011), Expect = e-107 Identities = 211/497 (42%), Positives = 297/497 (59%), Gaps = 28/497 (5%) Frame = +3 Query: 423 NVSHECSVKDWRKVVLEQMHQSLSETEGGLQDCIRDALVFNHERSCTSS--------IKE 578 N S + S W VVL+Q++ SL GG++ CIR+AL+ + + SC ++ + Sbjct: 22 NTSKDHSHDHWGNVVLKQLYHSLGNDNGGMEWCIREALMSHPKISCATTMTVGSAETLNI 81 Query: 579 FRFGERMHSGAYNAF-------KDHVGATSNVSTNESNSCLDSDMWKQALFSVLISPKFA 737 + G+ + F H +N ++ESN + ++ +L S KF+ Sbjct: 82 VKDGQECSPQLESLFYRLQSEANGHENVVNNGFSSESNGHGATGRCQRVFRDILASEKFS 141 Query: 738 ELCDLICGNFHGMNVSSLFNLGLINSRINEGAYKNSPVLFHSDIQQVWTKLRKVGTEMVT 917 LC ++ NF GM ++F+ LINSR+ AY+ SP LF SD QQVW KL+ G ++V Sbjct: 142 SLCKVLLENFRGMKPETVFDFSLINSRMKGQAYEQSPTLFLSDFQQVWRKLQNTGNQIVA 201 Query: 918 VAMSLSETSKASCQNTFYMQETNGLSNVEQREGCCN-RACTCRHCEGKTEGRDCLVCENC 1094 +A SLS SKAS QE+ EQ C + C HC K +G DCLVC++C Sbjct: 202 MARSLSNMSKASFCEQLCNQESISHMKPEQTVECVAFKVGNCWHCGDKADGIDCLVCDSC 261 Query: 1095 KDIYHVSCIEP-VQEASRKSWHCAKCRSNGIGPRHEFCVICESINA--TRSSCTGVDGLA 1265 +++YH+SCIEP V+E RKSW CA C +NGIG RH+ CV+CE +N T G + Sbjct: 262 EEMYHLSCIEPAVKEIPRKSWFCANCTANGIGCRHKNCVVCEQLNVLKTLDDFVGEENFP 321 Query: 1266 ANGKEVEESFDDLEQ------DGLV--NDGASLPCCKICKIDIGVEDFRICGHPFCPNKY 1421 N EE+ ++LE+ DG+ DG + CKICK+ + E +ICGH FCP+KY Sbjct: 322 TN----EETLNELEEYSNCTYDGIQVSTDGRNSSNCKICKMAVDGEKVKICGHSFCPSKY 377 Query: 1422 YHTRCLTVEQLNTYGSCWYCPSCLCRGCLTDRDDDKIVLCDGCDQAFHIYCMQPPRTSIP 1601 YH RCL+ +QL +YG+CWYCPSC+C+ CLTD+DDDKIVLCDGCD A+HIYCM+PP+ SIP Sbjct: 378 YHVRCLSSKQLKSYGNCWYCPSCICQVCLTDKDDDKIVLCDGCDHAYHIYCMKPPQNSIP 437 Query: 1602 KGKWFCTICEEGIQLIRKAKRAHANSECKMIKKVERGNGTYENGPVGVSKEMEGEVDTSG 1781 KGKWFC CE GIQ IR+A++A+ + + K+ + + N E+ +K+ E D G Sbjct: 438 KGKWFCIKCEAGIQAIRQARKAYESKKGKVGQNDSKPN---EDIDKKWNKKRGRESDKVG 494 Query: 1782 G-VDMLLTAAQTLNNED 1829 G +DML+ AA TLN+E+ Sbjct: 495 GMMDMLINAANTLNSEE 511 >ref|XP_002520293.1| DNA binding protein, putative [Ricinus communis] gi|223540512|gb|EEF42079.1| DNA binding protein, putative [Ricinus communis] Length = 510 Score = 388 bits (997), Expect = e-105 Identities = 204/495 (41%), Positives = 297/495 (60%), Gaps = 14/495 (2%) Frame = +3 Query: 375 SCKQVSHVRMDSHTSTNVSHECSVKDWRKVVLEQMHQSLSETEGGLQDCIRDALVFNHER 554 + K+ + DSH S + S++ K+ R VLE ++QSL++ G+Q CI+D + + Sbjct: 19 TAKEAPYGIPDSHASLDGSNDVLHKESRNFVLENIYQSLTDNHDGIQGCIQDTHMMTIKD 78 Query: 555 SCTSSIKEFRFGER---MHSGAYNAFKDHVGATSNVSTNESNSCLDSDMWKQALFSVLIS 725 S + + + M +G + A + ++ T N S ++S + ++M + A +++IS Sbjct: 79 SDAADKDRNTWSSQLGWMPNGTHYAARGNIDVTLNKSLDDSQRSV-TEMCQHAFANIIIS 137 Query: 726 PKFAELCDLICGNFHGMNVSSLFNLGLINSRINEGAYKNSPVLFHSDIQQVWTKLRKVGT 905 KF+ LC L+ NF M + +L I ++ +G Y+ SP+LF+ DIQ+VW KL+ +G Sbjct: 138 EKFSLLCKLLSENFQEMKPDNFLSLSRIKIKMKDGVYERSPMLFYEDIQRVWKKLQGIGN 197 Query: 906 EMVTVAMSLSETSKASCQNTFYMQETNGLSNVEQREGC-CNRACTCRHCEGKTEGRDCLV 1082 E++++A SLS+ S S F+ QE++ EQ E C CTCR C GK +GR+CLV Sbjct: 198 ELISLAKSLSDVSSTSYDEQFHPQESHFHGKPEQIEACGAYSVCTCRRCGGKADGRNCLV 257 Query: 1083 CENCKDIYHVSCIEPV-QEASRKSWHCAKCRSNGIGPRHEFCVICESINATRSSCTGVDG 1259 C++C+++YHVSCIEPV +E KSW+CA C + G+G HE C +CE +NA R+ CT Sbjct: 258 CDSCEEMYHVSCIEPVVKEIPSKSWYCASCSAAGMGSPHENCAVCERLNAPRNLCTQASD 317 Query: 1260 -----LAANGKEVEESFDDLEQDGLVND---GASLPCCKICKIDI-GVEDFRICGHPFCP 1412 NG E EE+ + +E DG G ++ CK+C ++ E +IC H CP Sbjct: 318 EKGSPTIENGSEFEEASNHIE-DGFHQSPAGGKNVCFCKMCGSEVENGEKVKICEHILCP 376 Query: 1413 NKYYHTRCLTVEQLNTYGSCWYCPSCLCRGCLTDRDDDKIVLCDGCDQAFHIYCMQPPRT 1592 KYYH RCLT L +YG WYCPSCLCR C DRDDD+IVLCDGCD A+H+YCM PPRT Sbjct: 377 YKYYHVRCLTNNLLKSYGPRWYCPSCLCRTCFVDRDDDQIVLCDGCDHAYHMYCMSPPRT 436 Query: 1593 SIPKGKWFCTICEEGIQLIRKAKRAHANSECKMIKKVERGNGTYENGPVGVSKEMEGEVD 1772 SIP+GKWFC C+ I+ IR+AKRA+ E ++ KK E EN + ++ E E Sbjct: 437 SIPRGKWFCRQCDVKIKEIRRAKRAYEKREKRLEKKAEADKRACENLEKKLDEKCEKE-S 495 Query: 1773 TSGGVDMLLTAAQTL 1817 +G +D+LLTAA L Sbjct: 496 GNGRLDILLTAAFNL 510 >ref|XP_002302212.1| predicted protein [Populus trichocarpa] gi|222843938|gb|EEE81485.1| predicted protein [Populus trichocarpa] Length = 604 Score = 388 bits (997), Expect = e-105 Identities = 222/562 (39%), Positives = 309/562 (54%), Gaps = 35/562 (6%) Frame = +3 Query: 228 TYRRRKRAKMNSDPEEKLLIAWNSDSQVTNKVSSFFGLLQSIKEPDLRSSCKQVSHVRMD 407 TY+RR+ + + D + Q K SF + + +++ + H+R + Sbjct: 46 TYKRRRNTRSSLDGK----------GQQDGK--SFMEAASRLADQTIKNDSQD--HLR-E 90 Query: 408 SHTSTNVSHECSVKDWRKVVLEQMHQSLSETEGGLQDCIRDALVF----------NHERS 557 +H S N S + S + WRK VL+ M+QS S E G+Q CIRDAL+ N + Sbjct: 91 NHASLNHSSDVSQRQWRKFVLDYMYQSSSNDEHGIQRCIRDALMMAVKIYAAIKLNESGN 150 Query: 558 CTSSIKEFRFGERMHSGAYNAFKDHVGATSNVSTNESNSCLDSDMWKQALFSVLISPKFA 737 C + + RM +G ++ K HVG SN + ES +D+ + A + L+S KF Sbjct: 151 CNADWHKSPSMGRMANGTHSTAKGHVGVISNGTLEESQHHSVTDLCQHAFLNTLLSEKFT 210 Query: 738 ELCDLICGNFHGMNVSSLFNLGLINSRINEGAYKNSPVLFHSDIQQVWTKLRKVGTEMVT 917 LC L+ NF GM S+ +L I+ R+ EGAY PVLF DI+Q W KL+ G E+++ Sbjct: 211 SLCKLLFENFKGMTTDSILSLNFIDKRMKEGAYDRLPVLFCEDIEQFWRKLQGFGAELIS 270 Query: 918 VAMSLSETSKASCQN---------TFY---MQETNGLSNVEQREGC-CNRACTCRHCEGK 1058 +A SLS SK +C N TF +++N EQ + C R C+CR C K Sbjct: 271 LAKSLSNISK-TCYNEQVGGLVDCTFEDKKHEDSNSHGKPEQTDACYVYRVCSCRRCGEK 329 Query: 1059 TEGRDCLVCENCKDIYHVSCIEP-VQEASRKSWHCAKCRSNGIGPRHEFCVICESINATR 1235 +GRDCLVC++C+++YHVSCI P V+E KSW+C C ++G+G H+ CV CE ++ R Sbjct: 330 ADGRDCLVCDSCEEMYHVSCIVPAVREIPPKSWYCHNCTTSGMGSPHKNCVACERLSCCR 389 Query: 1236 SSCTGVDGLAANGKEVEESFDDLEQDG---------LVNDGASLPC-CKICKIDIGV-ED 1382 D G +E F+D E+ L ++G C CKIC +G E Sbjct: 390 IQNNQADDEI--GLSTQEPFNDFEEASNFSANNEVKLSSEGTGNVCTCKICGSPVGNGEK 447 Query: 1383 FRICGHPFCPNKYYHTRCLTVEQLNTYGSCWYCPSCLCRGCLTDRDDDKIVLCDGCDQAF 1562 +IC H CP KYYH RCLT Q+++ G WYCPSCLCR C+TDRDDDKIVLCDGCD A+ Sbjct: 448 IKICDHSECPGKYYHVRCLTTRQIDSCGHRWYCPSCLCRVCITDRDDDKIVLCDGCDHAY 507 Query: 1563 HIYCMQPPRTSIPKGKWFCTICEEGIQLIRKAKRAHANSECKMIKKVERGNGTYENGPVG 1742 H+YCM PPR S+PKGKWFC C+ IQ +R+ +RA+ SE KK + G Sbjct: 508 HLYCMIPPRISVPKGKWFCRQCDVKIQRLRRVRRAYEKSESHR-KKNDEGVKKESENLKK 566 Query: 1743 VSKEMEGEVDTSGGVDMLLTAA 1808 + +E E D G+DML+TAA Sbjct: 567 LYEEGGEESDKGRGMDMLITAA 588