BLASTX nr result
ID: Angelica22_contig00028554
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica22_contig00028554 (1936 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264... 343 1e-91 ref|XP_002318209.1| predicted protein [Populus trichocarpa] gi|2... 325 2e-86 ref|XP_003533172.1| PREDICTED: uncharacterized protein LOC100818... 300 7e-79 emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera] 284 7e-74 ref|NP_200056.1| hydroxyproline-rich glycoprotein family protein... 249 2e-63 >ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera] Length = 448 Score = 343 bits (879), Expect = 1e-91 Identities = 210/493 (42%), Positives = 261/493 (52%), Gaps = 8/493 (1%) Frame = +3 Query: 195 NNSFEXXXXXXXXXXXXQTRVQPSSARKRRWRGCWSLYWCFGSSKHSKRIGRATLVPEPT 374 NNS E ++RVQP++ +KRRW C SLYWCFGS +HSKRIG A LVPEP Sbjct: 5 NNSVETINAAATAIVSAESRVQPTTVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLVPEPM 64 Query: 375 EAVTAAPFTETINYXXXXXXXXXXXXXXXXXXXXXXXXXGIHXXXXXXXXXXXXXXXXXP 554 AP +E +N P Sbjct: 65 VPGAVAPASENLNLSTSIVLPFIAPPSSPASFLQSDPPSSTQSPAGFLSLTALSVNAYSP 124 Query: 555 GGTASIFTIGPYAYETQLVSPPVFSTYTTEPSTASFTPPPESAQLTTPSSPEVPFAQLLA 734 G AS+F IGPYA+ETQLVSPPVFST+ TEPSTA FTPPPES QLTTPSSPEVPFAQLL Sbjct: 125 SGPASMFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLT 184 Query: 735 SSLARSQRNTGPNQKFPSSQYEFQPYRLNXXXXXXXXXXXXXXXXNSGTSSPFPDKQPII 914 SSL RS+RN+G NQK S YEFQPY+L NSGTSSPFPD++PI+ Sbjct: 185 SSLDRSRRNSGTNQKLSLSNYEFQPYQL---YPESPVGHLISPISNSGTSSPFPDRRPIV 241 Query: 915 KFRVEEIPKFLGYEYFSSQKWXXXXXXXXXXXXXXXXXXXXXXXXXYGCVSRLGSGASTP 1094 E PK LG+E+FS+++W SRLGSG+ TP Sbjct: 242 -----EAPKLLGFEHFSTRRWG----------------------------SRLGSGSLTP 268 Query: 1095 NXXXXXXXXXXXXPTNREPVMKVTPVESQISELETLARSDKIS-GEEVVFDHRVSFELPT 1271 + P +R+ + +E+QISE+ +LA S+ S E V DHRVSFEL Sbjct: 269 D---------GAGPASRDSFL----LENQISEVASLANSESGSQNGETVIDHRVSFELAG 315 Query: 1272 EYVSASLKEDLL---ETVSECQKKVTTEG--ATLSNNISEKANNYCD-CEGCTEPILPGE 1433 E V+ +++ + ETV + + EG + ISE N C+ C G + Sbjct: 316 EDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTENCCEFCVGEALKAASEK 375 Query: 1434 DKRDNQEHQCRCLHQTISLGLSKEFIFDNTKGDASHKPT-LGVEWWTSEKVVGKNLVPRT 1610 + +E QC H I G KEF FDNTKG+ S KP +G EWW +EKVVGK P+T Sbjct: 376 ASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWWVNEKVVGKGTGPQT 435 Query: 1611 SWTFFPMLQPEVS 1649 +WTFFP+LQP +S Sbjct: 436 NWTFFPLLQPGIS 448 >ref|XP_002318209.1| predicted protein [Populus trichocarpa] gi|222858882|gb|EEE96429.1| predicted protein [Populus trichocarpa] Length = 507 Score = 325 bits (834), Expect = 2e-86 Identities = 204/517 (39%), Positives = 259/517 (50%), Gaps = 27/517 (5%) Frame = +3 Query: 180 MSNLWNNSFEXXXXXXXXXXXXQTRVQPSSA--RKRRWRGCWSLYWCFGSS---KHSKRI 344 M ++ N+S E ++RVQPSS+ +KRRW GCWSLYWCFGS K+SKRI Sbjct: 1 MRSVNNSSIETVNAAATAIVSAESRVQPSSSSVQKRRWGGCWSLYWCFGSHGSHKNSKRI 60 Query: 345 GRATLVPEPTEAVTAAPFTETINYXXXXXXXXXXXXXXXXXXXXXXXXXGIHXXXXXXXX 524 G A LVPEP + TE Sbjct: 61 GHAVLVPEPEVPGAVSSSTENQTQSTPILLPFIAPPSSPASFLQSDPPSSTQSPAGLLSL 120 Query: 525 XXXXXXXXXPGGTASIFTIGPYAYETQLVSPPVFSTYTTEPSTASFTPPPESAQLTTPSS 704 P G ASIF IGPYA+ETQLV+PPVFS +TTEPSTA FTPPPES QLTTPSS Sbjct: 121 TSLSANAYSPRGPASIFAIGPYAHETQLVTPPVFSAFTTEPSTAPFTPPPESVQLTTPSS 180 Query: 705 PEVPFAQLLASSLARSQRNTGPNQKFPSSQYEFQPYRLNXXXXXXXXXXXXXXXXNSGTS 884 PEVPFAQLL SSL R++RN+GPNQKF S YEFQ Y L NSGTS Sbjct: 181 PEVPFAQLLTSSLERARRNSGPNQKFSLSHYEFQSYHLYPGSPGGQIISPGSAISNSGTS 240 Query: 885 SPFPDKQPIIKFRVEEIPKFLGYEYFSSQKWXXXXXXXXXXXXXXXXXXXXXXXXXYGCV 1064 SPFPD+ P+++FR+ E PK LG+E+FS++KW + Sbjct: 241 SPFPDRHPMLEFRMGEAPKLLGFEHFSTRKWGSRLGSGSLTPDATPDGMG---------L 291 Query: 1065 SRLGSGASTPNXXXXXXXXXXXXPTNREPVMK------VTP------------VESQISE 1190 SRLGSG TP+ + + +TP +E+QISE Sbjct: 292 SRLGSGTVTPDGMGLSRLCSGTATPDGAGLRSRLGSGTLTPDCFVPASQIGFLLENQISE 351 Query: 1191 LETLARSDKIS-GEEVVFDHRVSFELPTEYVSASLKEDLL---ETVSECQKKVTTEGATL 1358 + +L S+ S EE V HRVSFEL E V+ L+ + T E + E Sbjct: 352 VASLTNSENGSKTEENVVHHRVSFELSGEEVARCLEIKSVASTRTFPEYPQDTMPEDPVR 411 Query: 1359 SNNISEKANNYCDCEGCTEPILPGEDKRDNQEHQCRCLHQTISLGLSKEFIFDNTKGDAS 1538 + ++ C G +P ++ + +E H++I+LG KEF FDN+KG+ S Sbjct: 412 GDRLAMNGER-CLQNGEASSEMPEKNSEETEEDHVYRKHRSITLGSIKEFNFDNSKGEVS 470 Query: 1539 HKPTLGVEWWTSEKVVGKNLVPRTSWTFFPMLQPEVS 1649 KP + EWW +E + GK P SWTFFP+LQPEVS Sbjct: 471 DKPAISSEWWANETIAGKEARPANSWTFFPLLQPEVS 507 >ref|XP_003533172.1| PREDICTED: uncharacterized protein LOC100818313 [Glycine max] Length = 499 Score = 300 bits (769), Expect = 7e-79 Identities = 195/484 (40%), Positives = 241/484 (49%), Gaps = 16/484 (3%) Frame = +3 Query: 246 QTRVQPSSARKRRWRGCWSLYWCFGS---SKHSKRIGRATLVPEPTEAVTAAPFTETINY 416 ++RVQP+ A K+RW GCWS YWCFGS SK SKRIG A LVPEP A N Sbjct: 26 ESRVQPTDAPKKRWGGCWSQYWCFGSCKSSKSSKRIGHAVLVPEPAAPTGPAAAAAAPNP 85 Query: 417 XXXXXXXXXXXXXXXXXXXXXXXXXGIHXXXXXXXXXXXXXXXXXPGGTASIFTIGPYAY 596 GI GG AS+FTIGPYAY Sbjct: 86 SAAIVMPFIAPPSSPASFLQSDPPSGIQSPPGLLSLSALAANAYSSGGPASMFTIGPYAY 145 Query: 597 ETQLVSPPVFSTYTTEPSTASFTPPPESAQLTTPSSPEVPFAQLLASSLARSQRNTGPNQ 776 ETQLVSPPVFS +TTEPSTA +TPPPES Q TTPSSP+VPFAQLLASSL R++++ G N Sbjct: 146 ETQLVSPPVFSAFTTEPSTAPYTPPPESVQQTTPSSPDVPFAQLLASSLDRARKSNG-NH 204 Query: 777 KFPSSQYEFQPYRLNXXXXXXXXXXXXXXXXNSGTSSPFPDKQPIIKFRVE--EIPKFLG 950 KFP YEF PY+ SGTS+PFPD+ P ++F E P+ LG Sbjct: 205 KFPLYNYEFHPYQQYPGSPGGQLISPGSAFSTSGTSTPFPDRPPTLEFPFPKGETPRILG 264 Query: 951 YEYFSSQKW----XXXXXXXXXXXXXXXXXXXXXXXXXYGCVSRLGSGASTPN--XXXXX 1112 +E+FS+++W G SRLGSG TP+ Sbjct: 265 FEHFSTRRWGSRLGSGSLTPDGAWQGSRLGSGSLTPDGIGLASRLGSGCVTPDGLGLESR 324 Query: 1113 XXXXXXXPTNREPV-MKVTPVESQISELETLARSDK-ISGEEVVFDHRVSFELPTEYVSA 1286 P + P+ V++QIS+ TLA +D S + DHRVSFEL E V+ Sbjct: 325 LGSGCLTPDSAGPINQNNISVQNQISKEATLADTDNGHSSNATLIDHRVSFELTGEDVAR 384 Query: 1287 SLKED---LLETVSECQKKVTTEGATLSNNISEKANNYCDCEGCTEPILPGEDKRDNQEH 1457 L LL +S + + LS + ++ D + CTE +DK DN Sbjct: 385 CLANKTGVLLRNMSGSSQGI------LSKDPVDRERVQKDTDTCTEKT---DDKPDNSVG 435 Query: 1458 QCRCLHQTISLGLSKEFIFDNTKGDASHKPTLGVEWWTSEKVVGKNLVPRTSWTFFPMLQ 1637 +CLH+ S+ SKEF FDN KGD S G EWWT+ KV GK SW FFPMLQ Sbjct: 436 GEQCLHKQNSVNSSKEFNFDNRKGDVSVTAGSGSEWWTNRKVAGKEGRSANSWAFFPMLQ 495 Query: 1638 PEVS 1649 E++ Sbjct: 496 SEMN 499 >emb|CAN63074.1| hypothetical protein VITISV_026979 [Vitis vinifera] Length = 385 Score = 284 bits (726), Expect = 7e-74 Identities = 173/374 (46%), Positives = 216/374 (57%), Gaps = 8/374 (2%) Frame = +3 Query: 552 PGGTASIFTIGPYAYETQLVSPPVFSTYTTEPSTASFTPPPESAQLTTPSSPEVPFAQLL 731 P G AS+F IGPYA+ETQLVSPPVFST+ TEPSTA FTPPPES QLTTPSSPEVPFAQLL Sbjct: 61 PSGPASMFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLL 120 Query: 732 ASSLARSQRNTGPNQKFPSSQYEFQPYRLNXXXXXXXXXXXXXXXXNSGTSSPFPDKQPI 911 SSL RS+RN+G NQK S YEFQPY+L NSGTSSPFPD++PI Sbjct: 121 TSSLDRSRRNSGTNQKLSLSNYEFQPYQL---YPESPVGHLISPISNSGTSSPFPDRRPI 177 Query: 912 IKFRVEEIPKFLGYEYFSSQKWXXXXXXXXXXXXXXXXXXXXXXXXXYGCVSRLGSGAST 1091 + E PK LG+E+FS+++W SRLGSG+ T Sbjct: 178 V-----EAPKLLGFEHFSTRRWG----------------------------SRLGSGSLT 204 Query: 1092 PNXXXXXXXXXXXXPTNREPVMKVTPVESQISELETLARSDKIS-GEEVVFDHRVSFELP 1268 P+ P +R+ + +E+QISE+ +LA S+ S E V DHRVSFEL Sbjct: 205 PD---------GAGPASRDSFL----LENQISEVASLANSESGSQNGETVIDHRVSFELA 251 Query: 1269 TEYVSASLKEDLL---ETVSECQKKVTTEG--ATLSNNISEKANNYCD-CEGCTEPILPG 1430 E V+ +++ + ETV + + EG + ISE N C+ C G Sbjct: 252 GEDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTENCCEFCVGEALKAASE 311 Query: 1431 EDKRDNQEHQCRCLHQTISLGLSKEFIFDNTKGDASHKPT-LGVEWWTSEKVVGKNLVPR 1607 + + +E QC H I G KEF FDNTKG+ S KP +G EWW +EKVVGK P+ Sbjct: 312 KASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWWVNEKVVGKGTGPQ 371 Query: 1608 TSWTFFPMLQPEVS 1649 T+WTFFP+LQP +S Sbjct: 372 TNWTFFPLLQPGIS 385 >ref|NP_200056.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|10177409|dbj|BAB10540.1| unnamed protein product [Arabidopsis thaliana] gi|40823427|gb|AAR92282.1| At5g52430 [Arabidopsis thaliana] gi|56381929|gb|AAV85683.1| At5g52430 [Arabidopsis thaliana] gi|110738650|dbj|BAF01250.1| hypothetical protein [Arabidopsis thaliana] gi|332008830|gb|AED96213.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 438 Score = 249 bits (636), Expect = 2e-63 Identities = 181/499 (36%), Positives = 242/499 (48%), Gaps = 9/499 (1%) Frame = +3 Query: 180 MSNLWNNSFEXXXXXXXXXXXXQTRVQPSSARKRRWRGCWSLYWCFGSSKHSKRIGRATL 359 M N+ NNS E ++RVQPSS++K RW CWSLY CFG+ K++KRIG A L Sbjct: 1 MRNVVNNSVETVNAAATAIVTAESRVQPSSSQKGRWGKCWSLYSCFGTQKNNKRIGNAVL 60 Query: 360 VPEPTEAVTAAPFTETINYXXXXXXXXXXXXXXXXXXXXXXXXXGIHXXXXXXXXXXXXX 539 VPEP VT+ T+ Sbjct: 61 VPEP---VTSGVPVVTVQNSATSTTVVLPFIAPPSSPASFLQSDPSSVSHSPVGPLSLTS 117 Query: 540 XXXXPGGTASIFTIGPYAYETQLVSPPVFSTYTTEPSTASFTPPPESA-QLTTPSSPEVP 716 P S+FT+GPYA ETQ V+PPVFS + TEPSTA +TPPPES+ +TTPSSPEVP Sbjct: 118 NTFSPKEPQSVFTVGPYANETQPVTPPVFSAFITEPSTAPYTPPPESSVHITTPSSPEVP 177 Query: 717 FAQLLASSLARSQRN--TGPNQKFPSSQYEFQPYRL-NXXXXXXXXXXXXXXXXNSGTSS 887 FAQLL SSL ++R+ +G NQKF SS YEF+ ++ NSGTSS Sbjct: 178 FAQLLTSSLELTRRDSTSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSS 237 Query: 888 PFPDKQPIIKFRVEEIPKFLGYEYFSSQKWXXXXXXXXXXXXXXXXXXXXXXXXXYGCVS 1067 P+P K P+++FR+ E PKFLG+E+F+++KW G S Sbjct: 238 PYPGKSPMVEFRIGEPPKFLGFEHFTARKW--------------GSRFGSGSITPVGHGS 283 Query: 1068 RLGSGASTPNXXXXXXXXXXXXPTNREPVMKVTPVESQISELETLARSDKISGEEVVFDH 1247 L SGA TPN N P P+++QISE+ +LA SD E +V DH Sbjct: 284 GLASGALTPNGPEIVSG-------NLTPNNTTWPLQNQISEVASLANSDH-GSEVMVADH 335 Query: 1248 RVSFELPTEYVSASLKEDLLETVSECQK--KVTTEGAT---LSNNISEKANNYCDCEGCT 1412 RVSFEL E V+ L L + ++ TE ++ + NI +++ + Sbjct: 336 RVSFELTGEDVARCLASKLNRSHDRMNNNDRIETEESSSTDIRRNIEKRSGD-------- 387 Query: 1413 EPILPGEDKRDNQEHQCRCLHQTISLGLSKEFIFDNTKGDASHKPTLGVEWWTSEKVVGK 1592 R+N++H+ + L + S+G SKEF FDNTK + EKV G Sbjct: 388 ---------RENEQHRIQKLSSS-SIGSSKEFKFDNTKDE------------NIEKVAG- 424 Query: 1593 NLVPRTSWTFFPMLQPEVS 1649 SW+FFP L+ VS Sbjct: 425 -----NSWSFFPGLRSGVS 438