BLASTX nr result
ID: Angelica23_contig00008324
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00008324 (641 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003554214.1| PREDICTED: uncharacterized protein LOC100810... 135 5e-30 ref|XP_002323327.1| predicted protein [Populus trichocarpa] gi|2... 135 5e-30 ref|XP_004147878.1| PREDICTED: uncharacterized protein LOC101206... 132 7e-29 ref|NP_187968.1| cysteine/histidine-rich C1 domain-containing pr... 125 7e-27 dbj|BAB02601.1| unnamed protein product [Arabidopsis thaliana] 125 7e-27 >ref|XP_003554214.1| PREDICTED: uncharacterized protein LOC100810028 [Glycine max] Length = 567 Score = 135 bits (341), Expect = 5e-30 Identities = 70/211 (33%), Positives = 107/211 (50%), Gaps = 8/211 (3%) Frame = +3 Query: 33 INVVETESSDHM-----ICDGCVQPILSPPDSFYGCLD--CNFFLHNICAAELPREFEHA 191 + E ES D + +CD CV+PI+ P F+ C + C FFLH CA ELPR +H Sbjct: 259 LKFTEEESDDDVKYFDKLCDACVRPIMPP---FFSCEEENCGFFLHQSCA-ELPRTKQHP 314 Query: 192 SHPQHKLIRCDQIGKPNAFFLCWSCDKLCNGIFYRCESCEFYFDLVCAAMPSEIKHDSHK 371 H ++ + + C C +L NG YRC+ C+F D+ C + I+H+SH Sbjct: 315 FHTHPLTLQSK--APYDGIYKCDGCRRLSNGFVYRCDVCQFDLDVCCGTLEERIEHESHV 372 Query: 372 HKLKYANDPKIMPCDGCGYPSRRNFICTTC-NYTINTSCVRNPGRIKHRWDEHQLCLVYP 548 H L C GC S+ ++C C +++++ C P + H +D+H L L Y Sbjct: 373 HPLFLKKTSVSRQCKGCHLWSKLVYVCDVCEDFSVDCGCATLPSKTWHMYDKHPLSLNY- 431 Query: 549 PVKGHPHDFNCELCSEDINPNHWFYHCRKCD 641 V+G + C +C+E ++P WFY+C CD Sbjct: 432 FVEGSLREHECGICNEKMSPKQWFYYCDDCD 462 Score = 98.2 bits (243), Expect = 1e-18 Identities = 56/201 (27%), Positives = 92/201 (45%), Gaps = 3/201 (1%) Frame = +3 Query: 45 ETESSDHMICDGCVQPILSPPDSFYGCLDCNFFLHNICAAELPREFEHASHPQHKLIRCD 224 ++ ++ ++C GC + I P + C+ CN+ LH CA ++ R +H HPQH L+ Sbjct: 19 DSNYNNQVLCSGCEKLISGP---VFCCVQCNYVLHKKCA-QIARHVKHPFHPQHPLVLLS 74 Query: 225 QIGKPNAFFLCWSCDKLCNGIFYRCESCEFYFDLVCAAMPSEIKHDSHKHKLKYANDPKI 404 + +C SC L Y C C + D+ C + + D HKH+ +P+ Sbjct: 75 TTPYEGPY-ICDSCRGLFINFVYHCYHCNYDLDVSCGSQFNP--DDGHKHEFLSVTNPQS 131 Query: 405 MPCDGCGYPSRRNF--ICTTCNYTINTSCVRNPGRIKHRWDEHQLCLVYP-PVKGHPHDF 575 C CG + +CT C +++SC P I+ + +H L L Y V G + Sbjct: 132 FVCYACGMHANEGLACLCTICQIWVHSSCAELPLAIRIKGHDHPLKLTYTLHVYGFWGNL 191 Query: 576 NCELCSEDINPNHWFYHCRKC 638 C +C+E ++P Y C C Sbjct: 192 TCGVCNEAMSPAFAGYFCSTC 212 >ref|XP_002323327.1| predicted protein [Populus trichocarpa] gi|222867957|gb|EEF05088.1| predicted protein [Populus trichocarpa] Length = 675 Score = 135 bits (341), Expect = 5e-30 Identities = 70/215 (32%), Positives = 112/215 (52%), Gaps = 2/215 (0%) Frame = +3 Query: 3 SLILLDNFNDINVVETESSDHMICDGCVQPILSPPDSFYGCLDCNFFLHNICAAELPREF 182 +LIL+D FN+ CDGC+ PI +P FY C +CNFFL C LPR+ Sbjct: 404 NLILIDEFNN----------DPKCDGCMLPIFTP---FYSCTECNFFLDKACIG-LPRK- 448 Query: 183 EHASHPQHKLIRCDQIGKPNAFFLCWSCDKLCNGIFYRCESCEFYFDLVC-AAMPSEIKH 359 ++ + +H LI + + F C C++ C+G Y C+ C + D+ C + I+H Sbjct: 449 KYWQYDRHPLILILNTWEEDPF-QCAICEQYCHGFSYNCDRCHRFLDVRCFKSTKDSIEH 507 Query: 360 DSHKHKLKYANDPKIMPCDGCGYPSR-RNFICTTCNYTINTSCVRNPGRIKHRWDEHQLC 536 H+H L A + + C GCG + F C C++ ++ C P + +HR+DEH L Sbjct: 508 GGHEHPLYLAVESENRHCSGCGVSGESQTFRCVVCDFNLDFKCATLPDKARHRYDEHPLF 567 Query: 537 LVYPPVKGHPHDFNCELCSEDINPNHWFYHCRKCD 641 L Y + + + + C++C ++ +P WFY C +CD Sbjct: 568 LTY--IDPNDYQYVCQICEKERDPKLWFYRCEECD 600 Score = 117 bits (294), Expect = 1e-24 Identities = 68/207 (32%), Positives = 103/207 (49%), Gaps = 15/207 (7%) Frame = +3 Query: 66 MICDGCVQPILSPPDSFYGCLDCNFFLHNICAAELPREFEHASHPQHKLIRCDQIGKPNA 245 ++C GC +PI P Y C CNFFLH C ELP+E + HPQH L +GKP A Sbjct: 148 VMCSGCDEPISGPS---YRCTSCNFFLHKKCT-ELPQEIKRCLHPQHPL---HLLGKPPA 200 Query: 246 F----FLCWSCDKLCNGIFYRCESCEFYFDLVCAAMP--SEIKHDSHKHKLKYANDPKIM 407 ++C C+K C YRC C FY D+ CA P SE++ H ++ PK + Sbjct: 201 HHTGKWMCDLCNKTCKSFVYRCSFCNFYLDIKCALPPCLSEVEGQEH----QFICLPKSL 256 Query: 408 P-------CDGCGYPSRRN-FICTTCNYTINTSCVRNPGRIKHRWDEH-QLCLVYPPVKG 560 P C+ CG + F+CT C ++ +C+ P IK +H ++ Y + Sbjct: 257 PLKIVSFTCNACGTDGDDSPFVCTMCQLIVHKTCISFPRSIKLCIHQHPRIIHTYHLQQC 316 Query: 561 HPHDFNCELCSEDINPNHWFYHCRKCD 641 + + C +C + ++ N+ Y+C+ CD Sbjct: 317 NSRNKYCGICRDGVDTNYGVYYCQDCD 343 Score = 103 bits (258), Expect = 2e-20 Identities = 70/223 (31%), Positives = 98/223 (43%), Gaps = 20/223 (8%) Frame = +3 Query: 33 INVVETESSDHMICDGCVQPILSPPDSFYGCLDCNFFLHNICAAELPREFE---HASHPQ 203 IN V S + +IC GC +PI P Y C C FFLH C AELPRE + H HP Sbjct: 16 INQVLEYSGELVICSGCDEPIWGP---CYSCTSCYFFLHKTC-AELPREIKRHIHFKHPL 71 Query: 204 HKLIRCDQIGKPNAFF----LCWSCDKLCNGIFYRCESCEFYFDLVCAAMPSEIKHDSHK 371 H L KP + + +C C K C Y C C+F + CA + DS Sbjct: 72 HLL------AKPPSHYILGCICDLCKKTCESFVYHCSVCKFDLHIKCAFQQCFFEVDSQA 125 Query: 372 HKLKYANDPKI-----------MPCDGCGYP-SRRNFICTTCNYTINTSCVRNPGRIKH- 512 H+ + + I + C GC P S ++ CT+CN+ ++ C P IK Sbjct: 126 HQFAHIDHSLISNEEQEFHVEGVMCSGCDEPISGPSYRCTSCNFFLHKKCTELPQEIKRC 185 Query: 513 RWDEHQLCLVYPPVKGHPHDFNCELCSEDINPNHWFYHCRKCD 641 +H L L+ P H + C+LC++ + Y C C+ Sbjct: 186 LHPQHPLHLLGKPPAHHTGKWMCDLCNKTC--KSFVYRCSFCN 226 Score = 65.5 bits (158), Expect = 8e-09 Identities = 59/225 (26%), Positives = 92/225 (40%), Gaps = 48/225 (21%) Frame = +3 Query: 108 DSFYGCLDCNFFLHNICAAELPREFEHASHPQHKLIRCDQIGKPNA-------------- 245 DS + C C +H C + PR + H ++I + + N+ Sbjct: 274 DSPFVCTMCQLIVHKTCIS-FPRSIKLCIHQHPRIIHTYHLQQCNSRNKYCGICRDGVDT 332 Query: 246 ---FFLCWSCDKLCN---GIFYRCESCEF----------------YFDLVCAA------M 341 + C CD + + GI YR + E FD+V + Sbjct: 333 NYGVYYCQDCDFVAHVNCGIQYRLSNTESDGDGRSITMNDEFKESSFDIVREIKHGDERI 392 Query: 342 PSEIKHDSHKHKL----KYANDPKIMPCDGCGYPSRRNFI-CTTCNYTINTSCVRNPGRI 506 +EIKH SH+H L ++ NDPK CDGC P F CT CN+ ++ +C+ P + Sbjct: 393 IAEIKHFSHQHNLILIDEFNNDPK---CDGCMLPIFTPFYSCTECNFFLDKACIGLPRKK 449 Query: 507 KHRWDEHQLCLVYPPVKGHPHDFNCELCSEDINPNHWF-YHCRKC 638 ++D H L L+ + P F C +C + H F Y+C +C Sbjct: 450 YWQYDRHPLILILNTWEEDP--FQCAICEQYC---HGFSYNCDRC 489 Score = 59.3 bits (142), Expect = 6e-07 Identities = 48/171 (28%), Positives = 72/171 (42%), Gaps = 16/171 (9%) Frame = +3 Query: 177 EFEHASHPQHKLIRCDQIGKPNA-FFLCWSCDKLCNGIFYRCESCEFYFDLVCAAMPSEI 353 E EH SHP H LI +Q+ + + +C CD+ G Y C SC F+ CA +P EI Sbjct: 2 EVEHFSHPVHPLILINQVLEYSGELVICSGCDEPIWGPCYSCTSCYFFLHKTCAELPREI 61 Query: 354 KHDSH-KHKLKYANDPKI-----MPCDGCGYPSRRNFI--CTTCNYTINTSCVRNPGRIK 509 K H KH L P CD C + +F+ C+ C + ++ C + Sbjct: 62 KRHIHFKHPLHLLAKPPSHYILGCICDLC-KKTCESFVYHCSVCKFDLHIKCAFQQCFFE 120 Query: 510 HRWDEHQLCLV-YPPVKGHPHDFN-----CELCSEDIN-PNHWFYHCRKCD 641 HQ + + + +F+ C C E I+ P+ Y C C+ Sbjct: 121 VDSQAHQFAHIDHSLISNEEQEFHVEGVMCSGCDEPISGPS---YRCTSCN 168 >ref|XP_004147878.1| PREDICTED: uncharacterized protein LOC101206314 [Cucumis sativus] Length = 829 Score = 132 bits (331), Expect = 7e-29 Identities = 69/201 (34%), Positives = 103/201 (51%), Gaps = 2/201 (0%) Frame = +3 Query: 45 ETESSDHMICDGCVQPILSPPDSFYGCLDCNFFLHNICAAELPREFEHASHPQHKLIRCD 224 E E +CDGC++ + P YGC +C+FF+H C ELPR+ + H QH L D Sbjct: 387 EEELGQDRVCDGCMKRLSGPS---YGCEECDFFVHKECL-ELPRKKRNFIH-QHSL---D 438 Query: 225 QIGKPNAFFLCWSCDKLCNGIFYRCESCEFYFDLVCAAMPSEIKHDSHKHKLKYANDPKI 404 I PN F C +C K NG Y C++C FD C ++ KH +H+H L + Sbjct: 439 LISIPNFVFQCQACLKYFNGFAYHCKTCLSTFDTRCTSIKIPFKHPAHQHPLSLDRTNED 498 Query: 405 MPCDGCGYPSRRN--FICTTCNYTINTSCVRNPGRIKHRWDEHQLCLVYPPVKGHPHDFN 578 C+GCG + F C C++ ++ C P +++R+D H L L + ++ Sbjct: 499 HKCEGCGEGVKHKVAFRCVDCDFHLDAGCATLPLGVRYRFDPHPLDLTFFE-NEEEEEYC 557 Query: 579 CELCSEDINPNHWFYHCRKCD 641 CE+C E +P WFY C+KC+ Sbjct: 558 CEICEEKRDPGPWFYGCQKCN 578 Score = 63.9 bits (154), Expect = 2e-08 Identities = 51/202 (25%), Positives = 81/202 (40%), Gaps = 5/202 (2%) Frame = +3 Query: 45 ETESSDHMICDGCVQPILSPPDSFYGCLDCNFFLHNICAAELPREFEHASHPQHKLIRCD 224 ET+ D ++C + S+Y C + H CA ELPRE ++ +H L Sbjct: 143 ETDEIDCLVCG----LFIKSGSSYYLCSFGDSRFHQQCA-ELPREMLNSDFHEHPLFLLP 197 Query: 225 QIGKPNAFFLCWSCDKLCNGIFYRCESCEFYFDLVCAAMPSEIKHDSHKHKL-KYANDPK 401 P +C SC C Y C C+F + C ++ HKH KY N + Sbjct: 198 S-SPPQT--ICNSCKNDCGEFVYNCSLCDFNLHIAC------LQSFKHKHSFTKYRNRTQ 248 Query: 402 IMPCDGCGYPSRR-NFICTTCNYTINTSCVRNPGRIK---HRWDEHQLCLVYPPVKGHPH 569 C CG ++ C C+ +++ C + P ++ HR + L V + Sbjct: 249 FF-CRACGEKGDGFSWYCIICHLSVHEKCAKMPLTLRIFGHRLHDLSLTYFRDRVDFVGN 307 Query: 570 DFNCELCSEDINPNHWFYHCRK 635 +C++C E I + Y C K Sbjct: 308 KIDCKICGEKIRTKYAAYGCYK 329 Score = 60.1 bits (144), Expect = 3e-07 Identities = 52/221 (23%), Positives = 85/221 (38%), Gaps = 22/221 (9%) Frame = +3 Query: 45 ETESSDHMICDGCVQPILSPPDSFYGCLD--CNFFLHNICAAELPREFEHASHPQHKLIR 218 ++++ + + C C + + P + C D CNF +H C +LP + + HPQH L R Sbjct: 19 QSKNDEVVFCTRCRRQLRPPA---FTCSDSLCNFHIHQSCI-DLPPQIHNRFHPQHLLSR 74 Query: 219 CDQIGKPNAFFLCWSCDKLCNGIFYRCESCEFYFDLVCAAMP-----------SEIKHDS 365 + C C ++ +G Y C C F D+ CA +E +H S Sbjct: 75 TTNN------YHCVPCSQMPSGDVYICSQCCFQIDVKCAIADTKASGLRRMNGNEFRHFS 128 Query: 366 HKHKL-----KYANDPKIMPCDGCGY---PSRRNFICTTCNYTINTSCVRNPGR-IKHRW 518 H H L + + + C CG ++C+ + + C P + + Sbjct: 129 HPHTLTLLQPEQNRETDEIDCLVCGLFIKSGSSYYLCSFGDSRFHQQCAELPREMLNSDF 188 Query: 519 DEHQLCLVYPPVKGHPHDFNCELCSEDINPNHWFYHCRKCD 641 EH L L + P C C D + Y+C CD Sbjct: 189 HEHPLFL----LPSSPPQTICNSCKNDC--GEFVYNCSLCD 223 >ref|NP_187968.1| cysteine/histidine-rich C1 domain-containing protein [Arabidopsis thaliana] gi|332641858|gb|AEE75379.1| cysteine/histidine-rich C1 domain-containing protein [Arabidopsis thaliana] Length = 513 Score = 125 bits (314), Expect = 7e-27 Identities = 74/209 (35%), Positives = 102/209 (48%), Gaps = 7/209 (3%) Frame = +3 Query: 33 INVVETESSDHMICDGCVQPILSPPDSFYGCLDCNFFLHNICAAELPREFEHASHPQHKL 212 +++ E E S +C CV PI SF GC C+F LH++CA+ LPR+ EH H Sbjct: 236 LDMGEKEESGQ-VCQACVLPIEF--GSFLGCKQCDFALHDVCAS-LPRKMEHGIHIHPLT 291 Query: 213 IRCDQIGKPNAFFLCWSCDKLCNGIFYRC--ESCEFYFDLVCAAMPSEIKHDSHKHKLKY 386 I D + N FF C C++ G Y+C E CEF D+ CA+ H HKH L Sbjct: 292 IHVDAMNNENGFFTCSVCNQHSCGFMYKCCQEDCEFKIDVKCASFAEPFDHSMHKHPLYL 351 Query: 387 A--NDPKIMPCDGCGYPSRRNFIC-TTCNYTINTSCVRNPGRIKHRWDEHQLCLVYPPV- 554 A D I C+GC SR C C + + C+ P +K+++D H L L Sbjct: 352 AIYFDEFIYRCEGCTQSSRFAAKCYKGCWFPLEFKCLNLPKLVKYKYDSHSLTLYSDKYY 411 Query: 555 -KGHPHDFNCELCSEDINPNHWFYHCRKC 638 K ++ CE+C E+IN FY C +C Sbjct: 412 SKSFQLEWWCEICEENINKKKLFYTCHEC 440 >dbj|BAB02601.1| unnamed protein product [Arabidopsis thaliana] Length = 486 Score = 125 bits (314), Expect = 7e-27 Identities = 74/209 (35%), Positives = 102/209 (48%), Gaps = 7/209 (3%) Frame = +3 Query: 33 INVVETESSDHMICDGCVQPILSPPDSFYGCLDCNFFLHNICAAELPREFEHASHPQHKL 212 +++ E E S +C CV PI SF GC C+F LH++CA+ LPR+ EH H Sbjct: 209 LDMGEKEESGQ-VCQACVLPIEF--GSFLGCKQCDFALHDVCAS-LPRKMEHGIHIHPLT 264 Query: 213 IRCDQIGKPNAFFLCWSCDKLCNGIFYRC--ESCEFYFDLVCAAMPSEIKHDSHKHKLKY 386 I D + N FF C C++ G Y+C E CEF D+ CA+ H HKH L Sbjct: 265 IHVDAMNNENGFFTCSVCNQHSCGFMYKCCQEDCEFKIDVKCASFAEPFDHSMHKHPLYL 324 Query: 387 A--NDPKIMPCDGCGYPSRRNFIC-TTCNYTINTSCVRNPGRIKHRWDEHQLCLVYPPV- 554 A D I C+GC SR C C + + C+ P +K+++D H L L Sbjct: 325 AIYFDEFIYRCEGCTQSSRFAAKCYKGCWFPLEFKCLNLPKLVKYKYDSHSLTLYSDKYY 384 Query: 555 -KGHPHDFNCELCSEDINPNHWFYHCRKC 638 K ++ CE+C E+IN FY C +C Sbjct: 385 SKSFQLEWWCEICEENINKKKLFYTCHEC 413