BLASTX nr result
ID: Angelica22_contig00001426
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica22_contig00001426 (1653 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002521962.1| conserved hypothetical protein [Ricinus comm... 439 e-120 ref|XP_002319651.1| predicted protein [Populus trichocarpa] gi|2... 439 e-120 gb|ADZ55303.1| hypothetical protein MA17P03.10 [Coffea arabica] 402 e-109 gb|ABZ89185.1| putative protein [Coffea canephora] 402 e-109 ref|XP_003536143.1| PREDICTED: uncharacterized protein LOC100793... 400 e-109 >ref|XP_002521962.1| conserved hypothetical protein [Ricinus communis] gi|223538766|gb|EEF40366.1| conserved hypothetical protein [Ricinus communis] Length = 450 Score = 439 bits (1129), Expect = e-120 Identities = 224/408 (54%), Positives = 281/408 (68%), Gaps = 9/408 (2%) Frame = +1 Query: 184 RPVIDATIPSTPAPKTQ-VYXXXXXXXXXXXXXXXXLDANGRLEILANRLGLWFEYAPLI 360 +P+ A IPSTP P Q +Y LD GRLE+LANRLGLW+EYAPLI Sbjct: 43 KPISAALIPSTPPPSNQQLYQPFRPPPSPIPSQFSSLDTAGRLEVLANRLGLWYEYAPLI 102 Query: 361 PSLTQEGFAAPTIEEVTGLTGVEQNQLIVGCQVRDSLVQSKLEEEVLAFFDLGGAQLLYE 540 PSL QEGF+ P+IEE TG++GVEQN+L+V +VR+SL QS+ E+++ FD GGA+LLYE Sbjct: 103 PSLIQEGFSPPSIEESTGISGVEQNRLVVAAKVRESLTQSQTAAEIVSEFDTGGAELLYE 162 Query: 541 IRLLSVSQRAEAARLIANEKFDVKSAQELARAIKDYPRRKKEKGWECFEYTSPRDCLAFM 720 IRLLS QRA AAR I + D K A++LARA+KD+PRR+ +KGWE F+YT P DCL+FM Sbjct: 163 IRLLSAPQRAAAARFIVENRLDAKGAEDLARAMKDFPRRRGDKGWESFDYTLPGDCLSFM 222 Query: 721 YYRLALEHQS-LEPRELGLEKALEMAETERAKNRILKDLRGKSGHGVGEEESVAKAIKVP 897 YYR + EH++ EPR LE+AL++AE+E+AKN +LK+L G S +E V A +VP Sbjct: 223 YYRQSREHKTPSEPRTNALERALDVAESEKAKNEVLKELEGDSEGKEEKEGEVGDATRVP 282 Query: 898 VVRMKFGEVSEATSVAVLPVCRSEERDKEVVDAPWECGTAGEFGVVVAEKPWSRWVVLPG 1077 VVR++ GEV+EATSV VLPVCR+ +++KE+ +APWEC + GEFGVVVAEK W RWVVLPG Sbjct: 283 VVRLRIGEVAEATSVVVLPVCRALQKEKEIWEAPWECKSEGEFGVVVAEKGWERWVVLPG 342 Query: 1078 WEPXXXXXXXXXXXXFSDARALPWKVNRWYKEESVLVVADRKVKEVTIDDGFYLVCKD-- 1251 WEP F DARALPWKVNRWYKEE++LVVADR KEV +DGFYLV D Sbjct: 343 WEPVVGLEKGGVVVAFPDARALPWKVNRWYKEEAILVVADRGSKEVNANDGFYLVAVDGS 402 Query: 1252 -----DGLKVEIGSALKXXXXXXXXXXXXXXXRPPREDAENQLEEDWE 1380 GL+VE GS LK RPP+E + +E+WE Sbjct: 403 GDGRSGGLEVERGSILKERGVEESLGTVVLVVRPPKEQTDQLSDENWE 450 >ref|XP_002319651.1| predicted protein [Populus trichocarpa] gi|222858027|gb|EEE95574.1| predicted protein [Populus trichocarpa] Length = 451 Score = 439 bits (1129), Expect = e-120 Identities = 228/397 (57%), Positives = 274/397 (69%), Gaps = 5/397 (1%) Frame = +1 Query: 205 IPSTPAPKTQVYXXXXXXXXXXXXXXXXLDANGRLEILANRLGLWFEYAPLIPSLTQEGF 384 IPS+P P Q+Y LDA RLEIL+NRLGLW+EYAPLIPSL QEGF Sbjct: 55 IPSSPPPYQQLYQPFRPPPSPIPSQYKSLDAPSRLEILSNRLGLWYEYAPLIPSLFQEGF 114 Query: 385 AAPTIEEVTGLTGVEQNQLIVGCQVRDSLVQSKLEEEVLAFFDLGGAQLLYEIRLLSVSQ 564 P+IEE TG++GVEQN+L+VG QVRDSLVQS + E++A FDLGGA+LLYEIRLLS +Q Sbjct: 115 TPPSIEEATGISGVEQNRLVVGAQVRDSLVQSNTDPEIVASFDLGGAELLYEIRLLSATQ 174 Query: 565 RAEAARLIANEKFDVKSAQELARAIKDYPRRKKEKGWECFEYTSPRDCLAFMYYRLALEH 744 R+ AAR I K D K AQ+LARA+KD+PRR+ +K WE F+Y P DCL+FMYYR + EH Sbjct: 175 RSAAARFIVVNKMDTKGAQDLARAMKDFPRRRGDKFWESFDYVLPGDCLSFMYYRQSREH 234 Query: 745 QS-LEPRELGLEKALEMAETERAKNRILKDLRGKSGHGVGEEESVAKAIKVPVVRMKFGE 921 ++ E R L+ ALE+AE+E+AK+ ILK+L G E A ++VPVVR+K GE Sbjct: 235 KNPSESRTNALQMALEVAESEKAKSAILKELEGGGERKERAEGETADGVRVPVVRLKIGE 294 Query: 922 VSEATSVAVLPVCRSEERDKEVVDAPWECGTAGEFGVVVAEKPWSRWVVLPGWEPXXXXX 1101 V+EATSV VLPVCRSE+ ++++V+APWEC GEFGVVVAEK W RWVVLPGWEP Sbjct: 295 VAEATSVVVLPVCRSEDGERKIVEAPWECKGQGEFGVVVAEKAWERWVVLPGWEPVLGLG 354 Query: 1102 XXXXXXXFSDARALPWKVNRWYKEESVLVVADRKVKEVTIDDGFYLVCKDDG---LKVEI 1272 F DAR LPWK NRWYKEES+LVVADR KEV DDGFYLV D KVE Sbjct: 355 RGGVAVAFPDARVLPWKANRWYKEESILVVADRGSKEVKADDGFYLVTLDGAGGDFKVER 414 Query: 1273 GSALKXXXXXXXXXXXXXXXRPPREDAENQL-EEDWE 1380 GSALK RPPR + ++QL +EDWE Sbjct: 415 GSALKERNVVECLGTVLLVVRPPRYETDDQLSDEDWE 451 >gb|ADZ55303.1| hypothetical protein MA17P03.10 [Coffea arabica] Length = 451 Score = 402 bits (1034), Expect = e-109 Identities = 215/413 (52%), Positives = 272/413 (65%), Gaps = 13/413 (3%) Frame = +1 Query: 181 PRPVIDATIP--STPAPKTQVYXXXXXXXXXXXXXXXXLDANGRLEILANRLGLWFEYAP 354 P V+ IP S+ A + Q+Y LD NGRLEIL+NRLG WFEYAP Sbjct: 39 PNSVVALIIPPKSSAAQQQQLYQPFRPPPSPLPPQYRNLDTNGRLEILSNRLGPWFEYAP 98 Query: 355 LIPSLTQEGFAAPTIEEVTGLTGVEQNQLIVGCQVRDSLVQSKLEEEVLAFFDLGGAQLL 534 LI +L QEGF PT+EE+TG++GVEQN+L+V QVR+SLVQS+++ ++L+FFD GGA+LL Sbjct: 99 LISALFQEGFTPPTLEEITGISGVEQNRLVVAAQVRESLVQSEIDPDILSFFDTGGAELL 158 Query: 535 YEIRLLSVSQRAEAARLIANEKFDVKSAQELARAIKDYPRRKKEKGWECFEYTSPRDCLA 714 YEIRLLS SQRA AA+ + KFD + ELARAIKD PRRK EKGWE F+ P DCLA Sbjct: 159 YEIRLLSASQRASAAKYLVLNKFDARMTLELARAIKDKPRRKGEKGWESFDGDLPGDCLA 218 Query: 715 FMYYRLALEHQSLEPREL---GLEKALEMAETERAKNRILKDLRG-KSGHGVGEEESVAK 882 FMY+R A EH++ EL LE+AL+ E+E + R+L++L G K G +E + A Sbjct: 219 FMYFRQAQEHRTASSPELWRSALERALQAVESENGRERVLEELEGEKDGEDKDKEGAAAD 278 Query: 883 AIKVPVVRMKFGEVSEATSVAVLPVCRSEERDKEVVDAPWECGTAGEFGVVVAEKPWSRW 1062 + VPVVRM+ GEV+E++ VAVLPVCR+EER+ EV +APWEC G+FGVV AEK W RW Sbjct: 279 RVVVPVVRMQTGEVAESSVVAVLPVCRAEEREVEVEEAPWECAGVGDFGVVEAEKGWGRW 338 Query: 1063 VVLPGWEPXXXXXXXXXXXXFSDARALPWKVNRWYKEESVLVVADRKVKEVTIDDGFYLV 1242 VVLPGWEP F +AR LP + +W +EE++LVVADR KEV DD FYLV Sbjct: 339 VVLPGWEPVAGLKRGGVAVAFKNARVLPGRAKKWNREEAILVVADRGRKEVVTDDNFYLV 398 Query: 1243 ------CKDDGLKVEIGSALKXXXXXXXXXXXXXXXRPPREDAENQL-EEDWE 1380 ++GLKVE G LK RPPRE+ ++QL +EDWE Sbjct: 399 VGGGNGSVEEGLKVERGLELKEIGVKESLGTVVLVVRPPREEYDDQLSDEDWE 451 >gb|ABZ89185.1| putative protein [Coffea canephora] Length = 451 Score = 402 bits (1034), Expect = e-109 Identities = 215/413 (52%), Positives = 272/413 (65%), Gaps = 13/413 (3%) Frame = +1 Query: 181 PRPVIDATIP--STPAPKTQVYXXXXXXXXXXXXXXXXLDANGRLEILANRLGLWFEYAP 354 P V+ IP S+ A + Q+Y LD NGRLEIL+NRLG WFEYAP Sbjct: 39 PNSVVALIIPPKSSAAQQQQLYQPFRPPPSPLPPQYRNLDTNGRLEILSNRLGPWFEYAP 98 Query: 355 LIPSLTQEGFAAPTIEEVTGLTGVEQNQLIVGCQVRDSLVQSKLEEEVLAFFDLGGAQLL 534 LI +L QEGF PT+EE+TG++GVEQN+L+V QVR+SLVQS+++ ++L+FFD GGA+LL Sbjct: 99 LISALFQEGFTPPTLEEITGISGVEQNRLVVAAQVRESLVQSEIDPDILSFFDTGGAELL 158 Query: 535 YEIRLLSVSQRAEAARLIANEKFDVKSAQELARAIKDYPRRKKEKGWECFEYTSPRDCLA 714 YEIRLLS SQRA AA+ + KFD + ELARAIKD PRRK EKGWE F+ P DCLA Sbjct: 159 YEIRLLSASQRASAAKYLVLNKFDARMTLELARAIKDKPRRKGEKGWESFDGDLPGDCLA 218 Query: 715 FMYYRLALEHQSLEPREL---GLEKALEMAETERAKNRILKDLRG-KSGHGVGEEESVAK 882 FMY+R A EH++ EL LE+AL+ E+E + R+L++L G K G +E + A Sbjct: 219 FMYFRQAQEHRTASSPELWRSALERALQAVESENGRERVLEELEGKKDGEDKDKEGAAAD 278 Query: 883 AIKVPVVRMKFGEVSEATSVAVLPVCRSEERDKEVVDAPWECGTAGEFGVVVAEKPWSRW 1062 + VPVVRM+ GEV+E++ VAVLPVCR+EER+ EV +APWEC G+FGVV AEK W RW Sbjct: 279 RVVVPVVRMQTGEVAESSVVAVLPVCRAEEREVEVEEAPWECAGVGDFGVVEAEKGWGRW 338 Query: 1063 VVLPGWEPXXXXXXXXXXXXFSDARALPWKVNRWYKEESVLVVADRKVKEVTIDDGFYLV 1242 VVLPGWEP F +AR LP + +W +EE++LVVADR KEV DD FYLV Sbjct: 339 VVLPGWEPVAGLKRGGVAVAFKNARVLPGRAKKWNREEAILVVADRGRKEVVTDDNFYLV 398 Query: 1243 ------CKDDGLKVEIGSALKXXXXXXXXXXXXXXXRPPREDAENQL-EEDWE 1380 ++GLKVE G LK RPPRE+ ++QL +EDWE Sbjct: 399 VGGGNGSVEEGLKVERGLELKEIGVKESLGTVVLVVRPPREEYDDQLSDEDWE 451 >ref|XP_003536143.1| PREDICTED: uncharacterized protein LOC100793519 [Glycine max] Length = 439 Score = 400 bits (1028), Expect = e-109 Identities = 211/388 (54%), Positives = 266/388 (68%), Gaps = 2/388 (0%) Frame = +1 Query: 223 PKTQVYXXXXXXXXXXXXXXXXLDANGRLEILANRLGLWFEYAPLIPSLTQEGFAAPTIE 402 P+ QVY LD GR++ILANRLGLW+EYAPLI SL +EGF+ PTIE Sbjct: 55 PQHQVYQPFRPPPSPLPSQFGTLDIAGRIDILANRLGLWYEYAPLINSLIREGFSPPTIE 114 Query: 403 EVTGLTGVEQNQLIVGCQVRDSLVQSKLEEEVLAFFDLGGAQLLYEIRLLSVSQRAEAAR 582 E TG++GVEQN+LIVG QVRDSLV SK + ++LA F+ GGA+LLYEIRLLS SQR AAR Sbjct: 115 ETTGISGVEQNRLIVGAQVRDSLVHSKADPDLLAAFETGGAELLYEIRLLSASQRVAAAR 174 Query: 583 LIANEKFDVKSAQELARAIKDYPRRKKEKGWECFEYTSPRDCLAFMYYRLALEHQS-LEP 759 + + D K+AQELAR++KD+P R+ +KGW F+YT P DCL+FMYYR + EH++ E Sbjct: 175 FLVENRCDGKAAQELARSMKDFPSRRGDKGWARFDYTLPGDCLSFMYYRQSREHRNPSEQ 234 Query: 760 RELGLEKALEMAETERAKNRILKDLRGKSGHGVGEEESVAKAIKVPVVRMKFGEVSEATS 939 R LE+AL +AETE A+N IL++L G G + ++ A++VPVVR++ GEV+EA+S Sbjct: 235 RTSALEQALRVAETEAARNMILEELEGNGEEG-DKVDAGEGAVRVPVVRLRIGEVAEASS 293 Query: 940 VAVLPVCRSEERDKEVVDAPWECGTAGEFGVVVAEKPWSRWVVLPGWEPXXXXXXXXXXX 1119 V VLPV +EER E+++AP+E + G FGVVVAEK W +WVVLP W+P Sbjct: 294 VVVLPVSAAEER--EILEAPYESRSQGVFGVVVAEKGWGKWVVLPSWDPVVGLGKGGVVV 351 Query: 1120 XFSDARALPWKVNRWYKEESVLVVADRKVKEVTIDDGFYLVCKD-DGLKVEIGSALKXXX 1296 F DAR LPWKVNRWYKEE +LVVADR KEV DDGFYLV D +GLKVE GS LK Sbjct: 352 SFPDARVLPWKVNRWYKEEPILVVADRSKKEVGADDGFYLVNADGEGLKVERGSGLKEKG 411 Query: 1297 XXXXXXXXXXXXRPPREDAENQLEEDWE 1380 RPP+E+ + +EDWE Sbjct: 412 FTQSLGTVLLVVRPPKEENDELSDEDWE 439