BLASTX nr result

ID: Angelica22_contig00020944 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00020944
         (2543 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004145843.1| PREDICTED: uncharacterized protein LOC101220...   448   e-123
ref|XP_002526587.1| conserved hypothetical protein [Ricinus comm...   429   e-117
ref|NP_001240050.1| uncharacterized protein LOC100789581 [Glycin...   421   e-115
ref|XP_003519313.1| PREDICTED: uncharacterized protein LOC100783...   417   e-114
ref|XP_002305906.1| predicted protein [Populus trichocarpa] gi|2...   415   e-113

>ref|XP_004145843.1| PREDICTED: uncharacterized protein LOC101220526 [Cucumis sativus]
            gi|449521649|ref|XP_004167842.1| PREDICTED:
            uncharacterized LOC101220526 [Cucumis sativus]
          Length = 364

 Score =  448 bits (1153), Expect = e-123
 Identities = 219/369 (59%), Positives = 276/369 (74%)
 Frame = -3

Query: 2295 MFGAVHFGIMAALVVLFVPMGLAGWHLSRNKVLFFSGILFITLFVGVHLSPYFTSVSSFV 2116
            M G V  GI+AA +VLFVPMG+AGWHLSRNK+LFFSG LFITL +GVHL+PY  SVS FV
Sbjct: 1    MLGGVQLGILAACIVLFVPMGMAGWHLSRNKMLFFSGALFITLAIGVHLTPYIPSVSDFV 60

Query: 2115 NTFPTASSSVPVLYEQEDRDICMSVLQLHHVIYKFESGNKDGPLNDSSVHEDLSWNWDYR 1936
             T     SSV V    + R  C+S  QLH +++  +  +   PL+++SV+ + SW W   
Sbjct: 61   TTV----SSVVVF---DSRASCVS--QLHEIVWDVKQSDGFNPLSNNSVNYEKSWKWGRS 111

Query: 1935 APSAGCDFQKLSKVDASDLLNGSWVVVAGDSQARLFVVSLLDLVLGPEKMESVRGDLFKR 1756
            AP   CDFQKL+  D +DLLNGSWVVVAGDSQARL  +SLLDL L  ++ME+VRGDLFKR
Sbjct: 112  APVIACDFQKLAPTDVADLLNGSWVVVAGDSQARLMALSLLDLTLDSQRMEAVRGDLFKR 171

Query: 1755 HSNYQTVIDSIGLKLDFLWAPYARNLTDIVMEFMGNKTYPDVLVLGAGLWDMLRITNSTE 1576
            HSNYQ +I   G+KLDF+WAPYA NLTD++ EF  N++YPDV+++G+GLW ML  TN+++
Sbjct: 172  HSNYQILIGETGMKLDFIWAPYASNLTDLMGEFKKNRSYPDVIIMGSGLWHMLHFTNASD 231

Query: 1575 YGTSLQQLNTYLVSSLPVSPEYSTSDPITGAVSVPTPHFFWLGSPTLINRMLNTEEKKVK 1396
            +G SL+ L + +VS +P++PE  +  P+TG+VS+ TPH FW+G PTLIN MLNTEEK+ K
Sbjct: 232  FGLSLESLRSSVVSLIPLTPELGSDGPLTGSVSIRTPHLFWIGMPTLINSMLNTEEKRKK 291

Query: 1395 MTDAMYAAYEREMYKSKLLRQSGGPAFLLDIKLLSQKCGDRCTEDGMHYNNAVYETAVHI 1216
            MTD M AAY+  +  SKLLR SGGP  LLDI+ LS  CG RCT DGMHY+  VYE A+HI
Sbjct: 292  MTDTMRAAYDAALGDSKLLRSSGGPLLLLDIETLSWNCGVRCTVDGMHYDGVVYEAAIHI 351

Query: 1215 MLNALIIES 1189
            MLNAL+IES
Sbjct: 352  MLNALLIES 360


>ref|XP_002526587.1| conserved hypothetical protein [Ricinus communis]
            gi|223534081|gb|EEF35799.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 377

 Score =  429 bits (1104), Expect = e-117
 Identities = 218/376 (57%), Positives = 275/376 (73%), Gaps = 7/376 (1%)
 Frame = -3

Query: 2295 MFGAVHFGIMAALVVLFVPMGLAGWHLSRNKVLFFSGILFITLFVGVHLSPYFTSVSSFV 2116
            MFG V  G++AA VVLFVPMG+AGWHLSRNK+LFFSG LFITL VGVHL+PYF SVS FV
Sbjct: 1    MFGGVQLGVLAACVVLFVPMGMAGWHLSRNKMLFFSGALFITLAVGVHLTPYFPSVSDFV 60

Query: 2115 NTFPTASSSVPVLYEQEDRDICMSVLQ------LHHVIYKFESGNKD-GPLNDSSVHEDL 1957
                T+  SV V   +ED   C++++          VI    SG+ D G + + S+  D 
Sbjct: 61   ----TSVQSVVVFDNREDS--CINLVNEVVWDVKPRVINDSSSGSSDSGGIKNGSLSYDK 114

Query: 1956 SWNWDYRAPSAGCDFQKLSKVDASDLLNGSWVVVAGDSQARLFVVSLLDLVLGPEKMESV 1777
             W+W        C+FQ+L + DASDLLNGSWVVVAGDSQARL V SLL L+L  ++MES+
Sbjct: 115  IWDWAKTGKVKACEFQRLDRFDASDLLNGSWVVVAGDSQARLIVQSLLKLILDSKRMESI 174

Query: 1776 RGDLFKRHSNYQTVIDSIGLKLDFLWAPYARNLTDIVMEFMGNKTYPDVLVLGAGLWDML 1597
            +GDLFKRHS+YQ VI+ IGLKLDF+WAPY  NLTD+++ F  N++YPDVLV+G+GLW ML
Sbjct: 175  KGDLFKRHSDYQIVIEEIGLKLDFIWAPYVVNLTDLMIGFKQNRSYPDVLVMGSGLWHML 234

Query: 1596 RITNSTEYGTSLQQLNTYLVSSLPVSPEYSTSDPITGAVSVPTPHFFWLGSPTLINRMLN 1417
             + N+++YG +LQ L + +VS LP SP+     P+TG+VSV +PH FWLG P LIN MLN
Sbjct: 235  HMNNASDYGFALQSLRSSVVSLLPFSPQLGADGPVTGSVSVRSPHLFWLGMPMLINGMLN 294

Query: 1416 TEEKKVKMTDAMYAAYEREMYKSKLLRQSGGPAFLLDIKLLSQKCGDRCTEDGMHYNNAV 1237
            TEEK+ KM+D M+ AY+R +  S+LLR+ GGP  LLDI+ +S  CG RCT DGMHY+ AV
Sbjct: 295  TEEKREKMSDEMWHAYDRALRNSRLLRRYGGPLLLLDIQSMSWNCGPRCTVDGMHYDGAV 354

Query: 1236 YETAVHIMLNALIIES 1189
            YE AVHI+LNAL+IES
Sbjct: 355  YEAAVHILLNALLIES 370


>ref|NP_001240050.1| uncharacterized protein LOC100789581 [Glycine max]
            gi|255636963|gb|ACU18814.1| unknown [Glycine max]
          Length = 370

 Score =  421 bits (1083), Expect = e-115
 Identities = 213/372 (57%), Positives = 269/372 (72%), Gaps = 3/372 (0%)
 Frame = -3

Query: 2295 MFGAVHFGIMAALVVLFVPMGLAGWHLSRNKVLFFSGILFITLFVGVHLSPYFTSVSSFV 2116
            M GAV  G++AA VVLFVPMG+AGWHLSRNKVLFFSG LFITL VGVHL+PYF SVS FV
Sbjct: 1    MLGAVQLGLLAACVVLFVPMGMAGWHLSRNKVLFFSGALFITLAVGVHLTPYFPSVSDFV 60

Query: 2115 NTFPTASSSVPVLYEQEDRDICMSVLQLHHVIYKFESGNK-DGPLNDS--SVHEDLSWNW 1945
             +  ++S +V V    +DRD C+S+L  H ++++       D  LND+  SV+ D SW+W
Sbjct: 61   TSVSSSSVNVVV----DDRDSCVSLL--HEIVWEVRPRRVFDFELNDNNNSVNYDKSWSW 114

Query: 1944 DYRAPSAGCDFQKLSKVDASDLLNGSWVVVAGDSQARLFVVSLLDLVLGPEKMESVRGDL 1765
                    C+FQ+L + D S LLNGSWVV+AGDSQAR+F +SLL LVL PE MESV+G L
Sbjct: 115  KRSGSVDSCEFQRLKRYDVSVLLNGSWVVIAGDSQARIFALSLLSLVLEPEGMESVKGSL 174

Query: 1764 FKRHSNYQTVIDSIGLKLDFLWAPYARNLTDIVMEFMGNKTYPDVLVLGAGLWDMLRITN 1585
            FKRHS+Y TV+D IG+KLDF+WAPY  NLT +V  F  N+ YPD+LV+G+GLW ML  TN
Sbjct: 175  FKRHSDYHTVVDEIGMKLDFMWAPYVTNLTSLVAGFKRNRVYPDLLVMGSGLWHMLHFTN 234

Query: 1584 STEYGTSLQQLNTYLVSSLPVSPEYSTSDPITGAVSVPTPHFFWLGSPTLINRMLNTEEK 1405
            +++YG SL  L + + S LPVS E+   + +  + SV +PH FWLG PTL+N MLNT+EK
Sbjct: 235  ASDYGFSLGLLRSSVTSLLPVSSEFGNDEAVAVSASVRSPHLFWLGMPTLVNSMLNTKEK 294

Query: 1404 KVKMTDAMYAAYEREMYKSKLLRQSGGPAFLLDIKLLSQKCGDRCTEDGMHYNNAVYETA 1225
            + KMTD M+  YERE+  S +LRQ GGP  L+DI  LS+ CG +CT DGMHY+  VYE  
Sbjct: 295  REKMTDLMWGEYEREVQGSGMLRQFGGPLQLVDIGSLSRNCGIKCTVDGMHYDGVVYEAG 354

Query: 1224 VHIMLNALIIES 1189
            V I+LNAL+IES
Sbjct: 355  VQILLNALLIES 366


>ref|XP_003519313.1| PREDICTED: uncharacterized protein LOC100783987 [Glycine max]
          Length = 374

 Score =  417 bits (1071), Expect = e-114
 Identities = 214/380 (56%), Positives = 270/380 (71%), Gaps = 11/380 (2%)
 Frame = -3

Query: 2295 MFGAVHFGIMAALVVLFVPMGLAGWHLSRNKVLFFSGILFITLFVGVHLSPYFTSVSSFV 2116
            M GAV  G++AA VVLFVPMG+AGWHLSRNKVLFFSG LFITL VGVHL+PYF SVS FV
Sbjct: 1    MLGAVQLGLLAACVVLFVPMGMAGWHLSRNKVLFFSGALFITLAVGVHLTPYFPSVSDFV 60

Query: 2115 NTFPTASSSVPVLYEQEDRDICMSVLQLHHVIYK--------FESGNKDGPLNDSSVHED 1960
             +  ++S +V V    +DRD+C+S+L  H ++++        FE  N     N++SV+ D
Sbjct: 61   TSVSSSSVNVVV----DDRDLCVSLL--HDIVWEVRPRRVFDFELNNN----NNNSVNYD 110

Query: 1959 LSWNWDYRAPSAGCDFQKLSKVDASDLLNGSWVVVAGDSQARLFVVSLLDLVLGPEKMES 1780
             SW+W        C+FQ+L + D S LLNGSWVVVAGDSQAR+F +SLL LVL  E MES
Sbjct: 111  KSWSWKRSGSVESCEFQRLKRHDVSVLLNGSWVVVAGDSQARIFTLSLLSLVLDSEGMES 170

Query: 1779 VRGDLFKRHSNYQTVIDSIGLKLDFLWAPYARNLTDIVMEFMGNKTYPDVLVLGAGLWDM 1600
            V+G LFKRHS+Y TV+D IG+KLDF+WAPY  NLT +V  F  N+ YPD+LV+G+GLW M
Sbjct: 171  VKGSLFKRHSDYHTVVDEIGMKLDFMWAPYVTNLTSLVAGFKRNRVYPDLLVMGSGLWHM 230

Query: 1599 LRITNSTEYGTSLQQLNTYLVSSLPVSPEYSTSD---PITGAVSVPTPHFFWLGSPTLIN 1429
            L  TN+++YG SL  L + + S LPVS E+   +    ++ +V  P+PH FWLG PTL+N
Sbjct: 231  LHFTNASDYGFSLGVLRSSVNSLLPVSSEFGNDEADVAVSASVRSPSPHLFWLGMPTLVN 290

Query: 1428 RMLNTEEKKVKMTDAMYAAYEREMYKSKLLRQSGGPAFLLDIKLLSQKCGDRCTEDGMHY 1249
             MLNT EK+ KMTD M+  YERE+  S +LRQ GGP  L+DI  LS  CG +CT+DGMHY
Sbjct: 291  SMLNTNEKREKMTDLMWGEYEREVQGSGMLRQFGGPLQLVDIGSLSWTCGIKCTDDGMHY 350

Query: 1248 NNAVYETAVHIMLNALIIES 1189
            +  VYE  V IMLNAL+IES
Sbjct: 351  DGVVYEAGVQIMLNALLIES 370


>ref|XP_002305906.1| predicted protein [Populus trichocarpa] gi|222848870|gb|EEE86417.1|
            predicted protein [Populus trichocarpa]
          Length = 371

 Score =  415 bits (1066), Expect = e-113
 Identities = 209/369 (56%), Positives = 263/369 (71%)
 Frame = -3

Query: 2295 MFGAVHFGIMAALVVLFVPMGLAGWHLSRNKVLFFSGILFITLFVGVHLSPYFTSVSSFV 2116
            MFG +  G++AA VVLFVPMG+AG+HLSRNK+LFFSG LFITL VGVHL+PYF SVS FV
Sbjct: 1    MFGGIRIGVLAACVVLFVPMGMAGYHLSRNKMLFFSGALFITLAVGVHLTPYFPSVSDFV 60

Query: 2115 NTFPTASSSVPVLYEQEDRDICMSVLQLHHVIYKFESGNKDGPLNDSSVHEDLSWNWDYR 1936
                T+  SV V   +ED  I +    + +V  +  S N     NDS  H+ + W+W   
Sbjct: 61   ----TSVQSVVVFDNREDSCINLVNEVIWNVKPRVISSNGSDRSNDSVGHDKI-WDWSKN 115

Query: 1935 APSAGCDFQKLSKVDASDLLNGSWVVVAGDSQARLFVVSLLDLVLGPEKMESVRGDLFKR 1756
                GCDF+KL + D  DLLNGSWVVVAGDSQARL V SLL L+L  ++M  + GDLFKR
Sbjct: 116  GMVKGCDFEKLGRGDVKDLLNGSWVVVAGDSQARLLVQSLLSLILDEKRMGMIMGDLFKR 175

Query: 1755 HSNYQTVIDSIGLKLDFLWAPYARNLTDIVMEFMGNKTYPDVLVLGAGLWDMLRITNSTE 1576
            HS+Y+ V+D IG+KLDF+WAPY  NLT++++ F  N+TYPDVLV+GAGLW ML + N+++
Sbjct: 176  HSDYEIVVDDIGMKLDFVWAPYVVNLTNLMVGFKQNRTYPDVLVIGAGLWHMLHVNNASD 235

Query: 1575 YGTSLQQLNTYLVSSLPVSPEYSTSDPITGAVSVPTPHFFWLGSPTLINRMLNTEEKKVK 1396
            Y  +L+ L + +VS LP SPE  T  P+TG+VSV +PH FWLG P LIN MLNTEEK+ K
Sbjct: 236  YDIALENLRSSVVSLLPFSPELGTDGPVTGSVSVRSPHLFWLGMPMLINEMLNTEEKREK 295

Query: 1395 MTDAMYAAYEREMYKSKLLRQSGGPAFLLDIKLLSQKCGDRCTEDGMHYNNAVYETAVHI 1216
            M D +  AY   ++ S++LR  GGP  LLDI+ LS  CG RCT DGMHY+  VYE AVHI
Sbjct: 296  MNDKIRHAYYGALHDSRILRSYGGPLLLLDIQSLSWNCGPRCTNDGMHYDGTVYEAAVHI 355

Query: 1215 MLNALIIES 1189
            +LNAL+IES
Sbjct: 356  LLNALLIES 364


Top