BLASTX nr result

ID: Angelica22_contig00008566 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00008566
         (1968 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002514588.1| conserved hypothetical protein [Ricinus comm...   130   1e-27
emb|CBI27399.3| unnamed protein product [Vitis vinifera]              128   5e-27
ref|XP_003520134.1| PREDICTED: uncharacterized protein LOC100778...   114   7e-23
ref|XP_002311412.1| predicted protein [Populus trichocarpa] gi|2...   111   6e-22
ref|XP_002875229.1| hypothetical protein ARALYDRAFT_484289 [Arab...   102   4e-19

>ref|XP_002514588.1| conserved hypothetical protein [Ricinus communis]
            gi|223546192|gb|EEF47694.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 488

 Score =  130 bits (328), Expect = 1e-27
 Identities = 110/387 (28%), Positives = 176/387 (45%), Gaps = 33/387 (8%)
 Frame = -2

Query: 1379 SSNEEAKESVAASGVHTEMSHADSIVYTDKNILESDLPELIVCYKDSVFHAVKDICVDEG 1200
            S +++ +E    + +  E    DS+ Y DKN++E +LPEL++CYK++ +H VKDICVDEG
Sbjct: 87   SIHDKEEEVRNFTSLKIESFDKDSVFYIDKNVMEPELPELVLCYKENTYHVVKDICVDEG 146

Query: 1199 MPAENKCLTENVVGGLS----TLPSNNYKHRDVTEEADADISYEDDFKS-SSYK-DFREN 1038
            +P++   L +  V         +P  + K     E  D D+S +   K+ +S+K D +E+
Sbjct: 147  VPSQENFLFDTSVDQEKLCPYLIPEKDIKSEIQKERVDLDMSTQYLSKNDNSFKCDSKES 206

Query: 1037 GAAHGVNNGEV-NLDKDTAYDSDSV-ELESSPESDKYQNSASIRFPKTGEEECNAVEKIT 864
             A   + +  +  +   T+ ++ S+ EL   PE     + +      T E E  ++++ +
Sbjct: 207  MAIAEIEDDAMEEIANYTSKETFSLGELLLMPEVVAELSHSKSLLNSTDEAEQLSIQRPS 266

Query: 863  EIVCDST-----------------------LLGKDLHSETSLESLLKSARCGENNSSQQS 753
            E +  +T                       L+ +  H E  L +L   +      S    
Sbjct: 267  ENIVLATASACEESKYATEQFLLVTPAVDPLVEESGHEEAKLGTLTSDS--SPKASDHGH 324

Query: 752  DQISVARAQSPVVTEESTTSNSDNLLDSGVISLNVDSSKLAPTARDEIHGNATG--PQLK 579
            D++ +A       TEE           S  +    D +  APTA     G+  G    L+
Sbjct: 325  DEVILASLAPSYATEEPENGAKAAKSPSHTLDSVSDLNSSAPTASGGEEGSQVGGSEHLE 384

Query: 578  SEKKLIHDDGISDNRLVINNQIKRDQGESSFSVAGPLPDVVPYSGHIPFXXXXXXXXXXX 399
            S     H+D  +      + Q++   GESSFS AGPL  ++ YSG I +           
Sbjct: 385  SRNSSRHED--TSITEPFSGQLQYSHGESSFSAAGPLSGLISYSGPIAYSGSLSLRSDSS 442

Query: 398  XXXXXSFAFPVLPNEWNSSPVRMAKAD 318
                 SFAFP+L +EWNSSPVRMAKAD
Sbjct: 443  TTSTRSFAFPILQSEWNSSPVRMAKAD 469


>emb|CBI27399.3| unnamed protein product [Vitis vinifera]
          Length = 435

 Score =  128 bits (322), Expect = 5e-27
 Identities = 123/409 (30%), Positives = 183/409 (44%), Gaps = 11/409 (2%)
 Frame = -2

Query: 1511 DQNGISSHPNGYERIAPLSTELKNNNGYRKAPEDDVYADVDDFTSS-NEEAKESVAASGV 1335
            +Q  IS    G+ER A     L   + +    E D   +VDD  ++   E + SVA   V
Sbjct: 45   NQKVISCDLKGHERDAD---PLDGEDRFWNTSERDCSINVDDIANACGNEVRNSVATCVV 101

Query: 1334 HTEMSHA---DSIVYTDKNILESDLPELIVCYKDSVFHAVKDICVDEGMPAENKCLTEN- 1167
             +E   +   D  + TDK++ + +LP   VC ++S +HAVKDIC+DEGM +  K L EN 
Sbjct: 102  SSEKLESFEKDGDMCTDKSVTKHELP---VCCEESTYHAVKDICIDEGMLSPEKILVENG 158

Query: 1166 ---VVGGLSTLPSNNYKHRDVTEE-ADADISYEDDFKSSSYKDFRENGAAHGVNNGEVNL 999
                 G    LP +  K+ D T+E AD ++   D  K+S+     EN     +   E N 
Sbjct: 159  KEEHEGFCPFLPPDTDKNVDPTKETADKELPLPDGQKASA-----ENDCGKDLMQEEENY 213

Query: 998  DKDTAYDSDSVELESSPESDKYQNSASIRFPKTGEEECNAVEKITEIVCDSTLLGKDLHS 819
            D      SD+ E +  PE        S         E N +E   + +       ++ + 
Sbjct: 214  DARDKIISDTSEEKIVPEDIFLIPELSKANSMPESSEFNGMEIEHQCI-------QNPNG 266

Query: 818  ETSLESLLKSARCGENNSSQQSDQISVARAQSPVVTEESTTSNSDNLLDSGVISLNVDSS 639
            E  LE+    +   E++ +   +++S                  ++ L+SG I+ +  SS
Sbjct: 267  EAVLENPALVSEAEESDKNSFPNELSY-----------------NSKLESGTITFDFGSS 309

Query: 638  KLAPTARDEIHGNATG--PQLKSEKKLIHDDGISDNRLVINNQIKRDQGESSFSVAGPLP 465
              +  +  E+     G  P L+S+     +DG     L  + QI+R  GESSFS AGP  
Sbjct: 310  TTSMDSGREVSPQNDGCEPPLESQNLSKLEDG--SESLPFSGQIQRGLGESSFSAAGPSS 367

Query: 464  DVVPYSGHIPFXXXXXXXXXXXXXXXXSFAFPVLPNEWNSSPVRMAKAD 318
             ++ YSG I                  SFAFPVL  EWNSSPVRMAKA+
Sbjct: 368  ALISYSGQITHSGNISLRSDSSTTSTRSFAFPVLQTEWNSSPVRMAKAE 416


>ref|XP_003520134.1| PREDICTED: uncharacterized protein LOC100778990 [Glycine max]
          Length = 485

 Score =  114 bits (286), Expect = 7e-23
 Identities = 93/344 (27%), Positives = 159/344 (46%), Gaps = 16/344 (4%)
 Frame = -2

Query: 1301 YTDKNILESDLPELIVCYKDSVFHAVKDICVDEGMPAENKCLTENVVGGLSTLPSNNYKH 1122
            Y DK + + + P L VCYK+S +H VKDIC+DEG+  ++K +  N        P +   H
Sbjct: 134  YMDKTVTQCE-PHLEVCYKESNYHVVKDICIDEGVLKKDKVMFLN--------PDDEKAH 184

Query: 1121 RDVTEEADADISYEDDFKSSSYKDFRENG-AAHGVNNGEVNLDKDTAYDSDSVE-LESSP 948
                 ++  +   + D  S         G  AH     E   +K+   D+ S+  L  +P
Sbjct: 185  NFFPSDSYENKEKQKDNTSIGVLSLIPTGEKAHNFFPSESYENKEKQKDNTSINVLSLTP 244

Query: 947  ESDKYQNSASIRFPK--------TGEEECNAVEKITEIVCDSTLLGKDLHSETSLESLLK 792
              +  ++ A+   PK          E+  + V K T +  D  LL +DL ++ S+ S  K
Sbjct: 245  TKESDKDPANHDQPKDLMHKDEDATEKVSSNVNKETPLPEDKVLL-QDLLAQDSVSSDDK 303

Query: 791  SARCG-----ENNSSQQSDQISVARAQSPVVTEESTTSNSDNLL-DSGVISLNVDSSKLA 630
              +        +   +  + +  A  ++P +  E   SN+DN+L + G  +  +D S  +
Sbjct: 304  GEQISNEPELHSQPEESKNTVEEAILETPSLALEDDESNNDNVLSEKGSFTHQLDPSVPS 363

Query: 629  PTARDEIHGNATGPQLKSEKKLIHDDGISDNRLVINNQIKRDQGESSFSVAGPLPDVVPY 450
               +++ H        + ++ +   +G SD++  +   ++   GE+SFS  GP+   + Y
Sbjct: 364  DCGKEDCHQAGVCKCDEIQQTMKPVEGKSDDQ-AVTGTVRHSLGEASFSAIGPMSGRISY 422

Query: 449  SGHIPFXXXXXXXXXXXXXXXXSFAFPVLPNEWNSSPVRMAKAD 318
            SG +P+                SFAFP++ +EWNSSPVRMAKAD
Sbjct: 423  SGPVPYSGSISLRSDSSTTSTRSFAFPIIQSEWNSSPVRMAKAD 466


>ref|XP_002311412.1| predicted protein [Populus trichocarpa] gi|222851232|gb|EEE88779.1|
            predicted protein [Populus trichocarpa]
          Length = 486

 Score =  111 bits (278), Expect = 6e-22
 Identities = 118/465 (25%), Positives = 187/465 (40%), Gaps = 73/465 (15%)
 Frame = -2

Query: 1493 SHPNGYERIAPLSTELKNNNGYRKAPEDDVYADVDDFTSSNEEAKESVAASGVHTEMS-- 1320
            S P  Y   A  S  LK+ NG  K  E+  ++D++      +           H+ +   
Sbjct: 7    SRPVEYNDNALDSIGLKSGNGSVKEIENGKFSDLNGMEGDADRLPNVAPVPSPHSSLKME 66

Query: 1319 -HADSIVYTDKNILESDLPELIVCYKDSVFHAVKDICVDEGMPAENKCLTENVVGGLST- 1146
               +S+ Y DK+++  ++PELIVCYK++ +H VKDICVDEG+P ++K L +      +  
Sbjct: 67   PFEESVFYMDKSVMVREVPELIVCYKENTYH-VKDICVDEGVPLQDKFLFDTDAHKKNMC 125

Query: 1145 --LPSNNYKHRD-VTEEADADISYEDDFKSSSYKD--------------FRENGAAHGVN 1017
              LPS    + + V E++D D+   +  KSSS K                 E G+ H ++
Sbjct: 126  EFLPSERDMNNEMVKEKSDLDMLIPEMLKSSSEKQNVDLHLPVPDVLISSEEKGSKHDLS 185

Query: 1016 NG---------EVNLDKDTAYDSDSVELE----------------SSPESDKYQNSASIR 912
                       E  +D  T   +D+   E                 +P +  Y N   + 
Sbjct: 186  LDCDPKHLMPTEEVMDYGTKKVTDNASKEILSLRDLLSMSELGAKCTPANASYHNMDKVE 245

Query: 911  FPKTGEEECNAVEKITEIVCDSTLLGKDLHSETSLES----------------------L 798
                     NA+ +      +S   G++  S+  LES                      +
Sbjct: 246  QQSLLCPRENAILETDSASEESEHCGEETISDNGLESATLAIPTQDPAYQEGDHGHTEAV 305

Query: 797  LKSARCGENNSSQQSDQISVARAQSPVVTEESTTSNSDNL-----LDSGVISLNVDSSKL 633
            L S           S +  +A       +E ST+   D L      ++  IS + DSS  
Sbjct: 306  LVSPTLTSAAEESDSKETKLASHALDSFSEGSTSRIEDELPYNSKTETRSISFDNDSSAP 365

Query: 632  APTARDEIHGNATGPQLKSEKKLIHDDGISDNRLVINNQIKRDQGESSFSVAGPLPDVVP 453
            A +AR+       G   +   +++      +   +   Q++   GESSFS +GPL  +  
Sbjct: 366  AASARESPQN---GESQRLGTRIVSRFEDPNAERLSGGQLQYADGESSFSSSGPLFGLTS 422

Query: 452  YSGHIPFXXXXXXXXXXXXXXXXSFAFPVLPNEWNSSPVRMAKAD 318
            +SG I +                SFAFP+L +EWNSSP RMAKAD
Sbjct: 423  HSGPIAYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPARMAKAD 467


>ref|XP_002875229.1| hypothetical protein ARALYDRAFT_484289 [Arabidopsis lyrata subsp.
            lyrata] gi|297321067|gb|EFH51488.1| hypothetical protein
            ARALYDRAFT_484289 [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  102 bits (254), Expect = 4e-19
 Identities = 98/371 (26%), Positives = 151/371 (40%), Gaps = 39/371 (10%)
 Frame = -2

Query: 1313 DSIVYTDKNILESDLPELIVCYKDSVFHAVKDICVDEGMPAENKCL-----------TEN 1167
            D + Y DKN+   DLPE++VCYK++ +H VKDICVDEG+P + K L           TE+
Sbjct: 86   DPVFYMDKNVTACDLPEIVVCYKENTYHVVKDICVDEGVPVQEKFLFGEKDSVKSSSTED 145

Query: 1166 VVGGLST--LPSNNYKHRDVTEEAD-----ADISYEDDFKSSSYKDFRE----------- 1041
            +     T   PS +    D   + D      +   + D + SS +DF +           
Sbjct: 146  LTKADKTNVNPSESKSAEDSNTKVDDSEFCNNCKTDRDVEESSREDFADAEGSSAYNQEH 205

Query: 1040 --------NGAAHGVNNGEVNLDKDTAYDSDSVELESSPESDKYQNSASIRFPKTGEEEC 885
                       +HG+N  E+  D+++   +D V + S  +S +      I   +  ++  
Sbjct: 206  LIVTEEAKASPSHGLNPSEIEPDENS---NDEVAISSETDSKESLTLGDILSREDEQKSL 262

Query: 884  NAVEKITEIVCDSTLLGKDLHSETSLESLLKSARCGENNSSQQSDQISVARAQSPVVTEE 705
            N              +  D H E S   L    +     ++ +++   + + + P   EE
Sbjct: 263  NHGN-----------ISSDSHEEQSPSQLQDKEKRSLETAAIETE---LEKTEEPKPVEE 308

Query: 704  STTSNSDNLLDSGVISLNVDSSKLAPTARDEIHGNATGPQLKSEKKLIHDDGISDNRLVI 525
               S S   L     + N D  K  P   +    N+        +    DD +S +R   
Sbjct: 309  KLPSASTTTLQEPNKTCN-DPEK--PETENHHQQNSL------VENSYEDDKLSSSRF-- 357

Query: 524  NNQIKRDQGESSFSVAG--PLPDVVPYSGHIPFXXXXXXXXXXXXXXXXSFAFPVLPNEW 351
                    GE+SFS A    +   + YSG I +                SFAFP+L +EW
Sbjct: 358  --------GETSFSAAESVSISGHITYSGPIAYSGSLSVRSDASTTSGRSFAFPILQSEW 409

Query: 350  NSSPVRMAKAD 318
            NSSPVRMAKAD
Sbjct: 410  NSSPVRMAKAD 420


Top