BLASTX nr result

ID: Angelica23_contig00002959 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00002959
         (2379 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002889451.1| hypothetical protein ARALYDRAFT_470307 [Arab...   267   1e-68
ref|XP_002874890.1| hypothetical protein ARALYDRAFT_490272 [Arab...   261   5e-67
ref|NP_192197.2| uncharacterized protein [Arabidopsis thaliana] ...   255   4e-65
ref|NP_001190665.1| uncharacterized protein [Arabidopsis thalian...   253   2e-64
ref|XP_002321060.1| predicted protein [Populus trichocarpa] gi|2...   246   3e-62

>ref|XP_002889451.1| hypothetical protein ARALYDRAFT_470307 [Arabidopsis lyrata subsp.
            lyrata] gi|297335293|gb|EFH65710.1| hypothetical protein
            ARALYDRAFT_470307 [Arabidopsis lyrata subsp. lyrata]
          Length = 573

 Score =  267 bits (682), Expect = 1e-68
 Identities = 214/643 (33%), Positives = 315/643 (48%), Gaps = 9/643 (1%)
 Frame = -1

Query: 2262 MGFNSVYKVLQEVFPQVDSRILRAVAIENSKDADVAVGVVLSEVMPSFSVKSGKVATSPN 2083
            MGF SVY+ L E+FPQ+D+RILRAVAIE+ KDAD A  VVLSE++PSFS       T  +
Sbjct: 1    MGFGSVYRSLTEIFPQIDARILRAVAIEHPKDADEAAAVVLSEIIPSFSSNLSHNLTQSS 60

Query: 2082 EGPSLPNSYGDVEVSNTADGTYSTAVNYGLSEEVIASQQIPSRKPGESIFSEFDIPYANF 1903
               S   S  D EV    +   S    +     + AS   PS     S  S   +P    
Sbjct: 61   NKSS--GSISDREVERGLEDVVSRCRPF-----LGASGSKPSTSSSCSSSSSETLPLVVV 113

Query: 1902 ISSGPKIVANEVQDQVFVDAVDEFGDNISLPGKHQVSSTEEGPIFPRSLHMMTKLGNRLI 1723
                 + ++ ++     ++       N+ L   H+   +EE       +  + K   +  
Sbjct: 114  RDHNTRALSTDLVSN--MNEPTNLQPNVGLDVCHKDLESEE-------VQSLKKARGKEH 164

Query: 1722 NDTDWFTNSSSLHEEQSSLDELVRKNHRSVTXXXXXXXXXVQIISPNNSEGNSQLLTXXX 1543
             + D+F     + +  + L  LV ++              V  IS +N +  S       
Sbjct: 165  GNYDFFGRCFDV-KSNAKLGLLVPED---------DIASVVSAISLDNIKLTSDFWEDLC 214

Query: 1542 XXXXXXXXXXXXXPSLTASGGD-IGSSEMADCQNVD--MVNIVSDISVKDLEKHDPSGVN 1372
                           + ++ GD   +++   C  VD    N+V + S            N
Sbjct: 215  FGMTWNQAENAVSKLVDSTPGDTTTTTQQGSCFEVDSGSTNLVDETS------------N 262

Query: 1371 SLFASDMGDFEEETSMKAIVTSRSGQMYSVELLEGIIENEKSEKIALRSAKDSLLSFINE 1192
                S+ GD E   +      S S  + SV+ LE IIE+ KS K  L +  +++ + + E
Sbjct: 263  RSLVSENGDTEIGDTF-----STSTHVCSVDHLEEIIEDAKSNKKTLLTEMETVTNLMRE 317

Query: 1191 VEMKEKSVEKAREEAAHGGLDTLAKVEDLKKMLQFAKEANDMHAGEVYGEKAILSTEMRE 1012
            VE++EK  EK++EEAA GGLDTL KVE+LKKML+ AKEANDMHAGEVYGEK+IL+TE++E
Sbjct: 318  VELQEKDAEKSKEEAARGGLDTLQKVEELKKMLEHAKEANDMHAGEVYGEKSILATEVKE 377

Query: 1011 LQSRLHSLSDERVKALAILDEIHQALEVRLVSXXXXXXXXXXXXXXXXXXXXXXXAHEES 832
            L++RL +LS+ER K+L ILDE+  +LE+RL +                         +E+
Sbjct: 378  LENRLLNLSEERNKSLTILDEMRGSLEIRLATALEMKKTAEQEKKNKEDSALQALVEQEA 437

Query: 831  RMQKVVEESKILEQQAEENAKLRKFLIDRGRQVDILQGEISVICQDVKSMKTKFDQGLPX 652
             M+KVV+ESK+L+Q+AEEN+KLR+FL+DRG+ VD LQGEISVICQDVK +K KF+     
Sbjct: 438  NMEKVVQESKLLQQEAEENSKLREFLMDRGQIVDSLQGEISVICQDVKLLKEKFE----- 492

Query: 651  XXXXXXXXXXXXXXXXXXXXLDASFEKLPETVVVASPTFSS------SSLLGASFEKLPE 490
                                      ++  T +++S   SS      S +LG   E+L  
Sbjct: 493  -------------------------NRVQLTNLISSSLTSSCGSSMRSLVLGNPSERLNG 527

Query: 489  PEVVASPRNGKAALNSTDDELLYRNEILGTGMELVDDGWELFD 361
                +S +N   A  S      + N+      +L++DGW++FD
Sbjct: 528  VPETSSNKNFPEAAAS------FMNKEKDDCRDLLEDGWDIFD 564


>ref|XP_002874890.1| hypothetical protein ARALYDRAFT_490272 [Arabidopsis lyrata subsp.
            lyrata] gi|297320727|gb|EFH51149.1| hypothetical protein
            ARALYDRAFT_490272 [Arabidopsis lyrata subsp. lyrata]
          Length = 559

 Score =  261 bits (668), Expect = 5e-67
 Identities = 201/640 (31%), Positives = 310/640 (48%), Gaps = 6/640 (0%)
 Frame = -1

Query: 2262 MGFNSVYKVLQEVFPQVDSRILRAVAIENSKDADVAVGVVLSEVMPSFSVKSGKVATSPN 2083
            MG+ +VY+ L E+FPQ+D+R+L+AVAIE+ KDA+ A  VV+SE++P F       +T P 
Sbjct: 1    MGYKAVYRSLTELFPQIDARLLKAVAIEHPKDANEAAAVVVSEIVPFFYPNLADNSTQPE 60

Query: 2082 EGPSLPNSYGDVEVSNTADGTYSTAVNYGLSEEVIASQQIPSRKPGESIFSEFDI----- 1918
                           N   G     V   +   V++  +  S     SI    D      
Sbjct: 61   ---------------NRTPGNVPNKVERAMQNGVLSGSETGSSSSSGSIPLAVDCDHESR 105

Query: 1917 -PYANFISSGPKIVANEVQDQVFVDAVDEFGDNISLPGKHQVSSTEEGPIFPRSLHMMTK 1741
             P    ISS  ++    V   V +D        I    K  +S +EE  +      +  +
Sbjct: 106  APITESISSRNQLT--HVMPNVDLD--------IQSNAKIGLSGSEESGVVSSENPVSFQ 155

Query: 1740 LGNRLINDTDWFTNSSSLHEEQSSLDELVRKNHRSVTXXXXXXXXXVQIISPNNSEGNSQ 1561
             G +           S+ H  Q     +   N    +          +++ P ++   +Q
Sbjct: 156  AGAK-----------STSHGCQGVGFHITGSNQAEASTSSESEDAVHKLVYPADNSAMTQ 204

Query: 1560 LLTXXXXXXXXXXXXXXXXPSLTASGGDIGSSEMADCQNVDMVNIVSDISVKDLEKHDPS 1381
                                + T+SG     +  A+    ++V++ S  S+  +E  DP 
Sbjct: 205  -----KSPPLQIRFGSIDIVNETSSGSLAVENSDAELSGSNLVDVTSKGSLA-VENGDPE 258

Query: 1380 GVNSLFASDMGDFEEETSMKAIVTSRSGQMYSVELLEGIIENEKSEKIALRSAKDSLLSF 1201
             V + F+S              V SRS Q  ++  LE IIE+ KS K  L +  +S+++ 
Sbjct: 259  LVGA-FSS--------------VVSRSTQGCNIVHLEQIIEDAKSNKKTLFTVMESIMNL 303

Query: 1200 INEVEMKEKSVEKAREEAAHGGLDTLAKVEDLKKMLQFAKEANDMHAGEVYGEKAILSTE 1021
            + EVE++EK  EKA+E+A+ GG DTL KVE+LKKML+ AKEANDM AGEVYGE++IL+TE
Sbjct: 304  MREVELQEKDAEKAKEDASRGGFDTLDKVEELKKMLEHAKEANDMDAGEVYGERSILTTE 363

Query: 1020 MRELQSRLHSLSDERVKALAILDEIHQALEVRLVSXXXXXXXXXXXXXXXXXXXXXXXAH 841
            + EL++RL +LS+ER K+L++LDE+ + LE+RL +                       A 
Sbjct: 364  VNELENRLLNLSEERDKSLSVLDEMREVLEIRLAAALEIKNAAEQEKQEKEGSARMAFAE 423

Query: 840  EESRMQKVVEESKILEQQAEENAKLRKFLIDRGRQVDILQGEISVICQDVKSMKTKFDQG 661
            +E+ M+KVV+ESK+L+Q+AEEN+KLR+FL+D GR VD LQGEISVICQD++ +K KFD  
Sbjct: 424  QEAIMEKVVQESKLLQQEAEENSKLREFLMDHGRIVDSLQGEISVICQDIRHLKEKFDNR 483

Query: 660  LPXXXXXXXXXXXXXXXXXXXXXLDASFEKLPETVVVASPTFSSSSLLGASFEKLPEPEV 481
            +P                            L +++  +  +   +S   +    L E  +
Sbjct: 484  VP----------------------------LSQSITSSQTSCKLASSASSMKSLLLEKPL 515

Query: 480  VASPRNGKAALNSTDDELLYRNEILGTGMELVDDGWELFD 361
             AS    +A+ N+T  + L  NE      EL+++GW+ FD
Sbjct: 516  EASYETAEASSNNTSPKALV-NEGKDDRKELLEEGWDFFD 554


>ref|NP_192197.2| uncharacterized protein [Arabidopsis thaliana]
            gi|18377666|gb|AAL66983.1| unknown protein [Arabidopsis
            thaliana] gi|20465977|gb|AAM20210.1| unknown protein
            [Arabidopsis thaliana] gi|332656842|gb|AEE82242.1|
            uncharacterized protein [Arabidopsis thaliana]
          Length = 552

 Score =  255 bits (651), Expect = 4e-65
 Identities = 203/647 (31%), Positives = 314/647 (48%), Gaps = 13/647 (2%)
 Frame = -1

Query: 2262 MGFNSVYKVLQEVFPQVDSRILRAVAIENSKDADVAVGVVLSEVMPSFSVKSGKVATSPN 2083
            MG+ +VY+ L E+FPQ+D+R+L+AVAIE+ KD + A  VV+SE++P F       +T P 
Sbjct: 1    MGYKAVYRSLTELFPQIDARLLKAVAIEHPKDVNEAAAVVVSEIVPFFYPNLADSSTQPE 60

Query: 2082 EGPSLPNSYGDVEVSNTADGTYSTAVNYGLSEEVIASQQIPSRKPGESI----FSEFDIP 1915
                           N   G   T V   +   V++  ++     G +     + E   P
Sbjct: 61   ---------------NKTPGNVPTEVERDMPFSVLSGSEMGGSYSGSASMAFEYHETRAP 105

Query: 1914 YANFISSGPKIVANEVQDQVFVDAVDEFGDNISLPGKHQVSSTEEGPIFPRSLHMMTKLG 1735
                +S   ++    V   V VD        I   GK  +S ++E  +      +  + G
Sbjct: 106  VTESVSKRNQLT--HVMPNVVVD--------IQRKGKIGLSGSDESGVVSSEPPVSCQAG 155

Query: 1734 NRLINDTDW----FTNSSSLHEEQSSLDELVRKNHRSVTXXXXXXXXXVQIISPNNSEGN 1567
             +   D DW    F ++ +  E  +S D                     +++ P ++   
Sbjct: 156  AKSTGD-DWQGVEFHSTGNQAEASTSADS---------------EDAVHKLVYPADNLAI 199

Query: 1566 SQLLTXXXXXXXXXXXXXXXXPSLTASGGDIGSSEMADCQNVDMVNIVSDISVKDLEKHD 1387
            +Q                    + T+SG     +  A+    ++V+ +S  S+ D     
Sbjct: 200  TQ-----NSHPLQIRFGSIDVVNETSSGSLAVENSDAELSGSNLVDEISKGSLAD----- 249

Query: 1386 PSGVNSLFASDMGDFEEETSMKAIVTSRSGQMYSVELLEGIIENEKSEKIALRSAKDSLL 1207
                      + GD E + ++ + V +RS Q  ++  LE IIE+ KS K  L +  +S++
Sbjct: 250  ----------ENGDPELDGAVSS-VGNRSTQGCNMVHLEQIIEDAKSNKRTLFTVMESIM 298

Query: 1206 SFINEVEMKEKSVEKAREEAAHGGLDTLAKVEDLKKMLQFAKEANDMHAGEVYGEKAILS 1027
            + + EVE++EK  EKA+E+A+ GG DTL KVE+LKKML+ AKEANDM AGEVYGE++IL+
Sbjct: 299  NLMREVELQEKEAEKAKEDASIGGFDTLDKVEELKKMLEHAKEANDMAAGEVYGERSILT 358

Query: 1026 TEMRELQSRLHSLSDERVKALAILDEIHQALEVRLVSXXXXXXXXXXXXXXXXXXXXXXX 847
            TE+ EL++RL SLS+ER  +L++LDE+   LE+RL +                       
Sbjct: 359  TEVNELENRLISLSEERDNSLSVLDEMRVDLEIRLATALGIKNAAEQEKQEKEGSARKAF 418

Query: 846  AHEESRMQKVVEESKILEQQAEENAKLRKFLIDRGRQVDILQGEISVICQDVKSMKTKFD 667
            A +E+ M++VV+ESK+L+Q+AEEN+KLR+FL+D GR VD LQGEISVICQD++ +K KFD
Sbjct: 419  AEQEAIMERVVQESKLLQQEAEENSKLREFLMDHGRIVDSLQGEISVICQDIRHLKEKFD 478

Query: 666  QGLPXXXXXXXXXXXXXXXXXXXXXLDASFEKLPETVVVASPTFSSSSL-----LGASFE 502
              +P                     L  S      +  +AS   S  SL     L AS+E
Sbjct: 479  NRVP---------------------LSQSISSSQTSCKLASSASSMKSLLTEKPLEASYE 517

Query: 501  KLPEPEVVASPRNGKAALNSTDDELLYRNEILGTGMELVDDGWELFD 361
                PE  ++ ++ KA++N                 EL+DDGW+ FD
Sbjct: 518  ---TPEASSNNKSPKASVNER--------------KELLDDGWDFFD 547


>ref|NP_001190665.1| uncharacterized protein [Arabidopsis thaliana]
            gi|332656843|gb|AEE82243.1| uncharacterized protein
            [Arabidopsis thaliana]
          Length = 556

 Score =  253 bits (646), Expect = 2e-64
 Identities = 204/643 (31%), Positives = 315/643 (48%), Gaps = 9/643 (1%)
 Frame = -1

Query: 2262 MGFNSVYKVLQEVFPQVDSRILRAVAIENSKDADVAVGVVLSEVMPSFSVKSGKVATSPN 2083
            MG+ +VY+ L E+FPQ+D+R+L+AVAIE+ KD + A  VV+SE++P F       +T P 
Sbjct: 1    MGYKAVYRSLTELFPQIDARLLKAVAIEHPKDVNEAAAVVVSEIVPFFYPNLADSSTQP- 59

Query: 2082 EGPSLPNSYGDVEVSNTADGTYSTAVNYGLSEEVIASQQIPSRKPGESIFSEFDIPYANF 1903
            E  +  N   +VE +   D  +S      +      S  +         + E   P    
Sbjct: 60   ENKTPGNVPTEVENAVERDMPFSVLSGSEMGGSYSGSASMAFE------YHETRAPVTES 113

Query: 1902 ISSGPKIVANEVQDQVFVDAVDEFGDNISLPGKHQVSSTEEGPIFPRSLHMMTKLGNRLI 1723
            +S   ++    V   V VD        I   GK  +S ++E  +      +  + G +  
Sbjct: 114  VSKRNQLT--HVMPNVVVD--------IQRKGKIGLSGSDESGVVSSEPPVSCQAGAKST 163

Query: 1722 NDTDW----FTNSSSLHEEQSSLDELVRKNHRSVTXXXXXXXXXVQIISPNNSEGNSQLL 1555
             D DW    F ++ +  E  +S D                     +++ P ++   +Q  
Sbjct: 164  GD-DWQGVEFHSTGNQAEASTSADS---------------EDAVHKLVYPADNLAITQ-- 205

Query: 1554 TXXXXXXXXXXXXXXXXPSLTASGGDIGSSEMADCQNVDMVNIVSDISVKDLEKHDPSGV 1375
                              + T+SG     +  A+    ++V+ +S  S+ D         
Sbjct: 206  ---NSHPLQIRFGSIDVVNETSSGSLAVENSDAELSGSNLVDEISKGSLAD--------- 253

Query: 1374 NSLFASDMGDFEEETSMKAIVTSRSGQMYSVELLEGIIENEKSEKIALRSAKDSLLSFIN 1195
                  + GD E + ++ + V +RS Q  ++  LE IIE+ KS K  L +  +S+++ + 
Sbjct: 254  ------ENGDPELDGAVSS-VGNRSTQGCNMVHLEQIIEDAKSNKRTLFTVMESIMNLMR 306

Query: 1194 EVEMKEKSVEKAREEAAHGGLDTLAKVEDLKKMLQFAKEANDMHAGEVYGEKAILSTEMR 1015
            EVE++EK  EKA+E+A+ GG DTL KVE+LKKML+ AKEANDM AGEVYGE++IL+TE+ 
Sbjct: 307  EVELQEKEAEKAKEDASIGGFDTLDKVEELKKMLEHAKEANDMAAGEVYGERSILTTEVN 366

Query: 1014 ELQSRLHSLSDERVKALAILDEIHQALEVRLVSXXXXXXXXXXXXXXXXXXXXXXXAHEE 835
            EL++RL SLS+ER  +L++LDE+   LE+RL +                       A +E
Sbjct: 367  ELENRLISLSEERDNSLSVLDEMRVDLEIRLATALGIKNAAEQEKQEKEGSARKAFAEQE 426

Query: 834  SRMQKVVEESKILEQQAEENAKLRKFLIDRGRQVDILQGEISVICQDVKSMKTKFDQGLP 655
            + M++VV+ESK+L+Q+AEEN+KLR+FL+D GR VD LQGEISVICQD++ +K KFD  +P
Sbjct: 427  AIMERVVQESKLLQQEAEENSKLREFLMDHGRIVDSLQGEISVICQDIRHLKEKFDNRVP 486

Query: 654  XXXXXXXXXXXXXXXXXXXXXLDASFEKLPETVVVASPTFSSSSL-----LGASFEKLPE 490
                                 L  S      +  +AS   S  SL     L AS+E    
Sbjct: 487  ---------------------LSQSISSSQTSCKLASSASSMKSLLTEKPLEASYE---T 522

Query: 489  PEVVASPRNGKAALNSTDDELLYRNEILGTGMELVDDGWELFD 361
            PE  ++ ++ KA++N                 EL+DDGW+ FD
Sbjct: 523  PEASSNNKSPKASVNER--------------KELLDDGWDFFD 551


>ref|XP_002321060.1| predicted protein [Populus trichocarpa] gi|222861833|gb|EEE99375.1|
            predicted protein [Populus trichocarpa]
          Length = 549

 Score =  246 bits (627), Expect = 3e-62
 Identities = 190/537 (35%), Positives = 280/537 (52%), Gaps = 1/537 (0%)
 Frame = -1

Query: 2262 MGFNSVYKVLQEVFPQVDSRILRAVAIENSKDADVAVGVVLSEVMPSFSVKSGKVATSPN 2083
            MGF++VYK L +VFPQVD+RIL+AVAIE+SKDAD     + +EV+ S  + S    +  +
Sbjct: 1    MGFSTVYKCLTDVFPQVDARILKAVAIEHSKDAD-----IAAEVVLSEVIPS---LSRHS 52

Query: 2082 EGPSLPNSYGDVEVSNTADGTYSTAVNYGLSEEVIASQQIPSRKPGESIFSEFDIPYANF 1903
              PS P    D   S   DG        GL    ++  +                   + 
Sbjct: 53   AAPSPPCE--DTSPSLPLDGQTEQEEETGLRHRQVSLVK-------------------SV 91

Query: 1902 ISSGPKIVANEVQDQVFVDAVDEFGDNISLPGKHQVSSTEEGPIFPRSLHMMTKLGNRLI 1723
             SS P ++A E   +  + +    GD+      HQ +  ++  + P   +  T   N+L 
Sbjct: 92   RSSEPGLIAEEDDGKTELTSGVNDGDST-----HQENRQDQPIVVPSGANADT---NQLQ 143

Query: 1722 NDTDWFTNSSSLHEEQSSLDELVRKNHRSVTXXXXXXXXXVQIISPNNSEGNSQLLTXXX 1543
               +      +  EE++ L       HR V+           +I+  + +G ++L     
Sbjct: 144  GHIE------TEQEEETGL------RHRQVSLVKSVRSSEPGLIAEED-DGKTELTGGVN 190

Query: 1542 XXXXXXXXXXXXXPSLTASGGDIGSSEM-ADCQNVDMVNIVSDISVKDLEKHDPSGVNSL 1366
                         P +  SG +  ++++    ++ +++ +      + + +   S    L
Sbjct: 191  DGDSTHQEIRQDQPVVVPSGANADTNQLQGHIESDELILLGKPQHQEGISQPGSSQTLIL 250

Query: 1365 FASDMGDFEEETSMKAIVTSRSGQMYSVELLEGIIENEKSEKIALRSAKDSLLSFINEVE 1186
             ++D+       +M       S Q   +ELLE I+E  K  K  L SA +S+++ + EVE
Sbjct: 251  VSNDLLLGVNAENMN------SKQYRQIELLEEIVEAAKDNKKTLFSAMESVMNMMKEVE 304

Query: 1185 MKEKSVEKAREEAAHGGLDTLAKVEDLKKMLQFAKEANDMHAGEVYGEKAILSTEMRELQ 1006
            ++E S E+A+EEAA GGLD L +VE LK+ML  AKEANDMHAGEVYGEKAIL+TE+RELQ
Sbjct: 305  LQEISAEQAKEEAARGGLDILVEVEKLKQMLVHAKEANDMHAGEVYGEKAILATEVRELQ 364

Query: 1005 SRLHSLSDERVKALAILDEIHQALEVRLVSXXXXXXXXXXXXXXXXXXXXXXXAHEESRM 826
            +RL SLSDER  ALAILDE+ Q LE RL +                       A +E  M
Sbjct: 365  ARLLSLSDERDNALAILDEMRQTLESRLAAAEELRKTAELEKLEKEETARNALAEQEIIM 424

Query: 825  QKVVEESKILEQQAEENAKLRKFLIDRGRQVDILQGEISVICQDVKSMKTKFDQGLP 655
            +KVV+ESKIL+++AEENAKL++FL+DRG  VD LQGEISVICQDV+ +K +FD+ +P
Sbjct: 425  EKVVQESKILQKEAEENAKLQEFLMDRGCVVDTLQGEISVICQDVRLLKERFDERVP 481


Top