BLASTX nr result

ID: Angelica22_contig00004747 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00004747
         (2002 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002889451.1| hypothetical protein ARALYDRAFT_470307 [Arab...   264   8e-68
ref|XP_002874890.1| hypothetical protein ARALYDRAFT_490272 [Arab...   253   2e-64
ref|NP_192197.2| uncharacterized protein [Arabidopsis thaliana] ...   246   2e-62
ref|NP_001190665.1| uncharacterized protein [Arabidopsis thalian...   244   8e-62
ref|XP_002321060.1| predicted protein [Populus trichocarpa] gi|2...   243   2e-61

>ref|XP_002889451.1| hypothetical protein ARALYDRAFT_470307 [Arabidopsis lyrata subsp.
            lyrata] gi|297335293|gb|EFH65710.1| hypothetical protein
            ARALYDRAFT_470307 [Arabidopsis lyrata subsp. lyrata]
          Length = 573

 Score =  264 bits (674), Expect = 8e-68
 Identities = 193/535 (36%), Positives = 277/535 (51%), Gaps = 3/535 (0%)
 Frame = +2

Query: 233  MGFNSVYKVLQEVFPQVDSRILRAVAIENSKDADVAVGVVLSEVMPSFSVKSGKVATSPN 412
            MGF SVY+ L E+FPQ+D+RILRAVAIE+ KDAD A  VVLSE++PSFS       T  +
Sbjct: 1    MGFGSVYRSLTEIFPQIDARILRAVAIEHPKDADEAAAVVLSEIIPSFSSNLSHNLTQSS 60

Query: 413  EGPSLPNSYGDVEVSNTADGTYSTAVNYGLSEEVIASQQIPSRKPGESIFSEFDIPYANF 592
               S   S  D EV    +   S    +     + AS   PS     S  S   +P    
Sbjct: 61   NKSS--GSISDREVERGLEDVVSRCRPF-----LGASGSKPSTSSSCSSSSSETLPLVVV 113

Query: 593  ISSGPKIVANEVQDQVFVDAVDEFGDNISLPGKHQVSSTEEGPIFPRSLHMMTKLGNRLI 772
                 + ++ ++     ++       N+ L   H+   +EE       +  + K   +  
Sbjct: 114  RDHNTRALSTDLVSN--MNEPTNLQPNVGLDVCHKDLESEE-------VQSLKKARGKEH 164

Query: 773  NDTDWFTNSSSLHEEQSSLDELVRKNHRSVTXXXXXXXXXXQIISPNNSEGNSQLLTXXX 952
             + D+F     + +  + L  LV ++                 IS +N +  S       
Sbjct: 165  GNYDFFGRCFDV-KSNAKLGLLVPED---------DIASVVSAISLDNIKLTSDFWEDLC 214

Query: 953  XXXXXXXXXXXXXXSLTASGGD-IGSSEMADCQNVD--MVNIVSDISVKDLEKHDPSGVN 1123
                           + ++ GD   +++   C  VD    N+V + S            N
Sbjct: 215  FGMTWNQAENAVSKLVDSTPGDTTTTTQQGSCFEVDSGSTNLVDETS------------N 262

Query: 1124 SLFASDMGDFEEETSMKAIVTSRSGQMYSVELLEGIIENEKSEKIALRSAKDSLLSFINE 1303
                S+ GD E   +      S S  + SV+ LE IIE+ KS K  L +  +++ + + E
Sbjct: 263  RSLVSENGDTEIGDTF-----STSTHVCSVDHLEEIIEDAKSNKKTLLTEMETVTNLMRE 317

Query: 1304 VEMKEKSVEKAREEAAHGGLDTLAKVEDLKKMLQFAKEANDMHAGEVYGEKAILSTEMRE 1483
            VE++EK  EK++EEAA GGLDTL KVE+LKKML+ AKEANDMHAGEVYGEK+IL+TE++E
Sbjct: 318  VELQEKDAEKSKEEAARGGLDTLQKVEELKKMLEHAKEANDMHAGEVYGEKSILATEVKE 377

Query: 1484 LQSRLHSLSDERVKALAILDEIHQALEVRLVSXXXXXXXXXXXXXXXXXXXXXXXXHEES 1663
            L++RL +LS+ER K+L ILDE+  +LE+RL +                         +E+
Sbjct: 378  LENRLLNLSEERNKSLTILDEMRGSLEIRLATALEMKKTAEQEKKNKEDSALQALVEQEA 437

Query: 1664 RMQKVFEESKILEQQAEENSKLRKFLIDRGRQVDILQGEISVICQDVKSMKTKFD 1828
             M+KV +ESK+L+Q+AEENSKLR+FL+DRG+ VD LQGEISVICQDVK +K KF+
Sbjct: 438  NMEKVVQESKLLQQEAEENSKLREFLMDRGQIVDSLQGEISVICQDVKLLKEKFE 492


>ref|XP_002874890.1| hypothetical protein ARALYDRAFT_490272 [Arabidopsis lyrata subsp.
            lyrata] gi|297320727|gb|EFH51149.1| hypothetical protein
            ARALYDRAFT_490272 [Arabidopsis lyrata subsp. lyrata]
          Length = 559

 Score =  253 bits (645), Expect = 2e-64
 Identities = 182/542 (33%), Positives = 274/542 (50%), Gaps = 6/542 (1%)
 Frame = +2

Query: 233  MGFNSVYKVLQEVFPQVDSRILRAVAIENSKDADVAVGVVLSEVMPSFSVKSGKVATSPN 412
            MG+ +VY+ L E+FPQ+D+R+L+AVAIE+ KDA+ A  VV+SE++P F       +T P 
Sbjct: 1    MGYKAVYRSLTELFPQIDARLLKAVAIEHPKDANEAAAVVVSEIVPFFYPNLADNSTQPE 60

Query: 413  EGPSLPNSYGDVEVSNTADGTYSTAVNYGLSEEVIASQQIPSRKPGESIFSEFDI----- 577
                           N   G     V   +   V++  +  S     SI    D      
Sbjct: 61   ---------------NRTPGNVPNKVERAMQNGVLSGSETGSSSSSGSIPLAVDCDHESR 105

Query: 578  -PYANFISSGPKIVANEVQDQVFVDAVDEFGDNISLPGKHQVSSTEEGPIFPRSLHMMTK 754
             P    ISS  ++    V   V +D        I    K  +S +EE  +      +  +
Sbjct: 106  APITESISSRNQLT--HVMPNVDLD--------IQSNAKIGLSGSEESGVVSSENPVSFQ 155

Query: 755  LGNRLINDTDWFTNSSSLHEEQSSLDELVRKNHRSVTXXXXXXXXXXQIISPNNSEGNSQ 934
             G +           S+ H  Q     +   N    +          +++ P ++   +Q
Sbjct: 156  AGAK-----------STSHGCQGVGFHITGSNQAEASTSSESEDAVHKLVYPADNSAMTQ 204

Query: 935  LLTXXXXXXXXXXXXXXXXXSLTASGGDIGSSEMADCQNVDMVNIVSDISVKDLEKHDPS 1114
                                + T+SG     +  A+    ++V++ S  S+  +E  DP 
Sbjct: 205  -----KSPPLQIRFGSIDIVNETSSGSLAVENSDAELSGSNLVDVTSKGSLA-VENGDPE 258

Query: 1115 GVNSLFASDMGDFEEETSMKAIVTSRSGQMYSVELLEGIIENEKSEKIALRSAKDSLLSF 1294
             V + F+S              V SRS Q  ++  LE IIE+ KS K  L +  +S+++ 
Sbjct: 259  LVGA-FSS--------------VVSRSTQGCNIVHLEQIIEDAKSNKKTLFTVMESIMNL 303

Query: 1295 INEVEMKEKSVEKAREEAAHGGLDTLAKVEDLKKMLQFAKEANDMHAGEVYGEKAILSTE 1474
            + EVE++EK  EKA+E+A+ GG DTL KVE+LKKML+ AKEANDM AGEVYGE++IL+TE
Sbjct: 304  MREVELQEKDAEKAKEDASRGGFDTLDKVEELKKMLEHAKEANDMDAGEVYGERSILTTE 363

Query: 1475 MRELQSRLHSLSDERVKALAILDEIHQALEVRLVSXXXXXXXXXXXXXXXXXXXXXXXXH 1654
            + EL++RL +LS+ER K+L++LDE+ + LE+RL +                         
Sbjct: 364  VNELENRLLNLSEERDKSLSVLDEMREVLEIRLAAALEIKNAAEQEKQEKEGSARMAFAE 423

Query: 1655 EESRMQKVFEESKILEQQAEENSKLRKFLIDRGRQVDILQGEISVICQDVKSMKTKFDQG 1834
            +E+ M+KV +ESK+L+Q+AEENSKLR+FL+D GR VD LQGEISVICQD++ +K KFD  
Sbjct: 424  QEAIMEKVVQESKLLQQEAEENSKLREFLMDHGRIVDSLQGEISVICQDIRHLKEKFDNR 483

Query: 1835 LP 1840
            +P
Sbjct: 484  VP 485


>ref|NP_192197.2| uncharacterized protein [Arabidopsis thaliana]
            gi|18377666|gb|AAL66983.1| unknown protein [Arabidopsis
            thaliana] gi|20465977|gb|AAM20210.1| unknown protein
            [Arabidopsis thaliana] gi|332656842|gb|AEE82242.1|
            uncharacterized protein [Arabidopsis thaliana]
          Length = 552

 Score =  246 bits (627), Expect = 2e-62
 Identities = 178/544 (32%), Positives = 277/544 (50%), Gaps = 8/544 (1%)
 Frame = +2

Query: 233  MGFNSVYKVLQEVFPQVDSRILRAVAIENSKDADVAVGVVLSEVMPSFSVKSGKVATSPN 412
            MG+ +VY+ L E+FPQ+D+R+L+AVAIE+ KD + A  VV+SE++P F       +T P 
Sbjct: 1    MGYKAVYRSLTELFPQIDARLLKAVAIEHPKDVNEAAAVVVSEIVPFFYPNLADSSTQPE 60

Query: 413  EGPSLPNSYGDVEVSNTADGTYSTAVNYGLSEEVIASQQIPSRKPGESI----FSEFDIP 580
                           N   G   T V   +   V++  ++     G +     + E   P
Sbjct: 61   ---------------NKTPGNVPTEVERDMPFSVLSGSEMGGSYSGSASMAFEYHETRAP 105

Query: 581  YANFISSGPKIVANEVQDQVFVDAVDEFGDNISLPGKHQVSSTEEGPIFPRSLHMMTKLG 760
                +S   ++    V   V VD        I   GK  +S ++E  +      +  + G
Sbjct: 106  VTESVSKRNQLT--HVMPNVVVD--------IQRKGKIGLSGSDESGVVSSEPPVSCQAG 155

Query: 761  NRLINDTDW----FTNSSSLHEEQSSLDELVRKNHRSVTXXXXXXXXXXQIISPNNSEGN 928
             +   D DW    F ++ +  E  +S D                     +++ P ++   
Sbjct: 156  AKSTGD-DWQGVEFHSTGNQAEASTSADS---------------EDAVHKLVYPADNLAI 199

Query: 929  SQLLTXXXXXXXXXXXXXXXXXSLTASGGDIGSSEMADCQNVDMVNIVSDISVKDLEKHD 1108
            +Q                    + T+SG     +  A+    ++V+ +S  S+ D     
Sbjct: 200  TQ-----NSHPLQIRFGSIDVVNETSSGSLAVENSDAELSGSNLVDEISKGSLAD----- 249

Query: 1109 PSGVNSLFASDMGDFEEETSMKAIVTSRSGQMYSVELLEGIIENEKSEKIALRSAKDSLL 1288
                      + GD E + ++ + V +RS Q  ++  LE IIE+ KS K  L +  +S++
Sbjct: 250  ----------ENGDPELDGAVSS-VGNRSTQGCNMVHLEQIIEDAKSNKRTLFTVMESIM 298

Query: 1289 SFINEVEMKEKSVEKAREEAAHGGLDTLAKVEDLKKMLQFAKEANDMHAGEVYGEKAILS 1468
            + + EVE++EK  EKA+E+A+ GG DTL KVE+LKKML+ AKEANDM AGEVYGE++IL+
Sbjct: 299  NLMREVELQEKEAEKAKEDASIGGFDTLDKVEELKKMLEHAKEANDMAAGEVYGERSILT 358

Query: 1469 TEMRELQSRLHSLSDERVKALAILDEIHQALEVRLVSXXXXXXXXXXXXXXXXXXXXXXX 1648
            TE+ EL++RL SLS+ER  +L++LDE+   LE+RL +                       
Sbjct: 359  TEVNELENRLISLSEERDNSLSVLDEMRVDLEIRLATALGIKNAAEQEKQEKEGSARKAF 418

Query: 1649 XHEESRMQKVFEESKILEQQAEENSKLRKFLIDRGRQVDILQGEISVICQDVKSMKTKFD 1828
              +E+ M++V +ESK+L+Q+AEENSKLR+FL+D GR VD LQGEISVICQD++ +K KFD
Sbjct: 419  AEQEAIMERVVQESKLLQQEAEENSKLREFLMDHGRIVDSLQGEISVICQDIRHLKEKFD 478

Query: 1829 QGLP 1840
              +P
Sbjct: 479  NRVP 482


>ref|NP_001190665.1| uncharacterized protein [Arabidopsis thaliana]
            gi|332656843|gb|AEE82243.1| uncharacterized protein
            [Arabidopsis thaliana]
          Length = 556

 Score =  244 bits (622), Expect = 8e-62
 Identities = 179/540 (33%), Positives = 278/540 (51%), Gaps = 4/540 (0%)
 Frame = +2

Query: 233  MGFNSVYKVLQEVFPQVDSRILRAVAIENSKDADVAVGVVLSEVMPSFSVKSGKVATSPN 412
            MG+ +VY+ L E+FPQ+D+R+L+AVAIE+ KD + A  VV+SE++P F       +T P 
Sbjct: 1    MGYKAVYRSLTELFPQIDARLLKAVAIEHPKDVNEAAAVVVSEIVPFFYPNLADSSTQP- 59

Query: 413  EGPSLPNSYGDVEVSNTADGTYSTAVNYGLSEEVIASQQIPSRKPGESIFSEFDIPYANF 592
            E  +  N   +VE +   D  +S      +      S  +         + E   P    
Sbjct: 60   ENKTPGNVPTEVENAVERDMPFSVLSGSEMGGSYSGSASMAFE------YHETRAPVTES 113

Query: 593  ISSGPKIVANEVQDQVFVDAVDEFGDNISLPGKHQVSSTEEGPIFPRSLHMMTKLGNRLI 772
            +S   ++    V   V VD        I   GK  +S ++E  +      +  + G +  
Sbjct: 114  VSKRNQLT--HVMPNVVVD--------IQRKGKIGLSGSDESGVVSSEPPVSCQAGAKST 163

Query: 773  NDTDW----FTNSSSLHEEQSSLDELVRKNHRSVTXXXXXXXXXXQIISPNNSEGNSQLL 940
             D DW    F ++ +  E  +S D                     +++ P ++   +Q  
Sbjct: 164  GD-DWQGVEFHSTGNQAEASTSADS---------------EDAVHKLVYPADNLAITQ-- 205

Query: 941  TXXXXXXXXXXXXXXXXXSLTASGGDIGSSEMADCQNVDMVNIVSDISVKDLEKHDPSGV 1120
                              + T+SG     +  A+    ++V+ +S  S+ D         
Sbjct: 206  ---NSHPLQIRFGSIDVVNETSSGSLAVENSDAELSGSNLVDEISKGSLAD--------- 253

Query: 1121 NSLFASDMGDFEEETSMKAIVTSRSGQMYSVELLEGIIENEKSEKIALRSAKDSLLSFIN 1300
                  + GD E + ++ + V +RS Q  ++  LE IIE+ KS K  L +  +S+++ + 
Sbjct: 254  ------ENGDPELDGAVSS-VGNRSTQGCNMVHLEQIIEDAKSNKRTLFTVMESIMNLMR 306

Query: 1301 EVEMKEKSVEKAREEAAHGGLDTLAKVEDLKKMLQFAKEANDMHAGEVYGEKAILSTEMR 1480
            EVE++EK  EKA+E+A+ GG DTL KVE+LKKML+ AKEANDM AGEVYGE++IL+TE+ 
Sbjct: 307  EVELQEKEAEKAKEDASIGGFDTLDKVEELKKMLEHAKEANDMAAGEVYGERSILTTEVN 366

Query: 1481 ELQSRLHSLSDERVKALAILDEIHQALEVRLVSXXXXXXXXXXXXXXXXXXXXXXXXHEE 1660
            EL++RL SLS+ER  +L++LDE+   LE+RL +                         +E
Sbjct: 367  ELENRLISLSEERDNSLSVLDEMRVDLEIRLATALGIKNAAEQEKQEKEGSARKAFAEQE 426

Query: 1661 SRMQKVFEESKILEQQAEENSKLRKFLIDRGRQVDILQGEISVICQDVKSMKTKFDQGLP 1840
            + M++V +ESK+L+Q+AEENSKLR+FL+D GR VD LQGEISVICQD++ +K KFD  +P
Sbjct: 427  AIMERVVQESKLLQQEAEENSKLREFLMDHGRIVDSLQGEISVICQDIRHLKEKFDNRVP 486


>ref|XP_002321060.1| predicted protein [Populus trichocarpa] gi|222861833|gb|EEE99375.1|
            predicted protein [Populus trichocarpa]
          Length = 549

 Score =  243 bits (619), Expect = 2e-61
 Identities = 186/537 (34%), Positives = 277/537 (51%), Gaps = 1/537 (0%)
 Frame = +2

Query: 233  MGFNSVYKVLQEVFPQVDSRILRAVAIENSKDADVAVGVVLSEVMPSFSVKSGKVATSPN 412
            MGF++VYK L +VFPQVD+RIL+AVAIE+SKDAD     + +EV+ S  + S    +  +
Sbjct: 1    MGFSTVYKCLTDVFPQVDARILKAVAIEHSKDAD-----IAAEVVLSEVIPS---LSRHS 52

Query: 413  EGPSLPNSYGDVEVSNTADGTYSTAVNYGLSEEVIASQQIPSRKPGESIFSEFDIPYANF 592
              PS P    D   S   DG        GL    ++  +                   + 
Sbjct: 53   AAPSPPCE--DTSPSLPLDGQTEQEEETGLRHRQVSLVK-------------------SV 91

Query: 593  ISSGPKIVANEVQDQVFVDAVDEFGDNISLPGKHQVSSTEEGPIFPRSLHMMTKLGNRLI 772
             SS P ++A E   +  + +    GD+      HQ +  ++  + P   +  T   N+L 
Sbjct: 92   RSSEPGLIAEEDDGKTELTSGVNDGDST-----HQENRQDQPIVVPSGANADT---NQLQ 143

Query: 773  NDTDWFTNSSSLHEEQSSLDELVRKNHRSVTXXXXXXXXXXQIISPNNSEGNSQLLTXXX 952
               +      +  EE++ L       HR V+           +I+  + +G ++L     
Sbjct: 144  GHIE------TEQEEETGL------RHRQVSLVKSVRSSEPGLIAEED-DGKTELTGGVN 190

Query: 953  XXXXXXXXXXXXXXSLTASGGDIGSSEM-ADCQNVDMVNIVSDISVKDLEKHDPSGVNSL 1129
                           +  SG +  ++++    ++ +++ +      + + +   S    L
Sbjct: 191  DGDSTHQEIRQDQPVVVPSGANADTNQLQGHIESDELILLGKPQHQEGISQPGSSQTLIL 250

Query: 1130 FASDMGDFEEETSMKAIVTSRSGQMYSVELLEGIIENEKSEKIALRSAKDSLLSFINEVE 1309
             ++D+       +M       S Q   +ELLE I+E  K  K  L SA +S+++ + EVE
Sbjct: 251  VSNDLLLGVNAENMN------SKQYRQIELLEEIVEAAKDNKKTLFSAMESVMNMMKEVE 304

Query: 1310 MKEKSVEKAREEAAHGGLDTLAKVEDLKKMLQFAKEANDMHAGEVYGEKAILSTEMRELQ 1489
            ++E S E+A+EEAA GGLD L +VE LK+ML  AKEANDMHAGEVYGEKAIL+TE+RELQ
Sbjct: 305  LQEISAEQAKEEAARGGLDILVEVEKLKQMLVHAKEANDMHAGEVYGEKAILATEVRELQ 364

Query: 1490 SRLHSLSDERVKALAILDEIHQALEVRLVSXXXXXXXXXXXXXXXXXXXXXXXXHEESRM 1669
            +RL SLSDER  ALAILDE+ Q LE RL +                         +E  M
Sbjct: 365  ARLLSLSDERDNALAILDEMRQTLESRLAAAEELRKTAELEKLEKEETARNALAEQEIIM 424

Query: 1670 QKVFEESKILEQQAEENSKLRKFLIDRGRQVDILQGEISVICQDVKSMKTKFDQGLP 1840
            +KV +ESKIL+++AEEN+KL++FL+DRG  VD LQGEISVICQDV+ +K +FD+ +P
Sbjct: 425  EKVVQESKILQKEAEENAKLQEFLMDRGCVVDTLQGEISVICQDVRLLKERFDERVP 481


Top