BLASTX nr result

ID: Angelica22_contig00003147 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00003147
         (1806 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002276508.1| PREDICTED: uncharacterized protein LOC100264...   284   5e-74
ref|NP_171828.1| uncharacterized protein [Arabidopsis thaliana] ...   266   2e-68
ref|XP_002889451.1| hypothetical protein ARALYDRAFT_470307 [Arab...   260   1e-66
ref|XP_002321060.1| predicted protein [Populus trichocarpa] gi|2...   254   4e-65
ref|XP_002874890.1| hypothetical protein ARALYDRAFT_490272 [Arab...   244   4e-62

>ref|XP_002276508.1| PREDICTED: uncharacterized protein LOC100264786 [Vitis vinifera]
            gi|296086718|emb|CBI32353.3| unnamed protein product
            [Vitis vinifera]
          Length = 667

 Score =  284 bits (727), Expect = 5e-74
 Identities = 216/596 (36%), Positives = 312/596 (52%), Gaps = 63/596 (10%)
 Frame = +1

Query: 148  MGFTAVFMALLEVFPQVDPRILRAVAIENSKDADVAVSVVLAEVLPSLSAQSGIGSGGCS 327
            MGF AV+ AL +VFPQVD R+L+AVAIE+SKDAD AV  VL +VLP +S   G  SG C 
Sbjct: 1    MGFKAVYRALQDVFPQVDARLLKAVAIEHSKDADAAVEFVLHDVLPFMSQHPG-SSGSCY 59

Query: 328  FHQGLSLLNSYGDVQASDIAVGM-HSVGPNNGVSEEIDIDKQIPSTKPVESSRVEFGNFG 504
             +Q L   +S G V+  + ++   H        +  +D+  +  S      +  E     
Sbjct: 60   ENQLLEDSSS-GMVEGEEESIPTDHQHVVEEAKAANVDLSTKSGSVADENPNDDE----A 114

Query: 505  LNSNVSGPDTTANESHNQVYVNGSGKSVSLLERH---STEAGPKDISRAMV--------- 648
            ++ + +     AN+ H++VY N   + +  LE+    S++ GP  IS  +          
Sbjct: 115  MDGSTALDFYDANDGHDEVYENTESEELIPLEQGQDISSKVGPGRISNVIAITPLHADDG 174

Query: 649  --------------DELFNNVLYDTDVSVISSYLDE--------EEQSNLDEL------- 741
                          D   +    D+ V+ I  +L +        E+ S +D L       
Sbjct: 175  CGDLELALEKYKTKDLTLSQDFGDSGVTSIIDFLFQITPKTLIHEDDSKVDGLNDPHADS 234

Query: 742  -------VRKNHRSATQVADVDPSPVQIISVNVPESSSKPLTEPVS----PVTKNGEPEA 888
                      N  S T  A +    + I S+N   + S     PV+     VT     EA
Sbjct: 235  KDLDLQDTSVNASSVTSNASMHGDGI-ISSLNDQHADSDSFNGPVACDFDTVTHKKGQEA 293

Query: 889  ----GVPLLMISGEDVTSSEMVDSENVDIINIVSDLSIKGLEKGEPNGIKDLYTSEVNDF 1056
                G+ + MI   D  + E +    +D I+ ++       EK E +   D    + + F
Sbjct: 294  SGLDGIQVEMIQVPDTDAPERLLQAEIDSISCITHC-----EKEESSVSFDHDAKQEDAF 348

Query: 1057 EDESC------MKATLTSRSGQMCTTKLLEDIIENEKTEKFALHSARDSLFSLISEVQLK 1218
            + E        +  T+ ++SG +C+T  LE++IE+ K  K  L S+ DS+ +++ EV+L+
Sbjct: 349  DIEMVGDVVEPVLNTIVTQSGHICSTDFLEEMIEDAKNNKKTLFSSMDSVMNIMREVELQ 408

Query: 1219 EKYVQKAREEAAQGGLDTFARAEDLTKMLQRAKEANDMYAGEVYGEKAILATETRELQSR 1398
            EK  Q+AREEAA+GGL+   R E+L +MLQ AKEAN M+AGEVYGEKAILATE RELQSR
Sbjct: 409  EKAAQQAREEAARGGLEILTRVEELKEMLQHAKEANGMHAGEVYGEKAILATEARELQSR 468

Query: 1399 LQSLSEERVKALAILNEIHQXXXXXXXXXXXXXXXXXXXXXKQEESALKYFAQEESNMEK 1578
            L SLS+ER K+L IL+E+                       ++EESA K  A++E+ MEK
Sbjct: 469  LLSLSDERDKSLKILDEMRHALEARLAAAEEDIKAAEQVKFEKEESARKALAEQEAIMEK 528

Query: 1579 VVQESKILEQQAAENSKLREFLMDCGRLVDILQGEVSVIFQDVEFVKSKFEQGVPL 1746
            VVQES +L+Q+A ENSKL+EFLMD G +VD+LQGE+SVI QDV+F+K KF+  VPL
Sbjct: 529  VVQESMMLKQEAEENSKLQEFLMDRGHIVDMLQGEISVICQDVKFLKVKFDDRVPL 584


>ref|NP_171828.1| uncharacterized protein [Arabidopsis thaliana]
            gi|334182264|ref|NP_001184898.1| uncharacterized protein
            [Arabidopsis thaliana] gi|3850585|gb|AAC72125.1| ESTs
            gb|H36966, gb|R65511, gb|T42324 and gb|T20569 come from
            this gene [Arabidopsis thaliana]
            gi|332189433|gb|AEE27554.1| uncharacterized protein
            [Arabidopsis thaliana] gi|332189434|gb|AEE27555.1|
            uncharacterized protein [Arabidopsis thaliana]
          Length = 571

 Score =  266 bits (679), Expect = 2e-68
 Identities = 194/542 (35%), Positives = 289/542 (53%), Gaps = 9/542 (1%)
 Frame = +1

Query: 148  MGFTAVFMALLEVFPQVDPRILRAVAIENSKDADVAVSVVLAEVLPSLSAQSGIGSGGCS 327
            MGF +V+ +L E+FPQ+D RILRAVAIE+ KDAD A +VVL+E++PS S+          
Sbjct: 1    MGFGSVYRSLTEIFPQIDARILRAVAIEHPKDADEAAAVVLSEIIPSFSSNL-------- 52

Query: 328  FHQGL-SLLNSYGDVQASDIAVGMHSVGPNNGVSEEIDIDKQIPSTKPVESSRVEFGNFG 504
            FH    S   S G +   ++  G+  V                   +P            
Sbjct: 53   FHNFTQSSYKSSGSISEREVEHGLEDVASR---------------CRPF----------- 86

Query: 505  LNSNVSGPDTTANESHNQVYVNGSGKSVSLLERHSTEAGPKDISRAMVDELFNNVLYDTD 684
            L ++ S   T+++ S ++         + +   H+T A   D+   M +     +  + D
Sbjct: 87   LGASGSKASTSSSSSSSETL------PLVVTRDHNTRALSTDLVSNMNE--LTTLQPNVD 138

Query: 685  VSVISSYLDEEEQSNLDELVRK---NHRSATQVADVDPSPVQIISVNVPESSSKPLTEPV 855
              V    L+ EE  ++ +   K   N+    +  DV  +    I ++VPE     +    
Sbjct: 139  PDVCHKDLESEEIQSVKKARGKENGNYDLFGRCFDVTSNAK--IGLDVPEDDIASVVSLF 196

Query: 856  S----PVTKNGEPEAGVPLLMISGEDVTSSEMVDSENVDIINIVSDLSIKGLEKGEPNGI 1023
            S     +  N   + G  +     E+  S ++VDS   D +      S   +  G  N +
Sbjct: 197  SLDNVKLASNFWEDLGFDITWNQAENAVS-KLVDSTPGDTMTTTQQGSCFEVGHGSTNLV 255

Query: 1024 KDLYTSEVNDFEDESCMK-ATLTSRSGQMCTTKLLEDIIENEKTEKFALHSARDSLFSLI 1200
             +  TS  + F +    +     S S  +C+   LEDIIE+ K+ K  L +  +++ +++
Sbjct: 256  DE--TSNRSLFSENGDTEIGDAFSTSTHVCSVDQLEDIIEDAKSNKKNLLTEMETVTNIM 313

Query: 1201 SEVQLKEKYVQKAREEAAQGGLDTFARAEDLTKMLQRAKEANDMYAGEVYGEKAILATET 1380
             EV+LKEK  +K++EEAA+GGLDT  + E+L KML+ AKEANDM+AGEVYGEK+ILATE 
Sbjct: 314  REVELKEKDAEKSKEEAARGGLDTLQKVEELKKMLEHAKEANDMHAGEVYGEKSILATEV 373

Query: 1381 RELQSRLQSLSEERVKALAILNEIHQXXXXXXXXXXXXXXXXXXXXXKQEESALKYFAQE 1560
            +EL++RL +LSEER K+LAIL+E+                        +E+SALK  A++
Sbjct: 374  KELENRLLNLSEERNKSLAILDEMRGSLEIRLAAALELKKTAEKEKKDKEDSALKALAEQ 433

Query: 1561 ESNMEKVVQESKILEQQAAENSKLREFLMDCGRLVDILQGEVSVIFQDVEFVKSKFEQGV 1740
            E+NMEKVVQESK+L+Q+A ENSKLR+FLMD G++VD LQGE+SVI QDV+ +K KFE  V
Sbjct: 434  EANMEKVVQESKLLQQEAEENSKLRDFLMDRGQIVDTLQGEISVICQDVKLLKEKFENRV 493

Query: 1741 PL 1746
            PL
Sbjct: 494  PL 495


>ref|XP_002889451.1| hypothetical protein ARALYDRAFT_470307 [Arabidopsis lyrata subsp.
            lyrata] gi|297335293|gb|EFH65710.1| hypothetical protein
            ARALYDRAFT_470307 [Arabidopsis lyrata subsp. lyrata]
          Length = 573

 Score =  260 bits (664), Expect = 1e-66
 Identities = 186/538 (34%), Positives = 286/538 (53%), Gaps = 5/538 (0%)
 Frame = +1

Query: 148  MGFTAVFMALLEVFPQVDPRILRAVAIENSKDADVAVSVVLAEVLPSLSAQSGIGSGGCS 327
            MGF +V+ +L E+FPQ+D RILRAVAIE+ KDAD A +VVL+E++PS S+         S
Sbjct: 1    MGFGSVYRSLTEIFPQIDARILRAVAIEHPKDADEAAAVVLSEIIPSFSSN-------LS 53

Query: 328  FHQGLSLLNSYGDVQASDIAVGMHSVGPNNGVSEEIDIDKQIPSTKPVESSRVEFGNFGL 507
             +   S   S G +   ++  G+  V               +   +P            L
Sbjct: 54   HNLTQSSNKSSGSISDREVERGLEDV---------------VSRCRPF-----------L 87

Query: 508  NSNVSGPDTTANESHNQVYVNGSGKSVSLLERHSTEAGPKDISRAMVDELFNNVLYDTDV 687
             ++ S P T+++ S +    +     + ++  H+T A   D+   M +    N+  +  +
Sbjct: 88   GASGSKPSTSSSCSSS----SSETLPLVVVRDHNTRALSTDLVSNMNEP--TNLQPNVGL 141

Query: 688  SVISSYLDEEEQSNLDELVRKNHRSATQVADV-DPSPVQIISVNVPESSSKPLTEPVS-- 858
             V    L+ EE  +L +   K H +        D      + + VPE     +   +S  
Sbjct: 142  DVCHKDLESEEVQSLKKARGKEHGNYDFFGRCFDVKSNAKLGLLVPEDDIASVVSAISLD 201

Query: 859  --PVTKNGEPEAGVPLLMISGEDVTSSEMVDSENVDIINIVSDLSIKGLEKGEPNGIKDL 1032
               +T +   +    +     E+  S ++VDS   D        S   ++ G  N + + 
Sbjct: 202  NIKLTSDFWEDLCFGMTWNQAENAVS-KLVDSTPGDTTTTTQQGSCFEVDSGSTNLVDET 260

Query: 1033 YTSEVNDFEDESCMKATLTSRSGQMCTTKLLEDIIENEKTEKFALHSARDSLFSLISEVQ 1212
                +     ++ +  T  S S  +C+   LE+IIE+ K+ K  L +  +++ +L+ EV+
Sbjct: 261  SNRSLVSENGDTEIGDTF-STSTHVCSVDHLEEIIEDAKSNKKTLLTEMETVTNLMREVE 319

Query: 1213 LKEKYVQKAREEAAQGGLDTFARAEDLTKMLQRAKEANDMYAGEVYGEKAILATETRELQ 1392
            L+EK  +K++EEAA+GGLDT  + E+L KML+ AKEANDM+AGEVYGEK+ILATE +EL+
Sbjct: 320  LQEKDAEKSKEEAARGGLDTLQKVEELKKMLEHAKEANDMHAGEVYGEKSILATEVKELE 379

Query: 1393 SRLQSLSEERVKALAILNEIHQXXXXXXXXXXXXXXXXXXXXXKQEESALKYFAQEESNM 1572
            +RL +LSEER K+L IL+E+                        +E+SAL+   ++E+NM
Sbjct: 380  NRLLNLSEERNKSLTILDEMRGSLEIRLATALEMKKTAEQEKKNKEDSALQALVEQEANM 439

Query: 1573 EKVVQESKILEQQAAENSKLREFLMDCGRLVDILQGEVSVIFQDVEFVKSKFEQGVPL 1746
            EKVVQESK+L+Q+A ENSKLREFLMD G++VD LQGE+SVI QDV+ +K KFE  V L
Sbjct: 440  EKVVQESKLLQQEAEENSKLREFLMDRGQIVDSLQGEISVICQDVKLLKEKFENRVQL 497


>ref|XP_002321060.1| predicted protein [Populus trichocarpa] gi|222861833|gb|EEE99375.1|
            predicted protein [Populus trichocarpa]
          Length = 549

 Score =  254 bits (650), Expect = 4e-65
 Identities = 191/540 (35%), Positives = 275/540 (50%), Gaps = 7/540 (1%)
 Frame = +1

Query: 148  MGFTAVFMALLEVFPQVDPRILRAVAIENSKDADVAVSVVLAEVLPSLSAQSGIGSGGCS 327
            MGF+ V+  L +VFPQVD RIL+AVAIE+SKDAD+A  VVL+EV+PSLS  S   S  C 
Sbjct: 1    MGFSTVYKCLTDVFPQVDARILKAVAIEHSKDADIAAEVVLSEVIPSLSRHSAAPSPPCE 60

Query: 328  FHQGLSLLNSYGDVQASDIAVGMHSVGPNNGVSE-EIDIDKQIPSTKPVESSRVEFGNFG 504
                        D   S    G        G+   ++ + K + S++P   +  + G   
Sbjct: 61   ------------DTSPSLPLDGQTEQEEETGLRHRQVSLVKSVRSSEPGLIAEEDDGKTE 108

Query: 505  LNSNVSGPDTTANESHNQ---VYVNGSGKSVSLLERHSTEAGPKDISRAMVDELFNNVLY 675
            L S V+  D+T  E+      V  +G+    + L+ H                       
Sbjct: 109  LTSGVNDGDSTHQENRQDQPIVVPSGANADTNQLQGHIET-------------------- 148

Query: 676  DTDVSVISSYLDEEEQSNLDELVRKNHRSATQVADVDPSPVQIISVNVPESSSKPLTEPV 855
                       ++EE++ L       HR  + V  V  S   +I+    +     LT  V
Sbjct: 149  -----------EQEEETGL------RHRQVSLVKSVRSSEPGLIAEE--DDGKTELTGGV 189

Query: 856  SPV-TKNGEPEAGVPLLMISGEDVTSSEMVDSENVDIINIVSDLSIKGLEKGEPNGIKDL 1032
            +   + + E     P+++ SG +  ++++         +I SD  I   +     GI   
Sbjct: 190  NDGDSTHQEIRQDQPVVVPSGANADTNQLQG-------HIESDELILLGKPQHQEGISQP 242

Query: 1033 YTSEVNDFEDESCMKATLTSR--SGQMCTTKLLEDIIENEKTEKFALHSARDSLFSLISE 1206
             +S+         +         S Q    +LLE+I+E  K  K  L SA +S+ +++ E
Sbjct: 243  GSSQTLILVSNDLLLGVNAENMNSKQYRQIELLEEIVEAAKDNKKTLFSAMESVMNMMKE 302

Query: 1207 VQLKEKYVQKAREEAAQGGLDTFARAEDLTKMLQRAKEANDMYAGEVYGEKAILATETRE 1386
            V+L+E   ++A+EEAA+GGLD     E L +ML  AKEANDM+AGEVYGEKAILATE RE
Sbjct: 303  VELQEISAEQAKEEAARGGLDILVEVEKLKQMLVHAKEANDMHAGEVYGEKAILATEVRE 362

Query: 1387 LQSRLQSLSEERVKALAILNEIHQXXXXXXXXXXXXXXXXXXXXXKQEESALKYFAQEES 1566
            LQ+RL SLS+ER  ALAIL+E+ Q                     ++EE+A    A++E 
Sbjct: 363  LQARLLSLSDERDNALAILDEMRQTLESRLAAAEELRKTAELEKLEKEETARNALAEQEI 422

Query: 1567 NMEKVVQESKILEQQAAENSKLREFLMDCGRLVDILQGEVSVIFQDVEFVKSKFEQGVPL 1746
             MEKVVQESKIL+++A EN+KL+EFLMD G +VD LQGE+SVI QDV  +K +F++ VPL
Sbjct: 423  IMEKVVQESKILQKEAEENAKLQEFLMDRGCVVDTLQGEISVICQDVRLLKERFDERVPL 482


>ref|XP_002874890.1| hypothetical protein ARALYDRAFT_490272 [Arabidopsis lyrata subsp.
            lyrata] gi|297320727|gb|EFH51149.1| hypothetical protein
            ARALYDRAFT_490272 [Arabidopsis lyrata subsp. lyrata]
          Length = 559

 Score =  244 bits (624), Expect = 4e-62
 Identities = 180/538 (33%), Positives = 276/538 (51%), Gaps = 5/538 (0%)
 Frame = +1

Query: 148  MGFTAVFMALLEVFPQVDPRILRAVAIENSKDADVAVSVVLAEVLPSLSAQSGIGSGGCS 327
            MG+ AV+ +L E+FPQ+D R+L+AVAIE+ KDA+ A +VV++E++P         S    
Sbjct: 1    MGYKAVYRSLTELFPQIDARLLKAVAIEHPKDANEAAAVVVSEIVPFFYPNLADNSTQPE 60

Query: 328  FHQGLSLLNSYGDVQASDIAVGMHSVGPNNGVSEEIDIDKQIPSTKPVE---SSRVEFGN 498
                 ++ N       + +  G  +   ++  S  + +D    S  P+    SSR +  +
Sbjct: 61   NRTPGNVPNKVERAMQNGVLSGSETGSSSSSGSIPLAVDCDHESRAPITESISSRNQLTH 120

Query: 499  FGLNSNVSGPDTTANESHNQVYVNGSGKS--VSLLERHSTEAGPKDISRAMVDELFNNVL 672
               N ++        +S+ ++ ++GS +S  VS     S +AG K  S       F+   
Sbjct: 121  VMPNVDLD------IQSNAKIGLSGSEESGVVSSENPVSFQAGAKSTSHGCQGVGFH--- 171

Query: 673  YDTDVSVISSYLDEEEQSNLDELVRKNHRSATQVADVDPSPVQIISVNVPESSSKPLTEP 852
                    S+  +    S  ++ V K    A   A    SP              PL   
Sbjct: 172  -----ITGSNQAEASTSSESEDAVHKLVYPADNSAMTQKSP--------------PLQIR 212

Query: 853  VSPVTKNGEPEAGVPLLMISGEDVTSSEMVDSENVDIINIVSDLSIKGLEKGEPNGIKDL 1032
               +    E  +G   +  S  +++ S +VD         V+      +E G+P  +   
Sbjct: 213  FGSIDIVNETSSGSLAVENSDAELSGSNLVD---------VTSKGSLAVENGDPELVGAF 263

Query: 1033 YTSEVNDFEDESCMKATLTSRSGQMCTTKLLEDIIENEKTEKFALHSARDSLFSLISEVQ 1212
                           +++ SRS Q C    LE IIE+ K+ K  L +  +S+ +L+ EV+
Sbjct: 264  ---------------SSVVSRSTQGCNIVHLEQIIEDAKSNKKTLFTVMESIMNLMREVE 308

Query: 1213 LKEKYVQKAREEAAQGGLDTFARAEDLTKMLQRAKEANDMYAGEVYGEKAILATETRELQ 1392
            L+EK  +KA+E+A++GG DT  + E+L KML+ AKEANDM AGEVYGE++IL TE  EL+
Sbjct: 309  LQEKDAEKAKEDASRGGFDTLDKVEELKKMLEHAKEANDMDAGEVYGERSILTTEVNELE 368

Query: 1393 SRLQSLSEERVKALAILNEIHQXXXXXXXXXXXXXXXXXXXXXKQEESALKYFAQEESNM 1572
            +RL +LSEER K+L++L+E+ +                     ++E SA   FA++E+ M
Sbjct: 369  NRLLNLSEERDKSLSVLDEMREVLEIRLAAALEIKNAAEQEKQEKEGSARMAFAEQEAIM 428

Query: 1573 EKVVQESKILEQQAAENSKLREFLMDCGRLVDILQGEVSVIFQDVEFVKSKFEQGVPL 1746
            EKVVQESK+L+Q+A ENSKLREFLMD GR+VD LQGE+SVI QD+  +K KF+  VPL
Sbjct: 429  EKVVQESKLLQQEAEENSKLREFLMDHGRIVDSLQGEISVICQDIRHLKEKFDNRVPL 486


Top