BLASTX nr result

ID: Akebia24_contig00044928 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00044928
         (382 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007012730.1| Hydroxyproline-rich glycoprotein family prot...    80   2e-13
gb|EXB29688.1| hypothetical protein L484_013462 [Morus notabilis]      78   1e-12
ref|XP_002279061.1| PREDICTED: uncharacterized protein LOC100261...    76   4e-12
emb|CBI35923.3| unnamed protein product [Vitis vinifera]               70   3e-10
ref|XP_006369280.1| hydroxyproline-rich glycoprotein [Populus tr...    70   3e-10
ref|XP_006342342.1| PREDICTED: pollen-specific leucine-rich repe...    69   5e-10
ref|XP_007024556.1| Hydroxyproline-rich glycoprotein family prot...    69   5e-10
ref|XP_004243732.1| PREDICTED: uncharacterized protein LOC101260...    66   4e-09
ref|XP_007215011.1| hypothetical protein PRUPE_ppa002494mg [Prun...    66   6e-09
gb|EXB38898.1| hypothetical protein L484_027333 [Morus notabilis]      64   2e-08
ref|XP_007203847.1| hypothetical protein PRUPE_ppa004367m1g, par...    64   2e-08
ref|XP_003597554.1| hypothetical protein MTR_2g099520 [Medicago ...    62   8e-08
ref|XP_006401247.1| hypothetical protein EUTSA_v10013114mg [Eutr...    59   5e-07
ref|XP_002521366.1| conserved hypothetical protein [Ricinus comm...    59   7e-07
ref|XP_004487232.1| PREDICTED: uncharacterized protein LOC101499...    58   1e-06
ref|XP_003546756.1| PREDICTED: uncharacterized protein LOC100811...    58   1e-06
gb|ACU21434.1| unknown [Glycine max]                                   58   1e-06
ref|XP_002514089.1| conserved hypothetical protein [Ricinus comm...    57   2e-06

>ref|XP_007012730.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
           gi|508783093|gb|EOY30349.1| Hydroxyproline-rich
           glycoprotein family protein [Theobroma cacao]
          Length = 610

 Score = 80.5 bits (197), Expect = 2e-13
 Identities = 48/109 (44%), Positives = 58/109 (53%), Gaps = 10/109 (9%)
 Frame = -3

Query: 299 SLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXXXXXXXXXGANV----------DENTNV 150
           S+TSQ FRPN V+KSWDSLN+ LVLFAILC            N           D N N 
Sbjct: 54  SITSQIFRPNGVRKSWDSLNIFLVLFAILCGVFARRNDDDDNNSGSSGNNNVRNDNNNNK 113

Query: 149 SKEENHQQSTQDQWFEYSDRGIYXXXXXXXXXSKMVGKLRRSSSSYPDL 3
           ++  +H  ++Q QWF Y  R IY            V +L+RSSSSYPDL
Sbjct: 114 NEASSHPVNSQ-QWFGYPGRKIYDDDPPMNASGTSVRRLKRSSSSYPDL 161


>gb|EXB29688.1| hypothetical protein L484_013462 [Morus notabilis]
          Length = 530

 Score = 78.2 bits (191), Expect = 1e-12
 Identities = 48/126 (38%), Positives = 63/126 (50%), Gaps = 2/126 (1%)
 Frame = -3

Query: 374 NSGVLIVVXXXXXXXXXXXXXXXXFSLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXXXX 195
           NSG +++                  S TSQ FRP+ VKKSWDSLNL+LVLFAI+C     
Sbjct: 39  NSGAVLIALIVTALAFIFVIIPSFLSFTSQIFRPHSVKKSWDSLNLVLVLFAIVCGFLSR 98

Query: 194 XXXXXGANVDENTNVSKE--ENHQQSTQDQWFEYSDRGIYXXXXXXXXXSKMVGKLRRSS 21
                 ++  ++  VS E  +    ST  QW+EYSDR            +  + +  RSS
Sbjct: 99  NSTENTSSNHDDQRVSNEGGQKSNPSTPHQWYEYSDR------TQSDSFNSRIYRRMRSS 152

Query: 20  SSYPDL 3
           SSYPDL
Sbjct: 153 SSYPDL 158


>ref|XP_002279061.1| PREDICTED: uncharacterized protein LOC100261010 [Vitis vinifera]
          Length = 555

 Score = 76.3 bits (186), Expect = 4e-12
 Identities = 52/128 (40%), Positives = 61/128 (47%), Gaps = 3/128 (2%)
 Frame = -3

Query: 377 LNSGVLIVVXXXXXXXXXXXXXXXXFSLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXXX 198
           LN  VLI++                 + TSQF RPN V+KSWDSLN+LLVLFAILC    
Sbjct: 30  LNPAVLIILLPILAMIVVFFAVPSFLNFTSQFLRPNSVRKSWDSLNVLLVLFAILCGVFA 89

Query: 197 XXXXXXGANVDENTNVSKE---ENHQQSTQDQWFEYSDRGIYXXXXXXXXXSKMVGKLRR 27
                   +V EN   S         +S     FE+SDR IY              +LRR
Sbjct: 90  RKNDEKNDDVLENHGSSGSVVMGKSHESISHSLFEFSDRKIYDPPIQSGSV-----RLRR 144

Query: 26  SSSSYPDL 3
           SSSSYPDL
Sbjct: 145 SSSSYPDL 152


>emb|CBI35923.3| unnamed protein product [Vitis vinifera]
          Length = 628

 Score = 70.1 bits (170), Expect = 3e-10
 Identities = 46/128 (35%), Positives = 62/128 (48%), Gaps = 2/128 (1%)
 Frame = -3

Query: 380 FLNSGVLIVVXXXXXXXXXXXXXXXXFSLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXX 201
           FL+SG LI+                  S TS  F+PN+VKKSWDSLNL+LVLFAI+C   
Sbjct: 69  FLSSGFLIIFLPLTALLFIVFVLPPILSFTSYIFKPNMVKKSWDSLNLVLVLFAIICGFL 128

Query: 200 XXXXXXXGANVDENTNVSKEENHQQSTQDQWFEYSDRGIYXXXXXXXXXSKMVGKLR--R 27
                   ++++ + +   EE+ Q+S     +E    G               G +R  R
Sbjct: 129 SRGGGGGSSDMESSVSEVPEESTQRSNHGHCYEERISG--------------YGGMRRMR 174

Query: 26  SSSSYPDL 3
           SSSSYPDL
Sbjct: 175 SSSSYPDL 182


>ref|XP_006369280.1| hydroxyproline-rich glycoprotein [Populus trichocarpa]
           gi|550347738|gb|ERP65849.1| hydroxyproline-rich
           glycoprotein [Populus trichocarpa]
          Length = 560

 Score = 70.1 bits (170), Expect = 3e-10
 Identities = 50/142 (35%), Positives = 64/142 (45%), Gaps = 16/142 (11%)
 Frame = -3

Query: 380 FLNSGVLIVVXXXXXXXXXXXXXXXXFSLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXX 201
           FLNSGV +V+                 SLTSQ  RP  +KKSWDSLNL+LVLFAI+C   
Sbjct: 32  FLNSGVFLVILLVVALAFVLVVVSSIGSLTSQILRPQSIKKSWDSLNLVLVLFAIVCGFL 91

Query: 200 XXXXXXXGANV-------DENT---------NVSKEENHQQSTQDQWFEYSDRGIYXXXX 69
                   +         +ENT         NV K  +   +   +WFE+ DR +     
Sbjct: 92  SSNNSSGSSGSGSGSGGDNENTSYYEDQSLSNVQKPSHPSSTPSHRWFEHQDRTV----- 146

Query: 68  XXXXXSKMVGKLRRSSSSYPDL 3
                   + +L RS SSYPDL
Sbjct: 147 ----SYNTLNRL-RSFSSYPDL 163


>ref|XP_006342342.1| PREDICTED: pollen-specific leucine-rich repeat extensin-like
           protein 1-like [Solanum tuberosum]
          Length = 642

 Score = 69.3 bits (168), Expect = 5e-10
 Identities = 46/115 (40%), Positives = 55/115 (47%), Gaps = 18/115 (15%)
 Frame = -3

Query: 293 TSQFFRPNLVKKSWDSLNLLLVLFAILCXXXXXXXXXXGANVDENTNVSKEEN------- 135
           T+Q  RPN VKK WDS N+LLV+FAILC           A V+ N NVS  E+       
Sbjct: 60  TTQILRPNSVKKGWDSFNILLVVFAILCGIFARKNDDNSA-VERNRNVSTTESSNFNDGS 118

Query: 134 -----------HQQSTQDQWFEYSDRGIYXXXXXXXXXSKMVGKLRRSSSSYPDL 3
                       +  + D+WFE SD   Y            V +LRRSSSSYPDL
Sbjct: 119 ASADVDVDHDMRRPVSNDRWFEASDEKTY----HFGVPETSVNRLRRSSSSYPDL 169


>ref|XP_007024556.1| Hydroxyproline-rich glycoprotein family protein, putative
           [Theobroma cacao] gi|508779922|gb|EOY27178.1|
           Hydroxyproline-rich glycoprotein family protein,
           putative [Theobroma cacao]
          Length = 553

 Score = 69.3 bits (168), Expect = 5e-10
 Identities = 50/144 (34%), Positives = 65/144 (45%), Gaps = 18/144 (12%)
 Frame = -3

Query: 380 FLNSGVLIVVXXXXXXXXXXXXXXXXFSLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXX 201
           F N+G+LI++                 S TSQ F+P+LVKKSWDSLNL+LVLFAI+C   
Sbjct: 30  FFNTGILIILLLVVALAFIFVIIPSFLSFTSQIFKPHLVKKSWDSLNLVLVLFAIIC--- 86

Query: 200 XXXXXXXGANVDENTNVSKEE---------------NHQQSTQDQWFEY---SDRGIYXX 75
                    N D +T  + E+                   ST  QW++Y   SDR  Y  
Sbjct: 87  -GFLGKNNGNNDSDTRSTYEDYKFSTTPKHDRDHVGRSNPSTPRQWYDYSSSSDRTAYNS 145

Query: 74  XXXXXXXSKMVGKLRRSSSSYPDL 3
                       +  RSS+SYPDL
Sbjct: 146 L-----------QRLRSSNSYPDL 158


>ref|XP_004243732.1| PREDICTED: uncharacterized protein LOC101260449 [Solanum
           lycopersicum]
          Length = 608

 Score = 66.2 bits (160), Expect = 4e-09
 Identities = 45/106 (42%), Positives = 52/106 (49%), Gaps = 9/106 (8%)
 Frame = -3

Query: 293 TSQFFRPNLVKKSWDSLNLLLVLFAILCXXXXXXXXXXGANVDENTNVSKEE-------N 135
           T+   RPN VKK WDS N+LLV+FAILC           A  + N NVS  E       +
Sbjct: 60  TTHILRPNSVKKGWDSFNILLVVFAILCGIFARKNDDNSA-AERNRNVSTTESSSNFNDH 118

Query: 134 HQQST--QDQWFEYSDRGIYXXXXXXXXXSKMVGKLRRSSSSYPDL 3
           H   T   D+WFE S    Y            V +LRRSSSSYPDL
Sbjct: 119 HMPPTVSNDRWFETSHDKTY----NFGVPETSVNRLRRSSSSYPDL 160


>ref|XP_007215011.1| hypothetical protein PRUPE_ppa002494mg [Prunus persica]
           gi|462411161|gb|EMJ16210.1| hypothetical protein
           PRUPE_ppa002494mg [Prunus persica]
          Length = 666

 Score = 65.9 bits (159), Expect = 6e-09
 Identities = 51/146 (34%), Positives = 64/146 (43%), Gaps = 20/146 (13%)
 Frame = -3

Query: 380 FLNSGVLIVVXXXXXXXXXXXXXXXXFSLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXX 201
           F +SG  I+                  S TSQ FRP+ VKKSWDSLNL+LVLFAI+C   
Sbjct: 116 FFSSGAFILALLAIALVFIFFIIPSVLSFTSQIFRPHSVKKSWDSLNLVLVLFAIVC--- 172

Query: 200 XXXXXXXGANVDENTNVSKEENHQQ-------------------STQDQWF-EYSDRGIY 81
                    N + + N+S   ++ Q                   ST  QWF +YSDR  Y
Sbjct: 173 ----GFLSRNTNNDGNLSSPSSYDQVHNQTVFNSSSPQAPKSNPSTPRQWFDQYSDRTGY 228

Query: 80  XXXXXXXXXSKMVGKLRRSSSSYPDL 3
                    +   G   R+SSSYPDL
Sbjct: 229 NQSSSSTSAAMNRGV--RTSSSYPDL 252


>gb|EXB38898.1| hypothetical protein L484_027333 [Morus notabilis]
          Length = 509

 Score = 64.3 bits (155), Expect = 2e-08
 Identities = 43/106 (40%), Positives = 51/106 (48%), Gaps = 7/106 (6%)
 Frame = -3

Query: 299 SLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXXXXXXXXXGANVDENTNVSKE---ENHQ 129
           S TS  FRP  VKKSWD LN+ LVLFAILC           AN D      +    E  +
Sbjct: 59  SFTSLIFRPIAVKKSWDLLNIFLVLFAILCGIFARRNDDESANNDVVPTARRSGGVEESE 118

Query: 128 QSTQDQWFEYSD----RGIYXXXXXXXXXSKMVGKLRRSSSSYPDL 3
            +   +WF +SD      IY           +  +LRRSSSSYPDL
Sbjct: 119 PANPQRWFAFSDDRRSEKIYDSVDRTAESGSL-RRLRRSSSSYPDL 163


>ref|XP_007203847.1| hypothetical protein PRUPE_ppa004367m1g, partial [Prunus persica]
           gi|462399378|gb|EMJ05046.1| hypothetical protein
           PRUPE_ppa004367m1g, partial [Prunus persica]
          Length = 339

 Score = 63.9 bits (154), Expect = 2e-08
 Identities = 50/140 (35%), Positives = 63/140 (45%), Gaps = 15/140 (10%)
 Frame = -3

Query: 377 LNSGVLIVVXXXXXXXXXXXXXXXXFSLTSQFFRPNL-VKKSWDSLNLLLVLFAILC--- 210
           L+  VLI++                 SLTSQ  RP + VKKSWDSLN+LLV+FAILC   
Sbjct: 12  LSPPVLIILLPIITFLFLFCTIPPFLSLTSQILRPTISVKKSWDSLNVLLVVFAILCGIF 71

Query: 209 ----XXXXXXXXXXGANVDENTNVSKEENHQQSTQD-------QWFEYSDRGIYXXXXXX 63
                           N  +  N S   N+  +T +       QWF +S+R         
Sbjct: 72  AKRNDDGSPAEEDPIQNASDPLNNSIAANNTTNTSEAEVLLPQQWFGFSER--------- 122

Query: 62  XXXSKMVGKLRRSSSSYPDL 3
                  G+LRRSSSSYPDL
Sbjct: 123 -PPETRGGRLRRSSSSYPDL 141


>ref|XP_003597554.1| hypothetical protein MTR_2g099520 [Medicago truncatula]
           gi|355486602|gb|AES67805.1| hypothetical protein
           MTR_2g099520 [Medicago truncatula]
          Length = 485

 Score = 62.0 bits (149), Expect = 8e-08
 Identities = 46/138 (33%), Positives = 57/138 (41%), Gaps = 12/138 (8%)
 Frame = -3

Query: 380 FLNSGVLIVVXXXXXXXXXXXXXXXXFSLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXX 201
           FL+SG +I+V                 S  S  F PN VKKSWDSLNLLLVLFAI C   
Sbjct: 30  FLSSGTVIIVLLVIALAFILVIVPTLHSFASHIFNPNSVKKSWDSLNLLLVLFAIFCGFL 89

Query: 200 XXXXXXXGANVDENTNVSKEENHQQSTQDQ-----------WFEYS-DRGIYXXXXXXXX 57
                       E+ N +  + + Q   ++           W+EYS DR  Y        
Sbjct: 90  SKNNNNESPRSYEDQNQTFSDTNTQQEYEKPNPEPETAPRFWYEYSEDRTSYNRL----- 144

Query: 56  XSKMVGKLRRSSSSYPDL 3
                    RS +SYPDL
Sbjct: 145 ---------RSFNSYPDL 153


>ref|XP_006401247.1| hypothetical protein EUTSA_v10013114mg [Eutrema salsugineum]
           gi|557102337|gb|ESQ42700.1| hypothetical protein
           EUTSA_v10013114mg [Eutrema salsugineum]
          Length = 570

 Score = 59.3 bits (142), Expect = 5e-07
 Identities = 42/118 (35%), Positives = 61/118 (51%), Gaps = 19/118 (16%)
 Frame = -3

Query: 299 SLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXXXXXXXXXGANVDENTNVSKEENHQQST 120
           S+TSQ F+P  VKK WDS+N++LV+FAILC           ++  ++++V +EE+   + 
Sbjct: 57  SITSQIFQPASVKKGWDSINVVLVVFAILCGVLARQNDDGLSSSSQSSHVEEEEDDVTNG 116

Query: 119 QD-----------QWFE---YSDR-GIYXXXXXXXXXSKM----VGKLRRSSSSYPDL 3
           +D           QWF+    +DR  IY           +       LRRSSSSYPDL
Sbjct: 117 EDSKISSSPVVSQQWFDDVYDADRLKIYESLSNRSFSPGLPVTGTLPLRRSSSSYPDL 174


>ref|XP_002521366.1| conserved hypothetical protein [Ricinus communis]
           gi|223539444|gb|EEF41034.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 553

 Score = 58.9 bits (141), Expect = 7e-07
 Identities = 27/57 (47%), Positives = 36/57 (63%)
 Frame = -3

Query: 380 FLNSGVLIVVXXXXXXXXXXXXXXXXFSLTSQFFRPNLVKKSWDSLNLLLVLFAILC 210
           FLNSGV++++                 + TSQ F+PNL+KK WDSLN +LVLFAI+C
Sbjct: 32  FLNSGVILIMLLVIAFVFVFVVVPSVVTFTSQVFKPNLIKKGWDSLNFVLVLFAIVC 88


>ref|XP_004487232.1| PREDICTED: uncharacterized protein LOC101499728 [Cicer arietinum]
          Length = 435

 Score = 58.2 bits (139), Expect = 1e-06
 Identities = 46/147 (31%), Positives = 57/147 (38%), Gaps = 22/147 (14%)
 Frame = -3

Query: 377 LNSGVLIVVXXXXXXXXXXXXXXXXFSLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXXX 198
           + SG ++++                 S TS  FRPN VKKSWDSLN+LLVLFAI C    
Sbjct: 31  VTSGTVVIILLVTTIAFALVVVPTLQSFTSHIFRPNSVKKSWDSLNILLVLFAIFC---- 86

Query: 197 XXXXXXGANVDENTNVSKEENHQQSTQD---------------------QWFEYS-DRGI 84
                   + + NTN S      Q+  D                      W+EYS DR  
Sbjct: 87  -----GFLSRNNNTNESPRSYEDQTFSDTNTRQEYEKPNLEPETEMPPFSWYEYSEDRTS 141

Query: 83  YXXXXXXXXXSKMVGKLRRSSSSYPDL 3
           Y                 RS +SYPDL
Sbjct: 142 YNRL--------------RSFNSYPDL 154


>ref|XP_003546756.1| PREDICTED: uncharacterized protein LOC100811539 [Glycine max]
          Length = 419

 Score = 58.2 bits (139), Expect = 1e-06
 Identities = 41/105 (39%), Positives = 53/105 (50%), Gaps = 6/105 (5%)
 Frame = -3

Query: 299 SLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXXXXXXXXXGANVDENTNVSKEENHQQST 120
           S T+Q F+PN VKKSWDSLN +L+LFAILC           +  + NTN S  +   +  
Sbjct: 57  SFTTQIFKPNSVKKSWDSLNFVLILFAILC--------GFLSRNNSNTNESFSDTPLEYD 108

Query: 119 QDQ------WFEYSDRGIYXXXXXXXXXSKMVGKLRRSSSSYPDL 3
           +        W+E SDR  Y          K   +L RS SS+PDL
Sbjct: 109 KPNPPLPRPWYEESDRTPY----------KSYNRL-RSFSSHPDL 142


>gb|ACU21434.1| unknown [Glycine max]
          Length = 419

 Score = 58.2 bits (139), Expect = 1e-06
 Identities = 41/105 (39%), Positives = 53/105 (50%), Gaps = 6/105 (5%)
 Frame = -3

Query: 299 SLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXXXXXXXXXGANVDENTNVSKEENHQQST 120
           S T+Q F+PN VKKSWDSLN +L+LFAILC           +  + NTN S  +   +  
Sbjct: 57  SFTTQIFKPNSVKKSWDSLNFVLILFAILC--------GFLSRNNSNTNESFSDTPLEYD 108

Query: 119 QDQ------WFEYSDRGIYXXXXXXXXXSKMVGKLRRSSSSYPDL 3
           +        W+E SDR  Y          K   +L RS SS+PDL
Sbjct: 109 KPNPPLPRPWYEESDRTPY----------KSYNRL-RSFSSHPDL 142


>ref|XP_002514089.1| conserved hypothetical protein [Ricinus communis]
           gi|223546545|gb|EEF48043.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 831

 Score = 57.4 bits (137), Expect = 2e-06
 Identities = 37/102 (36%), Positives = 49/102 (48%), Gaps = 10/102 (9%)
 Frame = -3

Query: 278 RPNLVKKSWDSLNLLLVLFAILCXXXXXXXXXXGAN----------VDENTNVSKEENHQ 129
           RP+ VKKSWDSLN+ LVLFAILC           A           +  N+N +KE +H 
Sbjct: 46  RPSTVKKSWDSLNVFLVLFAILCGIFARRNDDDSAPSGDHSNSSSVLHNNSNNNKERDHA 105

Query: 128 QSTQDQWFEYSDRGIYXXXXXXXXXSKMVGKLRRSSSSYPDL 3
            S    W + +              +  + +L+RSSSSYPDL
Sbjct: 106 VSNHSHWLDDNQ----------FASATPMRRLKRSSSSYPDL 137


Top