BLASTX nr result
ID: Akebia24_contig00044928
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00044928 (382 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007012730.1| Hydroxyproline-rich glycoprotein family prot... 80 2e-13 gb|EXB29688.1| hypothetical protein L484_013462 [Morus notabilis] 78 1e-12 ref|XP_002279061.1| PREDICTED: uncharacterized protein LOC100261... 76 4e-12 emb|CBI35923.3| unnamed protein product [Vitis vinifera] 70 3e-10 ref|XP_006369280.1| hydroxyproline-rich glycoprotein [Populus tr... 70 3e-10 ref|XP_006342342.1| PREDICTED: pollen-specific leucine-rich repe... 69 5e-10 ref|XP_007024556.1| Hydroxyproline-rich glycoprotein family prot... 69 5e-10 ref|XP_004243732.1| PREDICTED: uncharacterized protein LOC101260... 66 4e-09 ref|XP_007215011.1| hypothetical protein PRUPE_ppa002494mg [Prun... 66 6e-09 gb|EXB38898.1| hypothetical protein L484_027333 [Morus notabilis] 64 2e-08 ref|XP_007203847.1| hypothetical protein PRUPE_ppa004367m1g, par... 64 2e-08 ref|XP_003597554.1| hypothetical protein MTR_2g099520 [Medicago ... 62 8e-08 ref|XP_006401247.1| hypothetical protein EUTSA_v10013114mg [Eutr... 59 5e-07 ref|XP_002521366.1| conserved hypothetical protein [Ricinus comm... 59 7e-07 ref|XP_004487232.1| PREDICTED: uncharacterized protein LOC101499... 58 1e-06 ref|XP_003546756.1| PREDICTED: uncharacterized protein LOC100811... 58 1e-06 gb|ACU21434.1| unknown [Glycine max] 58 1e-06 ref|XP_002514089.1| conserved hypothetical protein [Ricinus comm... 57 2e-06 >ref|XP_007012730.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao] gi|508783093|gb|EOY30349.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao] Length = 610 Score = 80.5 bits (197), Expect = 2e-13 Identities = 48/109 (44%), Positives = 58/109 (53%), Gaps = 10/109 (9%) Frame = -3 Query: 299 SLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXXXXXXXXXGANV----------DENTNV 150 S+TSQ FRPN V+KSWDSLN+ LVLFAILC N D N N Sbjct: 54 SITSQIFRPNGVRKSWDSLNIFLVLFAILCGVFARRNDDDDNNSGSSGNNNVRNDNNNNK 113 Query: 149 SKEENHQQSTQDQWFEYSDRGIYXXXXXXXXXSKMVGKLRRSSSSYPDL 3 ++ +H ++Q QWF Y R IY V +L+RSSSSYPDL Sbjct: 114 NEASSHPVNSQ-QWFGYPGRKIYDDDPPMNASGTSVRRLKRSSSSYPDL 161 >gb|EXB29688.1| hypothetical protein L484_013462 [Morus notabilis] Length = 530 Score = 78.2 bits (191), Expect = 1e-12 Identities = 48/126 (38%), Positives = 63/126 (50%), Gaps = 2/126 (1%) Frame = -3 Query: 374 NSGVLIVVXXXXXXXXXXXXXXXXFSLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXXXX 195 NSG +++ S TSQ FRP+ VKKSWDSLNL+LVLFAI+C Sbjct: 39 NSGAVLIALIVTALAFIFVIIPSFLSFTSQIFRPHSVKKSWDSLNLVLVLFAIVCGFLSR 98 Query: 194 XXXXXGANVDENTNVSKE--ENHQQSTQDQWFEYSDRGIYXXXXXXXXXSKMVGKLRRSS 21 ++ ++ VS E + ST QW+EYSDR + + + RSS Sbjct: 99 NSTENTSSNHDDQRVSNEGGQKSNPSTPHQWYEYSDR------TQSDSFNSRIYRRMRSS 152 Query: 20 SSYPDL 3 SSYPDL Sbjct: 153 SSYPDL 158 >ref|XP_002279061.1| PREDICTED: uncharacterized protein LOC100261010 [Vitis vinifera] Length = 555 Score = 76.3 bits (186), Expect = 4e-12 Identities = 52/128 (40%), Positives = 61/128 (47%), Gaps = 3/128 (2%) Frame = -3 Query: 377 LNSGVLIVVXXXXXXXXXXXXXXXXFSLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXXX 198 LN VLI++ + TSQF RPN V+KSWDSLN+LLVLFAILC Sbjct: 30 LNPAVLIILLPILAMIVVFFAVPSFLNFTSQFLRPNSVRKSWDSLNVLLVLFAILCGVFA 89 Query: 197 XXXXXXGANVDENTNVSKE---ENHQQSTQDQWFEYSDRGIYXXXXXXXXXSKMVGKLRR 27 +V EN S +S FE+SDR IY +LRR Sbjct: 90 RKNDEKNDDVLENHGSSGSVVMGKSHESISHSLFEFSDRKIYDPPIQSGSV-----RLRR 144 Query: 26 SSSSYPDL 3 SSSSYPDL Sbjct: 145 SSSSYPDL 152 >emb|CBI35923.3| unnamed protein product [Vitis vinifera] Length = 628 Score = 70.1 bits (170), Expect = 3e-10 Identities = 46/128 (35%), Positives = 62/128 (48%), Gaps = 2/128 (1%) Frame = -3 Query: 380 FLNSGVLIVVXXXXXXXXXXXXXXXXFSLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXX 201 FL+SG LI+ S TS F+PN+VKKSWDSLNL+LVLFAI+C Sbjct: 69 FLSSGFLIIFLPLTALLFIVFVLPPILSFTSYIFKPNMVKKSWDSLNLVLVLFAIICGFL 128 Query: 200 XXXXXXXGANVDENTNVSKEENHQQSTQDQWFEYSDRGIYXXXXXXXXXSKMVGKLR--R 27 ++++ + + EE+ Q+S +E G G +R R Sbjct: 129 SRGGGGGSSDMESSVSEVPEESTQRSNHGHCYEERISG--------------YGGMRRMR 174 Query: 26 SSSSYPDL 3 SSSSYPDL Sbjct: 175 SSSSYPDL 182 >ref|XP_006369280.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|550347738|gb|ERP65849.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 560 Score = 70.1 bits (170), Expect = 3e-10 Identities = 50/142 (35%), Positives = 64/142 (45%), Gaps = 16/142 (11%) Frame = -3 Query: 380 FLNSGVLIVVXXXXXXXXXXXXXXXXFSLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXX 201 FLNSGV +V+ SLTSQ RP +KKSWDSLNL+LVLFAI+C Sbjct: 32 FLNSGVFLVILLVVALAFVLVVVSSIGSLTSQILRPQSIKKSWDSLNLVLVLFAIVCGFL 91 Query: 200 XXXXXXXGANV-------DENT---------NVSKEENHQQSTQDQWFEYSDRGIYXXXX 69 + +ENT NV K + + +WFE+ DR + Sbjct: 92 SSNNSSGSSGSGSGSGGDNENTSYYEDQSLSNVQKPSHPSSTPSHRWFEHQDRTV----- 146 Query: 68 XXXXXSKMVGKLRRSSSSYPDL 3 + +L RS SSYPDL Sbjct: 147 ----SYNTLNRL-RSFSSYPDL 163 >ref|XP_006342342.1| PREDICTED: pollen-specific leucine-rich repeat extensin-like protein 1-like [Solanum tuberosum] Length = 642 Score = 69.3 bits (168), Expect = 5e-10 Identities = 46/115 (40%), Positives = 55/115 (47%), Gaps = 18/115 (15%) Frame = -3 Query: 293 TSQFFRPNLVKKSWDSLNLLLVLFAILCXXXXXXXXXXGANVDENTNVSKEEN------- 135 T+Q RPN VKK WDS N+LLV+FAILC A V+ N NVS E+ Sbjct: 60 TTQILRPNSVKKGWDSFNILLVVFAILCGIFARKNDDNSA-VERNRNVSTTESSNFNDGS 118 Query: 134 -----------HQQSTQDQWFEYSDRGIYXXXXXXXXXSKMVGKLRRSSSSYPDL 3 + + D+WFE SD Y V +LRRSSSSYPDL Sbjct: 119 ASADVDVDHDMRRPVSNDRWFEASDEKTY----HFGVPETSVNRLRRSSSSYPDL 169 >ref|XP_007024556.1| Hydroxyproline-rich glycoprotein family protein, putative [Theobroma cacao] gi|508779922|gb|EOY27178.1| Hydroxyproline-rich glycoprotein family protein, putative [Theobroma cacao] Length = 553 Score = 69.3 bits (168), Expect = 5e-10 Identities = 50/144 (34%), Positives = 65/144 (45%), Gaps = 18/144 (12%) Frame = -3 Query: 380 FLNSGVLIVVXXXXXXXXXXXXXXXXFSLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXX 201 F N+G+LI++ S TSQ F+P+LVKKSWDSLNL+LVLFAI+C Sbjct: 30 FFNTGILIILLLVVALAFIFVIIPSFLSFTSQIFKPHLVKKSWDSLNLVLVLFAIIC--- 86 Query: 200 XXXXXXXGANVDENTNVSKEE---------------NHQQSTQDQWFEY---SDRGIYXX 75 N D +T + E+ ST QW++Y SDR Y Sbjct: 87 -GFLGKNNGNNDSDTRSTYEDYKFSTTPKHDRDHVGRSNPSTPRQWYDYSSSSDRTAYNS 145 Query: 74 XXXXXXXSKMVGKLRRSSSSYPDL 3 + RSS+SYPDL Sbjct: 146 L-----------QRLRSSNSYPDL 158 >ref|XP_004243732.1| PREDICTED: uncharacterized protein LOC101260449 [Solanum lycopersicum] Length = 608 Score = 66.2 bits (160), Expect = 4e-09 Identities = 45/106 (42%), Positives = 52/106 (49%), Gaps = 9/106 (8%) Frame = -3 Query: 293 TSQFFRPNLVKKSWDSLNLLLVLFAILCXXXXXXXXXXGANVDENTNVSKEE-------N 135 T+ RPN VKK WDS N+LLV+FAILC A + N NVS E + Sbjct: 60 TTHILRPNSVKKGWDSFNILLVVFAILCGIFARKNDDNSA-AERNRNVSTTESSSNFNDH 118 Query: 134 HQQST--QDQWFEYSDRGIYXXXXXXXXXSKMVGKLRRSSSSYPDL 3 H T D+WFE S Y V +LRRSSSSYPDL Sbjct: 119 HMPPTVSNDRWFETSHDKTY----NFGVPETSVNRLRRSSSSYPDL 160 >ref|XP_007215011.1| hypothetical protein PRUPE_ppa002494mg [Prunus persica] gi|462411161|gb|EMJ16210.1| hypothetical protein PRUPE_ppa002494mg [Prunus persica] Length = 666 Score = 65.9 bits (159), Expect = 6e-09 Identities = 51/146 (34%), Positives = 64/146 (43%), Gaps = 20/146 (13%) Frame = -3 Query: 380 FLNSGVLIVVXXXXXXXXXXXXXXXXFSLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXX 201 F +SG I+ S TSQ FRP+ VKKSWDSLNL+LVLFAI+C Sbjct: 116 FFSSGAFILALLAIALVFIFFIIPSVLSFTSQIFRPHSVKKSWDSLNLVLVLFAIVC--- 172 Query: 200 XXXXXXXGANVDENTNVSKEENHQQ-------------------STQDQWF-EYSDRGIY 81 N + + N+S ++ Q ST QWF +YSDR Y Sbjct: 173 ----GFLSRNTNNDGNLSSPSSYDQVHNQTVFNSSSPQAPKSNPSTPRQWFDQYSDRTGY 228 Query: 80 XXXXXXXXXSKMVGKLRRSSSSYPDL 3 + G R+SSSYPDL Sbjct: 229 NQSSSSTSAAMNRGV--RTSSSYPDL 252 >gb|EXB38898.1| hypothetical protein L484_027333 [Morus notabilis] Length = 509 Score = 64.3 bits (155), Expect = 2e-08 Identities = 43/106 (40%), Positives = 51/106 (48%), Gaps = 7/106 (6%) Frame = -3 Query: 299 SLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXXXXXXXXXGANVDENTNVSKE---ENHQ 129 S TS FRP VKKSWD LN+ LVLFAILC AN D + E + Sbjct: 59 SFTSLIFRPIAVKKSWDLLNIFLVLFAILCGIFARRNDDESANNDVVPTARRSGGVEESE 118 Query: 128 QSTQDQWFEYSD----RGIYXXXXXXXXXSKMVGKLRRSSSSYPDL 3 + +WF +SD IY + +LRRSSSSYPDL Sbjct: 119 PANPQRWFAFSDDRRSEKIYDSVDRTAESGSL-RRLRRSSSSYPDL 163 >ref|XP_007203847.1| hypothetical protein PRUPE_ppa004367m1g, partial [Prunus persica] gi|462399378|gb|EMJ05046.1| hypothetical protein PRUPE_ppa004367m1g, partial [Prunus persica] Length = 339 Score = 63.9 bits (154), Expect = 2e-08 Identities = 50/140 (35%), Positives = 63/140 (45%), Gaps = 15/140 (10%) Frame = -3 Query: 377 LNSGVLIVVXXXXXXXXXXXXXXXXFSLTSQFFRPNL-VKKSWDSLNLLLVLFAILC--- 210 L+ VLI++ SLTSQ RP + VKKSWDSLN+LLV+FAILC Sbjct: 12 LSPPVLIILLPIITFLFLFCTIPPFLSLTSQILRPTISVKKSWDSLNVLLVVFAILCGIF 71 Query: 209 ----XXXXXXXXXXGANVDENTNVSKEENHQQSTQD-------QWFEYSDRGIYXXXXXX 63 N + N S N+ +T + QWF +S+R Sbjct: 72 AKRNDDGSPAEEDPIQNASDPLNNSIAANNTTNTSEAEVLLPQQWFGFSER--------- 122 Query: 62 XXXSKMVGKLRRSSSSYPDL 3 G+LRRSSSSYPDL Sbjct: 123 -PPETRGGRLRRSSSSYPDL 141 >ref|XP_003597554.1| hypothetical protein MTR_2g099520 [Medicago truncatula] gi|355486602|gb|AES67805.1| hypothetical protein MTR_2g099520 [Medicago truncatula] Length = 485 Score = 62.0 bits (149), Expect = 8e-08 Identities = 46/138 (33%), Positives = 57/138 (41%), Gaps = 12/138 (8%) Frame = -3 Query: 380 FLNSGVLIVVXXXXXXXXXXXXXXXXFSLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXX 201 FL+SG +I+V S S F PN VKKSWDSLNLLLVLFAI C Sbjct: 30 FLSSGTVIIVLLVIALAFILVIVPTLHSFASHIFNPNSVKKSWDSLNLLLVLFAIFCGFL 89 Query: 200 XXXXXXXGANVDENTNVSKEENHQQSTQDQ-----------WFEYS-DRGIYXXXXXXXX 57 E+ N + + + Q ++ W+EYS DR Y Sbjct: 90 SKNNNNESPRSYEDQNQTFSDTNTQQEYEKPNPEPETAPRFWYEYSEDRTSYNRL----- 144 Query: 56 XSKMVGKLRRSSSSYPDL 3 RS +SYPDL Sbjct: 145 ---------RSFNSYPDL 153 >ref|XP_006401247.1| hypothetical protein EUTSA_v10013114mg [Eutrema salsugineum] gi|557102337|gb|ESQ42700.1| hypothetical protein EUTSA_v10013114mg [Eutrema salsugineum] Length = 570 Score = 59.3 bits (142), Expect = 5e-07 Identities = 42/118 (35%), Positives = 61/118 (51%), Gaps = 19/118 (16%) Frame = -3 Query: 299 SLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXXXXXXXXXGANVDENTNVSKEENHQQST 120 S+TSQ F+P VKK WDS+N++LV+FAILC ++ ++++V +EE+ + Sbjct: 57 SITSQIFQPASVKKGWDSINVVLVVFAILCGVLARQNDDGLSSSSQSSHVEEEEDDVTNG 116 Query: 119 QD-----------QWFE---YSDR-GIYXXXXXXXXXSKM----VGKLRRSSSSYPDL 3 +D QWF+ +DR IY + LRRSSSSYPDL Sbjct: 117 EDSKISSSPVVSQQWFDDVYDADRLKIYESLSNRSFSPGLPVTGTLPLRRSSSSYPDL 174 >ref|XP_002521366.1| conserved hypothetical protein [Ricinus communis] gi|223539444|gb|EEF41034.1| conserved hypothetical protein [Ricinus communis] Length = 553 Score = 58.9 bits (141), Expect = 7e-07 Identities = 27/57 (47%), Positives = 36/57 (63%) Frame = -3 Query: 380 FLNSGVLIVVXXXXXXXXXXXXXXXXFSLTSQFFRPNLVKKSWDSLNLLLVLFAILC 210 FLNSGV++++ + TSQ F+PNL+KK WDSLN +LVLFAI+C Sbjct: 32 FLNSGVILIMLLVIAFVFVFVVVPSVVTFTSQVFKPNLIKKGWDSLNFVLVLFAIVC 88 >ref|XP_004487232.1| PREDICTED: uncharacterized protein LOC101499728 [Cicer arietinum] Length = 435 Score = 58.2 bits (139), Expect = 1e-06 Identities = 46/147 (31%), Positives = 57/147 (38%), Gaps = 22/147 (14%) Frame = -3 Query: 377 LNSGVLIVVXXXXXXXXXXXXXXXXFSLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXXX 198 + SG ++++ S TS FRPN VKKSWDSLN+LLVLFAI C Sbjct: 31 VTSGTVVIILLVTTIAFALVVVPTLQSFTSHIFRPNSVKKSWDSLNILLVLFAIFC---- 86 Query: 197 XXXXXXGANVDENTNVSKEENHQQSTQD---------------------QWFEYS-DRGI 84 + + NTN S Q+ D W+EYS DR Sbjct: 87 -----GFLSRNNNTNESPRSYEDQTFSDTNTRQEYEKPNLEPETEMPPFSWYEYSEDRTS 141 Query: 83 YXXXXXXXXXSKMVGKLRRSSSSYPDL 3 Y RS +SYPDL Sbjct: 142 YNRL--------------RSFNSYPDL 154 >ref|XP_003546756.1| PREDICTED: uncharacterized protein LOC100811539 [Glycine max] Length = 419 Score = 58.2 bits (139), Expect = 1e-06 Identities = 41/105 (39%), Positives = 53/105 (50%), Gaps = 6/105 (5%) Frame = -3 Query: 299 SLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXXXXXXXXXGANVDENTNVSKEENHQQST 120 S T+Q F+PN VKKSWDSLN +L+LFAILC + + NTN S + + Sbjct: 57 SFTTQIFKPNSVKKSWDSLNFVLILFAILC--------GFLSRNNSNTNESFSDTPLEYD 108 Query: 119 QDQ------WFEYSDRGIYXXXXXXXXXSKMVGKLRRSSSSYPDL 3 + W+E SDR Y K +L RS SS+PDL Sbjct: 109 KPNPPLPRPWYEESDRTPY----------KSYNRL-RSFSSHPDL 142 >gb|ACU21434.1| unknown [Glycine max] Length = 419 Score = 58.2 bits (139), Expect = 1e-06 Identities = 41/105 (39%), Positives = 53/105 (50%), Gaps = 6/105 (5%) Frame = -3 Query: 299 SLTSQFFRPNLVKKSWDSLNLLLVLFAILCXXXXXXXXXXGANVDENTNVSKEENHQQST 120 S T+Q F+PN VKKSWDSLN +L+LFAILC + + NTN S + + Sbjct: 57 SFTTQIFKPNSVKKSWDSLNFVLILFAILC--------GFLSRNNSNTNESFSDTPLEYD 108 Query: 119 QDQ------WFEYSDRGIYXXXXXXXXXSKMVGKLRRSSSSYPDL 3 + W+E SDR Y K +L RS SS+PDL Sbjct: 109 KPNPPLPRPWYEESDRTPY----------KSYNRL-RSFSSHPDL 142 >ref|XP_002514089.1| conserved hypothetical protein [Ricinus communis] gi|223546545|gb|EEF48043.1| conserved hypothetical protein [Ricinus communis] Length = 831 Score = 57.4 bits (137), Expect = 2e-06 Identities = 37/102 (36%), Positives = 49/102 (48%), Gaps = 10/102 (9%) Frame = -3 Query: 278 RPNLVKKSWDSLNLLLVLFAILCXXXXXXXXXXGAN----------VDENTNVSKEENHQ 129 RP+ VKKSWDSLN+ LVLFAILC A + N+N +KE +H Sbjct: 46 RPSTVKKSWDSLNVFLVLFAILCGIFARRNDDDSAPSGDHSNSSSVLHNNSNNNKERDHA 105 Query: 128 QSTQDQWFEYSDRGIYXXXXXXXXXSKMVGKLRRSSSSYPDL 3 S W + + + + +L+RSSSSYPDL Sbjct: 106 VSNHSHWLDDNQ----------FASATPMRRLKRSSSSYPDL 137