BLASTX nr result
ID: Akebia27_contig00019433
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia27_contig00019433 (520 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-r... 72 8e-11 ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r... 70 4e-10 ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r... 68 1e-09 emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera] 68 1e-09 ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r... 67 2e-09 gb|EYU25168.1| hypothetical protein MIMGU_mgv1a018620mg [Mimulus... 67 3e-09 gb|EXC05942.1| hypothetical protein L484_014210 [Morus notabilis] 65 1e-08 ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-r... 64 3e-08 ref|XP_007210661.1| hypothetical protein PRUPE_ppa022176mg [Prun... 64 3e-08 ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296... 63 4e-08 gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis] 62 6e-08 ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296... 62 6e-08 ref|XP_007051691.1| Late embryogenesis abundant (LEA) hydroxypro... 62 1e-07 ref|XP_007040371.1| Late embryogenesis abundant hydroxyproline-r... 61 1e-07 ref|XP_004300825.1| PREDICTED: uncharacterized protein LOC101294... 61 1e-07 emb|CBI22611.3| unnamed protein product [Vitis vinifera] 61 1e-07 ref|XP_004306727.1| PREDICTED: uncharacterized protein LOC101306... 61 2e-07 ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295... 61 2e-07 ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295... 60 2e-07 ref|XP_004309174.1| PREDICTED: uncharacterized protein LOC101303... 60 3e-07 >ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508777614|gb|EOY24870.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 188 Score = 72.0 bits (175), Expect = 8e-11 Identities = 37/105 (35%), Positives = 54/105 (51%) Frame = +1 Query: 4 YKFDNTTMTLYYQDTAVGAAEISKGKAKARKTRTLPIAVDVXXXXXXXXXXXXXXXXXXX 183 ++FDNTT + + VG EI K +A+ARKT+ + + VDV Sbjct: 83 FRFDNTTANVTFGSVMVGDGEIVKSRARARKTKRMNVTVDVSSSAVSDEDELRTKLSSGT 142 Query: 184 XXXRSHARLRGKSTVMHVMKKRKSGEMNCIIIIRTNPISIGSISC 318 ARLRGK T+M +MKKRK+ EMNC + + N ++ + C Sbjct: 143 LTLTGVARLRGKVTLMKLMKKRKTAEMNCTMTVNLNSHAVQDLDC 187 >ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508777615|gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 215 Score = 69.7 bits (169), Expect = 4e-10 Identities = 39/107 (36%), Positives = 54/107 (50%), Gaps = 1/107 (0%) Frame = +1 Query: 4 YKFDNTTMTLYYQDTAVGAAEISKGKAKARKTRTLPIAVDV-XXXXXXXXXXXXXXXXXX 180 YKFDN TM+ Y VG A I K +A+AR T+ L + V+V Sbjct: 109 YKFDNATMSFLYDGVMVGEAIIPKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSS 168 Query: 181 XXXXRSHARLRGKSTVMHVMKKRKSGEMNCIIIIRTNPISIGSISCR 321 S A+L+GK +M VMKK+KS EMNC +I + S+ + C+ Sbjct: 169 VLTLNSQAKLKGKVELMKVMKKKKSPEMNCTLIFNVSTRSLQDLKCK 215 >ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508776108|gb|EOY23364.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 191 Score = 68.2 bits (165), Expect = 1e-09 Identities = 37/106 (34%), Positives = 54/106 (50%) Frame = +1 Query: 4 YKFDNTTMTLYYQDTAVGAAEISKGKAKARKTRTLPIAVDVXXXXXXXXXXXXXXXXXXX 183 +KF N+T+T+ Y+ T VG A I K +A+AR T L + V V Sbjct: 86 FKFQNSTLTISYRGTPVGEATIVKARARARSTTKLNVTVSVSSDKMSRNSALSSDVGSGT 145 Query: 184 XXXRSHARLRGKSTVMHVMKKRKSGEMNCIIIIRTNPISIGSISCR 321 SHA+L GK + V KK+KS EMNC + + T+ I ++ C+ Sbjct: 146 INLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQNLMCQ 191 >emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera] Length = 186 Score = 68.2 bits (165), Expect = 1e-09 Identities = 37/106 (34%), Positives = 55/106 (51%) Frame = +1 Query: 4 YKFDNTTMTLYYQDTAVGAAEISKGKAKARKTRTLPIAVDVXXXXXXXXXXXXXXXXXXX 183 +KFDN+T T+ Y TAVG A I K +A++R T+ I V + Sbjct: 81 FKFDNSTATISYFGTAVGEATILKARARSRSTKRFNITVPISSSKVNNHRQLRRDLNSGV 140 Query: 184 XXXRSHARLRGKSTVMHVMKKRKSGEMNCIIIIRTNPISIGSISCR 321 S A+L GK + + KK+KS EM+C + + TN SI ++SC+ Sbjct: 141 LNLSSTAKLSGKIHLFKIFKKKKSAEMSCTMELHTNTSSIENLSCK 186 >ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508776113|gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 201 Score = 67.4 bits (163), Expect = 2e-09 Identities = 34/106 (32%), Positives = 55/106 (51%) Frame = +1 Query: 4 YKFDNTTMTLYYQDTAVGAAEISKGKAKARKTRTLPIAVDVXXXXXXXXXXXXXXXXXXX 183 +KFDNTT++ Y VG A ++KG+AKAR T+ + + VD+ Sbjct: 96 FKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKKMNVTVDLNSNNIPANSNLASDISSGF 155 Query: 184 XXXRSHARLRGKSTVMHVMKKRKSGEMNCIIIIRTNPISIGSISCR 321 +H +L GK +M ++KK+KS +MNC + + +I I C+ Sbjct: 156 LTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMTVNLASRAIQDIKCQ 201 >gb|EYU25168.1| hypothetical protein MIMGU_mgv1a018620mg [Mimulus guttatus] Length = 214 Score = 67.0 bits (162), Expect = 3e-09 Identities = 36/107 (33%), Positives = 53/107 (49%) Frame = +1 Query: 1 EYKFDNTTMTLYYQDTAVGAAEISKGKAKARKTRTLPIAVDVXXXXXXXXXXXXXXXXXX 180 +YK+ N+T+ +++ T VG A I + +A AR TR VD+ Sbjct: 108 QYKYQNSTVEFFFRGTKVGEARIVRSRANARSTRRFLATVDLSSAGVPTEVLANEFRTHA 167 Query: 181 XXXXRSHARLRGKSTVMHVMKKRKSGEMNCIIIIRTNPISIGSISCR 321 S + LRGK +M +MKK KS MNC + I + +G+ISCR Sbjct: 168 LIPLTSRSTLRGKVEIMKLMKKNKSTNMNCTMEIMISSKQLGNISCR 214 >gb|EXC05942.1| hypothetical protein L484_014210 [Morus notabilis] Length = 213 Score = 64.7 bits (156), Expect = 1e-08 Identities = 36/109 (33%), Positives = 54/109 (49%) Frame = +1 Query: 1 EYKFDNTTMTLYYQDTAVGAAEISKGKAKARKTRTLPIAVDVXXXXXXXXXXXXXXXXXX 180 E+KFD +++T Y+ T VG A + KGKAKAR T+ + + +V Sbjct: 108 EFKFDESSITFVYKGTEVGDASVEKGKAKARSTKKMNVTAEV-----NANSNLANDVRSG 162 Query: 181 XXXXRSHARLRGKSTVMHVMKKRKSGEMNCIIIIRTNPISIGSISCR*N 327 S ++L GK +M V+KK+K+ EMNC I I + C+ N Sbjct: 163 FLTLTSQSKLNGKVHLMKVIKKKKTAEMNCTITINLENKVVQDFKCKSN 211 >ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] gi|508777613|gb|EOY24869.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] Length = 226 Score = 63.5 bits (153), Expect = 3e-08 Identities = 33/89 (37%), Positives = 50/89 (56%) Frame = +1 Query: 4 YKFDNTTMTLYYQDTAVGAAEISKGKAKARKTRTLPIAVDVXXXXXXXXXXXXXXXXXXX 183 +K++N+T+T+ Y VG A I KG+A+AR+T+ I VD+ Sbjct: 80 FKYENSTVTILYGGMPVGEAAIFKGRARARQTKKFNINVDISSSRLSSNSNLGNDINAGV 139 Query: 184 XXXRSHARLRGKSTVMHVMKKRKSGEMNC 270 S A+L+GK +M V+KK+KSGEM+C Sbjct: 140 LPLSSQAKLKGKVHLMKVIKKKKSGEMSC 168 >ref|XP_007210661.1| hypothetical protein PRUPE_ppa022176mg [Prunus persica] gi|462406396|gb|EMJ11860.1| hypothetical protein PRUPE_ppa022176mg [Prunus persica] Length = 213 Score = 63.5 bits (153), Expect = 3e-08 Identities = 34/109 (31%), Positives = 58/109 (53%), Gaps = 2/109 (1%) Frame = +1 Query: 1 EYKFDNTTMTLYYQDTAVGAAEISKGKAKARKTRTLPIAVDVXXXXXXXXXXXXXXXXXX 180 EYKF+ ++ +L+Y VG A+I KG+ KAR TR + +++DV Sbjct: 105 EYKFEGSSASLWYGGFKVGEAKIGKGRVKARGTRRVSLSIDVRSNRLPQEAKNGFEGEMN 164 Query: 181 XXXXR--SHARLRGKSTVMHVMKKRKSGEMNCIIIIRTNPISIGSISCR 321 + S+A+L GK +M +MKKRK+ + NC +++ ++ + CR Sbjct: 165 SGYLKISSYAKLTGKVNLMKIMKKRKTIDTNCTMVVVLKSRTVKDLFCR 213 >ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca subsp. vesca] Length = 211 Score = 63.2 bits (152), Expect = 4e-08 Identities = 34/106 (32%), Positives = 52/106 (49%) Frame = +1 Query: 4 YKFDNTTMTLYYQDTAVGAAEISKGKAKARKTRTLPIAVDVXXXXXXXXXXXXXXXXXXX 183 YKFD +T+T YQ AVG + KGKA R T+ + + V + Sbjct: 106 YKFDASTVTFMYQGVAVGQVTVPKGKAGMRSTKKMNVEVSLNANGLPSSSNLGSELNSGV 165 Query: 184 XXXRSHARLRGKSTVMHVMKKRKSGEMNCIIIIRTNPISIGSISCR 321 S A+L GK +M +MKK+KS M+C+I + ++ S+ C+ Sbjct: 166 LTLNSQAKLSGKVELMLIMKKKKSSTMDCMIGFDLSTKTVKSLQCK 211 >gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis] Length = 212 Score = 62.4 bits (150), Expect = 6e-08 Identities = 34/93 (36%), Positives = 45/93 (48%) Frame = +1 Query: 4 YKFDNTTMTLYYQDTAVGAAEISKGKAKARKTRTLPIAVDVXXXXXXXXXXXXXXXXXXX 183 YKFDNTT Y+ VG I KGKA R T+ +P++V + Sbjct: 109 YKFDNTTAAFAYEGETVGQVVIPKGKAGMRSTKKVPVSVSLSSSQLKNNTNLGSELSGGI 168 Query: 184 XXXRSHARLRGKSTVMHVMKKRKSGEMNCIIII 282 R A++ GK +M +MKK+KS MNC I I Sbjct: 169 LTLRCTAKMTGKVKLMLIMKKKKSANMNCTINI 201 >ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca subsp. vesca] Length = 211 Score = 62.4 bits (150), Expect = 6e-08 Identities = 34/106 (32%), Positives = 51/106 (48%) Frame = +1 Query: 4 YKFDNTTMTLYYQDTAVGAAEISKGKAKARKTRTLPIAVDVXXXXXXXXXXXXXXXXXXX 183 YKFD +T T YQ AVG I K KA+ R T+ + ++V + Sbjct: 106 YKFDASTATFMYQGVAVGQVSIPKSKARMRSTKKISVSVILNTNALPSSSTIGTELNSGI 165 Query: 184 XXXRSHARLRGKSTVMHVMKKRKSGEMNCIIIIRTNPISIGSISCR 321 S A+L GK +M +MKK+KS M+C I + ++ S+ C+ Sbjct: 166 LTLTSQAKLTGKVELMLIMKKKKSATMDCTIAFDLSTKTVKSLQCK 211 >ref|XP_007051691.1| Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family, putative isoform 1 [Theobroma cacao] gi|590721708|ref|XP_007051692.1| Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family, putative isoform 1 [Theobroma cacao] gi|508703952|gb|EOX95848.1| Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family, putative isoform 1 [Theobroma cacao] gi|508703953|gb|EOX95849.1| Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family, putative isoform 1 [Theobroma cacao] Length = 220 Score = 61.6 bits (148), Expect = 1e-07 Identities = 32/111 (28%), Positives = 57/111 (51%) Frame = +1 Query: 4 YKFDNTTMTLYYQDTAVGAAEISKGKAKARKTRTLPIAVDVXXXXXXXXXXXXXXXXXXX 183 +K+ NTT TLYY T VG A G+AKAR+T + I+VD+ Sbjct: 110 FKYKNTTTTLYYYGTIVGEARGPAGRAKARRTMRMNISVDIITDRLLASPNLVADVNSGT 169 Query: 184 XXXRSHARLRGKSTVMHVMKKRKSGEMNCIIIIRTNPISIGSISCR*NVEI 336 S++R+ G+ +++++KK + +MNC + + + +I C+ V++ Sbjct: 170 LTMSSYSRIGGRVNMLNIIKKHVTVKMNCSMTVNISSQAIQEQKCKRKVDL 220 >ref|XP_007040371.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508777616|gb|EOY24872.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 213 Score = 61.2 bits (147), Expect = 1e-07 Identities = 31/94 (32%), Positives = 49/94 (52%) Frame = +1 Query: 1 EYKFDNTTMTLYYQDTAVGAAEISKGKAKARKTRTLPIAVDVXXXXXXXXXXXXXXXXXX 180 ++KF+NTT T++ VG +I G+A+AR T L ++VDV Sbjct: 107 DFKFENTTGTVWCGSVVVGKMKIPTGRAQARATERLNVSVDVSSLPLPDTKNVSCNISSG 166 Query: 181 XXXXRSHARLRGKSTVMHVMKKRKSGEMNCIIII 282 SH +L GK ++M+ MK+R+ EMNC + + Sbjct: 167 LLELNSHVKLSGKVSIMNFMKRRRHPEMNCFMTL 200 >ref|XP_004300825.1| PREDICTED: uncharacterized protein LOC101294764 [Fragaria vesca subsp. vesca] Length = 182 Score = 61.2 bits (147), Expect = 1e-07 Identities = 28/106 (26%), Positives = 52/106 (49%) Frame = +1 Query: 4 YKFDNTTMTLYYQDTAVGAAEISKGKAKARKTRTLPIAVDVXXXXXXXXXXXXXXXXXXX 183 +K+ N+T+++ Y+ +G A + KAKAR TR I + + Sbjct: 77 FKYPNSTVSILYEGQVIGTAAVPSQKAKARSTRRTDITISIDSSKLSGTTNLTTAIGAGV 136 Query: 184 XXXRSHARLRGKSTVMHVMKKRKSGEMNCIIIIRTNPISIGSISCR 321 S + L+GK VM ++KK KSG+M+C +++ ++ + C+ Sbjct: 137 VPLTSESTLKGKVEVMKIIKKNKSGKMSCTMLLNLKTRTVDDLKCK 182 >emb|CBI22611.3| unnamed protein product [Vitis vinifera] Length = 297 Score = 61.2 bits (147), Expect = 1e-07 Identities = 34/93 (36%), Positives = 48/93 (51%) Frame = +1 Query: 4 YKFDNTTMTLYYQDTAVGAAEISKGKAKARKTRTLPIAVDVXXXXXXXXXXXXXXXXXXX 183 +KF N+T+TL Y+ VG A+ISK +A+AR T+ + + VDV Sbjct: 192 FKFKNSTITLAYRGDHVGDAKISKARARARSTKKMNVTVDVTSNNVSSNSNLASDINSGF 251 Query: 184 XXXRSHARLRGKSTVMHVMKKRKSGEMNCIIII 282 +L GK +M V KK+KS +MNC I I Sbjct: 252 LTLTGQGKLNGKVHLMKVFKKKKSPQMNCTIKI 284 >ref|XP_004306727.1| PREDICTED: uncharacterized protein LOC101306460 [Fragaria vesca subsp. vesca] Length = 219 Score = 60.8 bits (146), Expect = 2e-07 Identities = 32/111 (28%), Positives = 54/111 (48%) Frame = +1 Query: 4 YKFDNTTMTLYYQDTAVGAAEISKGKAKARKTRTLPIAVDVXXXXXXXXXXXXXXXXXXX 183 +K+ NTT TLYY T VG A G+AKAR+T + I VD+ Sbjct: 109 FKYSNTTTTLYYHGTVVGEARGPPGRAKARRTMRMNITVDIITDILTTNPNLKTDVGSGL 168 Query: 184 XXXRSHARLRGKSTVMHVMKKRKSGEMNCIIIIRTNPISIGSISCR*NVEI 336 S++R+ G+ +++++KK +MNC + + + +I C+ V + Sbjct: 169 LTMSSYSRIPGRVNMLNIVKKHVVVKMNCTMTVNISSQAIQEQKCKRKVSL 219 >ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295918 [Fragaria vesca subsp. vesca] Length = 211 Score = 60.8 bits (146), Expect = 2e-07 Identities = 34/106 (32%), Positives = 49/106 (46%) Frame = +1 Query: 4 YKFDNTTMTLYYQDTAVGAAEISKGKAKARKTRTLPIAVDVXXXXXXXXXXXXXXXXXXX 183 YKFD +T YQ VG + KGKA R T+ + + V + Sbjct: 106 YKFDQGVVTFMYQGAPVGTVVVPKGKAGMRGTKKINVNVSLNTAALPSSSTLSSELSGGV 165 Query: 184 XXXRSHARLRGKSTVMHVMKKRKSGEMNCIIIIRTNPISIGSISCR 321 S A+L GK +M +MKK+KS MNC I I + ++ S+ C+ Sbjct: 166 LTLTSEAKLTGKVELMLIMKKKKSASMNCTIQIDVSGKTVKSLECK 211 >ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295333 [Fragaria vesca subsp. vesca] Length = 200 Score = 60.5 bits (145), Expect = 2e-07 Identities = 36/106 (33%), Positives = 51/106 (48%) Frame = +1 Query: 4 YKFDNTTMTLYYQDTAVGAAEISKGKAKARKTRTLPIAVDVXXXXXXXXXXXXXXXXXXX 183 YKFD +T YQ T VG + KGKA R T+ + +V + Sbjct: 104 YKFDEGVVTFKYQGTPVGTFTVPKGKAGMRGTKKIDASVSLNTAALNSSGELTLT----- 158 Query: 184 XXXRSHARLRGKSTVMHVMKKRKSGEMNCIIIIRTNPISIGSISCR 321 S A+L GK T+M +MKK+KS MNC I I + ++ S+ C+ Sbjct: 159 ----SEAKLTGKVTLMFIMKKKKSASMNCTIQIDVSGQTVKSVVCK 200 >ref|XP_004309174.1| PREDICTED: uncharacterized protein LOC101303468 [Fragaria vesca subsp. vesca] Length = 178 Score = 60.1 bits (144), Expect = 3e-07 Identities = 32/106 (30%), Positives = 56/106 (52%) Frame = +1 Query: 4 YKFDNTTMTLYYQDTAVGAAEISKGKAKARKTRTLPIAVDVXXXXXXXXXXXXXXXXXXX 183 YKF++ T Y+ T +G ISK KAKA+KT+ + + V + Sbjct: 78 YKFESAKATFSYKGTNIGEGTISKDKAKAKKTKKINVTVSL-----NSDKITASDISSGN 132 Query: 184 XXXRSHARLRGKSTVMHVMKKRKSGEMNCIIIIRTNPISIGSISCR 321 ++A+L GK +++++KK+KS E+NC I + T ++ +SC+ Sbjct: 133 VTLTAYAKLDGKVHLLNIIKKKKSSELNCTIHVDTKAKAVHVLSCK 178