BLASTX nr result
ID: Akebia27_contig00024544
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia27_contig00024544 (800 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI22611.3| unnamed protein product [Vitis vinifera] 150 7e-34 ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-r... 149 2e-33 ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-r... 146 8e-33 ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-r... 145 2e-32 ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-r... 142 1e-31 ref|XP_007210712.1| hypothetical protein PRUPE_ppa022983mg [Prun... 139 1e-30 ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-r... 137 6e-30 ref|XP_007219159.1| hypothetical protein PRUPE_ppa016330mg [Prun... 135 1e-29 ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-r... 135 2e-29 ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobrom... 132 1e-28 ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295... 132 1e-28 ref|XP_004300825.1| PREDICTED: uncharacterized protein LOC101294... 132 2e-28 ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-r... 129 1e-27 ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296... 129 2e-27 ref|XP_004300832.1| PREDICTED: uncharacterized protein LOC101296... 128 3e-27 ref|XP_007022217.1| Late embryogenesis abundant hydroxyproline-r... 127 4e-27 gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis] 127 5e-27 gb|EYU40056.1| hypothetical protein MIMGU_mgv1a022222mg [Mimulus... 125 1e-26 ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296... 125 1e-26 gb|EXC05942.1| hypothetical protein L484_014210 [Morus notabilis] 125 2e-26 >emb|CBI22611.3| unnamed protein product [Vitis vinifera] Length = 297 Score = 150 bits (378), Expect = 7e-34 Identities = 82/190 (43%), Positives = 120/190 (63%), Gaps = 1/190 (0%) Frame = -1 Query: 713 RVNLGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNST 534 + ++ S+E R K + I + +F+T+ ++ P + +V+IENLN T Sbjct: 107 KTDVESEELRRMKCTRYIAYLSAFALFETIVIMVCVVTLMRIRSPKFRFRAVSIENLNYT 166 Query: 533 S-ATNTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHM 357 S T+ SFN+R KV VKN N+G +KF N +TL+Y G VGDA+I + +A+ RST+ M Sbjct: 167 SDTTSPSFNIRFNAKVAVKNTNFGHFKFKNSTITLAYRGDHVGDAKISKARARARSTKKM 226 Query: 356 DVTIDVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINT 177 +VT+DVTS+ +S NSNLASD++ G LT KLNG+V +M + KK KS QMNCTI IN Sbjct: 227 NVTVDVTSNNVSSNSNLASDINSGFLTLTGQGKLNGKVHLMKVFKKKKSPQMNCTIKINL 286 Query: 176 RSQTVQQIIC 147 ++ +Q+ C Sbjct: 287 ENKVIQEWKC 296 >ref|XP_007038868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508776113|gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 201 Score = 149 bits (375), Expect = 2e-33 Identities = 77/183 (42%), Positives = 111/183 (60%), Gaps = 1/183 (0%) Frame = -1 Query: 692 ERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSATNT-S 516 E RKK KL +VFQT+ +K P ++ S+ +E++ TS N S Sbjct: 18 ELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAYTSTPNPPS 77 Query: 515 FNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTIDVT 336 FNM+ +V VKN N+G +KF N ++ YGG VG+A + + +AK RST+ M+VT+D+ Sbjct: 78 FNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKKMNVTVDLN 137 Query: 335 SDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQTVQQ 156 S+ + NSNLASD+ G LT + KL+G+V +M LIKK KS QMNCT+ +N S+ +Q Sbjct: 138 SNNIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMTVNLASRAIQD 197 Query: 155 IIC 147 I C Sbjct: 198 IKC 200 >ref|XP_007040370.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508777615|gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 215 Score = 146 bits (369), Expect = 8e-33 Identities = 80/195 (41%), Positives = 116/195 (59%), Gaps = 2/195 (1%) Frame = -1 Query: 725 NDEERVNLGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIEN 546 +DEE +L S+E RKK K + I VFQT+ VK P ++I V +E Sbjct: 20 SDEESASLQSKELKRKKRIKYAVYIAAFAVFQTVVILIFALTVMRVKNPKVRIGKVTVET 79 Query: 545 LN-STSATNTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRS 369 + S + SFN+R IT+VTVKN N+G YKF N M+ Y G VG+A IP+ +A+ RS Sbjct: 80 METSNTEAAASFNLRFITQVTVKNTNFGHYKFDNATMSFLYDGVMVGEAIIPKARARARS 139 Query: 368 TRHMDVTIDVTSDWL-SRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCT 192 T+ +DVT++V S L S + L S+L LT N+ KL G+V +M ++KK KS +MNCT Sbjct: 140 TKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQAKLKGKVELMKVMKKKKSPEMNCT 199 Query: 191 IAINTRSQTVQQIIC 147 + N ++++Q + C Sbjct: 200 LIFNVSTRSLQDLKC 214 >ref|XP_007038863.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508776108|gb|EOY23364.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 191 Score = 145 bits (365), Expect = 2e-32 Identities = 69/180 (38%), Positives = 118/180 (65%), Gaps = 1/180 (0%) Frame = -1 Query: 683 RKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLN-STSATNTSFNM 507 RK+N K + IV ++ QT+ ++ P +++ V +ENLN ++S+++ SF+M Sbjct: 11 RKRNIKCLAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVENLNLNSSSSSPSFSM 70 Query: 506 RLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTIDVTSDW 327 L +VTVKN N+G +KF N +T+SY G VG+A I + +A+ RST ++VT+ V+SD Sbjct: 71 NLNAQVTVKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARARARSTTKLNVTVSVSSDK 130 Query: 326 LSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQTVQQIIC 147 +SRNS L+SD+ G + +++ KL+G++ + + KK KS +MNCT+ + T S+ +Q ++C Sbjct: 131 MSRNSALSSDVGSGTINLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQNLMC 190 >ref|XP_007040367.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] gi|508777612|gb|EOY24868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] Length = 185 Score = 142 bits (358), Expect = 1e-31 Identities = 69/179 (38%), Positives = 114/179 (63%) Frame = -1 Query: 683 RKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSATNTSFNMR 504 ++ N K + + V +VFQT +K P ++ +V +EN ++ ++++ F+MR Sbjct: 6 KRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNSSSPFFDMR 65 Query: 503 LITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTIDVTSDWL 324 L+ +VTVKN N+G +K+ N + + YGG VG+A I + +A+ R T+ DVTID++S L Sbjct: 66 LMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTIDISSSKL 125 Query: 323 SRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQTVQQIIC 147 S NSNL +D+ G L ++ KL+G+V +M +IKK KS +M+CT+ IN ++TVQ + C Sbjct: 126 STNSNLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINIGTRTVQDLKC 184 >ref|XP_007210712.1| hypothetical protein PRUPE_ppa022983mg [Prunus persica] gi|462406447|gb|EMJ11911.1| hypothetical protein PRUPE_ppa022983mg [Prunus persica] Length = 209 Score = 139 bits (350), Expect = 1e-30 Identities = 73/192 (38%), Positives = 113/192 (58%) Frame = -1 Query: 722 DEERVNLGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENL 543 DEE L S+E R+K K+ IV+ IV Q + VK P ++ ++ ++NL Sbjct: 18 DEESTALQSEELKRQKRIKMYKYIVIFIVVQLIVLPVFGLTVMKVKTPKFRLGNIKVQNL 77 Query: 542 NSTSATNTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTR 363 +S +T SF T++ VKN NWG YKF G +T Y G TVG +P+ KAK+RST+ Sbjct: 78 SSVPST-PSFEASFATQIRVKNTNWGPYKFDAGTVTFMYKGVTVGQVVVPKSKAKMRSTK 136 Query: 362 HMDVTIDVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAI 183 +DVT+ + S L +SNL ++L G LT ++ KL G+V +M ++KK KS M+CT+ Sbjct: 137 KIDVTVSLNSYGLPSSSNLGTELKSGVLTLSSKGKLTGKVVLMLMMKKRKSATMDCTMTF 196 Query: 182 NTRSQTVQQIIC 147 + ++T++ + C Sbjct: 197 DLSTKTLKTLQC 208 >ref|XP_007040369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508777614|gb|EOY24870.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 188 Score = 137 bits (344), Expect = 6e-30 Identities = 68/186 (36%), Positives = 110/186 (59%) Frame = -1 Query: 704 LGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSAT 525 L S++ R +N K I+ +VFQT+ +K P+ ++ SV +++LN ++ Sbjct: 2 LESEKYKRMQNMKCYAYIIAGVVFQTIIILVFALTVMRIKTPSARLRSVTVQSLNYNASG 61 Query: 524 NTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTI 345 FNMRLI ++ VKNKN+G ++F N +++G VGD +I + +A+ R T+ M+VT+ Sbjct: 62 VPHFNMRLIMEIAVKNKNFGHFRFDNTTANVTFGSVMVGDGEIVKSRARARKTKRMNVTV 121 Query: 344 DVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQT 165 DV+S +S L + L G LT +L G+VT+M L+KK K+ +MNCT+ +N S Sbjct: 122 DVSSSAVSDEDELRTKLSSGTLTLTGVARLRGKVTLMKLMKKRKTAEMNCTMTVNLNSHA 181 Query: 164 VQQIIC 147 VQ + C Sbjct: 182 VQDLDC 187 >ref|XP_007219159.1| hypothetical protein PRUPE_ppa016330mg [Prunus persica] gi|462415621|gb|EMJ20358.1| hypothetical protein PRUPE_ppa016330mg [Prunus persica] Length = 189 Score = 135 bits (341), Expect = 1e-29 Identities = 69/186 (37%), Positives = 117/186 (62%), Gaps = 1/186 (0%) Frame = -1 Query: 701 GSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSATN 522 G+++ RK+N + + I I+ QT+ +K P +++ SVA+++L + S+ + Sbjct: 4 GNEDSRRKRN-RCFLYIAAGIILQTIIIVLFVVFVMRIKTPKVRLDSVAVDSLTANSSPS 62 Query: 521 T-SFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTI 345 + SF +++ VTVKNKN+G YKF +T SY G VG+ I + KAK + T+ ++VT+ Sbjct: 63 SPSFKVQINALVTVKNKNFGHYKFEGSKVTFSYKGTAVGEGTIAKAKAKAKRTKKINVTV 122 Query: 344 DVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQT 165 + S+ +S +S L+SDL GNLT AY KL+G+V + +IKK KS +NCT+ ++T+++ Sbjct: 123 SLNSNKVSSHSQLSSDLSSGNLTLTAYAKLDGKVHLFKVIKKKKSANLNCTVHVDTKAKV 182 Query: 164 VQQIIC 147 V + C Sbjct: 183 VHVLTC 188 >ref|XP_007022216.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508721844|gb|EOY13741.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 259 Score = 135 bits (339), Expect = 2e-29 Identities = 72/186 (38%), Positives = 115/186 (61%), Gaps = 2/186 (1%) Frame = -1 Query: 698 SQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSATNT 519 S+E RKK K + + ++FQT +K P +I SV +++L +++ Sbjct: 13 SKELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNSS-P 71 Query: 518 SFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPE--RKAKLRSTRHMDVTI 345 SFNM+ I +VTVKN N+G YKF N +T +Y G+ VG+A + + +A+ RST+ M+VT+ Sbjct: 72 SFNMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNVTM 131 Query: 344 DVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQT 165 D+ S+ ++ +S+L SDL+ G LT + LNG+V +M +IKK KS +MNCT+ +N + Sbjct: 132 DLNSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLAQKL 191 Query: 164 VQQIIC 147 V+ I C Sbjct: 192 VRDIKC 197 >ref|XP_007038869.1| Uncharacterized protein TCM_015287 [Theobroma cacao] gi|508776114|gb|EOY23370.1| Uncharacterized protein TCM_015287 [Theobroma cacao] Length = 214 Score = 132 bits (333), Expect = 1e-28 Identities = 68/193 (35%), Positives = 115/193 (59%) Frame = -1 Query: 725 NDEERVNLGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIEN 546 +DEE V S+E +KK K ++ IV+ VFQT ++ P ++ S + Sbjct: 21 SDEESVAAHSKELKKKKRMKCLLYIVLFAVFQTGIILLFALTVMRIRNPKFRVRSGSFTT 80 Query: 545 LNSTSATNTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRST 366 N + + SF++++ T+ TVKN N+G +K+ G++T +Y G VG A I + +A+ RST Sbjct: 81 FNVGTEASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFAYRGTPVGRATIQKARARARST 140 Query: 365 RHMDVTIDVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIA 186 + +DV ++++S+ L + L D+ G LT + KL+G++ +M +IKK KS QMNCT+ Sbjct: 141 KKVDVVVELSSNGLPNTNELGRDISAGVLTLTSSSKLDGKIHLMKVIKKKKSTQMNCTMD 200 Query: 185 INTRSQTVQQIIC 147 + ++TV+ IIC Sbjct: 201 VAIDTRTVRNIIC 213 >ref|XP_004300827.1| PREDICTED: uncharacterized protein LOC101295341 [Fragaria vesca subsp. vesca] Length = 211 Score = 132 bits (333), Expect = 1e-28 Identities = 72/186 (38%), Positives = 105/186 (56%) Frame = -1 Query: 704 LGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSAT 525 + E RKK KL I + IVFQ + VK P + S+ +E LN AT Sbjct: 26 VSEDELKRKKRIKLFTYIGIFIVFQIIVMTVFGLTVMKVKTPKARWGSIDVETLNYVPAT 85 Query: 524 NTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTI 345 SF+ T++ +KN NWG YKF G T Y G T+G IP+ KA +RST+ +DV + Sbjct: 86 -PSFDTTFETQIRIKNTNWGPYKFDAGTATFLYQGVTIGKVDIPKSKAGMRSTKKIDVEV 144 Query: 344 DVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQT 165 + ++ L +S L ++L G LT + V+L G+V +M ++KKNK+ M+CTIA + S+T Sbjct: 145 SLNTNALPNSSALGTELSSGVLTLTSQVQLKGKVELMLIMKKNKNASMDCTIAFDLSSKT 204 Query: 164 VQQIIC 147 VQ + C Sbjct: 205 VQSLQC 210 >ref|XP_004300825.1| PREDICTED: uncharacterized protein LOC101294764 [Fragaria vesca subsp. vesca] Length = 182 Score = 132 bits (331), Expect = 2e-28 Identities = 65/176 (36%), Positives = 103/176 (58%) Frame = -1 Query: 674 NFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSATNTSFNMRLIT 495 N K + + + IVFQ + +K P ++ + + N NS S+T SF+ L+T Sbjct: 6 NKKCLAYVAIFIVFQIIVITIFALTVMKIKGPKVRFQTATVSNFNSDSSTAASFSGDLVT 65 Query: 494 KVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTIDVTSDWLSRN 315 K VKN N+G +K+ N +++ Y G +G A +P +KAK RSTR D+TI + S LS Sbjct: 66 KFAVKNTNFGHFKYPNSTVSILYEGQVIGTAAVPSQKAKARSTRRTDITISIDSSKLSGT 125 Query: 314 SNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQTVQQIIC 147 +NL + + G + + L G+V VM +IKKNKSG+M+CT+ +N +++TV + C Sbjct: 126 TNLTTAIGAGVVPLTSESTLKGKVEVMKIIKKNKSGKMSCTMLLNLKTRTVDDLKC 181 >ref|XP_007040368.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] gi|508777613|gb|EOY24869.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein [Theobroma cacao] Length = 226 Score = 129 bits (324), Expect = 1e-27 Identities = 60/170 (35%), Positives = 106/170 (62%) Frame = -1 Query: 689 RARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSATNTSFN 510 R N K + + +VFQT ++ P ++ +V +E+ ++ ++++ SF+ Sbjct: 4 RREGSNAKCLAYVAAFVVFQTAIILLFALTVMRIRSPKVRFGAVTVESFSTVNSSSPSFD 63 Query: 509 MRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTIDVTSD 330 M+L+ +V VKN N+G +K+ N +T+ YGG VG+A I + +A+ R T+ ++ +D++S Sbjct: 64 MKLMAQVAVKNTNFGHFKYENSTVTILYGGMPVGEAAIFKGRARARQTKKFNINVDISSS 123 Query: 329 WLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAIN 180 LS NSNL +D++ G L ++ KL G+V +M +IKK KSG+M+CT+ IN Sbjct: 124 RLSSNSNLGNDINAGVLPLSSQAKLKGKVHLMKVIKKKKSGEMSCTMGIN 173 >ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca subsp. vesca] Length = 211 Score = 129 bits (323), Expect = 2e-27 Identities = 66/186 (35%), Positives = 106/186 (56%) Frame = -1 Query: 704 LGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSAT 525 + E R+K +L I + IVFQ + VK P +++ + +++ NS AT Sbjct: 26 VSKDELKRRKRIRLFTYIGIFIVFQIIVMTVFGLTVMKVKTPKVRLGEINVQDFNSVPAT 85 Query: 524 NTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTI 345 SF+ T++ VKN NWG YKF +T Y G VG +P+ KA +RST+ M+V + Sbjct: 86 -PSFDTTFTTQIRVKNTNWGPYKFDASTVTFMYQGVAVGQVTVPKGKAGMRSTKKMNVEV 144 Query: 344 DVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQT 165 + ++ L +SNL S+L+ G LT N+ KL+G+V +M ++KK KS M+C I + ++T Sbjct: 145 SLNANGLPSSSNLGSELNSGVLTLNSQAKLSGKVELMLIMKKKKSSTMDCMIGFDLSTKT 204 Query: 164 VQQIIC 147 V+ + C Sbjct: 205 VKSLQC 210 >ref|XP_004300832.1| PREDICTED: uncharacterized protein LOC101296778 [Fragaria vesca subsp. vesca] Length = 237 Score = 128 bits (321), Expect = 3e-27 Identities = 71/194 (36%), Positives = 108/194 (55%) Frame = -1 Query: 728 GNDEERVNLGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIE 549 G D + E R+K KL I +LIVFQ + VK P +++ ++ I+ Sbjct: 44 GEDHSTTFQFNDEIKRQKRMKLYKCIGILIVFQIIILTVFALTVMKVKTPKVRLGAINIQ 103 Query: 548 NLNSTSATNTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRS 369 +LNS AT SF+ T++ VKN NWG +KF T SY G VG IP+ KA++RS Sbjct: 104 SLNSVPAT-PSFDASFTTQIRVKNPNWGPFKFDASTATFSYQGVPVGQVVIPKSKARMRS 162 Query: 368 TRHMDVTIDVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTI 189 T+ + VT+ V S L +S+L S+L G LT ++ K++G+V +M + KK K+ QM C I Sbjct: 163 TKKIGVTVSVNSKALPSSSDLGSELKNGVLTLSSQAKVSGKVEIMSVTKKRKTAQMYCAI 222 Query: 188 AINTRSQTVQQIIC 147 + ++ +Q + C Sbjct: 223 VFDLSTKAIQTLQC 236 >ref|XP_007022217.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] gi|508721845|gb|EOY13742.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein, putative [Theobroma cacao] Length = 192 Score = 127 bits (320), Expect = 4e-27 Identities = 67/188 (35%), Positives = 118/188 (62%), Gaps = 3/188 (1%) Frame = -1 Query: 701 GSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLN-STSAT 525 G Q K+N K ++V ++ +T+ ++ P +++ V +ENL S+S++ Sbjct: 4 GDQTSRGKRNIKCWAIVVAGVIAKTIIILLFVLIVMRIRNPKVRLGGVTVENLRASSSSS 63 Query: 524 NTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTI 345 + SF+ +L +V+VKN N+G +KF N +T+SY G+ VG A I E A+ RST+ +VTI Sbjct: 64 SPSFSTKLNAQVSVKNTNFGHFKFKNSTLTISYNGSPVGKATIVEGLARARSTKKFNVTI 123 Query: 344 DVTS-DWLSRNSN-LASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRS 171 V+S + +SRNS+ L+SD++ G + +++ KL G++ + + KK KS +MNCT+ +NT Sbjct: 124 LVSSNNKISRNSDQLSSDIESGTINLSSHAKLEGKIHLFKIFKKKKSAEMNCTMDVNTSL 183 Query: 170 QTVQQIIC 147 + +Q++ C Sbjct: 184 KQIQKLTC 191 >gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis] Length = 212 Score = 127 bits (319), Expect = 5e-27 Identities = 68/189 (35%), Positives = 105/189 (55%), Gaps = 1/189 (0%) Frame = -1 Query: 725 NDEERVNLGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISS-VAIE 549 +DEE NL ++E R+K KL + + Q + VK P +++S + Sbjct: 20 SDEESSNLDAKELKRRKRIKLAIYAFIFTASQIIVTLVFVLVVMRVKSPKLRLSDKFEFQ 79 Query: 548 NLNSTSATNTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRS 369 + + S + SF++ T++ VKN NWG YKF N +Y G TVG IP+ KA +RS Sbjct: 80 TIETNSGSKPSFDISFTTQLRVKNTNWGPYKFDNTTAAFAYEGETVGQVVIPKGKAGMRS 139 Query: 368 TRHMDVTIDVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTI 189 T+ + V++ ++S L N+NL S+L G LT K+ G+V +M ++KK KS MNCTI Sbjct: 140 TKKVPVSVSLSSSQLKNNTNLGSELSGGILTLRCTAKMTGKVKLMLIMKKKKSANMNCTI 199 Query: 188 AINTRSQTV 162 I+ + +TV Sbjct: 200 NIHVKEKTV 208 >gb|EYU40056.1| hypothetical protein MIMGU_mgv1a022222mg [Mimulus guttatus] Length = 192 Score = 125 bits (315), Expect = 1e-26 Identities = 63/190 (33%), Positives = 114/190 (60%), Gaps = 6/190 (3%) Frame = -1 Query: 698 SQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSATNT 519 S+ +K + K + + V +VFQ +K P I+++++A+E+ +S++ N Sbjct: 2 SKGDGKKSSKKCLAYVAVFVVFQAAVIMVLALTVMKIKSPKIRLNAIAVESFSSSNNGNN 61 Query: 518 -----SFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMD 354 S NM+L+T++T+KN N+G++K+ N + + Y G +G+A IP + K R T + Sbjct: 62 AGPTPSINMKLLTQLTIKNTNFGQFKYDNATLAILYNGVPLGEAVIPRGRVKARKTLKFN 121 Query: 353 VTIDVTSDWLS-RNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINT 177 V+ D+ SD L+ N+NL +D++ G L ++ ++NG+V +M +IKKNKSG MNC +N Sbjct: 122 VSFDLNSDRLNGNNTNLGNDINSGVLRLSSQARVNGKVHLMKIIKKNKSGNMNCDWIVNL 181 Query: 176 RSQTVQQIIC 147 ++ V+ + C Sbjct: 182 ATRMVENLNC 191 >ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca subsp. vesca] Length = 211 Score = 125 bits (315), Expect = 1e-26 Identities = 66/186 (35%), Positives = 106/186 (56%) Frame = -1 Query: 704 LGSQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLNSTSAT 525 + E R+K KL M I + IV Q + VK P +++ + +++LNS AT Sbjct: 26 VSEDELKRQKRRKLFMYIGIFIVVQIIVMTVFGLTVMKVKTPKVRLGGINVQSLNSVPAT 85 Query: 524 NTSFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTI 345 SF+ T++ VKN NWG YKF T Y G VG IP+ KA++RST+ + V++ Sbjct: 86 -PSFDTSFTTQIRVKNTNWGPYKFDASTATFMYQGVAVGQVSIPKSKARMRSTKKISVSV 144 Query: 344 DVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQT 165 + ++ L +S + ++L+ G LT + KL G+V +M ++KK KS M+CTIA + ++T Sbjct: 145 ILNTNALPSSSTIGTELNSGILTLTSQAKLTGKVELMLIMKKKKSATMDCTIAFDLSTKT 204 Query: 164 VQQIIC 147 V+ + C Sbjct: 205 VKSLQC 210 >gb|EXC05942.1| hypothetical protein L484_014210 [Morus notabilis] Length = 213 Score = 125 bits (314), Expect = 2e-26 Identities = 74/186 (39%), Positives = 111/186 (59%), Gaps = 2/186 (1%) Frame = -1 Query: 698 SQERARKKNFKLIMLIVVLIVFQTMXXXXXXXXXXXVKRPNIKISSVAIENLN-STSATN 522 ++E KK + + + ++V T+ +K P ++I SVAIE+L S S TN Sbjct: 28 AKELQHKKRMRRLGGVTAIVVLLTVVILVFPQTVMRIKGPELRIRSVAIEDLTISNSDTN 87 Query: 521 T-SFNMRLITKVTVKNKNWGEYKFYNGIMTLSYGGATVGDAQIPERKAKLRSTRHMDVTI 345 + S +M+ +++ VKN N+GE+KF +T Y G VGDA + + KAK RST+ M+VT Sbjct: 88 SPSLSMKFDSEIGVKNTNFGEFKFDESSITFVYKGTEVGDASVEKGKAKARSTKKMNVTA 147 Query: 344 DVTSDWLSRNSNLASDLDLGNLTFNAYVKLNGQVTVMFLIKKNKSGQMNCTIAINTRSQT 165 +V + NSNLA+D+ G LT + KLNG+V +M +IKK K+ +MNCTI IN ++ Sbjct: 148 EVNA-----NSNLANDVRSGFLTLTSQSKLNGKVHLMKVIKKKKTAEMNCTITINLENKV 202 Query: 164 VQQIIC 147 VQ C Sbjct: 203 VQDFKC 208