BLASTX nr result
ID: Cornus23_contig00025067
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cornus23_contig00025067 (734 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_008220943.1| PREDICTED: uncharacterized protein LOC103320... 261 3e-67 ref|XP_010648423.1| PREDICTED: uncharacterized protein LOC100252... 259 1e-66 ref|XP_010648352.1| PREDICTED: uncharacterized protein LOC100252... 259 1e-66 emb|CBI26785.3| unnamed protein product [Vitis vinifera] 259 1e-66 ref|XP_009351588.1| PREDICTED: uncharacterized protein LOC103943... 253 9e-65 ref|XP_010105545.1| hypothetical protein L484_019288 [Morus nota... 251 5e-64 ref|XP_010648465.1| PREDICTED: uncharacterized protein LOC100252... 249 1e-63 ref|XP_008360022.1| PREDICTED: uncharacterized protein LOC103423... 248 4e-63 ref|XP_008365818.1| PREDICTED: uncharacterized protein LOC103429... 247 5e-63 ref|XP_008381778.1| PREDICTED: uncharacterized protein LOC103444... 247 5e-63 ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309... 229 2e-57 emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera] 229 2e-57 ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prun... 224 6e-56 ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family prot... 223 1e-55 ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family prot... 223 1e-55 gb|KHN28877.1| hypothetical protein glysoja_025671 [Glycine soja] 219 1e-54 ref|XP_010271173.1| PREDICTED: uncharacterized protein LOC104607... 219 1e-54 ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794... 218 3e-54 gb|KHN25979.1| hypothetical protein glysoja_018833 [Glycine soja] 217 6e-54 ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809... 217 6e-54 >ref|XP_008220943.1| PREDICTED: uncharacterized protein LOC103320980 [Prunus mume] Length = 691 Score = 261 bits (667), Expect = 3e-67 Identities = 146/261 (55%), Positives = 179/261 (68%), Gaps = 24/261 (9%) Frame = -2 Query: 715 DIHYHHQQWLTD--DGFMWWLRGEFAAANAIIDLLCQHLRLIGEPGEYDGVMGCIQQRRC 542 +I HH+QW D DGF+ WLRGEFAAANAIID LC HLR +GEPGEYD V+GCIQQRRC Sbjct: 29 EIPQHHRQWFPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRC 88 Query: 541 NWHPVLHMQHYFSVAEVMYALQQVAWRRQQWHFDPVKVPRKEFKRSGVA-SRQAQRVEIV 365 NW+PVLHMQ YFSVAEV+YALQ VAWRRQQ ++DPVK KEFKRSGV ++ QR E Sbjct: 89 NWNPVLHMQQYFSVAEVIYALQHVAWRRQQRYYDPVKAGAKEFKRSGVGFNKGQQRAEAF 148 Query: 364 RDGHNSNVEYYSQAGNSTG------SEKGGEVEKDEEP-----KLRDEVV--------KL 242 ++GHNS +E +S GNS+G E+G EV ++ EP KL D+ + L Sbjct: 149 KEGHNSTLESHSNDGNSSGVVAPEKFERGSEVGEEVEPGGEVGKLNDKGLAPAGEKKDAL 208 Query: 241 XNPQEDSYIRSSGNSHET-TGNSEPEVEELDDGCTENSKGPCNVLLDNDSHFMQ-THEKK 68 PQEDS +RS GNS T + NSEPEV E+ DGCT +SK N+SH +Q ++K+ Sbjct: 209 TKPQEDSNLRSFGNSQGTISENSEPEVVEV-DGCTPSSK-------VNESHSIQIQNQKQ 260 Query: 67 NLKITPKTFVGTEMFEGKAIN 5 NL I PKTF+G E +GK +N Sbjct: 261 NLSIVPKTFIGNETSDGKTVN 281 >ref|XP_010648423.1| PREDICTED: uncharacterized protein LOC100252594 isoform X2 [Vitis vinifera] Length = 704 Score = 259 bits (662), Expect = 1e-66 Identities = 143/262 (54%), Positives = 173/262 (66%), Gaps = 27/262 (10%) Frame = -2 Query: 706 YHHQQWLTD--DGFMWWLRGEFAAANAIIDLLCQHLRLIGEPGEYDGVMGCIQQRRCNWH 533 +HH+QW D DGF+ WLRGEFAAANAIID LC HLRLIGEPGEYD V+GCIQQRR NW Sbjct: 32 HHHRQWFPDERDGFISWLRGEFAAANAIIDSLCNHLRLIGEPGEYDAVIGCIQQRRYNWS 91 Query: 532 PVLHMQHYFSVAEVMYALQQVAWRRQQWHFDPVKVPRKEFKRSGVASRQAQRVEIVRDGH 353 VLHMQ YFSVAEV+YALQQV WRRQQ H DPVK KE+KR GVA RQ QR E +D H Sbjct: 92 SVLHMQQYFSVAEVIYALQQVGWRRQQRHLDPVKGAGKEYKRYGVAYRQGQRGETAKDSH 151 Query: 352 NSNVEYYSQAGNSTGS-EKGGEVEK-----------DEEPKLRDEVV-----------KL 242 NSN E +S NS+G+ EKG V + D KL D+ + + Sbjct: 152 NSNFENHSHDANSSGTLEKGERVSEIYDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAV 211 Query: 241 XNPQEDSYIRSSGNSH-ETTGNSEPEVEELDDGCTENSKGPCNVLLDNDSHFMQT-HEKK 68 P +S +SS NS G SE E ++DDG T N KG CN++++N++H +Q +EK Sbjct: 212 AKPNANSCSKSSENSEGSRCGISETEANDMDDGGTLNPKGSCNMIMENNAHPVQNQNEKP 271 Query: 67 NLKITPKTFVGTEMFEGKAINV 2 N +PKTFVGTE+F+GKA+NV Sbjct: 272 NPTTSPKTFVGTEIFDGKAVNV 293 >ref|XP_010648352.1| PREDICTED: uncharacterized protein LOC100252594 isoform X1 [Vitis vinifera] gi|731369021|ref|XP_010648386.1| PREDICTED: uncharacterized protein LOC100252594 isoform X1 [Vitis vinifera] Length = 705 Score = 259 bits (662), Expect = 1e-66 Identities = 143/262 (54%), Positives = 173/262 (66%), Gaps = 27/262 (10%) Frame = -2 Query: 706 YHHQQWLTD--DGFMWWLRGEFAAANAIIDLLCQHLRLIGEPGEYDGVMGCIQQRRCNWH 533 +HH+QW D DGF+ WLRGEFAAANAIID LC HLRLIGEPGEYD V+GCIQQRR NW Sbjct: 32 HHHRQWFPDERDGFISWLRGEFAAANAIIDSLCNHLRLIGEPGEYDAVIGCIQQRRYNWS 91 Query: 532 PVLHMQHYFSVAEVMYALQQVAWRRQQWHFDPVKVPRKEFKRSGVASRQAQRVEIVRDGH 353 VLHMQ YFSVAEV+YALQQV WRRQQ H DPVK KE+KR GVA RQ QR E +D H Sbjct: 92 SVLHMQQYFSVAEVIYALQQVGWRRQQRHLDPVKGAGKEYKRYGVAYRQGQRGETAKDSH 151 Query: 352 NSNVEYYSQAGNSTGS-EKGGEVEK-----------DEEPKLRDEVV-----------KL 242 NSN E +S NS+G+ EKG V + D KL D+ + + Sbjct: 152 NSNFENHSHDANSSGTLEKGERVSEIYDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAV 211 Query: 241 XNPQEDSYIRSSGNSH-ETTGNSEPEVEELDDGCTENSKGPCNVLLDNDSHFMQT-HEKK 68 P +S +SS NS G SE E ++DDG T N KG CN++++N++H +Q +EK Sbjct: 212 AKPNANSCSKSSENSEGSRCGISETEANDMDDGGTLNPKGSCNMIMENNAHPVQNQNEKP 271 Query: 67 NLKITPKTFVGTEMFEGKAINV 2 N +PKTFVGTE+F+GKA+NV Sbjct: 272 NPTTSPKTFVGTEIFDGKAVNV 293 >emb|CBI26785.3| unnamed protein product [Vitis vinifera] Length = 672 Score = 259 bits (662), Expect = 1e-66 Identities = 143/262 (54%), Positives = 173/262 (66%), Gaps = 27/262 (10%) Frame = -2 Query: 706 YHHQQWLTD--DGFMWWLRGEFAAANAIIDLLCQHLRLIGEPGEYDGVMGCIQQRRCNWH 533 +HH+QW D DGF+ WLRGEFAAANAIID LC HLRLIGEPGEYD V+GCIQQRR NW Sbjct: 32 HHHRQWFPDERDGFISWLRGEFAAANAIIDSLCNHLRLIGEPGEYDAVIGCIQQRRYNWS 91 Query: 532 PVLHMQHYFSVAEVMYALQQVAWRRQQWHFDPVKVPRKEFKRSGVASRQAQRVEIVRDGH 353 VLHMQ YFSVAEV+YALQQV WRRQQ H DPVK KE+KR GVA RQ QR E +D H Sbjct: 92 SVLHMQQYFSVAEVIYALQQVGWRRQQRHLDPVKGAGKEYKRYGVAYRQGQRGETAKDSH 151 Query: 352 NSNVEYYSQAGNSTGS-EKGGEVEK-----------DEEPKLRDEVV-----------KL 242 NSN E +S NS+G+ EKG V + D KL D+ + + Sbjct: 152 NSNFENHSHDANSSGTLEKGERVSEIYDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAV 211 Query: 241 XNPQEDSYIRSSGNSH-ETTGNSEPEVEELDDGCTENSKGPCNVLLDNDSHFMQT-HEKK 68 P +S +SS NS G SE E ++DDG T N KG CN++++N++H +Q +EK Sbjct: 212 AKPNANSCSKSSENSEGSRCGISETEANDMDDGGTLNPKGSCNMIMENNAHPVQNQNEKP 271 Query: 67 NLKITPKTFVGTEMFEGKAINV 2 N +PKTFVGTE+F+GKA+NV Sbjct: 272 NPTTSPKTFVGTEIFDGKAVNV 293 >ref|XP_009351588.1| PREDICTED: uncharacterized protein LOC103943111 [Pyrus x bretschneideri] gi|694320826|ref|XP_009351589.1| PREDICTED: uncharacterized protein LOC103943111 [Pyrus x bretschneideri] Length = 690 Score = 253 bits (646), Expect = 9e-65 Identities = 141/262 (53%), Positives = 173/262 (66%), Gaps = 24/262 (9%) Frame = -2 Query: 715 DIHYHHQQWLTD--DGFMWWLRGEFAAANAIIDLLCQHLRLIGEPGEYDGVMGCIQQRRC 542 +IH H +QW D DGF+ WLRGEFAAAN IID LC HLR +G+PGEYD V+GCIQQRRC Sbjct: 29 EIHQHPRQWFPDERDGFISWLRGEFAAANTIIDSLCHHLRAVGDPGEYDVVIGCIQQRRC 88 Query: 541 NWHPVLHMQHYFSVAEVMYALQQVAWRRQQWHFDPVKVPRKEFKRSGVA-SRQAQRVEIV 365 NW+PVLHMQ YFSVAEV+YALQ VAWRRQQ +DPVKV KE+KRS ++ QR E Sbjct: 89 NWNPVLHMQQYFSVAEVIYALQHVAWRRQQMQYDPVKVGTKEYKRSASGFNKDQQRAEHF 148 Query: 364 RDGHNSNVEYYSQAGNSTGSEKGGEVEKD----EEPKLRDEVVKLXN------------- 236 ++GHN E +S GNS+G +VE+ EE K R EV KL + Sbjct: 149 KEGHNFRTEVHSYDGNSSGLVASEKVERGSDVAEEVKPRGEVGKLDDNGLAPAGEKKDAL 208 Query: 235 --PQEDSYIRSSGNSHETT-GNSEPEVEELDDGCTENSKGPCNVLLDNDSHFMQ-THEKK 68 PQEDS +RSSGNS +T N EPEV + DGCT +SK +N+SH +Q + K+ Sbjct: 209 TKPQEDSRLRSSGNSQQTIYCNLEPEV-AVGDGCTSSSK-------ENESHSIQIQNAKQ 260 Query: 67 NLKITPKTFVGTEMFEGKAINV 2 NL + PKTFVG E+ +GK +NV Sbjct: 261 NLPVVPKTFVGNELIDGKTVNV 282 >ref|XP_010105545.1| hypothetical protein L484_019288 [Morus notabilis] gi|587917472|gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis] Length = 681 Score = 251 bits (640), Expect = 5e-64 Identities = 133/253 (52%), Positives = 164/253 (64%), Gaps = 17/253 (6%) Frame = -2 Query: 709 HYHHQQWLTD--DGFMWWLRGEFAAANAIIDLLCQHLRLIGEPGEYDGVMGCIQQRRCNW 536 H++++QW D DGF+ WLRGEFAAANA+ID LC HLR +GEPGEYD V+ CIQ RRCNW Sbjct: 28 HHNNRQWFPDERDGFISWLRGEFAAANAMIDSLCHHLRAVGEPGEYDAVIACIQLRRCNW 87 Query: 535 HPVLHMQHYFSVAEVMYALQQVAWRRQQWHFDPVKVPRKEFKRSGVASRQAQRVEIVRDG 356 +PVLHMQ YFSVAEVM+ALQQVAWRRQQ +DPVK+ KEFKRSGV +Q QR + +DG Sbjct: 88 NPVLHMQQYFSVAEVMFALQQVAWRRQQRFYDPVKMGNKEFKRSGVGFKQWQRNDSFKDG 147 Query: 355 HNSNVEYYSQAGNST----GSEKGGEVEKDEE----------PKLRDEVVKLXNPQEDSY 218 NS E + GNS+ SEKGG + +E P +++ QED Sbjct: 148 RNSAAESHCLDGNSSFGNAASEKGGSDKSGDEVGNSDDRGSMPAAKEKNDSAAKSQEDGN 207 Query: 217 IRSSGNSHETTGNSEPEVEELDDGCTENSKGPCNVLLDNDSHFM-QTHEKKNLKITPKTF 41 ++S GN SEPEV +DDGCT +SK +NDSH + +E NL PKTF Sbjct: 208 VKSLGNFEGVVSGSEPEVHAVDDGCTSSSK-------ENDSHSTPKQNENSNLANVPKTF 260 Query: 40 VGTEMFEGKAINV 2 G EMF+GK +NV Sbjct: 261 SGNEMFDGKPVNV 273 >ref|XP_010648465.1| PREDICTED: uncharacterized protein LOC100252594 isoform X3 [Vitis vinifera] Length = 699 Score = 249 bits (636), Expect = 1e-63 Identities = 140/262 (53%), Positives = 170/262 (64%), Gaps = 27/262 (10%) Frame = -2 Query: 706 YHHQQWLTD--DGFMWWLRGEFAAANAIIDLLCQHLRLIGEPGEYDGVMGCIQQRRCNWH 533 +HH+QW D DGF+ WLRGEFAAANAIID LC HLRLIGEPGEYD V+GCIQQRR NW Sbjct: 32 HHHRQWFPDERDGFISWLRGEFAAANAIIDSLCNHLRLIGEPGEYDAVIGCIQQRRYNWS 91 Query: 532 PVLHMQHYFSVAEVMYALQQVAWRRQQWHFDPVKVPRKEFKRSGVASRQAQRVEIVRDGH 353 VLHMQ YFSVAEV+YALQQV WRRQQ H DPVK KE+KR GVA RQ QR E +D H Sbjct: 92 SVLHMQQYFSVAEVIYALQQVGWRRQQRHLDPVKGAGKEYKRYGVAYRQGQRGETAKDSH 151 Query: 352 NSNVEYYSQAGNSTGS-EKGGEVEK-----------DEEPKLRDEVV-----------KL 242 NSN E +S NS+G+ EKG V + D KL D+ + + Sbjct: 152 NSNFENHSHDANSSGTLEKGERVSEIYDDVKGGDKGDVVGKLEDKDLAAAEEKKAGTDAV 211 Query: 241 XNPQEDSYIRSSGNSH-ETTGNSEPEVEELDDGCTENSKGPCNVLLDNDSHFMQT-HEKK 68 P +S +SS NS G SE E ++DDG G CN++++N++H +Q +EK Sbjct: 212 AKPNANSCSKSSENSEGSRCGISETEANDMDDG------GSCNMIMENNAHPVQNQNEKP 265 Query: 67 NLKITPKTFVGTEMFEGKAINV 2 N +PKTFVGTE+F+GKA+NV Sbjct: 266 NPTTSPKTFVGTEIFDGKAVNV 287 >ref|XP_008360022.1| PREDICTED: uncharacterized protein LOC103423718 [Malus domestica] Length = 687 Score = 248 bits (632), Expect = 4e-63 Identities = 141/252 (55%), Positives = 169/252 (67%), Gaps = 20/252 (7%) Frame = -2 Query: 697 QQWLTD--DGFMWWLRGEFAAANAIIDLLCQHLRLIGEPGEYDGVMGCIQQRRCNWHPVL 524 +QW D DGF+ WLRGEFAAANAIID LC HLR++GEPGEYDGV+ CIQQRRCNW+PVL Sbjct: 35 RQWFPDERDGFISWLRGEFAAANAIIDSLCHHLRVVGEPGEYDGVISCIQQRRCNWNPVL 94 Query: 523 HMQHYFSVAEVMYALQQVAWRRQQWHFDPVKVPRKEFKRSGVASRQAQ-RVEIVRDGHNS 347 HMQ YFSVAEV+YALQ VAWRRQQ +D VKV KE+KRSG + Q R E ++GHN Sbjct: 95 HMQQYFSVAEVIYALQHVAWRRQQRQYDHVKVGAKEYKRSGSGFNKGQHRAEHFKEGHNF 154 Query: 346 NVEYYSQAGNSTGSEKGGEVEKD----EEPKLRDEVVKL-----------XNPQEDSYIR 212 + E +S GNS+G +VE+ EE K EV KL PQEDS +R Sbjct: 155 STEVHSYDGNSSGLXASEKVERGSEVAEELKPGGEVGKLDGNGLAAAGEKTEPQEDSRLR 214 Query: 211 SSGNSHETT-GNSEPEVEELDDGCTENSKGPCNVLLDNDSHFMQ-THEKKNLKITPKTFV 38 SS NS T GNSEPEV + DGCT +SK +N+SH +Q + K+NL I PKTFV Sbjct: 215 SSENSQLTIYGNSEPEV-AVGDGCTSSSK-------ENESHSIQIQNAKQNLSIVPKTFV 266 Query: 37 GTEMFEGKAINV 2 G E+ +GK +NV Sbjct: 267 GNELLDGKTVNV 278 >ref|XP_008365818.1| PREDICTED: uncharacterized protein LOC103429447, partial [Malus domestica] Length = 640 Score = 247 bits (631), Expect = 5e-63 Identities = 138/262 (52%), Positives = 173/262 (66%), Gaps = 24/262 (9%) Frame = -2 Query: 715 DIHYHHQQWLTD--DGFMWWLRGEFAAANAIIDLLCQHLRLIGEPGEYDGVMGCIQQRRC 542 +IH H +QWL D DGF+ WLRGEFAAAN IID LC HLR +G+PGEYD V+GCIQQRRC Sbjct: 29 EIHQHPRQWLPDERDGFISWLRGEFAAANTIIDSLCHHLRAVGDPGEYDVVIGCIQQRRC 88 Query: 541 NWHPVLHMQHYFSVAEVMYALQQVAWRRQQWHFDPVKVPRKEFKRSGVA-SRQAQRVEIV 365 NW+PVLHMQ YFSVAEV+YALQ VAWRRQ +DPVKV KE+KRS ++ QR E Sbjct: 89 NWNPVLHMQQYFSVAEVIYALQHVAWRRQHMQYDPVKVGTKEYKRSASGFNKDQQRAEHF 148 Query: 364 RDGHNSNVEYYSQAGNSTG------SEKGGEVEKDEEP-----KLRDEVV--------KL 242 ++GHN E +S GNS+G E+G +V ++ +P KL D+ + L Sbjct: 149 KEGHNFRTEVHSYDGNSSGLVASEKVERGSDVAEEVKPHGEVGKLDDKGLAPAGEKKDAL 208 Query: 241 XNPQEDSYIRSSGNSHETT-GNSEPEVEELDDGCTENSKGPCNVLLDNDSHFMQTH-EKK 68 PQEDS +RSSGNS +T N EPEV + DGCT SK +N+SH +Q ++ Sbjct: 209 TKPQEDSRLRSSGNSQQTIYCNLEPEV-AVGDGCTSISK-------ENESHSIQIQIAQQ 260 Query: 67 NLKITPKTFVGTEMFEGKAINV 2 NL + PKTFVG E+ +GK +NV Sbjct: 261 NLPVVPKTFVGNELIDGKTVNV 282 >ref|XP_008381778.1| PREDICTED: uncharacterized protein LOC103444603 [Malus domestica] Length = 690 Score = 247 bits (631), Expect = 5e-63 Identities = 138/262 (52%), Positives = 173/262 (66%), Gaps = 24/262 (9%) Frame = -2 Query: 715 DIHYHHQQWLTD--DGFMWWLRGEFAAANAIIDLLCQHLRLIGEPGEYDGVMGCIQQRRC 542 +IH H +QWL D DGF+ WLRGEFAAAN IID LC HLR +G+PGEYD V+GCIQQRRC Sbjct: 29 EIHQHPRQWLPDERDGFISWLRGEFAAANTIIDSLCHHLRAVGDPGEYDVVIGCIQQRRC 88 Query: 541 NWHPVLHMQHYFSVAEVMYALQQVAWRRQQWHFDPVKVPRKEFKRSGVA-SRQAQRVEIV 365 NW+PVLHMQ YFSVAEV+YALQ VAWRRQ +DPVKV KE+KRS ++ QR E Sbjct: 89 NWNPVLHMQQYFSVAEVIYALQHVAWRRQHMQYDPVKVGTKEYKRSASGFNKDQQRAEHF 148 Query: 364 RDGHNSNVEYYSQAGNSTG------SEKGGEVEKDEEP-----KLRDEVV--------KL 242 ++GHN E +S GNS+G E+G +V ++ +P KL D+ + L Sbjct: 149 KEGHNFRTEVHSYDGNSSGLVASEKVERGSDVAEEVKPHGEVGKLDDKGLAPAGEKKDAL 208 Query: 241 XNPQEDSYIRSSGNSHETT-GNSEPEVEELDDGCTENSKGPCNVLLDNDSHFMQTH-EKK 68 PQEDS +RSSGNS +T N EPEV + DGCT SK +N+SH +Q ++ Sbjct: 209 TKPQEDSRLRSSGNSQQTIYCNLEPEV-AVGDGCTSISK-------ENESHSIQIQIAQQ 260 Query: 67 NLKITPKTFVGTEMFEGKAINV 2 NL + PKTFVG E+ +GK +NV Sbjct: 261 NLPVVPKTFVGNELIDGKTVNV 282 >ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309147 [Fragaria vesca subsp. vesca] Length = 682 Score = 229 bits (583), Expect = 2e-57 Identities = 131/254 (51%), Positives = 165/254 (64%), Gaps = 16/254 (6%) Frame = -2 Query: 715 DIHYHHQQWLTD--DGFMWWLRGEFAAANAIIDLLCQHLRLIGEPGEYDGVMGCIQQRRC 542 +IH +QW D DGF+ WLRGEFAAANAIID LC HLR +GEP EYD V+GC+QQRRC Sbjct: 28 EIHQQPRQWFPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPSEYDMVIGCVQQRRC 87 Query: 541 NWHPVLHMQHYFSVAEVMYALQQVAWRRQQWHFDPVKVPRKEFKRSGVASRQAQRVEIVR 362 NW PVLHMQ YFSVAEV+YALQQVAWRRQQ +++PVK+ K++KRS R E V+ Sbjct: 88 NWTPVLHMQQYFSVAEVIYALQQVAWRRQQRYYEPVKMGNKDYKRSNSGVGFKPRNEPVK 147 Query: 361 DGHNSNVEYYSQAGN---STGSE------KGGEVEK-DEEPKLRDEVVK--LXNPQEDSY 218 + H ++VEY S G+ GSE GGE K D++ V K L P E Sbjct: 148 EWHTASVEYRSYDGSGLEKVGSEMREEVKPGGEAGKVDDKGSAAGAVTKGVLTKPHEYIS 207 Query: 217 IRSSGNSHET-TGNSEPEVEELDDGCTENSKGPCNVLLDNDSHFMQ-THEKKNLKITPKT 44 RSS NS T +GNSE E +++GCT + K +N+S+ +Q +EK+NL + PKT Sbjct: 208 SRSSANSQGTISGNSESEDAVVNEGCTSSIK-------ENESNSIQIQNEKQNLSLIPKT 260 Query: 43 FVGTEMFEGKAINV 2 FVG E F+GK +NV Sbjct: 261 FVGNETFDGKTVNV 274 >emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera] Length = 1145 Score = 229 bits (583), Expect = 2e-57 Identities = 133/271 (49%), Positives = 163/271 (60%), Gaps = 36/271 (13%) Frame = -2 Query: 706 YHHQQWLTD--DGFMWWLRGEFAAANAIIDLLCQHLRLIGEPGEYDGVMGCIQQRRCNWH 533 +HH+QW D DGF+ WLRGEFAAANAIID LC HLRLIGEPGEYD V+GCIQQRR NW Sbjct: 30 HHHRQWFPDERDGFISWLRGEFAAANAIIDSLCNHLRLIGEPGEYDAVIGCIQQRRYNWS 89 Query: 532 PVLHMQHYFSVAEVMYALQQVAWRRQQWHFDPVKVPRKEFKRSGVASRQAQRVEIVRDGH 353 VLHMQ YFSVAEV+YALQQV WRRQQ H DPVK KE+KR GVA RQ QR E +D H Sbjct: 90 SVLHMQQYFSVAEVIYALQQVGWRRQQRHLDPVKGAGKEYKRYGVAYRQGQRGETAKDSH 149 Query: 352 NSNVEYYSQAGNSTGS-EKGGEVEK-----------DEEPKLRD---------------- 257 NSN E +S NS+G+ EKG V + D KL D Sbjct: 150 NSNFENHSHDANSSGTLEKGERVSEIYDDVKGGDKGDVVGKLEDKDLSAAAEKKEVMNFV 209 Query: 256 -----EVVKLXNPQEDSYIRSSGNSHETTGNSEPEVEELDDGCTENSKGPCNVLLDNDSH 92 E + L NP + + R +T + + + L CN++++N++H Sbjct: 210 IFGQLEQMLLQNPMQIAVRRVQ----KTQKDPDVAFQRLRPMTWMMEARSCNMIMENNAH 265 Query: 91 FMQT-HEKKNLKITPKTFVGTEMFEGKAINV 2 +Q +EK N +PKTFVGTE+F+GKA+NV Sbjct: 266 PVQNQNEKPNPTTSPKTFVGTEIFDGKAVNV 296 >ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prunus persica] gi|462422058|gb|EMJ26321.1| hypothetical protein PRUPE_ppa002630mg [Prunus persica] Length = 650 Score = 224 bits (570), Expect = 6e-56 Identities = 127/250 (50%), Positives = 155/250 (62%), Gaps = 12/250 (4%) Frame = -2 Query: 715 DIHYHHQQWLTD--DGFMWWLRGEFAAANAIIDLLCQHLRLIGEPGEYDGVMGCIQQRRC 542 +I HH+QW D DGF+ WLRGEFAAANAIID LC HLR +GEPGEYD V+GCIQQRRC Sbjct: 29 EIAQHHRQWFPDERDGFISWLRGEFAAANAIIDSLCHHLRAVGEPGEYDVVIGCIQQRRC 88 Query: 541 NWHPVLHMQHYFSVAEVMYALQQVAWRRQQWHFDPVKVPRKEFKRSGVA-SRQAQRVEIV 365 NW+PVLHMQ YFSVAEV+YALQ VAWRRQQ ++DPVK KEFKRSGV ++ QR E Sbjct: 89 NWNPVLHMQQYFSVAEVIYALQHVAWRRQQRYYDPVKAGAKEFKRSGVGFNKGQQRAEAF 148 Query: 364 RDGHNSNVEYYSQAGNSTG------SEKGGEVEKDEEPKLRDEVVKLXNPQEDSYIRSSG 203 ++GHNS +E +S GNS+G E+G EV ++ EP EV KL Sbjct: 149 KEGHNSTLESHSNDGNSSGVVAPEKFERGSEVGEEVEP--GGEVGKL------------- 193 Query: 202 NSHETTGNSEPEVEELDDGCTENSKG--PCNVLLDNDSHFMQ-THEKKNLKITPKTFVGT 32 N KG P N+SH +Q ++K+NL I PKTF+G Sbjct: 194 ----------------------NDKGLAPAGEKKVNESHSIQIQNQKQNLSIVPKTFIGN 231 Query: 31 EMFEGKAINV 2 E+ +GK +NV Sbjct: 232 EISDGKTVNV 241 >ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] gi|590697545|ref|XP_007045470.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] gi|508709403|gb|EOY01300.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] gi|508709405|gb|EOY01302.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2 [Theobroma cacao] Length = 680 Score = 223 bits (567), Expect = 1e-55 Identities = 127/253 (50%), Positives = 165/253 (65%), Gaps = 15/253 (5%) Frame = -2 Query: 715 DIH-YHHQQWLTD--DGFMWWLRGEFAAANAIIDLLCQHLRLIGEPGEYDGVMGCIQQRR 545 +IH +HH+QWL D DGF++WLRGEFAA+NAIID LC HLR +GE GEY+ V+ CIQQRR Sbjct: 42 EIHQHHHRQWLPDERDGFIYWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRR 101 Query: 544 CNWHPVLHMQHYFSVAEVMYALQQVAWRRQQWHFDPVKVPRKEFKRSGVASRQAQRVEIV 365 CNW+PVLHMQ YFSVAEV YALQQVAWRR+Q H++ KV KEFKRSG+ + QR+E+ Sbjct: 102 CNWNPVLHMQQYFSVAEVSYALQQVAWRRRQRHYESGKVGGKEFKRSGMGFK-GQRMEVA 160 Query: 364 RDGHNSNVEYYSQAGNST----------GSEKGGEVEK-DEEPKLRDEVVKLXNPQEDSY 218 ++G NS V+ GNST GSEK EV+ E K+ D+ ++D+ Sbjct: 161 KEGQNSGVD---SDGNSTVTAVSERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDT- 216 Query: 217 IRSSGNSHETTGNSEPEVEELDDGCTENSKGPCNVLLDNDSHFMQT-HEKKNLKITPKTF 41 S G++E E+++ GCT + K +ND +Q +EK+NL PKTF Sbjct: 217 -----GSKPHAGDAESVTEDVNGGCTSSYK-------ENDLCSIQNQNEKQNLAAGPKTF 264 Query: 40 VGTEMFEGKAINV 2 VG EMF+GK +NV Sbjct: 265 VGNEMFDGKMVNV 277 >ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] gi|590697542|ref|XP_007045469.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] gi|508709402|gb|EOY01299.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] gi|508709404|gb|EOY01301.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] Length = 681 Score = 223 bits (567), Expect = 1e-55 Identities = 127/253 (50%), Positives = 165/253 (65%), Gaps = 15/253 (5%) Frame = -2 Query: 715 DIH-YHHQQWLTD--DGFMWWLRGEFAAANAIIDLLCQHLRLIGEPGEYDGVMGCIQQRR 545 +IH +HH+QWL D DGF++WLRGEFAA+NAIID LC HLR +GE GEY+ V+ CIQQRR Sbjct: 42 EIHQHHHRQWLPDERDGFIYWLRGEFAASNAIIDSLCHHLREVGEVGEYEAVIACIQQRR 101 Query: 544 CNWHPVLHMQHYFSVAEVMYALQQVAWRRQQWHFDPVKVPRKEFKRSGVASRQAQRVEIV 365 CNW+PVLHMQ YFSVAEV YALQQVAWRR+Q H++ KV KEFKRSG+ + QR+E+ Sbjct: 102 CNWNPVLHMQQYFSVAEVSYALQQVAWRRRQRHYESGKVGGKEFKRSGMGFK-GQRMEVA 160 Query: 364 RDGHNSNVEYYSQAGNST----------GSEKGGEVEK-DEEPKLRDEVVKLXNPQEDSY 218 ++G NS V+ GNST GSEK EV+ E K+ D+ ++D+ Sbjct: 161 KEGQNSGVD---SDGNSTVTAVSERNERGSEKREEVKSCGEVGKVEDKCSTFTEDKKDT- 216 Query: 217 IRSSGNSHETTGNSEPEVEELDDGCTENSKGPCNVLLDNDSHFMQT-HEKKNLKITPKTF 41 S G++E E+++ GCT + K +ND +Q +EK+NL PKTF Sbjct: 217 -----GSKPHAGDAESVTEDVNGGCTSSYK-------ENDLCSIQNQNEKQNLAAGPKTF 264 Query: 40 VGTEMFEGKAINV 2 VG EMF+GK +NV Sbjct: 265 VGNEMFDGKMVNV 277 >gb|KHN28877.1| hypothetical protein glysoja_025671 [Glycine soja] Length = 679 Score = 219 bits (559), Expect = 1e-54 Identities = 128/262 (48%), Positives = 162/262 (61%), Gaps = 24/262 (9%) Frame = -2 Query: 715 DIHYHH--QQWLTD--DGFMWWLRGEFAAANAIIDLLCQHLRLIGEPGEYDGVMGCIQQR 548 +IH H QQW D DG + WLR EFAAANAIID LC HLR++G+PGEYD V+G IQQR Sbjct: 29 EIHQPHYCQQWFVDERDGLIGWLRSEFAAANAIIDSLCHHLRVVGDPGEYDMVIGAIQQR 88 Query: 547 RCNWHPVLHMQHYFSVAEVMYALQQVAWRRQQWHFDPVKVPRKEFKRSGVASRQAQRVEI 368 RCNW+ VL MQ YFSVA+V YALQQVAWRRQQ DP+KV KE ++SG R QR E Sbjct: 89 RCNWNQVLMMQQYFSVADVAYALQQVAWRRQQRPLDPMKVGAKEVRKSGSGYRHGQRFES 148 Query: 367 VRDGHNSNVEYYSQAGN---STGSEKGGE-VEKDEEPKLRDEVVK--------------- 245 V++G+NS+VE YS N + G+EKG VEK EE K +V K Sbjct: 149 VKEGYNSSVESYSHDANVAVTGGTEKGTPVVEKSEEHKSGGKVEKVGDKGLASVEEKKDA 208 Query: 244 LXNPQEDSYIRSSGNSHETTGNSEPEVEELDDGCTENSKGPCNVLLDNDSHFMQTH-EKK 68 + N Q D ++S+ ++ + N E E ++DGC NSKG ND H +Q + + Sbjct: 209 ITNHQSDGSLKSARSTEGSLSNLESEA-VVNDGCISNSKG-------NDLHSVQNQSQSQ 260 Query: 67 NLKITPKTFVGTEMFEGKAINV 2 +L KTF+G EMF+GK +NV Sbjct: 261 SLSNIAKTFIGNEMFDGKTVNV 282 >ref|XP_010271173.1| PREDICTED: uncharacterized protein LOC104607267 [Nelumbo nucifera] Length = 698 Score = 219 bits (558), Expect = 1e-54 Identities = 127/262 (48%), Positives = 153/262 (58%), Gaps = 28/262 (10%) Frame = -2 Query: 703 HHQQWLTD--DGFMWWLRGEFAAANAIIDLLCQHLRLIGEPGEYDGVMGCIQQRRCNWHP 530 HH+QW D DGF+ WLRGEFAAANAIID LC HLR IGEP EYD V+ CIQQRRCNW+P Sbjct: 29 HHRQWFPDERDGFISWLRGEFAAANAIIDSLCHHLRSIGEPREYDVVISCIQQRRCNWNP 88 Query: 529 VLHMQHYFSVAEVMYALQQVAWRRQQWHFDPVKVPRKEFKRS---GVASRQAQRVEIVRD 359 VLHMQ YFS+AEVMYALQQVAWR+QQ HFD +K+ K+FK++ G+ SRQ R E V++ Sbjct: 89 VLHMQQYFSIAEVMYALQQVAWRKQQRHFDQMKITEKDFKKNGPQGIGSRQGHRAENVKE 148 Query: 358 GHNSNVEYYSQAGNSTGSEKGGEVEKDEEPKLRDEVVKLXNPQEDSYIRSSGNSHETTG- 182 H SN E + N++ E EK EE + E VK E S + S E G Sbjct: 149 NHKSNSETHYLDANTSPQPVNMESEKTEEEPEKGEAVKQGAKVERSDDKGSALGEEREGG 208 Query: 181 -------------NSEP---------EVEELDDGCTENSKGPCNVLLDNDSHFMQTHEKK 68 NSE E+E +DDGC SKG N L + +Q Sbjct: 209 DSVEKSHSGSGLKNSENPERSEHENLEIEVVDDGCI--SKGTSNALQKGATDTIQVP--- 263 Query: 67 NLKITPKTFVGTEMFEGKAINV 2 PKTFVGTE+F+G +NV Sbjct: 264 ----IPKTFVGTEIFDGNVVNV 281 >ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794176 [Glycine max] gi|947093927|gb|KRH42512.1| hypothetical protein GLYMA_08G093800 [Glycine max] Length = 683 Score = 218 bits (555), Expect = 3e-54 Identities = 124/258 (48%), Positives = 160/258 (62%), Gaps = 22/258 (8%) Frame = -2 Query: 709 HYHHQQWLTD--DGFMWWLRGEFAAANAIIDLLCQHLRLIGEPGEYDGVMGCIQQRRCNW 536 H++ QW D DG + WLR EFAAANAIID LC HLR++G+PGEYD V+G IQQRRCNW Sbjct: 37 HHYRPQWFVDERDGLIGWLRSEFAAANAIIDSLCHHLRVVGDPGEYDMVVGAIQQRRCNW 96 Query: 535 HPVLHMQHYFSVAEVMYALQQVAWRRQQWHFDPVKVPRKEFKRSGVASRQAQRVEIVRDG 356 + VL MQ YFSVA+V YALQQVAWRRQQ DP+KV KE ++SG R QR E V++G Sbjct: 97 NQVLMMQQYFSVADVAYALQQVAWRRQQRPLDPMKVGAKEVRKSGSGYRHGQRFESVKEG 156 Query: 355 HNSNVEYYSQAGN---STGSEKGGE-VEKDEEPKLRDEVVK---------------LXNP 233 +NS+VE YS N + G+EKG VEK EE K +V K + N Sbjct: 157 YNSSVESYSHDANVAVTGGTEKGTPVVEKSEEHKSGGKVEKVGDKGLASVEEKKDAITNH 216 Query: 232 QEDSYIRSSGNSHETTGNSEPEVEELDDGCTENSKGPCNVLLDNDSHFMQTH-EKKNLKI 56 Q + ++S+ ++ + N E E ++DGC NSKG ND H +Q + ++L Sbjct: 217 QSEGSLKSARSTEGSLSNLESEA-VVNDGCISNSKG-------NDLHSVQNQSQSQSLSN 268 Query: 55 TPKTFVGTEMFEGKAINV 2 KTF+G EMF+GK +NV Sbjct: 269 IAKTFIGNEMFDGKTVNV 286 >gb|KHN25979.1| hypothetical protein glysoja_018833 [Glycine soja] Length = 685 Score = 217 bits (553), Expect = 6e-54 Identities = 127/263 (48%), Positives = 164/263 (62%), Gaps = 25/263 (9%) Frame = -2 Query: 715 DIHYHH--QQWLTD--DGFMWWLRGEFAAANAIIDLLCQHLRLIGEPGEYDGVMGCIQQR 548 +IH H QQW D DG + WLR EFAAANAIID LC HLR++G+PGEYD V+G IQQR Sbjct: 29 EIHQPHYCQQWFVDERDGLIGWLRSEFAAANAIIDSLCHHLRVVGDPGEYDMVIGAIQQR 88 Query: 547 RCNWHPVLHMQHYFSVAEVMYALQQVAWRRQQWHFDPVKVPRKEFKRSGVASRQAQRVEI 368 RCNW+ VL MQ YFSVA+V +ALQQVAWRRQQ DPVKV KEF++SG R QR E Sbjct: 89 RCNWNQVLMMQQYFSVADVAHALQQVAWRRQQRPLDPVKVGAKEFRKSGSGYRHGQRFEP 148 Query: 367 VRDGHNSNVEYYSQAGNST----GSEKGGE-VEKDEEPKLRDEVVKLXNP---------- 233 V++G+NS+VE Y+Q + G+EKG VEK EE K +V K+ + Sbjct: 149 VKEGYNSSVESYNQYDANVTVTGGTEKGTPVVEKSEEHKSGGKVEKVGDKGLASAEDKKD 208 Query: 232 -----QEDSYIRSSGNSHETTGNSEPEVEELDDGCTENSKGPCNVLLDNDSHFMQT-HEK 71 Q D ++S+ ++ + N E E ++D C NSKG +DSH +Q H+ Sbjct: 209 AITKHQTDGSLKSTRSTEGSLSNLESEA-VVNDECISNSKG-------DDSHSVQNQHQS 260 Query: 70 KNLKITPKTFVGTEMFEGKAINV 2 ++L KTF+G EMF+GK +NV Sbjct: 261 QSLSTKAKTFIGNEMFDGKMVNV 283 >ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809865 isoform X1 [Glycine max] gi|947110281|gb|KRH58607.1| hypothetical protein GLYMA_05G138600 [Glycine max] Length = 681 Score = 217 bits (553), Expect = 6e-54 Identities = 127/263 (48%), Positives = 164/263 (62%), Gaps = 25/263 (9%) Frame = -2 Query: 715 DIHYHH--QQWLTD--DGFMWWLRGEFAAANAIIDLLCQHLRLIGEPGEYDGVMGCIQQR 548 +IH H QQW D DG + WLR EFAAANAIID LC HLR++G+PGEYD V+G IQQR Sbjct: 29 EIHQPHYCQQWFVDERDGLIGWLRSEFAAANAIIDSLCHHLRVVGDPGEYDMVIGAIQQR 88 Query: 547 RCNWHPVLHMQHYFSVAEVMYALQQVAWRRQQWHFDPVKVPRKEFKRSGVASRQAQRVEI 368 RCNW+ VL MQ YFSVA+V +ALQQVAWRRQQ DPVKV KEF++SG R QR E Sbjct: 89 RCNWNQVLMMQQYFSVADVAHALQQVAWRRQQRPLDPVKVGAKEFRKSGSGYRHGQRFEP 148 Query: 367 VRDGHNSNVEYYSQAGNST----GSEKGGE-VEKDEEPKLRDEVVKLXNP---------- 233 V++G+NS+VE Y+Q + G+EKG VEK EE K +V K+ + Sbjct: 149 VKEGYNSSVESYNQYDANVTVTGGTEKGTPVVEKSEEHKSGGKVEKVGDKGLASAEDKKD 208 Query: 232 -----QEDSYIRSSGNSHETTGNSEPEVEELDDGCTENSKGPCNVLLDNDSHFMQT-HEK 71 Q D ++S+ ++ + N E E ++D C NSKG +DSH +Q H+ Sbjct: 209 AITKHQTDGSLKSTRSTEGSLSNLESEA-VVNDECISNSKG-------DDSHSVQNQHQS 260 Query: 70 KNLKITPKTFVGTEMFEGKAINV 2 ++L KTF+G EMF+GK +NV Sbjct: 261 QSLSTKAKTFIGNEMFDGKMVNV 283