BLASTX nr result
ID: Rauwolfia21_contig00002338
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rauwolfia21_contig00002338 (3023 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004233344.1| PREDICTED: prolyl 4-hydroxylase subunit alph... 462 e-127 ref|XP_006357128.1| PREDICTED: prolyl 4-hydroxylase subunit alph... 461 e-127 dbj|BAG86625.1| type 2 proly 4-hydroxylase [Nicotiana tabacum] 461 e-126 gb|EPS61415.1| type 2 proly 4-hydroxylase [Genlisea aurea] 445 e-122 gb|EMJ16894.1| hypothetical protein PRUPE_ppa008787mg [Prunus pe... 445 e-122 gb|EOY03276.1| Oxoglutarate/iron-dependent oxygenase [Theobroma ... 441 e-120 ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative... 436 e-119 ref|XP_002281420.1| PREDICTED: prolyl 4-hydroxylase subunit alph... 434 e-118 gb|ESW23170.1| hypothetical protein PHAVU_004G024300g [Phaseolus... 432 e-118 ref|XP_002324024.1| hypothetical protein POPTR_0017s11150g [Popu... 432 e-118 ref|XP_003543632.1| PREDICTED: prolyl 4-hydroxylase subunit alph... 431 e-117 gb|EXB47702.1| Prolyl 4-hydroxylase subunit alpha-1 [Morus notab... 429 e-117 ref|XP_003554232.1| PREDICTED: prolyl 4-hydroxylase subunit alph... 428 e-117 gb|ACU19077.1| unknown [Glycine max] 427 e-116 ref|XP_006482512.1| PREDICTED: prolyl 4-hydroxylase subunit alph... 427 e-116 ref|XP_006431045.1| hypothetical protein CICLE_v10012224mg [Citr... 426 e-116 ref|XP_006395360.1| hypothetical protein EUTSA_v10004627mg [Eutr... 422 e-115 ref|XP_006291518.1| hypothetical protein CARUB_v10017667mg [Caps... 422 e-115 gb|AAT84604.1| prolyl 4-hydroxylase [Dianthus caryophyllus] 421 e-114 ref|XP_002877111.1| oxidoreductase [Arabidopsis lyrata subsp. ly... 421 e-114 >ref|XP_004233344.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Solanum lycopersicum] Length = 317 Score = 462 bits (1188), Expect = e-127 Identities = 222/282 (78%), Positives = 243/282 (86%), Gaps = 4/282 (1%) Frame = -1 Query: 3023 VSGRSADKKT----MKSTTGASSVRFDPTRVTQISWRPRAFIYRGFLTHDECDHLIKLAK 2856 VSG D+KT +K TG SS DPTRVTQISWRPRAFIYR FLT +ECDHLI LAK Sbjct: 30 VSGGHGDRKTKLSVLKLVTGRSSATIDPTRVTQISWRPRAFIYRNFLTDEECDHLITLAK 89 Query: 2855 DKLEKSMVADNESGKSIESEVRTSSGMFLRKNQDEVVATIETRIAAWTFLPPENGEAIQI 2676 DKLEKSMVADNESGKS+ESEVRTSSGMFL K QDEVVA +E RIAAWTFLP ENGE+IQI Sbjct: 90 DKLEKSMVADNESGKSVESEVRTSSGMFLSKGQDEVVANVEARIAAWTFLPKENGESIQI 149 Query: 2675 LHYEHGQKYEPHFDYFHDKANQELGGHRVATVLMYLSDVQKGGETVFPSSEAVETQPKAD 2496 LHYEHGQKYEPH+DYFHDK NQELGGHRVATVLMYLSDV+KGGET+FP+SEA ++QPK D Sbjct: 150 LHYEHGQKYEPHYDYFHDKVNQELGGHRVATVLMYLSDVEKGGETIFPNSEAKKSQPKGD 209 Query: 2495 DWSECAKKGYAVKPMKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSFE 2316 DWS+CAK GYAVKP KGDALLFFSLH +ATTDPLSLHGSCPVIEGEKWSATKWIHVRSFE Sbjct: 210 DWSDCAKNGYAVKPRKGDALLFFSLHLNATTDPLSLHGSCPVIEGEKWSATKWIHVRSFE 269 Query: 2315 LPSSGESGDCVDKNPNCSHWAAAGECEKNPLYMVGSEDATGY 2190 + E C D+NP+CS WA GEC+KNPLYMVGSE++ G+ Sbjct: 270 TVFNNE---CQDQNPSCSQWAVNGECDKNPLYMVGSENSVGH 308 >ref|XP_006357128.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Solanum tuberosum] Length = 317 Score = 461 bits (1187), Expect = e-127 Identities = 222/282 (78%), Positives = 242/282 (85%), Gaps = 4/282 (1%) Frame = -1 Query: 3023 VSGRSADKKT----MKSTTGASSVRFDPTRVTQISWRPRAFIYRGFLTHDECDHLIKLAK 2856 VSG D+KT +K TG SS DPTRVTQISWRPRAF+YR FLT +ECDHL+ LAK Sbjct: 30 VSGGHGDRKTKLSVLKLVTGGSSSTIDPTRVTQISWRPRAFVYRNFLTDEECDHLVTLAK 89 Query: 2855 DKLEKSMVADNESGKSIESEVRTSSGMFLRKNQDEVVATIETRIAAWTFLPPENGEAIQI 2676 DKLEKSMVADNESGKSIESEVRTSSGMFL K QDEVVA +E RIA+WTFLP ENGE+IQI Sbjct: 90 DKLEKSMVADNESGKSIESEVRTSSGMFLSKGQDEVVANVEARIASWTFLPKENGESIQI 149 Query: 2675 LHYEHGQKYEPHFDYFHDKANQELGGHRVATVLMYLSDVQKGGETVFPSSEAVETQPKAD 2496 LHYEHGQKYEPH+DYFHDK NQELGGHRVATVLMYLSDV+KGGET+FP+SEA ++QPK D Sbjct: 150 LHYEHGQKYEPHYDYFHDKVNQELGGHRVATVLMYLSDVEKGGETIFPNSEAKKSQPKGD 209 Query: 2495 DWSECAKKGYAVKPMKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSFE 2316 DWS+CAK GYAVKP KGDALLFFSLH DATTDPLSLHGSCPVIEGEKWSATKWIHVRSFE Sbjct: 210 DWSDCAKNGYAVKPRKGDALLFFSLHLDATTDPLSLHGSCPVIEGEKWSATKWIHVRSFE 269 Query: 2315 LPSSGESGDCVDKNPNCSHWAAAGECEKNPLYMVGSEDATGY 2190 + E C D+NPNCS WA GECEKNPLYMVGS ++ G+ Sbjct: 270 TLFNNE---CQDQNPNCSQWAINGECEKNPLYMVGSGNSVGH 308 >dbj|BAG86625.1| type 2 proly 4-hydroxylase [Nicotiana tabacum] Length = 318 Score = 461 bits (1185), Expect = e-126 Identities = 224/283 (79%), Positives = 239/283 (84%), Gaps = 6/283 (2%) Frame = -1 Query: 3020 SGRSADKKTMKSTT------GASSVRFDPTRVTQISWRPRAFIYRGFLTHDECDHLIKLA 2859 SG DKKT S +SS DPTRVTQISWRPRAF+YR FLT +ECDH I LA Sbjct: 31 SGWHNDKKTKSSVLKLLTDRSSSSPTIDPTRVTQISWRPRAFVYRNFLTDEECDHFITLA 90 Query: 2858 KDKLEKSMVADNESGKSIESEVRTSSGMFLRKNQDEVVATIETRIAAWTFLPPENGEAIQ 2679 K KLEKSMVADNESGKS+ESEVRTSSGMF RK QD+VVA +E RIAAWTFLP ENGE+IQ Sbjct: 91 KHKLEKSMVADNESGKSVESEVRTSSGMFFRKAQDQVVANVEARIAAWTFLPEENGESIQ 150 Query: 2678 ILHYEHGQKYEPHFDYFHDKANQELGGHRVATVLMYLSDVQKGGETVFPSSEAVETQPKA 2499 ILHYEHGQKYEPHFDYFHDK NQELGGHRVATVLMYLSDV+KGGETVFP+SEA +TQ K Sbjct: 151 ILHYEHGQKYEPHFDYFHDKVNQELGGHRVATVLMYLSDVEKGGETVFPNSEAKKTQAKG 210 Query: 2498 DDWSECAKKGYAVKPMKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSF 2319 DDWS+CAKKGYAVKP KGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSF Sbjct: 211 DDWSDCAKKGYAVKPRKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSF 270 Query: 2318 ELPSSGESGDCVDKNPNCSHWAAAGECEKNPLYMVGSEDATGY 2190 E SS C D+NPNC WA AGECEKNPLYM+GSED+ G+ Sbjct: 271 ETTSS----VCKDQNPNCPQWATAGECEKNPLYMMGSEDSVGH 309 >gb|EPS61415.1| type 2 proly 4-hydroxylase [Genlisea aurea] Length = 308 Score = 445 bits (1144), Expect = e-122 Identities = 208/274 (75%), Positives = 238/274 (86%), Gaps = 3/274 (1%) Frame = -1 Query: 3005 DKKTMKSTTGASSVRFDPTRVTQISWRPRAFIYRGFLTHDECDHLIKLAKDKLEKSMVAD 2826 ++ +K TG+SS DPTRV+QISW+PRAF+YR FL+ +ECDHLI LAKDKLEKSMVAD Sbjct: 22 NESVLKLITGSSSPSLDPTRVSQISWKPRAFLYRWFLSDEECDHLIILAKDKLEKSMVAD 81 Query: 2825 NESGKSIESEVRTSSGMFLRKNQDEVVATIETRIAAWTFLPPENGEAIQILHYEHGQKYE 2646 NESGKS+ESEVRTSSGMF++K QDE+VA IE RIA+WTFLP ENGEA+QILHYEHGQKYE Sbjct: 82 NESGKSVESEVRTSSGMFIQKAQDEIVAGIEARIASWTFLPIENGEAMQILHYEHGQKYE 141 Query: 2645 PHFDYFHDKANQELGGHRVATVLMYLSDVQKGGETVFPSSEAVE---TQPKADDWSECAK 2475 PHFDYFHD+ NQ+LGGHR+ATVLMYLSDV+KGGETVFP+SE QPK +DW++CAK Sbjct: 142 PHFDYFHDEVNQKLGGHRIATVLMYLSDVEKGGETVFPTSEVEHEGIRQPKGEDWTDCAK 201 Query: 2474 KGYAVKPMKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSFELPSSGES 2295 +GYAVKP KGDALLFFSLHPDATTDP+SLHGSCPV+EGEKWSATKWIHVRSF+ S + Sbjct: 202 QGYAVKPRKGDALLFFSLHPDATTDPMSLHGSCPVVEGEKWSATKWIHVRSFDRTSQKQP 261 Query: 2294 GDCVDKNPNCSHWAAAGECEKNPLYMVGSEDATG 2193 DCVD +PNC+ WA AGECEKNPLYMVGSE G Sbjct: 262 TDCVDDHPNCAAWALAGECEKNPLYMVGSEGMAG 295 >gb|EMJ16894.1| hypothetical protein PRUPE_ppa008787mg [Prunus persica] Length = 319 Score = 445 bits (1144), Expect = e-122 Identities = 212/279 (75%), Positives = 239/279 (85%), Gaps = 1/279 (0%) Frame = -1 Query: 3023 VSGRSADKKTMKSTTGASSVRFDPTRVTQISWRPRAFIYRGFLTHDECDHLIKLAKDKLE 2844 + + + ++ GASS FDPTRVTQ+SW PRAF+Y+GFL+ +ECDHLI++AK+KLE Sbjct: 31 IEEKKTEGSVLRLRRGASSATFDPTRVTQLSWHPRAFLYKGFLSEEECDHLIEIAKNKLE 90 Query: 2843 KSMVADNESGKSIESEVRTSSGMFLRKNQDEVVATIETRIAAWTFLPPENGEAIQILHYE 2664 KSMVADNESGKSIESEVRTSSGMFL+K+QDEVVA IE RIAAWTFLP ENGE+IQILHYE Sbjct: 91 KSMVADNESGKSIESEVRTSSGMFLQKSQDEVVANIEARIAAWTFLPIENGESIQILHYE 150 Query: 2663 HGQKYEPHFDYFHDKANQELGGHRVATVLMYLSDVQKGGETVFPSSEAVETQPKADDWSE 2484 HGQKYEPHFDYFHDKANQELGGHRVATVLMYLS+V+KGGETVFP++EA +Q K DD S+ Sbjct: 151 HGQKYEPHFDYFHDKANQELGGHRVATVLMYLSNVEKGGETVFPNTEAQMSQSKDDDASD 210 Query: 2483 CAKKGYAVKPMKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSFELP-S 2307 CAK+GY+VKP KGDALLFFSLHPDATTDP SLHGSCPVIEGEKWSATKWIHVRSFE Sbjct: 211 CAKQGYSVKPYKGDALLFFSLHPDATTDPSSLHGSCPVIEGEKWSATKWIHVRSFEKSLK 270 Query: 2306 SGESGDCVDKNPNCSHWAAAGECEKNPLYMVGSEDATGY 2190 SGDC D+N NC WA AGECEKNP YMVGS+ G+ Sbjct: 271 HAVSGDCADENDNCPLWAKAGECEKNPTYMVGSKGLPGF 309 >gb|EOY03276.1| Oxoglutarate/iron-dependent oxygenase [Theobroma cacao] Length = 307 Score = 441 bits (1133), Expect = e-120 Identities = 210/269 (78%), Positives = 232/269 (86%), Gaps = 1/269 (0%) Frame = -1 Query: 2993 MKSTTGASSVRFDPTRVTQISWRPRAFIYRGFLTHDECDHLIKLAKDKLEKSMVADNESG 2814 +K G SSV FDP+RVTQ+SW PRAFIY GFL+ +ECDHLI LAKDKLEKSMVADNESG Sbjct: 29 LKMKRGTSSVLFDPSRVTQLSWHPRAFIYEGFLSAEECDHLITLAKDKLEKSMVADNESG 88 Query: 2813 KSIESEVRTSSGMFLRKNQDEVVATIETRIAAWTFLPPENGEAIQILHYEHGQKYEPHFD 2634 +S+ESEVRTSSGMFL+K QDEV+A IE RIAAWTFLP ENGE++QILHYE GQKYEPHFD Sbjct: 89 QSLESEVRTSSGMFLQKAQDEVIADIEARIAAWTFLPVENGESMQILHYEQGQKYEPHFD 148 Query: 2633 YFHDKANQELGGHRVATVLMYLSDVQKGGETVFPSSEAVETQPKADDWSECAKKGYAVKP 2454 YFHDKANQELGGHR+ATVLMYLSDV+ GGETVFP+SE QPK D WS CAK GYAVKP Sbjct: 149 YFHDKANQELGGHRIATVLMYLSDVESGGETVFPNSEGKLAQPKDDSWSACAKNGYAVKP 208 Query: 2453 MKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSFE-LPSSGESGDCVDK 2277 KGDALLFFSLHPDATTD SLHGSCPVI+GEKWSATKWIHVRSF+ L E+GDCVD+ Sbjct: 209 RKGDALLFFSLHPDATTDTNSLHGSCPVIKGEKWSATKWIHVRSFDKLERRSENGDCVDE 268 Query: 2276 NPNCSHWAAAGECEKNPLYMVGSEDATGY 2190 + NC WA AGECEKNP YMVGSE++ G+ Sbjct: 269 SENCPVWAKAGECEKNPTYMVGSEESYGF 297 >ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis] gi|223543448|gb|EEF44979.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis] Length = 311 Score = 436 bits (1120), Expect = e-119 Identities = 208/262 (79%), Positives = 229/262 (87%), Gaps = 1/262 (0%) Frame = -1 Query: 2972 SSVRFDPTRVTQISWRPRAFIYRGFLTHDECDHLIKLAKDKLEKSMVADNESGKSIESEV 2793 SS FDPTRVTQ+SW PRAF+Y+GFL+++ECDHLI LA+DKLEKSMVADNESGKSIESEV Sbjct: 40 SSRIFDPTRVTQLSWHPRAFLYKGFLSYEECDHLIDLARDKLEKSMVADNESGKSIESEV 99 Query: 2792 RTSSGMFLRKNQDEVVATIETRIAAWTFLPPENGEAIQILHYEHGQKYEPHFDYFHDKAN 2613 RTSSGMF+ K QDE+VA IE RIAAWTFLP ENGE++QILHYEHGQKYEPHFDYFHDKAN Sbjct: 100 RTSSGMFIAKAQDEIVADIEARIAAWTFLPEENGESMQILHYEHGQKYEPHFDYFHDKAN 159 Query: 2612 QELGGHRVATVLMYLSDVQKGGETVFPSSEAVETQPKADDWSECAKKGYAVKPMKGDALL 2433 QELGGHRVATVLMYLS+V+KGGETVFP++E +QPK D WS+CAK GYAVKP KGDALL Sbjct: 160 QELGGHRVATVLMYLSNVEKGGETVFPNAEGKLSQPKEDSWSDCAKGGYAVKPEKGDALL 219 Query: 2432 FFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSFELP-SSGESGDCVDKNPNCSHW 2256 FFSLHPDATTD SLHGSCPVIEGEKWSATKWIHVRSFE GDCVD+N +C W Sbjct: 220 FFSLHPDATTDSDSLHGSCPVIEGEKWSATKWIHVRSFEKSFKQLGKGDCVDENDHCPLW 279 Query: 2255 AAAGECEKNPLYMVGSEDATGY 2190 A AGEC+KNPLYM+GS A GY Sbjct: 280 AKAGECKKNPLYMIGSGGANGY 301 >ref|XP_002281420.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera] gi|296087745|emb|CBI35001.3| unnamed protein product [Vitis vinifera] Length = 316 Score = 434 bits (1116), Expect = e-118 Identities = 209/281 (74%), Positives = 236/281 (83%), Gaps = 5/281 (1%) Frame = -1 Query: 3017 GRSADKKTMKSTTGAS----SVRFDPTRVTQISWRPRAFIYRGFLTHDECDHLIKLAKDK 2850 G +KKT S G + FDPTRVTQ+SWRPRAF+Y+GFL+ +ECDHLI LAKDK Sbjct: 26 GWVGEKKTGGSVLGLKPRGFASGFDPTRVTQLSWRPRAFLYKGFLSEEECDHLITLAKDK 85 Query: 2849 LEKSMVADNESGKSIESEVRTSSGMFLRKNQDEVVATIETRIAAWTFLPPENGEAIQILH 2670 LEKSMVADNESGKSI SEVRTSSGMFL K QDE+VA IE RIAAWTFLP ENGE+IQILH Sbjct: 86 LEKSMVADNESGKSIMSEVRTSSGMFLLKAQDEIVADIEARIAAWTFLPVENGESIQILH 145 Query: 2669 YEHGQKYEPHFDYFHDKANQELGGHRVATVLMYLSDVQKGGETVFPSSEAVETQPKADDW 2490 YE+G+KYEPHFDYFHDK NQ LGGHR+ATVLMYL+ V++GGETVFP+SE +QPK D W Sbjct: 146 YENGEKYEPHFDYFHDKVNQLLGGHRIATVLMYLATVEEGGETVFPNSEGRFSQPKDDSW 205 Query: 2489 SECAKKGYAVKPMKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSFELP 2310 S+CAKKGYAV P KGDALLFFSLHPDATTDP SLHGSCPVI GEKWSATKWIHVRSF+ P Sbjct: 206 SDCAKKGYAVNPKKGDALLFFSLHPDATTDPSSLHGSCPVIAGEKWSATKWIHVRSFDKP 265 Query: 2309 SS-GESGDCVDKNPNCSHWAAAGECEKNPLYMVGSEDATGY 2190 S G G+CVD++ +C WAA GECEKNP+YMVGSE++ G+ Sbjct: 266 SKRGAQGECVDEDEHCPKWAAVGECEKNPVYMVGSENSDGF 306 >gb|ESW23170.1| hypothetical protein PHAVU_004G024300g [Phaseolus vulgaris] Length = 318 Score = 432 bits (1110), Expect = e-118 Identities = 208/275 (75%), Positives = 229/275 (83%), Gaps = 1/275 (0%) Frame = -1 Query: 3014 RSADKKTMKSTTGASSVRFDPTRVTQISWRPRAFIYRGFLTHDECDHLIKLAKDKLEKSM 2835 +S ++ TG SSV+FDPTRVTQ+SW PRAF+Y+GFL+ +ECDHLI LAKDKLE SM Sbjct: 33 KSTHGSVLRMKTGVSSVKFDPTRVTQLSWNPRAFLYKGFLSEEECDHLITLAKDKLEISM 92 Query: 2834 VADNESGKSIESEVRTSSGMFLRKNQDEVVATIETRIAAWTFLPPENGEAIQILHYEHGQ 2655 VADNESGKS+ SEVRTSSGMFL K QD++VA IE RI+AWTFLP ENGE++Q+LHYE+GQ Sbjct: 93 VADNESGKSVMSEVRTSSGMFLNKAQDKIVADIEARISAWTFLPIENGESMQVLHYENGQ 152 Query: 2654 KYEPHFDYFHDKANQELGGHRVATVLMYLSDVQKGGETVFPSSEAVETQPKADDWSECAK 2475 KYEPHFDYFHDKANQ +GGHRVATVLMYLS+V KGGET+FP+SEA QPK D WSECA Sbjct: 153 KYEPHFDYFHDKANQIMGGHRVATVLMYLSNVGKGGETIFPNSEAKLLQPKDDTWSECAH 212 Query: 2474 KGYAVKPMKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSFELP-SSGE 2298 KGYAVKP KGDALLFFSLH DATTD SLHGSCPVIEGEKWSATKWIHV FE P S E Sbjct: 213 KGYAVKPEKGDALLFFSLHLDATTDANSLHGSCPVIEGEKWSATKWIHVSDFEKPVISVE 272 Query: 2297 SGDCVDKNPNCSHWAAAGECEKNPLYMVGSEDATG 2193 GDCVD N NCS WA GECEKNPLYMVGS G Sbjct: 273 GGDCVDDNENCSRWAKIGECEKNPLYMVGSAGVRG 307 >ref|XP_002324024.1| hypothetical protein POPTR_0017s11150g [Populus trichocarpa] gi|222867026|gb|EEF04157.1| hypothetical protein POPTR_0017s11150g [Populus trichocarpa] Length = 308 Score = 432 bits (1110), Expect = e-118 Identities = 204/257 (79%), Positives = 223/257 (86%), Gaps = 1/257 (0%) Frame = -1 Query: 2960 FDPTRVTQISWRPRAFIYRGFLTHDECDHLIKLAKDKLEKSMVADNESGKSIESEVRTSS 2781 FDPTRVTQ+SW PRAF+Y+GFL+ +ECDHL+ LA+DKLEKSMVADNESGKSIESEVRTSS Sbjct: 41 FDPTRVTQLSWNPRAFLYKGFLSDEECDHLMNLARDKLEKSMVADNESGKSIESEVRTSS 100 Query: 2780 GMFLRKNQDEVVATIETRIAAWTFLPPENGEAIQILHYEHGQKYEPHFDYFHDKANQELG 2601 GMF+ K+QDE+V IE RIAAWTFLP ENGE+IQILHYEHGQKYEPHFDYFHDKANQELG Sbjct: 101 GMFIGKSQDEIVDDIEARIAAWTFLPQENGESIQILHYEHGQKYEPHFDYFHDKANQELG 160 Query: 2600 GHRVATVLMYLSDVQKGGETVFPSSEAVETQPKADDWSECAKKGYAVKPMKGDALLFFSL 2421 GHRV TVLMYLS+V KGGETVFP+SE QPK D WS+CAK GYAVKP KGDALLFFSL Sbjct: 161 GHRVVTVLMYLSNVGKGGETVFPNSEGKTIQPKDDSWSDCAKNGYAVKPQKGDALLFFSL 220 Query: 2420 HPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSFELP-SSGESGDCVDKNPNCSHWAAAG 2244 HPDATTD SLHGSCPVIEGEKWSATKWIHVRSFE SG C+D+N NC WA AG Sbjct: 221 HPDATTDTNSLHGSCPVIEGEKWSATKWIHVRSFEKSLKHAASGGCIDENENCPLWAKAG 280 Query: 2243 ECEKNPLYMVGSEDATG 2193 EC+KNP+YMVGSE + G Sbjct: 281 ECQKNPVYMVGSEGSYG 297 >ref|XP_003543632.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max] Length = 318 Score = 431 bits (1107), Expect = e-117 Identities = 204/268 (76%), Positives = 228/268 (85%), Gaps = 1/268 (0%) Frame = -1 Query: 2993 MKSTTGASSVRFDPTRVTQISWRPRAFIYRGFLTHDECDHLIKLAKDKLEKSMVADNESG 2814 ++ G SSV+FDPTRVTQ+SW PRAF+Y+GFL+ +ECDHLI LAKDKLEKSMVADNESG Sbjct: 40 LRLNRGGSSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESG 99 Query: 2813 KSIESEVRTSSGMFLRKNQDEVVATIETRIAAWTFLPPENGEAIQILHYEHGQKYEPHFD 2634 KSI SEVRTSSGMFL K QDE+VA IE RIAAWTFLP ENGE++QILHYE+GQKYEPHFD Sbjct: 100 KSIMSEVRTSSGMFLNKAQDEIVAGIEARIAAWTFLPIENGESMQILHYENGQKYEPHFD 159 Query: 2633 YFHDKANQELGGHRVATVLMYLSDVQKGGETVFPSSEAVETQPKADDWSECAKKGYAVKP 2454 YFHDKANQ +GGHR+ATVLMYLSDV+KGGET+FP+++A QPK + WSECA KGYAVKP Sbjct: 160 YFHDKANQVMGGHRIATVLMYLSDVEKGGETIFPNAKAKLLQPKDESWSECAHKGYAVKP 219 Query: 2453 MKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSFELP-SSGESGDCVDK 2277 KGDALLFFSLH DA+TD SLHGSCPVIEGEKWSATKWIHV F+ P +SGDCVD+ Sbjct: 220 RKGDALLFFSLHLDASTDNKSLHGSCPVIEGEKWSATKWIHVSDFQKPIKQVDSGDCVDE 279 Query: 2276 NPNCSHWAAAGECEKNPLYMVGSEDATG 2193 N NC WA GECEKNPLYMVG E G Sbjct: 280 NENCPRWAKVGECEKNPLYMVGGEGVKG 307 >gb|EXB47702.1| Prolyl 4-hydroxylase subunit alpha-1 [Morus notabilis] Length = 328 Score = 429 bits (1102), Expect = e-117 Identities = 206/287 (71%), Positives = 234/287 (81%), Gaps = 10/287 (3%) Frame = -1 Query: 3023 VSGRSADKKTMKSTTGASSVRFDPTRVTQISWRPRAFIYRGFLTHDECDHLIKLAKDKLE 2844 +S + ++ ++ TGASSV FDPTRVTQ+SW PRAF+Y+GFL+ +ECDHLI LAKDKLE Sbjct: 31 LSEKKTEESVLRLKTGASSVTFDPTRVTQLSWHPRAFLYKGFLSEEECDHLITLAKDKLE 90 Query: 2843 KSMVADNESGKSIESEVRTSSGMFLRKNQDEVVATIETRIAAWTFLPPENGEAIQILHYE 2664 KSMVADN+SGKSI SEVRTSSGMFL+K QD++V IE RIAAWTFLP ENGE++QILHYE Sbjct: 91 KSMVADNDSGKSIMSEVRTSSGMFLQKAQDQIVTDIEARIAAWTFLPEENGESMQILHYE 150 Query: 2663 HGQKYEPHFDYFHDKANQELGGHRVATVLMYLSDVQKGGETVFPSSEAVE---------T 2511 HG+KYEPHFDYFHDKANQELGGHRVATVLMYLS+V+KGGET+FP++E + Sbjct: 151 HGEKYEPHFDYFHDKANQELGGHRVATVLMYLSNVEKGGETIFPNAEVTPDDELLAQKMS 210 Query: 2510 QPKADDWSECAKKGYAVKPMKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIH 2331 QPK +WS+CAK GYAVKP KGDALLFFSLH DATTD SLHGSCPVIEGEKWSATKWIH Sbjct: 211 QPKGANWSDCAKSGYAVKPYKGDALLFFSLHLDATTDTNSLHGSCPVIEGEKWSATKWIH 270 Query: 2330 VRSFELPSSGESGD-CVDKNPNCSHWAAAGECEKNPLYMVGSEDATG 2193 VRSF+ P S D C D N NC WA AGEC KNP+YMVGSE G Sbjct: 271 VRSFDKPVKRSSSDECTDDNDNCPLWAKAGECAKNPVYMVGSEGFPG 317 >ref|XP_003554232.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max] Length = 319 Score = 428 bits (1100), Expect = e-117 Identities = 200/268 (74%), Positives = 228/268 (85%), Gaps = 1/268 (0%) Frame = -1 Query: 2993 MKSTTGASSVRFDPTRVTQISWRPRAFIYRGFLTHDECDHLIKLAKDKLEKSMVADNESG 2814 ++ G SSV+FDPTRVTQ+SW PRAF+Y+GFL+ +ECDHLI LAKDKLEKSMVADN+SG Sbjct: 41 LRLNRGGSSVKFDPTRVTQLSWSPRAFLYKGFLSEEECDHLIVLAKDKLEKSMVADNDSG 100 Query: 2813 KSIESEVRTSSGMFLRKNQDEVVATIETRIAAWTFLPPENGEAIQILHYEHGQKYEPHFD 2634 KSI S++RTSSGMFL K QDE+VA IE RIAAWTFLP ENGE++QILHYE+GQKYEPHFD Sbjct: 101 KSIMSDIRTSSGMFLNKAQDEIVAGIEARIAAWTFLPVENGESMQILHYENGQKYEPHFD 160 Query: 2633 YFHDKANQELGGHRVATVLMYLSDVQKGGETVFPSSEAVETQPKADDWSECAKKGYAVKP 2454 YFHDKANQ +GGHR+ATVLMYLSDV+KGGET+FP++EA QPK + WSECA KGYAVKP Sbjct: 161 YFHDKANQVMGGHRIATVLMYLSDVEKGGETIFPNAEAKLLQPKDESWSECAHKGYAVKP 220 Query: 2453 MKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSFELP-SSGESGDCVDK 2277 KGDALLFFSLH DA+TD SLHGSCPVIEGEKWSATKWIHV FE P ++G+CVD+ Sbjct: 221 QKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVSDFEKPFKQVDNGECVDE 280 Query: 2276 NPNCSHWAAAGECEKNPLYMVGSEDATG 2193 N NC WA GEC+KNPLYMVG E G Sbjct: 281 NENCPRWAKVGECDKNPLYMVGGEGVRG 308 >gb|ACU19077.1| unknown [Glycine max] Length = 318 Score = 427 bits (1099), Expect = e-116 Identities = 203/268 (75%), Positives = 227/268 (84%), Gaps = 1/268 (0%) Frame = -1 Query: 2993 MKSTTGASSVRFDPTRVTQISWRPRAFIYRGFLTHDECDHLIKLAKDKLEKSMVADNESG 2814 ++ G SSV+FDPTRVTQ+SW PRAF+Y+GFL+ +ECDHLI LAKDKLEKSMVADNESG Sbjct: 40 LRLNRGGSSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESG 99 Query: 2813 KSIESEVRTSSGMFLRKNQDEVVATIETRIAAWTFLPPENGEAIQILHYEHGQKYEPHFD 2634 KSI SEVRTSSGMFL K QDE+VA IE RIAAWTFLP ENGE++QILHYE+GQKYEPHFD Sbjct: 100 KSIMSEVRTSSGMFLNKAQDEIVAGIEARIAAWTFLPIENGESMQILHYENGQKYEPHFD 159 Query: 2633 YFHDKANQELGGHRVATVLMYLSDVQKGGETVFPSSEAVETQPKADDWSECAKKGYAVKP 2454 YFHDKANQ +GGHR+ATVLMYLSDV+KGGET+F +++A QPK + WSECA KGYAVKP Sbjct: 160 YFHDKANQVMGGHRIATVLMYLSDVEKGGETIFSNAKAKLLQPKDESWSECAHKGYAVKP 219 Query: 2453 MKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSFELP-SSGESGDCVDK 2277 KGDALLFFSLH DA+TD SLHGSCPVIEGEKWSATKWIHV F+ P +SGDCVD+ Sbjct: 220 RKGDALLFFSLHLDASTDNKSLHGSCPVIEGEKWSATKWIHVSDFQKPIKQVDSGDCVDE 279 Query: 2276 NPNCSHWAAAGECEKNPLYMVGSEDATG 2193 N NC WA GECEKNPLYMVG E G Sbjct: 280 NENCPRWAKVGECEKNPLYMVGGEGVKG 307 >ref|XP_006482512.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Citrus sinensis] Length = 318 Score = 427 bits (1098), Expect = e-116 Identities = 207/280 (73%), Positives = 237/280 (84%), Gaps = 2/280 (0%) Frame = -1 Query: 3023 VSGRSADKKTMKSTTGA-SSVRFDPTRVTQISWRPRAFIYRGFLTHDECDHLIKLAKDKL 2847 V G +DK+ S +S FDP+RVTQ+SW PRAFIY+GFL+ +ECDHLI LAKDKL Sbjct: 26 VPGWLSDKEKKTSVLRLKTSTTFDPSRVTQLSWNPRAFIYKGFLSDEECDHLIDLAKDKL 85 Query: 2846 EKSMVADNESGKSIESEVRTSSGMFLRKNQDEVVATIETRIAAWTFLPPENGEAIQILHY 2667 EKSMVADNESGKSI SEVRTSSGMFL K QDE+VA+IE RIAAWTFLPPENGEA+QILHY Sbjct: 86 EKSMVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHY 145 Query: 2666 EHGQKYEPHFDYFHDKANQELGGHRVATVLMYLSDVQKGGETVFPSSEAVETQPKADDWS 2487 EHGQKYEPHFD+F DK NQ+LGGHR+ATVLMYLS V+KGGETVFP+SE +Q + +WS Sbjct: 146 EHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV--SQSRDGNWS 203 Query: 2486 ECAKKGYAVKPMKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSFELP- 2310 ECA++GYAVKPMKGDALLFFSLHPDA+TD SLHGSCPVIEGEKWSATKWIHVR+F+ P Sbjct: 204 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 263 Query: 2309 SSGESGDCVDKNPNCSHWAAAGECEKNPLYMVGSEDATGY 2190 E+ DCVD++ NC WA AGEC+KNPLYMVGS+ + GY Sbjct: 264 KEPENDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGY 303 >ref|XP_006431045.1| hypothetical protein CICLE_v10012224mg [Citrus clementina] gi|557533102|gb|ESR44285.1| hypothetical protein CICLE_v10012224mg [Citrus clementina] Length = 318 Score = 426 bits (1095), Expect = e-116 Identities = 206/280 (73%), Positives = 235/280 (83%), Gaps = 2/280 (0%) Frame = -1 Query: 3023 VSGRSADKKTMKSTTGA-SSVRFDPTRVTQISWRPRAFIYRGFLTHDECDHLIKLAKDKL 2847 V G +DK+ S +S FDP+RVTQ+SW PRAFIY+GFL+ +ECDHLI LAKDKL Sbjct: 26 VPGWLSDKEKKTSVLRLKTSTTFDPSRVTQLSWNPRAFIYKGFLSDEECDHLIDLAKDKL 85 Query: 2846 EKSMVADNESGKSIESEVRTSSGMFLRKNQDEVVATIETRIAAWTFLPPENGEAIQILHY 2667 E SMVADNESGKSI SEVRTSSGMFL K QDE+VA+IE RIAAWTFLPPENGEA+QILHY Sbjct: 86 ETSMVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHY 145 Query: 2666 EHGQKYEPHFDYFHDKANQELGGHRVATVLMYLSDVQKGGETVFPSSEAVETQPKADDWS 2487 EHGQKYEPHFD+F DK NQ+LGGHR+ATVLMYLS+V+KGGET+FP+SE +Q + +WS Sbjct: 146 EHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSNVEKGGETIFPNSEV--SQSRDGNWS 203 Query: 2486 ECAKKGYAVKPMKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSFELP- 2310 ECA++GYAVKPMKGDALLFFSLHPDA+TD SLHGSCPVIEGEKWSATKWIHVR+F+ P Sbjct: 204 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 263 Query: 2309 SSGESGDCVDKNPNCSHWAAAGECEKNPLYMVGSEDATGY 2190 E DCVD++ NC WA AGECEKNPLYMVGS+ GY Sbjct: 264 KEPEDDDCVDEDLNCVVWAKAGECEKNPLYMVGSKSTRGY 303 >ref|XP_006395360.1| hypothetical protein EUTSA_v10004627mg [Eutrema salsugineum] gi|557091999|gb|ESQ32646.1| hypothetical protein EUTSA_v10004627mg [Eutrema salsugineum] Length = 315 Score = 422 bits (1086), Expect = e-115 Identities = 196/272 (72%), Positives = 229/272 (84%) Frame = -1 Query: 3005 DKKTMKSTTGASSVRFDPTRVTQISWRPRAFIYRGFLTHDECDHLIKLAKDKLEKSMVAD 2826 D +K T ASS+ FDPTRVTQ+SW PR F+Y+G L+ +ECDH I LAK KLEKSMVAD Sbjct: 35 DGSVIKQKTSASSLAFDPTRVTQLSWTPRVFLYKGLLSDEECDHFINLAKGKLEKSMVAD 94 Query: 2825 NESGKSIESEVRTSSGMFLRKNQDEVVATIETRIAAWTFLPPENGEAIQILHYEHGQKYE 2646 N+SG+S+ESEVRTSSGMFL K QD+VVA +E+++AAWTFLP ENGE++QILHYE+GQKYE Sbjct: 95 NDSGESVESEVRTSSGMFLSKRQDDVVANVESKLAAWTFLPEENGESMQILHYENGQKYE 154 Query: 2645 PHFDYFHDKANQELGGHRVATVLMYLSDVQKGGETVFPSSEAVETQPKADDWSECAKKGY 2466 PHFDYFHD+ N ELGGHR+ATVLMYLS+V+KGGETVFP + QPK D W+ECAK+GY Sbjct: 155 PHFDYFHDQVNLELGGHRIATVLMYLSNVKKGGETVFPMWKGETIQPKDDSWTECAKQGY 214 Query: 2465 AVKPMKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSFELPSSGESGDC 2286 AVKPMKGDALLFF+LHP+ATTDP SLHGSCPVIEGEKWSAT+WIHV+SFE P +G C Sbjct: 215 AVKPMKGDALLFFNLHPNATTDPSSLHGSCPVIEGEKWSATRWIHVKSFERPIQRPTG-C 273 Query: 2285 VDKNPNCSHWAAAGECEKNPLYMVGSEDATGY 2190 VD+N +C WA +GECEKNP YMVGSE GY Sbjct: 274 VDENESCPKWAKSGECEKNPTYMVGSETDHGY 305 >ref|XP_006291518.1| hypothetical protein CARUB_v10017667mg [Capsella rubella] gi|482560225|gb|EOA24416.1| hypothetical protein CARUB_v10017667mg [Capsella rubella] Length = 315 Score = 422 bits (1084), Expect = e-115 Identities = 196/272 (72%), Positives = 230/272 (84%) Frame = -1 Query: 3005 DKKTMKSTTGASSVRFDPTRVTQISWRPRAFIYRGFLTHDECDHLIKLAKDKLEKSMVAD 2826 D +K T ASS FDPTRVTQ+SW PRAF+Y+GFL+ +ECDH IKLAK KLEKSMVAD Sbjct: 35 DGSVIKMKTSASSFGFDPTRVTQLSWTPRAFLYKGFLSDEECDHFIKLAKGKLEKSMVAD 94 Query: 2825 NESGKSIESEVRTSSGMFLRKNQDEVVATIETRIAAWTFLPPENGEAIQILHYEHGQKYE 2646 NESG+S+ESEVRTSSGMFL K QD++V+ +E ++AAWTFLP ENGE++QILHYE+GQKYE Sbjct: 95 NESGESVESEVRTSSGMFLSKRQDDIVSNVEKKLAAWTFLPEENGESMQILHYENGQKYE 154 Query: 2645 PHFDYFHDKANQELGGHRVATVLMYLSDVQKGGETVFPSSEAVETQPKADDWSECAKKGY 2466 PHFDYFHD+AN ELGGHR+ATVLMYLS+V+KGGETVFP + TQ K D W+ECAK+GY Sbjct: 155 PHFDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKTTQLKDDTWTECAKQGY 214 Query: 2465 AVKPMKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSFELPSSGESGDC 2286 AVKP KGDALLFF+LHP+ATTDP SLHGSCPV+EGEKWSAT+WIHVRSFE +SG C Sbjct: 215 AVKPRKGDALLFFNLHPNATTDPTSLHGSCPVVEGEKWSATRWIHVRSFEKAFKKQSG-C 273 Query: 2285 VDKNPNCSHWAAAGECEKNPLYMVGSEDATGY 2190 VD+N +C WA +GEC+KNP YMVGS+ GY Sbjct: 274 VDENDSCEKWAKSGECQKNPTYMVGSDKDHGY 305 >gb|AAT84604.1| prolyl 4-hydroxylase [Dianthus caryophyllus] Length = 316 Score = 421 bits (1081), Expect = e-114 Identities = 202/270 (74%), Positives = 228/270 (84%), Gaps = 2/270 (0%) Frame = -1 Query: 2993 MKSTTGASSVRFDPTRVTQISWRPRAFIYRGFLTHDECDHLIKLAKDKLEKSMVADNESG 2814 +KS SSV DP+ VTQ+SW+PRAF+Y GFLTH+ECDHLI +AKDKLEKSMVADNESG Sbjct: 39 LKSENVPSSVGVDPSHVTQLSWKPRAFLYEGFLTHEECDHLIDMAKDKLEKSMVADNESG 98 Query: 2813 KSIESEVRTSSGMFLRKNQDEVVATIETRIAAWTFLPPENGEAIQILHYEHGQKYEPHFD 2634 KSI SEVRTSSGMFL+K QD+VVA IE RIAAWTFLP ENGEA+QILHYE GQKYEPHFD Sbjct: 99 KSIPSEVRTSSGMFLQKAQDDVVAAIEARIAAWTFLPIENGEAMQILHYERGQKYEPHFD 158 Query: 2633 YFHDKANQELGGHRVATVLMYLSDVQKGGETVFPSSEAVETQPKADDWSECAKKGYAVKP 2454 YFHDK NQ+LGGHR+ATVLMYLS+V++GGETVFP++EA + S+CAK GY+VKP Sbjct: 159 YFHDKVNQQLGGHRIATVLMYLSNVEEGGETVFPNAEAKLQLANNESLSDCAKGGYSVKP 218 Query: 2453 MKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSFELPSSGE--SGDCVD 2280 KGDALLFFSLHPDA+TD LSLHGSCPVIEGEKWSATKWIHVRSF+ + SGDCVD Sbjct: 219 KKGDALLFFSLHPDASTDSLSLHGSCPVIEGEKWSATKWIHVRSFDRIRKDDPPSGDCVD 278 Query: 2279 KNPNCSHWAAAGECEKNPLYMVGSEDATGY 2190 N C+ WA AGEC+KNPLYMVGS+D GY Sbjct: 279 DNALCAQWALAGECKKNPLYMVGSKDMKGY 308 >ref|XP_002877111.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata] gi|297322949|gb|EFH53370.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata] Length = 316 Score = 421 bits (1081), Expect = e-114 Identities = 196/277 (70%), Positives = 232/277 (83%) Frame = -1 Query: 3020 SGRSADKKTMKSTTGASSVRFDPTRVTQISWRPRAFIYRGFLTHDECDHLIKLAKDKLEK 2841 S + D +K T ASS FDPTRVTQ+SW PRAF+Y+GFL+ +ECDH IKLAK KLEK Sbjct: 31 SSNNRDGSVIKMKTSASSFGFDPTRVTQLSWTPRAFLYKGFLSDEECDHFIKLAKGKLEK 90 Query: 2840 SMVADNESGKSIESEVRTSSGMFLRKNQDEVVATIETRIAAWTFLPPENGEAIQILHYEH 2661 SMVADN+SG+S+ESEVRTSSGMFL K QD++VA +E ++AAWTF+P ENGE++QILHYE+ Sbjct: 91 SMVADNDSGESVESEVRTSSGMFLSKRQDDIVANVEAKLAAWTFIPEENGESMQILHYEN 150 Query: 2660 GQKYEPHFDYFHDKANQELGGHRVATVLMYLSDVQKGGETVFPSSEAVETQPKADDWSEC 2481 GQKYEPHFDYFHD+AN ELGGHR+ATVLMYLS+V+KGGETVFP + TQ K D W+EC Sbjct: 151 GQKYEPHFDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKTTQLKDDSWTEC 210 Query: 2480 AKKGYAVKPMKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSFELPSSG 2301 AK+GYAVKP KGDALLFF+LHP+ATTD SLHGSCPV+EGEKWSAT+WIHVRSF+ S Sbjct: 211 AKQGYAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVRSFDRAFSK 270 Query: 2300 ESGDCVDKNPNCSHWAAAGECEKNPLYMVGSEDATGY 2190 +SG CVD+N +C WA AGEC+KNP YMVGS+ GY Sbjct: 271 QSG-CVDENVSCEKWAKAGECQKNPTYMVGSDKDHGY 306