BLASTX nr result
ID: Catharanthus22_contig00021982
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00021982 (1136 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004232457.1| PREDICTED: GATA transcription factor 5-like ... 257 5e-66 ref|XP_006340696.1| PREDICTED: GATA transcription factor 5-like ... 253 8e-65 gb|EOX90924.1| GATA transcription factor 5, putative [Theobroma ... 232 2e-58 gb|EMJ05895.1| hypothetical protein PRUPE_ppa008278mg [Prunus pe... 224 5e-56 ref|XP_002310287.2| hypothetical protein POPTR_0007s13700g [Popu... 223 1e-55 ref|XP_002272762.1| PREDICTED: GATA transcription factor 5 [Viti... 223 1e-55 emb|CAN64003.1| hypothetical protein VITISV_037635 [Vitis vinifera] 221 3e-55 ref|XP_002327771.1| predicted protein [Populus trichocarpa] gi|5... 221 6e-55 ref|XP_004512096.1| PREDICTED: GATA transcription factor 5-like ... 220 7e-55 ref|XP_002865076.1| zinc finger family protein [Arabidopsis lyra... 219 2e-54 ref|XP_006425559.1| hypothetical protein CICLE_v10025844mg [Citr... 217 6e-54 ref|XP_006425558.1| hypothetical protein CICLE_v10025844mg [Citr... 217 6e-54 gb|ADL36694.1| GATA domain class transcription factor [Malus dom... 216 1e-53 ref|XP_004287842.1| PREDICTED: GATA transcription factor 5-like ... 215 2e-53 gb|EXC35403.1| GATA transcription factor 5 [Morus notabilis] 214 5e-53 ref|NP_001242253.1| uncharacterized protein LOC100783966 [Glycin... 214 7e-53 emb|CBI17417.3| unnamed protein product [Vitis vinifera] 213 9e-53 ref|NP_201433.1| GATA transcription factor 5 [Arabidopsis thalia... 212 2e-52 ref|XP_006572850.1| PREDICTED: uncharacterized protein LOC100783... 211 6e-52 ref|XP_006280758.1| hypothetical protein CARUB_v10026725mg [Caps... 210 1e-51 >ref|XP_004232457.1| PREDICTED: GATA transcription factor 5-like [Solanum lycopersicum] Length = 342 Score = 257 bits (657), Expect = 5e-66 Identities = 153/331 (46%), Positives = 177/331 (53%), Gaps = 16/331 (4%) Frame = +2 Query: 191 EVALKSSFGPDLPPKTATYLHQQPQQQVFSDDCSA-GNGQNAXXXXXXXXXXXXXXSNAV 367 E AL++SF P+ P K Q Q F DD SA G GQN SN Sbjct: 5 EWALRNSFVPETPLKMT-------QNQTFGDDFSAAGAGQNGVSGDDFFVDDLLDFSNGF 57 Query: 368 VEDPEEQKQEE---------------LLENDXXXXXXXXXXXXXXXXXXKDDDFGSLPAS 502 VE ++++EE + ++DF SLP S Sbjct: 58 VEGEGDEEEEEGKNQGGEGISVQKPCSVSIAVSPLKKTEIDDKGKVTISVNEDFASLPVS 117 Query: 503 ELTVPSDDLESLEWLSHFVDDSFPEYSLTYPVTKLPPMPAKSKTDPEVLVQKKPNCFATP 682 E++VP+DDL+SLEWLSHFV++SF YSL YP KLP K D E+ V++K CFATP Sbjct: 118 EISVPTDDLDSLEWLSHFVEESFSGYSLAYPAGKLPV--EKKTGDGEIPVEEKKPCFATP 175 Query: 683 VQTKARSKRTRAGVPGWSFGXXXXXXXXXXXXXXXXXXXXXXXXPCHPWLFYSGPGQTAE 862 VQTKAR+KR R+ V W P W Y P +AE Sbjct: 176 VQTKARTKRGRSSVRVWPVCSGSLTESSSSSTSSSSTTTMSSSPPTGSWFLYPTPVHSAE 235 Query: 863 SLFXXXXXXXXXXXXATESSGGGQQPRRCSHCGVQKTPQWRAGPMGAKTLCNACGVRFKS 1042 S A+ G QQPRRCSHCGVQKTPQWRAGPMGAKTLCNACGVRFKS Sbjct: 236 SP-GKPLAKKLKKKPASHGGNGPQQPRRCSHCGVQKTPQWRAGPMGAKTLCNACGVRFKS 294 Query: 1043 GRLLPEYRPACSPTFSSELHSNNHRKVLEMR 1135 GRLLPEYRPACSPTFS+ELHSNNHRKVLEMR Sbjct: 295 GRLLPEYRPACSPTFSTELHSNNHRKVLEMR 325 >ref|XP_006340696.1| PREDICTED: GATA transcription factor 5-like [Solanum tuberosum] Length = 339 Score = 253 bits (647), Expect = 8e-65 Identities = 154/327 (47%), Positives = 174/327 (53%), Gaps = 14/327 (4%) Frame = +2 Query: 197 ALKSSFGPDLPPKTATYLHQQPQQQVFSDDCSA-GNGQNAXXXXXXXXXXXXXXSNAVVE 373 AL++SF P+ P K Q Q F DD SA G GQN SN VE Sbjct: 7 ALRNSFVPETPLKMT-------QNQTFGDDLSAAGAGQNGVSGDDFFVDDLLDFSNGFVE 59 Query: 374 DPEEQKQ------EELLENDXXXXXXXXXXXXXXXXXXKD-------DDFGSLPASELTV 514 E+++ E++ KD +DF SLP SE++V Sbjct: 60 GEGEEEEGKNQGGEDISVQKPCSVSISVSPLKKTEIDDKDKVTISVKEDFSSLPVSEISV 119 Query: 515 PSDDLESLEWLSHFVDDSFPEYSLTYPVTKLPPMPAKSKTDPEVLVQKKPNCFATPVQTK 694 P+DDL+SLEWLSHFV+DSF YSL YP KL K D E+ V++K CFATPVQTK Sbjct: 120 PTDDLDSLEWLSHFVEDSFSGYSLAYPAGKLEV--EKKTGDGEIPVEEKKPCFATPVQTK 177 Query: 695 ARSKRTRAGVPGWSFGXXXXXXXXXXXXXXXXXXXXXXXXPCHPWLFYSGPGQTAESLFX 874 AR+KR R V W P W Y P +AES Sbjct: 178 ARTKRGRTSVRFWP-ACSGSLTDSSSSSTSSSSTTTMSSSPTASWFLYPTPVHSAESP-G 235 Query: 875 XXXXXXXXXXXATESSGGGQQPRRCSHCGVQKTPQWRAGPMGAKTLCNACGVRFKSGRLL 1054 A G QQPRRCSHCGVQKTPQWRAGPMGAKTLCNACGVRFKSGRLL Sbjct: 236 KPLAKKLKKKPAPHGGNGPQQPRRCSHCGVQKTPQWRAGPMGAKTLCNACGVRFKSGRLL 295 Query: 1055 PEYRPACSPTFSSELHSNNHRKVLEMR 1135 PEYRPACSPTFS+ELHSNNHRKVLEMR Sbjct: 296 PEYRPACSPTFSTELHSNNHRKVLEMR 322 >gb|EOX90924.1| GATA transcription factor 5, putative [Theobroma cacao] Length = 389 Score = 232 bits (592), Expect = 2e-58 Identities = 150/339 (44%), Positives = 177/339 (52%), Gaps = 24/339 (7%) Frame = +2 Query: 191 EVALKSSFGPDLPPKTATYLHQQPQQQVFSDDCSAGNGQNAXXXXXXXXXXXXXXSNA-- 364 E ALK+SF ++ K++ Q F +D NGQN +N Sbjct: 43 EAALKTSFRKEMALKSSP--------QAFLEDIWLANGQNGVSSDDFSVDDLFDFTNEEG 94 Query: 365 -VVEDPEEQKQEELLENDXXXXXXXXXXXXXXXXXXKDD----------DFGSLPASELT 511 + + + Q +EE E D +++ D+GSLP SEL Sbjct: 95 FLEQQQQPQHEEEEEEEDEGAPISSSSSSPKRQKLSQEEHLSNDTTTNFDYGSLPTSELA 154 Query: 512 VPSDDLESLEWLSHFVDDSFPEYSLTYP---VTKLPPMPAKSKTDPEVLVQKKPNCFATP 682 VP+DD+ +LEWLSHFV+DSF E+S YP +T+ P + A +PE V CF TP Sbjct: 155 VPADDVANLEWLSHFVEDSFSEHSTAYPTGTLTENPKLQADILAEPEKPVIT--TCFKTP 212 Query: 683 VQTKARSKRTRAGVPGWSFGXXXXXXXXXXXXXXXXXXXXXXXXPCHPWLFY--SGPGQT 856 V KARSKRTR G WS P PWL Y SG G T Sbjct: 213 VPAKARSKRTRTGGRVWSL----VASPSLTESSSSSTSSSSSSSPSSPWLLYPNSGSGST 268 Query: 857 AESL----FXXXXXXXXXXXXATESSGG-GQQP-RRCSHCGVQKTPQWRAGPMGAKTLCN 1018 E AT+S+GG G QP RRCSHCGV KTPQWRAGPMGAKTLCN Sbjct: 269 FEPSEPLSVEKPPAKKHKKRPATDSTGGNGTQPTRRCSHCGVTKTPQWRAGPMGAKTLCN 328 Query: 1019 ACGVRFKSGRLLPEYRPACSPTFSSELHSNNHRKVLEMR 1135 ACGVRFKSGRLLPEYRPACSPTFSSELHSN+HRKVLEMR Sbjct: 329 ACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEMR 367 >gb|EMJ05895.1| hypothetical protein PRUPE_ppa008278mg [Prunus persica] Length = 338 Score = 224 bits (571), Expect = 5e-56 Identities = 137/324 (42%), Positives = 163/324 (50%), Gaps = 9/324 (2%) Frame = +2 Query: 191 EVALKSSFGPDLPPKTATYLHQQPQQQVFSDDCSAG-NGQNAXXXXXXXXXXXXXXSN-- 361 E ALK+S ++ K ++ Q VF D G NGQN SN Sbjct: 5 EAALKTSIRKEMAVKASS-------QAVFDDLLWGGVNGQNGVACDDFSVDDLLDFSNED 57 Query: 362 AVVEDPEEQKQEELLENDXXXXXXXXXXXXXXXXXXKDDDFGSLPASELTVPSDDLESLE 541 VE E+ ++ ++ + ++ G P SEL+VP+DDLE+LE Sbjct: 58 GFVETEAEEDDKDKVKGFASVPPQKQPQDPENSDLSEKNELGPEPTSELSVPADDLENLE 117 Query: 542 WLSHFVDDSFPEYSLTYPVTKLPPMPAKSKT-DPEVLVQKKPNCFATPVQTKARSKRTRA 718 WLSHFV+DSF E++ + P +P P K DP + +KP CF TPV KARSKRTR Sbjct: 118 WLSHFVEDSFTEFTTSLPAGFIPEKPKTEKRPDPAAPLPEKP-CFKTPVPAKARSKRTRT 176 Query: 719 GVPGWSFGXXXXXXXXXXXXXXXXXXXXXXXXPCHPWLFYSG-----PGQTAESLFXXXX 883 G WS G P PWL Y P + Sbjct: 177 GGRVWSLGSPSLTETSSSSSSSSSSSS-----PSSPWLIYPTTQNREPAEAGGEPVGSVE 231 Query: 884 XXXXXXXXATESSGGGQQPRRCSHCGVQKTPQWRAGPMGAKTLCNACGVRFKSGRLLPEY 1063 Q PRRCSHCGVQKTPQWR GP GAKTLCNACGVR+KSGRLLPEY Sbjct: 232 KPPKKPKRRLVDGSSSQPPRRCSHCGVQKTPQWRTGPNGAKTLCNACGVRYKSGRLLPEY 291 Query: 1064 RPACSPTFSSELHSNNHRKVLEMR 1135 RPACSPTFSSELHSN+HRKVLEMR Sbjct: 292 RPACSPTFSSELHSNHHRKVLEMR 315 >ref|XP_002310287.2| hypothetical protein POPTR_0007s13700g [Populus trichocarpa] gi|550334822|gb|EEE90737.2| hypothetical protein POPTR_0007s13700g [Populus trichocarpa] Length = 376 Score = 223 bits (568), Expect = 1e-55 Identities = 121/224 (54%), Positives = 136/224 (60%), Gaps = 4/224 (1%) Frame = +2 Query: 476 DDFGSLPASELTVPSDDLESLEWLSHFVDDSFPEYSLTYPVTKLPPMPAKSK-TDPEVLV 652 DDF S+P SEL VP+DD SLEWLSHFV+DS EY+ +P PP P K + E LV Sbjct: 138 DDFFSVPTSELCVPTDDFASLEWLSHFVEDSNSEYAAPFPTNVSPPEPKKENPVEQEKLV 197 Query: 653 QKKPNCFATPVQTKARSKRTRAGVPGWSFGXXXXXXXXXXXXXXXXXXXXXXXXPCHPWL 832 ++P F TPV KARSKRTR GV W G P PWL Sbjct: 198 LEEP-LFKTPVPGKARSKRTRNGVRVWPLGSPSLTESSSSSSSTSSSS------PSSPWL 250 Query: 833 FYSGPGQTAESLFXXXXXXXXXXXXATESSG---GGQQPRRCSHCGVQKTPQWRAGPMGA 1003 YS P E ++ A E++ G RRCSHCGVQKTPQWRAGP G+ Sbjct: 251 VYSKPCLKVEPVWFEKPVAKKMKKPAVEAAAKGCGSNSSRRCSHCGVQKTPQWRAGPNGS 310 Query: 1004 KTLCNACGVRFKSGRLLPEYRPACSPTFSSELHSNNHRKVLEMR 1135 KTLCNACGVR+KSGRLLPEYRPACSPTFS ELHSN+HRKVLEMR Sbjct: 311 KTLCNACGVRYKSGRLLPEYRPACSPTFSKELHSNHHRKVLEMR 354 >ref|XP_002272762.1| PREDICTED: GATA transcription factor 5 [Vitis vinifera] Length = 338 Score = 223 bits (568), Expect = 1e-55 Identities = 145/336 (43%), Positives = 171/336 (50%), Gaps = 21/336 (6%) Frame = +2 Query: 191 EVALKSSFGPDLPPKTATYLHQQPQQQVFSDDCSAGNGQNAXXXXXXXXXXXXXXSNAVV 370 E ALKSS + P+ A L QQP DD GNGQ+ +N + Sbjct: 5 EKALKSSV---VRPELAFKLTQQP---ACMDDMCMGNGQSGVSGDDFSIDDLLDFTNGGI 58 Query: 371 ------EDPEEQKQE---------ELLENDXXXXXXXXXXXXXXXXXXKDDDFGSLPASE 505 E+ EE + + EL END D+F S+PA+E Sbjct: 59 GEGLFQEEDEEDEDKGCGSLSPRGELTENDNSNLTTTTFSVK--------DEFPSVPATE 110 Query: 506 LTVPSDDLESLEWLSHFVDDSFPEYSLTYP---VTKLPPMPAKSKTDPEVLVQKKPNCFA 676 LTVP+DDL LEWLSHFV+DSF EYS +P +T+ ++ +PE +Q K +C Sbjct: 111 LTVPADDLADLEWLSHFVEDSFSEYSAPFPHGTLTEKAQNQTENPPEPETPLQIK-SCLK 169 Query: 677 TPVQTKARSKRTRAGVPGWSFGXXXXXXXXXXXXXXXXXXXXXXXXPCHPWLFYSGPGQT 856 TP KARSKR R G WS G PWL Y Q Sbjct: 170 TPFPAKARSKRARTGGRVWSMGSPSLTESSSSSSSSSSSSLSS------PWLIYPNTCQN 223 Query: 857 AESLFXXXXXXXXXXXXAT--ESSGGGQQ-PRRCSHCGVQKTPQWRAGPMGAKTLCNACG 1027 ES E+SG Q P RCSHCGVQKTPQWR GP+GAKTLCNACG Sbjct: 224 VESFHSAVKPPAKKHKKRLDPEASGSAQPTPHRCSHCGVQKTPQWRTGPLGAKTLCNACG 283 Query: 1028 VRFKSGRLLPEYRPACSPTFSSELHSNNHRKVLEMR 1135 VR+KSGRLLPEYRPACSPTFSSE+HSN+HRKVLEMR Sbjct: 284 VRYKSGRLLPEYRPACSPTFSSEIHSNHHRKVLEMR 319 >emb|CAN64003.1| hypothetical protein VITISV_037635 [Vitis vinifera] Length = 338 Score = 221 bits (564), Expect = 3e-55 Identities = 145/336 (43%), Positives = 170/336 (50%), Gaps = 21/336 (6%) Frame = +2 Query: 191 EVALKSSFGPDLPPKTATYLHQQPQQQVFSDDCSAGNGQNAXXXXXXXXXXXXXXSNAVV 370 E ALKSS + P+ A L QQP DD GNGQ+ +N + Sbjct: 5 EKALKSSV---VRPELAFKLTQQP---ACXDDICMGNGQSGVSGDDFSIDDLLDFTNGGI 58 Query: 371 -------EDPEEQKQ--------EELLENDXXXXXXXXXXXXXXXXXXKDDDFGSLPASE 505 ED E++ + EL END D+F S+PA+E Sbjct: 59 GEGLFQEEDEEDEDKGCGSLSPRRELTENDNSNLTTTTFSVK--------DEFPSVPATE 110 Query: 506 LTVPSDDLESLEWLSHFVDDSFPEYSLTYP---VTKLPPMPAKSKTDPEVLVQKKPNCFA 676 LTVP+DDL LEWLSHFV+DSF EYS +P +T+ ++ +PE +Q K +C Sbjct: 111 LTVPADDLADLEWLSHFVEDSFSEYSAPFPPGTLTEKAQNQTENPPEPETPLQIK-SCLK 169 Query: 677 TPVQTKARSKRTRAGVPGWSFGXXXXXXXXXXXXXXXXXXXXXXXXPCHPWLFYSGPGQT 856 TP KARSKR R G WS G PWL Y Q Sbjct: 170 TPFPAKARSKRARTGGRVWSMGSPSLTESSSSSSSSSSSSLSS------PWLIYPNTCQN 223 Query: 857 AESLFXXXXXXXXXXXXAT--ESSGGGQQ-PRRCSHCGVQKTPQWRAGPMGAKTLCNACG 1027 ES E+SG Q P RCSHCGVQKT QWR GP+GAKTLCNACG Sbjct: 224 VESFHSAVKPPAKKHKKRLDPEASGSAQXTPHRCSHCGVQKTXQWRTGPLGAKTLCNACG 283 Query: 1028 VRFKSGRLLPEYRPACSPTFSSELHSNNHRKVLEMR 1135 VRFKSGRLLPEYRPACSPTFSSE+HSN+HRKVLEMR Sbjct: 284 VRFKSGRLLPEYRPACSPTFSSEIHSNHHRKVLEMR 319 >ref|XP_002327771.1| predicted protein [Populus trichocarpa] gi|566170906|ref|XP_006383142.1| zinc finger family protein [Populus trichocarpa] gi|550338722|gb|ERP60939.1| zinc finger family protein [Populus trichocarpa] Length = 333 Score = 221 bits (562), Expect = 6e-55 Identities = 120/227 (52%), Positives = 135/227 (59%), Gaps = 7/227 (3%) Frame = +2 Query: 476 DDFGSLPASELTVPSDDLESLEWLSHFVDDSFPEYSLTYPVTKLPPMPAKSK-TDPEVLV 652 +DF S P SEL VP+DDL SLEWLSHFV+DS EY+ +P PP P K + E V Sbjct: 91 EDFVSGPTSELCVPTDDLASLEWLSHFVEDSNSEYAAPFPAIVSPPEPEKENFAEQEKSV 150 Query: 653 QKKPNCFATPVQTKARSKRTRAGVPGWSFGXXXXXXXXXXXXXXXXXXXXXXXXPCHPWL 832 +P CF TPV KARSKRTR GV W G P PWL Sbjct: 151 LTEP-CFKTPVPAKARSKRTRTGVRVWPLGSPTLTESSTSSSSSTSSSS-----PSSPWL 204 Query: 833 FYSGPGQTAESLFXXXXXXXXXXXX------ATESSGGGQQPRRCSHCGVQKTPQWRAGP 994 ++ P AE L+ A+ GG RRCSHCG+QKTPQWRAGP Sbjct: 205 IHTKPLLNAEPLWFEKPVVKRMKKKPSFHAAASGGGGGSHSSRRCSHCGIQKTPQWRAGP 264 Query: 995 MGAKTLCNACGVRFKSGRLLPEYRPACSPTFSSELHSNNHRKVLEMR 1135 G+KTLCNACGVR+KSGRLLPEYRPACSPTFS ELHSN+HRKVLEMR Sbjct: 265 NGSKTLCNACGVRYKSGRLLPEYRPACSPTFSKELHSNHHRKVLEMR 311 >ref|XP_004512096.1| PREDICTED: GATA transcription factor 5-like [Cicer arietinum] Length = 380 Score = 220 bits (561), Expect = 7e-55 Identities = 141/331 (42%), Positives = 161/331 (48%), Gaps = 16/331 (4%) Frame = +2 Query: 191 EVALKSSFGPDLPPKTATYLHQQPQQQVFSDDCSAGNGQNAXXXXXXXXXXXXXXSNAVV 370 E ALK+S D+ K Q F D+ S N QN S+ + Sbjct: 39 ETALKTSLRKDMTVKL--------NPQTFVDELSCLNAQNGTSCDDFFVDDLLDFSHVIE 90 Query: 371 EDP--EEQKQEELLENDXXXXXXXXXXXXXXXXXXKDDDFGSLPASELTVPSDDLESLEW 544 E EE+K + + DDF SLP ++L VPSDD+ LEW Sbjct: 91 EQQQQEEEKDSSICVSLKQHNQNHEISNLNSTSFSLKDDFCSLPTTDLNVPSDDVADLEW 150 Query: 545 LSHFVDDS--FPEYSLTYPVTKLPPMPAKSKT---DPEVLVQKKPN-------CFATPVQ 688 LSHFV+DS F E+S PV L KS + E + KP CF TPVQ Sbjct: 151 LSHFVEDSDSFSEFSAALPVVTLTEKNPKSVVVVNESEPKPENKPKSPVFSQPCFKTPVQ 210 Query: 689 TKARSKRTRAGVPGWSFGXXXXXXXXXXXXXXXXXXXXXXXXPCHPWLFYSGPGQTAESL 868 TKARSKRTR V W FG P L Y+ Q E + Sbjct: 211 TKARSKRTRTSVRVWPFGSNSLTESSSSSTTTSSSTSSS---PTSTLLIYTNLAQNLEKV 267 Query: 869 FXXXXXXXXXXXXATESSGG--GQQPRRCSHCGVQKTPQWRAGPMGAKTLCNACGVRFKS 1042 + S G PRRCSHCGVQKTPQWR GP+GAKTLCNACGVRFKS Sbjct: 268 YSVPEKKPKKIASFNGSGHGTVALAPRRCSHCGVQKTPQWRTGPLGAKTLCNACGVRFKS 327 Query: 1043 GRLLPEYRPACSPTFSSELHSNNHRKVLEMR 1135 GRLLPEYRPACSPTFSSELHSN+HRKVLEMR Sbjct: 328 GRLLPEYRPACSPTFSSELHSNHHRKVLEMR 358 >ref|XP_002865076.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] gi|297310911|gb|EFH41335.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] Length = 339 Score = 219 bits (558), Expect = 2e-54 Identities = 144/330 (43%), Positives = 174/330 (52%), Gaps = 13/330 (3%) Frame = +2 Query: 185 VEEVALKSSFGPDLPPKTATYLHQQPQQQVFSDDCSAGNGQNAXXXXXXXXXXXXXXSNA 364 +E+ ALKSS ++ KT ++++ F +A NG +A Sbjct: 1 MEQTALKSSIRKEMAFKTTPPVYEE-----FLAVTTAPNGFSADDFSVDDLLDLSNDDVF 55 Query: 365 VVEDPEEQKQEELLENDXXXXXXXXXXXXXXXXXXKDDDFGSLPASELTVPSDDLESLEW 544 ED + + Q++++ DDFGSLP SEL+VP+DDL +LEW Sbjct: 56 ADEDTDPKAQQDMVRVSSEEPNDDGDALRRSSDLSGCDDFGSLPTSELSVPADDLANLEW 115 Query: 545 LSHFVDDSFPEYS---LTYPVTKLPPMPAKSKTDPEVLVQKKPNCFATPVQTKARSKRTR 715 LSHFVDDSF EYS LT T+ P + P V + +CF +PV KARSKR R Sbjct: 116 LSHFVDDSFTEYSGPNLTGTPTEKPSWLTGDRKHP-VTPATEESCFKSPVPAKARSKRNR 174 Query: 716 AGVPGWSFGXXXXXXXXXXXXXXXXXXXXXXXXPCHPWLFYSG-----PGQTAESLFXXX 880 GV WS G P PW +SG P T+E Sbjct: 175 NGVKVWSLGSSSSSGPSSSGSTSSSSSR-----PSSPW--FSGAEMLEPVVTSER----P 223 Query: 881 XXXXXXXXXATESSGGGQ----QP-RRCSHCGVQKTPQWRAGPMGAKTLCNACGVRFKSG 1045 + ES GQ QP RRCSHCGVQKTPQWRAGPMGAKTLCNACGVR+KSG Sbjct: 224 PFPKKHKKRSAESVFCGQLQQLQPQRRCSHCGVQKTPQWRAGPMGAKTLCNACGVRYKSG 283 Query: 1046 RLLPEYRPACSPTFSSELHSNNHRKVLEMR 1135 RLLPEYRPACSPTFSSELHSN+HRKV+EMR Sbjct: 284 RLLPEYRPACSPTFSSELHSNHHRKVMEMR 313 >ref|XP_006425559.1| hypothetical protein CICLE_v10025844mg [Citrus clementina] gi|557527549|gb|ESR38799.1| hypothetical protein CICLE_v10025844mg [Citrus clementina] Length = 340 Score = 217 bits (553), Expect = 6e-54 Identities = 116/231 (50%), Positives = 137/231 (59%), Gaps = 11/231 (4%) Frame = +2 Query: 476 DDFGSLPASELTVPSDDLESLEWLSHFVDDSFPEYSLTYPVTKLPPMPAKSKTDPEVLVQ 655 DD G +P SEL VP+DD+ +LEWLSHFV+DSF EYS +P LP ++ +PE Sbjct: 95 DDLGPIPTSELAVPTDDVANLEWLSHFVEDSFAEYSSPFPAGTLPVKAKENGAEPEHKPA 154 Query: 656 KKPNCFATPVQTKARSKRTRAGVPGWSFGXXXXXXXXXXXXXXXXXXXXXXXXPCHPWLF 835 +CF TP+ KARSKR+R G+ WS G P PW Sbjct: 155 LAIHCFKTPIPAKARSKRSRTGLRIWSLGSPSLSDSSSTSSASSSSS------PSSPWPV 208 Query: 836 YSGPG-----QTAESLFXXXXXXXXXXXXATE--SSGG----GQQPRRCSHCGVQKTPQW 982 + PG + AE E ++GG GQ RRCSHCGVQKTPQW Sbjct: 209 STNPGSLASLRPAEPFIVKPPKKKLKKKSPPEGYNAGGNISWGQFTRRCSHCGVQKTPQW 268 Query: 983 RAGPMGAKTLCNACGVRFKSGRLLPEYRPACSPTFSSELHSNNHRKVLEMR 1135 R GP+GAKTLCNACGVR+KSGRL PEYRPACSPTFSSELHSN+HRKV+EMR Sbjct: 269 RTGPLGAKTLCNACGVRYKSGRLFPEYRPACSPTFSSELHSNHHRKVMEMR 319 >ref|XP_006425558.1| hypothetical protein CICLE_v10025844mg [Citrus clementina] gi|568825030|ref|XP_006466892.1| PREDICTED: GATA transcription factor 5-like [Citrus sinensis] gi|557527548|gb|ESR38798.1| hypothetical protein CICLE_v10025844mg [Citrus clementina] Length = 381 Score = 217 bits (553), Expect = 6e-54 Identities = 116/231 (50%), Positives = 137/231 (59%), Gaps = 11/231 (4%) Frame = +2 Query: 476 DDFGSLPASELTVPSDDLESLEWLSHFVDDSFPEYSLTYPVTKLPPMPAKSKTDPEVLVQ 655 DD G +P SEL VP+DD+ +LEWLSHFV+DSF EYS +P LP ++ +PE Sbjct: 136 DDLGPIPTSELAVPTDDVANLEWLSHFVEDSFAEYSSPFPAGTLPVKAKENGAEPEHKPA 195 Query: 656 KKPNCFATPVQTKARSKRTRAGVPGWSFGXXXXXXXXXXXXXXXXXXXXXXXXPCHPWLF 835 +CF TP+ KARSKR+R G+ WS G P PW Sbjct: 196 LAIHCFKTPIPAKARSKRSRTGLRIWSLGSPSLSDSSSTSSASSSSS------PSSPWPV 249 Query: 836 YSGPG-----QTAESLFXXXXXXXXXXXXATE--SSGG----GQQPRRCSHCGVQKTPQW 982 + PG + AE E ++GG GQ RRCSHCGVQKTPQW Sbjct: 250 STNPGSLASLRPAEPFIVKPPKKKLKKKSPPEGYNAGGNISWGQFTRRCSHCGVQKTPQW 309 Query: 983 RAGPMGAKTLCNACGVRFKSGRLLPEYRPACSPTFSSELHSNNHRKVLEMR 1135 R GP+GAKTLCNACGVR+KSGRL PEYRPACSPTFSSELHSN+HRKV+EMR Sbjct: 310 RTGPLGAKTLCNACGVRYKSGRLFPEYRPACSPTFSSELHSNHHRKVMEMR 360 >gb|ADL36694.1| GATA domain class transcription factor [Malus domestica] Length = 331 Score = 216 bits (551), Expect = 1e-53 Identities = 139/322 (43%), Positives = 164/322 (50%), Gaps = 7/322 (2%) Frame = +2 Query: 191 EVALKSSFGPDLPPKTATYLHQQPQQQVFSDDCSAG---NGQNAXXXXXXXXXXXXXXSN 361 E ALK+S ++ K PQ VF D G NGQNA + Sbjct: 5 EAALKTSIRKEMAVKATG-----PQVVVFDDFLWGGAVVNGQNACDDFSVDDLLDFSNED 59 Query: 362 AVVE-DPEEQKQEELLENDXXXXXXXXXXXXXXXXXXKDDDFGSLPASELTVPSDDLESL 538 VE + EE+ +E ++ + + PASEL+VP+DDLE+L Sbjct: 60 GFVETEAEEEGDKEKVKGFVSVSLQKQNQETEKSNLSEKIE----PASELSVPADDLENL 115 Query: 539 EWLSHFVDDSFPEYSLTYPVTKLPPMPAKSKT-DPEVLVQKKPNCFATPVQTKARSKRTR 715 EWLSHFV+DSF E++ P LP P K D E +KP CF TPV KARSKR R Sbjct: 116 EWLSHFVEDSFSEFTTALPAGFLPEKPKSEKRPDLETPFPEKP-CFKTPVPAKARSKRRR 174 Query: 716 AGVPGWSFGXXXXXXXXXXXXXXXXXXXXXXXXPCHPWLFYSGPG--QTAESLFXXXXXX 889 G WS G P PW Y ++AE + Sbjct: 175 TGGRVWSLGSPSLTESSSSSSSSSSSS------PSSPWTIYPATQNQESAEPVSSVEKPP 228 Query: 890 XXXXXXATESSGGGQQPRRCSHCGVQKTPQWRAGPMGAKTLCNACGVRFKSGRLLPEYRP 1069 + S Q PRRCSHCGVQKTPQWR GP GAKTLCNACGVR+KSGRLLPEYRP Sbjct: 229 RKPKRRLVDGSSS-QPPRRCSHCGVQKTPQWRTGPNGAKTLCNACGVRYKSGRLLPEYRP 287 Query: 1070 ACSPTFSSELHSNNHRKVLEMR 1135 ACSPTFSSELHSN+HRKV+EMR Sbjct: 288 ACSPTFSSELHSNHHRKVIEMR 309 >ref|XP_004287842.1| PREDICTED: GATA transcription factor 5-like [Fragaria vesca subsp. vesca] Length = 333 Score = 215 bits (548), Expect = 2e-53 Identities = 118/220 (53%), Positives = 134/220 (60%), Gaps = 6/220 (2%) Frame = +2 Query: 494 PASELTVPSDDLESLEWLSHFVDDSFPEYSLTYPVTKLPPMPAKSKTDPEVLVQKKPNCF 673 P SELTVP+DDLE+LEWLSHFV+DSF ++ + P + P K + +PE L KP CF Sbjct: 102 PTSELTVPADDLENLEWLSHFVEDSFSGFNASLPAGFMAVKPEK-RPEPEAL---KP-CF 156 Query: 674 ATPVQTKARSKRTRAGVPGWSFGXXXXXXXXXXXXXXXXXXXXXXXXPCHPWLFYS---- 841 TPV KARSKRTR G WS G P PWL Y+ Sbjct: 157 KTPVPAKARSKRTRTGGRVWSLGSPSFTETSSSSSSSSSTSSC----PSSPWLIYNPTQG 212 Query: 842 --GPGQTAESLFXXXXXXXXXXXXATESSGGGQQPRRCSHCGVQKTPQWRAGPMGAKTLC 1015 G G + E TE G Q PRRCSHCGVQKTPQWR GP GAKTLC Sbjct: 213 LGGFGSSVEK-----PQKKPKRPATTEGGGSSQPPRRCSHCGVQKTPQWRTGPNGAKTLC 267 Query: 1016 NACGVRFKSGRLLPEYRPACSPTFSSELHSNNHRKVLEMR 1135 NACGVR+KSGRL+PEYRPACSPTFSSELHSN+HRKV+E+R Sbjct: 268 NACGVRYKSGRLVPEYRPACSPTFSSELHSNHHRKVMEIR 307 >gb|EXC35403.1| GATA transcription factor 5 [Morus notabilis] Length = 393 Score = 214 bits (545), Expect = 5e-53 Identities = 114/225 (50%), Positives = 135/225 (60%), Gaps = 9/225 (4%) Frame = +2 Query: 488 SLPASELTVPSDDLESLEWLSHFVDDSFPEYSLTYPVTKLPPMPAKSKT---DPEVLVQK 658 S+P +ELT+P+++LE+LEWLSHFV++SF E+S +Y P + +T +P+ + Sbjct: 152 SVPTTELTLPAEELENLEWLSHFVEESFSEFSTSYLAGVSAEKPPEDETFLPEPKRFAPE 211 Query: 659 KPNCFATPVQTKARSKRTRAGVPGWSFGXXXXXXXXXXXXXXXXXXXXXXXXPCHPWLFY 838 KP CF TP+ KARSKR R G WS G P PWL Y Sbjct: 212 KP-CFTTPIPAKARSKRPRTGGRVWSLGSPSFIESSSSSTTSSSSSSS----PTSPWLIY 266 Query: 839 SGPGQTAESLFXXXXXXXXXXXXATESSGGG------QQPRRCSHCGVQKTPQWRAGPMG 1000 + A ES G G Q PRRCSHCGVQKTPQWR GP+G Sbjct: 267 ATHSHEPACSVQKPAPKKAKKRQAVESFGSGSGPASAQPPRRCSHCGVQKTPQWRTGPLG 326 Query: 1001 AKTLCNACGVRFKSGRLLPEYRPACSPTFSSELHSNNHRKVLEMR 1135 AKTLCNACGVRFKSGRLLPEYRPACSPTFSS+LHSN+HRKVLEMR Sbjct: 327 AKTLCNACGVRFKSGRLLPEYRPACSPTFSSDLHSNHHRKVLEMR 371 >ref|NP_001242253.1| uncharacterized protein LOC100783966 [Glycine max] gi|255637027|gb|ACU18846.1| unknown [Glycine max] Length = 352 Score = 214 bits (544), Expect = 7e-53 Identities = 134/323 (41%), Positives = 169/323 (52%) Frame = +2 Query: 167 EKGMDRVEEVALKSSFGPDLPPKTATYLHQQPQQQVFSDDCSAGNGQNAXXXXXXXXXXX 346 EK M+ VE ALKS++ ++ K + + F+++ S NG Sbjct: 42 EKEMECVE-AALKSNYRKEMTLKLSP--------RTFTEEVSVQNGTTCDDFFVNDLLDF 92 Query: 347 XXXSNAVVEDPEEQKQEELLENDXXXXXXXXXXXXXXXXXXKDDDFGSLPASELTVPSDD 526 + V E+PE+Q+ + DD+ S+P SEL+V +DD Sbjct: 93 ----SHVEEEPEQQEDTPCVSLQHENPSHEPCTFK--------DDYASVPTSELSVLADD 140 Query: 527 LESLEWLSHFVDDSFPEYSLTYPVTKLPPMPAKSKTDPEVLVQKKPNCFATPVQTKARSK 706 L LEWLSHFV+DSF E+S +P P + +PE + P F TPVQTKARSK Sbjct: 141 LADLEWLSHFVEDSFSEFSAAFPTVTENPTACLKEAEPEPEIPVFP--FKTPVQTKARSK 198 Query: 707 RTRAGVPGWSFGXXXXXXXXXXXXXXXXXXXXXXXXPCHPWLFYSGPGQTAESLFXXXXX 886 RTR G+ W FG P P L Y+ Q+ + L Sbjct: 199 RTRNGLRVWPFGSPSFTDSSSSSTTSSFSFFS----PSSPLLIYT---QSLDHLCSEPNT 251 Query: 887 XXXXXXXATESSGGGQQPRRCSHCGVQKTPQWRAGPMGAKTLCNACGVRFKSGRLLPEYR 1066 ++++ PRRCSHCGVQKTPQWR GP+G KTLCNACGVRFKSGRLLPEYR Sbjct: 252 KKMKKKPSSDTLA----PRRCSHCGVQKTPQWRTGPLGPKTLCNACGVRFKSGRLLPEYR 307 Query: 1067 PACSPTFSSELHSNNHRKVLEMR 1135 PACSPTFSSELHSN+HRKVLEMR Sbjct: 308 PACSPTFSSELHSNHHRKVLEMR 330 >emb|CBI17417.3| unnamed protein product [Vitis vinifera] Length = 305 Score = 213 bits (543), Expect = 9e-53 Identities = 142/334 (42%), Positives = 170/334 (50%), Gaps = 19/334 (5%) Frame = +2 Query: 191 EVALKSSFGPDLPPKTATYLHQQPQQQVFSDDCSAGNGQNAXXXXXXXXXXXXXXSNAVV 370 E ALKSS + P+ A L QQP DD GNGQ+ +N + Sbjct: 5 EKALKSSV---VRPELAFKLTQQP---ACMDDMCMGNGQSGVSGDDFSIDDLLDFTNGGI 58 Query: 371 ------EDPEEQKQE---------ELLENDXXXXXXXXXXXXXXXXXXKDDDFGSLPASE 505 E+ EE + + EL END D+F S+PA+E Sbjct: 59 GEGLFQEEDEEDEDKGCGSLSPRGELTENDNSNLTTTTFSVK--------DEFPSVPATE 110 Query: 506 LTVPSDDLESLEWLSHFVDDSFPEYSLTYP---VTKLPPMPAKSKTDPEVLVQKKPNCFA 676 LTVP+DDL LEWLSHFV+DSF EYS +P +T+ ++ +PE +Q K +C Sbjct: 111 LTVPADDLADLEWLSHFVEDSFSEYSAPFPHGTLTEKAQNQTENPPEPETPLQIK-SCLK 169 Query: 677 TPVQTKARSKRTRAGVPGWSFGXXXXXXXXXXXXXXXXXXXXXXXXPCHPWLFYSGPGQT 856 TP KARSKR R G WS G P L S + Sbjct: 170 TPFPAKARSKRARTGGRVWSMGS--------------------------PSLTESSSSSS 203 Query: 857 AESLFXXXXXXXXXXXXATESSGGGQQ-PRRCSHCGVQKTPQWRAGPMGAKTLCNACGVR 1033 + S E+SG Q P RCSHCGVQKTPQWR GP+GAKTLCNACGVR Sbjct: 204 SSS-----------SSLDPEASGSAQPTPHRCSHCGVQKTPQWRTGPLGAKTLCNACGVR 252 Query: 1034 FKSGRLLPEYRPACSPTFSSELHSNNHRKVLEMR 1135 +KSGRLLPEYRPACSPTFSSE+HSN+HRKVLEMR Sbjct: 253 YKSGRLLPEYRPACSPTFSSEIHSNHHRKVLEMR 286 >ref|NP_201433.1| GATA transcription factor 5 [Arabidopsis thaliana] gi|42573812|ref|NP_975002.1| GATA transcription factor 5 [Arabidopsis thaliana] gi|71660777|sp|Q9FH57.1|GATA5_ARATH RecName: Full=GATA transcription factor 5 gi|10177426|dbj|BAB10711.1| GATA-binding transcription factor-like protein [Arabidopsis thaliana] gi|22531223|gb|AAM97115.1| GATA-binding transcription factor-like protein [Arabidopsis thaliana] gi|34098855|gb|AAQ56810.1| At5g66320 [Arabidopsis thaliana] gi|332010815|gb|AED98198.1| GATA transcription factor 5 [Arabidopsis thaliana] gi|332010816|gb|AED98199.1| GATA transcription factor 5 [Arabidopsis thaliana] Length = 339 Score = 212 bits (540), Expect = 2e-52 Identities = 138/330 (41%), Positives = 173/330 (52%), Gaps = 13/330 (3%) Frame = +2 Query: 185 VEEVALKSSFGPDLPPKTATYLHQQPQQQVFSDDCSAGNGQNAXXXXXXXXXXXXXXSNA 364 +E+ ALKSS ++ KT + ++++ F +A NG + Sbjct: 1 MEQAALKSSVRKEMALKTTSPVYEE-----FLAVTTAQNGFSVDDFSVDDLLDLSNDDVF 55 Query: 365 VVEDPEEQKQEELLENDXXXXXXXXXXXXXXXXXXKDDDFGSLPASELTVPSDDLESLEW 544 E+ + + Q E++ DDFGSLP SEL++P+DDL +LEW Sbjct: 56 ADEETDLKAQHEMVRVSSEEPNDDGDALRRSSDFSGCDDFGSLPTSELSLPADDLANLEW 115 Query: 545 LSHFVDDSFPEYS---LTYPVTKLPPMPAKSKTDPEVLVQKKPNCFATPVQTKARSKRTR 715 LSHFV+DSF EYS LT T+ P + P V ++ CF +PV KARSKR R Sbjct: 116 LSHFVEDSFTEYSGPNLTGTPTEKPAWLTGDRKHPVTAVTEE-TCFKSPVPAKARSKRNR 174 Query: 716 AGVPGWSFGXXXXXXXXXXXXXXXXXXXXXXXXPCHPWLFYSG-----PGQTAESLFXXX 880 G+ WS G P PW +SG P T+E Sbjct: 175 NGLKVWSLGSSSSSGPSSSGSTSSSSSG-----PSSPW--FSGAELLEPVVTSER----P 223 Query: 881 XXXXXXXXXATESSGGGQ----QP-RRCSHCGVQKTPQWRAGPMGAKTLCNACGVRFKSG 1045 + ES G+ QP R+CSHCGVQKTPQWRAGPMGAKTLCNACGVR+KSG Sbjct: 224 PFPKKHKKRSAESVFSGELQQLQPQRKCSHCGVQKTPQWRAGPMGAKTLCNACGVRYKSG 283 Query: 1046 RLLPEYRPACSPTFSSELHSNNHRKVLEMR 1135 RLLPEYRPACSPTFSSELHSN+HRKV+EMR Sbjct: 284 RLLPEYRPACSPTFSSELHSNHHRKVIEMR 313 >ref|XP_006572850.1| PREDICTED: uncharacterized protein LOC100783966 isoform X1 [Glycine max] Length = 308 Score = 211 bits (536), Expect = 6e-52 Identities = 130/315 (41%), Positives = 163/315 (51%) Frame = +2 Query: 191 EVALKSSFGPDLPPKTATYLHQQPQQQVFSDDCSAGNGQNAXXXXXXXXXXXXXXSNAVV 370 E ALKS++ ++ K + Q F+++ S NG + V Sbjct: 5 EAALKSNYRKEMTLKLSP--------QTFTEEVSVQNGTTCDDFFVNDLLDF----SHVE 52 Query: 371 EDPEEQKQEELLENDXXXXXXXXXXXXXXXXXXKDDDFGSLPASELTVPSDDLESLEWLS 550 E+PE+Q+ + DD+ S+P SEL+V +DDL LEWLS Sbjct: 53 EEPEQQEDTPCVSLQHENPSHEPCTFK--------DDYASVPTSELSVLADDLADLEWLS 104 Query: 551 HFVDDSFPEYSLTYPVTKLPPMPAKSKTDPEVLVQKKPNCFATPVQTKARSKRTRAGVPG 730 HFV+DSF E+S +P P + +PE + F TPVQTKARSKRTR G+ Sbjct: 105 HFVEDSFSEFSAAFPTVTENPTACLKEAEPEPEIPVFS--FKTPVQTKARSKRTRNGLRV 162 Query: 731 WSFGXXXXXXXXXXXXXXXXXXXXXXXXPCHPWLFYSGPGQTAESLFXXXXXXXXXXXXA 910 W FG P P L Y+ Q+ + L + Sbjct: 163 WPFGSPSFTDSSSSSTTSSSSSSS----PSSPLLIYT---QSLDHLCSEPNTKKMKKKPS 215 Query: 911 TESSGGGQQPRRCSHCGVQKTPQWRAGPMGAKTLCNACGVRFKSGRLLPEYRPACSPTFS 1090 +++ PRRCSHCGVQKTPQWR GP+G KTLCNACGVRFKSGRLLPEYRPACSPTFS Sbjct: 216 SDTLA----PRRCSHCGVQKTPQWRTGPLGPKTLCNACGVRFKSGRLLPEYRPACSPTFS 271 Query: 1091 SELHSNNHRKVLEMR 1135 SELHSN+HRKVLEMR Sbjct: 272 SELHSNHHRKVLEMR 286 >ref|XP_006280758.1| hypothetical protein CARUB_v10026725mg [Capsella rubella] gi|565433824|ref|XP_006280759.1| hypothetical protein CARUB_v10026725mg [Capsella rubella] gi|482549462|gb|EOA13656.1| hypothetical protein CARUB_v10026725mg [Capsella rubella] gi|482549463|gb|EOA13657.1| hypothetical protein CARUB_v10026725mg [Capsella rubella] Length = 342 Score = 210 bits (534), Expect = 1e-51 Identities = 125/261 (47%), Positives = 146/261 (55%), Gaps = 4/261 (1%) Frame = +2 Query: 365 VVEDPEEQKQEELLENDXXXXXXXXXXXXXXXXXXKDDDFGSLPASELTVPSDDLESLEW 544 V + EE+++EE L +D D GSLP SEL+VP+DDL +LEW Sbjct: 72 VSSEEEEEEEEEELNDDGDALPRCI------------DFSGSLPTSELSVPADDLANLEW 119 Query: 545 LSHFVDDSFPEYS---LTYPVTKLPPMPAKSKTDPEVLVQKKPNCFATPVQTKARSKRTR 715 LSHFV+DSF EYS LT T+ P + P V + +CF +PV KARSKR R Sbjct: 120 LSHFVEDSFTEYSGPNLTGTPTEKPAWLTGDRKHP-VTPATQESCFKSPVPAKARSKRHR 178 Query: 716 AGVPGWSFGXXXXXXXXXXXXXXXXXXXXXXXXPCHPWLFYSGPGQTAESLFXXXXXXXX 895 GV WS G P PW + + + Sbjct: 179 NGVKAWSLGSSSSSGPSSSGSTSSSSSSSG---PSSPWFSGADLFEPMVASERPPFPKKH 235 Query: 896 XXXXATESSGGGQQP-RRCSHCGVQKTPQWRAGPMGAKTLCNACGVRFKSGRLLPEYRPA 1072 A + G QP RRCSHCGVQKTPQWRAGPMGAKTLCNACGVR+KSGRLLPEYRPA Sbjct: 236 KKRSAESAFCGQLQPQRRCSHCGVQKTPQWRAGPMGAKTLCNACGVRYKSGRLLPEYRPA 295 Query: 1073 CSPTFSSELHSNNHRKVLEMR 1135 CSPTFSSELHSN+HRKV+EMR Sbjct: 296 CSPTFSSELHSNHHRKVMEMR 316