BLASTX nr result
ID: Sinomenium21_contig00005383
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00005383 (1158 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002282173.1| PREDICTED: uncharacterized protein LOC100261... 179 2e-42 gb|EXB38836.1| Putative GATA transcription factor 22 [Morus nota... 176 2e-41 ref|XP_007012845.1| GATA type zinc finger transcription factor f... 173 1e-40 ref|XP_002514107.1| hypothetical protein RCOM_1046780 [Ricinus c... 171 4e-40 gb|ADL36695.1| GATA domain class transcription factor [Malus dom... 161 6e-37 ref|XP_006353530.1| PREDICTED: putative GATA transcription facto... 159 2e-36 ref|XP_004287558.1| PREDICTED: uncharacterized protein LOC101297... 157 1e-35 gb|ADL36692.1| GATA domain class transcription factor [Malus dom... 157 1e-35 ref|XP_007203151.1| hypothetical protein PRUPE_ppa024374mg [Prun... 156 2e-35 ref|XP_002279283.1| PREDICTED: putative GATA transcription facto... 156 2e-35 emb|CAN63090.1| hypothetical protein VITISV_032017 [Vitis vinifera] 156 2e-35 ref|XP_002866169.1| hypothetical protein ARALYDRAFT_495776 [Arab... 155 4e-35 ref|XP_006401276.1| hypothetical protein EUTSA_v10013793mg [Eutr... 154 5e-35 ref|XP_006451458.1| hypothetical protein CICLE_v10009004mg [Citr... 154 9e-35 ref|XP_007012281.1| GATA type zinc finger transcription factor f... 153 1e-34 ref|XP_004251667.1| PREDICTED: putative GATA transcription facto... 153 1e-34 ref|XP_006280600.1| hypothetical protein CARUB_v10026556mg [Caps... 152 2e-34 ref|NP_200497.1| GATA transcription factor 21 [Arabidopsis thali... 151 6e-34 ref|XP_006346565.1| PREDICTED: GATA transcription factor 21-like... 151 6e-34 ref|XP_002516445.1| conserved hypothetical protein [Ricinus comm... 151 6e-34 >ref|XP_002282173.1| PREDICTED: uncharacterized protein LOC100261004 [Vitis vinifera] gi|297738668|emb|CBI27913.3| unnamed protein product [Vitis vinifera] Length = 309 Score = 179 bits (455), Expect = 2e-42 Identities = 121/262 (46%), Positives = 141/262 (53%), Gaps = 6/262 (2%) Frame = -2 Query: 953 QPQLEA-SKDVLHGGSSDHEFPSSSSIPSMENNIDYDLKFSMIWKHDENKXXXXXXXXXX 777 QPQ EA K V GGS DH P++E+ D LK + IWK ++ Sbjct: 66 QPQQEAHDKFVFRGGSYDH--------PTLESESDNGLKLT-IWKTEDRNENHSENGSVK 116 Query: 776 XXXXSVXXXXXXXXXXXXXXXXSRVEGERLHKFQNQKIKPXXXXXXXXXXXXXXXXXXXN 597 + + F + K + Sbjct: 117 WMSSKMRVMQKMMISDQTGA---QKPSNTALNFGDHKQQSLPSETDYNSINSSNINSNNT 173 Query: 596 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAATSAIVH----PNETTITK 429 IRVC+DCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAA +A + P T TK Sbjct: 174 IRVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAATANGTILPTNTAPTK 233 Query: 428 QSKVMNHKEKRSGKGYNITNYKKRCKLTTSTPRPGGDNKLCFEDFTISLMSKNSAFHRVF 249 HK+K+S G+ +++YKKRCKL + KLCFEDFTISL SKNSAFHRVF Sbjct: 234 TK--AKHKDKKSSNGH-VSHYKKRCKLAAAPSCE--TKKLCFEDFTISL-SKNSAFHRVF 287 Query: 248 PQDE-KEAAILLMALSCGLVHG 186 QDE KEAAILLMALSCGLVHG Sbjct: 288 LQDEIKEAAILLMALSCGLVHG 309 >gb|EXB38836.1| Putative GATA transcription factor 22 [Morus notabilis] Length = 335 Score = 176 bits (445), Expect = 2e-41 Identities = 94/142 (66%), Positives = 105/142 (73%), Gaps = 5/142 (3%) Frame = -2 Query: 596 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAATSAIVH----PNETTITK 429 IRVC+DCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAA +A + + T K Sbjct: 197 IRVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAAANGTILATDATTMK 256 Query: 428 QSKVMNHKEKRSGKGYNIT-NYKKRCKLTTSTPRPGGDNKLCFEDFTISLMSKNSAFHRV 252 S + KEK+ G + +KKRCKLT S R G K+CFED IS+ SKNSAF RV Sbjct: 257 SSTKVQRKEKKPKNGNGVVPQFKKRCKLTASPSR--GRKKICFEDLAISI-SKNSAFQRV 313 Query: 251 FPQDEKEAAILLMALSCGLVHG 186 FPQDEK+AAILLMALS GLVHG Sbjct: 314 FPQDEKDAAILLMALSYGLVHG 335 >ref|XP_007012845.1| GATA type zinc finger transcription factor family protein, putative [Theobroma cacao] gi|508783208|gb|EOY30464.1| GATA type zinc finger transcription factor family protein, putative [Theobroma cacao] Length = 302 Score = 173 bits (439), Expect = 1e-40 Identities = 94/140 (67%), Positives = 107/140 (76%), Gaps = 3/140 (2%) Frame = -2 Query: 596 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAATSA---IVHPNETTITKQ 426 IRVC+DCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAA +A + +TT T + Sbjct: 168 IRVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAANGAIVAAQTTPTMK 227 Query: 425 SKVMNHKEKRSGKGYNITNYKKRCKLTTSTPRPGGDNKLCFEDFTISLMSKNSAFHRVFP 246 SKV + K KRS + KK+CK ++ + G KLCFED I ++SKNSAFHRVFP Sbjct: 228 SKVQD-KSKRSSNSGCVAQLKKKCKHSSQSQ---GRKKLCFEDLRI-ILSKNSAFHRVFP 282 Query: 245 QDEKEAAILLMALSCGLVHG 186 QDEKEAAILLMALS GLVHG Sbjct: 283 QDEKEAAILLMALSYGLVHG 302 >ref|XP_002514107.1| hypothetical protein RCOM_1046780 [Ricinus communis] gi|223546563|gb|EEF48061.1| hypothetical protein RCOM_1046780 [Ricinus communis] Length = 312 Score = 171 bits (434), Expect = 4e-40 Identities = 91/141 (64%), Positives = 105/141 (74%), Gaps = 4/141 (2%) Frame = -2 Query: 596 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAATSA----IVHPNETTITK 429 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRA+AA A + +T K Sbjct: 177 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRALAAAQASANGTIFAPDTAAMK 236 Query: 428 QSKVMNHKEKRSGKGYNITNYKKRCKLTTSTPRPGGDNKLCFEDFTISLMSKNSAFHRVF 249 +KV N KEKR+ + +KKRCK T + G KLCFED + +++SKNSAF ++F Sbjct: 237 TNKVQN-KEKRTNNSH--LPFKKRCKFTAQS--RGSRKKLCFEDLSSTILSKNSAFQQLF 291 Query: 248 PQDEKEAAILLMALSCGLVHG 186 PQDEKEAAILLMALS GLVHG Sbjct: 292 PQDEKEAAILLMALSYGLVHG 312 >gb|ADL36695.1| GATA domain class transcription factor [Malus domestica] Length = 359 Score = 161 bits (407), Expect = 6e-37 Identities = 89/152 (58%), Positives = 102/152 (67%), Gaps = 15/152 (9%) Frame = -2 Query: 596 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAATSAIVHPNETTITKQSKV 417 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAA +A T+ S Sbjct: 208 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAAASGTTLTVAAPSMK 267 Query: 416 MNHKEKRSGKG--YNITNYKKR--CKLTTSTPRPGGDNKLCFEDFTISLMSKNS------ 267 + + ++ K + +KKR KL++S G KLCFEDFTIS+ + +S Sbjct: 268 SSKVQPKANKSRVSSTVPFKKRPYNKLSSSPSSRGKSKKLCFEDFTISMKNNSSSGNPTA 327 Query: 266 -----AFHRVFPQDEKEAAILLMALSCGLVHG 186 A RVFPQDEKEAAILLMALSCGLVHG Sbjct: 328 ATTTTALQRVFPQDEKEAAILLMALSCGLVHG 359 >ref|XP_006353530.1| PREDICTED: putative GATA transcription factor 22-like [Solanum tuberosum] Length = 323 Score = 159 bits (403), Expect = 2e-36 Identities = 95/170 (55%), Positives = 106/170 (62%), Gaps = 33/170 (19%) Frame = -2 Query: 596 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAATSAIVH------PNETTI 435 IRVCSDCNTTKTPLWRSGP+GPKSLCNACGIRQRKARRA AA +A + ETT Sbjct: 155 IRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRKARRAAAAAAAATNNGTNFTSTETTT 214 Query: 434 TKQSKVMNHKEKRSGKGYN-ITNYKKRCKL---TTSTPRP--------GGDN-------- 315 T + KV K K + N + +KKRCK TT+TP P G + Sbjct: 215 TMKIKVQQQKHKITKVNTNHVVPFKKRCKFLSNTTTTPAPVPAPAPRVGSSSSSSSYNNN 274 Query: 314 -------KLCFEDFTISLMSKNSAFHRVFPQDEKEAAILLMALSCGLVHG 186 LCFEDF ++L S N A HRVFPQDEKEAAILLMALS GLVHG Sbjct: 275 NDVQQKKNLCFEDFFVNL-SNNLAIHRVFPQDEKEAAILLMALSSGLVHG 323 >ref|XP_004287558.1| PREDICTED: uncharacterized protein LOC101297577 [Fragaria vesca subsp. vesca] Length = 357 Score = 157 bits (396), Expect = 1e-35 Identities = 97/196 (49%), Positives = 110/196 (56%), Gaps = 22/196 (11%) Frame = -2 Query: 707 RVEGERLHKFQNQKIKPXXXXXXXXXXXXXXXXXXXNIRVCSDCNTTKTPLWRSGPRGPK 528 RV H F+ QK+ P IRVCSDCNTTKTPLWRSGPRGPK Sbjct: 170 RVNFSASHNFEEQKLHPLSPLGTDSSYSTNP------IRVCSDCNTTKTPLWRSGPRGPK 223 Query: 527 SLCNACGIRQRKARRAMAATSAIVHPNETTI-TKQSKVMNHKEKRSGKGYNITNYKKRCK 351 SLCNACGIRQRKARRAMAA +A N TT+ + + M K K +KKRC Sbjct: 224 SLCNACGIRQRKARRAMAAAAAAA--NSTTLAVEAAPSMIKTSKVKLKDNKTIPFKKRCH 281 Query: 350 LTTSTPRPGGDN--KLCFEDFTISLMSKNS-------------------AFHRVFPQDEK 234 +P P G + KL FEDF++S M++NS F RVFPQDEK Sbjct: 282 KLAISPSPRGKSKTKLRFEDFSVSSMNQNSGTDPPPPPTTTTTTTTTTTTFQRVFPQDEK 341 Query: 233 EAAILLMALSCGLVHG 186 EAAILLMALSCGLV G Sbjct: 342 EAAILLMALSCGLVRG 357 >gb|ADL36692.1| GATA domain class transcription factor [Malus domestica] Length = 342 Score = 157 bits (396), Expect = 1e-35 Identities = 90/145 (62%), Positives = 103/145 (71%), Gaps = 8/145 (5%) Frame = -2 Query: 596 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAATSAIVHPNETTIT----- 432 IRVCSDC+TTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAA +A + TT+T Sbjct: 202 IRVCSDCSTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAAAAASGTTLTVAAPS 261 Query: 431 -KQSKVMNHKEKRSGKGYNITNYKKR--CKLTTSTPRPGGDNKLCFEDFTISLMSKNSAF 261 K SKV HK+ +S + +KKR KLT+S G KLCFE T + + +A Sbjct: 262 MKSSKV-QHKDNKSRVSSTVP-FKKRPYNKLTSSPSSRGKSKKLCFEAPTAA--AATTAL 317 Query: 260 HRVFPQDEKEAAILLMALSCGLVHG 186 RVFPQDE+EAAILLMALSCGLVHG Sbjct: 318 QRVFPQDEREAAILLMALSCGLVHG 342 >ref|XP_007203151.1| hypothetical protein PRUPE_ppa024374mg [Prunus persica] gi|462398682|gb|EMJ04350.1| hypothetical protein PRUPE_ppa024374mg [Prunus persica] Length = 297 Score = 156 bits (394), Expect = 2e-35 Identities = 90/154 (58%), Positives = 106/154 (68%), Gaps = 17/154 (11%) Frame = -2 Query: 596 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAATSAIVHPNETTIT----- 432 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAA +A + TT+ Sbjct: 147 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAAA--SGTTLAAAPSM 204 Query: 431 KQSKVMNHKEKRSGKGYNITNYKKR--CKLTTSTPRPG-GDNKLCFEDFTISLMSKNS-- 267 K + HK+ + +G + +KKR KL+++ P G KLCFEDF IS+ + +S Sbjct: 205 KSTSKAQHKDNKP-RGASTVPFKKRPYNKLSSTPPSKGRPPKKLCFEDFAISMDNNHSSS 263 Query: 266 -------AFHRVFPQDEKEAAILLMALSCGLVHG 186 + RVFPQDEKEAAILLMALSCGLVHG Sbjct: 264 ATTTTTTSLQRVFPQDEKEAAILLMALSCGLVHG 297 >ref|XP_002279283.1| PREDICTED: putative GATA transcription factor 22 [Vitis vinifera] gi|296081660|emb|CBI20665.3| unnamed protein product [Vitis vinifera] Length = 306 Score = 156 bits (394), Expect = 2e-35 Identities = 84/136 (61%), Positives = 96/136 (70%) Frame = -2 Query: 596 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAATSAIVHPNETTITKQSKV 417 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAA +A N T + + Sbjct: 172 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAAA-ANGTAVGTEISP 230 Query: 416 MNHKEKRSGKGYNITNYKKRCKLTTSTPRPGGDNKLCFEDFTISLMSKNSAFHRVFPQDE 237 M K K + +N ++ KL P + KLCFEDFT S+ KNS F RVFP+DE Sbjct: 231 MKMKLPNKEKKMHTSNVGQQKKLCKPPCPPPTEKKLCFEDFTSSI-CKNSGFRRVFPRDE 289 Query: 236 KEAAILLMALSCGLVH 189 +EAAILLMALSC LV+ Sbjct: 290 EEAAILLMALSCDLVY 305 >emb|CAN63090.1| hypothetical protein VITISV_032017 [Vitis vinifera] Length = 211 Score = 156 bits (394), Expect = 2e-35 Identities = 84/136 (61%), Positives = 96/136 (70%) Frame = -2 Query: 596 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAATSAIVHPNETTITKQSKV 417 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAA +A N T + + Sbjct: 77 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAAA-ANGTAVGTEISP 135 Query: 416 MNHKEKRSGKGYNITNYKKRCKLTTSTPRPGGDNKLCFEDFTISLMSKNSAFHRVFPQDE 237 M K K + +N ++ KL P + KLCFEDFT S+ KNS F RVFP+DE Sbjct: 136 MKMKLPNKEKKMHTSNVGQQKKLCKPPCPPPTEKKLCFEDFTSSI-CKNSGFRRVFPRDE 194 Query: 236 KEAAILLMALSCGLVH 189 +EAAILLMALSC LV+ Sbjct: 195 EEAAILLMALSCDLVY 210 >ref|XP_002866169.1| hypothetical protein ARALYDRAFT_495776 [Arabidopsis lyrata subsp. lyrata] gi|297312004|gb|EFH42428.1| hypothetical protein ARALYDRAFT_495776 [Arabidopsis lyrata subsp. lyrata] Length = 396 Score = 155 bits (391), Expect = 4e-35 Identities = 89/172 (51%), Positives = 108/172 (62%), Gaps = 35/172 (20%) Frame = -2 Query: 596 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAATSAIVHPNETTITKQS-- 423 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRA A +A E + +S Sbjct: 226 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAAMAAAAAAGDQEVVVASRSSQ 285 Query: 422 ----KVMNHKEKRS--GKGYN----ITNYKKRCKL-----------------------TT 342 K + +K+KRS G+ YN + K+CK+ TT Sbjct: 286 LLLKKKLQNKKKRSNGGEKYNLSPPVVAKAKKCKIREEDEVDMEAETMIARDLEISKSTT 345 Query: 341 STPRPGGDNKLCFEDFTISLMSKNSAFHRVFPQDEKEAAILLMALSCGLVHG 186 S+ NKLCF+D TI ++SK+SA+ +VFPQDEKEAA+LLMALS G+VHG Sbjct: 346 SSNSSISSNKLCFDDLTI-MLSKSSAYQQVFPQDEKEAAVLLMALSYGMVHG 396 >ref|XP_006401276.1| hypothetical protein EUTSA_v10013793mg [Eutrema salsugineum] gi|557102366|gb|ESQ42729.1| hypothetical protein EUTSA_v10013793mg [Eutrema salsugineum] Length = 384 Score = 154 bits (390), Expect = 5e-35 Identities = 88/163 (53%), Positives = 105/163 (64%), Gaps = 26/163 (15%) Frame = -2 Query: 596 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAATSAIVHPNETTITKQ--- 426 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRA A +A + +Q Sbjct: 224 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAAMAAAAAAGDQDVVAARQQLP 283 Query: 425 -SKVMNHKEKRSGKGYN----ITNYKKRCKL------------------TTSTPRPGGDN 315 K + +K+KR K YN + K+CK+ TTS+ N Sbjct: 284 VKKKLQNKKKRCDK-YNLSPPVVAKAKKCKIIEEEVPAMAAGDSEISKSTTSSDSSISSN 342 Query: 314 KLCFEDFTISLMSKNSAFHRVFPQDEKEAAILLMALSCGLVHG 186 KLCF+D TI ++SK+SA+ +VFPQDEKEAAILLMALS G+VHG Sbjct: 343 KLCFDDLTI-MLSKSSAYQQVFPQDEKEAAILLMALSYGMVHG 384 >ref|XP_006451458.1| hypothetical protein CICLE_v10009004mg [Citrus clementina] gi|568843031|ref|XP_006475428.1| PREDICTED: putative GATA transcription factor 22-like [Citrus sinensis] gi|557554684|gb|ESR64698.1| hypothetical protein CICLE_v10009004mg [Citrus clementina] Length = 306 Score = 154 bits (388), Expect = 9e-35 Identities = 84/143 (58%), Positives = 95/143 (66%), Gaps = 6/143 (4%) Frame = -2 Query: 596 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAATSAIVHPNETTI-----T 432 IRVC+DCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAA +A N T + Sbjct: 168 IRVCADCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAA----NGTAVQLAADD 223 Query: 431 KQSKVMNHKEKRSGKGYNITNYKKRCKLTTSTPRPGGDNKLCFEDFTISLMSKN-SAFHR 255 S K R + +KKRCK +++P G FED T++L N SA R Sbjct: 224 TSSNKKKSKTPRPSNNNSCLPFKKRCKYNSNSPSRGKKKLCSFEDLTLNLSKNNSSALQR 283 Query: 254 VFPQDEKEAAILLMALSCGLVHG 186 VFPQ+EKEAAILLMALS GLVHG Sbjct: 284 VFPQEEKEAAILLMALSYGLVHG 306 >ref|XP_007012281.1| GATA type zinc finger transcription factor family protein, putative [Theobroma cacao] gi|508782644|gb|EOY29900.1| GATA type zinc finger transcription factor family protein, putative [Theobroma cacao] Length = 311 Score = 153 bits (387), Expect = 1e-34 Identities = 84/143 (58%), Positives = 96/143 (67%), Gaps = 7/143 (4%) Frame = -2 Query: 596 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAATSAIVHPNETTITKQS-- 423 +RVCSDCNTT TPLWRSGPRGPKSLCNACGIRQRKARRAM A +A N + Sbjct: 174 VRVCSDCNTTTTPLWRSGPRGPKSLCNACGIRQRKARRAMEAAAAAAAENGAAAAADASS 233 Query: 422 ---KVMNHKEKRSGKGYNITNYKKRCKLTTSTP--RPGGDNKLCFEDFTISLMSKNSAFH 258 KV HKEK+S T++ +CK P P KLCF++F +SL SKNSA Sbjct: 234 MKIKVHIHKEKKSR-----TSHVAQCKKQVKPPYYSPQSQKKLCFKEFALSL-SKNSALQ 287 Query: 257 RVFPQDEKEAAILLMALSCGLVH 189 RVFPQD ++AAILLM LSCGLVH Sbjct: 288 RVFPQDVEDAAILLMELSCGLVH 310 >ref|XP_004251667.1| PREDICTED: putative GATA transcription factor 22-like [Solanum lycopersicum] Length = 326 Score = 153 bits (387), Expect = 1e-34 Identities = 95/179 (53%), Positives = 103/179 (57%), Gaps = 42/179 (23%) Frame = -2 Query: 596 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAATSAI-VHPNE-------- 444 IRVCSDCNTTKTPLWRSGP+GPKSLCNACGIRQRKARRA AA +A PN Sbjct: 149 IRVCSDCNTTKTPLWRSGPKGPKSLCNACGIRQRKARRAAAAAAAASTTPNNGTNFTSTE 208 Query: 443 --TTITKQSKVMNHKEKRSGKGYN-ITNYKKRCKL----TTSTPRPG------------- 324 TT T + KV K K + N + +KKRCK TT P PG Sbjct: 209 TTTTTTMKIKVQQQKHKITKVNANHVVPFKKRCKFLSSTTTPAPEPGLVPTPAPRVGSSS 268 Query: 323 -------------GDNKLCFEDFTISLMSKNSAFHRVFPQDEKEAAILLMALSCGLVHG 186 K+CFEDF I+L S N A HRVFPQDEKEAAILLMALS LVHG Sbjct: 269 SSSFYNNNNNDVQQKKKICFEDFFINL-SNNLAIHRVFPQDEKEAAILLMALSSDLVHG 326 >ref|XP_006280600.1| hypothetical protein CARUB_v10026556mg [Capsella rubella] gi|482549304|gb|EOA13498.1| hypothetical protein CARUB_v10026556mg [Capsella rubella] Length = 395 Score = 152 bits (385), Expect = 2e-34 Identities = 87/167 (52%), Positives = 106/167 (63%), Gaps = 30/167 (17%) Frame = -2 Query: 596 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAATSAIVHPNETTITKQ--- 426 +RVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRA A +A E + + Sbjct: 230 VRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAAMAAAAASGDQEVAVAARVQQ 289 Query: 425 ---SKVMNHKEKRS--GKGYN----ITNYKKRCKL------------------TTSTPRP 327 K + +K+KRS G+ YN + K+CK+ TTS+ Sbjct: 290 SPLKKKLQNKKKRSNGGEKYNLSPPVVAKAKKCKMVQAEEEETVAGDSEISKSTTSSNSS 349 Query: 326 GGDNKLCFEDFTISLMSKNSAFHRVFPQDEKEAAILLMALSCGLVHG 186 NK CF+D TI ++SK+SA+ +VFPQDEKEAAILLMALS G+VHG Sbjct: 350 ISSNKFCFDDLTI-MLSKSSAYQQVFPQDEKEAAILLMALSYGMVHG 395 >ref|NP_200497.1| GATA transcription factor 21 [Arabidopsis thaliana] gi|71660831|sp|Q5HZ36.2|GAT21_ARATH RecName: Full=GATA transcription factor 21 gi|8809654|dbj|BAA97205.1| unnamed protein product [Arabidopsis thaliana] gi|109134121|gb|ABG25059.1| At5g56860 [Arabidopsis thaliana] gi|332009432|gb|AED96815.1| GATA transcription factor 21 [Arabidopsis thaliana] Length = 398 Score = 151 bits (381), Expect = 6e-34 Identities = 87/171 (50%), Positives = 106/171 (61%), Gaps = 34/171 (19%) Frame = -2 Query: 596 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAATSAIVHPNETTITKQ--- 426 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRA A +A E + + Sbjct: 229 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAAMAAAAAAGDQEVAVAPRVQQ 288 Query: 425 ---SKVMNHKEKRS--GKGYN----ITNYKKRCKL----------------------TTS 339 K + +K+KRS G+ YN + K+CK+ TTS Sbjct: 289 LPLKKKLQNKKKRSNGGEKYNHSPPMVAKAKKCKIKEEEEKEMEAETVAGDSEISKSTTS 348 Query: 338 TPRPGGDNKLCFEDFTISLMSKNSAFHRVFPQDEKEAAILLMALSCGLVHG 186 + NK CF+D TI ++SK+SA+ +VFPQDEKEAA+LLMALS G+VHG Sbjct: 349 SNSSISSNKFCFDDLTI-MLSKSSAYQQVFPQDEKEAAVLLMALSYGMVHG 398 >ref|XP_006346565.1| PREDICTED: GATA transcription factor 21-like [Solanum tuberosum] Length = 222 Score = 151 bits (381), Expect = 6e-34 Identities = 84/140 (60%), Positives = 100/140 (71%), Gaps = 3/140 (2%) Frame = -2 Query: 596 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAATSAIVHPNETTITKQSKV 417 IRVC+DCNTTKTPLWRSGP+GPKSLCNACGIRQRKARRAMAA + ++T + + KV Sbjct: 86 IRVCTDCNTTKTPLWRSGPKGPKSLCNACGIRQRKARRAMAAAANGKTDHQTAM--KIKV 143 Query: 416 MNHKEK--RSGKGYNITNYKKRCKL-TTSTPRPGGDNKLCFEDFTISLMSKNSAFHRVFP 246 HK + ++T +KKRCKL +S+ KL FED I+L S AF ++FP Sbjct: 144 QQHKPNITKVRTNNHVTPFKKRCKLGPSSSGTNNAPKKLGFEDLLINL-SNQLAFQQIFP 202 Query: 245 QDEKEAAILLMALSCGLVHG 186 QDEKEAAILLMALS GLVHG Sbjct: 203 QDEKEAAILLMALSSGLVHG 222 >ref|XP_002516445.1| conserved hypothetical protein [Ricinus communis] gi|223544265|gb|EEF45786.1| conserved hypothetical protein [Ricinus communis] Length = 186 Score = 151 bits (381), Expect = 6e-34 Identities = 86/140 (61%), Positives = 99/140 (70%), Gaps = 3/140 (2%) Frame = -2 Query: 596 IRVCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAATSAIVHPNETTITKQSKV 417 IRVCSDCNTT TPLWRSGPRGPKSLCNACGIRQRKARRAMAA +AI ET+ TK +KV Sbjct: 54 IRVCSDCNTTTTPLWRSGPRGPKSLCNACGIRQRKARRAMAAAAAIA--METSSTKAAKV 111 Query: 416 MNHKEKRSGKGYNITNYKKRCKLTTSTPRP---GGDNKLCFEDFTISLMSKNSAFHRVFP 246 KEK+S G+ + KK CK P P G K+ F++ +SL S NSA RVFP Sbjct: 112 ---KEKKSRTGH-ASQCKKLCKPPDHPPPPYNQGQKPKVSFKNLALSL-SNNSALQRVFP 166 Query: 245 QDEKEAAILLMALSCGLVHG 186 +D +EAA LLM LSCG +HG Sbjct: 167 EDVEEAATLLMELSCGFIHG 186