BLASTX nr result
ID: Sinomenium22_contig00022772
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium22_contig00022772 (1094 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002282225.1| PREDICTED: GATA transcription factor 9-like ... 286 1e-74 emb|CAN83570.1| hypothetical protein VITISV_041707 [Vitis vinifera] 286 1e-74 emb|CBI29927.3| unnamed protein product [Vitis vinifera] 281 4e-73 ref|XP_007030701.1| GATA transcription factor 9, putative [Theob... 268 3e-69 ref|XP_002325826.2| hypothetical protein POPTR_0019s04860g [Popu... 250 8e-64 ref|XP_002319169.2| hypothetical protein POPTR_0013s05610g [Popu... 248 3e-63 ref|XP_003548758.1| PREDICTED: GATA transcription factor 9-like ... 246 9e-63 ref|XP_007161927.1| hypothetical protein PHAVU_001G109500g [Phas... 244 4e-62 ref|XP_003554005.1| PREDICTED: GATA transcription factor 5-like ... 241 5e-61 gb|ACU17869.1| unknown [Glycine max] 238 4e-60 ref|XP_003624611.1| GATA transcription factor [Medicago truncatu... 237 6e-60 ref|XP_003518214.1| PREDICTED: GATA transcription factor 9-like ... 237 7e-60 ref|XP_003548060.1| PREDICTED: GATA transcription factor 9-like ... 233 8e-59 ref|XP_004493146.1| PREDICTED: GATA transcription factor 9-like ... 233 1e-58 ref|XP_007151835.1| hypothetical protein PHAVU_004G079200g [Phas... 233 1e-58 ref|XP_006574751.1| PREDICTED: GATA transcription factor 9-like ... 232 2e-58 ref|XP_006443395.1| hypothetical protein CICLE_v10021190mg [Citr... 231 3e-58 ref|XP_006826302.1| hypothetical protein AMTR_s00004p00074220 [A... 205 3e-50 ref|XP_003619010.1| GATA transcription factor [Medicago truncatu... 198 3e-48 ref|XP_004141141.1| PREDICTED: GATA transcription factor 9-like ... 192 2e-46 >ref|XP_002282225.1| PREDICTED: GATA transcription factor 9-like [Vitis vinifera] Length = 299 Score = 286 bits (731), Expect = 1e-74 Identities = 163/304 (53%), Positives = 189/304 (62%), Gaps = 20/304 (6%) Frame = +2 Query: 59 MDFCRNVPEPLAFQSEYQQEP------DKPGGLST-SLDDLFPSENPESDVNLEWLSIFV 217 MDF R V + EY QE K GGLS SLDDLF ++N E DV+LEWLSIFV Sbjct: 1 MDFYREV----SVSGEYPQEQVPSTVCSKLGGLSAGSLDDLFSTQNTEVDVSLEWLSIFV 56 Query: 218 EDCLSSSGNCIPPPPNDLHLQHNSSPVPSKPC--LNKTHQNPPPKLQDFAVPAKPRTKRK 391 EDCLSS+GNC+P P N +++P PSKP + K Q P LQ+ +P K R+KRK Sbjct: 57 EDCLSSTGNCLPAPKNVA--SDSATPKPSKPLQSMQKPQQKPSSPLQNLVIPGKARSKRK 114 Query: 392 RXXXXXXXXXXXXXXXXXQAQ----ITSDPPLLHQAYWLADSELIVPKKE-------NKY 538 R + Q +SDPPLL QAYWLADSELIVPKKE N Sbjct: 115 RATTITTSFSNWVHHLNPENQNLHITSSDPPLLQQAYWLADSELIVPKKEESSSNNNNNN 174 Query: 539 STXXXXXXXXXXXXXXXXXXXXKVVEVDGGVMKENSASSGSSDWGNGQARRCTHCLAQRT 718 ++ EV+ V K N G+ + NGQ RRCTHCLAQRT Sbjct: 175 NSMVKEEEEVEEEEEEEEEEEETREEVEVEVEKGNKERWGNLEGSNGQPRRCTHCLAQRT 234 Query: 719 PQWRAGPLGPKTLCNACGVRFKSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRMSILSS 898 PQWRAGPLGPKTLCNACGVR+KSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRM++LSS Sbjct: 235 PQWRAGPLGPKTLCNACGVRYKSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRMAVLSS 294 Query: 899 VPTE 910 +P++ Sbjct: 295 IPSD 298 >emb|CAN83570.1| hypothetical protein VITISV_041707 [Vitis vinifera] Length = 620 Score = 286 bits (731), Expect = 1e-74 Identities = 163/304 (53%), Positives = 189/304 (62%), Gaps = 20/304 (6%) Frame = +2 Query: 59 MDFCRNVPEPLAFQSEYQQEP------DKPGGLST-SLDDLFPSENPESDVNLEWLSIFV 217 MDF R V + EY QE K GGLS SLDDLF ++N E DV+LEWLSIFV Sbjct: 322 MDFYREV----SVSGEYPQEQVPSTVCSKLGGLSAGSLDDLFSTQNTEVDVSLEWLSIFV 377 Query: 218 EDCLSSSGNCIPPPPNDLHLQHNSSPVPSKPC--LNKTHQNPPPKLQDFAVPAKPRTKRK 391 EDCLSS+GNC+P P N +++P PSKP + K Q P LQ+ +P K R+KRK Sbjct: 378 EDCLSSTGNCLPAPKNVA--SDSATPKPSKPLQSMQKPQQKPSSPLQNLVIPGKARSKRK 435 Query: 392 RXXXXXXXXXXXXXXXXXQAQ----ITSDPPLLHQAYWLADSELIVPKKE-------NKY 538 R + Q +SDPPLL QAYWLADSELIVPKKE N Sbjct: 436 RATTITTSFSNWVHHLNPENQNLHITSSDPPLLQQAYWLADSELIVPKKEESSSNNNNNN 495 Query: 539 STXXXXXXXXXXXXXXXXXXXXKVVEVDGGVMKENSASSGSSDWGNGQARRCTHCLAQRT 718 ++ EV+ V K N G+ + NGQ RRCTHCLAQRT Sbjct: 496 NSMVKEEEEVEEEEEEEEEEEETREEVEVEVEKGNKERWGNLEGSNGQPRRCTHCLAQRT 555 Query: 719 PQWRAGPLGPKTLCNACGVRFKSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRMSILSS 898 PQWRAGPLGPKTLCNACGVR+KSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRM++LSS Sbjct: 556 PQWRAGPLGPKTLCNACGVRYKSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRMAVLSS 615 Query: 899 VPTE 910 +P++ Sbjct: 616 IPSD 619 >emb|CBI29927.3| unnamed protein product [Vitis vinifera] Length = 265 Score = 281 bits (718), Expect = 4e-73 Identities = 159/297 (53%), Positives = 181/297 (60%), Gaps = 13/297 (4%) Frame = +2 Query: 59 MDFCRNVPEPLAFQSEYQQEP------DKPGGLST-SLDDLFPSENPESDVNLEWLSIFV 217 MDF R V + EY QE K GGLS SLDDLF ++N E DV+LEWLSIFV Sbjct: 1 MDFYREV----SVSGEYPQEQVPSTVCSKLGGLSAGSLDDLFSTQNTEVDVSLEWLSIFV 56 Query: 218 EDCLSSSGNCIPPPPNDLHLQHNSSPVPSKP--CLNKTHQNPPPKLQDFAVPAKPRTKRK 391 EDCLSS+GNC+P P N +++P PSKP + K Q P LQ+ +P K R+KRK Sbjct: 57 EDCLSSTGNCLPAPKN--VASDSATPKPSKPLQSMQKPQQKPSSPLQNLVIPGKARSKRK 114 Query: 392 RXXXXXXXXXXXXXXXXXQAQ----ITSDPPLLHQAYWLADSELIVPKKENKYSTXXXXX 559 R + Q +SDPPLL QAYWLADSELIVPKKE S Sbjct: 115 RATTITTSFSNWVHHLNPENQNLHITSSDPPLLQQAYWLADSELIVPKKEESSSNNNN-- 172 Query: 560 XXXXXXXXXXXXXXXKVVEVDGGVMKENSASSGSSDWGNGQARRCTHCLAQRTPQWRAGP 739 NS + NGQ RRCTHCLAQRTPQWRAGP Sbjct: 173 -------------------------NNNSMVKEEEEGSNGQPRRCTHCLAQRTPQWRAGP 207 Query: 740 LGPKTLCNACGVRFKSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRMSILSSVPTE 910 LGPKTLCNACGVR+KSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRM++LSS+P++ Sbjct: 208 LGPKTLCNACGVRYKSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRMAVLSSIPSD 264 >ref|XP_007030701.1| GATA transcription factor 9, putative [Theobroma cacao] gi|508719306|gb|EOY11203.1| GATA transcription factor 9, putative [Theobroma cacao] Length = 302 Score = 268 bits (685), Expect = 3e-69 Identities = 162/312 (51%), Positives = 189/312 (60%), Gaps = 28/312 (8%) Frame = +2 Query: 59 MDFCRNVPEPLAFQSEYQQEP------DKPGG-LSTS--LDDLFPSENPESDVNLEWLSI 211 MDFC+NV + EY QE K G L+T+ LDDLFP++N E D +LEWLSI Sbjct: 1 MDFCQNV----SVSGEYHQEQVLSSPCSKLGATLATTGTLDDLFPAQNTEVDKSLEWLSI 56 Query: 212 FVEDCLSSSGNCIPPPPNDLHLQHNSSPVPSKPCLN---KTHQNPPPKLQDFAVPAKPRT 382 FVEDCLSS+GNCIP ++Q+ S+ +KP + K Q PP LQ F VP K R+ Sbjct: 57 FVEDCLSSTGNCIPVAATT-NVQNKSTTTATKPAQSLQQKPQQIIPPSLQKFVVPGKARS 115 Query: 383 KRKRXXXXXXXXXXXXXXXXXQAQI----------TSDPPLLHQAYWLADSELIVPKKEN 532 KRKR Q+ +SDPPLL QAYWLADSELIVPKKE+ Sbjct: 116 KRKRVAATTLSKTKMNPFTSWSYQLNSHNQNLHLASSDPPLLQQAYWLADSELIVPKKED 175 Query: 533 KYSTXXXXXXXXXXXXXXXXXXXXKVVEVDGG----VMKEN--SASSGSSDWGNGQARRC 694 + K E++G V KE+ S S Q RRC Sbjct: 176 DSNNSSSNMRGNSETEES------KKEEMEGEKTVVVCKESLGSLEGNSGQQQQQQPRRC 229 Query: 695 THCLAQRTPQWRAGPLGPKTLCNACGVRFKSGRLLPEYRPAKSPTFVSYKHSNSHKKVME 874 THCLAQRTPQWRAGPLGPKTLCNACGVR+KSGRLLPEYRPAKSPTFVSY HSNSHKKVME Sbjct: 230 THCLAQRTPQWRAGPLGPKTLCNACGVRYKSGRLLPEYRPAKSPTFVSYLHSNSHKKVME 289 Query: 875 MRMSILSSVPTE 910 MRM++LSS+P+E Sbjct: 290 MRMALLSSIPSE 301 >ref|XP_002325826.2| hypothetical protein POPTR_0019s04860g [Populus trichocarpa] gi|550316808|gb|EEF00208.2| hypothetical protein POPTR_0019s04860g [Populus trichocarpa] Length = 294 Score = 250 bits (638), Expect = 8e-64 Identities = 142/289 (49%), Positives = 170/289 (58%), Gaps = 13/289 (4%) Frame = +2 Query: 59 MDFCRNVPEPLAFQSEYQQEPDKPGGLST---------SLDDLFPSENPESDVNLEWLSI 211 MDFC+NV + + + P ST LDDLF ++N E D ++EWLS+ Sbjct: 1 MDFCQNVTVSGEYHHQQEHVLASPPPCSTLTAAAASNSPLDDLFSAQNTEVDFSMEWLSV 60 Query: 212 FVEDCLSSSGNCIPPPPNDLHLQHNSSPVPSKPCLNKTH-QNPPPKLQDFAVPAKPRTKR 388 FVEDCLSS+GNC+P P +D + N+ P KP K Q P L+ A+P K R+KR Sbjct: 61 FVEDCLSSTGNCLPAPTSDAQ-KTNTEENPPKPLQQKPQDQENPSSLKKLAIPGKARSKR 119 Query: 389 KRXXXXXXXXXXXXXXXXXQA--QITSDPPLLHQAYWLADSELIVPKKENKYSTXXXXXX 562 +R QA SDPPLL Q +WLADSELI P K+ + Sbjct: 120 RRTTGDRSRNPLTSWCYTNQAFNLACSDPPLLQQTHWLADSELITPIKDGSDNRGTDGEV 179 Query: 563 XXXXXXXXXXXXXX-KVVEVDGGVMKENSASSGSSDWGNGQARRCTHCLAQRTPQWRAGP 739 KV+EV+ K+++ S SD G Q RRCTHCLAQRTPQWRAGP Sbjct: 180 QEKSGAEGDVEEELGKVLEVESSSSKDSTGSL-ESDNGQQQPRRCTHCLAQRTPQWRAGP 238 Query: 740 LGPKTLCNACGVRFKSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRMS 886 GPKTLCNACGVR+KSGRLLPEYRPAKSPTFVSY HSNSHKKVMEMRM+ Sbjct: 239 SGPKTLCNACGVRYKSGRLLPEYRPAKSPTFVSYLHSNSHKKVMEMRMA 287 >ref|XP_002319169.2| hypothetical protein POPTR_0013s05610g [Populus trichocarpa] gi|550325038|gb|EEE95092.2| hypothetical protein POPTR_0013s05610g [Populus trichocarpa] Length = 295 Score = 248 bits (633), Expect = 3e-63 Identities = 141/295 (47%), Positives = 166/295 (56%), Gaps = 19/295 (6%) Frame = +2 Query: 59 MDFCRNVPEPLAFQSEYQQEP---------DKPGGLSTSL-------DDLFPSENPESDV 190 MDFCRNV +EY Q+ K G + +L DD F ++N E D Sbjct: 1 MDFCRNV----TVSTEYHQQEKVLASPPPCSKLGAAAAALTATTSPLDDPFSAQNTEVDF 56 Query: 191 NLEWLSIFVEDCLSSSGNCIPPPPNDLHLQHNSSPVPSKPCLNKTHQNPPPKLQDFAVPA 370 + EWLS+FVEDCLSS+GNC+P P + + P Q P L+ +P Sbjct: 57 SSEWLSVFVEDCLSSTGNCLPAPTVEAQKPNTEENPPKNWQRKPQDQEDPSSLKKLVIPG 116 Query: 371 KPRTKRKRXXXXXXXXXXXXXXXXXQA---QITSDPPLLHQAYWLADSELIVPKKENKYS 541 K R+KR+R QA +SDPPLL Q YWLADSELI+P KE+ + Sbjct: 117 KSRSKRRRLTGDKTRNPLTSWCYTNQAFNNLTSSDPPLLQQTYWLADSELIMPIKEDSNN 176 Query: 542 TXXXXXXXXXXXXXXXXXXXXKVVEVDGGVMKENSASSGSSDWGNGQARRCTHCLAQRTP 721 T KVV V G ++S S+ G Q RRCTHCLAQRTP Sbjct: 177 TDMDNEVQEESGVGVHDEDIGKVVAVVGSNGSKDSLGVLESNNGQQQPRRCTHCLAQRTP 236 Query: 722 QWRAGPLGPKTLCNACGVRFKSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRMS 886 QWRAGPLGPKTLCNACGVR+KSGRLLPEYRPAKSPTFVSY HSNSHKKVMEMRM+ Sbjct: 237 QWRAGPLGPKTLCNACGVRYKSGRLLPEYRPAKSPTFVSYLHSNSHKKVMEMRMA 291 >ref|XP_003548758.1| PREDICTED: GATA transcription factor 9-like [Glycine max] Length = 281 Score = 246 bits (629), Expect = 9e-63 Identities = 140/283 (49%), Positives = 160/283 (56%), Gaps = 4/283 (1%) Frame = +2 Query: 59 MDFCRNVPEPLAFQSEYQQEPDKPGGLSTSLDDLFPSENPESDVNLEWLSIFVEDCLSSS 238 MD C+NV + E QQ S+SLDDLF ++N E DV LEWLS FVEDC SS Sbjct: 3 MDMCQNV----SVSGECQQVQVFAPSCSSSLDDLFSAQNTEVDVELEWLSEFVEDCFSSP 58 Query: 239 GNCIPPPPNDLHLQHNSSPVPSKPCLNKTHQNPPPKLQDFAVPAKPRTKRKRXXXXXXXX 418 +C+ P S+ P L + Q P LQ+FAVP K R+KRKR Sbjct: 59 PSCVLVPVGVKTTSTKSTSTSINPSLKRPQQQNEPPLQNFAVPGKARSKRKRLSAPRTNK 118 Query: 419 XXXXXXXXX----QAQITSDPPLLHQAYWLADSELIVPKKENKYSTXXXXXXXXXXXXXX 586 + SDPPLL QAYWLADSELI+PK ++K Sbjct: 119 DPLSIWSHHLNPQNEALCSDPPLLKQAYWLADSELIMPKPKDKEEQQEEVVIMAKEDEEK 178 Query: 587 XXXXXXKVVEVDGGVMKENSASSGSSDWGNGQARRCTHCLAQRTPQWRAGPLGPKTLCNA 766 K + + E GS+ RRCTHCLAQRTPQWRAGPLGPKTLCNA Sbjct: 179 VIINVSKEISFGDSELDE-----GSNGQQQPMPRRCTHCLAQRTPQWRAGPLGPKTLCNA 233 Query: 767 CGVRFKSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRMSILS 895 CGVR+KSGRLLPEYRPAKSPTFVSY HSNSHKKVMEMRMS+ S Sbjct: 234 CGVRYKSGRLLPEYRPAKSPTFVSYLHSNSHKKVMEMRMSVYS 276 >ref|XP_007161927.1| hypothetical protein PHAVU_001G109500g [Phaseolus vulgaris] gi|561035391|gb|ESW33921.1| hypothetical protein PHAVU_001G109500g [Phaseolus vulgaris] Length = 271 Score = 244 bits (624), Expect = 4e-62 Identities = 146/294 (49%), Positives = 173/294 (58%), Gaps = 10/294 (3%) Frame = +2 Query: 59 MDFCRNVPEPLAFQSEYQQEPDKPGGLSTSLDDLFPSENPESDVNLEWLSIFVEDCLSSS 238 M+ C+NV + SE QE S+SLDDLF ++N E DV LEWLS FVEDC SS Sbjct: 3 MNMCQNV----SVSSECHQEQVFAPSCSSSLDDLFSAQNTEVDVELEWLSEFVEDCFSS- 57 Query: 239 GNCIPPPPNDLHLQHNSSPV----PSKPCLNKTHQNPPPKLQDFAVPAKPRTKRKRXXXX 406 PP L + + PS L + Q+ P LQ+FAVP K R+KRKR Sbjct: 58 -----PPSFVLGASGVKTTITSTDPSSGTLKRAQQHESP-LQNFAVPGKARSKRKRLSAP 111 Query: 407 XXXXXXXXXXXXXQAQ---ITSDPPLLHQAYWLADSELIVPKKENKYSTXXXXXXXXXXX 577 Q ++SDPPLL QAYWLADSELI+PK + + Sbjct: 112 RTKDPLSIWSHHLNPQNEALSSDPPLLKQAYWLADSELIMPKPKEEQEVSKVDG------ 165 Query: 578 XXXXXXXXXKVVEVDGGVMKENSASSGSSDWGNGQA---RRCTHCLAQRTPQWRAGPLGP 748 +VVE ++++ S + NGQ RRCTHCLAQRTPQWRAGPLGP Sbjct: 166 ---------EVVEKGVVIIRKESFGDSELEGSNGQQPMPRRCTHCLAQRTPQWRAGPLGP 216 Query: 749 KTLCNACGVRFKSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRMSILSSVPTE 910 KTLCNACGVR+KSGRLLPEYRPAKSPTFVSY HSNSHKKVMEMRMS+ SS+ +E Sbjct: 217 KTLCNACGVRYKSGRLLPEYRPAKSPTFVSYLHSNSHKKVMEMRMSVYSSISSE 270 >ref|XP_003554005.1| PREDICTED: GATA transcription factor 5-like [Glycine max] Length = 274 Score = 241 bits (614), Expect = 5e-61 Identities = 143/292 (48%), Positives = 171/292 (58%), Gaps = 8/292 (2%) Frame = +2 Query: 59 MDFCRNVPEPLAFQSEYQQEPDKPGGLSTSLDDLFPSENPESDVNLEWLSIFVEDCLSSS 238 MD C+NV + E QQ S+SLDDLF ++N E DV LEWLS FVEDC SS Sbjct: 3 MDMCQNV----SVSGECQQVQVFAPSCSSSLDDLFSAQNTEVDVELEWLSEFVEDCFSSP 58 Query: 239 GNCIPPPPNDLHLQHNSSPVPSKPCLNKTHQNPPPKLQDFAVPAKPRTKRKRXXXXXXXX 418 +C+ P + S+ + S QN P LQ+FAVP K R+KRKR Sbjct: 59 PSCVLVPIG-VKTTSTSTNLSSGTLKRPQQQNESP-LQNFAVPGKARSKRKRLSAPRTNK 116 Query: 419 XXXXXXXXX----QAQITSDPPLLHQAYWLADSELIVPKKENKYSTXXXXXXXXXXXXXX 586 + SDPPLL QAYWLADSELI+PK +++ Sbjct: 117 DPLNIWSHHLNPQNESLCSDPPLLKQAYWLADSELIMPKPKDEEQEEVVTKEDE------ 170 Query: 587 XXXXXXKVVEVDGGVMKENSASSGSSDWGNGQ----ARRCTHCLAQRTPQWRAGPLGPKT 754 KV+ V + KE+ S + NGQ RRC+HCLAQRTPQWRAGPLGPKT Sbjct: 171 ------KVINV---MSKESFGDSELEEGSNGQQPMPTRRCSHCLAQRTPQWRAGPLGPKT 221 Query: 755 LCNACGVRFKSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRMSILSSVPTE 910 LCNACGVR+KSGRLLPEYRPAKSPTFVSY HSNSHKKVMEMRM++ S++ +E Sbjct: 222 LCNACGVRYKSGRLLPEYRPAKSPTFVSYLHSNSHKKVMEMRMAVFSTISSE 273 >gb|ACU17869.1| unknown [Glycine max] Length = 274 Score = 238 bits (606), Expect = 4e-60 Identities = 142/292 (48%), Positives = 170/292 (58%), Gaps = 8/292 (2%) Frame = +2 Query: 59 MDFCRNVPEPLAFQSEYQQEPDKPGGLSTSLDDLFPSENPESDVNLEWLSIFVEDCLSSS 238 MD C+NV + E QQ S+SLDDLF ++N E DV LEWLS FVEDC SS Sbjct: 3 MDMCQNV----SVSGECQQVQVFAPSCSSSLDDLFSAQNTEVDVELEWLSEFVEDCFSSP 58 Query: 239 GNCIPPPPNDLHLQHNSSPVPSKPCLNKTHQNPPPKLQDFAVPAKPRTKRKRXXXXXXXX 418 +C+ P + S+ + S QN P LQ+FAVP K R+KRKR Sbjct: 59 PSCVLVPIG-VKTTSTSTNLSSGTLKRPQQQNESP-LQNFAVPGKARSKRKRLSAPRTNK 116 Query: 419 XXXXXXXXX----QAQITSDPPLLHQAYWLADSELIVPKKENKYSTXXXXXXXXXXXXXX 586 + SDPPLL QAYWLADSELI+PK +++ Sbjct: 117 DPLNIWSHHLNPQNESLCSDPPLLKQAYWLADSELIMPKPKDEEQEEVVTKEDE------ 170 Query: 587 XXXXXXKVVEVDGGVMKENSASSGSSDWGNGQ----ARRCTHCLAQRTPQWRAGPLGPKT 754 KV+ V + KE+ S + NGQ RRC+HCLAQR PQWRAGPLGPKT Sbjct: 171 ------KVINV---MSKESFGDSELEEGSNGQQPMPTRRCSHCLAQRAPQWRAGPLGPKT 221 Query: 755 LCNACGVRFKSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRMSILSSVPTE 910 LCNACGVR+KSGRLLPEYRPAKSPTFVSY HSNSHKKVMEMRM++ S++ +E Sbjct: 222 LCNACGVRYKSGRLLPEYRPAKSPTFVSYLHSNSHKKVMEMRMAVFSTIFSE 273 >ref|XP_003624611.1| GATA transcription factor [Medicago truncatula] gi|124365580|gb|ABN09814.1| Zinc finger, GATA-type [Medicago truncatula] gi|355499626|gb|AES80829.1| GATA transcription factor [Medicago truncatula] Length = 264 Score = 237 bits (605), Expect = 6e-60 Identities = 146/293 (49%), Positives = 172/293 (58%), Gaps = 12/293 (4%) Frame = +2 Query: 59 MDFCRNVPEPLAFQSEYQQEPDKPGGLSTSLDDLFPSENPESDVNLEWLSIFVEDCLSSS 238 MD C+NV + E P LS+SLDDLF ++N E DV +EWLS+FVEDC SS Sbjct: 1 MDVCQNVQVSSECKQEKVLNPSC-SNLSSSLDDLFSAQNMEVDVGMEWLSVFVEDCFSSP 59 Query: 239 GNCIPPPPNDLHLQHNSSPVPSKPCLN----KTHQNPPPKLQDFAVPAKPRTKRKRXXXX 406 +C+ P + +Q+ +S V SKP K QN P FAVP K R+KRKR Sbjct: 60 QSCVLLPSS---VQNTTSTVSSKPSNTVKKPKQEQNESP----FAVPGKARSKRKRLSAP 112 Query: 407 XXXXXXXXXXXXX----QAQITSDPPLLHQAYWLADSELIVPKKENKYSTXXXXXXXXXX 574 + SDPPLL QAYWLADSEL+VPK E + + Sbjct: 113 RRPKDPLSILSNTLNPQNESLCSDPPLLKQAYWLADSELMVPKGEKEVT----------- 161 Query: 575 XXXXXXXXXXKVVEVDGGVMKENSASSGSSDWGNGQ----ARRCTHCLAQRTPQWRAGPL 742 K EV V KE G + NGQ RRCTHCL+QRTPQWRAGPL Sbjct: 162 ----------KDCEV---VEKERFDFEGFVN--NGQNPIPTRRCTHCLSQRTPQWRAGPL 206 Query: 743 GPKTLCNACGVRFKSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRMSILSSV 901 GPKTLCNACGVR+KSGRLLPEYRPAKSPTFVS+ HSNSHKKVMEMRM+++SS+ Sbjct: 207 GPKTLCNACGVRYKSGRLLPEYRPAKSPTFVSFLHSNSHKKVMEMRMNVVSSI 259 >ref|XP_003518214.1| PREDICTED: GATA transcription factor 9-like isoform X1 [Glycine max] Length = 280 Score = 237 bits (604), Expect = 7e-60 Identities = 143/295 (48%), Positives = 165/295 (55%), Gaps = 11/295 (3%) Frame = +2 Query: 59 MDFCRNVPEPLAFQSEYQQEPDKPGGLSTSLDDLFPSENPESDVNLEWLSIFVEDCLSSS 238 MD CRNV SE QQE +LDDLF +N E D LEWLS+FVEDC SS Sbjct: 1 MDVCRNVS---VSSSECQQE-------LPTLDDLFSHQNTEVDFGLEWLSVFVEDCFSSR 50 Query: 239 GNCIPPPPNDLHLQHNSSPVPSKPCLNKTHQNPPP-----KLQDFAVPAKPRTKRKRXXX 403 +C+ P +Q S+ +KP Q P LQ+FAVP K R+KRKR Sbjct: 51 PSCLLAPGG---VQTTSTSTSTKPSSGTILQRPQQLSHHCPLQNFAVPGKARSKRKRKRL 107 Query: 404 XXXXXXXXXXXXXXQA------QITSDPPLLHQAYWLADSELIVPKKENKYSTXXXXXXX 565 Q ++SDPPLL QAYWLADSELIVPKK++ Sbjct: 108 SAPRTTKHTLSTWSQHFSTQNDGVSSDPPLLKQAYWLADSELIVPKKKD-----VEQEEE 162 Query: 566 XXXXXXXXXXXXXKVVEVDGGVMKENSASSGSSDWGNGQARRCTHCLAQRTPQWRAGPLG 745 + D G N+ S+ + + RRCTHCLAQRTPQWRAGPLG Sbjct: 163 EGVVVVVKKEKLGDYCDHDEGDEINNNNSNNDDNVQHPIPRRCTHCLAQRTPQWRAGPLG 222 Query: 746 PKTLCNACGVRFKSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRMSILSSVPTE 910 PKTLCNACGVRFKSGRLLPEYRPAKSPTFVSY HSNSHKKVMEMRM ++ T+ Sbjct: 223 PKTLCNACGVRFKSGRLLPEYRPAKSPTFVSYLHSNSHKKVMEMRMGVVGVFSTD 277 >ref|XP_003548060.1| PREDICTED: GATA transcription factor 9-like [Glycine max] Length = 279 Score = 233 bits (595), Expect = 8e-59 Identities = 140/288 (48%), Positives = 168/288 (58%), Gaps = 11/288 (3%) Frame = +2 Query: 59 MDFCRNVPEPLAFQSEYQQEPDKPGGLSTSLDDLFPSENPESDVNLEWLSIFVEDCLSSS 238 MD CRN+ SE QQE +LDDLF +N E D +EWLS+FVEDC SS Sbjct: 3 MDVCRNIS---VSSSECQQE-------LPTLDDLFCHQNTEVDFGMEWLSVFVEDCFSSR 52 Query: 239 GNCIPPPPNDLHLQHNSSPVPSK----PCLNKTHQNPPPKLQDFAVPAKPRTKRKRXXXX 406 +C+ PP ++S PS P ++H P LQ+FAVP K R+KRKR Sbjct: 53 PSCLLPPSGGGVQTTSTSTKPSSGTIMPRPQQSHHCP---LQNFAVPGKARSKRKRLSAP 109 Query: 407 XXXXXXXXXXXXXQAQ----ITSDPPLLHQAYWLADSELIVPKKENKYSTXXXXXXXXXX 574 + ++SDPPLL QAYWLADSELIVPKK++ Sbjct: 110 RTTKHTLSTWSQHFSSQNDGVSSDPPLLKQAYWLADSELIVPKKKDVEQEEGVVVVVKKE 169 Query: 575 XXXXXXXXXXKVVEVDGGVMKENSASSGSSDWGNGQ---ARRCTHCLAQRTPQWRAGPLG 745 + +G + N+ ++ ++D N Q RRCTHCLAQRTPQWRAGPLG Sbjct: 170 KLGDYYD------DDEGDEVNNNNTNNNNND--NVQHPIPRRCTHCLAQRTPQWRAGPLG 221 Query: 746 PKTLCNACGVRFKSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRMSI 889 PKTLCNACGVR+KSGRLLPEYRPAKSPTFVSY HSNSHKKVMEMRM + Sbjct: 222 PKTLCNACGVRYKSGRLLPEYRPAKSPTFVSYLHSNSHKKVMEMRMGV 269 >ref|XP_004493146.1| PREDICTED: GATA transcription factor 9-like [Cicer arietinum] gi|502184174|ref|XP_004517300.1| PREDICTED: GATA transcription factor 9-like [Cicer arietinum] Length = 274 Score = 233 bits (594), Expect = 1e-58 Identities = 140/290 (48%), Positives = 172/290 (59%), Gaps = 7/290 (2%) Frame = +2 Query: 59 MDFCRNVPEPLAFQSEYQQEPDKPGGLSTSLDDLFPSENPESDVNLEWLSIFVEDCLSSS 238 MD C+NV + + +QE STSLDDLF ++N E DV +EWLS+FVEDC SS Sbjct: 3 MDVCQNV----SVSDDCKQEKVL-STFSTSLDDLFSTQNMEVDVGMEWLSVFVEDCFSSP 57 Query: 239 GNCIPPPPNDLHLQHNSSP-VPSKPC--LNKTHQNPPPKLQDFAVPAKPRTKRKRXXXXX 409 +C+ P N + ++ + +KP + + +QN P LQ+F +P K R+KRKR Sbjct: 58 QSCVLLPSNVQNTTTTTTTTISTKPSNTMKRQNQNESP-LQNFTIPGKARSKRKRLSAPR 116 Query: 410 XXXXXXXXXXXX----QAQITSDPPLLHQAYWLADSELIVPKKENKYSTXXXXXXXXXXX 577 I DPPLL QAYWLADSELIVPK E Sbjct: 117 TNKNPLSIWSNTLNPKNELIGCDPPLLKQAYWLADSELIVPKGEKNV------------V 164 Query: 578 XXXXXXXXXKVVEVDGGVMKENSASSGSSDWGNGQARRCTHCLAQRTPQWRAGPLGPKTL 757 + +E D V K + + S+ + RRCTHCL+QRTPQWRAGPLGPKTL Sbjct: 165 EKEIVVKKEEKIE-DDIVFKNENENENESENSSIPTRRCTHCLSQRTPQWRAGPLGPKTL 223 Query: 758 CNACGVRFKSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRMSILSSVPT 907 CNACGVR+KSGRLLPEYRPAKSPTFVSY HSNSHKKVMEMR ++ SS+PT Sbjct: 224 CNACGVRYKSGRLLPEYRPAKSPTFVSYLHSNSHKKVMEMR-NMSSSIPT 272 >ref|XP_007151835.1| hypothetical protein PHAVU_004G079200g [Phaseolus vulgaris] gi|561025144|gb|ESW23829.1| hypothetical protein PHAVU_004G079200g [Phaseolus vulgaris] Length = 253 Score = 233 bits (593), Expect = 1e-58 Identities = 142/294 (48%), Positives = 167/294 (56%), Gaps = 14/294 (4%) Frame = +2 Query: 59 MDFCRNVPEPLAFQSEYQQEPDKPGGLSTSLDDLFPSENPESDVNLEWLSIFVEDCLSSS 238 MD CRNV + SE +LDDLF +N E D +EWLS+FVEDC SS Sbjct: 3 MDVCRNV----SVSSELP-----------TLDDLFSHQNMEVDFGMEWLSVFVEDCFSSK 47 Query: 239 GNCIPPPPNDLHLQHNSSPVPSKPCLNKTHQNPPPK---LQDFAVPAKPRTKRKRXXXXX 409 +C+ P +S P + T Q P LQ+F VP K R+KRKR Sbjct: 48 PSCLLAPTGVQTTSTSSKP-------SSTMQRPQQSYCHLQNFVVPGKARSKRKRLSAPR 100 Query: 410 XXXXXXXXXXXXQAQIT-SDPPLLHQAYWLADSELIVPKKENKYSTXXXXXXXXXXXXXX 586 +++ SDPPLL QAYWLADSELIVPKK+++ Sbjct: 101 TKHTLSTWSHNFSSEVLCSDPPLLKQAYWLADSELIVPKKKDEQE--------------- 145 Query: 587 XXXXXXKVVEVDGGVMKENSAS-------SGSSDWGNGQ---ARRCTHCLAQRTPQWRAG 736 V+V+ V KE + SS+ NGQ RRCTHCL+QRTPQWRAG Sbjct: 146 --------VKVEIVVRKEKLGECCDEVEVNNSSNNNNGQHPIPRRCTHCLSQRTPQWRAG 197 Query: 737 PLGPKTLCNACGVRFKSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRMSILSS 898 PLGPKTLCNACGVR+KSGRLLPEYRPAKSPTFVSY HSNSHK+VMEMRMS+LSS Sbjct: 198 PLGPKTLCNACGVRYKSGRLLPEYRPAKSPTFVSYLHSNSHKRVMEMRMSVLSS 251 >ref|XP_006574751.1| PREDICTED: GATA transcription factor 9-like isoform X2 [Glycine max] Length = 281 Score = 232 bits (592), Expect = 2e-58 Identities = 143/296 (48%), Positives = 165/296 (55%), Gaps = 12/296 (4%) Frame = +2 Query: 59 MDFCRNVPEPLAFQSEYQQEPDKPGGLSTSLDDLFPSENP-ESDVNLEWLSIFVEDCLSS 235 MD CRNV SE QQE +LDDLF +N E D LEWLS+FVEDC SS Sbjct: 1 MDVCRNVS---VSSSECQQE-------LPTLDDLFSHQNTQEVDFGLEWLSVFVEDCFSS 50 Query: 236 SGNCIPPPPNDLHLQHNSSPVPSKPCLNKTHQNPPP-----KLQDFAVPAKPRTKRKRXX 400 +C+ P +Q S+ +KP Q P LQ+FAVP K R+KRKR Sbjct: 51 RPSCLLAPGG---VQTTSTSTSTKPSSGTILQRPQQLSHHCPLQNFAVPGKARSKRKRKR 107 Query: 401 XXXXXXXXXXXXXXXQA------QITSDPPLLHQAYWLADSELIVPKKENKYSTXXXXXX 562 Q ++SDPPLL QAYWLADSELIVPKK++ Sbjct: 108 LSAPRTTKHTLSTWSQHFSTQNDGVSSDPPLLKQAYWLADSELIVPKKKD-----VEQEE 162 Query: 563 XXXXXXXXXXXXXXKVVEVDGGVMKENSASSGSSDWGNGQARRCTHCLAQRTPQWRAGPL 742 + D G N+ S+ + + RRCTHCLAQRTPQWRAGPL Sbjct: 163 EEGVVVVVKKEKLGDYCDHDEGDEINNNNSNNDDNVQHPIPRRCTHCLAQRTPQWRAGPL 222 Query: 743 GPKTLCNACGVRFKSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRMSILSSVPTE 910 GPKTLCNACGVRFKSGRLLPEYRPAKSPTFVSY HSNSHKKVMEMRM ++ T+ Sbjct: 223 GPKTLCNACGVRFKSGRLLPEYRPAKSPTFVSYLHSNSHKKVMEMRMGVVGVFSTD 278 >ref|XP_006443395.1| hypothetical protein CICLE_v10021190mg [Citrus clementina] gi|568850806|ref|XP_006479088.1| PREDICTED: GATA transcription factor 5-like [Citrus sinensis] gi|557545657|gb|ESR56635.1| hypothetical protein CICLE_v10021190mg [Citrus clementina] Length = 321 Score = 231 bits (590), Expect = 3e-58 Identities = 141/321 (43%), Positives = 172/321 (53%), Gaps = 41/321 (12%) Frame = +2 Query: 59 MDFCRNVPEPLAFQSEYQQE---------PDKPGGLSTS--LDDLFPSENPESDVNLEWL 205 MDFCRNV +YQQ+ P L+ + LDDLFP+ E DV+LEWL Sbjct: 1 MDFCRNVA---VSGDQYQQDQVLSPSSIPPSSSSNLALADPLDDLFPAHTTEVDVSLEWL 57 Query: 206 SIFVEDCLSSSGNCIPPPPNDLHLQHNSSPVPSKPCLNKTHQ------------NPPPKL 349 SIFVEDCLSSSG C+P ++L ++N++ + P Q +PPP L Sbjct: 58 SIFVEDCLSSSGICLPA--SELPTKNNAAATTAAPSPKPLQQQQQQKESTTTTPSPPPSL 115 Query: 350 QDFAVPAKPRTKRKRXXXXXXXXXXXXXXXXX----------------QAQITSDPPLLH 481 + F VP K R+KRKR Q + DPPLL Sbjct: 116 EKFVVPGKARSKRKRASSTAKLTQTSTSLSSLTTGCWTTTHNNHPADTQLFHSDDPPLLQ 175 Query: 482 --QAYWLADSELIVPKKENKYSTXXXXXXXXXXXXXXXXXXXXKVVEVDGGVMKENSASS 655 QA+WLADS+LI PKKE + + E + + KE Sbjct: 176 VQQAFWLADSQLIFPKKETTNTNTINTNSNKKAKANDGDEEEEEAKEEETEIGKEVEVVQ 235 Query: 656 GSSDWGNGQARRCTHCLAQRTPQWRAGPLGPKTLCNACGVRFKSGRLLPEYRPAKSPTFV 835 Q RRC+HCL+QRTPQWRAGPLGPKTLCNACGVR+KSGRLLPEYRPAKSPTFV Sbjct: 236 QQQQQQQ-QGRRCSHCLSQRTPQWRAGPLGPKTLCNACGVRYKSGRLLPEYRPAKSPTFV 294 Query: 836 SYKHSNSHKKVMEMRMSILSS 898 SY HSNSHKKV+EMRM+++ S Sbjct: 295 SYLHSNSHKKVLEMRMALMPS 315 >ref|XP_006826302.1| hypothetical protein AMTR_s00004p00074220 [Amborella trichopoda] gi|548830616|gb|ERM93539.1| hypothetical protein AMTR_s00004p00074220 [Amborella trichopoda] Length = 278 Score = 205 bits (521), Expect = 3e-50 Identities = 125/260 (48%), Positives = 141/260 (54%), Gaps = 8/260 (3%) Frame = +2 Query: 134 GLSTSLDDLFPSENP-----ESDVNLEWLSIFVEDCLSSSGNCIPPPPNDLHLQHNSSPV 298 G TSL++ P E E ++ LEWLS FVEDC S PP N P Sbjct: 44 GSFTSLEEFLPDEKSFSSAEEGNIGLEWLSEFVEDCFSGEIPATHPPVNHF-------PG 96 Query: 299 PSKPCLNKTHQNPPPKLQDFAVPAKPRTKRKRXXXXXXXXXXXXXXXXXQAQITSDPPLL 478 PS H P L +P K RTKR+R DPPLL Sbjct: 97 PS-------HMKP---LHPVQIPCKARTKRRRRTCHLPVPTPST-----DESAAYDPPLL 141 Query: 479 HQAYWLADSELIVPKK-ENKYSTXXXXXXXXXXXXXXXXXXXXKVVEVDGGV--MKENSA 649 HQAYWLA+SELI+ K EN+ S V E D GV + S Sbjct: 142 HQAYWLAESELIIQTKPENEPSNGSTHILNESLDTNDSNQL---VTESDEGVTDLGSESL 198 Query: 650 SSGSSDWGNGQARRCTHCLAQRTPQWRAGPLGPKTLCNACGVRFKSGRLLPEYRPAKSPT 829 + + Q RRCTHCLAQRTPQWRAGPLGPKTLCNACGVRFKSGRLLPEYRPAKSP+ Sbjct: 199 DRTNESCSSSQPRRCTHCLAQRTPQWRAGPLGPKTLCNACGVRFKSGRLLPEYRPAKSPS 258 Query: 830 FVSYKHSNSHKKVMEMRMSI 889 F+SYKHSNSHKKVMEMRM++ Sbjct: 259 FLSYKHSNSHKKVMEMRMAL 278 >ref|XP_003619010.1| GATA transcription factor [Medicago truncatula] gi|355494025|gb|AES75228.1| GATA transcription factor [Medicago truncatula] Length = 217 Score = 198 bits (504), Expect = 3e-48 Identities = 109/219 (49%), Positives = 131/219 (59%), Gaps = 1/219 (0%) Frame = +2 Query: 230 SSSGNCIPPPPNDLHLQHNSSPVPSKPCLNKTHQNPPPKLQDFAVPAKPRTKRKRXXXXX 409 SS +C+ P +++ +Q +++ PS + K Q LQ+F VP K R+KRKR Sbjct: 3 SSKPSCVIAPSSNVQIQESTNTKPSNT-MQKPQQQNQSYLQNFVVPGKARSKRKRLSAPS 61 Query: 410 XXXXXXXXXXXXQAQITSDPPLLHQAYWLADSELIVPKKENKYSTXXXXXXXXXXXXXXX 589 + SDPPLL QAYWLADSELI PK E K S Sbjct: 62 TNIWSHSHLIS-DGNLISDPPLLKQAYWLADSELIAPKNEQKVSAVAYGDQKEAKRRVKK 120 Query: 590 XXXXXKVVEVDGGVMKENSASSGSSDWGN-GQARRCTHCLAQRTPQWRAGPLGPKTLCNA 766 +++V +NS + D + ARRCTHCL+QRTPQWRAGPLGPKTLCNA Sbjct: 121 ESYEVGIIQV------KNSENVNDDDEEHIPNARRCTHCLSQRTPQWRAGPLGPKTLCNA 174 Query: 767 CGVRFKSGRLLPEYRPAKSPTFVSYKHSNSHKKVMEMRM 883 CGVR+KSGRLLPEYRPAKSPTFVSY HSNSHKKV+EMRM Sbjct: 175 CGVRYKSGRLLPEYRPAKSPTFVSYLHSNSHKKVLEMRM 213 >ref|XP_004141141.1| PREDICTED: GATA transcription factor 9-like [Cucumis sativus] gi|449529527|ref|XP_004171751.1| PREDICTED: GATA transcription factor 9-like [Cucumis sativus] Length = 290 Score = 192 bits (489), Expect = 2e-46 Identities = 121/283 (42%), Positives = 150/283 (53%), Gaps = 20/283 (7%) Frame = +2 Query: 140 STSLDDLFPSENP---ESDVNLEWLSIFVEDCLSSSGNCIP-PPPNDLHLQHNSSPVPSK 307 S+++DD+ S + DV+LEWLS FVE+CLS+ G+ +P PPP+ L Q N+ P Sbjct: 27 SSTIDDILYSSQAMTMDVDVSLEWLSAFVEECLSTKGSTLPLPPPSQLSTQLNNPPTKPS 86 Query: 308 PCLNKTHQNPPPKLQDF-AVPAKPRTKRKRXXXXXXXXXXXXXXXXXQAQI--------- 457 + + F AVP K R+KR+R Q + Sbjct: 87 SLSQLVPTSSNSQFAHFPAVPGKARSKRRRRTPSKMSVLPLISRRLRQLNLLQNKHSLQL 146 Query: 458 --TSDPPLLHQAYWLADSELIVPKKENKYSTXXXXXXXXXXXXXXXXXXXXKVVEVDGGV 631 ++DP LL Q YWLADSEL++P K + VD G Sbjct: 147 TTSTDPLLLQQTYWLADSELLLPPKAR---------------------GGEREKTVDMGQ 185 Query: 632 MK---ENSASSGSSDWGNGQARRCTHCLAQRTPQWRAGPLGPKTLCNACGVRF-KSGRLL 799 ++ ENS G G RRC+HC AQRTPQWR+GPLGPKTLCNACGVR+ KSGRLL Sbjct: 186 IETTVENSMKKQQQQ-GAGSGRRCSHCQAQRTPQWRSGPLGPKTLCNACGVRYKKSGRLL 244 Query: 800 PEYRPAKSPTFVSYKHSNSHKKVMEMRMSILSSVPTE*VGVFP 928 PEYRPA SPTFVS HSNSHK+VMEMRM SS + FP Sbjct: 245 PEYRPANSPTFVSLLHSNSHKRVMEMRMMNASSSTSTSTTTFP 287