BLASTX nr result
ID: Cnidium21_contig00007509
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cnidium21_contig00007509 (1927 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAC28528.1| GATA-1 zinc finger protein [Nicotiana tabacum] 157 1e-35 ref|XP_002328301.1| GATA zinc finger protein regulating nitrogen... 147 1e-32 ref|XP_002529940.1| GATA transcription factor, putative [Ricinus... 145 5e-32 ref|XP_003570342.1| PREDICTED: uncharacterized protein LOC100841... 140 1e-30 ref|XP_003543479.1| PREDICTED: GATA transcription factor 11-like... 140 2e-30 >emb|CAC28528.1| GATA-1 zinc finger protein [Nicotiana tabacum] Length = 305 Score = 157 bits (397), Expect = 1e-35 Identities = 101/259 (38%), Positives = 134/259 (51%), Gaps = 13/259 (5%) Frame = -2 Query: 1686 GYWDGFVNG---DDSFHNVINMLDFPLESVEGDKCVAEENWEA-QFPSLGPFSSEIVQGF 1519 GY DG G D+ F +++N LDFPLES+E D E W+A + LGP + + F Sbjct: 9 GYLDGIPTGPVVDEDFDDILNFLDFPLESLEEDGQGVE--WDASESKFLGPIPMDALMAF 66 Query: 1518 TPVFRSDFAEDVPFNFVQNNAGYAASDRKLLPVTEFSSTNFRLDSSHSRNPSLLQTPSQN 1339 PV Q N G V ++N + + + + QT S Sbjct: 67 PPV-------------PQGNIGNGR-------VKAEPNSNHPIKVTEGQGSGIFQTQSPV 106 Query: 1338 SVLESSSSCSAGKNLSTSSEFLVPVRARSKIPRSLTFSRWHLLSPLTTPK---KILCP-- 1174 SVLESS+SCS GK++S + +PVR RSK PRS + W L+ P+++ + K C Sbjct: 107 SVLESSNSCSGGKSISIKHDIAIPVRPRSKRPRSSALNPWILMPPISSTRFASKKTCDAR 166 Query: 1173 KGKEKKN----MQLSQHSNEFDMKEDSYQHMPNKRCMHCQAEQTPQWRAGPMGPHTLCNA 1006 KGKEKK + + Q ++ K S Q K+C HCQ +TPQWR GP+GP TLCNA Sbjct: 167 KGKEKKRKMSLLSVPQIADVTKKKTTSGQQFSFKKCTHCQVTKTPQWREGPLGPKTLCNA 226 Query: 1005 CGVCYRKRGLLPEYRLTAS 949 CGV YR L PEYR AS Sbjct: 227 CGVRYRSGRLFPEYRPAAS 245 Score = 145 bits (365), Expect(2) = 1e-34 Identities = 71/128 (55%), Positives = 87/128 (67%), Gaps = 14/128 (10%) Frame = -2 Query: 645 NTRNGEKKKELPDLSNDSEIMGFA-----THQPMQVKRCTHCQVTKTPQWREGPMGPRTL 481 + R G++KK L + +I + Q K+CTHCQVTKTPQWREGP+GP+TL Sbjct: 164 DARKGKEKKRKMSLLSVPQIADVTKKKTTSGQQFSFKKCTHCQVTKTPQWREGPLGPKTL 223 Query: 480 CNACGVRYRSGRLVPEYRPAASPTFVPSLHSNKHRKVIEMREKVV---------PLNIVE 328 CNACGVRYRSGRL PEYRPAASPTFVP+LHSN HRKV+EMR+K + P N++ Sbjct: 224 CNACGVRYRSGRLFPEYRPAASPTFVPTLHSNSHRKVVEMRKKAIYGETSALEEPHNVIV 283 Query: 327 DGSRNVPA 304 +G PA Sbjct: 284 EGPPMSPA 291 Score = 29.6 bits (65), Expect(2) = 1e-34 Identities = 12/18 (66%), Positives = 15/18 (83%) Frame = -3 Query: 320 PAMSPPPEFVLLSSYVFN 267 P MSP PEFV +SSY+F+ Sbjct: 286 PPMSPAPEFVPMSSYLFD 303 >ref|XP_002328301.1| GATA zinc finger protein regulating nitrogen assimilation [Populus trichocarpa] gi|222837816|gb|EEE76181.1| GATA zinc finger protein regulating nitrogen assimilation [Populus trichocarpa] Length = 301 Score = 147 bits (370), Expect = 1e-32 Identities = 68/98 (69%), Positives = 82/98 (83%) Frame = -2 Query: 645 NTRNGEKKKELPDLSNDSEIMGFATHQPMQVKRCTHCQVTKTPQWREGPMGPRTLCNACG 466 ++R +KK+ L LS+ E M QP++ +RCTHCQVTKTPQWREGP+GP+TLCNACG Sbjct: 204 SSRKQQKKRNLMLLSSAVE-MAPKMKQPVETRRCTHCQVTKTPQWREGPLGPKTLCNACG 262 Query: 465 VRYRSGRLVPEYRPAASPTFVPSLHSNKHRKVIEMREK 352 VRYRSGRL+PEYRPAASPTFVP LHSN HRKV+EMR++ Sbjct: 263 VRYRSGRLLPEYRPAASPTFVPFLHSNSHRKVLEMRKQ 300 Score = 119 bits (299), Expect = 2e-24 Identities = 94/265 (35%), Positives = 129/265 (48%), Gaps = 28/265 (10%) Frame = -2 Query: 1659 DDSFHNVINMLDFPLESVE--GDKCVAEENWEAQFPSLGPFSSEIVQGFTPVFRSDFAED 1486 D+ F + + DFPLE VE GD E+WE++F L P SS ++ F+ + A Sbjct: 20 DNFFEDTLGCFDFPLEDVEPNGDD---GEDWESKFRHLEPPSSNLLTTFSTALCGEDASS 76 Query: 1485 VPFNFVQNNAGYAASDRKLLP----VTEFSSTNFR---LDSSHSRNPSLLQTPSQNSVLE 1327 + N+ N+ + L E SS+ + SS S+ L Q S SVLE Sbjct: 77 LEPNY--NSCSVLLNGSLQLKHWASSAEASSSRSKPILCRSSDSKYSHLFQATSPVSVLE 134 Query: 1326 SSSSCSAGKNLSTS-SEFLVPV-RARSKIPR----------SLTFSRWHLLSPLTTPK-- 1189 SS S +N +T +F+ PV R RSK+PR + S+ S + P+ Sbjct: 135 SSGSSCPTENATTYYPKFVTPVKRPRSKLPRLRRHTFPFIPTACASKKFYCSASSDPELE 194 Query: 1188 -----KILCPKGKEKKNMQLSQHSNEFDMKEDSYQHMPNKRCMHCQAEQTPQWRAGPMGP 1024 +IL K++K L S+ +M Q + +RC HCQ +TPQWR GP+GP Sbjct: 195 YYNDEEILDSSRKQQKKRNLMLLSSAVEMAPKMKQPVETRRCTHCQVTKTPQWREGPLGP 254 Query: 1023 HTLCNACGVCYRKRGLLPEYRLTAS 949 TLCNACGV YR LLPEYR AS Sbjct: 255 KTLCNACGVRYRSGRLLPEYRPAAS 279 >ref|XP_002529940.1| GATA transcription factor, putative [Ricinus communis] gi|223530570|gb|EEF32448.1| GATA transcription factor, putative [Ricinus communis] Length = 323 Score = 145 bits (365), Expect = 5e-32 Identities = 66/92 (71%), Positives = 78/92 (84%) Frame = -2 Query: 630 EKKKELPDLSNDSEIMGFATHQPMQVKRCTHCQVTKTPQWREGPMGPRTLCNACGVRYRS 451 +KKK+L LS E ++ P ++++CTHC+VTKTPQWREGPMGP+TLCNACGVRYRS Sbjct: 209 QKKKDLMMLSCTVEKKKPSSEVPGEIRKCTHCEVTKTPQWREGPMGPKTLCNACGVRYRS 268 Query: 450 GRLVPEYRPAASPTFVPSLHSNKHRKVIEMRE 355 GRL PEYRPAASPTFVP+LHSN HRKVIEMR+ Sbjct: 269 GRLFPEYRPAASPTFVPALHSNSHRKVIEMRK 300 Score = 108 bits (269), Expect = 7e-21 Identities = 94/270 (34%), Positives = 125/270 (46%), Gaps = 33/270 (12%) Frame = -2 Query: 1659 DDSFHNVINMLDFPLESVEGDKCVAEENWEAQFPSLGPFSSEIVQGFTPVFRSDFAED-V 1483 DD F + + LD P E VE + + E+WE+QF L P S I+ FT ++D + Sbjct: 16 DDFFDDALKYLDLPPEDVESNDAI--EDWESQFQQL-PTPSNILADFTSGICDQISKDSL 72 Query: 1482 PFNFVQNNAGYAASDRKLLPVTEF-SSTNFRL--DSSHSRNPSLLQTPSQNSVLESSSSC 1312 + ++ + L E SS N L D S + L T S SVLESSSS Sbjct: 73 KLEKSSVSCDESSQPKPWLRAAEAPSSRNIPLNYDPSEGKYSHLFWTSSPVSVLESSSSS 132 Query: 1311 SAGKNLST-SSEFLVPV-RARSKIPRSLTFSRWHLLSPLTTPK----------------- 1189 S+ +N + +F PV R RSK PR + + LS PK Sbjct: 133 SSAENSTVYHPKFAKPVKRPRSKCPRRRRCT-FPFLSTSYAPKNNPLGGSESESESESES 191 Query: 1188 --------KILCPKGKEKKNMQLSQHSNEFDMKEDSYQHMPN--KRCMHCQAEQTPQWRA 1039 K+L K +K L S + K+ S + +P ++C HC+ +TPQWR Sbjct: 192 ESESNPDEKMLNLAKKIQKKKDLMMLSCTVEKKKPSSE-VPGEIRKCTHCEVTKTPQWRE 250 Query: 1038 GPMGPHTLCNACGVCYRKRGLLPEYRLTAS 949 GPMGP TLCNACGV YR L PEYR AS Sbjct: 251 GPMGPKTLCNACGVRYRSGRLFPEYRPAAS 280 >ref|XP_003570342.1| PREDICTED: uncharacterized protein LOC100841640 [Brachypodium distachyon] Length = 416 Score = 140 bits (353), Expect = 1e-30 Identities = 63/102 (61%), Positives = 79/102 (77%), Gaps = 8/102 (7%) Frame = -2 Query: 624 KKELPDLSNDSEIMGFATHQ--------PMQVKRCTHCQVTKTPQWREGPMGPRTLCNAC 469 KK P +++D+E A ++ P V+RCTHCQ+ KTPQWR GP+GP+TLCNAC Sbjct: 302 KKPAPPVTSDAEGDADADYEEGGGSALPPGAVRRCTHCQIEKTPQWRAGPLGPKTLCNAC 361 Query: 468 GVRYRSGRLVPEYRPAASPTFVPSLHSNKHRKVIEMREKVVP 343 GVRY+SGRL PEYRPAASPTFVP++HSN H+KV+EMR+KV P Sbjct: 362 GVRYKSGRLFPEYRPAASPTFVPAIHSNSHKKVVEMRQKVAP 403 Score = 77.0 bits (188), Expect = 2e-11 Identities = 32/46 (69%), Positives = 36/46 (78%) Frame = -2 Query: 1086 KRCMHCQAEQTPQWRAGPMGPHTLCNACGVCYRKRGLLPEYRLTAS 949 +RC HCQ E+TPQWRAGP+GP TLCNACGV Y+ L PEYR AS Sbjct: 334 RRCTHCQIEKTPQWRAGPLGPKTLCNACGVRYKSGRLFPEYRPAAS 379 >ref|XP_003543479.1| PREDICTED: GATA transcription factor 11-like [Glycine max] Length = 327 Score = 140 bits (352), Expect = 2e-30 Identities = 63/100 (63%), Positives = 79/100 (79%) Frame = -2 Query: 645 NTRNGEKKKELPDLSNDSEIMGFATHQPMQVKRCTHCQVTKTPQWREGPMGPRTLCNACG 466 N +KKK+ LS+D E+M ++ + ++C HC+VTKTPQWREGP+GP+TLCNACG Sbjct: 208 NKLKKQKKKDSSLLSDDVEMMRSSSPESGSPRKCMHCEVTKTPQWREGPVGPKTLCNACG 267 Query: 465 VRYRSGRLVPEYRPAASPTFVPSLHSNKHRKVIEMREKVV 346 VRYRSGRL PEYRPAASPTFV SLHSN H+KV+EMR + + Sbjct: 268 VRYRSGRLFPEYRPAASPTFVASLHSNCHKKVVEMRSRAI 307 Score = 102 bits (255), Expect = 3e-19 Identities = 87/278 (31%), Positives = 125/278 (44%), Gaps = 33/278 (11%) Frame = -2 Query: 1683 YWDGFVNG--DDSFHNVINMLDFPLESVEGDKCVAEENWEAQFPSLGPFSSEIVQGFTPV 1510 ++D NG D+ F +VIN DFPLE VE + EE+W+AQ L ++ + Sbjct: 13 FFDNNFNGLSDEIFDDVINFFDFPLEDVEANG--VEEDWDAQLKCLEDPRVDVYTASSAG 70 Query: 1509 FRSDFAEDVPFNFVQNNAGYAASDRKLLPVTEFSSTN---------FRLDSSHSRNPSLL 1357 + + P Q ++AS + P+ + + +S+ ++ Sbjct: 71 LCAKTQNEKP----QLGMKFSASGNGISPIKQLGKATGPVYGKTITHQNVTSNGKDLHQF 126 Query: 1356 QTPSQNSVLESSSSCSAGKNLSTSSEFLVPV-RARSKIPRSLTFSR-----WHLLSPL-- 1201 QT + + V SS S+ S ++PV RARSK R +FS + L SP Sbjct: 127 QTYTYSPVSVFESSSSSSVENSNFDRPVIPVKRARSKRQRPSSFSPLFSIPFILNSPAMQ 186 Query: 1200 -------------TTPKKILCPKGKEKKNMQLSQHSNEFDMKEDSYQHMPNKR-CMHCQA 1063 T L K K++K S S++ +M S + R CMHC+ Sbjct: 187 NHQRIAAADSDFGTNVAGNLSNKLKKQKKKDSSLLSDDVEMMRSSSPESGSPRKCMHCEV 246 Query: 1062 EQTPQWRAGPMGPHTLCNACGVCYRKRGLLPEYRLTAS 949 +TPQWR GP+GP TLCNACGV YR L PEYR AS Sbjct: 247 TKTPQWREGPVGPKTLCNACGVRYRSGRLFPEYRPAAS 284