BLASTX nr result
ID: Catharanthus22_contig00040379
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00040379 (355 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006347916.1| PREDICTED: GATA transcription factor 2-like ... 107 2e-21 ref|XP_004229778.1| PREDICTED: GATA transcription factor 2-like ... 105 8e-21 dbj|BAC98493.1| AG-motif binding protein-3 [Nicotiana tabacum] 100 2e-19 emb|CBI16598.3| unnamed protein product [Vitis vinifera] 84 2e-14 ref|XP_006375265.1| hypothetical protein POPTR_0014s05760g [Popu... 83 3e-14 gb|EOX96349.1| GATA transcription factor 2, putative [Theobroma ... 82 7e-14 ref|XP_002277959.1| PREDICTED: GATA transcription factor 2 [Viti... 82 1e-13 ref|XP_002511642.1| GATA transcription factor, putative [Ricinus... 80 4e-13 ref|XP_002301258.2| hypothetical protein POPTR_0002s14380g [Popu... 76 4e-12 ref|XP_006445338.1| hypothetical protein CICLE_v10021733mg [Citr... 75 1e-11 ref|XP_004248553.1| PREDICTED: GATA transcription factor 2-like ... 74 2e-11 ref|XP_006359592.1| PREDICTED: GATA transcription factor 2-like ... 73 3e-11 ref|XP_006402573.1| hypothetical protein EUTSA_v10006202mg [Eutr... 68 1e-09 ref|XP_006294606.1| hypothetical protein CARUB_v10023643mg [Caps... 67 2e-09 gb|ADL36699.1| GATA domain class transcription factor [Malus dom... 67 2e-09 ref|NP_182031.1| GATA transcription factor 2 [Arabidopsis thalia... 67 3e-09 gb|EMJ22222.1| hypothetical protein PRUPE_ppa014583m1g, partial ... 67 3e-09 gb|ADK63416.1| GATA type zinc finger protein [Brassica rapa] 65 1e-08 ref|XP_002880154.1| hypothetical protein ARALYDRAFT_903940 [Arab... 65 1e-08 ref|XP_002876563.1| zinc finger family protein [Arabidopsis lyra... 65 1e-08 >ref|XP_006347916.1| PREDICTED: GATA transcription factor 2-like [Solanum tuberosum] Length = 260 Score = 107 bits (266), Expect = 2e-21 Identities = 59/108 (54%), Positives = 77/108 (71%), Gaps = 2/108 (1%) Frame = +3 Query: 30 HHHHFPNYSDKNNFAAGSSAYFHTLNPDTYSATDFTDDIRVPNDDMKELEWLSNFVHDSF 209 +HHH P+ N+ AAG++ Y+ L P+ S+ DFTD++ VP+DD+ ELEWLSNFV DSF Sbjct: 39 NHHHQPH--SHNSSAAGAANYYDALLPN--SSDDFTDNLCVPSDDVAELEWLSNFVEDSF 94 Query: 210 TEFPSSSITGTMNIRSETPS-NGSSRSKRFRST-VWTSESATTNSDFS 347 + FP++S+TGTMNI S T S +G SRSKR RST WTS TN+ S Sbjct: 95 SNFPANSVTGTMNISSNTASFHGRSRSKRSRSTSSWTSSLQNTNATTS 142 >ref|XP_004229778.1| PREDICTED: GATA transcription factor 2-like [Solanum lycopersicum] Length = 260 Score = 105 bits (261), Expect = 8e-21 Identities = 61/114 (53%), Positives = 79/114 (69%), Gaps = 2/114 (1%) Frame = +3 Query: 12 TDSDAIHHHHFPNYSDKNNFAAGSSAYFHTLNPDTYSATDFTDDIRVPNDDMKELEWLSN 191 TDS+ HHH P+ N+ AAG + Y+ L P+ S+ DFTD++ VP+DD+ ELEWLSN Sbjct: 36 TDSN---HHHQPH--SHNSSAAGPANYYDALLPN--SSDDFTDNLCVPSDDVAELEWLSN 88 Query: 192 FVHDSFTEFPSSSITGTMNIRSETPS-NGSSRSKRFRST-VWTSESATTNSDFS 347 FV DSF+ FP++S+TGTMNI S T S +G SRSKR RST WTS +N+ S Sbjct: 89 FVEDSFSNFPANSVTGTMNITSNTASFHGRSRSKRSRSTSSWTSSLQNSNATTS 142 >dbj|BAC98493.1| AG-motif binding protein-3 [Nicotiana tabacum] Length = 256 Score = 100 bits (250), Expect = 2e-19 Identities = 55/119 (46%), Positives = 77/119 (64%), Gaps = 3/119 (2%) Frame = +3 Query: 3 ATATDSDAIHHHHFPNYSDKNNFAAGSSAYFHTLNPDTYSATDFTDDIRVPNDDMKELEW 182 +T D+ HHHH P+ +N +A ++ Y+ L P+ + DFTD++ VP+DD+ ELEW Sbjct: 34 STTATPDSQHHHHQPH---SDNSSAATANYYDALLPNC--SDDFTDNLCVPSDDVAELEW 88 Query: 183 LSNFVHDSFTEFPSSSITGTMNIRSETPS--NGSSRSKRFRST-VWTSESATTNSDFSN 350 LSNFV DSF+ FP++SITGTMN+ S + + + SRSKR RST WTS N+ N Sbjct: 89 LSNFVEDSFSNFPTNSITGTMNLSSNSTASFHSRSRSKRSRSTSSWTSSLQNPNTTMKN 147 >emb|CBI16598.3| unnamed protein product [Vitis vinifera] Length = 255 Score = 84.3 bits (207), Expect = 2e-14 Identities = 39/79 (49%), Positives = 57/79 (72%), Gaps = 4/79 (5%) Frame = +3 Query: 108 PDTYSATDFTDDIRVPNDDMKELEWLSNFVHDSFTEFPSSSITGTMNIRSETPSNGSSRS 287 P+T+ + DFTDD+ VP+DD+ ELEWLSNFV DSF +FP + + GT+ R ++ G +RS Sbjct: 57 PNTFHSADFTDDLCVPSDDVAELEWLSNFVDDSFADFPENELAGTVMARPDSSFPGRTRS 116 Query: 288 KRFRST----VWTSESATT 332 KR R++ VWTS S+++ Sbjct: 117 KRSRASSTNKVWTSSSSSS 135 >ref|XP_006375265.1| hypothetical protein POPTR_0014s05760g [Populus trichocarpa] gi|550323584|gb|ERP53062.1| hypothetical protein POPTR_0014s05760g [Populus trichocarpa] Length = 251 Score = 83.2 bits (204), Expect = 3e-14 Identities = 48/105 (45%), Positives = 64/105 (60%) Frame = +3 Query: 15 DSDAIHHHHFPNYSDKNNFAAGSSAYFHTLNPDTYSATDFTDDIRVPNDDMKELEWLSNF 194 ++ +IHHHHFP SS Y + NP + S TDFTD + VP DD+ ELEWLS F Sbjct: 47 ETSSIHHHHFP-----------SSTYIN--NPSSLS-TDFTDHLSVPTDDVAELEWLSQF 92 Query: 195 VHDSFTEFPSSSITGTMNIRSETPSNGSSRSKRFRSTVWTSESAT 329 V DSF++FPS +NI ++T SRSKR R+T T+ S++ Sbjct: 93 VEDSFSDFPS-----IINIPTDTSFCNKSRSKRSRATATTATSSS 132 >gb|EOX96349.1| GATA transcription factor 2, putative [Theobroma cacao] Length = 273 Score = 82.0 bits (201), Expect = 7e-14 Identities = 46/124 (37%), Positives = 73/124 (58%), Gaps = 7/124 (5%) Frame = +3 Query: 3 ATATDSDAIHHHHFPNYSDKNNFAAGSSAYFHTLNPDTYSATDFTDDIRVPNDDMKELEW 182 + ++ + + ++ FP ++A+ SS+ ++S TDFT D+ +P+DD+ ELEW Sbjct: 28 SASSSTASTNNDQFPPSEAPFSYASASSSSSSAAFHPSFS-TDFTHDLCLPSDDVAELEW 86 Query: 183 LSNFVHDSFTEFPSSSITGTMNIRSETPSNGSSRSKRFR-------STVWTSESATTNSD 341 LS FV DSFT+FPS+SI GT+N R+++ + +RSKR R +T WT+ S Sbjct: 87 LSQFVEDSFTDFPSNSIAGTLNPRNDSSFSSKARSKRSRAATAMKTTTTWTTMSEAAPPF 146 Query: 342 FSNS 353 NS Sbjct: 147 TGNS 150 >ref|XP_002277959.1| PREDICTED: GATA transcription factor 2 [Vitis vinifera] Length = 270 Score = 81.6 bits (200), Expect = 1e-13 Identities = 38/74 (51%), Positives = 53/74 (71%), Gaps = 4/74 (5%) Frame = +3 Query: 108 PDTYSATDFTDDIRVPNDDMKELEWLSNFVHDSFTEFPSSSITGTMNIRSETPSNGSSRS 287 P+T+ + DFTDD+ VP+DD+ ELEWLSNFV DSF +FP + + GT+ R ++ G +RS Sbjct: 57 PNTFHSADFTDDLCVPSDDVAELEWLSNFVDDSFADFPENELAGTVMARPDSSFPGRTRS 116 Query: 288 KRFRST----VWTS 317 KR R++ VWTS Sbjct: 117 KRSRASSTNKVWTS 130 >ref|XP_002511642.1| GATA transcription factor, putative [Ricinus communis] gi|223548822|gb|EEF50311.1| GATA transcription factor, putative [Ricinus communis] Length = 235 Score = 79.7 bits (195), Expect = 4e-13 Identities = 38/73 (52%), Positives = 55/73 (75%), Gaps = 1/73 (1%) Frame = +3 Query: 123 ATDFTDDIRVPNDDMKELEWLSNFVHDSFTEFPSSSITGTMNIRSETPSNG-SSRSKRFR 299 +TDFTD + VP+DD+ ELEWLS FV DSF EFP + +TGT+N+RS+T +G ++R KR + Sbjct: 60 STDFTDHLSVPSDDVAELEWLSQFVDDSFIEFPPNLLTGTINVRSDTSFSGKAARRKRSK 119 Query: 300 STVWTSESATTNS 338 + T+ +A T+S Sbjct: 120 AATTTATTAWTSS 132 >ref|XP_002301258.2| hypothetical protein POPTR_0002s14380g [Populus trichocarpa] gi|550345007|gb|EEE80531.2| hypothetical protein POPTR_0002s14380g [Populus trichocarpa] Length = 246 Score = 76.3 bits (186), Expect = 4e-12 Identities = 43/104 (41%), Positives = 62/104 (59%) Frame = +3 Query: 15 DSDAIHHHHFPNYSDKNNFAAGSSAYFHTLNPDTYSATDFTDDIRVPNDDMKELEWLSNF 194 ++ +IHHHH +F + Y +N + +TDFTD + VP+DD+ ELEWLS F Sbjct: 42 ETSSIHHHH--------HFFPSPTTY---INNTSSLSTDFTDHLSVPSDDVAELEWLSQF 90 Query: 195 VHDSFTEFPSSSITGTMNIRSETPSNGSSRSKRFRSTVWTSESA 326 + DSFT+FPS T+NI ++T S S SKR R+T + S+ Sbjct: 91 MEDSFTDFPS-----TINIPTDTSSRIKSCSKRSRTTTTATSSS 129 >ref|XP_006445338.1| hypothetical protein CICLE_v10021733mg [Citrus clementina] gi|568875525|ref|XP_006490843.1| PREDICTED: GATA transcription factor 2-like [Citrus sinensis] gi|557547600|gb|ESR58578.1| hypothetical protein CICLE_v10021733mg [Citrus clementina] Length = 263 Score = 74.7 bits (182), Expect = 1e-11 Identities = 41/100 (41%), Positives = 61/100 (61%) Frame = +3 Query: 54 SDKNNFAAGSSAYFHTLNPDTYSATDFTDDIRVPNDDMKELEWLSNFVHDSFTEFPSSSI 233 SD ++ F + NP ++DFT D+ VP+DD+ ELEWLS FV DS +FP++S+ Sbjct: 47 SDTDHLPQAQHQSFDSFNP----SSDFTGDLCVPSDDVAELEWLSQFVDDSCMDFPANSL 102 Query: 234 TGTMNIRSETPSNGSSRSKRFRSTVWTSESATTNSDFSNS 353 GT+ +RS+T +G RSKR ++T + + T N S S Sbjct: 103 AGTI-VRSDTSLSGRGRSKRSKATNSAANTTTWNWTSSES 141 >ref|XP_004248553.1| PREDICTED: GATA transcription factor 2-like [Solanum lycopersicum] Length = 256 Score = 73.9 bits (180), Expect = 2e-11 Identities = 47/111 (42%), Positives = 65/111 (58%), Gaps = 5/111 (4%) Frame = +3 Query: 6 TATDSDAIHHHHFPNYSDKNNFAAGSSAYFHTLNPDTYSATDFTDDIRVPNDDMKELEWL 185 TA D D ++HH+ P +D A + Y+H ++ DFTD + VP+DD+ ELEWL Sbjct: 32 TAIDFD-LNHHYQPPPTDS---IADTGCYYHA----PPNSVDFTDKLCVPSDDVAELEWL 83 Query: 186 SNFVHDSFTEFPSSSITGTMNIRSETPS--NGSSRSKRFR---STVWTSES 323 SNFV DS FPS+++T TM + T + + SRSKR R ST W + S Sbjct: 84 SNFVEDSSNNFPSNNLTQTMYHLNNTNTILHSKSRSKRSRNSNSTSWNTSS 134 >ref|XP_006359592.1| PREDICTED: GATA transcription factor 2-like [Solanum tuberosum] Length = 258 Score = 73.2 bits (178), Expect = 3e-11 Identities = 48/113 (42%), Positives = 65/113 (57%), Gaps = 7/113 (6%) Frame = +3 Query: 6 TATDSDAIHHHHFPNYSDKNNFAAGSSAYFHTLNPDTYSATDFTDDIRVPNDDMKELEWL 185 TA D D HH+ P +N AA + Y+ L ++ DFTD + VP+DD+ ELEWL Sbjct: 31 TAIDFDLNHHYQPP---PTDNIAA-AGCYYDALP----NSVDFTDKLCVPSDDVAELEWL 82 Query: 186 SNFVHDSFTEFPSSSITGTMNIRSETPS-----NGSSRSKRFR--STVWTSES 323 SNFV D+ FPS+S+T TM + T + + SRSKR R +T WT+ S Sbjct: 83 SNFVEDTSNNFPSNSLTQTMYHLNNTNNTTTILHSKSRSKRSRNSNTSWTTSS 135 >ref|XP_006402573.1| hypothetical protein EUTSA_v10006202mg [Eutrema salsugineum] gi|312282833|dbj|BAJ34282.1| unnamed protein product [Thellungiella halophila] gi|557103672|gb|ESQ44026.1| hypothetical protein EUTSA_v10006202mg [Eutrema salsugineum] Length = 247 Score = 67.8 bits (164), Expect = 1e-09 Identities = 36/79 (45%), Positives = 48/79 (60%) Frame = +3 Query: 66 NFAAGSSAYFHTLNPDTYSATDFTDDIRVPNDDMKELEWLSNFVHDSFTEFPSSSITGTM 245 NF + +S FHT P TDFT D VP+DD LEWLS FV DSF+++P++ + TM Sbjct: 48 NFPSSASNSFHTSPPPLL--TDFTHDFCVPSDDAAHLEWLSRFVDDSFSDYPANPL--TM 103 Query: 246 NIRSETPSNGSSRSKRFRS 302 +R E G RS+R R+ Sbjct: 104 TVRPEMSFTGKPRSRRSRA 122 >ref|XP_006294606.1| hypothetical protein CARUB_v10023643mg [Capsella rubella] gi|482563314|gb|EOA27504.1| hypothetical protein CARUB_v10023643mg [Capsella rubella] Length = 322 Score = 67.4 bits (163), Expect = 2e-09 Identities = 37/92 (40%), Positives = 50/92 (54%), Gaps = 1/92 (1%) Frame = +3 Query: 30 HHHHFPNYSDKNNFAAGSSAYFHTLNPDTYSATDFTDDIRVPNDDMKELEWLSNFVHDSF 209 HHHH P+ +D ++F DI VP+DD LEWLS FV DSF Sbjct: 111 HHHHLPSSADHHSFL---------------------HDICVPSDDAAHLEWLSQFVDDSF 149 Query: 210 TEFPSSSITGTM-NIRSETPSNGSSRSKRFRS 302 +FP++ + GTM +++SET G RSKR R+ Sbjct: 150 ADFPANPLGGTMASVKSETSFPGKPRSKRSRA 181 >gb|ADL36699.1| GATA domain class transcription factor [Malus domestica] Length = 239 Score = 67.4 bits (163), Expect = 2e-09 Identities = 39/98 (39%), Positives = 58/98 (59%) Frame = +3 Query: 9 ATDSDAIHHHHFPNYSDKNNFAAGSSAYFHTLNPDTYSATDFTDDIRVPNDDMKELEWLS 188 +TDS +HHH P + G++ TY TDFT+++ VP+DD+ ELEWLS Sbjct: 20 STDSMDLHHHPPPP-----DHLHGTTTTSLFAPATTY--TDFTNNLCVPSDDVAELEWLS 72 Query: 189 NFVHDSFTEFPSSSITGTMNIRSETPSNGSSRSKRFRS 302 FV DSFT+FP++ +TG+ + ++E SR + RS Sbjct: 73 RFVDDSFTDFPTTDLTGSASFQNEASFMFPSRVRTKRS 110 >ref|NP_182031.1| GATA transcription factor 2 [Arabidopsis thaliana] gi|62900344|sp|O49741.1|GATA2_ARATH RecName: Full=GATA transcription factor 2; Short=AtGATA-2 gi|2959732|emb|CAA74000.1| homologous to GATA-binding transcription factors [Arabidopsis thaliana] gi|24030302|gb|AAN41321.1| putative GATA-type zinc finger transcription factor [Arabidopsis thaliana] gi|222423708|dbj|BAH19820.1| AT2G45050 [Arabidopsis thaliana] gi|225898595|dbj|BAH30428.1| hypothetical protein [Arabidopsis thaliana] gi|330255406|gb|AEC10500.1| GATA transcription factor 2 [Arabidopsis thaliana] Length = 264 Score = 66.6 bits (161), Expect = 3e-09 Identities = 36/94 (38%), Positives = 51/94 (54%), Gaps = 1/94 (1%) Frame = +3 Query: 24 AIHHHHFPNYSDKNNFAAGSSAYFHTLNPDTYSATDFTDDIRVPNDDMKELEWLSNFVHD 203 + HHHH P+ +D ++F DI VP+DD LEWLS FV D Sbjct: 50 SFHHHHLPSSADHHSFL---------------------HDICVPSDDAAHLEWLSQFVDD 88 Query: 204 SFTEFPSSSITGTM-NIRSETPSNGSSRSKRFRS 302 SF +FP++ + GTM ++++ET G RSKR R+ Sbjct: 89 SFADFPANPLGGTMTSVKTETSFPGKPRSKRSRA 122 >gb|EMJ22222.1| hypothetical protein PRUPE_ppa014583m1g, partial [Prunus persica] Length = 250 Score = 66.6 bits (161), Expect = 3e-09 Identities = 31/60 (51%), Positives = 45/60 (75%) Frame = +3 Query: 123 ATDFTDDIRVPNDDMKELEWLSNFVHDSFTEFPSSSITGTMNIRSETPSNGSSRSKRFRS 302 ATDFT+D+ VP+DD+ ELEWLS FV DSFT+FP++++ G+ + ++T S SR + RS Sbjct: 66 ATDFTNDLCVPSDDVAELEWLSRFVDDSFTDFPTTNVFGSASFPNDTSSLFPSRVRTNRS 125 >gb|ADK63416.1| GATA type zinc finger protein [Brassica rapa] Length = 256 Score = 64.7 bits (156), Expect = 1e-08 Identities = 35/89 (39%), Positives = 47/89 (52%), Gaps = 1/89 (1%) Frame = +3 Query: 30 HHHHFPNYSDKNNFAAGSSAYFHTLNPDTYSATDFTDDIRVPNDDMKELEWLSNFVHDSF 209 HHHH P+ +D + F DI VP+DD LEWLS FV DSF Sbjct: 50 HHHHLPSSADHS----------------------FLHDICVPSDDAAHLEWLSQFVDDSF 87 Query: 210 TEFPSSSITGTM-NIRSETPSNGSSRSKR 293 +FP++ + GTM ++++ET G RSKR Sbjct: 88 ADFPANPLGGTMTSVKTETSFTGKPRSKR 116 >ref|XP_002880154.1| hypothetical protein ARALYDRAFT_903940 [Arabidopsis lyrata subsp. lyrata] gi|297325993|gb|EFH56413.1| hypothetical protein ARALYDRAFT_903940 [Arabidopsis lyrata subsp. lyrata] Length = 262 Score = 64.7 bits (156), Expect = 1e-08 Identities = 36/92 (39%), Positives = 49/92 (53%), Gaps = 1/92 (1%) Frame = +3 Query: 30 HHHHFPNYSDKNNFAAGSSAYFHTLNPDTYSATDFTDDIRVPNDDMKELEWLSNFVHDSF 209 HHHH P+ +D ++F DI VP+DD LEWLS FV DSF Sbjct: 52 HHHHLPSSADHHSFL---------------------HDICVPSDDAAHLEWLSQFVDDSF 90 Query: 210 TEFPSSSITGTM-NIRSETPSNGSSRSKRFRS 302 +FP++ + GTM + ++ET G RSKR R+ Sbjct: 91 ADFPANPLGGTMTSAKTETSFPGKPRSKRSRA 122 >ref|XP_002876563.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] gi|297322401|gb|EFH52822.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] Length = 240 Score = 64.7 bits (156), Expect = 1e-08 Identities = 39/83 (46%), Positives = 50/83 (60%) Frame = +3 Query: 54 SDKNNFAAGSSAYFHTLNPDTYSATDFTDDIRVPNDDMKELEWLSNFVHDSFTEFPSSSI 233 S +N F SSAY T P TDFT D+ VP+DD LEWLS FV DSF++FP++ + Sbjct: 39 SSENPFNFPSSAY--TSPP---LLTDFTHDLCVPSDDAAHLEWLSRFVDDSFSDFPANPL 93 Query: 234 TGTMNIRSETPSNGSSRSKRFRS 302 TM +R E G RS+R R+ Sbjct: 94 --TMTVRPEISFTGKPRSRRSRA 114