influxdb 🚀 - [Feature request] Drop field

On a side note: I also expected one of these inserts/updates

traffic,id=42 tx_pkt=null,rx_pkt=null 1459265806000000000
traffic,id=42 tx_pkt=,rx_pkt= 1459265806000000000

to delete the field key/value from the field set (in #2429 it was stated that TSM doesn't handle null values). Would that be a feasible feature request?

dmke on 29 Mar 2016

+1

gvohra on 25 May 2016

👎5 👍5

+1

PaulKuiper on 24 Jun 2016

👎5 👍4

+1

dstreppa on 9 Aug 2016

👎5 👍4

+1. I don't so much want to store nulls myself, but I'd love to have a way to signal "don't use this value". I store enriched data with my measurements, and if the enrichment changes, sometimes fields need to disappear.

retorquere on 22 Aug 2016

Another use case: I've been storing a field that I later realized can be approximated well enough from two other fields. I'd like to remove that field from the measurement.

While this isn't possible yet, are there some migration tools that can assist with "filtering" a database into a new one, sans undesired fields and tags?

dandv on 14 Sep 2016

👍6 👎2

+1 if a random field gets added to a measurement by accident I'd definitely like to be able to remove it, at the moment it stays even if that field has 0 non-null datapoints

drmclean on 16 Sep 2016

👍28

Until/if this gets implemented, what is the recommended workaround? SELECT INTO? How do you make sure that all wanted points were copied to the new measurement, if you have points containing NULLs, given than COUNT ignores nulls?

dandv on 25 Sep 2016

StackOverflow question: How to delete columns of a series in InfluxDB.

dandv on 26 Sep 2016

👍3

+1

superbool on 27 Sep 2016

👍5 👎4

Please stop commenting "+1" or similar non-comments. It becomes a little annoying and doesn't really add to the discussion.

If you want to share your reaction, hit the little icon:
add your raction

If you want to get notified about updates, click the subscribe button on the sidebar:

Thanks.

dmke on 5 Oct 2016

👍29 ❤8

adding onto what @dandv said: SELECT / INTO is not very efficient (been running for 10h for 15G of data with 16 processors and still not done). Would love to see this feature implemented.

Occasionally an input plugin will fail and insert bad data, which ruins your entire measurement.

anazia on 5 Oct 2016

👍16

One of the great attractions of InfluxDB for us was the free-form data storage: you can store any field/value pairs you like. The trap seems to be that once set it's impossible to remove field/value pairs from a measurement without recreating the entire measurement series using SELECT .. INTO.

Aside from being very slow, it seems that the INTO part must specify a different database/rp/measurement (i.e. one or more of those must be different to the source data), which creates another problem: you can't rename any of those, once written. So you either have to write a 'temporary' measurement then delete the original and copy the temporary back to the original name, or modify the application to use the new database/dp/measurement name.

Being able to DROP FIELD <fieldname> FROM MEASUREMENT <name> would save a lot of hassle when it's necessary to perform what should be a fairly simple repair.

JeremySTX on 29 Jun 2017

👍48 😕1

DROP TAG <tagname> FROM MEASUREMENT <name>
would be really useful too.

lqueryvg on 25 Oct 2017

👍15

If you drop all series which have the column (in my case only few series had an extra column due to an error) InfluxDB automatically removes that column. Show Field Keys no longer lists that column.

mvadu on 3 Nov 2017

+1

7c on 2 Feb 2018

👎3

This is kinda crippling. I'll do the following:

DROP SERIES WHERE "myTagName" = 'PeskyTagIWantToDelete'

and theres only like a 50/50 that it gets removed when I do

SHOW TAG VALUES WITH KEY "myTagName"

NateZimmer on 10 Mar 2018

😕1

@NateZimmer if you drop series, and restart influxdb. the fields will be gone.

geometrybase on 31 May 2018

Can we get a status update on this issue from the Influx team? This issue really is pretty crippling.

Dwood15 on 5 Jul 2018

👍18

I had a thought about this which may or may not be helpful ...
I have to assume that the lack of a DROP FIELD or DROP TAG operation and the lack of any sign of activity to provide such an operation is because it's hard to do, or is estimated to have significant performance impact on the data store. (How the performance impact could be worse than DROP SERIES and then recreating the series, I don't know!)

Anyway perhaps an alternative would be HIDE FIELD or HIDE TAG, which means "don't return this field (or tag) unless specifically requested in the query". So for example

HIDE FIELD battery_charge FROM meters

would add a flag to the field key for 'battery_charge' in 'meters' so that any future query such as

SELECT * FROM meters WHERE meter_id='grid1'

would return every field except the 'battery_charge' field, but

SELECT *,battery_charge FROM meters WHERE meter_id='grid1'

would return every field including the 'battery_charge' field, and
SELECT battery_charge FROM meters WHERE meter_id='grid1'
would return only the 'battery_charge' field.

I suppose it might be extended to the SERIES level but for our purposes having it at the MEASUREMENT level would be sufficient.

Just a suggestion.

JeremySTX on 30 Jul 2018

👍1

I keep running into the need for this because there's no way to effectively search for data points with certain fields to remove them manually, without first querying all the data.

shakefu on 31 Jul 2018

👍15 ❤1

Another example is with the new telegraf syslog receiver, which creates fields dynamically from structured data. To test this I did the following:

# echo '173 <165>1 2003-10-11T22:14:15.003Z mymachine.example.com evntslog - ID47 [exampleSDID@32473 iut="3" eventSource="Application" eventID="1011"] An application event log entry...' | nc 127.0.0.1 6514

It works fine, and generates dynamic fields from the structured data.

The trouble is, when I drop the test record, those dynamic fields live forever.

> delete from syslog where "appname" = 'evntslog'
> show field keys from syslog
name: syslog
fieldKey                      fieldType
--------                      ---------
exampleSDID@32473_eventID     string
exampleSDID@32473_eventSource string
exampleSDID@32473_iut         string
facility_code                 integer
message                       string
msgid                         string
procid                        string
severity_code                 integer
timestamp                     integer
version                       integer

And now select * from syslog... always includes these fat columns which never contain data :-(

Aside: if those extra values had been tags then it would have been very easy, because you can just drop the series: e.g. drop series where hostname='mymachine.example.com'

candlerb on 27 Sep 2018

👍3

I too need to delete an unnecessary field on Influxdb 1.6.4

jfcg on 13 Dec 2018

Yup would be good if we have this feature drop field and tag. Also, alter the tag and field names.

gowrisankar22 on 10 Feb 2019

Can you please at least put this feature in the roadmap? This can make an entire database useless when a field is written to with two different types by error.

etcinitd on 15 Feb 2019

👍15 ❤2

This is a feature that needs to be added ASAP. We experienced exactly what happened above (and it indeed does make the entire database useless). With no way to drop field, or alter table, you are shit out of luck when (mind you, with no human intervention) telegraf decided to create an extra field with a fieldvalue of float when it already had a field with the same name in the correct fieldvalue of integer. There is NO WAY to remove this extra field, making telegraf unable to write to DB, and no fix available. This is crazy.

z0rx0r on 4 Mar 2019

👍8

@geometrybase

if you drop series, and restart influxdb. the fields will be gone.

This seems not to work :(
Here is my log what I have tried so far:

> select * from water limit 1;
name: water
time                cons consumption count pulse sum    value
----                ---- ----------- ----- ----- ---    -----
1552147044000000000      0           1     1     345879
# Here you see: I want to get rid of the two fields "cons" and "value"

> SELECT "consumption","count","pulse","sum" INTO "water4" FROM "water" GROUP BY *

> select * from water4 limit 1;
name: water4
time                consumption count pulse sum
----                ----------- ----- ----- ---
1552147044000000000 0           1     1     345879

> drop measurement water;
> drop series from water;

root@home:~/influx# service influxdb restart
Stopping influxdb...
influxdb process was stopped [ OK ]
Starting influxdb...
influxdb process was started [ OK ]

> SELECT * INTO "water" FROM "water4" GROUP BY *

> select * from water limit 1;
name: water
time                cons consumption count pulse sum    value
----                ---- ----------- ----- ----- ---    -----
1552147044000000000      0           1     1     345879

pzystorm on 18 Mar 2019

@pzystorm I was able to get this to work (at least for my measurement) by flipping the order:

> drop series from <measurement>;
> drop measurement <measurement>;

nathanielatom on 19 Mar 2019

Could we at least hear from the developers about whether this is/is not on the roadmap?

nathanielatom on 19 Mar 2019

👍10

@pzystorm I was able to get this to work (at least for my measurement) by flipping the order:
> drop series from <measurement>;
> drop measurement <measurement>;

Flipping the order did not work for me. The only way to get rid of the fields was to copy into a new measurement and copy it back (as described by @pzystorm) thereby making the fields empty (not deleting them). Afterwards backup the whole db, drop the db, and restore the db. The whole procedure can take hours if you want to get rid of a single field (in my case because it was written once with a wrong datatype). Very very cumbersome...

drb-germany on 26 Mar 2019

👍3 😕1

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale[bot] on 23 Jul 2019

👎12

Wait, what? This is higly demanded feature and some stupid bot decides to close it just because there is no "recent activity"? What do you want us to do - talk about it all day? Will it make you implement it faster?

maraswrona on 23 Jul 2019

👍9

I'm happy to set up a bot that will request this feature once a week if that helps?

-------- Original message --------
From: maraswrona notifications@github.com
Date: 23/07/2019 10:31 (GMT+00:00)
To: influxdata/influxdb influxdb@noreply.github.com
Cc: Ivan Scattergood Ivan@ApogeeSoftware.co.uk, Manual manual@noreply.github.com
Subject: Re: [influxdata/influxdb] [Feature request] Drop field (#6150)

Wait, what? This is higly demanded feature and some stupid bot decides to close it just because there is no "recent activity"? What do you want us to do - talk about it all day? Will it make you implement it faster?

—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHubhttps://github.com/influxdata/influxdb/issues/6150?email_source=notifications&email_token=ABZRCH4Y6MQQZQ5IRJ3QRDTQA3FYBA5CNFSM4B7LMRG2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD2SQNNY#issuecomment-514131639, or mute the threadhttps://github.com/notifications/unsubscribe-auth/ABZRCH5O65WOPEQGAWBNUVLQA3FYBANCNFSM4B7LMRGQ.

ivanscattergood on 23 Jul 2019

👍15 😄10 🎉3

I'm happy to set up a bot that will request this feature once a week if that helps?

not all heroes wear capes.

drmclean on 23 Jul 2019

I just had this issue. I accidentally added some data as fields instead of the expected tags. So I ended up with a tag and field with the same name. This, unfortunately, meant that the original tag started being returned as tagname_1 in queries, breaking stuff expecting it to be called tagname.

I deleted all measurements that had values applied to the erroneous field 'tagname' but still InfluxDB insisted that the fields existed.

Interestingly the Chronograf explore via flux stopped showing them in the schema though.

I restarted InfluxDB but that didn't help.

I then spotted the following post https://github.com/influxdata/influxdb/issues/10052 and updated to 1.7.8 but this did not fix the issue.

However, I then deleted the .idx files as suggested in that post and this removed the fields! Happy days.

But please give us DROP FIELD & DROP TAG.

andsee on 20 Sep 2019

👍9

Interesting. Complete radio silence from anyone in the development community.......

avrhack on 18 Oct 2019

For almost 4 years(

positron96 on 22 Nov 2019

Can someone from the development community at least summarise the steps and/or challenges for implementing this? Maybe someone else can help to implement it because it's a much needed feature.

etcinitd on 3 Dec 2019

👍5

Is Indflux a dead project? This issue is 4 years old and highly demanded ... still .. no reply from devs at all.

mmihalev on 10 Jan 2020

It's not dead. Last commit was just half an hour ago.

retorquere on 10 Jan 2020

Idk how similar this is but I had an issue with my redis telegraf input where it was giving errors on one of my linux servers saying that the filed rss_overhead_bytes already had a value so both values were dropping
Error:
So I checked

SHOW FIELD KEYS FROM "redis
repl_backlog_size integer
rss_overhead_bytes float
rss_overhead_bytes integer
rss_overhead_ratio float

And my fix was to drop the redis field.
I figured it would repopulate and it did

DROP FIELD KEYS FROM "redis

SHOW FIELD KEYS FROM "redis
repl_backlog_size integer
rss_overhead_bytes float
rss_overhead_ratio float

This resolved my issue. not sure if it helps with others but hope it does.

tmeralus on 16 Jan 2020

It would be nice to have this feature.

We use InfluxDB Cloud 1 and one of our developers (using the nginx telegraf plugin) pushed 87K unique fields to one of our DBs. This caused us to be upgraded to the next hosting tier when we dont need to be.

If we could remove the erroneous field then we can go back down to the previous tier we were on. The select INTO option is slow and a horrible workaround. Since there was so much data we ended up just removing the measurement all together.

leedhi on 18 May 2020

We experienced this issue as well and got 1d of data not being written by telegraf because of two fields with the same name but different type. The risk of not having the _drop field_ or equivalent feature is high - as written in one of the comments above - one small incident renders complete db useless and can trigger good business and monetary losses.

dbo-wizata on 27 Oct 2020

👍2

If you drop all series which have the column (in my case only few series had an extra column due to an error) InfluxDB automatically removes that column. Show Field Keys no longer lists that column.

I have not found this to be the case. I just inserted a row by accident and deleted it, but all the new columns remain.

JetForMe on 22 Nov 2020

Same deal, multiple types (by mistake) in same field cause impossibilities to write new values in a series
Dropping responsible series does not suffice to fix the issue. We are losing a lot of time and effort on this seemingly minor issue. Please address this.

tomaszG247 on 16 Dec 2020

👍1

Influxdb: [Feature request] Drop field

Most helpful comment

All 45 comments

Related issues