Hi,
I would like to know if there is a way to delete old/unused data easily ?
I know about TTL but there are 2 limitations of TTL:
- AFAIK, there is no TTL on counter columns - TTL need to be defined at write time, so it's too late for data already inserted.
I also could use a standard "delete" but it seems inappropriate for such a massive.
In some cases, I don't know the row key and would like to delete all the rows starting by, let's say, "1050#..."
Even better, I understood that columns are always inserted in C* with (name, value, timestamp). So is it possible to delete all the data inserted in some CF between 2 dates or data older than 1 month ?
Alain
Alain RODRIGUEZ's gravatar image asked Feb 11 2013 at 15:21 in Cassandra-User by Alain RODRIGUEZ

1 Answers

> So is it possible to delete all the data inserted in some CF between 2 dates or data older than 1 month ? No.
You need to issue row level deletes. If you don't know the row key you'll need to do range scans to locate them.
If you are deleting parts of wide rows consider reducing the min_compaction_level_threshold on the CF to 2
Cheers
----------------- Aaron Morton Freelance Cassandra Developer New Zealand
@aaronmorton http://www.thelastpickle.com
aaron morton's gravatar image answered Feb 12 2013 at 17:00 by aaron morton

Related Discussions