Etcd: Question： Why STM did not support range get yet

Created on 27 Feb 2017 · 22Comments · Source: etcd-io/etcd

I have read the STM code in etcdv3 , and gonna to use in my project.

But here is a big problem there: I need the range get in STM .

Why there is no range get in STM ? Can I add this feature with a pull request?

arefeature arequestion stale

Source

arthurkiller

👍3

All 22 comments

@xiang90

arthurkiller on 27 Feb 2017

@arthurkiller it's written that way because there's a potential for a Range to fetch a very large number of keys which can lead to a large number of false conflicts due to a huge readset and poor performance from very large transactional compare blocks. How do you intend to use STM?

heyitsanthony on 27 Feb 2017

@heyitsanthony thanks a lot ,dude. I need the STM to __handle the potential conflicts__ while doing the update in a parallel way. We think this should __let the user to decide__ if a large number false is acceptable.

I have also make up a STM-like facilities , it provide the isolation ability between the read committed & repeatable read Cause we really __need the range get and doesn't care the failure much__ . In this situation etcd still get a good performance

Etcd can be used in many situations and the __failure acceptable__ is one of them I think.

arthurkiller on 28 Feb 2017

@arthurkiller Would an extension to the STM interface that takes a range [begin, end) and returns a slice of key/value pairs like Range(begin, end string) [][2]string be enough?

heyitsanthony on 28 Feb 2017

@heyitsanthony Actually I only need the prefix option and don't care the implementation

arthurkiller on 28 Feb 2017

@heyitsanthony would like a PR ? I can do this

arthurkiller on 28 Feb 2017

@arthurkiller sure

heyitsanthony on 28 Feb 2017

@heyitsanthony I want to use the AVL tree / dictionary tree instead of the map(hash map). Cause Tree can handle the prefix matching problem with less cost. How do you think about this? This may add another dependency in the package

arthurkiller on 28 Feb 2017

@arthurkiller there's already an interval tree in pkg/adt/interval_tree.go. I think that kind of approach will be very complicated though. An easier way: when calling Range(begin, end string) [][2]string, the STM runtime would cache the result for the begin/end pair and that's it.

One problem with Range and STM (and why I'm hesitant to diverge from the traditional single-key model): suppose a transaction T1 makes a range request [a, d), retrieving {a, b}. Concurrently, another transaction T2 inserts "c" and commits, giving {a, b, c}; there's now a read conflict on T1. How do you intend to resolve it?

heyitsanthony on 28 Feb 2017

👍1

@heyitsanthony This makes me confused. Question is that Serilizable read should not allow phantom read, right?

arthurkiller on 2 Mar 2017

@heyitsanthony You mean T1 do the read {a,b,c} but only get the {a,b} revison and T2 write the {c} then T1 should get a conflict ???

arthurkiller on 2 Mar 2017

@heyitsanthony I see ... Seems snapshot can solve this?

arthurkiller on 2 Mar 2017

@heyitsanthony Should serializable read add the revision(global) check when commit? That's not a good way , cause we get a global RWLock over etcd version.But it can avoid phantom read ,right ?

arthurkiller on 2 Mar 2017

Should serializable read add the revision(global) check when commit? That's not a good way , cause we get a global RWLock over etcd version.But it can avoid phantom read ,right ?

That would catch the read conflict, but would have so many false conflits it would be effectively unusable. Plus, that kind of test isn't supported in the transaction API and probably shouldn't be; if the txn model is going to be extended it should include range tests (e.g., Range(a, b).modrev < r).

Can you give a concrete example of how you're planning on using etcd / what your data model is going to be? Maybe STM isn't the best fit.

heyitsanthony on 2 Mar 2017

@heyitsanthony Sure. We make up a config manager service over etcd. And here is a problem. We need to identify&manage a kind of keys(this key is the config key ,they may have a common prefix) get same prefix.We also defined some status while operating these keys.And those operation should be concurrent safety

Now I do this with a additional key to hold the revision while do the transections to isolate the transections.And it really take effect.But till I read the STM ,I realized my isolation facilities is just between the read-committed & serializable-read.

Maybe here is a another way to handle this

arthurkiller on 3 Mar 2017

@arthurkiller was this the revision of the entire etcd store? that would have a lot of conflicts...

Maybe use a distributed lock? The one in clientv3/concurrency/mutex.go can guard concurrent transactions with IsOwner().

heyitsanthony on 3 Mar 2017

@heyitsanthony Yep, seems STM is not suit for this kind of problem.
So, what kind of problem is suit for STM??

arthurkiller on 4 Mar 2017

@arthurkiller it's suitable for problems which can be solved with ordinary software transactional memory. If the algorithm accesses keys like addresses in memory, it's OK; range queries are not part of that model.

heyitsanthony on 4 Mar 2017

👍1

@heyitsanthony Thx dude. I will think about this.

arthurkiller on 6 Mar 2017

Reopening since the txn API can do this as of #8025 and people still ask for this STM extension.

heyitsanthony on 9 Aug 2017

Also interested in this :)

halseth on 9 Oct 2019

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.