SGS database consistency model?

hplus · June 13, 2006, 11:54pm

Is the consistency model for the SGS globally sequenced?

Thus, if I peek object A on node X, then lock/write object A on node Y, then lock/write object A on node Z (which clearly sees the state from node Y), then attempt to peek object A on node X again, am I guaranteed to get the state as written from node Y or later, or is there a chance I could get the earlier state I already peeked?

I e, is the consistency semantic that a commit will always distribute state, even to peek/readers, or will the local node really only know that the view of state will advance when it actually locks/writes? The latter has performance benefits, but implementation pains…

Jeff · June 14, 2006, 8:04pm

Its a bit simpler to explain, actually. These are the guarantees:

(1) Peeks are repetable within a task. After the first peek subsequent peeks willr eturn the same object EXCEPT
(2) Doing a GET effectively resets the state so after a GET all subsequent peeks will return the same obejct as the GET. If you are keepign referecnes to the result os previous peeks though those remain the original object.

hplus · June 29, 2006, 1:13pm

Thanks for the explanation. That doesn’t, however, talk about the consistency model between different tasks.

Does a commit mean that any PEEK done physically after the commit completes will see the committed state, or can a physically later PEEK actually return data that is part of the pre-commit state? My guess is that, yes, a PEEK can actually return state that is out of date WRT the commit state of the object, and you’ll have to use GET if you want to ensure you have the “latest” state.

I’m assuming that successive GETs are synchronized and serial (and, in fact, you can’t GET an object at two places at physically the same time).

beowulf03809 · June 29, 2006, 1:42pm

If I’m following your question hplus and understanding the SGS model correctly, then your answer should be that after the commit any PEEK will return the new object. Even though the game servers are running on multiple seperate nodes, they are all drawing from the same central data repository ( if part of the same “game world” ). It is not that the information is being sync’d across multiple game server nodes and there may be a lag. They all go to the same source.

Hope that helps ( and hope that was correct! ;D )

bahuman · June 29, 2006, 5:27pm

The way I understand Jeff,
If process A peeks GLO “blahblah” and sees the value “one”
Then process B gets GLO “blahblah”, also sees value “one”
process B then commits a new value “two” to GLO “blahblah”

now AFTER this commit, if process A does another PEEK at “blahblah”, it will still see the old value “one”, because inside one process, all PEEK’s return the same value.

quote Peeks are repetable within a task. After the first peek subsequent peeks willr eturn the same object EXCEPT
[/quote]
B, on the other hand, will read the new value “two”, regardless of PEEKing or GETting.

endolf · June 29, 2006, 5:31pm

Hi

As I understand Jeffs other posts elsewhere, after process B does it’s get, nothing in any process can do a peek until the commit has happened, and at that point, it will get a different object because the commit has happened. It only returns the same GLO object if it’s data is unchanged.

Endolf

Jeff · June 29, 2006, 8:28pm

Okay guys… let me try to explain this clearly. Im sorry its so confusing… but anyalternate models we coudl come up with that didnt have really vicious over-head were even more confusing…

There is always a value in the object store to be PEEked. It is the last comitted value of the object.

This does not change until a transaction that has done a GET on the object, and subsequently modified it, commits.
At that point there is a new value to be PEEKed in the object store.

Now inside of a single transaction, what happens on GET and PEEk is kind of important to completing the story so heres how it works:

When you PEEK the first time, you get a refernce to an object that is a copy of that state in the object store.
Subsequent PEEKs will return a reference to that same object. Even if another transaction comes ina nd commits a new value,
you won’t see it because you arelady have a peek-copy locally.

The one time this chnages is when you do a GET. When you do a GET the first time, a new object representing a copy of the current state in the object store is made at the same time the GET-lock is taken.
After this, any subsequent GETs return a reference to that copy.
ALSO after that GET any subsequent PEEKs return a reference to the copy returned by GET… even if you did a PEEK before the GET.

That last line is the sort-of-tricky part. But it really does result in the code doing “what yo uwant” in an intuitive manner the vast majority of times.

endolf · June 29, 2006, 8:42pm

Cool, that clears up in instance things, but what about across instance of SGS on ‘the big backend’. If I peek from a task, and on another instance else where in the cluster, another task does a get, what happens if I peek again from the original task?. I can’t get a reference to an object that is on another machine unless you are using RMI or something along that lines, so do I get the same instance again?. What happens across instance of SGS after the commit?

I know this is a complicated area, otherwise we would all have written code to do it ourselves

Just trying to get it straight it my mind.

Endolf

Jeff · June 29, 2006, 9:03pm

THink of the ObjectStore as a “vault” or cold-storage. Object in the obejct store are just images. They dont existas objects untuil pulled into a GLE.

In the case of a PEEK, a transaction-local copy of the object is created from the value in the GLE for use.
If another transaction does a PEEK it creates it own transaction-local copy.

In the case of a GET, the same thing happens but additionally a flag is set on the cold-storage to make any other attempts to GET on it block until
the currently lock-holding transaction is complete.

It doesnt mater where the transactions live… one machine or many. Each gets its own copy.

Does that make more sense?

endolf · June 29, 2006, 9:35pm

Chat log with Jeff cause I still didn’t get it fully

[22:12:36] JGO Endolf: it’s either late or i’m being dense, but does that mean that after a get, all subseqent peeks will get a new copy, regardless of where they are?
[22:12:48] Jeff: no
[22:12:51] JGO Endolf: sorry
[22:12:55] JGO Endolf: worded wrong
[22:12:59] Jeff: it means after a GET all peeks return the same copy the GET did
[22:13:09] JGO Endolf: right
[22:13:21] JGO Endolf: or a copy of the same data if they are on other nodeS?
[22:13:24] Jeff: GET basically forces it to get anew local copy in order to assure ACID porperties
[22:13:56] Jeff: remember, the data is in the object store, not in any of the slices. (logically this is true… i wont get into caching and optimization techniques)
[22:14:14] Jeff: So for all slices there IS only one value in the ObjectStore
[22:14:21] Jeff: Its the last comitted val;ue
[22:14:48] JGO Endolf: aye, but there are local copys for peeks, and within a transaction any peek will retrun the same instance
[22:15:16] Jeff: yes. so if you peek, and then another task commits… you wont see that commit from a later peek.
[22:15:26] JGO Endolf: bingo
[22:15:33] JGO Endolf: the light at the end of the tunnel
[22:15:38] Jeff: the one exception being that IF after the other task commits, you do a GET…
[22:15:46] Jeff: then you start seeing that new value.
[22:16:06] Jeff: The tricky thing is that peeks before the GET returned the old value and if you stored that reernce, its stil laround.
[22:16:13] Jeff: Thats the one thing you need to be caeful of
[22:18:08] JGO Endolf: because the logic you performed on the peeked cached values may be out of date?
[22:18:10] Jeff: We looked at ways to solve that but they were all ugly and invovled handles and decided it was better just to leave it as a programmer “watch out for” point
[22:18:49] Jeff: Well if I do a peek, it returns a ref to a local object. When I do the get it returns a ref to a NEW local object. It has to to assure that the GET is transactionally proper…
[22:19:10] Jeff: I mean, we COULD try to re-write the old object but thats kind of ugly… chnaging object values out from under you.
[22:19:15] Jeff: We decided this was better.
[22:19:18] JGO Endolf: aye
[22:19:40] Jeff: after the GET though any peeks will be pointed at GET’s copy.
[22:20:24] JGO Endolf: aye, that bit I got, it was how peeked values were invalidated, and the answer is, only when the task finishes, unless within that task you do a get
[22:20:34] Jeff: right
[22:21:02] Jeff: again we looked at not caching it… and that turned ugly and race prone pretty fast

HTH

Endolf

hplus · July 3, 2006, 11:17am

So the consistency is on a per-task basis – specifically, a SGS execution process cannot locally cache the value of a peek between tasks, because the consistency model says that any new task (after a possibly remote commit) must return the committed value. This is nice for the object programmer, but a draw-back for someone attempting to implement a high-performance object store.

With the model, as described, then either each node cannot have caching (slow), or each node needs to participate in a global two-phase commit procedure (quite complicated).

If the rule was changed so that a task may get an older version of data unless it does a GET, then the implementation could be allowed to cache, and use a non-synchronous cache flush protocol, which would be higher performance and more robust (on the implementation side). The draw-back would be that you could get an older peek value after a newer commit – but, really, if you’re running on a different node, synchronization isn’t guaranteed anyway.

Jeff · July 4, 2006, 9:11pm

Yup. The goal is to make it easy and simple for the programmer. We are doing everything we can to avoid compromising that.

Thats why this is a labs project, we are solving really hard problems.

Actually there are other solutions having to do with aggregtaing and migrating users around the back end according to usage patterns.

Yes, it IS rocket science. Thats why I have rocket scientists on the project. 8)

hplus · July 5, 2006, 3:49pm

[quote]Actually there are other solutions having to do with aggregtaing and migrating users around the back end according to usage patterns.
[/quote]
You can distribute ownership, but ownership needs to be in one place at a time. Thus, ownership can migrate to the “most likely” node to want to next GET the object.

However, distributed ownership still needs the (moral equivalent of) a transaction monitor to arbitrate ownership. No two ways about it – anything less and you have a chance of ownership schizophrenia. And you really don’t want that! (Speaking as someone who solved this problem five years ago)

Jeff · July 5, 2006, 8:20pm

There are other solutions, and hybrid solutions.

Objects are not the only mobile thing in the system.

I really cannot say more because the guys in research are still finalizing details and we havent even begun to work through the patent process yet. When we have it all up and running in the big back end and the right patent apps in place I am sure we’ll give some talks and publish a paper or two.

But dont fall into the trap of thinking that because you find a soluition it is the only solution. Thats pretty much never true in computer science.

swpalmer · July 6, 2006, 2:40am

[quote=“Jeff,post:12,topic:27624”]
Yeah but, rocket scientists don’t necessarily make good programmers.

(I know of a NASA group that had a poor understanding of modern computers and computer science. It was clear that their skills in the CS area were lacking when my development team had to bail them out.)

But I must admit what you guys are doing with the SGS is exciting and I’m sure you’ve got some smart guys on the team or you wouldn’t be as far along as you are. I only wish that I had the time to start a project that used the SGS so I could get in on the excitement.