Why isn't DSM for unstructured memory done today?

Question

Edit: Comments suggested, that DSM just faded out by being not used recently. What were the reasons for this, what are DSMs drawbacks? Literature lists many positive aspects like easy to port programms to, no marshalling and therefore faster and so on but also negative aspects like only usable in homogenous environments due to endianess and word size issues. So why is all data synchronizing done by databases and not DSM anymore? Has there been a historic comparison or study at the time when both ways existed concurrently?

Old text:

Ages ago, Djikstra invented IPC through mutexes which then somehow led to shared memory (SHM) in multics (which afaik had the necessary mmap first). Then computer networks came up and DSM (distributed SHM) was invented for IPC between computers. So DSM is basically a not prestructured memory region (like a SHM) that magically get's synchronized between computers without the applications programmer taking action. Implementations include Treadmarks (inofficially dead now) and CRL.

But then someone thought this is not the right way to do it and invented Linda & tuplespaces. Current implementations include JavaSpaces and GigaSpaces. Here, you have to structure your data into tuples. Other ways to achieve similar effects may be the use of a relational database or a key-value-store like RIAK. Although someone might argue, I don't consider them as DSM since there is no coherent memory region where you can put data structures in as you like but have to structure your data which can be hard if it is continuous and administration like locking can not be done for hard coded parts (=tuples, ...).

Why is there no DSM implementation today or am I just unable to find one?

I think you've answered your own question - it's been tried; it doesn't play out nicely, so people don't use it. — , Dec 19 '12 at 14:20
i'd like to have reference for that, i couldn't find something where someone actually said that, thats just my own interpretation. — sinned, Dec 19 '12 at 14:22
many times you _won't_ see an explicit reference to that effect. `Negative declarations` are reasonably rare in the programming community. First, making the declaration doesn't provide much benefit. Second, techniques and technology keep changing so that something that _was_ impossible may now be probable. — , Dec 19 '12 at 14:24
uhm... ok. maybe someone did a comparison of different technics for DSM (at that time when there have been several) where I can see what their drawbacks were? I can only find mostly positive aspects of DSM like easy to port programms to, no marshalling and therefore faster but only usable in homogenous environments due to endianess and word size issues, but if everyone uses windows or linux? — sinned, Dec 19 '12 at 14:35
You would have better odds of getting your question answered if you edited it to reflect the newer questions posed in your last comment. — , Dec 19 '12 at 14:43
I am hearing this question as why is unstructured DSM not common, and I think the answer is that unstructured buffer-style programming is simply uncommon in this day and age as people prefer type systems, even if they're dynamic like JSON, people find type systems preferable to stream buffers for sharing data. I think that's really all there is to it. More details regarding Buffer-Oriented programming here: https://sites.google.com/site/steveyegge2/allocation-styles — Jimmy Hoffa, Jan 02 '13 at 16:12

score 4 · Answer 1 · edited Jan 06 '13 at 21:29

4

DSM (Distributed Shared Memory) is a topic from Computer Architecture rather than software development; however, I have the answer:

Summary

DSM has never stopped being used. There is an evidence with enough additional information to prove the use of DSM today and an active research in that area.

Numbers

Only for year 2012, there are 173 new unique papers at IEEE Explore describing various applications of DSM.

Additional Resources

Many other recent papers can be found in the ACM Digital Library.

Sample Papers and DSM Implementations

Software distributed shared memory and its various implementations can be found here, i.e. this paper describes a particular SW DSM implementation.

Another example of a DSM paper.

edited Jan 06 '13 at 21:29

Michael Borgwardt

51,037
13
124
176

answered Dec 31 '12 at 22:25

Thanks alot. Especially your last link seems to be exactly what I'm looking for. Currently, I'm thinking about spending the money or not... Maybe I can find a free similar paper or I can get access to this one through my university.... – sinned Jan 05 '13 at 09:08
@sinner Great! I'm glad it helped. Universities often have an institutional membership for IEEE. Find out if your university is the case. It's the "Athens/Shibboleth" option. Click this the link near Username, Password and enter your institution name. – Jan 05 '13 at 18:07

score 2 · Answer 2 · answered Mar 19 '13 at 02:27

If I understand your question, it's why don't developers use DSM to build applications, preferring other abstractions? I think it's because of complexity.

I really think shared memory, in general, is a deceptively simple abstraction. It seems easy to understand but the particulars make it complex. Other abstractions, like message-passing, are actually simpler to write and deploy. When I was getting my Master's, Linda was still in common use, but most people developing distributed systems were using a variation of message passing or RPC.

Even within a server, where shared memory is available, developers often use sockets (Unix sockets or loopback) to communicate between processes. Using message passing, developers don't have to worry about concurrent access, obtaining locks, what to do if a process holding a lock fails, etc. Taking message passing from a single machine to multiple machines is fairly straightforward. However, as you can see from the number of papers, it is not as clear cut with DSM.

At another level, clustering that involves shared resources has kind of fallen out of favor. For example, single-system-image clustering is no longer a hot topic. Most modern operating systems have taken a "shared-nothing and we're fine with that" approach. There are some clustering toolkits available as add-ons to Linux but it's not 'mainstream.' (A lot of clustering add ons are really failover clustering and not clustering in the OpenVMS sense).

DSM is not abandoned, dead, or buried. I'm not saying the problems around DSM are insurmountable, or it is too complex to use in every circumstance or that it will never be as convenient as message passing. Some products use DSM under the hood to manage their clustering. I think Oracle's WebLogic app server can use some region of shared memory to facilitate their clustering, as an example. At the very least, it looks like it's still fertile ground for research if you decide to go for a PhD.

score 0 · Answer 3 · answered Mar 18 '13 at 21:44

One merely needs to revisit the "Dining Philosophers" problem to understand the difficulties encountered with distributed shared resource access. How does one synchronize symmetrical access to a shared resource without a common clock? The reason why the client/server architecture is so popular in distributed systems is because the architecture introduces an asymmetrical node that provides synchronized access to one or more shared resources; thereby, avoiding race conditions.

Why isn't DSM for unstructured memory done today?

3 Answers3

Summary

Numbers

Additional Resources

Sample Papers and DSM Implementations