On the other hand, from Steve’s perspective, sooner or later, Margo will have to take what is a free-form edge document and check it into a central protected repository and live with controls. She can then go on to the next production phase, which might be a rev 2 derivative of that original work, or perhaps something completely different. RIEDEL You certainly have to be careful. You’re moving against the trend here, which is toward decentralization. Corporations are encouraging people to work on the beach and at home. KLEIMAN Nothing I’ve said is in conflict with that. Essentially, the distilled intellectual property has to come back to the corporation at some point. SELTZER Sometimes it’s the process that’s absolutely critical. Did I steal the code or write it myself? That information is encapsulated only on my laptop. Regardless of whether I check it into Steve’s repository, when Mary’s company sues me because I stole her software, what you really care about is the creation process that did or did not happen on my laptop. BREWER I don’t think that’s the day-to-day problem of a storage administrator. What we’re talking about is whether the first goal is to know which of the copies you don’t want to lose, which is a different problem than copies leaking out to others. KLEIMAN I do think that the legal system still counts. Technology can’t make that obsolete. You still have a legal obligation to a company. You still have an obligation not to break the law. No matter what technology we come up with, someone will probably find a way of circumventing it, and that will require the legal system to fill in the gaps. That’s absolutely true with all the stuff on laptops that we don’t know how to control right now. SELTZER I also think it’s more than just copies that we need to be concerned with; it’s also derivative works, to use the copyright term. It’s “Oh, look: File A was an input to File B, which was an input to File C, and now I have File D, and that might actually be tainted because I can see the full path of how it got there.” CREEGER Maybe what we’re seeing here is that we need to intuit more semantics about the bits we are storing. A file is not just a bunch of bits; it has a history and fits in a context, and to solve these kinds of problems, companies are going to have to put processes and procedures in place to define the context of the storage objects they want to retain.

more queue: www.acmqueue.com

BAKER You can clamp down to some extent, but it’s the hidden-channel problem, even through processes that are not malicious. Say I’m on the beach and the only thing I’ve got is a non-company PDA and I have some ideas or I talk to somebody and I record something. It can be very hard to bring all these different sources into a comprehensive storage management policy. Storage has gotten so cheap; it’s in everything around us. It’s very easy to store bits in lots of places that may be hard to incorporate as part of an integrated system.

KLEIMAN There’s not just one answer to these problems. Look at what happens in the virus-scanning world. It’s very much a belt-and-suspenders approach. They do it on laptops, on storage systems, in networks, and on gateways. It’s a hard problem, no doubt about it.

There are a variety of technologies for outsourcing markets, such as China and India, where people who are working on a particular piece of source code for a particular company are restricted from copying that source code in any way, shape, or form. The software disables that.

Similar things are possible for the information proliferation issues we have been talking about. All these types of solutions have pros and cons and depend on what cost you are willing to pay. This is not just a technological issue or a storage issue; it’s a policy issue that also includes management and legal issues. BREWER In some ways it’s a triumph of the storage industry that we have moved from the main concern being how to store stuff to trying to manage the semantics of what we’re storing. CREEGER Again, what should a storage manager be doing in the next 18 to 24 months? KLEIMAN Today people are saving a lot of time, money, and energy doing server virtualization and storage virtualization. Those two combined are very powerful, and I think that’s the next two, three, or four years right there. GANGER And the products are available now. Multiple people over the course of time have talked about snapshots. If you’re running a decent-size IT operation, you should make sure that your servers have the capability of doing snapshots. BREWER On the security side, encryption. Sometimes there are limited areas where you can do the right kind of key management and hierarchies, but encryption is an established way in the storage realm to begin to protect the data in a comprehensive way. SELTZER Backup, archival, and disaster recovery are all vital functions, but they’re different functions and you

ACM QUEUE November/December 2008 39

References:

http://www.acmqueue.com

Archives