OLTP working set known today. You don’t need hundreds of thousands of dollars of enterprise “who-ha” if a few thousand dollars will do it.
With companies like Teradata and Netezza you have to ask if doing all these things to reorganize the data for DSS (Decision Support Systems) is even necessary anymore?
mache cReeGeR: For the poor IT managers out in Des Moines struggling to get more out of their existing IT infrastructure, you’re saying that they should really look at existing vendors that supply flash caches?
steVe KLeiman: No. I actually think that flash caches are a temporary solution. If you think about the problem, caches are great with disks because there is a benefit to aggregation. If I have a lot of disks on the network, I can get a better level of performance than I could from my own single disk dedicated to me because I have more arms working for me.
With DRAM-based caches, I get a benefit to aggregation because DRAM is so expensive it’s hard to dedicate it to any single node. Neither of these is true of network-based flash caches. You can only get a fraction of performance of flash by sticking it out over the network. I think flash migrates to both sides, to the host and to the storage system. It doesn’t exist by itself in the network.
mache cReeGeR: Are there products or architectures that people can take advantage of?
steVe KLeiman: Sure. I think for the next few years, cache will be an important thing. It’s an easy way to do things. Put some SSDs (Solid State Disks) into some of the caching products, or arrays, that people have and it’s easy. There’ll be a lot of people consuming SSDs. I’m just talking about the long term.
mache cReeGeR: This increases performance overall, but what about the other issue: power consumption?
steVe KLeiman: I’m a power consumption skeptic. People do all these architectures to power things down, but the lowest-power disk is the one you don’t own. Better you should get things into their most compressed form. What we’ve seen is that if you can remove all the copies that are out in the storage system and make it only one instance, you can eliminate a lot of storage that
you would otherwise have to power.
When there are hundreds of copies of
the same set of executables, that’s a lot
of savings.
maRGo seLtzeR: You’re absolutely
right, getting rid of duplication helps
reduce power. But that’s not inconsis-
tent; it’s a different kind of power man-
agement. If you look at the cost of stor-
age it’s not just the initial cost, but also
the long-term cost, such as manage-
ment and power. Power is a huge frac-
tion, and de-duplication is one way to
cut that down. Any kind of lower-power
device, of which flash memory is one
example, is going to be increasingly
more attractive to people as power be-
comes increasingly more expensive.
steVe KLeiman: I agree. Flash can
handle a lot of the very expensive,
high-power workloads—the heavy
random I/Os. But I am working on the
assumption that disks still exist. On
a dollar-per-gigabyte basis, there’s at
least a 5-to- 1 ratio between flash and
disks, long term.
maRGo seLtzeR: If it costs five times
more to buy a flash disk than a spin-
ning disk, how long do I have to use a
flash disk before I’ve made up that 5X
cost in power savings over spinning
disk?
steVe KLeiman: It’s a fair point. Flash consumes very little power when you are not accessing it. Given the way electricity costs are rising, the cost of power and cooling over a five-year life for even a “fat” drive can approach the raw cost of the drive. That’s still not 5X. The disk folks are working on lower-power operating and idle modes that can cut the power by half or more without adding more than a few seconds latency to access. So that improves things to only 50% over the raw cost of the drive.
Look at tape-based migration systems. The penalty for making a bad decision is really bad, because you have to go find a tape, stick it in the drive, and wait a minute or two. Spinning up a disk or set of disks is almost the same since it can take longer than 30 seconds. Generally those tape systems were successful where it was expected behavior that the time to first data access might be a minute. Obviously, the classic example is backup and restore, and that’s where we see spin-down mostly used today.
If you want to apply these ideas to
general-purpose, so-called “ unstructured” data, where it’s difficult to let people know that accessing this particular data set might have a significant delay, it’s hard to get good results. By the time the required disks have all spun up, the person who tried to access an old project file or follow a search hit is on the phone to IT. With the lower-power operating modes, the time to first access is reasonable and the power savings is significant. By the way, much of the growth in data over the past few years has been in unstructured data.
eRiK RieDeL: That’s where the key solutions are going to come from. Look at what the EPA is doing with their recent proposals for Energy Star in the data center. They address a whole series of areas where you need to think about power. They have a section about the power management features you have in your device. The way that it’s likely to be written is you can get an Energy Star label if you do two of the following five things, choosing between things like de-duplication, thin provisioning, or spin-down.
But if you look at the core part of the spec, there’s a section where they’re focused on idle power. Idle power is where we have a big problem in storage. The CPU folks can idle the CPU. If there is nothing to do then it goes idle. The problem is storage systems still have to store the data and be responsive when a data request comes in. That means time-to-data and time-to-ready are important. In those cases people really do need to know about their data. The best idle power for storage systems is to turn the whole thing off, but that doesn’t give people access to their data.
We’ve never been really careful because we haven’t had to be. You could just keep spending the watts and throwing in more equipment. When you start asking “What data am I actually using and how am I using it?” you have to do prediction.
steVe KLeiman: My point is that there is so much low-hanging fruit with de-duplication, compression, and lower-power operating modes before you have to turn the disk off that we can spend the next four or five years just doing that and save much more energy than spinning it down will do.
eRiK RieDeL: We are going to have to
References:
Archives