mache cReeGeR: What can people who
have to manage storage for a living take
from this conversation? What recom-
mendations can we make? What tech-
nologies do you see on the horizon that
would help them?
steVe Kleiman: Storage administra-
tors today have tremendous problems
that are not adequately solved by any
tools. They have home directories,
databases, LUNs. It’s not just one set
of bits on one set of drives; they’re all
over the place. They’ve got replicas and
perhaps have to manage mirroring re-
lationships between them. They have
to manage a disaster recovery scenario
and the server infrastructure on the
other site if the whole thing fails. They
have all these mechanisms for all these
data sets that they must process day-in
and day-out and they have to monitor
the whole thing to see if it’s working
correctly. Just being able to manage
that mess, the thousands of data sets
they have to deal with, is a big problem
that isn’t solved yet.
mache cReeGeR: Nobody’s in the busi-
ness of providing enterprise-level stor-
age infrastructure management?
steVe Kleiman: The people who have
solved it best in the past have been the
backup people. They actually give you
a data transfer mechanism that manages everything in the background and
they give you a GUI that allows you to
say, “I want to look for this particular
data set, I want to see how many copies
of it I have, and I want to restore that
particular thing.” Or “I want to know
that these many copies have been made
across this much time.”
Of course, the problem is that it’s all
getting blown up. So now, it’s not just,
“What copies do I have on tape? What
copies do I have in various locations
spread around the world? What mirroring relationships do I have?” The
trouble is that today it’s all managed in
someone’s head. I call it “death by mirroring.” It’s hard. We’ll sort it all out
eventually.
KiRK mcKusicK: What do you see as a
possible solution?
steVe Kleiman: Currently people are
building outrageous ad hoc system
scripts—Perl scripts and other types.
My company is working on this as are
lots of other people in the storage industry, but it’s more than a single box
problem. It’s managing across boxes,
even managing heterogeneously. We
have to understand that we’re solving
the convergence of QoS, replication,
disaster recovery, archive, and backup.
What we need is a unified UI for han-
dling all these functions, each of which
used to be handled for different rea-
sons by different mechanisms.
eRic BReWeR: That is a core issue.
How many copies do you have and why
do you have them? Every copy is serving
some purpose, whether as a backup, or
a replication for read throughput, or a
cache copy in flash. Because they’re au-
tomatically distributed you can’t keep
track of all these things. I think you
actually can manage the file system—
broadly speaking, storage system,
whereby you proactively assign how
many copies you have of something.
maRGo seltzeR: Users make copies
outside the scope of the storage admin-
istrator all the time.
eRiK RieDel: Because the amount
of data and what it’s used for both increase constantly, you have to get the
machines to help the users tag content
with metadata—to help them know
what the data is, what the copy is for,
where it came from, why they have it,