Mac storage boldly enters the 1990s

By | August 22, 2011, 1:35pm PDT

Summary: Apple’s set the Wayback Machine to warp factor 9: OS X Lion now has a Logical Volume Manager - great technology in 1990. Today? Not so much.

What is an LVM?
PC users know that system disk drives show up as a C: drive or a Macintosh HD - or whatever you choose to call it - on their system. These are physical drives or physical volumes.

Logical Volume Managers virtualize these physical drives. The LVM takes the attached physical blocks on 1 or more disks and presents a virtual disk to the OS. The virtual disk walks like a disk and quacks like a disk, but the underlying storage is hidden from the OS.

So what’s hiding underneath? It can be a physical disk, a disk partition or a RAID stripe of multiple disks.

LVMs can be handy tools for storage administrators, but like any paradigm that relies on managing discrete units, LVMs don’t scale. Ask any sysadmin struggling to update an Excel spreadsheet of all the LVs on a SAN.

Enter the new millennium
Around 2001 architects saw the scale problem and asked: why pretend we have physical or virtual disks at all? Why not treat the underlying storage as a big pool of blocks?

That’s what architects of Google’s BigTable, Amazon’s Dynamo, Sun’s ZFS, Oracle’s BTRFS and Basho’s Bitcask, among others, all decided to do. Which is why LVMs are so last millennium.

These newer storage tools treat the underlying disks as a pool of blocks. Set a policy for replication - there’s no RAID in the common sense - and you can add drives as needed.

That’s how the 21st century does it. Too bad the Mac file system guys didn’t get the memo.

The Storage Bits take
For the average user with 1 or 2 disks managing the physical volumes isn’t a big deal. But the era of Big Data isn’t only in data centers: I have 16 external disks attached to my new iMac - plus several more that plug into a drive dock.

But the new LVM in Mac OS isn’t the biggest problem: the lack of data integrity in HFS+, the current Mac filesystem, is more worrisome. Video editors worry about not being able to transfer clips to the latest version of Final Cut Pro. How about losing your clips to silent data corruption?

If you’re wondering what the post-Steve Apple looks like, the continuing hacking of HFS+ is a sorry case in point. Steve doesn’t care about file systems - they’re plumbing - and thus the normal corporate sins of sloth, inertia and risk-avoidance kick in.

Apple was moving towards the modern ZFS 3 years ago - see Apple announces ZFS on Snow Leopard - but didn’t move fast enough to nail down a license before Sun went up for sale.

After the dust cleared Apple had no license for ZFS and discontinued the project. So the hacking of HFS+ has continued long past its expected demise.

Data integrity is one area where Microsoft has the chance to whack Mac OS. While NTFS isn’t much better than HFS+ - see How Microsoft puts your data at risk - the NTFS team has been hard at work to fix the biggest problems. I expect to see announcements from them by the end of the year.

Comments welcome, of course. Data integrity should be Job 1 for storage folks, just as computational correctness should be for CPU vendors. But in the PC world, “good enough” has been good enough.

Kick off your day with ZDNet's daily e-mail newsletter. It's the freshest tech news and opinion, served hot. Get it.

Topics

Robin Harris has been messing with computers for over 30 years and selling and marketing data storage for over 20 in companies large and small.

Disclosure

Robin Harris

Robin Harris is a president of TechnoQWAN, a consulting and analyst firm in northern Arizona. He also writes StorageMojo.com, a blog which accepts advertising from companies in the storage industry, and has a 25 year history with IT vendors. He has many industry contacts, many of whom are friends and all of whom he has opinions about. Robin has relationships with many companies in the technology industry. Every company he writes about may have sought to influence his opinion through carefully-crafted marketing messages and self-serving white papers, gifts ranging from desk calendars, t-shirts, lunches and trips as well as analyst or consulting assignments. He also invests in some technology companies. He may accept payment for services in stock as well. Robin discloses financial investments in or client relationships with companies named in Storage Bits. To help readers sort out the gold from the dross in his writings, Robin tries to communicate his reasons as clearly as he can. If you agree, you are intelligent and discerning. If you disagree, well, you disagree. In all cases, Robin encourages readers to subject everything they read, see or hear on the internet or from politicians to some simple questions: * What assumptions are implicit in the world view and judgments of the author? * What, if any, is the factual basis for the opinions the author expresses? * Is it reasonable, logical and clear? Your critical faculties: use ‘em or lose ‘em!

Biography

Robin Harris

Harris has been messing with computers for over 30 years and selling and marketing data storage for over 20 in companies large and small. He introduced a couple of multi-billion dollar storage products (DLT, the first Fibre Channel array) to market, as well as a many smaller ones. Earlier he spent 10 years marketing servers and networks. After leaving corporate life he founded TechnoQWAN, a consulting and analyst firm. He also developed StorageMojo into one of the top storage industry blogs.

Robin writes, consults, coaches and lives among the mountains of northern Arizona.

40
Comments

Join the conversation!

Just In

So you think it is fair that Robin critiques HFS for the same reason which
DeRSSS 25th Aug
@goff256: ... related to Ext4, but somehow Ext4 never gets critiqued?
0 Votes
+ -
LOL.

Once again, Apple playing catch up.
0 Votes
+ -
With whom?
Richard Flude 22nd Aug
LVM on a consumer desktop is following whom.

We've been using this technology on servers for sometime, and as robin points out it is useful for a number of tech savy end user but it's introduction to consumer OS isn't 1990s.

Data integrity is very important. It's always a trade off between demands of the market: ease of use, cost, performance, etc. apple missed their chance with ZFS, what's next?
0 Votes
+ -
@Droid101 Another troll... and look, it worked!
0 Votes
+ -
You're right Robin, but at the same time consider that Macs are very much "consumer" focused which isn't the usual purview of LVM deployments. While there are exceptions, no one is going to roll a massive fleet of Mac OS X servers were the need for an LVM critical. Yes, there's some exception somewhere but I'm talking about the mainstream. Not even Apple with its new massive data center in North Carolina is "Mac OS X only". Hell, Apple canned its Xserve line of servers. Yes you have a lot of drives but you aren't the average user of Mac OS X. Not to mention that a lay person likely doesn't have the inclination and wherewithal to bother. For the consumer I would suggest a Drobo.

-M
0 Votes
+ -
Contributr
@betelgeuse68 - I'm very interested in seeing how the 3rd party ZFS on Mac project develops. It could be better than Drobo since ZFS doesn't require a hardware controller.
@R Harris: is that because Linus ridiculously bashed HFS+ while his own file system, as Ken Hess mentioned, is dramatically behind in the most of key aspects.
0 Votes
+ -
No mention ox Ext4
Michael Alan Goff 23rd Aug
Because this is an article about LVM in OSX.

It has nothing to do with Linux.
0 Votes
+ -
RE: Mac storage boldly enters the 1990s
Michael Alan Goff 23rd Aug
Likely for a comparison of what a few other people do and why LVM isn't so good.
@goff256: ... mentioned, yet only HFS+ is critiqued.

Do you think this is coincidental?
0 Votes
+ -
RE: Mac storage boldly enters the 1990s
Michael Alan Goff 24th Aug
It isn't a coincidence, it's an article of criticism about that particular thing.
@goff256: ... related to Ext4, but somehow Ext4 never gets critiqued?
0 Votes
+ -
@betelgeuse68 " no one is going to roll a massive fleet of Mac OS X servers"

Yep, your 100% right on that one! LOL
0 Votes
+ -
Robin - you forgot Netapp (WAFL) file system - by far the first ones to virtualize pooled storage and commercially more successful (by far) than any of the names you drop happy
0 Votes
+ -
Contributr
@jsensarma Didn't NetApp buy Spinnaker? Why was that?
0 Votes
+ -
@jsensarma
I think you'll find that virtualized storage on physical media has been around since at least the 1960's - long before PC's or WAFL (patented in 1998). IBM's VM operating system was built around the concept of virtualized machines, virtualized peripherals and virtualized (logical) storage.
horse for years. Apparently, you don't understand mathematics and things like checksums and error correction logarithms.

Here's a fun little experiment. Take a kilobyte text document. Write a tool that reads and writes it randomly across your disk continuously. Let it run for 3 months 24x7. Get back to us on how many characters get garbled.
0 Votes
+ -
Data Corruption is Real and Scary
Jeff X Brown 23rd Aug
Today's 3TB consumer drives have a 1 in 4 chance of a non-recoverable bit error on a full drive. So if you put eight of those in a RAID array, you need dual parity (RAID6) just to compensate for the manufacturer's claimed error rate on the drive. Enterprise drives claim one tenth that error rate but that's not what most people are using for personal or small business RAID systems. Most Drobos are stuffed with Western Digital consumer drives. This is closer to the ragged edge of disaster than anybody realizes.
0 Votes
+ -
@Jeff X Brown Yes, and that's a conscious choice by the industry and consumers to choose storage density and capacity over reliability. If we didn't make disks at such high densities the error rate would go down to a point where RAID5 would be sufficient again.

Of course, there exist technologies such as RAID6 and ZFS which build in ever greater redundancies, and that's how the industry is tackling the issue. RAID5 is now essentially obsolete.
0 Votes
+ -
Contributr
@baggins_z
Oh. And you think HFS+ has checksums?
0 Votes
+ -
No problems with HFS ....
kundza 23rd Aug
I've been using RAID 5 & 6 for years with 7 Mac OS X Servers using AFP. Over 100TB's of PS, Final Cut, etc. files. Never one data corruption. Just lucky? I don't think so. Maybe it just works as it is suppose to if you configure it properly and use the right kinds of hard drives. I'll check out the ZFS project when it is released as I like the idea of it. Not because I have any corruption issues.
0 Votes
+ -
@kundza
I do not think you had no data corruption because of right kind hard drives or luck. You are genius man. Peace!
0 Votes
+ -
@kundza ...ditto.
0 Votes
+ -
RE: Mac storage boldly enters the 1990s
Tater_Tot Updated - 23rd Aug
Why should I worry about this data corruption bogeyman when these ultra-capacity, ultra-cheap hard disks keep dropping like flies. You're putting the cart farrrr in front of the horse. A SAN admin doesn't have time for volume-managing spreadsheets, he's too busy replacing failed disks.
ZFS is far superior and a good match for OS Xs underlying OS.
0 Votes
+ -
What to buy today?
pwatson 23rd Aug
If it is possible to do without a sales pitch, what would a serious consumer buy today that would improve their data storage?

What are the required specs that are in the "affordable" range?

Which brands offer controllers, subsystems, and disks that would meet these requirements?
0 Votes
+ -
Contributr
@pwatson
In general I prefer mirroring (RAID 1) disks to RAID 5/6 for consumers and myself. I just bought a couple of WD's My Book Studio II's that have 4 interfaces for future proofing, RAID 0 & 1, low power consumption and upgradeable drives. Get 'em with 3TB drives and it's a nice package.
0 Votes
+ -
RE: Mac storage boldly enters the 1990s
john_gillespie@... Updated - 23rd Aug
So Apple didn't get a ZFS license deal in time. So they should just copy it, license be damned, that is what MicroSoft would do. Then let the lawyers fight it out, but the time it winds through our 1800s style court system it will be meaningless as there will be something better.
0 Votes
+ -
It is like everything else of late from apple,the market machine grinds out more junk and people led like lemmings are led down a path of SO WHAT! Just like the way they handled the virus threats of late " tell the people we do not have any problem" and they will believe us. So just more CRAP!
0 Votes
+ -
RE: Mac storage boldly enters the 1990s
john_gillespie@... 23rd Aug
@bsmi021@...
Yeah ... I'm sticking to my HP tablet running Windows.
Maybe the reason everything seems to be about Apple is because they are producing something instead of running for cover.
This is yet another non-issue, blown out of proportion for the sake of having an article to write. Yes, it would have been nice if Apple was able to incorporate zfs as the new file system, but from what I've seen with using zfs with a FreeNAS server? Zfs has pretty big memory requirements to operate reliably. I'd say that a typical Mac would need to allocate a good 1GB of the total RAM just to handle the filesystem in a typical 1 or 2 hard drive configuration.

If you add a bunch of extra drives, those requirements increase with the amount of storage space ... so you might need 6GB or more of memory to handle a fault-tolerate zfs array of 6 or 7 hard drives. How realistic is THAT on a typical iMac?
0 Votes
+ -
This is a nice post, but if reffers to an article from 2007 and which is in his turn based on a thesis from 2005. So, how current is this data? Are we being misled by obsolete information?

As a sysadmin I follow storage with great interest and it looks like to me that Linux ext4 either solves or greatly aleviates those problems, while the soom to be released brtfs wold be a real solution and a better alternative to zfs. At the same time, it looks to me that Windows and MacOS had no real progress in this are. Am I right?

It wold be very nice if Robin Harris wrote an update on the state of data corruption on modern file systems and current / next OS releases.
0 Votes
+ -
Contributr
@fernando@... Good question. I haven't seen new papers on this topic, but there haven't been any major updates to any of the file systems in question, so I conclude that the paper's data is still relevant.
0 Votes
+ -
This blog is a little contradictory.

First we're criticizing Apple for their use of LVMs, but then "For the average user with 1 or 2 disks managing the physical volumes isn?t a big deal."

And then we criticize Apple for their continued hacking and muddling with HFS+ and note this is an opportunity for MS to beat Apple, but then "But in the PC world, ?good enough? has been good enough."

So you're calling out Apple on something that 99% of users don't care about, let alone have any clue about and basically has, and continues to provide an adequate solution?

This is a perfect example of every generation of the Apple bashing crowd. You had the MSCE's in the 90's joined by the Linux crowd in the nils, and now we have the Android joining the mix.

The techie Apple bashing crowd doesn't get the fact you represent about .001% of the population. You're like some gear head trying to sell a soccer mom an SUV, going on and on about cylinders, horsepower and drive trains when all she cares about is how it looks and if it will get her and the 3 kids to practice and back with enough comfort and amenities that she doesn't go crazy with the three of them fighting in the back seat.

The average user doesn't care about being open, LVMs, kernals, file systems, multitasking, etc. The average person cares about their computer/smartphone doing what they want it to do with the least amount of effort and stress as possible. Apple gets that where others don't. Apple will correct their storage and file system shortcomings soon enough, but aside from the techies, will anyone even notice? Apple's correct to focus their strategy on the cloud. As both mobile and residential bandwidth increase and the market continues it's shift towards mobility, the need to "have 16 external disks attached to my new iMac - plus several more that plug into a drive dock" becomes even further relegated to niche markets. Again, what's more important to the average user, having access to their data from anywhere they may be, or their OS's file system or approach to local storage?
0 Votes
+ -
Contributr
@piousmonk
Humans are very poor at estimating power functions - which is how storage capacities increase - so many more people will care in 5 years.

Plus there is no evidence that Apple is "correcting their storage and file system shortcomings soon enough" - in fact they've been going backwards since 2008.
0 Votes
+ -
@R Harris

There's little evidence of ANYTHING Apple is working on prior to release.

And let's not forget, with each new product release/discontinuation, Apple's focus on putting the burden of storage on the cloud becomes more and more evident. That feeds into the "just works" philosophy. Let the iCloud engineers worry about storage and file systems and just give the user a simple and painless interface to access their data (without chaining 16 external drives to their portable device).

I'll say it again, in your terms. Techies are very poor at estimating the average user's desire to give a rat's arse about all those things that techies spend their mental time and energy thinking about and debating. That desire is only going to decrease in 5 years, not increase.
0 Votes
+ -
@piousmonk
Well, the OS runs from the filesystem, which means that the FS still needs to be reliable. So, your argument about the user not caring doesn't mean it's not a problem -- a problem that, if not fixed, can eventually crash the OS. At that point, how will Apple's "just works" experience feel to the user? Even systems that don't store user data, like the locked-down Chromebook, still need a reliable FS for the OS to reliably function.
0 Votes
+ -
@Bit-Smacker

I don't disagree, but where's the evidence that it's unreliable and will crash the OS? And I'm not talking theoretical evidence, I'm talking real Apple users screaming in droves that they've suffered data loss or their Macs have become unstable? I haven't heard that, have you? So it would seem the only way it would drastically decline is if Apple's future HFS+ hacks degrade it.

From reading NewEggs reviews on hard drive, it seems that the failure rate of consumer hard drives is a much bigger risk to a user's data than the fact Apple uses HFS+.
0 Votes
+ -
Lion is Apple's Windows Vista
jeff.hochberg@... 24th Aug
Lion is a hot mess. Thankfully Apple only charges $29.99 for you to bend over and take it. My Macs were upgraded to Lion only to revert back to the solid Snow Leopard 10.6.8 release. I need stability, not to be a beta tester for Apple.

Colleagues at work are having mixed experiences. Some love it, some have such serious issues they can't even boot their computer.

Lion was rushed out the door and is a marginal improvement over Snow Leopard at best. Snow Leopard was in theory a marginal improvement over Leopard, but in reality it is probably the most solid OS release in Apple's history.

What happened with Lion? Is this what to expect in a post-Steve Jobs era?
0 Votes
+ -
I haven't had any filesystem problems in many years. We still have one legacy Windows 2000 Server that reliably chugs away on SP1, with no FS issues.

So, if you are saying that the current version of NTFS isn't much better than HFS+ (which has to be better than the version of NTFS that our old 2k server is running), I don't see this as being much of an issue.

In fact, I would guess that people would likely suffer worse from malware infestations, hard drive failures, or data corruption from improper system shutdowns before having an HFS+ bug-related problem.

The key to loss mitigation is to run a regular back-up of your data and make sure it can be reliably restored. But, few actually run back-ups (and even fewer test them), and they expect the FS to somehow fend off all unknown and unexpected data corruption events.

Join the conversation!

Formatting +
BB Codes - Note: HTML is not supported in forums
  • [b] Bold [/b]
  • [i] Italic [/i]
  • [u] Underline [/u]
  • [s] Strikethrough [/s]
  • [q] "Quote" [/q]
  • [ol][*] 1. Ordered List [/ol]
  • [ul][*] · Unordered List [/ul]
  • [pre] Preformat [/pre]
  • [quote] "Blockquote" [/quote]
ie8 fix

The best of ZDNet, delivered

ZDNet Newsletters

Get the best of ZDNet delivered straight to your inbox

Facebook Activity

White Papers, Webcasts, & Resources
ie8 fix