Home > File Systems, Linux, Microsoft, SCSI, Solaris > Opinion: On the Future of Data Storage and RAID Technologies

Opinion: On the Future of Data Storage and RAID Technologies

July 15th, 2009

Please note that this is only a personal opinion of mine as I have been observing the growth and various decline of storage concepts within the data storage industry. The views of the reader may differ from my own which is why I would invite you to please post your opinions as a comment to this post.

One of the most volatile and yet needed industries is the data storage industry. As computing technologies become more cloud centric and rely upon the web for business, productivity, education to even recreation, there is a constant push to increase capacities but even more so increase I/O throughput. As a result of recent demands, our approach with these technologies need to be re-evaluated. The primary focus of this article is on the future of data storage concepts and the limited life and functionality of RAID.

Back in 1987 when the idea of RAID was first conceived, the goal or vision was to be able to scale multiple drives into a single volume which was represented to a host as such while also offering a form of redundancy with a more sensitive magnetic platter-based disk technology. Flash forward to the present and we are still reliant upon the same technologies. Is that because RAID is so perfect or have we just grown too comfortable and are too afraid of change?

Hardware Vs. Software RAID 

There was a time when processing power was limited and it became advantageous to utilize external methods for creating and managing arrays of data storage, but as time progressed, this approach became increasingly insignificant. At least that is to say for the Small-to-Medium sized Business (SMB). For the last decade, a lot of efforts have been placed toward increasing the reliability, stability and enhanced features with the software-based RAID. This has slowly been eating away at the hardware vendors. Although it has been rarely noticeable.

These software implementations are integrated with methods of Logical Volume Management (with built in redundancy via RAID 1-6), Load Balancing/Multipathing capabilities, data encryption, along with the abilities to utilize incremental snapshot(s) over designated volumes. These software implementations include dynamic resizing, quota/permission management, enhanced copy-on-write file systems that perform very well along with routine checksums to correct noisy and silent data corruption; almost all of which can be managed while volumes are on-line. Some of these volume managers have the capability to export iSCSI & FCoE targets and can also be tuned to support FC targets.

To name a few you have ZFS (an all-in-one solution), Btrfs (still in development and under test), device-mapper / LVM2 / multipath-tools, mdadm, DRBD, etc. The list goes on. What is to stop an SMB from setting up an array of JBODS and (if more redundancy is needed) cluster a couple of Solaris / OpenSolaris or Linux servers to manage their software RAID while also exporting it via a file server or into a SAN? Note that Lustre support for ZFS is still in development. Realistically most entry-level modular external RAID solutions don’t run on the latest and greatest of hardware components (as they are intended for a limited purpose and not to provide other hosting services). You will most likely achieve much greater performance with the software approach while also utilizing a much more efficient virtual memory manager (for enhanced caching) alongside a finely tuned schedular.

On the enterprise end of computing you will find some very impressive storage solutions that are intended to take the workload of the enterprise environment. Such companies as Hitachi Data Systems (HDS) have been doing an excellent job with providing high-quality and well performing storage solutions that are also easily manageable. Other companies have resorted to being a little creative in order to gain some market share with the SMB and larger companies. Such notable companies are NetAppData Domain to even Cleversafe.

Earlier I found an interesting link differentiating the positives and negatives of both hardware and software RAID implementations. It should be noted that times have changed and some of the key points highlighted are no longer an issue. For instance, under the category /boot partition, this seems to no longer be an issue with at least ZFS.

Enter the SSD

In more recent years, the Flash-based Solid State Drive (SSD) has been entering into enterprise markets. This is a result from such notable providers as Sun Microsystems, etc. Currently the percentage in SSD usage in the enterprise is somewhat minimal as their is a limit in maximum capacities for the drives. This may soon change as in Q3 of 2009, PureSilicon will release their Nitro 1TB SSD drive. The throughput and performance speeds seem very optimal in arenas where greater speeds are needed, but the technology introduces additional handicaps (in the form of write operations and a limited cell life) which most environments and some manufacturers have a difficult time in accomodating to. To combat the limited cell life, vendors have implemented their own method of wear leveling, transparent to the host. With this concept, the same data cell, when accessed and written to multiple times will not get written to the exact location but instead, through an “intelligent” built in firmware the data will get written to another cell on the drive. To the operating system, it is still the same “sector” location. While there is very little latency in seeking performance (sequential and random), write operations take a huge hit, especially with smaller I/O transfer sizes, when typically the flash medium erase/rewrite a 128K page at a time.

SSD Tuning

With the recent hype of Flash-based SSDs, many vendors and UNIX/Linux distributions have been writing file systems tuned to perform extremely well on SSDs (and limit the impact of these handicaps). For example, Sun Microsystem’s ZFS (available on Solaris, OpenSolaris, MacOS X [read-only], FreeBSD and Linux [over FUSE]) had recently added tunable support for SSDs in their release versions for Solaris & OpenSolaris, while the development of Btrfs for Linux has done the same. In contrast the Microsoft developed NTFS does not offer such features or functionality. In fact the file system has remained somewhat unchanged over the course of the years and is just as inferior now as it was when it was first released as a replacement to the FAT series of file systems. I wrote an entire post explaining why the NTFS file system is not well suited for today’s methods of computing here.

In recent releases it should be noted that Microsoft’s Windows 7 has been tuned for SSDs that are to be provided on netbooks. What this means, I do not know? And by tuned, this is still unclear. You can read some of that information here. The only reason for the lack of changes in NTFS is to preserve backwards compatibility. This approach limits the ability to update a current existing server’s (if not running Windows 7) NTFS module if it needed to serve backend storage utilizing SSD media.

The Impact on RAID Technologies

As SSDs become more popular the advantages to using RAID are reduced, where the only benefits are gained from a simple stripe in a RAID 0 or mirroring to a backup array within a SAN or other form of network using RAID 01 (not to be confused with a RAID 10); just in case access to the first fails for whatever reason. This is where DRBD would come in real handy. As I briefly mentioned earlier, the whole concept of this form of redundancy was dependent upon the problematic nature of a magnetic disk device; where failures were imminent. And for those who are concerned with a method of error detection for both silent and noisy data corruptions, the majority of RAID implementations (both hardware and software) do not validate the data like the ZFS or Btrfs checksum implementation.

Changes in Protocol Layers?

With the popularity of SSD technologies growing and its costs reducing, the one drawback that is setting manufacturers and consumers back are the limitations offered by the protocols that they are working with. Today, Fibre Channel, SAS and SATA are not capable of handling full SSD speeds and serve only as a bottleneck to the technology. There have been recent attempts from vendors as Fusion-io to even PureSilicon to rely on other protocol interfaces such as PCI Express (PCI-E). Capable of handling up to 1 GB per second, it only seems natural for these vendors to move in that direction. I anticipate that shortly, others will follow. Fibre Channel and SAS may continue to serve the SAN (and with the appropriate load balancing mechanisms configured, it will perform well) but when it comes to the drive within the chassis, I expect to see more PCI Express in the near future. But who knows, with the recent drop in prices for 10Gb Ethernet or the supported high throughput offered from Infiniband, things may be moving toward another direction altogether.

In conclusion, I predict that in five years time we will start to see some huge and very interesting changes. I am looking forward to it.

Categories: File Systems, Linux, Microsoft, SCSI, Solaris Tags:
  1. July 17th, 2009 at 13:21 | #1

    Here is an update on btrfs.

  2. July 19th, 2009 at 01:43 | #2

    Good article. You may also look at GlusterFS (GNU GPLv3 License).

  3. July 19th, 2009 at 08:36 | #3

    Anand,

    Thank you for your post. GlusterFS is another great example of where the future of data storage is headed.

  4. July 25th, 2009 at 15:39 | #4

    Great short overview. The storage world IS changing — and the future promises dramatically better price / performance and the end to vendor lock-in.

    One of the underlying forces is Moore’s Law. Processors today are 800-900x more powerful than they were in the mid 90s when the leading legacy file systems and leading NAS and SAN solutions were designed. As a result the approaches taken then, such as a heavy reliance on hardware RAID for data integrity, no longer make sense today. Today you can do BETTER, with true end to end data integrity checks, for example via file systems such as ZFS that leverage today and tomorrow’s multicore processors.

    Regarding Gluster, we agree that storage users want to save time and are tired of working around the limitations of legacy systems. Nobody has time to deal with 16TB maximum volume sizes, for example, when data is growing by 65% per year ON AVERAGE, meaning we estimate that at least 25% of users by volume are seeing their data storage at least double every year. However, I think we all know hosting companies that have gone down the clustered file system path only to find themselves with fewer points of failure and a higher administrative burden. Higher risk + more ownership costs =/ happy customers :)

    This is where ZFS based solutions like NexentaStor come in. Thanks to years spent developing software written around ZFS to improve ease of use we at Nexenta are seeing tremendous uptake by customers who like the complete lack of file size, volume size, etc., limitations of ZFS and yet also understand that clustered file systems are generally not ready for prime time.

  5. July 25th, 2009 at 16:01 | #5

    Evan,

    I am grateful and honored that you found interest in my material. I thank you very much for your comments. I agree with your response 100%. Until the recent hype of the past few years, there has been very little change in the approach to data management, specifically the file system/volume manager. Fortunately enough, more recent projects such as ZFS are slowly changing that, as others start following this trend. I am excited to see what else the future may bring.

    Petros

  6. Stephen Goncalves
    August 11th, 2009 at 12:11 | #6

    Good read. However one point I would explore under the SSD realm is how data is written to the disks. The thing to consider is SAS/Fiber vs SATA. I know the world loves SATA. No sure why except for cost. SATA, and even SAS 1.0 does not have the RAS features that Fiber has. ( SAS 2.0 fixes that ) SATA does not have anything like the RAS features that are part of SAS 2.0 and Fiber. So the same question applies to SSD. There is only one company in the world that verifies what was written to the disk. Please correct me if I am wrong. ( be polite we are all professionals ) Hence IBM, EMC, and HP? are all using the same company STEC. There are some better technologies that are coming out in the next 2 years that are going to be exciting. When SSD’s are 10K dollars hard to see them in SMB. For SMB the sales and use has always been simplicity. If you make it easy to use and recover data. SMB will buy. Otherwise complexity will scare the market away. Also there will have to be some kind of acceptance rate. If a small company loses it’s data it will not survive in this economy.

  7. August 11th, 2009 at 13:40 | #7

    Stephen,

    Thank you for your comments. I am glad you brought up the topic of RAS. RAS is a very interesting set of technologies with a wide array of features. Back when I was a developer at Xyratex, we were beginning to implement RAS functionality in some of the product line, initially for diagnostic and basic management purposes. And this technology truly applies to enterprise class computing.

    While not directly defined in the protocol’s specification, RAS can be implemented in a storage array consisting of a SATA backend and a Fibre/SAS 2.0 frontend configured in a SAN or DAS. It would be up to the array’s controller in how it handles its implementation of RAS. On top of that and to emphasize availability, these SATA drives are configured with a dual channel MUX (it is defined as a single channel point-to-point protocol), so that it can run in a clustered (dual controller) environment.

    Also, you are correct that the SMB arena desires something simple and affordable. In most cases, high performance is not all that necessary as opposed to redundancy (as you pointed out); which is why prices need to really drop and capacities need to really increase before we start seeing SSDs in here. I guess we will see soon enough where the SSD will take us. Thank you again for your comments.

  8. Bernard Bonnell
    September 11th, 2009 at 12:43 | #8

    I have been dealing with these issues daily as I work with very small businesses (http://www.datasentinel.com/privacy/technical.php

Comments are closed.