As business users increasingly turn to high-performance hardware, IBM is adding features to its high-performance file systems to help push supercomputing more into the mainstream.
IBM on Friday plans to release a new version of its General Parallel File System (GPFS) that offers improved file management capabilities. The file system can search across multiple systems, up to 1000 nodes in parallel, said Scott Handy, vice president of marketing and strategy for IBM Power Systems.
In a test, Handy said IBM scanned 1 billion files using GPFS to show off its capabilities to customers in fields such as financial services and retail who deal with massive amounts of unstructured files. The scan was completed in just over two and a half hours; Handy said IBM is now working to shorten that to one hour.
The update to GPFS, now at Version 3.2, includes policy-based file management that will allow a user to tell the system how to store and search files. For instance, this upgrade will allow a user to stipulate that files saved in a certain format are to be stored on a particular kind of disk.
What that will mean, Handy said, is that users can take a tiered approach to how they distribute data. A user can write a policy telling the system to store certain kinds of data on its fastest and most expensive disk, with other types of data going to lower-cost systems where performance isn't as critical.
That capability would allow users to save money because they could use lower cost storage where it's appropriate, he said.
The file system runs on IBM System p and System x hardware and is supported by AIX as well as some versions of Red Hat and SUSE Linux.