Menu
Big Data Application Framework Gets Update, SQL Interface

Big Data Application Framework Gets Update, SQL Interface

Building on last month's release of Apache Hadoop 2.2, big data application platform specialist Concurrent today released a new version of Cascading, its big data application framework.

"I created Cascading in anger after having used MapReduce once in my life and vowing never to use it again." -- Chris Wensel, Founder and CTO of Concurrent.

Concurrent also announced the general availability of Cascading Lingual 1.0, an open source project that provides a comprehensive ANSI SQL interface.

Cascading is a stand-alone open source Java application framework designed as an alternative API to MapReduce. Cascading gives Java developers the capability to build big data applications on Hadoop using their existing skillset.

"I created Cascading in anger after having used MapReduce once in my life and vowing never to use it again," says Chris Wensel, creator of Cascading and founder and CTO of Concurrent.

The latest release, Cascading 2.5 adds support for Hadoop 2.2, including the new YARN architecture introduced in that version of Hadoop. Apache Hadoop YARN (Yet Another Resource Negotiator) serves as the Hadoop operating system, taking what was a single-use data platform for batch processing and evolving it into a multi-use platform that enables batch, interactive, online and stream processing.

YARN acts as the primary resource manager and mediator of access to data stored in Hadoop Distributed File System (HDFS), giving enterprises the capability to store data in a single place and then interact with it in multiple ways, simultaneously, with consistent levels of service.

Enterprises can now use Cascading to leverage Java, legacy SQL and predictive modeling investments for a single big data processing application.

Migration Path to Hadoop 2

Gary Nakamura, CEO of Concurrent, says that Cascading doesn't leverage YARN specifically, but does enable users to seamlessly migrate their applications to Hadoop 2 and take advantage of YARN. Domain specific languages (DSLs) like Scalding, Cascalog and PyCascading also seamlessly migrate to Hadoop 2. Similarly, Cascading will support Apache Tez when it takes its place in the Hadoop stack.

Concurrent has also added performance improvements for complex join operations and optimizations to dynamically partition and store processed data more efficiently on HDFS.

In addition to Cascading, Concurrent announced the immediate availability of Cascading Lingual 1.0, intended to help enterprises that have already invested heavily in business intelligence (BI) tools like Pentaho, Jaspersoft and Cognos-and the training to go with them-to quickly access their data on Hadoop. Lingual allows users to utilize their existing SQL skills and systems to create and run applications on Hadoop.

Concurrent's Wensel says Lingual empowers just about anyone familiar with SQL to instantly work with data stored on Hadoop using their JDBC-compliant BI or desktop tool of choice.

"Cascading is an important component to the big data application development ecosystem, and Lingual is another step forward in making it significantly easier to build big data apps," says Steve McPherson, group manager, Amazon Elastic MapReduce (EMR) at Amazon Web Services (AWS).

"Now, Amazon Elastic MapReduce customers can leverage Lingual to integrate disparate data stores on Amazon Web Services with services such as Amazon S3 and Amazon Redshift, and they can process the data and store it in Amazon EMR through one standard ANSI SQL statement," McPherson says. "This makes it easier for customers to query data with their favorite BI tool."

Thor Olavsrud covers IT Security, Big Data, Open Source, Microsoft Tools and Servers for CIO.com. Follow Thor on Twitter @ThorOlavsrud. Follow everything from CIO.com on Twitter @CIOonline, Facebook, Google + and LinkedIn.

Read more about applications in CIO's Applications Drilldown.


Follow Us

Join the newsletter!

Or

Sign up to gain exclusive access to email subscriptions, event invitations, competitions, giveaways, and much more.

Membership is free, and your security and privacy remain protected. View our privacy policy before signing up.

Error: Please check your email address.

Tags javasoftwarebig dataapplicationsSQLhadoopapachedata miningMapRConcurrentCascadingLingual

Brand Post

Featured

Slideshows

Reseller News Platinum Club celebrates leading partners in 2019

Reseller News Platinum Club celebrates leading partners in 2019

The leading players of the New Zealand channel came together to celebrate a year of achievement at the annual Reseller News Platinum Club lunch in Auckland. Following the Reseller News Innovation Awards, Platinum Club provides a platform to showcase the top performing partners and start-ups of the past 12 months.

Reseller News Platinum Club celebrates leading partners in 2019
Reseller News hosts alumnae breakfast for Women in ICT Awards

Reseller News hosts alumnae breakfast for Women in ICT Awards

Reseller News hosted its second annual alumnae breakfast for the Women in ICT Awards in New Zealand, designed to showcase the leading female leaders in the industry. Held at The Cordis in Auckland, attendees came together to hear inspiring keynotes and panel discussions, alongside high-level networking among peers. Photos by Gino Demeer.

Reseller News hosts alumnae breakfast for Women in ICT Awards
Reseller News Innovation Awards 2019: meet the winners

Reseller News Innovation Awards 2019: meet the winners

Reseller News honoured the standout players of the New Zealand channel in front of more than 480 technology leaders in Auckland on 23 October, recognising the achievements of top partners, emerging entrants and innovative start-ups.

Reseller News Innovation Awards 2019: meet the winners
Show Comments