Google is developing an open source architecture to improve its Search Appliance's ability to index data in content and document management systems and collaboration platforms.
The new open source framework will allow Google and others to create modules that link the Search Appliance natively with systems like EMC's Documentum, OpenText's LiveLink, IBM's Lotus Notes and Microsoft's Sharepoint.
Currently, Google partners have created custom-built connectors for these types of content systems using the Search Appliance's application programming interfaces (APIs). However, the new Content Connector Framework, as it is currently known inside Google, will simplify the creation of these modules and allow them to work more effectively than current ones thanks to this common connecting platform residing in the actual Search Appliance.
"We want to make that even easier to do, so that it's just a plug-and-play, much like we can do today with databases, web servers and file servers: plug it in, point it at it and we index it," says Matthew Glotzbach, product management director at Google's Enterprise unit.
As an open source framework, the Content Connector Framework will be available for anyone to examine, use and improve, he says. "We don't view the plumbing between one system and another as proprietary or as our strategic advantage," he says. "Our view is that if you build a good architectural infrastructure and you open it up, so that everyone can use it and contribute to it, maybe someone will find some optimisations and can contribute them back into the project."
This new framework will build on last year's introduction of the Onebox for Enterprise functionality, which enhanced the way the Search Appliance indexes data in business applications from vendors like SAP and Oracle.
IGoogle also plans to improve the scalability of the Search Appliance, which can now index a maximum of 30 million documents. Some customers with intensive enterprise search needs have several of these search appliances running side by side in parallel. The downside of this is that the customer doesn't get a single unified index across the devices. However, in a future release, Google will provide the ability to daisy-chain them so that together they act as a single system, he says.
Glotzbach declined to say when the open source framework and the scalability improvements will be delivered.