The Complete Magazine on Open Source

Cloudera expands open source in data science through new workbench

 

data science workbench with open source language support

Cloudera has developed a workbench for data science on its enterprise model. Called Cloudera Data Science Workbench, the new development enables a self-service data science platform at scale using open source technologies.

Cloudera Data Science Workbench is based on Sense.io, the data science startup that Cloudera acquired last year. The Workbench supports open source languages such as Python, R and Scala and works with Apache Spark and Apache Hadoop. This mix provides data scientists and IT professionals a native way to accelerate analytics projects right from their exploration stage to final production.

“With Data Science Workbench, Cloudera is helping IT groups and data scientists work together, bringing more users to shared environments in a way that delivers both flexibility and compliance,” said Charles Zedlewski, senior vice president of products, Cloudera.

Data scientists accessing the Cloudera Workbench can use any of their favorite libraries and frameworks from Web browser. The tool also lets them access the data stored in Hadoop clusters and collaborate on any certain project while working in teams. IT professionals, on the other hand, can easily set up Workbench for their data science teams and leverage the out of the box support for full Hadoop security. It can be deployed on-premise or in the cloud based on requirement.

Palo Alto, California-based Cloudera promises a secure environment for data science teams in addition to the combination of open source innovations along with customers trust. The Workbench is also touted to reduce time to market for AI applications.