Sponsored Links
-->

Tuesday, May 29, 2018

Google Launches Cloud Dataproc, A Managed Spark And Hadoop Big ...
src: beta.techcrunch.com

Google Cloud Dataproc (Cloud Dataproc) is a cloud-based managed Spark and Hadoop service offered on Google Cloud Platform. Cloud Dataproc utilizes many Google Cloud Platform technologies such as Google Compute Engine and Google Cloud Storage to offer fully managed clusters running popular data processing frameworks such as Apache Hadoop and Apache Spark.


Video Google Cloud Dataproc



Design

Cloud Dataproc is a Platform as a service (PaaS) product designed to combine the Spark and Hadoop frameworks with many common cloud computing patterns. Cloud Dataproc separates compute and storage, which is a relatively common design for many cloud Hadoop offerings. Cloud Dataproc utilizes Google Compute Engine virtual machines for compute and Google Cloud Storage for file storage. Cloud Dataproc has a set of control and integration mechanisms that coordinate the lifecycle, management, and coordination of clusters. Cloud Dataproc is integrated with the YARN application manager to make managing and using clusters easier.

Cloud Dataproc includes many open source packages used for data processing, including items from the Spark and Hadoop ecosystem, and open source tools to connect these frameworks with other Google Cloud Platform products.


Maps Google Cloud Dataproc



History

Cloud Dataproc was released as a publicly available beta service on September 23, 2015 and entered public general availability on February 22, 2016.


Create Hadoop Cluster on Google Cloud Platform | My Big Data World
src: weidongzhou.files.wordpress.com


Similar products

  • Amazon Elastic MapReduce (EMR)
  • Databricks
  • Microsoft HDInsight
  • Qubole

Google Cloud Tutorial - Hadoop | Spark Multinode Cluster ...
src: i.ytimg.com


See also

  • Apache Spark.
  • Apache Hadoop
  • Apache Hive
  • Apache Pig

Create Hadoop Cluster on Google Cloud Platform | My Big Data World
src: weidongzhou.files.wordpress.com


External links

  • Official website.
  • Cloud Dataproc release notes
  • Google Cloud Platform site
  • Cloud Dataproc questions on Stack Overflow
  • Cloud Dataproc discussion list

Create Hadoop Cluster on Google Cloud Platform | My Big Data World
src: weidongzhou.files.wordpress.com


References

Source of article : Wikipedia