2.1.x-*-arm images
support only the installed components and the following
optional components (the remaining 2.1 optional components and all
initialization actions are unsupported):
Apache Hive WebHCat
Docker
Zeppelin: Supported only by 2.1.77+-ubuntu20-arm images.
Zookeeper (installed in HA clusters;
optional component in non-HA clusters)
The HBase and Druid components are not supported in 2.1 and later image
versions.
The source code to image 2.1 libraries that are licensed under Reciprocal
and Restricted licenses is available at the
/usr/local/share/google/dataproc/third-party-sources filepath on
Dataproc cluster VMs.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-05-02 UTC."],[[["This document provides a detailed breakdown of component versions across multiple Dataproc image releases (2.1.79 to 2.1.83) for various operating systems, all of which are listed in a date-based order, from February 7th 2025, to March 10th 2025."],["The listed components include a variety of both installed and optional software, such as Apache Atlas, Flink, Hadoop, Hive, and Spark, as well as connectors like BigQuery and Cloud Storage, all with their specific versions and linked documentation."],["ARM-based images in the 2.1.x series support only installed components and a limited subset of optional components, excluding the majority of initialization actions and other optional components like Presto, HBase and Druid."],["Certain components, like Apache Hive, have additional considerations, such as the inclusion of HCatalog by default, while WebHCat is an optional add-on, and there are details on the inclusion of Trino and the deprecation of Presto as optional components in 2.1 and later image versions."],["The document also clarifies that the source code for certain components under Reciprocal and Restricted licenses is accessible within Dataproc cluster VMs, specifically at the `/usr/local/share/google/dataproc/third-party-sources` filepath."]]],[]]