Skip to content
#

bayudwiyansatria

Here are 4 public repositories matching this topic...

Language: All
Filter by language

Apache Hadoop. Apache Hadoop is a collection of open-source software utilities that facilitate using a network of many computers to solve problems involving massive amounts of data and computation. It provides a software framework for distributed storage and processing of big data using the MapReduce programming model. Originally designed for co…

  • Updated Oct 7, 2021
  • Java

Apache Spark Libraries. Apache Spark has as its architectural foundation the resilient distributed dataset (RDD), a read-only multiset of data items distributed over a cluster of machines, that is maintained in a fault-tolerant way. The Dataframe API was released as an abstraction on top of the RDD, followed by the Dataset API. In Spark 1.x, the…

  • Updated Aug 3, 2020
  • Java

Improve this page

Add a description, image, and links to the bayudwiyansatria topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the bayudwiyansatria topic, visit your repo's landing page and select "manage topics."

Learn more