CSE 512 : Distributed Database Systems
-
Updated
Jun 23, 2022 - Python
CSE 512 : Distributed Database Systems
Exploring Global Fishing Watch public data with SedonaDB & GeoParquet
Implemented spatial hotspot analysis on the NYC Yellow Cab taxi trip records using spark cluster setup on the AWS EC2 Instances. The aim was to analyse huge dataset using distributed cluster-computing framework like Apache Spark and Apache Sedona.
Spatial joining with a map reduce program on top of Apache Spark using the Apache Sedona spatial extension
The spatial table format for spatial lakehouse
Notebook to accompany the "Hands-On With Havasu & GeoParquet" livestream
Dockerised PySpark Apache Sedona examples.
Tutorials on Big Data essentials: Hadoop, MapReduce, Spark.
Add a description, image, and links to the apache-sedona topic page so that developers can more easily learn about it.
To associate your repository with the apache-sedona topic, visit your repo's landing page and select "manage topics."