ClickHouse® is a real-time analytics database management system
-
Updated
Feb 23, 2025 - C++
ClickHouse® is a real-time analytics database management system
A distributed, fast open-source graph database featuring horizontal scalability and high availability
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
🔨 🍇 💻 🚀 GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba | 一站式图计算系统
YTsaurus is a scalable and fault-tolerant open-source big data platform.
ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.
A @ClickHouse fork that supports high-performance vector search and full-text search.
🏅State-of-the-art learned data structure that enables fast lookup, predecessor, range searches and updates in arrays of billions of items using orders of magnitude less space than traditional indexes
oneAPI Data Analytics Library (oneDAL)
Fast Fourier Transform-accelerated Interpolation-based t-SNE (FIt-SNE)
Thrill - An EXPERIMENTAL Algorithmic Distributed Big Data Batch Processing Framework in C++
Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindings for C 99, C++ 17, Python 3, Java, GoLang 🗄️
High-performance Terrain and Hydrology Analysis
An open source, standard data file format for graph data storage and retrieval.
A distributed block-based data storage and compute engine
Add a description, image, and links to the big-data topic page so that developers can more easily learn about it.
To associate your repository with the big-data topic, visit your repo's landing page and select "manage topics."