Apache Drill
Developer(s) | Apache Software Foundation |
---|---|
Stable release | 0.8.0 / March 31, 2015 |
Development status | Active |
Operating system | Cross-platform |
Website |
drill |
Apache Drill is an open-source software framework that supports data-intensive distributed applications for interactive analysis of large-scale datasets. Drill is the open source version of Google's Dremel system which is available as an infrastructure service called Google BigQuery. One explicitly stated design goal is that Drill is able to scale to 10,000 servers or more and to be able to process petabytes of data and trillions of records in seconds. Drill is an Apache top-level project.[1]
See also
- Cloud computing
- Big data
- Data Intensive Computing
References
- ↑ "The Apache Software Foundation Announces Apache™ Drill™ as a Top-Level Project". Retrieved 2014-12-02.
Papers
Some papers influenced the birth and design. Here is a partial list:
- 2005 From Databases to Dataspaces: A New Abstraction for Information Management, the authors highlight the need for storage systems to accept all data formats and to provide APIs for data access that evolve based on the storage system’s understanding of the data.
- 2010 Dremel: Interactive Analysis of Web-Scale Datasets
External links
|