An open source big data framework from the Apache Software Foundation. Spark is used to analyze huge amounts of real-time data in RAM in contrast to Hadoop (another Apache project), which continuously writes to the storage drive. As a result, Spark is generally many times faster. Because Spark does not have its own distributed file system, it is often used in conjunction with the Hadoop Distributed File System. See
Hadoop and
big data.