Getting Started
The easiest way to learn the metalus library is to run an existing example. The following instructions will run the application example. This example will load an example order file, split each record into different DataFrames containing product, customer, credit card and order data. The data will be written to a Mongo data store (named application_examples) in different collections. The example requires metalus-common, metalus-mongo, metalus-examples step libraries and the self-contained metalus application jar.
Build Metalus
Below are the basic build commands required to build the Metalus libraries:
Spark Version | Scala Version | Command |
---|---|---|
2.4 | 2.11 | mvn clean install |
3.0 | 2.12 | mvn -P spark_3.0 clean install |
Mongo Setup
This example requires a local Mongo instance to be running. A free version can be downloaded here.
Spark Setup
Download the desired version of Spark and unpack. Metalus supports versions 2.4 or 3.0.
Start the spark server from the unpacked directory with the following commands:
```shell script
cd
Stopping Spark requires the following commands:
```shell script
cd <SPARK_INSTALL_DIR>
sbin/stop-slave.sh
sbin/stop-master.sh
The console will be available here: http://localhost:8080/
Run the application
Commands to run this application are available here