You are viewing the RapidMiner Go documentation for version 9.8 - Check here for latest version
RapidMiner Go
RapidMiner Go (AM) is a Spring Boot application.
Configuration
There are 2 levels of configuration for RapidMiner Go:
- Spring profiles
- environment variables
Using Spring profiles
As RapidMiner Go is a Spring Boot application, it can be configured using the SPRING_PROFILES_ACTIVE
environment variable in docker-compose-services.yml.
This is a comma separated list of the active spring profiles.
Available profiles:
Spring profile | Description |
---|---|
db-postgresql | RapidMiner Go uses PostgreSQL database |
on-prem | On prem installation |
The default value is db-postgresql,on-prem
.
This means that the default database is PostgreSQL.
The on-prem installation implies that Go uses an on-prem license for the whole product.
Using environment variables
On top of these profiles you can also specify environment variables -- these will override the default value of the profiles.
Table of default environment variables -- these are the minimum required with the above profiles:
Environment variable name | Description |
---|---|
DB_HOST | docker service for DB |
BASE_URL | external base URL used on the frontend - e.g. https://localhost:30000 |
RMID_BASE_URL | internal docker service URL for RMID -e.g. http://rapidminer-automodel-rmid:8080 |
AMQ_URL | AMQ URL |
AMQ_USERNAME | AMQ username |
AMQ_PASSWORD | AMQ password |
AUTH_SECRET | authentication secret used between services -- i.e AM and RMID |
AUTOMODELER_FRONTEND_TRACKING_ENABLED | enable event tracking service (internal analytics) |
JOB_QUEUE | AMQ job queue name |
JOB_STATUS_QUEUE | AMQ status queue name |
JOB_COMMAND_TOPIC | AMQ topic name |
Further useful environment variables:
Environment variable name | Description |
---|---|
AUTOMODELER_FRONTEND_CLOUDAPPS_DROPBOXAPPID | Dropbox application id for importing files from cloud |
AUTOMODELER_FRONTEND_CLOUDAPPS_ONEDRIVEAPPID | Microsoft One Drive application id for importing files from cloud |
AUTOMODELER_FRONTEND_GOOGLEANALYTICSID | Google Analytics id |
SPRING_SERVLET_MULTIPART_MAX_FILE_SIZE | Maximal Data file size has to be given in bytes upto 200MB |
AUTOMODELER_FILESTORE_MIN_ROW_LIMIT | Minimal no. of data rows for more reliable predictions (default 50) |
AUTOMODELER_EXECUTION_TASK_LIMIT_PER_USER | The no. of parallel tasks that can be submitted by a single user (default 4). A task can be an analysis or a prediction. A task consists of multiple jobs. |
AUTOMODELER_EXECUTION_QUEUE_LIMIT_PER_USER | The no. of jobs per user JobContainers will pick up concurrently. This number should be less than or equal the no. of JobContainer instances running simultaneously (default 1) |
AUTOMODELER_EXECUTION_MAXIMUM_TOTAL_RESULT_SIZE | Maximal size of RapidMiner result set given in MB |
AUTOMODELER_DATA_IMPORT_MINIMUM_ATTRIBUTE_COUNT | Minimum number of columns required for training |
AUTOMODELER_DATA_IMPORT_MAXIMUM_ATTRIBUTE_COUNT | Maximum number of columns accepted for training |
Licensing
The application depends on the license file at licenses/rapidminer-go-on-prem directory - if this is not present uploading data will fail. This folder is automatically mounted into the file system of every RapidMiner Go and Job Container instance - so there's no need to copy it manually.