Architecture
The Machine Learning model is broken up into a number of key components, derived from freely-available helper apps within Akumen. These components are:
- Delimited Data Connector: retrieves the data from the web and automatically decomposes it into an Akumen-consumable resultset;
- Customised PETL: takes the resultset and performs a few modifications (mostly as an example of how heavier transformations can be performed);
- Auto-SKLearn: uses the resultset with some configuration items to perform automated machine learning on the dataset and build a trained model; and
- SKLearn-Executor: loads the trained model and, given a correct input set, returns the answer.
Not every machine learning application needs these four components: some can be three. If you’re training a model to determine accuracy and not execute on an ongoing basis, you can use two.