Binary Classification
Home
Contact Support
Search…
Binary Classification
How to use?
Example
Configuration
Release Notes
Powered By
GitBook
Configuration
This section explains each of the properties in the configuration blade.
General
Property
Description
Collection
This associates the Agent to a specific Collection (default to that of the current Data Stream).
Refer to
Collections and Stream Hosts
to understand more about Collections.
Training
Property
Description
Learning Algorithm
The algorithm used to train the model.
Dataset
The training file used to train the algorithm.
Has Header
Tick to auto-populate the feature names from the first row of the training file, or the features are named column0, column1, etc.
Separator Character
The character used to separate data values in the training file.
Features (Name, Type, and Variable Type)
The grid of features is categorized by name, data type, and variable type.
Name: Feature name
Type: Data type of the feature
Variable Type: options are
feature
(default),
exclude
(ignore certain features from training), or
class variable
(the output).
Algorithm Parameters
The following list contains definitions of all parameters, but the combination of parameters used by each algorithm differs.
Property
Description
L1 Regularization
L1 penalty equal to the absolute value of the magnitude of coefficients.
L2 Regularization
L2 penalty which is equal to the square of the magnitude of coefficients.
Learning Rate
The amount of change in model in each iteration for better prediction.
Number of Interations
The number of times algorithms parameters are updated in a single run.
Number of Trees
The total number of trees generated on training data.
Number of Leaves
The number of leaves per tree in the model.
Minimum Example Count Per Leaf
The minimum number of samples required to be at a leaf node.
Model Options
Property
Description
Cross-Validation Folds
Indicates the number of times the fitting procedure is executed on the training data.
Deterministic Seed
Set this to a whole number for repeatable/deterministic results across multiple trainings.
Before configuring the Agent, please ensure that its input endpoint is connected to a parent Agent which will be sending data to it.
Input Mapping
Each parent Agent attribute is mapped against a feature in the training file so that test data matches the training data.
Refer to
Setup Input Mappings
for step-by-step instructions.
Endpoints
Name
Description
Input
This endpoint is used to receive data from the parent Agent.
Output
Events received from the parent Agent are assigned a class value and made available to this endpoint.
How to use? - Previous
Example
Next - How to use?
Release Notes
Last modified
5mo ago
Copy link
Outline
General
Training
Algorithm Parameters
Model Options
Input Mapping
Endpoints