Representation of a single cluster.
A model representing results of the K-Means clustering algorithm.
A model representing results of the K-Means clustering algorithm. Each cluster is a vector in a fixed dimensional space, and the transformer assigns the input row to its closest cluster using the Euclidean (L2) distance.
The clusters should all be of the same dimension, which should be equal to the number of input features.
The clusters of the model. These should have distinct names.
A seq of numeric feature descriptions describing the input to the model. * @param identifier Used to identify this model when in a collection of models. Should be simple characters, so it can be used in a feature name.
Representation of the classical linear regression model y = intercept + a_0 x_0 + a_1 x_1 + ...
Representation of the classical linear regression model y = intercept + a_0 x_0 + a_1 x_1 + ... + a_n x_n.
The length of the coefficients must match the length of the inputFeatures.
We use java.lang.Double for the type of the coefficients, because the scala Double type information is lost by scala/Gson and the deserialization fails badly for edge cases (e.g. Double.NaN).
Vector of coefficients of the model. Must match the length of inputFeatures.
Description of the (numeric) input features. Must match the length of coefficients.
Intercept of the model (defaults to 0).
Name used to identify the dependent feature in an evaluation dataset.
Used to identify this model when in a collection of models. Should be simple characters, so it can be used in a feature name.
Applies a Linear Regression model specified by the coefficients and intercept to a row of numeric data.
Applies a Linear Regression model specified by the coefficients and intercept to a row of numeric data.
Note that in the input row is wrapped in CastedDoubleSeq, so the input elements must be castable as java.lang.Number.
Represents a SingleLogisticRegression to be used as one of several in a MultiLogisticRegressionModel.
Represents a SingleLogisticRegression to be used as one of several in a MultiLogisticRegressionModel.
We use java.lang.Double for the type of the numeric values, because the scala Double type information is lost by scala/Gson and the deserialization fails badly for edge cases (e.g. Double.NaN).
The dependent value that the coefficients correspond to.
The coefficients for the single Logistic Regression model.
The constant term, that is added to the dot product of the feature coefficient vectors.
Representation of a single cluster.
We use java.lang.Double for the type of the numeric values, because the scala Double type information is lost by scala/Gson and the deserialization fails badly for edge cases (e.g. Double.NaN).
Name used to distinguish this cluster from others in the same model.
A vector representation of the cluster in orthogonal coordinates.