SUPPORT_VECTOR_MACHINE

The SUPPORT_VECTOR_MACHINE node is used to train a support vector machine model for classification tasks.It takes two dataframes of label and feature from labelled training data and a dataframe of unlabelled input data.Params:kernel : 'linear' | 'poly' | 'rbf' | 'sigmoid' | 'precomputed'Specifies the kernel type to be used in the algorithm. For detailed information about kernel types: https://scikit-learn.org/stable/modules/svm.html#kernel-functionsReturns:out : DataFrameThe predictions for the input data.

Python Code

from flojoy import flojoy, DataFrame, Matrix
import pandas as pd
import numpy as np
from sklearn import svm, preprocessing
from typing import Literal


@flojoy(deps={"scikit-learn": "1.2.2"})
def SUPPORT_VECTOR_MACHINE(
    train_feature: DataFrame | Matrix,
    train_label: DataFrame | Matrix,
    input_data: DataFrame | Matrix,
    kernel: Literal["linear", "poly", "rbf", "sigmoid", "precomputed"] = "linear",
) -> DataFrame:
    """The SUPPORT_VECTOR_MACHINE node is used to train a support vector machine model for classification tasks.

    It takes two dataframes of label and feature from labelled training data and a dataframe of unlabelled input data.

    Parameters
    ----------
    kernel : 'linear' | 'poly' | 'rbf' | 'sigmoid' | 'precomputed'
        Specifies the kernel type to be used in the algorithm.
        For detailed information about kernel types:
        https://scikit-learn.org/stable/modules/svm.html#kernel-functions

    Returns
    -------
    DataFrame
        The predictions for the input data.
    """

    le = preprocessing.LabelEncoder()

    if isinstance(train_feature, DataFrame):
        train = train_feature.m.to_numpy()
        col = train_label.m.to_numpy()
        target_name = train_label.m.columns.values[0]

    else:
        train = train_feature.m
        col = train_label.m
        target_name = "target"

    X = train
    Y = le.fit_transform(col)

    clf = svm.SVC(kernel=kernel)
    clf.fit(X, Y)

    if isinstance(input_data, DataFrame):
        input_arr = input_data.m.to_numpy()
    else:
        input_arr = input_data.m

    prediction = clf.predict(input_arr)
    prediction = le.inverse_transform(prediction)
    prediction = pd.DataFrame({target_name: prediction})
    return DataFrame(df=prediction)

Find this Flojoy Block on GitHub

Example

Having problem with this example app? Join our Discord community and we will help you out!

In this example, the SUPPORT_VECTOR_MACHINE is passed the iris dataset, split into two parts. The training data contains 120 labels examples, while the input dataset contains 30 samples with the labels removed.

This data is read from disk with two READ_CSV nodes, and the output predictions made by the classifier are visualised in a TABLE.