ml_grid.util.synthetic_data_generator

Generates synthetic time-series data for testing purposes.

Attributes

columns

Functions

generate_time_series(→ pandas.DataFrame)

Generates a synthetic time-series DataFrame.

Module Contents

ml_grid.util.synthetic_data_generator.columns[source]
ml_grid.util.synthetic_data_generator.generate_time_series(num_clients: int, num_rows_per_client: int) pandas.DataFrame[source]

Generates a synthetic time-series DataFrame.

This function creates a DataFrame with multiple clients, each having a sequence of data points over time. It generates random feature values and a binary target, with a simple association where features are biased upwards for one of the target classes.

Parameters:
  • num_clients (int) – The number of unique clients to generate.

  • num_rows_per_client (int) – The number of time-series rows for each client.

Returns:

A sorted DataFrame containing the synthetic time-series data.

Return type:

pd.DataFrame