ExpDataFrameGroupBy Class API

Overview

ExpDataFrameGroupBy is a specialized class that extends the functionality of the pd-explain ExpDataFrame class. It is designed to provide additional capabilities for explaining grouped data operations applied to DataFrames, making it easier to understand and work with grouped data transformations.

Methods

count()
Returns:

Count for each group.

mean(numeric_only=lib.no_default, engine='cython', engine_kwargs=None)

Compute mean of groups, excluding missing values.

Parameters:
  • numeric_only (bool, optional) – Include only float, int, boolean columns.

  • engine (str, optional) – Engine for computation.

  • engine_kwargs (dict[str, bool], optional) – Engine-specific keyword arguments.

Returns:

Mean value for each group.

median(numeric_only=lib.no_default)

Compute median of groups, excluding missing values.

Parameters:

numeric_only (bool | lib.NoDefault) – Optional. Include only float, int, boolean columns.

Returns:

Median of values within each group.

sum(numeric_only=lib.no_default, min_count=0, engine=None, engine_kwargs=None)

Compute sum of group values.

Parameters:
  • numeric_only (bool | lib.NoDefault) – Optional. Include only float, int, boolean columns.

  • min_count (int) – Optional. The required number of valid values to perform the operation.

  • engine (str | None) – Optional. Engine for computation.

  • engine_kwargs (dict[str, bool] | None) – Optional. Engine-specific keyword arguments.

Returns:

Computed sum of values within each group.

min(numeric_only=False, min_count=-1)

Compute min of group values.

Parameters:
  • numeric_only (bool) – Optional. Include only float, int, boolean columns.

  • min_count (int) – Optional. The required number of valid values to perform the operation.

Returns:

Computed min of values within each group.

max(numeric_only=False, min_count=-1)

Compute max of group values.

Parameters:
  • numeric_only (bool) – Optional. Include only float, int, boolean columns.

  • min_count (int) – Optional. The required number of valid values to perform the operation.

Returns:

Computed max of values within each group.