Skip to content

How to make transformers work properly on multindex dataframes with object/category dtypes #5943

Answered by fkiraly
tiloye asked this question in Q&A
Discussion options

You must be logged in to vote

an mtype is a specification for input format, e.g., pd.DataFrame with pd.MultiIndex where the last index is an integer or time index, and no columns are object type.

See the datatypes tutorial for more info.

Thanks for pointing out that this is missing in the glossary, I will add it.

From your output, it seems that indeed the problem is that you have object dtypes (dtypes are column types in pandas), which is not permitted. We are currently working on extending support for categorical types, see here: #5886

There is also a longer design discussion and project towards ensuring categorical types can be dealt with throughout the pipeline, @yarnabrina is also heavily involved. We are looking …

Replies: 3 comments 7 replies

Comment options

You must be logged in to vote
3 replies
@tiloye
Comment options

@fkiraly
Comment options

@fkiraly
Comment options

Comment options

You must be logged in to vote
4 replies
@tiloye
Comment options

@fkiraly
Comment options

Answer selected by tiloye
@tiloye
Comment options

@fkiraly
Comment options

Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
module:transformations transformations module: time series transformation, feature extraction, pre-/post-processing module:datatypes datatypes module: data containers, checkers & converters
2 participants