You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Being able to use numpy arrays in R dataframes would
Provide very nice ergonomics for managing nd-arrays with metadata (e.g., frames of a video stored as one ndarray, with miscellaneous columns of metadata for each frame)
Open the door to tracing + compiling operations in the context of a dataframe / dplyr (e.g., with jax, tensorflow, or similar).
This would need to an implementation of vctrs::vec_proxy() for numpy arrays that takes advantage of ALTREP that avoids materializing R atomic vectors where it doesn't make sense (e.g., if the numpy array is type int8, we don't want to materialize an int32 R atomic vector if we can avoid it).
(from Dewey Dunnington in slack)
Looking at nanoarrow_buffer may be helpful
Being able to use numpy arrays in R dataframes would
This would need to an implementation of
vctrs::vec_proxy()
for numpy arrays that takes advantage of ALTREP that avoids materializing R atomic vectors where it doesn't make sense (e.g., if the numpy array is type int8, we don't want to materialize an int32 R atomic vector if we can avoid it).(from Dewey Dunnington in slack)
Looking at nanoarrow_buffer may be helpful
https://arrow.apache.org/docs/format/CanonicalExtensions.html#fixed-shape-tensor
The text was updated successfully, but these errors were encountered: