-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Issues: thu-ml/tianshou
Clearer separation between the trainer and the algorithm and ...
#1034
opened Jan 24, 2024 by
maxhuettenrauch
Open
1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Python Bug: lambda function refers only one environment
#1155
opened May 28, 2024 by
maguro27
5 of 9 tasks
How to save the log which axis is each epoch not epoch's steps?
enhancement
Feature that is not a new algorithm or an algorithm enhancement
#1154
opened May 28, 2024 by
hems111318136
9 tasks
Support dict observation spaces in highlevel api
enhancement
Feature that is not a new algorithm or an algorithm enhancement
refactoring
No change to functionality
#1152
opened May 24, 2024 by
MischaPanch
Poetry update the torch versioned from cuda (2.0.1+cu118) to cpu (2.1.1) defaultly on Windows
build/test
#1145
opened May 11, 2024 by
coolermzb3
6 of 9 tasks
Document effects of the relations between buffer size, num workers and episode length
documentation
#1143
opened May 10, 2024 by
MischaPanch
How can I make action sampling within the range specified by my environment when using onpolicy_trainer?
question
Further information is requested
#1142
opened May 9, 2024 by
lidaken
Extend benchmark with mujoco v4 envs
documentation
experiment-eval
Issues about evaluation: plots, stats, multiprocessing etc.
#1140
opened May 6, 2024 by
MischaPanch
Does Tianshou truly supports MARL out of the box?
MARL
Temporary label to group all things MARL
question
Further information is requested
#1137
opened May 5, 2024 by
Legendorik
4 of 9 tasks
Use Altair inside a notebook to display benchmark results
documentation
good first issue
Good for newcomers
#1136
opened May 4, 2024 by
MischaPanch
how to run RL using multi-nodes in cluster
documentation
question
Further information is requested
#1133
opened May 2, 2024 by
HYB777
Adjust locations of setting the policy in train/eval mode
bug
Something isn't working
refactoring
No change to functionality
#1122
opened Apr 24, 2024 by
maxhuettenrauch
Provide a devcontainer, base GH actions off it
build/test
minor
Requires small changes to be fixed
#1118
opened Apr 17, 2024 by
MischaPanch
Should we use the new schedule-free optimizer?
optimization
Performance optimization (throughout, memory, processing speed)
#1115
opened Apr 15, 2024 by
MischaPanch
Should we use torch.compile?
optimization
Performance optimization (throughout, memory, processing speed)
#1114
opened Apr 15, 2024 by
MischaPanch
Revisit "warm-up" phase in examples
algorithm enhancement
Not quite a new algorithm, but an enhancement to algo. functionality
#1112
opened Apr 14, 2024 by
MischaPanch
Use Atari-5 for future benchmarking of discrete RL
build/test
discussion
Discussion of a typical issue
#1110
opened Apr 12, 2024 by
nuance1979
4 of 9 tasks
Batch: remove Changes in public interfaces. Includes small changes or changes in keys
good first issue
Good for newcomers
minor
Requires small changes to be fixed
refactoring
No change to functionality
is_empty
breaking changes
#1108
opened Apr 12, 2024 by
MischaPanch
Don't pass envpool envs where vectorenvs are needed
bug
Something isn't working
good first issue
Good for newcomers
refactoring
No change to functionality
#1096
opened Apr 3, 2024 by
MischaPanch
Re-examine the whole state story for RNNs
refactoring
No change to functionality
RNN
Temporary label to group all things RNN
tentative
Up to discussion, may be dismissed
#1095
opened Apr 3, 2024 by
MischaPanch
Re-examine the need of utils.net.common.DataParallelNet
refactoring
No change to functionality
tentative
Up to discussion, may be dismissed
#1094
opened Apr 3, 2024 by
MischaPanch
Reduce duplication between examples/atari/atari_network and examples/vizdoom/network
good first issue
Good for newcomers
refactoring
No change to functionality
#1092
opened Apr 3, 2024 by
MischaPanch
Better interfaces and names for Actor, Critic, Net and other classes
refactoring
No change to functionality
#1091
opened Apr 3, 2024 by
MischaPanch
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.