Skip to content

Pull requests: apple/axlearn

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Gradient Accumulation in Axlearn
#465 opened May 13, 2024 by apoorvtintin Loading…
Te flash attention
#464 opened May 13, 2024 by kelvin-zou Draft
Perf optimize with unroll=8
#461 opened May 13, 2024 by kelvin-zou Loading…
add more ctc-loss related summaries
#457 opened May 12, 2024 by yqwangustc Loading…
Simplify cli subprocess handling.
#451 opened May 8, 2024 by markblee Loading…
Add new model config for smaller tests
#450 opened May 8, 2024 by jesus-orozco Loading…
chore(axlearn): Fix typos
#420 opened Apr 24, 2024 by tony Loading…
Dataflow changes
#384 opened Mar 25, 2024 by jiya-zhang Draft
Print step time for each step
#361 opened Mar 9, 2024 by samos123 Loading…
expose max_decode_len and eos_token_id in decoding
#328 opened Feb 17, 2024 by gyin94 Loading…
Ulimit try
#260 opened Dec 20, 2023 by wwu137 Draft
Wwang48 rm
#229 opened Dec 6, 2023 by iamweiwang Draft
Update lora's param_partition_spec.
#224 opened Dec 5, 2023 by JianyuWangV Loading…
Create CONTRIBUTING.md
#149 opened Oct 28, 2023 by 0Armaan025 Loading…
Update README.md
#145 opened Oct 25, 2023 by CrypticRevenger Loading…
ProTip! What’s not been updated in a month: updated:<2024-04-14.