fix(opd): score teacher logprobs at rollout temperature, not 0#2085
Open
EazyReal wants to merge 1 commit into
Open
fix(opd): score teacher logprobs at rollout temperature, not 0#2085EazyReal wants to merge 1 commit into
EazyReal wants to merge 1 commit into
background
wait
wait-all
cancel
parallel
Loading