Skip to content

fix(opd): score teacher logprobs at rollout temperature, not 0#2085

Open
EazyReal wants to merge 1 commit into
THUDM:mainfrom
EazyReal:opd-teacher-temperature
Open

fix(opd): score teacher logprobs at rollout temperature, not 0#2085
EazyReal wants to merge 1 commit into
THUDM:mainfrom
EazyReal:opd-teacher-temperature

Commits

Commits on Jun 15, 2026