Skip to content

The way to calculate the log_probs #18

@laurenlong

Description

@laurenlong

Hi,
As the output of the model in each token's position represents the possibilities of next token, should the calculation of log_probs be misaligned.
I mean "diff_logits[range(diff_logits.shape[0]-1), continue_ids[1:]].sum().item()"
instead of "log_probs = diff_logits[range(diff_logits.shape[0]), continue_ids].sum().item()".

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions