INDEX
    Explanations

    the word "latest" followed by a number

    references to recent updates or news

    New Auto-Interp
    Negative Logits
    erved
    -0.79
    avery
    -0.78
    alties
    -0.73
    velt
    -0.73
    ivas
    -0.69
    utenant
    -0.69
     Rahman
    -0.69
    hovah
    -0.68
    bara
    -0.66
    dinand
    -0.65
    POSITIVE LOGITS
     incarnation
    1.06
     iteration
    1.05
     edition
    1.04
     installment
    1.04
     update
    0.91
     editions
    0.85
     episode
    0.82
     latest
    0.81
     developments
    0.81
     batch
    0.80
    Act Density 0.018%

    No Known Activations