INDEX
    Explanations

    references to confidence and self-assurance

    New Auto-Interp
    Negative Logits
    ozem
    -0.17
    ardon
    -0.17
    ward
    -0.15
    ewise
    -0.14
     WaitForSeconds
    -0.14
    //{{
    -0.14
    .Middle
    -0.14
    .Msg
    -0.14
    gren
    -0.14
    WARD
    -0.14
    POSITIVE LOGITS
    /conf
    0.19
     Vak
    0.17
    uchs
    0.16
    otto
    0.16
    nest
    0.15
     confidence
    0.15
    vale
    0.15
    icy
    0.15
    y
    0.15
    forth
    0.15
    Act Density 0.012%

    No Known Activations