INDEX
    Explanations

    expressions related to personal beliefs and reflections on experiences

    New Auto-Interp
    Negative Logits
    Occasionally
    -0.75
     Slightly
    -0.72
     Occasionally
    -0.70
    SOME
    -0.69
     occasionally
    -0.69
    Slightly
    -0.68
    somewhere
    -0.68
    slightly
    -0.66
    Vidite
    -0.66
     tantôt
    -0.64
    POSITIVE LOGITS
     much
    1.92
     too
    1.51
    much
    1.50
    Much
    1.34
     Much
    1.28
     muito
    1.22
     many
    1.22
     TOO
    1.21
    too
    1.21
     allzu
    1.15
    Act Density 0.330%

    No Known Activations