INDEX
    Explanations

    the word "almost" in various contexts

    New Auto-Interp
    Negative Logits
    ear
    -0.17
    eer
    -0.16
    uled
    -0.16
    orde
    -0.15
     g
    -0.15
     ear
    -0.14
     Jamal
    -0.14
    d
    -0.14
    ninger
    -0.14
    e
    -0.14
    POSITIVE LOGITS
    lied
    0.15
    .infinity
    0.15
    estone
    0.15
    縮
    0.14
     prostituer
    0.14
    _axis
    0.14
    iglia
    0.14
    adio
    0.14
    exo
    0.14
    売
    0.14
    Act Density 0.016%

    No Known Activations