INDEX
    Explanations

    phrases indicating uncertainty or vagueness

    vague references to unspecified objects or concepts

    New Auto-Interp
    Negative Logits
    arest
    -0.64
    adapt
    -0.62
    raid
    -0.61
    ENDED
    -0.58
    fw
    -0.58
    selves
    -0.57
    én
    -0.57
    ruck
    -0.56
    schild
    -0.56
    pload
    -0.55
    POSITIVE LOGITS
     else
    1.22
     thereof
    1.10
     alike
    1.03
     similar
    0.99
     like
    0.86
     analogous
    0.86
     fancy
    0.86
    Else
    0.84
    abouts
    0.82
     nonsense
    0.81
    Act Density 0.084%

    No Known Activations