INDEX
    Explanations

    occurrences of the word "things" and its variations in the text

    New Auto-Interp
    Negative Logits
    のである
    -0.40
    mitos
    -0.40
    之际
    -0.38
     yanında
    -0.38
    енча
    -0.38
     nedenle
    -0.37
     CEM
    -0.37
    nically
    -0.36
     Entfernung
    -0.36
     visualisation
    -0.36
    POSITIVE LOGITS
    Things
    1.21
     Things
    1.18
     things
    1.13
     THINGS
    1.04
    things
    0.97
    THINGS
    0.94
     cosas
    0.93
     coisas
    0.83
     dingen
    0.80
     Dinge
    0.76
    Act Density 0.010%

    No Known Activations