INDEX
Explanations
occurrences of the word "things" and its variations in the text
things change
New Auto-Interp
Negative Logits
のである
-0.40
mitos
-0.40
之际
-0.38
yanında
-0.38
енча
-0.38
nedenle
-0.37
CEM
-0.37
nically
-0.36
Entfernung
-0.36
visualisation
-0.36
POSITIVE LOGITS
Things
1.21
Things
1.18
things
1.13
THINGS
1.04
things
0.97
THINGS
0.94
cosas
0.93
coisas
0.83
dingen
0.80
Dinge
0.76
Activations Density 0.010%