INDEX
Explanations
occurrences of the word "head" and its variants in different contexts
New Auto-Interp
Negative Logits
aber
-0.18
dorf
-0.17
chter
-0.16
antino
-0.16
hawk
-0.15
670
-0.14
AGON
-0.14
anh
-0.14
OKIE
-0.14
Interr
-0.14
POSITIVE LOGITS
strong
0.26
long
0.25
lining
0.24
liner
0.24
-spin
0.24
gear
0.23
sets
0.23
lined
0.23
bands
0.23
lamp
0.23
Activations Density 0.014%