INDEX
Explanations
mentions of locations being someone's home
instances of the word "home" in various contexts
New Auto-Interp
Negative Logits
sha
-0.67
uku
-0.66
haircut
-0.61
stride
-0.59
bite
-0.57
entle
-0.57
perspective
-0.55
asm
-0.55
Stephenson
-0.54
ussian
-0.54
POSITIVE LOGITS
coming
0.91
quartered
0.84
liest
0.83
brew
0.81
chool
0.79
grown
0.79
opathy
0.78
lier
0.78
imately
0.77
oided
0.75
Activations Density 0.040%