INDEX
Explanations
words related to a direction or place, specifically referring to the "back."
instances of the word "back" in various contexts
New Auto-Interp
Negative Logits
fuss
-0.65
Aires
-0.62
tein
-0.61
mble
-0.61
BUS
-0.60
faint
-0.59
Osc
-0.59
urious
-0.56
IFIED
-0.56
LINE
-0.54
POSITIVE LOGITS
door
1.17
back
1.11
wards
1.06
lash
1.06
dated
1.01
gam
0.94
tracking
0.92
)=(
0.91
doors
0.91
hoe
0.90
Activations Density 0.015%