INDEX
Explanations
specific references to something, possibly objects or actions
references to the word "those" in various contexts
New Auto-Interp
Negative Logits
ob
-0.77
ILY
-0.73
¨
-0.71
iness
-0.71
onis
-0.70
achus
-0.69
BW
-0.67
maker
-0.67
shapeshifter
-0.67
manship
-0.66
POSITIVE LOGITS
pesky
1.10
kinds
0.94
sorts
0.86
fateful
0.81
damned
0.74
sights
0.73
nifty
0.73
aforementioned
0.73
darn
0.70
same
0.69
Activations Density 0.066%