INDEX
Explanations
the word "total" with very strong activation
occurrences of the word "total" in various contexts
New Auto-Interp
Negative Logits
Lazarus
-0.74
heid
-0.69
fle
-0.68
Dwell
-0.68
rium
-0.67
yer
-0.65
pell
-0.65
pher
-0.64
resonate
-0.64
bub
-0.64
POSITIVE LOGITS
itarian
0.84
elimination
0.76
totals
0.75
total
0.73
idad
0.72
total
0.72
esse
0.70
TOTAL
0.69
izen
0.68
carnage
0.68
Activations Density 0.018%