INDEX
Explanations
specific mentions of individuals
mentions of the name "Walker."
New Auto-Interp
Negative Logits
ritic
-0.83
Seym
-0.83
н
-0.76
rul
-0.75
Lumpur
-0.75
undai
-0.75
eon
-0.74
opal
-0.72
romeda
-0.72
sembly
-0.72
POSITIVE LOGITS
Walker
0.86
Walker
0.82
inson
0.81
esque
0.79
stown
0.79
ball
0.76
mania
0.75
chairs
0.70
ville
0.69
man
0.69
Activations Density 0.010%