INDEX
Explanations
entities associated with criminal acts
the word "who" in various contexts
New Auto-Interp
Negative Logits
³³³³
-0.80
Bound
-0.71
BACK
-0.68
Delicious
-0.67
Glob
-0.65
Processing
-0.65
Around
-0.65
Dos
-0.64
Lions
-0.64
Anything
-0.63
POSITIVE LOGITS
soever
1.11
oping
1.02
accompanies
0.99
preceded
0.96
oped
0.95
attended
0.90
oversaw
0.89
ever
0.89
attends
0.89
resided
0.88
Activations Density 0.168%