INDEX
Explanations
descriptions of crime events
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
906
+0.19
0.6%
946
+0.13
0.4%
1533
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
946
+0.19
0.07
736
+0.13
0.06
184
+0.11
0.03
Negative Logits
vogli
-0.83
ecru
-0.82
Facile
-0.80
bandung
-0.78
swarovski
-0.76
Février
-0.76
credere
-0.75
dimenti
-0.74
tupperware
-0.73
cushi
-0.72
POSITIVE LOGITS
suddenly
0.52
sudden
0.51
rushed
0.50
panic
0.47
yelling
0.46
rush
0.46
evacuated
0.46
immediately
0.46
seemed
0.46
ötzlich
0.46
Activations Density 0.455%