INDEX
Explanations
mentions of law enforcement and security measures in a community settings
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
946
+0.09
0.3%
2045
+0.09
0.3%
1336
+0.09
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
2045
+0.09
0.06
658
+0.09
0.06
1336
+0.09
0.05
Negative Logits
rhum
-0.56
veu
-0.55
transfé
-0.55
tarte
-0.55
parma
-0.54
doman
-0.54
igno
-0.54
gouver
-0.54
vache
-0.53
xenia
-0.52
POSITIVE LOGITS
stationed
0.63
tasked
0.61
trained
0.59
specializing
0.56
hired
0.54
felicity
0.53
fulness
0.53
dedicated
0.52
capable
0.51
bufio
0.50
Activations Density 0.510%