INDEX
Explanations
descriptions of violent events or encounters
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
906
+0.12
0.4%
946
+0.11
0.3%
690
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
946
+0.12
0.05
736
+0.11
0.05
724
+0.11
0.04
Negative Logits
Wicidata
-0.71
Chambres
-0.69
Visite
-0.68
himo
-0.64
Gennaio
-0.62
maniere
-0.61
Portail
-0.61
veste
-0.60
lampa
-0.60
Câ
-0.60
POSITIVE LOGITS
disreg
1.07
stratigraph
1.05
intersper
1.01
disagre
0.97
encomp
0.95
quitted
0.95
antem
0.94
tolerably
0.94
louder
0.92
depic
0.92
Activations Density 0.390%