INDEX
Explanations
phrases related to conflicts, political or social tensions, and communal interactions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
674
+0.14
0.4%
680
+0.11
0.3%
1942
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
680
+0.14
0.06
776
+0.11
0.06
1811
+0.10
0.05
Negative Logits
IndentedString
-0.54
ઊ
-0.49
addContainerGap
-0.45
AnchorTagHelper
-0.44
PushMatrix
-0.43
PLWABN
-0.43
RectangleBorder
-0.43
AnchorStyles
-0.43
IBinder
-0.43
Κα
-0.42
POSITIVE LOGITS
spion
0.82
utop
0.82
glan
0.81
stoff
0.81
kompati
0.80
plak
0.80
kosme
0.79
minimalis
0.77
abnorm
0.76
kön
0.75
Activations Density 0.346%