INDEX
Explanations
mentions of historical events and social movements
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1510
+0.09
0.3%
1604
+0.09
0.3%
1350
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1604
+0.09
0.04
1892
+0.09
0.04
1510
+0.09
0.04
Negative Logits
jaya
-1.18
haup
-1.10
bandung
-1.10
Minang
-1.02
roul
-1.01
!...
-1.01
aen
-0.95
tanga
-0.95
!!</
-0.94
jati
-0.94
POSITIVE LOGITS
others
0.72
other
0.66
his
0.64
its
0.64
their
0.61
him
0.59
outros
0.57
fellow
0.57
IsMutable
0.55
anyone
0.54
Activations Density 0.213%