INDEX
Explanations
phrases related to public events and activism
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1013
+0.21
0.6%
678
+0.15
0.5%
453
+0.09
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
678
+0.21
0.08
1013
+0.15
0.07
81
+0.09
0.02
Negative Logits
automne
-0.68
ControllerAdvice
-0.65
autunno
-0.61
urm
-0.61
tamaños
-0.60
iub
-0.55
--;
-0.53
appuntamento
-0.53
montón
-0.52
locu
-0.52
POSITIVE LOGITS
pamph
1.13
philosophic
1.09
Gorb
1.06
Simult
1.04
emphat
1.03
Keny
1.02
Kün
1.01
hcm
1.00
depic
0.97
philo
0.97
Activations Density 0.556%