INDEX
Explanations
political discussions and perspectives
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1577
+0.11
0.4%
2034
+0.11
0.3%
814
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1265
+0.11
0.03
814
+0.11
0.02
699
+0.10
0.03
Negative Logits
occupe
-0.64
Consig
-0.64
Lma
-0.61
estime
-0.59
Buona
-0.59
reconnaît
-0.59
ouvre
-0.59
assiste
-0.58
recru
-0.55
déclare
-0.55
POSITIVE LOGITS
AppColors
0.73
jaya
0.66
SUDOC
0.64
tsi
0.63
umo
0.61
siena
0.61
actéristique
0.61
actéris
0.60
strto
0.59
saba
0.59
Activations Density 0.115%