INDEX
Explanations
events related to political or historical conspiracies
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1276
+0.11
0.3%
1862
+0.09
0.2%
438
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1276
+0.11
0.05
1677
+0.09
0.04
555
+0.08
0.03
Negative Logits
<bos>
-0.87
Paglinawan
-0.62
womit
-0.56
astéroïdes
-0.54
expandindo
-0.53
worauf
-0.50
ozof
-0.49
<>",
-0.49
TokenNameLPAREN
-0.47
WebVitals
-0.46
POSITIVE LOGITS
alas
0.65
malheureusement
0.63
Pamph
0.62
réservé
0.60
unfortunately
0.57
désert
0.57
imprimée
0.57
sadly
0.57
préparé
0.56
imprimé
0.56
Activations Density 0.245%