INDEX
Explanations
text related to political and social issues, focusing on themes like freedom, government, and education
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1577
+0.28
0.9%
1842
+0.18
0.6%
1343
+0.16
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1577
+0.28
0.10
184
+0.18
0.03
1842
+0.16
0.07
Negative Logits
shenan
-2.84
reluct
-2.71
disagre
-2.61
encomp
-2.54
depic
-2.48
intersper
-2.43
uninten
-2.41
unspeak
-2.41
impra
-2.39
apprehen
-2.38
POSITIVE LOGITS
Obrázky
0.95
RegressionTest
0.93
ostavi
0.91
SEDS
0.90
aarrggbb
0.88
Himo
0.87
AllMovie
0.87
رشف
0.86
CiNii
0.85
ISTAT
0.84
Activations Density 1.083%