INDEX
Explanations
^words related to financial or political controversy
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
990
+0.12
0.4%
347
+0.11
0.3%
1820
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
990
+0.12
0.07
347
+0.11
0.05
1820
+0.09
0.05
Negative Logits
vogliamo
-0.66
trovo
-0.62
abbiano
-0.62
picuous
-0.61
BIBSYS
-0.60
dovre
-0.58
avrebbero
-0.56
facciamo
-0.56
succede
-0.56
vogli
-0.55
POSITIVE LOGITS
unce
0.72
trefoil
0.62
of
0.60
squa
0.59
unden
0.58
frow
0.58
macrop
0.56
pite
0.56
friable
0.56
tolerably
0.55
Activations Density 0.369%