INDEX
Explanations
locations and political events
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
528
+0.11
0.3%
1334
+0.10
0.3%
341
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
528
+0.11
0.05
341
+0.10
0.05
1334
+0.09
0.05
Negative Logits
akut
-0.69
kanton
-0.67
Temos
-0.65
antik
-0.64
Qualquer
-0.63
balkon
-0.62
Portanto
-0.61
minimalis
-0.61
kriminal
-0.61
tetten
-0.60
POSITIVE LOGITS
Souha
0.84
purtroppo
0.82
affatto
0.73
dovre
0.73
Messieurs
0.72
scattata
0.71
occorre
0.69
Și
0.69
altrett
0.68
poichè
0.68
Activations Density 0.173%