INDEX
Explanations
phrases related to government policies and decisions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1904
+0.09
0.2%
1443
+0.08
0.2%
1523
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
284
+0.09
0.07
509
+0.08
0.06
384
+0.07
0.05
Negative Logits
nicolas
-0.74
roberto
-0.74
ricardo
-0.74
purcha
-0.70
aen
-0.69
jorge
-0.69
eduardo
-0.69
alberto
-0.68
discogs
-0.67
sergio
-0.67
POSITIVE LOGITS
unless
0.75
until
0.74
unless
0.72
until
0.69
except
0.63
except
0.62
RTLR
0.60
逅
0.56
putAll
0.55
Until
0.54
Activations Density 0.445%