INDEX
Explanations
legal and political terms
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1177
+0.12
0.4%
1919
+0.11
0.3%
1870
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1919
+0.12
0.08
862
+0.11
0.05
802
+0.09
0.06
Negative Logits
paradigma
-0.69
colch
-0.68
psicologia
-0.66
rimb
-0.66
cristian
-0.65
masaj
-0.63
feltro
-0.62
balon
-0.61
rosas
-0.61
antropo
-0.61
POSITIVE LOGITS
reluct
1.72
disagre
1.64
unwarran
1.58
pamph
1.53
apprehen
1.52
inev
1.52
emphat
1.51
desir
1.50
depic
1.49
impra
1.46
Activations Density 0.394%