INDEX
Explanations
positive mentions or sentiments towards people, events, or actions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
674
+0.09
0.3%
555
+0.08
0.2%
1445
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1677
+0.09
0.06
1507
+0.08
0.04
1276
+0.08
0.05
Negative Logits
Añ
-0.65
sappi
-0.65
dichi
-0.64
masaj
-0.62
surfact
-0.60
Áng
-0.60
khong
-0.59
apparti
-0.59
aquare
-0.58
utop
-0.57
POSITIVE LOGITS
unspeak
1.06
tolerably
1.02
gaily
0.97
intrigu
0.96
nobly
0.96
apprehen
0.95
vainly
0.92
endeavouring
0.92
reconno
0.91
plenti
0.90
Activations Density 0.238%