INDEX
Explanations
discussions about philosophical and political concepts like laws and beliefs
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
581
+0.08
0.2%
1978
+0.08
0.2%
1510
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
832
+0.08
0.05
862
+0.08
0.02
1919
+0.07
0.05
Negative Logits
ché
-0.68
idr
-0.60
buta
-0.60
ù
-0.59
mao
-0.59
zom
-0.59
alm
-0.58
piment
-0.58
º
-0.57
ria
-0.57
POSITIVE LOGITS
anymore
0.67
znál
0.66
ever
0.59
ogóle
0.57
jemals
0.56
fared
0.56
">...
0.52
whether
0.52
=[]
0.52
perchance
0.50
Activations Density 0.440%