INDEX
Explanations
specific names and terms related to movies, individuals, and concepts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1978
+0.12
0.3%
1150
+0.09
0.3%
394
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
324
+0.12
0.04
227
+0.09
0.07
764
+0.09
0.05
Negative Logits
autorytatywna
-0.93
脚注の使い方
-0.85
تضيفلها
-0.84
Autoritní
-0.79
exé
-0.79
IndentedString
-0.78
مرئيه
-0.77
étu
-0.76
quegli
-0.74
GEBURTSDATUM
-0.73
POSITIVE LOGITS
Grath
0.50
姆斯
0.50
variant
0.49
that
0.47
UpdatedBy
0.47
çünkü
0.47
existence
0.47
incarnation
0.47
iteration
0.46
wzór
0.46
Activations Density 0.501%