INDEX
Explanations
references to sports and athletes
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
964
+0.18
0.7%
1842
+0.16
0.6%
394
+0.15
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
184
+0.18
0.02
1842
+0.16
0.06
227
+0.15
0.09
Negative Logits
<bos>
-0.97
appunt
-0.58
chequer
-0.58
appartamento
-0.58
podjela
-0.56
bezeichneter
-0.56
قایناقلار
-0.54
gynhyrchwyd
-0.54
ModelAdmin
-0.54
ristor
-0.54
POSITIVE LOGITS
McLaugh
0.65
unspeak
0.65
Juf
0.62
apprehen
0.61
Bengt
0.60
Henk
0.60
exorbit
0.59
HomeController
0.59
Dov
0.59
Traité
0.58
Activations Density 1.135%