INDEX
Explanations
baseball-related terms and statistics
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
599
+0.17
0.5%
764
+0.13
0.4%
394
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
227
+0.17
0.12
599
+0.13
0.09
1870
+0.12
0.05
Negative Logits
parsedMessage
-0.63
Apesar
-0.59
Fußball
-0.55
собенности
-0.54
Preço
-0.52
Manbalar
-0.52
kasarigan
-0.51
Ainda
-0.50
حوالہ
-0.49
<bos>
-0.49
POSITIVE LOGITS
reluct
1.42
accla
1.39
shenan
1.36
increa
1.35
affor
1.31
madonna
1.29
peppa
1.29
indestru
1.28
depic
1.27
excru
1.27
Activations Density 1.152%