INDEX
Explanations
terms related to mixed martial arts, wrestling, and video games
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
764
+0.15
0.5%
998
+0.13
0.4%
1438
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
764
+0.15
0.07
1438
+0.13
0.05
227
+0.11
0.07
Negative Logits
effe
-1.58
squa
-1.57
aen
-1.56
secon
-1.54
desir
-1.50
inev
-1.49
mef
-1.47
illi
-1.47
embodi
-1.46
wien
-1.46
POSITIVE LOGITS
<bos>
0.69
省市镇
0.62
Winaray
0.55
WebVitals
0.55
界
0.54
AutoScaleMode
0.54
today
0.53
expandindo
0.52
قایناقلار
0.52
autoreleasepool
0.51
Activations Density 0.564%