INDEX
Explanations
game-related terms and concepts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1557
+0.16
0.6%
200
+0.15
0.6%
506
+0.13
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1557
+0.16
0.02
200
+0.15
0.02
1387
+0.13
0.02
Negative Logits
Banjar
-0.69
pank
-0.65
kram
-0.63
kark
-0.61
banan
-0.59
makro
-0.59
sark
-0.58
Teks
-0.57
kasa
-0.56
kac
-0.56
POSITIVE LOGITS
Sim
1.35
Sim
1.31
sim
1.29
SIM
1.27
sim
1.18
SIM
1.17
sims
1.12
simulation
1.11
Simulation
1.10
sims
1.06
Activations Density 0.099%