INDEX
Explanations
video game attributes and stats, such as movement speed and mana regeneration
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1343
+0.15
0.4%
856
+0.13
0.4%
1870
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
536
+0.15
0.03
764
+0.13
0.03
1343
+0.09
0.03
Negative Logits
TestBed
-0.66
Todavía
-0.64
Varios
-0.64
Obrigada
-0.63
Será
-0.61
Hvor
-0.61
Obrigado
-0.61
Possui
-0.60
Muito
-0.59
está
-0.59
POSITIVE LOGITS
?...
1.47
!...
1.43
fluo
1.41
impra
1.41
cabrio
1.39
intermitt
1.39
suscep
1.38
embodi
1.37
vespa
1.37
strick
1.37
Activations Density 0.101%