INDEX
Explanations
descriptions related to video game controls and virtual reality experiences
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
906
+0.12
0.3%
1842
+0.11
0.3%
1819
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1876
+0.12
0.03
2044
+0.11
0.06
946
+0.09
0.04
Negative Logits
reluct
-1.73
increa
-1.71
volunte
-1.68
encomp
-1.64
affor
-1.64
disagre
-1.63
accla
-1.61
embra
-1.60
suscep
-1.57
inev
-1.56
POSITIVE LOGITS
<bos>
0.79
AddTagHelper
0.70
hyrchwyd
0.68
while
0.65
RTDA
0.63
simultaneously
0.63
monitor
0.63
watching
0.63
wijl
0.61
Personendaten
0.61
Activations Density 0.458%