INDEX
Explanations
phrases related to sitting or being stationary
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1671
+0.11
0.4%
1323
+0.11
0.4%
1480
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1323
+0.11
0.03
1480
+0.11
0.03
1671
+0.10
0.04
Negative Logits
kase
-0.66
lemp
-0.65
maksi
-0.63
uhr
-0.62
kela
-0.61
kaos
-0.61
kasa
-0.60
osal
-0.59
Præ
-0.59
kram
-0.59
POSITIVE LOGITS
sit
1.28
sitting
1.17
sat
1.08
sits
1.07
sit
1.06
Sit
1.05
Sit
1.04
sitting
1.04
Sitting
0.96
SIT
0.95
Activations Density 0.096%