INDEX
Explanations
activity related to a time-related process involving monitoring and physical actions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1343
+0.14
0.4%
690
+0.09
0.3%
394
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
862
+0.14
0.04
714
+0.09
0.05
460
+0.08
0.04
Negative Logits
fuf
-1.68
?...
-1.57
guarante
-1.57
cannes
-1.56
increa
-1.54
purcha
-1.54
thut
-1.54
!...
-1.53
fta
-1.52
desir
-1.52
POSITIVE LOGITS
turned
0.71
fully
0.66
removed
0.66
still
0.65
relenting
0.65
replaced
0.62
opened
0.62
Když
0.61
Cuidado
0.61
Pokud
0.60
Activations Density 0.510%