INDEX
Explanations
timestamps in a particular format
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1343
+0.21
0.6%
1967
+0.11
0.3%
630
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
620
+0.21
0.04
1343
+0.11
0.04
2002
+0.10
0.04
Negative Logits
intersper
-0.71
gaily
-0.69
tolerably
-0.68
unspeak
-0.67
disagre
-0.65
beaute
-0.65
apprehen
-0.63
friable
-0.63
shenan
-0.59
gratify
-0.58
POSITIVE LOGITS
ideolog
0.66
meras
0.64
sedia
0.61
solidar
0.60
maksi
0.58
notor
0.54
alkoh
0.54
robus
0.53
verdura
0.53
nastro
0.52
Activations Density 0.073%