INDEX
Explanations
timestamps in a specific format
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1343
+0.19
0.6%
612
+0.15
0.5%
814
+0.13
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1343
+0.19
0.06
612
+0.15
0.01
1036
+0.13
0.01
Negative Logits
betweenstory
-0.52
unworthy
-0.51
gyhoeddwyd
-0.48
gratify
-0.46
shrill
-0.45
pharynx
-0.45
quivering
-0.45
TBI
-0.44
cnpj
-0.44
ztő
-0.43
POSITIVE LOGITS
aen
0.93
alkoh
0.92
inder
0.88
meis
0.87
sii
0.85
kosme
0.84
handels
0.82
ciment
0.82
franz
0.81
gmbh
0.81
Activations Density 0.292%