INDEX
Explanations
dates in a specific format
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1416
+0.13
0.4%
568
+0.12
0.4%
1602
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
499
+0.13
0.04
382
+0.12
0.04
776
+0.12
0.04
Negative Logits
Punj
-0.70
épu
-0.62
éprou
-0.60
Bux
-0.59
retrouvé
-0.57
Skład
-0.56
Lari
-0.55
soulign
-0.54
Bekasi
-0.54
détru
-0.53
POSITIVE LOGITS
PHOSPH
0.50
áng
0.47
odori
0.47
arbonato
0.46
SpringBoot
0.46
sente
0.46
dita
0.43
Ress
0.43
orias
0.43
tā
0.42
Activations Density 0.196%