INDEX
Explanations
references to monetary amounts or financial figures
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
43
+0.11
0.6%
442
+0.11
0.6%
330
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
442
+0.11
0.04
379
+0.11
0.05
317
+0.11
0.04
Negative Logits
$/
-1.55
Rate
-1.53
¿½
-1.45
nickel
-1.45
fare
-1.40
buff
-1.39
%.
-1.38
ages
-1.37
lop
-1.36
manship
-1.35
POSITIVE LOGITS
usions
1.76
ubottu
1.52
conceived
1.52
ensively
1.49
inks
1.48
ensen
1.41
invented
1.41
ece
1.40
ocarcin
1.39
npmjs
1.36
Activations Density 0.356%