INDEX
Explanations
instances of the word "oil" and related terms
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1296
+0.14
0.6%
1895
+0.12
0.5%
1233
+0.12
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1296
+0.14
0.04
1464
+0.12
0.03
1233
+0.12
0.03
Negative Logits
meis
-0.72
poff
-0.71
fei
-0.71
foon
-0.71
mme
-0.71
fep
-0.70
aen
-0.70
fte
-0.70
autorytatywna
-0.67
myn
-0.65
POSITIVE LOGITS
oil
1.54
Oil
1.36
Oil
1.35
OIL
1.32
oil
1.32
oils
1.20
Oils
1.01
OIL
1.00
油
0.86
油
0.83
Activations Density 0.052%