INDEX
Explanations
references related to fuel and fuel-related activities such as fueling, subsidies, and different types of fuels
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1839
+0.12
0.4%
67
+0.12
0.4%
528
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1464
+0.12
0.02
67
+0.12
0.02
1507
+0.11
0.02
Negative Logits
racon
-0.76
fortn
-0.72
benevol
-0.68
evoc
-0.67
intrigu
-0.66
philosophic
-0.65
Machia
-0.65
pamph
-0.65
accla
-0.64
dismant
-0.63
POSITIVE LOGITS
fuel
1.44
fuel
1.34
Fuel
1.28
Fuel
1.27
fuels
1.25
FUEL
1.18
fuels
1.00
FUEL
0.99
Fuels
0.97
fueled
0.85
Activations Density 0.071%