INDEX
Explanations
descriptions of food and recipes, specifically related to lime and lemon flavors
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
453
+0.16
0.5%
1042
+0.08
0.2%
1535
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
453
+0.16
0.05
1417
+0.08
0.03
1160
+0.08
0.03
Negative Logits
reluct
-2.35
indestru
-2.30
encomp
-2.22
accla
-2.19
milf
-2.19
depic
-2.18
disagre
-2.15
increa
-2.13
shenan
-2.12
excru
-2.08
POSITIVE LOGITS
language
0.71
process
0.69
sport
0.69
period
0.66
date
0.65
type
0.63
portu
0.63
relation
0.62
range
0.62
mode
0.62
Activations Density 0.186%