INDEX
Explanations
terms related to weight, particularly in the context of weight management and health
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
376
+0.16
0.9%
172
+0.13
0.7%
232
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
172
+0.16
0.03
232
+0.13
0.02
426
+0.11
0.03
Negative Logits
thood
-1.86
uginosa
-1.65
mind
-1.56
Majesty
-1.55
Spacewatch
-1.55
áĢº
-1.52
chell
-1.50
ureus
-1.50
patient
-1.48
opes
-1.48
POSITIVE LOGITS
ĻĤ
1.94
loader
1.78
ģ
1.68
ily
1.65
wise
1.63
achusetts
1.60
spectrometer
1.49
tag
1.49
spectrom
1.48
ier
1.46
Activations Density 0.021%