INDEX
Explanations
references to eating actions
references to the act of eating
New Auto-Interp
Negative Logits
PV
-0.78
Aus
-0.76
Archangel
-0.71
TEXTURE
-0.67
PBS
-0.67
Stellar
-0.66
McKenna
-0.65
20439
-0.64
Kabul
-0.64
Stur
-0.64
POSITIVE LOGITS
eat
1.15
eaten
1.12
oleon
1.06
eating
1.01
foods
0.99
habits
0.99
ate
0.96
meals
0.95
Eat
0.95
eater
0.92
Activations Density 0.012%