INDEX
Explanations
mentions of the word 'lunch'
references to food items, specifically those related to "lunch."
New Auto-Interp
Negative Logits
wcs
-0.80
âĹ¼
-0.74
ministry
-0.73
lling
-0.69
religious
-0.67
Offic
-0.65
ns
-0.65
VICE
-0.64
visual
-0.64
cc
-0.63
POSITIVE LOGITS
atters
0.90
GOODMAN
0.87
arted
0.82
osate
0.81
ifact
0.80
icago
0.80
anging
0.77
alos
0.76
ucks
0.76
icer
0.76
Activations Density 0.016%