INDEX
Explanations
mentions of hunger strikes
references to hunger or hunger strikes
New Auto-Interp
Negative Logits
agall
-0.77
inion
-0.72
20439
-0.71
ational
-0.69
struct
-0.66
Offic
-0.66
oral
-0.66
ioxide
-0.64
iate
-0.64
INO
-0.64
POSITIVE LOGITS
hunger
0.98
hungry
0.98
strikers
0.91
Hunger
0.83
frenzy
0.81
fruit
0.80
meals
0.80
thirsty
0.80
striker
0.79
wolves
0.78
Activations Density 0.071%