INDEX
Explanations
phrases related to helping and assisting others
phrases referring to individuals or groups in need or facing challenges
New Auto-Interp
Negative Logits
kefeller
-0.69
zeb
-0.67
Alive
-0.66
Slim
-0.65
carnage
-0.65
bloodshed
-0.62
convincing
-0.62
uphem
-0.60
manslaughter
-0.57
Mandal
-0.56
POSITIVE LOGITS
might
1.16
may
1.11
otherwise
1.09
want
1.05
need
1.01
might
0.99
wish
0.97
want
0.97
wished
0.96
rely
0.96
Activations Density 0.213%