INDEX
Explanations
phrases related to offering or needing help
references to needing help or assistance
New Auto-Interp
Negative Logits
itect
-0.75
Democr
-0.65
azar
-0.65
atom
-0.64
imore
-0.63
querade
-0.60
ð
-0.60
Seym
-0.60
pire
-0.59
ignore
-0.58
POSITIVE LOGITS
lessly
1.43
assistance
1.27
help
1.19
rescuing
1.08
reminding
1.01
repairs
0.98
clarification
0.97
reinforcements
0.96
advice
0.95
HELP
0.92
Activations Density 0.082%