INDEX
Explanations
instances of the word "help" and its variations to identify discussions of support and assistance
New Auto-Interp
Negative Logits
.obtain
-0.15
à¸²à¸ł
-0.14
lover
-0.14
ilet
-0.14
mse
-0.14
rouch
-0.14
GLE
-0.14
gens
-0.14
appointment
-0.13
quip
-0.13
POSITIVE LOGITS
å¿Ļ
0.20
Äijỡ
0.20
with
0.18
effort
0.17
out
0.17
yro
0.16
efforts
0.15
usk
0.15
NECT
0.15
with
0.15
Activations Density 0.052%