INDEX
Explanations
instances of the word "help" and its variations, indicating a focus on assistance or support
New Auto-Interp
Negative Logits
quelcon
-0.82
يبة
-0.80
Wally
-0.76
Sten
-0.74
anair
-0.73
eries
-0.73
.*")]
-0.72
elektron
-0.71
hå
-0.71
Wally
-0.70
POSITIVE LOGITS
help
1.61
help
1.45
HELP
1.44
helps
1.40
helping
1.40
HELP
1.37
Help
1.36
Helps
1.33
Helping
1.33
Help
1.31
Activations Density 0.085%