INDEX
Explanations
phrases related to assistance or support
instances of the word "help" in various contexts
New Auto-Interp
Negative Logits
Creed
-0.70
ratio
-0.67
attraction
-0.66
piv
-0.64
Ov
-0.62
division
-0.61
wearing
-0.61
gradient
-0.60
fusion
-0.59
tub
-0.59
POSITIVE LOGITS
help
4.10
Help
2.30
helps
1.73
Help
1.70
HELP
1.65
help
1.51
support
1.39
guide
1.23
Helpful
1.15
helpful
1.11
Activations Density 0.018%