INDEX
Explanations
requests or mentions of needing help or assistance
phrases related to the need for assistance or support
New Auto-Interp
Negative Logits
nown
-0.69
umbered
-0.67
azar
-0.66
adr
-0.65
aroo
-0.65
uyomi
-0.65
lamm
-0.65
agate
-0.63
Lines
-0.62
amaru
-0.62
POSITIVE LOGITS
HELP
0.82
urgently
0.78
refres
0.77
reinforcement
0.75
sust
0.74
assurance
0.74
patience
0.74
manpower
0.71
assurances
0.71
help
0.69
Activations Density 0.183%