INDEX
Explanations
phrases related to seeking or offering assistance
instances of the word "help."
New Auto-Interp
Negative Logits
theless
-0.76
Pict
-0.75
é¾
-0.69
ross
-0.67
Bellev
-0.63
ategory
-0.63
andom
-0.63
Viet
-0.61
rall
-0.61
aval
-0.61
POSITIVE LOGITS
fully
0.95
Desk
0.83
des
0.82
meet
0.76
enza
0.75
full
0.75
counselors
0.72
ful
0.72
broker
0.71
aid
0.71
Activations Density 0.028%