INDEX
Explanations
phrases containing the word "worry."
expressions of reassurance to alleviate concern
New Auto-Interp
Negative Logits
arb
-0.83
avorite
-0.79
aw
-0.76
urd
-0.75
arth
-0.74
eg
-0.67
idel
-0.64
ĻĤ
-0.63
rown
-0.61
ibur
-0.61
POSITIVE LOGITS
fret
0.87
anymore
0.70
llor
0.69
ANCE
0.69
Variable
0.68
lessly
0.67
!.
0.67
ABOUT
0.66
whatsoever
0.64
DISTR
0.64
Activations Density 0.036%