INDEX
Explanations
words related to disturbing or unsettling situations
instances of the word "disturbing" and its context within the text
New Auto-Interp
Negative Logits
cellence
-0.79
oned
-0.76
gs
-0.73
oled
-0.72
arta
-0.72
eva
-0.72
ophers
-0.71
tes
-0.69
ht
-0.68
bid
-0.68
POSITIVE LOGITS
disturbing
0.91
disturb
0.81
sexist
0.75
undermin
0.73
urnal
0.71
creeps
0.69
indecent
0.69
pornographic
0.67
unsettling
0.67
alarming
0.66
Activations Density 0.011%