INDEX
Explanations
phrases where the text mentions that no one is involved or responsible
the phrase "no one" and its variations, indicating a focus on themes of isolation, neglect, or lack of attention to issues
New Auto-Interp
Negative Logits
Launch
-0.61
Rounds
-0.59
Contin
-0.59
qqa
-0.58
rn
-0.58
ciation
-0.57
lihood
-0.57
pread
-0.56
edged
-0.56
hip
-0.55
POSITIVE LOGITS
xious
1.18
matter
1.14
one
1.08
sane
1.07
one
1.05
amount
0.96
longer
0.93
sooner
0.88
doubt
0.87
reputable
0.87
Activations Density 0.067%