INDEX
Explanations
instances where the concept of bothering or not bothering someone is mentioned
instances of the word "bother" and its variations, indicating a focus on lack of effort or concern
New Auto-Interp
Negative Logits
arb
-0.77
arta
-0.77
INAL
-0.77
oiler
-0.75
ramer
-0.74
UE
-0.74
ophe
-0.73
sung
-0.72
inia
-0.69
odes
-0.68
POSITIVE LOGITS
bother
1.17
bothering
1.06
some
0.90
crow
0.89
bothered
0.82
MENTS
0.80
bothers
0.71
tamp
0.68
fulness
0.65
Ivanka
0.64
Activations Density 0.016%