INDEX
Explanations
sentences indicating a lack of concern or indifference towards certain topics or situations
expressions of indifference or apathy towards various topics
New Auto-Interp
Negative Logits
oward
-0.73
PG
-0.71
scripts
-0.69
arist
-0.69
visor
-0.68
ciplinary
-0.66
uns
-0.66
Fig
-0.66
Pacific
-0.66
idelines
-0.66
POSITIVE LOGITS
anymore
1.32
whatsoever
1.03
necessarily
0.81
nor
0.76
anybody
0.74
any
0.71
anything
0.71
blinking
0.70
EVER
0.68
slightest
0.67
Activations Density 0.075%