INDEX
Explanations
phrases related to erratic behavior and mental instability
references to psychological or emotional states and behaviors
New Auto-Interp
Negative Logits
keyes
-0.73
ction
-0.72
otin
-0.68
respective
-0.67
riad
-0.67
addons
-0.66
ISS
-0.65
Available
-0.64
oor
-0.63
Remove
-0.61
POSITIVE LOGITS
himself
1.16
Himself
0.90
herself
0.84
his
0.82
His
0.81
wandered
0.80
enjoys
0.79
evidently
0.79
His
0.78
confessed
0.77
Activations Density 1.002%