INDEX
Explanations
words related to absurdity, madness, and hypocrisy
themes related to absurdity and irrationality
New Auto-Interp
Negative Logits
ergy
-0.80
ellar
-0.70
iable
-0.66
vals
-0.65
Scientist
-0.65
amins
-0.64
arters
-0.63
vice
-0.63
eric
-0.62
izations
-0.62
POSITIVE LOGITS
inflicted
0.86
fulness
0.86
tremend
0.81
abound
0.81
ulence
0.80
iness
0.79
awaiting
0.79
perpetrated
0.77
imaginable
0.73
ously
0.72
Activations Density 0.080%