INDEX
Explanations
the word "mock" and variations of it
references to "mock" and its variations, indicating a focus on satirical or imitative themes
New Auto-Interp
Negative Logits
cryst
-0.82
Horizon
-0.75
metic
-0.71
arrang
-0.69
edin
-0.68
compan
-0.68
omen
-0.65
ettel
-0.64
violet
-0.63
OA
-0.63
POSITIVE LOGITS
Mock
1.12
ument
0.92
eries
0.89
mock
0.88
ito
0.83
ety
0.80
atory
0.79
eting
0.78
mocking
0.77
ery
0.76
Activations Density 0.030%