INDEX
Explanations
references to simulated or imitation versions of something
references to "mockumentaries" or similar formats
New Auto-Interp
Negative Logits
Horizon
-0.74
cryst
-0.72
ettel
-0.67
violet
-0.66
pins
-0.64
ItemThumbnailImage
-0.64
OA
-0.63
arrang
-0.63
omen
-0.63
compan
-0.62
POSITIVE LOGITS
Mock
1.00
ument
0.98
eries
0.89
ito
0.84
ingly
0.83
atory
0.82
ery
0.81
eting
0.79
mock
0.77
tails
0.75
Activations Density 0.029%