INDEX
Explanations
references to mainstream culture and its elements
New Auto-Interp
Negative Logits
alles
-0.19
toy
-0.16
ts
-0.16
toi
-0.16
MainMenu
-0.16
anus
-0.15
MainWindow
-0.15
canf
-0.15
inia
-0.15
genic
-0.15
POSITIVE LOGITS
stay
0.49
frame
0.35
stream
0.34
line
0.32
enance
0.31
lining
0.30
frames
0.29
-stream
0.28
steam
0.27
Stay
0.26
Activations Density 0.043%