INDEX
Explanations
references to dark or mysterious themes
New Auto-Interp
Negative Logits
kson
-0.77
berman
-0.73
ciation
-0.70
utic
-0.70
oples
-0.69
Virtual
-0.68
agine
-0.67
UTERS
-0.67
aii
-0.65
onent
-0.65
POSITIVE LOGITS
hound
1.18
horse
1.00
bolt
0.86
mere
0.86
croft
0.83
dark
0.82
moon
0.81
lord
0.81
urnal
0.81
crow
0.80
Activations Density 0.743%