INDEX
Explanations
phrases related to written scripts
mentions of scripts or screenplays
New Auto-Interp
Negative Logits
obby
-0.78
theless
-0.75
iscopal
-0.69
evil
-0.65
ternity
-0.63
Ath
-0.62
Charity
-0.60
ktop
-0.60
hm
-0.59
hate
-0.59
POSITIVE LOGITS
writers
1.08
writer
1.04
urally
1.03
ural
1.03
writing
0.95
script
0.88
uring
0.87
screenplay
0.86
synopsis
0.86
wright
0.86
Activations Density 0.009%