INDEX
Explanations
mentions of writing tools or instruments
New Auto-Interp
Negative Logits
enance
-0.77
ndum
-0.77
Marxism
-0.66
tt
-0.64
Normandy
-0.63
Athe
-0.60
OST
-0.59
@@@@
-0.59
Giuliani
-0.59
Benef
-0.57
POSITIVE LOGITS
ultimate
1.53
itent
1.50
cil
1.45
insula
1.32
elope
1.26
alties
1.26
manship
1.18
nington
1.11
cill
1.04
ryn
1.01
Activations Density 0.075%