INDEX
Explanations
words related to leniency, toughness, and draconian measures
terms related to legal leniency and strictness in enforcement
New Auto-Interp
Negative Logits
rift
-0.83
Rus
-0.79
thora
-0.76
wash
-0.75
Sphere
-0.74
plane
-0.70
Corpse
-0.70
Canterbury
-0.69
ieu
-0.68
Feast
-0.68
POSITIVE LOGITS
vetting
1.04
stringent
1.02
toug
0.98
restrictive
0.95
punitive
0.94
stricter
0.90
tougher
0.89
harsher
0.88
sentencing
0.88
deported
0.87
Activations Density 0.021%