INDEX
Explanations
words related to a lack of strictness, especially in terms of regulations or enforcement
terms related to regulatory oversights and their associated laxities
New Auto-Interp
Negative Logits
hung
-0.85
flies
-0.80
mitt
-0.71
cry
-0.70
stroke
-0.69
dress
-0.67
waves
-0.67
humans
-0.66
bearing
-0.66
Seeking
-0.66
POSITIVE LOGITS
lax
1.43
glers
0.90
ative
0.89
iencies
0.87
atives
0.86
ãĥ¼ãĥĨ
0.85
acies
0.82
undermin
0.81
ometer
0.80
keyes
0.78
Activations Density 0.009%