INDEX
Explanations
phrases related to enforcing rules or regulations
phrases related to enforcement actions or regulatory measures
New Auto-Interp
Negative Logits
asus
-0.82
NetMessage
-0.73
Columb
-0.70
FORE
-0.69
nown
-0.66
assies
-0.66
lime
-0.65
Hop
-0.63
fuck
-0.63
apter
-0.63
POSITIVE LOGITS
enforce
1.06
compliance
1.04
enforcement
0.94
enforced
0.86
ments
0.85
stricter
0.84
enforcing
0.83
conformity
0.82
atively
0.81
adherence
0.79
Activations Density 0.022%