INDEX
Explanations
instances of the word "impose" and its variations, indicating a focus on authority and regulations
New Auto-Interp
Negative Logits
rb
-0.17
uge
-0.17
ÐĹем
-0.15
ji
-0.15
SV
-0.15
ØŃÙĬ
-0.15
orts
-0.14
ben
-0.14
Ïĥμ
-0.14
zilla
-0.14
POSITIVE LOGITS
èĸ¦
0.16
mÃŃn
0.14
ê½
0.14
.conditions
0.14
erli
0.14
inent
0.13
.weather
0.13
`<
0.13
iments
0.13
CKER
0.13
Activations Density 0.013%