INDEX
Explanations
phrases related to authority or actions taken against something or someone
terms related to government enforcement actions or restrictions
New Auto-Interp
Negative Logits
cius
-0.71
lled
-0.67
chn
-0.66
UNCH
-0.64
FORM
-0.64
zza
-0.64
astern
-0.63
CE
-0.63
WC
-0.62
lda
-0.61
POSITIVE LOGITS
s
2.31
ski
1.30
sburg
1.25
sb
1.21
sand
1.18
sie
1.16
sf
1.13
sburgh
1.08
ship
1.08
sg
1.07
Activations Density 0.133%