INDEX
Explanations
information related to government actions and policies, particularly concerning health and safety regulations
New Auto-Interp
Negative Logits
hari
-0.80
KT
-0.78
ording
-0.78
nels
-0.76
obin
-0.75
holes
-0.75
sson
-0.74
lehem
-0.74
anges
-0.72
stad
-0.71
POSITIVE LOGITS
own
1.38
newest
1.26
biggest
1.13
flagship
1.10
finest
1.05
namesake
1.04
inability
1.04
foray
1.01
demise
1.01
predecessor
1.01
Activations Density 1.799%