INDEX
Explanations
words related to rules, regulations, and policies
rules, restrictions, or regulations in various contexts
New Auto-Interp
Negative Logits
nesota
-0.82
atform
-0.75
bernatorial
-0.72
ibliography
-0.72
cellaneous
-0.66
resurrection
-0.65
circumst
-0.64
aukee
-0.64
ospels
-0.64
Preview
-0.63
POSITIVE LOGITS
quotas
1.09
forcing
0.98
forbid
0.97
depri
0.96
refusing
0.96
forbidden
0.91
discriminatory
0.91
restricting
0.90
ration
0.89
blacklist
0.88
Activations Density 1.330%