INDEX
Explanations
phrases related to rules, policies, and procedures
words related to rules, policies, and structures of authority
New Auto-Interp
Negative Logits
pload
-0.78
nen
-0.71
Citiz
-0.71
glim
-0.69
BUS
-0.69
ÃŃn
-0.68
plet
-0.67
ãĥij
-0.66
star
-0.66
Defin
-0.65
POSITIVE LOGITS
ropy
0.73
XIV
0.65
-----
0.63
lain
0.63
relating
0.62
":["
0.62
utra
0.61
XIII
0.58
---
0.58
ulhu
0.58
Activations Density 0.255%