INDEX
Explanations
actions related to establishment and provision of services, regulations, or changes
New Auto-Interp
Negative Logits
jal
-0.16
Tort
-0.15
238
-0.15
elder
-0.14
chas
-0.14
utomation
-0.14
ruz
-0.13
olars
-0.13
utoff
-0.13
Ore
-0.13
POSITIVE LOGITS
isini
0.15
MBER
0.15
etat
0.14
ãĥ³ãĤ°
0.14
Ske
0.14
ocument
0.14
ç·
0.14
rawn
0.13
olo
0.13
itional
0.13
Activations Density 0.009%