INDEX
Explanations
references to legal or official matters, often related to terms such as "Offender" or "Official."
New Auto-Interp
Negative Logits
ãĤ§
-0.90
ãĤ©
-0.73
Jiu
-0.67
bund
-0.66
ãĥŃ
-0.66
ãĤ£
-0.65
ãĥ¥
-0.62
ISM
-0.62
zeb
-0.61
ãĥ£
-0.60
POSITIVE LOGITS
ense
1.22
ices
1.22
ensive
1.08
enders
1.03
erence
1.03
enses
0.97
ered
0.95
ences
0.95
enger
0.94
rey
0.94
Activations Density 0.059%