INDEX
Explanations
references to laws and legal terminology
New Auto-Interp
Negative Logits
ká
-0.16
ity
-0.16
497
-0.16
де
-0.15
olini
-0.15
459
-0.15
ت
-0.15
ately
-0.15
kle
-0.14
Riley
-0.14
POSITIVE LOGITS
fully
0.32
rence
0.24
fulness
0.24
-ab
0.21
yer
0.21
/reg
0.20
renc
0.19
enforcement
0.19
Enforcement
0.19
fare
0.19
Activations Density 0.053%