INDEX
Explanations
references to legal requests and governmental transparency
New Auto-Interp
Negative Logits
pow
-0.16
(describing
-0.15
ç²
-0.14
mitt
-0.14
icha
-0.14
Borg
-0.14
èº
-0.14
ÏĢοÏĤ
-0.14
loyal
-0.14
izz
-0.13
POSITIVE LOGITS
FO
0.39
Freedom
0.31
FO
0.27
Freedom
0.27
requester
0.23
freedom
0.22
request
0.22
PIO
0.21
requests
0.20
Fo
0.20
Activations Density 0.036%