INDEX
Explanations
governmental institutions and official departments
references to governmental and law enforcement entities
New Auto-Interp
Negative Logits
?",
-0.68
?),
-0.68
predomin
-0.64
"))
-0.60
pired
-0.59
uably
-0.58
successfully
-0.57
)",
-0.56
efully
-0.55
enjoys
-0.54
POSITIVE LOGITS
.
1.08
.'
0.89
.[
0.87
spokeswoman
0.87
spokesman
0.86
.
0.86
*.
0.83
.�
0.83
reports
0.83
.*
0.80
Activations Density 0.219%