INDEX
Explanations
statements made by authorities or officials
instances of reported speech or statements by authorities
New Auto-Interp
Negative Logits
theless
-0.64
fixation
-0.61
76561
-0.59
talk
-0.55
Bastard
-0.55
ggles
-0.52
milo
-0.51
obscurity
-0.51
estern
-0.50
haun
-0.48
POSITIVE LOGITS
.
1.01
*.
0.83
.–
0.82
.[
0.82
.?
0.81
.;
0.79
.*
0.78
.(
0.77
.�
0.74
.—
0.72
Activations Density 0.092%