INDEX
Explanations
entities, organizations, and events mentioned in news articles
statements and reports made by authorities or officials
New Auto-Interp
Negative Logits
âĸijâĸij
-0.74
\\\\\\\\
-0.68
oil
-0.61
pill
-0.58
ardless
-0.58
lor
-0.58
otal
-0.57
some
-0.56
rafted
-0.56
TABLE
-0.55
POSITIVE LOGITS
.
0.66
dism
0.61
doms
0.60
enz
0.60
."
0.59
indo
0.58
reon
0.57
yk
0.57
srf
0.57
adding
0.56
Activations Density 0.200%