INDEX
Explanations
organizations or authorities making official statements
statements made by official organizations or departments
New Auto-Interp
Negative Logits
haun
-0.85
impe
-0.73
teammates
-0.70
himself
-0.70
assassinated
-0.69
wives
-0.67
Redditor
-0.67
classmates
-0.66
UGH
-0.66
incumb
-0.66
POSITIVE LOGITS
statement
1.03
broch
0.87
guidelines
0.85
filings
0.85
summary
0.84
report
0.81
Publication
0.80
statement
0.80
Statement
0.79
latest
0.78
Activations Density 0.187%