INDEX
Explanations
words related to statements or claims being made
statements or comments made by individuals in a reporting context
New Auto-Interp
Negative Logits
EDIT
-0.71
irs
-0.66
ktop
-0.64
ï¸
-0.64
Justice
-0.62
soType
-0.60
xtap
-0.60
gencies
-0.60
terness
-0.60
GOODMAN
-0.60
POSITIVE LOGITS
"[
0.91
"â̦
0.88
omin
0.75
bluntly
0.72
'[
0.72
aloud
0.72
"#
0.70
publicly
0.68
underestimated
0.66
Saddam
0.65
Activations Density 0.389%