INDEX
Explanations
statements made by official authorities
references to government or administrative officials
New Auto-Interp
Negative Logits
ï¸
-0.77
Horses
-0.75
rums
-0.73
esville
-0.70
effects
-0.66
ĸļ
-0.66
ocene
-0.66
bows
-0.63
vasive
-0.63
Spect
-0.62
POSITIVE LOGITS
dom
1.01
ially
0.87
doms
0.83
tasked
0.83
overseeing
0.78
sanctioned
0.75
stationed
0.73
ials
0.71
itarian
0.71
ulty
0.70
Activations Density 0.032%