INDEX
Explanations
phrases related to government officials and their actions
references to authorities or officials
New Auto-Interp
Negative Logits
oise
-0.70
OPE
-0.68
ãĤ¦
-0.67
ocene
-0.67
esville
-0.64
oons
-0.63
Interstitial
-0.63
Beg
-0.61
ãĥ¤
-0.60
[|
-0.60
POSITIVE LOGITS
overseeing
1.10
briefed
1.05
tasked
0.98
doms
0.94
responsible
0.89
familiar
0.87
stationed
0.86
involved
0.81
complicit
0.79
investigating
0.79
Activations Density 0.066%