INDEX
Explanations
mentions of specific individuals, particularly in positions of leadership or governance
New Auto-Interp
Negative Logits
ort
-0.16
ides
-0.15
Palmer
-0.15
etas
-0.14
ever
-0.14
Orr
-0.14
amento
-0.14
ui
-0.13
etrics
-0.13
ext
-0.13
POSITIVE LOGITS
ظÙģ
0.15
ocket
0.15
оÑĤÑĮ
0.15
chwitz
0.15
missions
0.14
itou
0.14
phia
0.14
bytesRead
0.14
.capture
0.14
esan
0.14
Activations Density 0.019%