INDEX
Explanations
references to political figures and events, particularly related to controversy or conflict
New Auto-Interp
Negative Logits
skelet
-0.60
PAX
-0.57
Citiz
-0.56
Prometheus
-0.55
antioxid
-0.54
entitle
-0.54
palate
-0.53
Antar
-0.53
Amer
-0.53
VID
-0.52
POSITIVE LOGITS
ervative
0.71
ervatives
0.70
Jinping
0.70
enei
0.63
hler
0.62
Äĩ
0.60
hedral
0.60
vous
0.60
tsy
0.59
®
0.59
Activations Density 15.656%