INDEX
Explanations
expressions of social critique or commentary regarding societal issues
New Auto-Interp
Negative Logits
inos
-0.15
æ¯
-0.15
ÏĥκεÏħ
-0.14
istrov
-0.14
ordo
-0.14
gears
-0.14
Locator
-0.13
illard
-0.13
argin
-0.13
.Clock
-0.13
POSITIVE LOGITS
dyn
0.19
LK
0.18
Congress
0.18
VIP
0.18
chor
0.17
communal
0.17
Congress
0.16
dyn
0.16
Encounter
0.16
dynasty
0.15
Activations Density 0.380%