INDEX
Explanations
terms related to anti-establishment attitudes or movements
New Auto-Interp
Negative Logits
¥
-0.17
":[-
-0.15
pek
-0.14
pit
-0.14
238
-0.14
Wiley
-0.14
metam
-0.14
ur
-0.13
Duty
-0.13
sem
-0.13
POSITIVE LOGITS
ilib
0.19
ÅĽÄĩ
0.16
(er
0.15
chal
0.15
(?
0.15
hton
0.15
OLT
0.15
anth
0.15
mall
0.15
-"
0.15
Activations Density 0.199%