INDEX
Explanations
terminology related to various 'isms' and ideologies, particularly focusing on political and social movements
New Auto-Interp
Negative Logits
pad
-0.17
able
-0.16
gether
-0.15
cott
-0.15
aghan
-0.15
oom
-0.14
let
-0.14
ing
-0.14
own
-0.14
who
-0.14
POSITIVE LOGITS
adil
0.16
perature
0.15
ality
0.15
anness
0.15
pora
0.15
atically
0.15
CF
0.14
é¡Ķ
0.14
èĢħçļĦ
0.14
ometrics
0.14
Activations Density 0.103%