INDEX
Explanations
words and phrases related to resistance and social justice issues
New Auto-Interp
Negative Logits
TA
-0.17
rapy
-0.15
ta
-0.14
kav
-0.14
rencont
-0.14
žen
-0.14
igy
-0.14
коÑĤ
-0.13
ille
-0.13
emies
-0.13
POSITIVE LOGITS
instead
0.20
rung
0.16
instead
0.16
ék
0.15
increasingly
0.15
ãĥ³ãĥĢ
0.15
lic
0.15
iju
0.14
VML
0.14
Instead
0.14
Activations Density 0.466%