INDEX
Explanations
terms related to advocacy and activism
New Auto-Interp
Negative Logits
ken
-0.21
pond
-0.17
owo
-0.16
lements
-0.15
ks
-0.15
bac
-0.14
ugg
-0.14
uji
-0.14
æµģ
-0.14
nd
-0.14
POSITIVE LOGITS
for
0.18
against
0.18
atively
0.17
Inch
0.16
141
0.16
778
0.16
Against
0.15
amins
0.15
forth
0.14
../../../
0.14
Activations Density 0.032%