INDEX
Explanations
topics related to free speech and religious rights
New Auto-Interp
Negative Logits
loggedin
-0.17
_SCRIPT
-0.16
ucha
-0.15
æĺĵ
-0.14
uild
-0.14
egin
-0.14
_dot
-0.14
_dma
-0.14
eya
-0.14
u
-0.14
POSITIVE LOGITS
freedom
0.51
freedoms
0.45
Freedom
0.44
Freedom
0.42
liberty
0.38
fre
0.36
Ñģвоб
0.33
FRE
0.32
liberties
0.31
èĩªçͱ
0.30
Activations Density 0.268%