INDEX
Explanations
concepts related to freedom and expression
New Auto-Interp
Negative Logits
Mortar
-0.67
Kanpo
-0.66
оригіналу
-0.65
rungsseite
-0.65
يكب
-0.65
CreateTagHelper
-0.64
дописавши
-0.63
noses
-0.62
mortar
-0.61
briefcase
-0.61
POSITIVE LOGITS
freedom
1.78
Freedom
1.72
Freedom
1.63
freedom
1.52
FREEDOM
1.51
freedoms
1.42
liberty
1.24
EDOM
1.13
liberté
1.12
bebas
1.11
Activations Density 0.085%