INDEX
Explanations
words related to freedom and its various contexts
New Auto-Interp
Negative Logits
AndEndTag
-0.85
OGND
-0.71
onAttach
-0.64
AttributeSet
-0.64
oneph
-0.61
enumi
-0.59
himo
-0.55
cokinetic
-0.55
trưng
-0.54
ueuse
-0.54
POSITIVE LOGITS
Free
1.16
Free
1.00
free
0.98
libre
0.93
Fre
0.93
FREE
0.88
FREE
0.88
FRE
0.86
Fre
0.85
フリー
0.84
Activations Density 0.124%