INDEX
Explanations
concepts related to individual freedom and autonomy
New Auto-Interp
Negative Logits
cplusplus
-0.15
rique
-0.15
ä¸įè¶³
-0.15
wap
-0.14
imary
-0.14
umlu
-0.13
.Peek
-0.13
à¥Īà¤ł
-0.13
inecraft
-0.13
Exclusive
-0.13
POSITIVE LOGITS
freedom
0.82
liberty
0.69
Freedom
0.66
freedoms
0.65
Freedom
0.62
èĩªçͱ
0.58
fre
0.57
independence
0.57
Ñģвоб
0.54
libert
0.52
Activations Density 0.402%