INDEX
Explanations
references to influential figures in various fields such as philosophy, music, and activism
New Auto-Interp
Negative Logits
ewise
-0.17
_Tis
-0.15
_Lean
-0.15
arus
-0.15
bies
-0.14
ensi
-0.14
radu
-0.13
ÌĨ
-0.13
asan
-0.13
oui
-0.13
POSITIVE LOGITS
extra
0.19
known
0.16
who
0.14
edition
0.14
ocracy
0.14
@s
0.14
let
0.14
Fra
0.14
Pub
0.14
Edition
0.13
Activations Density 0.074%