INDEX
Explanations
concepts related to life and vitality
New Auto-Interp
Negative Logits
PREF
-0.16
ster
-0.15
cast
-0.15
sterile
-0.15
oker
-0.15
ipi
-0.15
igu
-0.14
achie
-0.14
afür
-0.14
ican
-0.14
POSITIVE LOGITS
life
0.17
ãĥ³ãĥĶ
0.16
↵↵
0.15
.life
0.15
_hdl
0.15
618
0.14
æ´»
0.14
life
0.14
Kür
0.14
alive
0.14
Activations Density 0.114%