INDEX
Explanations
references to various types of lifestyles and personal life experiences
New Auto-Interp
Negative Logits
isoft
-0.16
Král
-0.15
panse
-0.15
spb
-0.15
ateria
-0.14
soft
-0.14
ernen
-0.14
emos
-0.14
_REASON
-0.14
idden
-0.14
POSITIVE LOGITS
life
0.20
жизни
0.20
-life
0.20
life
0.18
-Life
0.17
lives
0.17
living
0.15
dag
0.14
_life
0.14
839
0.14
Activations Density 0.147%