INDEX
Explanations
verbs related to life and existence
phrases related to the concept of living
New Auto-Interp
Negative Logits
sonian
-0.86
ippi
-0.80
ession
-0.74
reg
-0.72
ery
-0.71
enei
-0.69
essee
-0.68
elight
-0.68
pex
-0.66
intendent
-0.65
POSITIVE LOGITS
lihood
1.04
vic
0.96
happily
0.81
liness
0.75
comfortably
0.75
Forever
0.74
stead
0.74
uate
0.73
peacefully
0.72
dangerously
0.72
Activations Density 0.036%