INDEX
Explanations
phrases indicating concern for the well-being of oneself and others
phrases related to personal relationships and connections
New Auto-Interp
Negative Logits
igun
-0.65
etting
-0.64
代
-0.63
geoning
-0.62
ettel
-0.61
Contin
-0.59
guiActiveUnfocused
-0.59
imal
-0.59
dayName
-0.58
Gleaming
-0.58
POSITIVE LOGITS
others
1.21
yours
1.07
theirs
0.93
ours
0.88
everyone
0.88
your
0.88
anyone
0.85
our
0.84
my
0.82
your
0.81
Activations Density 0.110%