INDEX
Explanations
phrases indicating a sense of loss or absence
New Auto-Interp
Negative Logits
idelines
-0.83
eds
-0.78
itures
-0.77
hips
-0.76
breaks
-0.75
Topic
-0.73
gemony
-0.72
imentary
-0.69
hesis
-0.69
reements
-0.69
POSITIVE LOGITS
encountering
0.99
owning
0.98
seeing
0.96
having
0.95
knowing
0.93
watching
0.89
discovering
0.86
witnessing
0.85
childbirth
0.81
being
0.81
Activations Density 0.083%