INDEX
Explanations
phrases indicating a change in relationship dynamics or personal circumstances
the repetition of the word "anymore" indicating a sense of loss or change
New Auto-Interp
Negative Logits
eer
-0.70
maximum
-0.66
erest
-0.65
pta
-0.64
stood
-0.60
ortment
-0.60
gging
-0.59
ured
-0.59
ighth
-0.58
gged
-0.58
POSITIVE LOGITS
than
0.86
adays
0.80
;)
0.74
:-)
0.73
ONSORED
0.71
:)
0.70
:(
0.69
!!!!!
0.69
!!!
0.69
!!
0.69
Activations Density 0.030%