INDEX
Explanations
short prompts starting with the word "Remember"
the repeated phrase "Remember," indicating a focus on nostalgia or recalling past events
New Auto-Interp
Negative Logits
Mehran
-0.73
elle
-0.70
goo
-0.68
Constructed
-0.66
hung
-0.66
irlf
-0.65
sels
-0.63
elled
-0.63
Beast
-0.63
Juda
-0.63
POSITIVE LOGITS
remember
0.92
remembering
0.82
Recall
0.74
forgetting
0.73
ably
0.72
remember
0.71
forgotten
0.70
remembered
0.70
remembrance
0.68
theless
0.67
Activations Density 0.018%