INDEX
Explanations
phrases related to completing tasks or achieving goals
phrases related to enjoyment or appreciation of experiences
New Auto-Interp
Negative Logits
.''
-0.79
]."
-0.78
."[
-0.78
thereto
-0.77
)."
-0.75
).[
-0.74
.''.
-0.73
''.
-0.72
thereby
-0.71
."
-0.68
POSITIVE LOGITS
FANTASY
0.84
Patreon
0.75
spoilers
0.72
Spoiler
0.69
Tags
0.67
spoiler
0.67
Nerd
0.67
disclaimer
0.66
nutshell
0.66
reenshots
0.64
Activations Density 2.469%