INDEX
Explanations
mentions of specific names or usernames along with corresponding text
social media engagement and sharing prompts
New Auto-Interp
Negative Logits
etheless
-0.85
thrust
-0.77
hement
-0.76
displacement
-0.71
apprehended
-0.70
shun
-0.67
submar
-0.67
shove
-0.66
penal
-0.66
tackling
-0.66
POSITIVE LOGITS
CHAPTER
1.00
³³³³
0.96
Anonymous
0.91
Follow
0.90
Original
0.90
QUEST
0.89
FAQ
0.88
robe
0.88
EGIN
0.87
EDIT
0.87
Activations Density 0.118%