INDEX
Explanations
mentions of someone going all out or to the extreme in their actions or behavior
expressions of intensity or absoluteness
New Auto-Interp
Negative Logits
ourses
-0.90
án
-0.83
SPONSORED
-0.80
furthermore
-0.79
iffs
-0.79
enes
-0.77
ells
-0.76
roups
-0.76
perhaps
-0.76
atches
-0.76
POSITIVE LOGITS
underdog
0.88
spoiler
0.76
Ninja
0.75
badass
0.75
pirate
0.75
renaissance
0.75
boring
0.74
plagiar
0.73
crappy
0.73
cheesy
0.73
Activations Density 0.370%