INDEX
Explanations
personal pronouns and short verb phrases, potentially indicating dialogue
expressions of uncertainty and indecision in dialogue
New Auto-Interp
Negative Logits
uably
-0.55
ufact
-0.54
ordes
-0.52
agric
-0.52
issance
-0.51
AMD
-0.50
municip
-0.50
orsi
-0.50
tnc
-0.50
respective
-0.49
POSITIVE LOGITS
fuckin
0.86
fucking
0.81
uh
0.71
fucked
0.70
fuck
0.68
eeee
0.67
gonna
0.67
funny
0.66
goddamn
0.65
kinda
0.64
Activations Density 1.658%