INDEX
Explanations
expressions of strong emotions or opinions
the word "totally" and its variations, indicating strong emphasis or agreement
New Auto-Interp
Negative Logits
rers
-0.84
åº
-0.76
ently
-0.75
lings
-0.73
sburg
-0.73
mere
-0.72
issance
-0.71
pring
-0.70
llular
-0.69
llor
-0.69
POSITIVE LOGITS
understandable
0.75
unrelated
0.71
forget
0.68
freaking
0.68
obliter
0.67
annihil
0.65
surprised
0.65
allergic
0.65
MIA
0.64
unexpected
0.64
Activations Density 0.021%