INDEX
Explanations
words related to physical actions, intense emotions, and descriptions of scenes
punctuation and its patterns within the text
New Auto-Interp
Negative Logits
OND
-0.77
UF
-0.74
venth
-0.68
":"/
-0.66
emaker
-0.66
ONSORED
-0.66
":[{"-0.65
®
-0.65
enary
-0.65
pec
-0.64
POSITIVE LOGITS
albeit
1.03
culminating
0.92
lest
0.91
including
0.83
huh
0.79
alas
0.79
namely
0.77
though
0.75
plung
0.74
eh
0.74
Activations Density 0.832%