INDEX
Explanations
mentions of hyperbolic or exaggerated language and descriptions
prefixes and suffixes related to hyperactivity or excessive states
New Auto-Interp
Negative Logits
baum
-0.70
pherd
-0.66
ADRA
-0.65
ometimes
-0.61
Dialogue
-0.61
ABE
-0.60
hement
-0.59
emouth
-0.58
ulhu
-0.57
Folder
-0.57
POSITIVE LOGITS
ulic
0.92
opter
0.86
emic
0.85
olic
0.78
agog
0.76
ilic
0.74
ersive
0.72
uating
0.71
inem
0.70
olester
0.68
Activations Density 0.136%