INDEX
Explanations
highly intensified or exaggerated words
phrases related to heightened emotions or states of hyperactivity
New Auto-Interp
Negative Logits
ORED
-0.73
Jaw
-0.71
Geneva
-0.69
oured
-0.68
goodbye
-0.68
DERR
-0.68
Forever
-0.67
Jinn
-0.66
OME
-0.63
wiser
-0.62
POSITIVE LOGITS
bole
1.58
visor
1.43
visors
1.23
dimension
1.20
bol
1.16
vent
1.15
links
1.09
tro
1.08
active
1.07
dimensional
1.05
Activations Density 0.013%