INDEX
Explanations
phrases related to superhuman abilities or qualities
instances of the word "super."
New Auto-Interp
Negative Logits
casualty
-0.75
RF
-0.68
Jays
-0.67
Dres
-0.67
DPR
-0.62
burned
-0.62
rounded
-0.61
FG
-0.61
drip
-0.60
empt
-0.60
POSITIVE LOGITS
super
3.94
Super
2.68
super
1.80
SUPER
1.78
Super
1.57
sup
1.54
SUP
1.54
uper
1.47
special
1.23
hyper
1.15
Activations Density 0.011%