INDEX
Explanations
phrases related to having fun and enjoyment
New Auto-Interp
Negative Logits
ypse
-0.15
ç§
-0.15
fade
-0.15
_ALPHA
-0.14
497
-0.14
ouch
-0.14
̧
-0.14
odyn
-0.14
167
-0.14
LEAN
-0.14
POSITIVE LOGITS
fun
0.53
FUN
0.44
fun
0.41
Fun
0.40
Fun
0.38
FUN
0.37
blast
0.35
_fun
0.35
.fun
0.34
fun
0.34
Activations Density 0.043%