INDEX
Explanations
references to laughter or laughing
instances of the word "laugh" and its variations, indicating a focus on humor and laughter
New Auto-Interp
Negative Logits
struct
-0.73
ģ
-0.71
des
-0.71
area
-0.67
Stra
-0.65
Prec
-0.64
di
-0.63
mi
-0.63
Dri
-0.63
abling
-0.62
POSITIVE LOGITS
laugh
3.95
chuckle
2.83
laughs
2.10
laughter
2.10
laughing
1.99
smile
1.91
joke
1.84
grin
1.79
Laugh
1.77
laugh
1.75
Activations Density 0.010%