INDEX
Explanations
references to dancing or dance-related activities
references to dance and its various forms
New Auto-Interp
Negative Logits
orters
-0.69
ython
-0.66
agar
-0.64
################
-0.64
iary
-0.63
urion
-0.61
krit
-0.60
umenthal
-0.60
ayson
-0.60
arah
-0.59
POSITIVE LOGITS
floor
1.16
hall
1.05
flo
0.94
chore
0.86
Dance
0.83
ballet
0.82
dancers
0.81
dancer
0.80
goers
0.80
dancing
0.79
Activations Density 0.045%