INDEX
Explanations
mentions of the word "cheek"
references to cheek and facial features
New Auto-Interp
Negative Logits
mosquit
-0.72
FORM
-0.69
exponential
-0.67
ENCE
-0.66
ENC
-0.63
olig
-0.63
redistributed
-0.62
unsustainable
-0.62
CTR
-0.61
mosqu
-0.61
POSITIVE LOGITS
bones
1.53
bone
1.30
cheek
1.02
pieces
1.02
piece
0.97
beat
0.92
Bone
0.86
poke
0.86
bone
0.85
bags
0.82
Activations Density 0.016%