INDEX
Explanations
words related to the tongue
references to the word "tongue" and its variations in various contexts
New Auto-Interp
Negative Logits
ded
-0.91
irements
-0.80
rity
-0.77
ding
-0.75
Parenthood
-0.73
Ct
-0.71
ividual
-0.70
rises
-0.70
Forestry
-0.70
Occupations
-0.69
POSITIVE LOGITS
tongue
1.18
tongues
0.90
lips
0.86
lip
0.83
sey
0.80
mouth
0.79
aware
0.77
ice
0.76
poke
0.76
piece
0.75
Activations Density 0.008%