INDEX
Explanations
words with 'fl' followed by other letters, potentially related to food items
the presence of the substring "fl" in various contexts
New Auto-Interp
Negative Logits
hood
-0.66
Skydragon
-0.64
Beware
-0.64
RAL
-0.63
MENTS
-0.62
pleas
-0.61
messenger
-0.60
illas
-0.60
Nightmares
-0.60
MENT
-0.59
POSITIVE LOGITS
oyd
1.32
owers
1.28
uffy
1.28
avored
1.27
oppy
1.23
ickr
1.20
avour
1.16
orescent
1.10
urry
1.10
utters
1.09
Activations Density 0.025%