INDEX
Explanations
occurrences of the letter 'F' in various contexts
New Auto-Interp
Negative Logits
lore
-0.17
loat
-0.17
actor
-0.16
faithful
-0.15
loe
-0.15
abad
-0.15
isher
-0.15
EMA
-0.15
indo
-0.15
elts
-0.15
POSITIVE LOGITS
akah
0.17
asher
0.15
duct
0.15
ash
0.15
Bund
0.14
ickou
0.14
omba
0.14
å®ļçļĦ
0.14
comb
0.14
tit
0.13
Activations Density 0.023%