INDEX
Explanations
instances of the letter 'f' in various contexts
New Auto-Interp
Negative Logits
ypy
-0.19
wert
-0.17
VERS
-0.16
ire
-0.15
vala
-0.15
itz
-0.15
oring
-0.15
throp
-0.15
PEG
-0.15
ires
-0.14
POSITIVE LOGITS
ic
0.19
ase
0.17
oment
0.16
iche
0.16
achu
0.16
het
0.15
ilde
0.15
ision
0.15
etic
0.15
idel
0.15
Activations Density 0.013%