INDEX
Explanations
occurrences of the prefix "re-" indicating repetition or return
New Auto-Interp
Negative Logits
blunt
-0.16
kaar
-0.15
tal
-0.15
nid
-0.15
antee
-0.15
gros
-0.15
ansa
-0.15
Albert
-0.14
rep
-0.14
Yar
-0.14
POSITIVE LOGITS
chts
0.27
ih
0.21
ise
0.20
iseum
0.19
chn
0.19
kon
0.19
bell
0.19
cht
0.19
edere
0.18
chte
0.18
Activations Density 0.006%