INDEX
Explanations
repeated instances of the substring "ines" in the text
New Auto-Interp
Negative Logits
rett
-0.18
wa
-0.17
asis
-0.16
ka
-0.15
erson
-0.15
Alo
-0.15
re
-0.15
sim
-0.15
du
-0.15
land
-0.14
POSITIVE LOGITS
ooth
0.15
Ãĸr
0.14
ukkan
0.14
aira
0.14
TokenName
0.14
ÃŃlia
0.14
%+
0.14
ampie
0.14
วà¸Ļ
0.14
ael
0.14
Activations Density 0.009%