INDEX
Explanations
occurrences of a specific character or name in the text
New Auto-Interp
Negative Logits
eyse
-0.19
est
-0.16
ease
-0.16
eh
-0.16
anja
-0.16
ông
-0.15
tsy
-0.15
an
-0.15
typings
-0.14
atz
-0.14
POSITIVE LOGITS
ermo
0.24
eros
0.21
acker
0.20
rock
0.20
Th
0.19
eron
0.19
eres
0.18
oms
0.18
wait
0.18
ê·¹
0.18
Activations Density 0.016%