INDEX
Explanations
instances of the word "so" indicating emphasis or conclusion
New Auto-Interp
Negative Logits
core
-0.16
ниÑĨе
-0.16
prd
-0.16
kaar
-0.15
едÑĮ
-0.15
tal
-0.15
panies
-0.15
ẻ
-0.14
lator
-0.14
dale
-0.14
POSITIVE LOGITS
-called
0.33
forth
0.22
oner
0.21
aken
0.19
far
0.19
ething
0.19
ìį¨
0.19
forth
0.19
far
0.19
hn
0.19
Activations Density 0.087%