INDEX
Explanations
repeated mentions of the word "sam" or its variations, indicating a focus on character names or identifiers
New Auto-Interp
Negative Logits
ÂĽ
-0.18
anel
-0.17
iba
-0.15
ansa
-0.14
isclosed
-0.14
ULSE
-0.14
ust
-0.13
Confidence
-0.13
Murdoch
-0.13
usta
-0.13
POSITIVE LOGITS
970
0.17
ex
0.16
bou
0.15
inos
0.14
à§įà¦
0.14
fv
0.14
723
0.14
ório
0.14
dana
0.14
urger
0.14
Activations Density 0.023%