INDEX
Explanations
references to mystery and detective literature
New Auto-Interp
Negative Logits
ivec
-0.15
æ²ĸ
-0.14
apsed
-0.14
troub
-0.14
γοÏħ
-0.14
otron
-0.14
æľĭ
-0.14
Mellon
-0.14
elig
-0.14
.Flag
-0.13
POSITIVE LOGITS
Herc
0.40
Christie
0.35
Hastings
0.28
Ag
0.25
Miss
0.22
Orient
0.22
Inspector
0.21
Belgian
0.20
Miss
0.20
Belg
0.20
Activations Density 0.002%