INDEX
Explanations
references to historical figures and events
New Auto-Interp
Negative Logits
ÏĬ
-0.17
Dante
-0.16
ÑĨо
-0.16
bish
-0.15
Nim
-0.15
ãĤ·ãĥ£ãĥ«
-0.15
Anglic
-0.15
Galaxy
-0.15
_TAC
-0.15
izza
-0.15
POSITIVE LOGITS
Spartan
0.28
Athens
0.27
hop
0.25
Lesbian
0.25
Athen
0.25
Marathon
0.24
hop
0.22
Maced
0.21
Hop
0.21
Syracuse
0.20
Activations Density 0.031%