INDEX
Explanations
the word "God" in various contexts
occurrences of the word "od."
New Auto-Interp
Negative Logits
Ago
-0.74
Citadel
-0.64
silence
-0.61
shorth
-0.61
âĸ¬
-0.60
Quit
-0.59
Tempest
-0.58
Goat
-0.58
Jihad
-0.57
mu
-0.57
POSITIVE LOGITS
yssey
1.15
od
1.13
sworth
1.12
opter
1.10
odon
1.09
iamond
1.08
unn
1.03
amn
1.00
ont
0.99
ata
0.99
Activations Density 0.012%