INDEX
Explanations
variations of the word "zigzag."
New Auto-Interp
Negative Logits
oir
-0.17
Ekon
-0.16
thren
-0.16
hani
-0.15
inverse
-0.15
ÏĦιο
-0.15
gaard
-0.14
Kim
-0.14
inverse
-0.14
goods
-0.14
POSITIVE LOGITS
zag
0.24
Zag
0.19
zap
0.19
mund
0.18
zig
0.17
zag
0.17
duino
0.16
Nun
0.15
zig
0.15
-z
0.15
Activations Density 0.012%