INDEX
Explanations
occurrences of the word "bush."
New Auto-Interp
Negative Logits
gers
-0.22
Marca
-0.18
ees
-0.17
stadt
-0.17
ean
-0.16
ози
-0.15
strup
-0.15
ces
-0.14
yectos
-0.14
zig
-0.14
POSITIVE LOGITS
nell
0.29
wick
0.27
ido
0.27
fires
0.23
IDO
0.22
fire
0.22
wh
0.21
craft
0.21
els
0.20
-league
0.20
Activations Density 0.011%