INDEX
Explanations
references to bats and related actions or objects
variations of the word "bat" in different contexts
New Auto-Interp
Negative Logits
LV
-0.68
éĸ
-0.65
zanne
-0.63
eous
-0.61
mble
-0.60
naire
-0.59
xia
-0.59
taining
-0.59
SPACE
-0.57
ately
-0.57
POSITIVE LOGITS
tered
1.22
allion
1.15
avia
1.07
hing
1.01
ched
0.96
iatus
0.96
cher
0.95
tell
0.94
ches
0.94
chers
0.93
Activations Density 0.030%