INDEX
Explanations
references to the term "Wolf" at different levels of specificity, possibly related to a specific topic or entity named "Wolf"
occurrences of the word "Wolf" in various contexts
the word "Wolf" regardless of context
New Auto-Interp
Negative Logits
bably
-0.91
pora
-0.81
acterial
-0.80
ntil
-0.77
acists
-0.77
thritis
-0.73
ursed
-0.73
tradem
-0.73
senal
-0.72
llular
-0.72
POSITIVE LOGITS
Wolf
1.29
enstein
1.26
hound
1.13
Wolf
1.13
sburg
1.06
Wolves
1.03
pack
0.98
owitz
0.98
Fenrir
0.95
bats
0.94
Activations Density 0.007%