INDEX
Explanations
references to the name "Wolf", particularly in different contexts such as names, places, and titles
occurrences of the word "Wolf" in various contexts
New Auto-Interp
Negative Logits
bably
-0.92
nces
-0.80
conflic
-0.79
ntil
-0.78
pora
-0.77
senal
-0.76
tradem
-0.76
ngth
-0.75
acists
-0.74
unnecess
-0.73
POSITIVE LOGITS
enstein
1.29
hound
1.17
sburg
1.15
Wolf
1.11
Wolf
1.03
pack
1.01
Wolves
0.98
owitz
0.98
ram
0.96
bats
0.92
Activations Density 0.011%