INDEX
Explanations
mentions of the word "Hunt" or variations of it
variations of the word "hunting."
New Auto-Interp
Negative Logits
subtract
-0.66
Birch
-0.64
Colossus
-0.64
compliment
-0.63
connecting
-0.62
AGE
-0.62
Mour
-0.62
oral
-0.62
toe
-0.61
chloride
-0.61
POSITIVE LOGITS
ters
1.26
cheon
1.21
ches
1.08
ting
1.05
kered
1.01
sts
1.01
eteenth
1.01
zees
0.99
ãĤ§
0.98
ger
0.98
Activations Density 0.008%