INDEX
Explanations
mentions of deer
references to deer
New Auto-Interp
Negative Logits
ppard
-0.74
ochemical
-0.70
acist
-0.65
anco
-0.64
hran
-0.64
sid
-0.64
uyomi
-0.63
inx
-0.63
colo
-0.63
Mellon
-0.62
POSITIVE LOGITS
deer
1.14
hunter
0.96
hunters
0.93
herds
0.86
beetle
0.84
hunting
0.83
hunter
0.79
herd
0.76
carc
0.76
Rapids
0.75
Activations Density 0.008%