INDEX
Explanations
mentions of dogs, particularly sled dogs
references to dogs and related topics
New Auto-Interp
Negative Logits
accumulated
-0.71
installations
-0.67
arta
-0.66
ibal
-0.66
ista
-0.65
afer
-0.64
srf
-0.64
itent
-0.62
idences
-0.61
airo
-0.61
POSITIVE LOGITS
dog
1.52
dogs
1.43
ertodd
1.01
loo
0.93
Dog
0.93
ĪĴ
0.91
wig
0.89
bowl
0.79
cake
0.78
cat
0.78
Activations Density 0.013%