INDEX
Explanations
phrases related to cows
the presence of the word "cow" in various contexts
New Auto-Interp
Negative Logits
Extend
-0.81
mble
-0.68
mith
-0.66
UTC
-0.65
eways
-0.64
Flavoring
-0.64
Spac
-0.63
ovsky
-0.63
satell
-0.63
arial
-0.62
POSITIVE LOGITS
boys
1.43
riter
1.34
ards
1.33
ritten
1.32
girl
1.12
bell
1.09
rie
1.01
ork
0.99
arding
0.98
ard
0.98
Activations Density 0.030%