INDEX
Explanations
the word "watch" in various contexts
mentions of the verb "watch."
New Auto-Interp
Negative Logits
interstitial
-0.74
misunderstanding
-0.72
ãĤ¨ãĥ«
-0.69
pse
-0.66
hemorrh
-0.66
activation
-0.66
phi
-0.65
ãĥ´ãĤ¡
-0.64
xual
-0.64
ctrl
-0.63
POSITIVE LOGITS
tower
1.28
watch
1.15
Watching
1.09
watches
1.06
Watch
1.04
watch
1.01
watching
0.94
dogs
0.91
Watch
0.88
watched
0.88
Activations Density 0.014%