INDEX
Explanations
words related to computer mice
instances of the word "mouse" along with related terminology
New Auto-Interp
Negative Logits
namese
-0.82
rating
-0.82
rators
-0.77
orian
-0.76
ivals
-0.76
Buffy
-0.73
inyl
-0.73
ivism
-0.73
rates
-0.72
ieves
-0.70
POSITIVE LOGITS
cursor
1.09
pointer
1.00
pad
0.95
wheel
0.84
pox
0.83
Mouse
0.82
cules
0.78
flies
0.76
mouse
0.76
lette
0.75
Activations Density 0.036%