INDEX
Explanations
instances of the word "sitting" and its variations
New Auto-Interp
Negative Logits
xual
-0.73
iler
-0.73
ctr
-0.68
selage
-0.66
acco
-0.65
CLSID
-0.65
iling
-0.64
hiba
-0.62
usable
-0.62
visual
-0.62
POSITIVE LOGITS
atop
1.09
ducks
1.01
comfortably
0.94
uate
0.89
idle
0.80
dormant
0.79
une
0.78
DOWN
0.77
silently
0.77
quietly
0.75
Activations Density 0.016%