INDEX
Explanations
phrases related to the action of shutting something
instances of the word "shut" in various contexts
New Auto-Interp
Negative Logits
ertodd
-0.75
lihood
-0.72
omething
-0.72
Values
-0.68
endors
-0.68
Fine
-0.67
agher
-0.66
ampton
-0.65
ordan
-0.65
abund
-0.65
POSITIVE LOGITS
tered
1.43
tering
1.25
tle
1.02
ters
1.00
downs
0.97
down
0.95
tun
0.89
down
0.89
outs
0.88
terness
0.88
Activations Density 0.016%