INDEX
Explanations
instances of the word "shut" in various contexts
occurrences of the word "shut"
New Auto-Interp
Negative Logits
ertodd
-0.77
Values
-0.75
omething
-0.75
Values
-0.73
Occupations
-0.72
lihood
-0.69
ITNESS
-0.69
enegger
-0.67
abund
-0.66
Fine
-0.65
POSITIVE LOGITS
tered
1.44
tering
1.26
tle
1.06
ters
1.00
downs
0.97
down
0.93
lock
0.92
shut
0.92
down
0.89
outs
0.87
Activations Density 0.015%