INDEX
Explanations
words related to supervision, oversight, or management
terms related to supervision and accomplices in various contexts
New Auto-Interp
Negative Logits
bed
-0.70
gif
-0.69
DEN
-0.68
lights
-0.68
REDACTED
-0.66
ARB
-0.65
cloth
-0.64
boat
-0.64
Shed
-0.63
rad
-0.63
POSITIVE LOGITS
ising
1.86
ises
1.85
ise
1.73
isons
1.68
ised
1.65
isions
1.65
ices
1.59
isance
1.52
isers
1.52
isable
1.51
Activations Density 0.077%