INDEX
Explanations
The neuron activates on tokens related to authentication/authorization (i.e. those containing the “auth” stem).
components related to authentication processes.
New Auto-Interp
Negative Logits
pile
-0.10
radius
-0.08
Dar
-0.07
trabal
-0.07
Pier
-0.07
�
-0.07
smallest
-0.07
competency
-0.07
Rocky
-0.07
percentile
-0.07
POSITIVE LOGITS
auth
0.09
TH
0.09
auth
0.09
Authentication
0.08
th
0.08
.authentication
0.08
uth
0.08
(auth
0.07
_auth
0.07
authentic
0.07
Activations Density 0.011%