INDEX
Explanations
mentions of professionals with the title "Assistant" at various organizations
references to individuals with the title "Assistant" or similar roles
New Auto-Interp
Negative Logits
yt
-0.77
stamped
-0.67
spir
-0.67
myth
-0.66
ra
-0.64
convention
-0.64
stra
-0.63
lived
-0.62
sands
-0.62
purity
-0.61
POSITIVE LOGITS
Assistant
3.73
Assistant
3.34
assistant
2.55
Associate
1.89
assistants
1.87
Deputy
1.83
Assist
1.62
Acting
1.62
Inspector
1.57
Coordinator
1.53
Activations Density 0.017%