INDEX
Explanations
words related to human face or facial features
the word "fac" and its variations, indicating a focus on facial features or conditions
New Auto-Interp
Negative Logits
went
-0.72
ettle
-0.68
Pole
-0.67
oshi
-0.67
itter
-0.65
Orion
-0.64
usha
-0.63
Nib
-0.63
linux
-0.61
adder
-0.61
POSITIVE LOGITS
fac
3.86
Fac
2.39
fac
2.21
Fac
2.13
facade
1.36
FAC
1.34
facial
1.17
fa
1.15
voic
1.10
facilit
1.07
Activations Density 0.018%