INDEX
Explanations
phrases related to human faces
references to faces or the concept of a "face" in various contexts
New Auto-Interp
Negative Logits
æ©Ł
-0.88
icult
-0.77
icultural
-0.72
RY
-0.66
>>\
-0.65
ighting
-0.65
iculture
-0.64
ary
-0.63
CAST
-0.63
reat
-0.62
POSITIVE LOGITS
plate
0.96
faces
0.94
plant
0.93
face
0.93
face
0.86
BOOK
0.86
offs
0.83
faces
0.82
hog
0.81
beard
0.80
Activations Density 0.031%