INDEX
Explanations
references to faces or facial features
references to "face" in various contexts
New Auto-Interp
Negative Logits
æ©Ł
-0.87
icult
-0.73
iculture
-0.71
icultural
-0.71
CAST
-0.69
RY
-0.67
ary
-0.65
ighting
-0.65
icut
-0.65
ally
-0.65
POSITIVE LOGITS
plate
0.96
plant
0.96
BOOK
0.96
face
0.86
offs
0.85
face
0.81
faces
0.81
hog
0.81
plates
0.80
faces
0.79
Activations Density 0.037%