INDEX
Explanations
references to faces and face-related concepts
face-to-face interaction
New Auto-Interp
Negative Logits
Albany
-0.60
expandindo
-0.56
Tikang
-0.54
Shrewsbury
-0.52
CTP
-0.51
oporosis
-0.50
Tivoli
-0.50
testens
-0.50
錦
-0.50
Brewers
-0.48
POSITIVE LOGITS
Face
1.33
face
1.33
Face
1.23
FACE
1.20
face
1.20
FACE
1.13
Faces
0.98
faces
0.97
Faces
0.96
faces
0.90
Activations Density 0.015%