INDEX
Explanations
references to car features and specifications
New Auto-Interp
Negative Logits
InputText
-0.38
face
-0.37
foreground
-0.36
Ow
-0.35
FACE
-0.33
vorg
-0.33
Gabel
-0.33
Mask
-0.32
face
-0.32
Face
-0.32
POSITIVE LOGITS
rear
1.95
rear
1.66
Rear
1.62
Rear
1.59
REAR
1.46
tail
1.32
posterior
1.22
posteriore
1.17
trasero
1.16
Tail
1.14
Activations Density 0.459%