INDEX
    Explanations

    references to car features and specifications

    New Auto-Interp
    Negative Logits
    InputText
    -0.38
     face
    -0.37
     foreground
    -0.36
    Ow
    -0.35
     FACE
    -0.33
     vorg
    -0.33
     Gabel
    -0.33
     Mask
    -0.32
    face
    -0.32
     Face
    -0.32
    POSITIVE LOGITS
     rear
    1.95
    rear
    1.66
     Rear
    1.62
    Rear
    1.59
     REAR
    1.46
     tail
    1.32
     posterior
    1.22
     posteriore
    1.17
     trasero
    1.16
     Tail
    1.14
    Act Density 0.459%

    No Known Activations