INDEX
    Explanations

    mentions of cars and automotive-related terms

    New Auto-Interp
    Negative Logits
     Tiefen
    -0.85
    )");
    
    -0.85
    ]")]
    -0.83
     Radu
    -0.80
    tals
    -0.79
    ]").
    -0.79
     '\\;'
    -0.76
    {}{}
    -0.75
     hydrauli
    -0.75
    }".
    -0.75
    POSITIVE LOGITS
     car
    1.70
     Car
    1.58
    Car
    1.54
    car
    1.53
     cars
    1.52
     CAR
    1.45
     Cars
    1.40
    CAR
    1.40
    Cars
    1.33
    cars
    1.33
    Act Density 0.089%

    No Known Activations