INDEX
    Explanations

    instances related to driving, vehicles, and car accidents

    mentions of drivers in various contexts

    New Auto-Interp
    Negative Logits
    iversal
    -0.80
    aeper
    -0.79
    yss
    -0.78
    achu
    -0.77
     Flavoring
    -0.76
    ertain
    -0.73
    ropolitan
    -0.72
    reme
    -0.72
    pta
    -0.72
    orkshire
    -0.71
    POSITIVE LOGITS
     drivers
    0.94
     driver
    0.93
     pige
    0.90
    driving
    0.84
    driver
    0.83
    less
    0.83
     bees
    0.80
     driving
    0.78
     Drivers
    0.78
    wings
    0.77
    Act Density 0.016%

    No Known Activations