INDEX
    Explanations

    mentions of roads or road-related incidents and safety measures

    New Auto-Interp
    Negative Logits
    arians
    -0.85
    ividual
    -0.82
    illian
    -0.76
    irements
    -0.68
     Hots
    -0.67
    tle
    -0.67
    rator
    -0.65
    emort
    -0.65
    uates
    -0.64
    ropolitan
    -0.64
    POSITIVE LOGITS
    ways
    1.27
    blocks
    1.21
    trip
    1.18
    block
    1.04
    side
    1.01
    show
    0.98
    map
    0.94
    hog
    0.91
    fare
    0.89
    runner
    0.88
    Act Density 0.026%

    No Known Activations