INDEX
    Explanations

    words related to fires and emergencies

    mentions of fires or fire-related incidents

    New Auto-Interp
    Negative Logits
    xus
    -0.83
    laus
    -0.74
     Vide
    -0.74
     Freed
    -0.73
     Lans
    -0.72
    atem
    -0.72
     Virtue
    -0.72
    sembly
    -0.70
    amaru
    -0.68
    VIDIA
    -0.68
    POSITIVE LOGITS
    flies
    1.16
    storm
    1.14
     exting
    1.10
    proof
    1.07
    balls
    1.02
    storms
    1.02
    fighting
    1.00
    fly
    1.00
    trap
    0.98
    fight
    0.96
    Act Density 0.029%

    No Known Activations