INDEX
    Explanations

    phrases related to counting or quantifying

    references to audiences or demographics in discussions

    New Auto-Interp
    Negative Logits
     sunset
    -0.59
     Collider
    -0.56
     Diver
    -0.54
     Versus
    -0.54
     Simulation
    -0.53
     Moonlight
    -0.52
     Chimera
    -0.51
     Vulkan
    -0.51
     Lethal
    -0.51
     Archdemon
    -0.50
    POSITIVE LOGITS
    selves
    0.69
     whom
    0.68
    who
    0.64
    heastern
    0.64
    rosse
    0.63
    dinand
    0.63
    anners
    0.62
    irs
    0.61
     alike
    0.61
    hari
    0.59
    Act Density 1.944%

    No Known Activations