INDEX
    Explanations

    specific actions or objects related to physical altercations or conflicts

    mentions of specific objects, individuals, or significant topics within a narrative

    New Auto-Interp
    Negative Logits
    ĪĴ
    -0.53
    ¿½
    -0.52
     Emin
    -0.51
     Leilan
    -0.49
     confir
    -0.48
     conclud
    -0.47
     surpr
    -0.46
     Rampage
    -0.46
     Aires
    -0.45
    ãĥ´
    -0.44
    POSITIVE LOGITS
     badge
    0.56
     onto
    0.52
     hostage
    0.48
     button
    0.45
     Seal
    0.45
    pes
    0.45
     reins
    0.45
    onto
    0.45
     curse
    0.45
     securely
    0.44
    Act Density 1.656%

    No Known Activations