INDEX
    Explanations

    terms related to news articles reporting on events or situations involving people

    references to individuals involved in incidents or events, particularly regarding crime and investigation

    New Auto-Interp
    Head Attr Weights
    0:0.06
    1:0.04
    2:0.06
    3:0.09
    4:0.02
    5:0.24
    6:0.07
    7:0.06
    8:0.11
    9:0.05
    10:0.11
    11:0.05
    Negative Logits
     latt
    -1.16
    ゴン
    -1.11
     Gil
    -1.01
     burner
    -1.01
     Potato
    -1.01
    plet
    -1.01
     Cannes
    -1.00
     mush
    -0.97
     Bliss
    -0.97
     Mell
    -0.96
    POSITIVE LOGITS
    ardless
    1.07
     shouldn
    1.07
    Crime
    1.06
    nob
    1.05
    Net
    1.05
    facebook
    1.04
    seless
    1.04
    BER
    1.03
    rac
    1.00
    eren
    1.00
    Act Density 0.237%

    No Known Activations