INDEX
    Explanations

    mentions of wars and military conflicts

    New Auto-Interp
    Negative Logits
    <bos>
    -0.62
    ConstraintMaker
    -0.51
    -0.50
    -0.50
    刺客
    -0.50
    -0.48
    称号
    -0.48
     huelga
    -0.48
     nakalista
    -0.48
     nev
    -0.47
    POSITIVE LOGITS
     swarovski
    1.26
     hairc
    1.25
     war
    1.21
     Darío
    1.18
     WAR
    1.16
     unwarran
    1.14
     unlaw
    1.14
     nutella
    1.13
     War
    1.11
    War
    1.11
    Act Density 0.091%

    No Known Activations