INDEX
    Explanations

    the word "war" in various contexts

    occurrences of the term "war."

    New Auto-Interp
    Negative Logits
    sembly
    -0.82
    essee
    -0.81
    ĸļ
    -0.77
    htaking
    -0.76
    aminer
    -0.74
    İĭ
    -0.73
    afort
    -0.69
    Ħ¢
    -0.69
     tremend
    -0.69
     vulnerable
    -0.67
    POSITIVE LOGITS
    rior
    1.41
    riors
    1.36
    fare
    1.35
    war
    1.04
    ring
    1.01
    lords
    0.89
    ney
    0.87
    fighter
    0.87
    hammer
    0.84
    bucks
    0.84
    Act Density 0.006%

    No Known Activations