INDEX
    Explanations

    mentions of political conflicts and international events related to war and refugee crises

    New Auto-Interp
    Negative Logits
    ''.
    -0.53
    ".
    -0.50
    thood
    -0.50
    $.
    -0.49
     ".
    -0.46
     boil
    -0.45
    EStreamFrame
    -0.45
    .''.
    -0.45
    '.
    -0.44
     bluff
    -0.44
    POSITIVE LOGITS
     meanwhile
    0.57
     countered
    0.56
     wrote
    0.54
     reacted
    0.53
     commented
    0.53
     Fr
    0.53
     echoed
    0.52
    WARN
    0.51
     Vital
    0.50
     Ori
    0.49
    Act Density 0.733%

    No Known Activations