INDEX
    Explanations

    topics related to societal issues and public opinion

    New Auto-Interp
    Negative Logits
    areth
    -0.14
    aze
    -0.14
    ango
    -0.14
    usalem
    -0.13
    èĢĮ
    -0.13
    azes
    -0.13
     whereas
    -0.13
     omdat
    -0.13
    yled
    -0.13
    ******/
    -0.13
    POSITIVE LOGITS
     же
    0.23
     here
    0.22
     wasn
    0.17
     therefore
    0.16
    å¦ĤæŃ¤
    0.16
     zde
    0.15
    lamaz
    0.15
    here
    0.15
     then
    0.15
     cannot
    0.15
    Act Density 0.498%

    No Known Activations