INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Greece
    -0.09
     Kurdish
    -0.09
     restore
    -0.08
     bids
    -0.08
    .LE
    -0.08
    .restore
    -0.08
     restored
    -0.08
    ANGO
    -0.08
    .MODE
    -0.07
     restoring
    -0.07
    POSITIVE LOGITS
     Afrika
    0.11
     Afr
    0.11
     apartheid
    0.09
     harassment
    0.09
     bad
    0.08
    Afr
    0.08
     voed
    0.08
     ûnder
    0.08
    áne
    0.08
     Tijd
    0.08
    Act Density 0.014%

    No Known Activations