INDEX
    Explanations

    political references or news articles

    New Auto-Interp
    Negative Logits
    <bos>
    -0.97
     pinulongan
    -0.67
     Normdatei
    -0.58
    WriteBarrier
    -0.56
     NUKAT
    -0.55
    انيف
    -0.53
    OGND
    -0.52
    الإنجليزية
    -0.50
     ffilmiau
    -0.50
    <eos>
    -0.49
    POSITIVE LOGITS
     emphat
    1.33
     accla
    1.07
     practition
    1.04
     Khart
    0.98
     maneu
    0.97
     reluct
    0.97
     volunte
    0.94
     fta
    0.94
     embra
    0.94
     disagre
    0.93
    Act Density 1.499%

    No Known Activations