INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     competition
    -0.08
     tedious
    -0.08
     nasty
    -0.08
     handige
    -0.08
    Casino
    -0.08
     ringtone
    -0.08
     Competition
    -0.08
     Casino
    -0.08
     repe
    -0.07
     troubles
    -0.07
    POSITIVE LOGITS
     silhouettes
    0.11
     силу
    0.11
     depicting
    0.10
     depict
    0.10
     depicted
    0.10
     depicts
    0.09
    -outline
    0.09
    พระ
    0.09
     continents
    0.09
    0.09
    Act Density 0.028%

    No Known Activations