INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     handset
    -0.08
     Silicon
    -0.08
     Weihnachts
    -0.08
    entscheidung
    -0.08
    comfort
    -0.07
     silicon
    -0.07
     школь
    -0.07
     Komfort
    -0.07
     genotype
    -0.07
     Wayne
    -0.07
    POSITIVE LOGITS
     marched
    0.09
     flutter
    0.09
     rouges
    0.08
    mixed
    0.08
     warfare
    0.08
     під
    0.08
    ्ञ
    0.08
     stiffness
    0.08
     crossed
    0.08
    战争
    0.08
    Act Density 0.003%

    No Known Activations