INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     pleaſure
    -0.98
     autorytatywna
    -0.90
     Governor
    -0.90
    sizeCache
    -0.86
    Governor
    -0.84
     gubern
    -0.84
     utafitiHapana
    -0.80
    RenderAtEndOf
    -0.79
     penguins
    -0.79
     Governors
    -0.78
    POSITIVE LOGITS
    ed
    0.90
    ing
    0.84
    age
    0.65
    e
    0.65
    ages
    0.60
    ING
    0.58
    ie
    0.56
    ee
    0.55
    hr
    0.55
    al
    0.54
    Act Density 0.040%

    No Known Activations