INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    aughed
    -0.79
    emaker
    -0.68
    imov
    -0.68
    henko
    -0.68
    ashtra
    -0.64
    rosse
    -0.62
    essee
    -0.61
    urat
    -0.60
     Trouble
    -0.60
    76561
    -0.60
    POSITIVE LOGITS
    adays
    0.98
    abouts
    0.92
     afternoon
    0.87
     morning
    0.85
    days
    0.76
    lights
    0.74
     evening
    0.71
     marks
    0.70
    tics
    0.69
    here
    0.69
    Act Density 0.390%

    No Known Activations