INDEX
    Explanations

    days of the week

    New Auto-Interp
    Negative Logits
    acus
    -0.93
    emort
    -0.86
    emale
    -0.82
    inav
    -0.78
    anooga
    -0.73
    ethical
    -0.72
    icidal
    -0.72
    umbn
    -0.71
    itars
    -0.70
    icide
    -0.70
    POSITIVE LOGITS
    pring
    1.00
    dream
    0.97
    days
    0.92
     ago
    0.90
    ilver
    0.82
    trip
    0.81
    DAY
    0.79
    hops
    0.75
    care
    0.74
    hift
    0.74
    Act Density 11.434%

    No Known Activations