INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     grotes
    0.62
     člove
    0.60
     grotesque
    0.59
     cowardly
    0.59
     vicious
    0.58
     ignorant
    0.57
     hypoc
    0.55
     sly
    0.55
     dehuman
    0.54
     savag
    0.54
    POSITIVE LOGITS
     Saturday
    0.89
     Workshop
    0.88
     scheduled
    0.87
     sessions
    0.84
     Workshops
    0.80
     afternoon
    0.79
     Thursday
    0.78
    schedule
    0.78
     Session
    0.78
    Saturday
    0.77
    Act Density 0.049%

    No Known Activations