INDEX
    Explanations

    names of companies and specific events or locations

    mentions of specific brands, notably "Nestlé," and references to the holiday "Easter."

    New Auto-Interp
    Negative Logits
    istic
    -0.83
    istically
    -0.77
    istics
    -0.61
    ist
    -0.60
    ¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯
    -0.60
    ights
    -0.58
    554
    -0.57
     nom
    -0.57
    mith
    -0.56
    xxxxxxxx
    -0.56
    POSITIVE LOGITS
    lé
    1.43
    dale
    1.10
    led
    0.98
    lings
    0.98
    lies
    0.94
     Bunny
    0.90
    lements
    0.89
    ding
    0.89
     Eggs
    0.88
    ea
    0.86
    Act Density 0.091%

    No Known Activations