INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     бюдж
    -0.07
     pj
    -0.07
     resurgence
    -0.07
     meilleure
    -0.07
    Combine
    -0.06
     proletariat
    -0.06
    MMMM
    -0.06
     유지
    -0.06
    Decoration
    -0.06
    gin
    -0.06
    POSITIVE LOGITS
     bartender
    0.07
     Bar
    0.06
     cafe
    0.06
    537
    0.06
     Tavern
    0.06
     bist
    0.06
    aim
    0.06
     tavern
    0.06
     Cafe
    0.06
     bar
    0.06
    Act Density 0.011%

    No Known Activations