INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     integrates
    -0.08
    ouders
    -0.08
    /stretchr
    -0.08
     combat
    -0.07
    occupation
    -0.07
    Particle
    -0.07
    illit
    -0.07
     uncon
    -0.07
     daytime
    -0.07
    focus
    -0.07
    POSITIVE LOGITS
     Extras
    0.09
     مورد
    0.09
     Libre
    0.09
     Grafik
    0.09
     그래
    0.08
     Gere
    0.08
    .yaml
    0.08
     Ре
    0.08
     Liberia
    0.08
    Dal
    0.08
    Act Density 0.001%

    No Known Activations