INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.08
    ca
    -0.07
    -0.06
    -0.06
    Atl
    -0.06
    efa
    -0.06
     собі
    -0.06
     Соб
    -0.06
    Defense
    -0.06
    повід
    -0.06
    POSITIVE LOGITS
    ILog
    0.07
     TypeScript
    0.07
     TODO
    0.06
     wellness
    0.06
    comment
    0.06
    SCRI
    0.06
    ива
    0.06
    .character
    0.06
     APPRO
    0.06
     flavorful
    0.06
    Act Density 0.003%

    No Known Activations