INDEX
    Explanations

    code parameters

    New Auto-Interp
    Negative Logits
    sus
    -0.08
    reinterpret
    -0.07
    N
    -0.07
    -0.07
     Lum
    -0.07
    Checker
    -0.07
     supplements
    -0.07
    -0.07
    Lum
    -0.07
    anju
    -0.07
    POSITIVE LOGITS
     ondernemer
    0.09
     expressão
    0.09
     schade
    0.09
    (worker
    0.08
     apresentação
    0.08
     apresentações
    0.08
     оны
    0.08
     bri
    0.08
     bestuurder
    0.08
     Meetings
    0.08
    Act Density 0.003%

    No Known Activations