INDEX
    Explanations

    Writing/Coding

    New Auto-Interp
    Negative Logits
     GER
    -0.08
     çev
    -0.08
     Grace
    -0.08
     vejo
    -0.07
    -0.07
     intraven
    -0.07
     ympär
    -0.07
     Hall
    -0.07
     Abr
    -0.07
     compañero
    -0.07
    POSITIVE LOGITS
     manually
    0.15
     manual
    0.13
     tedious
    0.13
     вруч
    0.13
     cumbersome
    0.11
    Manual
    0.11
     Manual
    0.11
    .manual
    0.11
    manual
    0.11
     явно
    0.11
    Act Density 0.023%

    No Known Activations