INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    _press
    -0.08
    Driven
    -0.07
    -0.07
     Mosk
    -0.07
    -spacing
    -0.07
    ulsive
    -0.07
    cement
    -0.07
     presses
    -0.07
    -0.07
     Spir
    -0.07
    POSITIVE LOGITS
     tasks
    0.10
     tarefas
    0.09
     tareas
    0.09
    (tasks
    0.08
     tâches
    0.08
     impossible
    0.08
    ndares
    0.08
    .tasks
    0.08
     claiming
    0.08
     ngoại
    0.08
    Act Density 0.004%

    No Known Activations