INDEX
    Explanations

    Writing/creation

    The neuron detects mentions of student misbehavior or disciplinary problems (e.g. bullying, bullies, troublemakers).

    New Auto-Interp
    Negative Logits
    inet
    -0.06
     isn
    -0.06
    -0.06
    angement
    -0.06
    reement
    -0.06
     couldn
    -0.06
    –
    -0.06
     made
    -0.06
     fileType
    -0.06
     moeten
    -0.06
    POSITIVE LOGITS
    гар
    0.08
     [])↵↵
    0.07
    Settings
    0.06
    0.06
    0.06
    ắn
    0.06
    CAT
    0.06
    PER
    0.06
     عباس
    0.06
     üzerinden
    0.06
    Act Density 0.256%

    No Known Activations