INDEX
    Explanations

    terms related to workplace health and safety regulations

    New Auto-Interp
    Negative Logits
     sur
    -0.53
    addGap
    -0.51
    בות
    -0.50
    <eos>
    -0.50
    ....
    -0.50
     im
    -0.48
     tra
    -0.47
    oney
    -0.47
     von
    -0.46
     Datenschutz
    -0.46
    POSITIVE LOGITS
     Jefus
    0.98
     itſelf
    0.93
     Efq
    0.92
     myſelf
    0.91
     Theſe
    0.88
     becauſe
    0.88
    0.85
    TagMode
    0.85
    ыгана
    0.84
     leſs
    0.83
    Act Density 0.212%

    No Known Activations