INDEX
    Explanations

    phrases related to being "off," indicating disconnection or disengagement

    New Auto-Interp
    Negative Logits
     Италијани
    -0.65
    KURZBESCHREIBUNG
    -0.61
    uxxxx
    -0.57
     betrekking
    -0.56
    />,
    -0.49
     HasFactory
    -0.49
     frasi
    -0.49
    UnsafeEnabled
    -0.48
    __.
    -0.47
    geslacht
    -0.47
    POSITIVE LOGITS
     off
    1.09
     Off
    1.06
    Off
    1.05
    off
    1.02
     OFF
    0.90
    OFF
    0.79
     offs
    0.70
    offs
    0.65
    オフ
    0.63
     オフ
    0.60
    Act Density 0.071%

    No Known Activations