INDEX
    Explanations

    concepts related to training and mentoring processes

    New Auto-Interp
    Negative Logits
     wards
    -0.17
    .self
    -0.16
    ìĽĥ
    -0.15
    ADB
    -0.14
    .assert
    -0.14
    buz
    -0.14
    Timestamp
    -0.14
    BIN
    -0.14
    anium
    -0.14
    endi
    -0.14
    POSITIVE LOGITS
     ihm
    0.22
     him
    0.20
     treat
    0.19
    escort
    0.18
    oints
    0.17
     ihn
    0.17
     escort
    0.17
     gently
    0.16
    oint
    0.16
    ãģĭãģij
    0.16
    Act Density 0.371%

    No Known Activations