INDEX
    Explanations

    references to time spent, particularly in hours

    New Auto-Interp
    Negative Logits
    ione
    -0.18
    chor
    -0.18
    éį
    -0.15
     Pride
    -0.14
    WXYZ
    -0.14
    adla
    -0.14
    ience
    -0.14
     Bray
    -0.14
    داد
    -0.13
    ji
    -0.13
    POSITIVE LOGITS
    oÅĽci
    0.14
    neys
    0.14
     dint
    0.13
    شتÙĩ
    0.13
    ouser
    0.13
    esian
    0.13
    _sensitive
    0.13
    edik
    0.13
     Hess
    0.13
    esk
    0.13
    Act Density 0.012%

    No Known Activations