INDEX
    Explanations

    instances of struggle or difficulty in various contexts

    New Auto-Interp
    Negative Logits
    Ậ
    -0.15
    è±Ĭ
    -0.14
    redo
    -0.14
    _mE
    -0.14
    _detach
    -0.14
    ko
    -0.14
    елениÑı
    -0.14
    avin
    -0.14
     resumes
    -0.13
    丰
    -0.13
    POSITIVE LOGITS
     even
    0.23
    proper
    0.22
     proper
    0.21
     Proper
    0.20
     ever
    0.19
     due
    0.19
     properly
    0.18
     anymore
    0.18
     progress
    0.17
     certain
    0.17
    Act Density 0.093%

    No Known Activations