INDEX
    Explanations

    expressions of critique or discussion surrounding performance and conditions

    New Auto-Interp
    Negative Logits
    ูà¹Ī
    -0.18
     “â̦
    -0.15
     attendee
    -0.15
     beden
    -0.14
    -0.14
     ç³
    -0.13
    ÅĻel
    -0.13
     “[
    -0.13
    chos
    -0.12
    #Region
    -0.12
    POSITIVE LOGITS
     [
    0.23
     important
    0.17
     my
    0.17
     normal
    0.17
     things
    0.16
     complicated
    0.16
     maybe
    0.16
     tranqu
    0.16
     maths
    0.16
     I
    0.15
    Act Density 0.027%

    No Known Activations