INDEX
    Explanations

    elements related to film criticism and artistic evaluation

    New Auto-Interp
    Negative Logits
    azor
    -0.17
    igh
    -0.17
    vey
    -0.16
    ãģ°
    -0.16
    icut
    -0.14
     Heller
    -0.14
    ظ
    -0.14
     Ders
    -0.14
    vy
    -0.14
    olic
    -0.14
    POSITIVE LOGITS
    ern
    0.32
    tern
    0.28
    ERN
    0.28
    fern
    0.26
    bern
    0.24
    TERN
    0.24
    enden
    0.21
    erne
    0.20
    ndern
    0.20
    ende
    0.20
    Act Density 0.017%

    No Known Activations