INDEX
    Explanations

    specific terms and concepts related to film, education, and cultural institutions

    New Auto-Interp
    Negative Logits
    yleft
    -0.16
    5
    -0.15
    3
    -0.15
    2
    -0.15
    7
    -0.14
    łĢ
    -0.14
    6
    -0.14
     contr
    -0.14
     Sez
    -0.14
    9
    -0.14
    POSITIVE LOGITS
    -,
    0.30
    unter
    0.27
    ver
    0.27
    -/
    0.27
    gesch
    0.27
    -
    0.26
    vere
    0.26
    bes
    0.25
    geb
    0.25
    bere
    0.25
    Act Density 0.046%

    No Known Activations