INDEX
    Explanations

    references to glass and related materials

    New Auto-Interp
    Negative Logits
    etic
    -0.17
    sten
    -0.16
    estro
    -0.15
    ektor
    -0.15
    529
    -0.14
    temp
    -0.14
    ots
    -0.14
    tem
    -0.14
    ä¿Ĺ
    -0.14
    hai
    -0.14
    POSITIVE LOGITS
    gow
    0.26
    (es
    0.21
    ses
    0.19
    mates
    0.18
    work
    0.18
    glass
    0.18
    wort
    0.18
    boro
    0.18
    door
    0.17
    æĪ¸
    0.17
    Act Density 0.017%

    No Known Activations