INDEX
    Explanations

    nouns related to books and educational material

    New Auto-Interp
    Negative Logits
    tram
    -0.17
    oka
    -0.16
    cz
    -0.16
     Donovan
    -0.16
    ones
    -0.15
    oods
    -0.15
    iet
    -0.15
    sect
    -0.15
    arts
    -0.14
    ts
    -0.14
    POSITIVE LOGITS
    osu
    0.18
     neler
    0.17
    едÑĮ
    0.15
    amba
    0.15
    리ìķĦ
    0.14
    ifter
    0.14
     dikke
    0.14
    ilos
    0.14
    dum
    0.14
    Tuy
    0.14
    Act Density 0.215%

    No Known Activations