INDEX
    Explanations

    references to age or generations

    New Auto-Interp
    Negative Logits
    undi
    -0.19
    baugh
    -0.17
    allet
    -0.16
    ential
    -0.15
    obic
    -0.15
    xic
    -0.15
    rescia
    -0.15
    ELS
    -0.14
     Philosophy
    -0.14
    abb
    -0.14
    POSITIVE LOGITS
    اÙĨÙĩ
    0.16
    -wide
    0.16
    antro
    0.15
    liness
    0.15
    /world
    0.14
     ton
    0.14
    eton
    0.14
    ÙĨب
    0.14
     pret
    0.14
    ando
    0.14
    Act Density 0.147%

    No Known Activations