INDEX
    Explanations

    references to specific time periods and locations

    New Auto-Interp
    Negative Logits
     Blasio
    -0.16
    avis
    -0.15
    ivas
    -0.15
    quila
    -0.15
    廳
    -0.14
    peq
    -0.14
    leÅŁtir
    -0.14
    nda
    -0.14
    Rp
    -0.14
     Beste
    -0.14
    POSITIVE LOGITS
    resi
    0.17
    izzo
    0.16
     concepts
    0.14
    akk
    0.14
    Concept
    0.14
     Clem
    0.14
    allis
    0.14
    515
    0.14
    around
    0.14
     tolerant
    0.14
    Act Density 0.198%

    No Known Activations