INDEX
    Explanations

    terms related to historical and cultural experiences

    New Auto-Interp
    Negative Logits
    ucha
    -0.16
    enso
    -0.16
    è¼
    -0.14
     seper
    -0.14
    roscope
    -0.14
    heiro
    -0.14
    ubl
    -0.14
     fitte
    -0.14
     bele
    -0.14
    obra
    -0.13
    POSITIVE LOGITS
    ò
    0.24
    ì
    0.24
     protagonist
    0.22
     possibile
    0.21
     pubb
    0.21
    ìm
    0.20
     preval
    0.20
     tras
    0.19
    itÃł
    0.19
     protagonists
    0.19
    Act Density 0.678%

    No Known Activations