INDEX
    Explanations

    terms related to historical and geographical contexts, especially concerning governance and cultural heritage

    New Auto-Interp
    Negative Logits
     none
    -0.14
    Tube
    -0.13
    Thunk
    -0.13
    iras
    -0.13
     ragaz
    -0.13
    ADF
    -0.13
    IDA
    -0.13
     елекÑĤÑĢон
    -0.13
    umbn
    -0.13
     once
    -0.13
    POSITIVE LOGITS
     en
    0.34
     die
    0.28
     wa
    0.21
     Die
    0.21
     tez
    0.20
    Die
    0.19
     mét
    0.19
     want
    0.19
    ,en
    0.19
     met
    0.19
    Act Density 0.043%

    No Known Activations