INDEX
    Explanations

    locations and dates mentioned in the text

    New Auto-Interp
    Negative Logits
    λÎŃον
    -0.17
    loff
    -0.15
    оже
    -0.15
    axter
    -0.15
    leigh
    -0.15
    lea
    -0.14
     nue
    -0.14
    ailer
    -0.14
    bero
    -0.14
    ĵ°
    -0.14
    POSITIVE LOGITS
    ANI
    0.22
    omi
    0.18
     tar
    0.17
     seins
    0.16
    oli
    0.16
     Tar
    0.15
    .unlock
    0.15
    erve
    0.15
    Tar
    0.15
    UIT
    0.14
    Act Density 0.005%

    No Known Activations