INDEX
    Explanations

    mentions of significant historical figures and events in a narrative context

    New Auto-Interp
    Negative Logits
    ostel
    -0.15
    ä¸Ģ页
    -0.15
    oref
    -0.15
     rencont
    -0.15
    erval
    -0.14
    athan
    -0.14
    ozo
    -0.14
     Sloan
    -0.14
    utta
    -0.14
    enheim
    -0.14
    POSITIVE LOGITS
    ToLocal
    0.15
    /world
    0.15
     ang
    0.15
    ÐŁÐļ
    0.14
    546
    0.14
    але
    0.13
    Ðİ
    0.13
    عÛĮ
    0.13
    _PK
    0.13
    ;base
    0.13
    Act Density 0.094%

    No Known Activations