INDEX
    Explanations

    references to global issues or phenomena

    New Auto-Interp
    Negative Logits
    ository
    -1.91
    rir
    -1.70
     arbitrary
    -1.59
     sacrifice
    -1.58
    fficient
    -1.53
    lla
    -1.52
     died
    -1.50
    apest
    -1.49
    heets
    -1.48
    lessly
    -1.48
    POSITIVE LOGITS
    ķ
    2.49
    Ĺ
    2.31
    Ł
    2.25
    ĵ
    2.17
    ĺ
    2.15
    Ħ
    2.13
    £
    2.03
    °
    2.03
    µ
    1.94
    isation
    1.93
    Act Density 0.067%

    No Known Activations