INDEX
    Explanations

    historical and religious figures' names

    historical figures and events

    New Auto-Interp
    Negative Logits
    tools
    -0.72
    malink
    -0.70
    ratom
    -0.70
    feature
    -0.70
    rador
    -0.70
    Wisconsin
    -0.68
    machine
    -0.68
    REAM
    -0.67
    FU
    -0.67
    LOAD
    -0.67
    POSITIVE LOGITS
     Galile
    1.22
     XVI
    1.20
     Augustus
    1.19
     Herod
    1.18
     VIII
    1.16
     Tud
    1.15
     Claud
    1.13
     Romans
    1.13
     XIV
    1.12
     XII
    1.12
    Act Density 0.244%

    No Known Activations