INDEX
    Explanations

    words related to historical and cultural significance

    New Auto-Interp
    Negative Logits
     Pwr
    -0.68
     tumble
    -0.62
     itch
    -0.62
     sake
    -0.60
    landish
    -0.57
     dism
    -0.57
    llah
    -0.57
     cop
    -0.56
     speedy
    -0.54
     spirited
    -0.54
    POSITIVE LOGITS
    ansk
    0.96
    heed
    0.86
    anne
    0.86
    achev
    0.84
    emort
    0.83
    cious
    0.81
    thening
    0.80
    uli
    0.78
    uania
    0.77
    alties
    0.75
    Act Density 0.024%

    No Known Activations