INDEX
    Explanations

    references to historical figures and events

    New Auto-Interp
    Negative Logits
    (DBG
    -0.15
    bourne
    -0.14
     synonym
    -0.14
     Dün
    -0.14
     Nisan
    -0.14
    NSS
    -0.14
     Ñĥли
    -0.14
    NullOr
    -0.14
    ãĥ³ãĥĶ
    -0.14
    starter
    -0.13
    POSITIVE LOGITS
     King
    0.33
     Emperor
    0.32
     Queen
    0.32
     Emp
    0.28
     Count
    0.28
     Princess
    0.23
     Prince
    0.23
     emperor
    0.23
    Queen
    0.23
    King
    0.23
    Act Density 0.212%

    No Known Activations