INDEX
    Explanations

    words related to political figures and events

    special characters or non-standard typographical elements

    New Auto-Interp
    Negative Logits
     mathemat
    -0.86
    undai
    -0.81
     horizont
    -0.81
     charism
    -0.80
     myster
    -0.78
    eatures
    -0.78
     simultane
    -0.77
     scrut
    -0.76
     pestic
    -0.75
     incorpor
    -0.74
    POSITIVE LOGITS
    ãĥĥãĥī
    0.96
    âĹ¼
    0.95
    MQ
    0.90
    ï¸ı
    0.88
    deg
    0.86
    é¾į
    0.81
    rd
    0.81
    horn
    0.81
    Īè
    0.79
    UE
    0.76
    Act Density 0.031%

    No Known Activations