INDEX
    Explanations

    names of historical figures

    New Auto-Interp
    Negative Logits
     dotnet
    0.34
     IHDA
    0.34
     élytres
    0.32
    🟣
    0.31
    ROID
    0.31
     arrondies
    0.31
    0.30
     ډاونلوډ
    0.29
     ब्लूटूथ
    0.29
     astrocyte
    0.29
    POSITIVE LOGITS
     ibn
    0.47
     Gandhi
    0.39
     Nietzsche
    0.38
     Freud
    0.38
    Gandhi
    0.37
     Hitler
    0.37
     Ibn
    0.35
     Tolstoy
    0.35
     Tolkien
    0.34
     Marx
    0.34
    Act Density 0.048%

    No Known Activations