INDEX
    Explanations

    names of people or characters

    proper nouns, particularly names of individuals

    New Auto-Interp
    Negative Logits
    theless
    -0.75
    ModLoader
    -0.74
     underwater
    -0.73
    âĶĢâĶĢ
    -0.71
    anwhile
    -0.69
    etheless
    -0.68
    LEASE
    -0.65
     Melania
    -0.65
    ãĥŁ
    -0.64
     Galileo
    -0.64
    POSITIVE LOGITS
    atz
    1.02
    ovich
    1.01
    lett
    1.00
    itz
    1.00
    acci
    0.98
    zen
    0.97
    inger
    0.95
    inski
    0.95
    owski
    0.93
    ansky
    0.93
    Act Density 0.306%

    No Known Activations