INDEX
    Explanations

    adjectives indicating strong beliefs or loyalty

    words that indicate strong personal beliefs or unwavering support

    New Auto-Interp
    Negative Logits
    ammy
    -1.05
    hops
    -0.91
    nesota
    -0.83
    ombies
    -0.74
    ammers
    -0.72
    uden
    -0.72
    NetMessage
    -0.72
    ovember
    -0.70
    APH
    -0.70
    anders
    -0.68
    POSITIVE LOGITS
    ly
    1.10
    ness
    0.85
    wart
    0.83
     supporter
    0.76
     staunch
    0.76
    nesses
    0.74
    ELY
    0.71
    ity
    0.70
     exponent
    0.70
    kowski
    0.69
    Act Density 0.027%

    No Known Activations