INDEX
    Explanations

    adjectives describing strong beliefs or characteristics

    terms that signal strong beliefs or positions

    New Auto-Interp
    Negative Logits
    daq
    -0.82
    REDACTED
    -0.77
    iership
    -0.71
    Mind
    -0.69
    oping
    -0.68
    nesota
    -0.68
    iosity
    -0.65
    xiety
    -0.64
    ither
    -0.61
    Report
    -0.60
    POSITIVE LOGITS
    glers
    0.85
    ãĤ¡
    0.83
    ����
    0.76
    char
    0.76
    aneers
    0.75
    sonian
    0.74
    \\\\
    0.72
    ãĥİ
    0.71
    essential
    0.71
    ãĤ©
    0.70
    Act Density 0.058%

    No Known Activations