INDEX
    Explanations

    phrases related to strong actions or impactful events

    phrases related to social and moral issues

    New Auto-Interp
    Negative Logits
     Oaks
    -0.63
     Gideon
    -0.63
    200000
    -0.61
     Robbie
    -0.60
    iage
    -0.58
     LH
    -0.58
    ilan
    -0.57
    ertility
    -0.56
     Kro
    -0.56
     Liberty
    -0.55
    POSITIVE LOGITS
    »
    2.00
    âĢ
    1.84
    ''
    1.79
     âĢ
    1.68
    ãĢį
    1.67
    ãĢ
    1.61
    ''.
    1.55
    [/
    1.53
    ¨
    1.47
    </
    1.46
    Act Density 0.688%

    No Known Activations