INDEX
    Explanations

    phrases related to beliefs and values, especially focusing on individual beliefs, societal values, and the juxtaposition of different beliefs and values

    New Auto-Interp
    Negative Logits
     osal
    -0.83
     hcm
    -0.73
     rispond
    -0.69
     dovre
    -0.69
     encomp
    -0.69
     interro
    -0.69
     vogli
    -0.67
     ridu
    -0.67
     allarg
    -0.66
     pessi
    -0.65
    POSITIVE LOGITS
     beliefs
    0.81
    values
    0.62
     values
    0.60
     convictions
    0.57
    Values
    0.56
     EINVAL
    0.56
     thoughts
    0.56
     attitudes
    0.55
     principles
    0.55
    VALUES
    0.53
    Act Density 0.327%

    No Known Activations