INDEX
    Explanations

    terms and phrases related to community interactions and social norms

    New Auto-Interp
    Negative Logits
    <sup>
    -0.49
    -0.49
     F
    -0.48
     isso
    -0.47
     Be
    -0.46
    openModal
    -0.45
     '
    -0.44
     Car
    -0.44
     E
    -0.43
    ↵↵
    -0.42
    POSITIVE LOGITS
    enumi
    0.87
    enderror
    0.79
     uſed
    0.75
    CloseOperation
    0.75
     henvisninger
    0.75
     purpoſe
    0.73
     ſmall
    0.72
    rrggbb
    0.71
    AutoresizingMask
    0.70
    ValueStyle
    0.70
    Act Density 0.301%

    No Known Activations