INDEX
    Explanations

    phrases expressing opinions or evaluations about technology and personal responsibility

    New Auto-Interp
    Negative Logits
    NamedQueries
    -0.86
    ']):
    -0.82
    évaluateur
    -0.81
    ')):
    -0.81
    windowFixed
    -0.80
    "]}
    -0.75
    ']],
    -0.73
    inSlope
    -0.73
    ')(
    -0.73
    '}),
    -0.73
    POSITIVE LOGITS
    What
    0.60
    <eos>
    0.59
     What
    0.59
     Why
    0.54
    You
    0.54
    новниш
    0.54
    Why
    0.53
    what
    0.53
     You
    0.52
     They
    0.52
    Act Density 0.042%

    No Known Activations