INDEX
Explanations
phrases expressing opinions or evaluations about technology and personal responsibility
New Auto-Interp
Negative Logits
NamedQueries
-0.86
']):
-0.82
évaluateur
-0.81
')):
-0.81
windowFixed
-0.80
"]}
-0.75
']],
-0.73
inSlope
-0.73
')(
-0.73
'}),
-0.73
POSITIVE LOGITS
What
0.60
<eos>
0.59
What
0.59
Why
0.54
You
0.54
новниш
0.54
Why
0.53
what
0.53
You
0.52
They
0.52
Activations Density 0.042%