INDEX
Explanations
phrases indicating evaluation or judgment of individuals and groups
evaluation or judgement
deeming or considering someone/something
New Auto-Interp
Negative Logits
ModelExpression
-0.67
tagHelperRunner
-0.64
CloseOperation
-0.61
avowed
-0.60
Apparently
-0.59
NameInMap
-0.58
absten
-0.58
credited
-0.57
기도
-0.56
Alleg
-0.56
POSITIVE LOGITS
superior
0.73
"
0.66
“
0.64
worthy
0.62
worthy
0.61
inferior
0.59
worth
0.59
suitable
0.58
worth
0.58
'
0.56
Activations Density 0.663%