INDEX
Explanations
phrases related to evaluation or comparison
phrases indicating criteria or expectations related to performance or evaluations
New Auto-Interp
Negative Logits
alion
-0.91
itional
-0.70
alysed
-0.69
gradation
-0.69
orah
-0.67
gered
-0.66
oche
-0.65
rection
-0.64
orage
-0.64
ackets
-0.63
POSITIVE LOGITS
considering
0.83
rookies
0.72
someone
0.72
understatement
0.71
rookie
0.68
sandwic
0.62
fledgling
0.62
hindsight
0.62
oneself
0.60
modern
0.59
Activations Density 0.314%