INDEX
Explanations
instances of recognizing or evaluating the quality of work or performance
New Auto-Interp
Negative Logits
Tang
-0.79
NetMessage
-0.75
fty
-0.71
ater
-0.66
raltar
-0.65
eros
-0.63
isks
-0.63
ities
-0.62
ories
-0.61
Phi
-0.61
POSITIVE LOGITS
portraying
0.94
manship
0.91
ethic
0.86
illustrating
0.86
capturing
0.83
protecting
0.81
showcasing
0.81
combating
0.81
keeping
0.80
educating
0.79
Activations Density 0.016%