INDEX
Explanations
articles related to evaluations or assessments
New Auto-Interp
Negative Logits
somewhat
-0.20
quite
-0.17
slightly
-0.16
sorts
-0.15
inel
-0.15
franch
-0.14
anus
-0.14
very
-0.14
rather
-0.14
iT
-0.14
POSITIVE LOGITS
treat
0.21
contrast
0.19
contrast
0.19
difference
0.19
Difference
0.17
undertaking
0.17
accomplishment
0.16
ride
0.16
feat
0.16
ìĿį
0.16
Activations Density 0.061%