INDEX
Explanations
adjectives describing the characteristics of things or entities
sentences affirming the existence or status of subjects
New Auto-Interp
Negative Logits
oice
-0.71
wipes
-0.68
Laughs
-0.67
Leave
-0.66
congr
-0.66
Fix
-0.65
Nope
-0.65
Yeah
-0.64
laughs
-0.63
stop
-0.62
POSITIVE LOGITS
characterized
1.31
regarded
1.25
commonly
1.22
distinguished
1.19
contrasted
1.17
therefore
1.15
considered
1.13
often
1.12
depicted
1.12
usually
1.11
Activations Density 0.315%