INDEX
Explanations
expressions of desire or intention
New Auto-Interp
Negative Logits
ARA
-0.80
CrossRef
-0.80
faptul
-0.68
Flores
-0.68
magasiner
-0.67
ooks
-0.66
IMS
-0.66
Torres
-0.66
JNIEnv
-0.66
matron
-0.64
POSITIVE LOGITS
want
1.68
WANT
1.67
wants
1.57
wants
1.57
Wants
1.57
want
1.54
wanted
1.53
WANT
1.46
Want
1.42
wanting
1.35
Activations Density 0.097%