INDEX
Explanations
verbs indicating future actions or possibilities
modal verbs indicating uncertainty or possibility
New Auto-Interp
Negative Logits
adolesc
-0.60
Citiz
-0.57
pione
-0.56
metic
-0.56
WithNo
-0.56
olini
-0.54
newcomer
-0.53
ãĤ¶
-0.52
corrid
-0.51
asus
-0.51
POSITIVE LOGITS
.
1.41
!.
1.32
!
1.31
.[
1.21
.(
1.20
;
1.20
;)
1.18
:)
1.16
!!!!
1.15
.--
1.15
Activations Density 0.427%