INDEX
Explanations
modal verbs indicating potential actions or conditions
New Auto-Interp
Negative Logits
odable
-0.17
shouldBe
-0.15
EMPLARY
-0.15
erin
-0.15
anden
-0.15
emu
-0.15
lém
-0.15
kenin
-0.15
anus
-0.14
Æ¡
-0.14
POSITIVE LOGITS
nor
0.29
mot
0.24
bot
0.22
net
0.21
Nor
0.21
note
0.21
Note
0.20
ot
0.20
.note
0.19
n
0.19
Activations Density 0.103%