INDEX
Explanations
occurrences of various verbs and adverbs that indicate actions or states relating to necessity and intention
New Auto-Interp
Negative Logits
yat
-0.16
ç´Ģ
-0.16
eph
-0.15
ynes
-0.15
yle
-0.15
lings
-0.15
arget
-0.14
just
-0.14
Cons
-0.14
Norm
-0.14
POSITIVE LOGITS
idlo
0.18
CHIP
0.17
CHIP
0.16
qed
0.16
.scalablytyped
0.15
iband
0.15
OTOS
0.15
idth
0.15
Retrie
0.15
ÑĤон
0.15
Activations Density 0.002%