INDEX
Explanations
the occurrence of the verb "have"
New Auto-Interp
Negative Logits
las
-0.21
ni
-0.19
la
-0.18
itself
-0.17
mer
-0.16
toll
-0.16
lang
-0.16
na
-0.16
nu
-0.15
nar
-0.15
POSITIVE LOGITS
aad
0.15
DDD
0.15
èĵ
0.14
Ù쨹
0.14
addin
0.14
ety
0.14
aed
0.14
indow
0.14
aket
0.14
eds
0.13
Activations Density 0.102%