INDEX
Explanations
modal verbs indicating necessity or conditional scenarios
New Auto-Interp
Negative Logits
иÑģполÑĮзÑĥ
-0.14
виÑĩай
-0.12
ÌĪ
-0.12
.learn
-0.12
learn
-0.12
unsch
-0.12
_NB
-0.11
indo
-0.11
ÌĨ
-0.11
zoekt
-0.11
POSITIVE LOGITS
be
0.32
mean
0.31
consist
0.30
include
0.30
involve
0.29
occur
0.29
contain
0.28
entail
0.28
exist
0.28
happen
0.27
Activations Density 0.781%