INDEX
Explanations
phrases indicating obligation or requirement
required to follow
New Auto-Interp
Negative Logits
vecind
-0.47
idées
-0.46
logros
-0.43
pieles
-0.43
Idee
-0.43
descubrió
-0.42
democracia
-0.42
nubes
-0.42
joven
-0.41
otras
-0.41
POSITIVE LOGITS
Must
0.81
must
0.75
Must
0.75
must
0.73
phải
0.66
MUST
0.62
MUST
0.62
Stiff
0.60
faudra
0.59
harus
0.58
Activations Density 0.022%