INDEX
Explanations
phrases indicating frequency or habitual actions
New Auto-Interp
Negative Logits
IntoConstraints
-0.59
deinit
-0.53
Schild
-0.50
IContainer
-0.49
shiro
-0.45
瘩
-0.44
PreExecute
-0.43
diatas
-0.43
ladesh
-0.42
autonomie
-0.42
POSITIVE LOGITS
often
1.13
often
1.05
Often
1.02
Often
1.01
frequently
0.91
frequently
0.88
oftentimes
0.87
spesso
0.85
kerap
0.84
souvent
0.82
Activations Density 0.014%