INDEX
Explanations
modal verbs and related phrases indicating necessity or capability
New Auto-Interp
Negative Logits
present
-0.59
auto
-0.57
lein
-0.53
A
-0.52
gen
-0.50
final
-0.50
brainly
-0.48
choice
-0.48
一旁的
-0.48
Poppins
-0.47
POSITIVE LOGITS
kann
0.86
Sucesor
0.82
können
0.75
expandindo
0.69
referrerpolicy
0.66
sollte
0.66
darf
0.65
müssen
0.65
muß
0.65
kommt
0.64
Activations Density 0.033%