INDEX
Explanations
modal verbs indicating potential actions or permissions
New Auto-Interp
Negative Logits
-0.57
-
-0.57
F
-0.55
All
-0.52
astic
-0.50
.
-0.49
’
-0.48
1
-0.48
Jan
-0.48
2
-0.47
POSITIVE LOGITS
цездатний
0.92
المناصب
0.90
becauſe
0.89
Biôgrafia
0.85
ainfi
0.85
onAnimation
0.83
་་
0.82
Misa
0.81
DebuggerNonUser
0.80
حياته
0.80
Activations Density 0.263%