INDEX
Explanations
gerunds and actions related to maintaining stability and balance in systems
New Auto-Interp
Negative Logits
ویکیپدی
-0.65
Diwedd
-0.63
propOrder
-0.61
Personensuche
-0.60
TagMode
-0.59
intenant
-0.59
ainfi
-0.55
increí
-0.55
înc
-0.53
verwijspagina
-0.52
POSITIVE LOGITS
while
0.50
しながら
0.45
的同时
0.44
while
0.43
'\\;'
0.42
simultaneously
0.42
同时
0.40
sambil
0.40
しつつ
0.39
dabei
0.38
Activations Density 0.453%