INDEX
Explanations
phrases indicating assistance and professional support in various services
New Auto-Interp
Negative Logits
achi
-0.16
buie
-0.15
uchen
-0.14
imbus
-0.14
ceive
-0.14
yang
-0.14
StateManager
-0.14
تاب
-0.14
indow
-0.13
_Component
-0.13
POSITIVE LOGITS
guide
0.26
walks
0.25
help
0.25
walk
0.24
walk
0.24
walked
0.23
suggest
0.21
Walk
0.21
walkers
0.21
recommend
0.20
Activations Density 0.156%