INDEX
Explanations
phrases indicating effectiveness or success in processes or actions
New Auto-Interp
Negative Logits
ServiceImpl
-0.15
adiator
-0.14
lesh
-0.14
kus
-0.14
اÙĩ
-0.14
obus
-0.14
æŀĿ
-0.14
patient
-0.14
ÙħÛĮÙĨ
-0.14
ong
-0.13
POSITIVE LOGITS
wonders
0.32
magic
0.26
magic
0.24
miracles
0.22
wonder
0.22
best
0.20
/help
0.20
Magic
0.20
Wonder
0.19
Wonder
0.18
Activations Density 0.044%