INDEX
Explanations
infinitive verbs indicating actions or decisions
New Auto-Interp
Negative Logits
batis
-0.18
ongyang
-0.17
vero
-0.15
ability
-0.15
ocate
-0.14
بتÙĪØ§ÙĨ
-0.14
intl
-0.14
berhasil
-0.14
getClient
-0.14
decisions
-0.14
POSITIVE LOGITS
arak
0.18
rather
0.17
instead
0.17
.lu
0.16
iglia
0.16
513
0.16
rather
0.15
pursuing
0.15
agra
0.14
alla
0.14
Activations Density 0.102%