INDEX
Explanations
common function words and prepositions in the text
New Auto-Interp
Negative Logits
ograd
-0.17
using
-0.14
ÙĪØ«
-0.14
kra
-0.14
اÙĦØ£ÙĨ
-0.14
elves
-0.13
obj
-0.13
roje
-0.13
اÙĦات
-0.13
amina
-0.12
POSITIVE LOGITS
hap
0.16
uteur
0.15
Wal
0.15
Steven
0.15
zon
0.14
ihan
0.14
OfDay
0.14
anager
0.14
eline
0.14
baugh
0.14
Activations Density 0.069%