INDEX
Explanations
prepositions and conjunctions
prepositions of origin
New Auto-Interp
Negative Logits
faſt
-0.55
pleaſure
-0.51
enfans
-0.47
itſelf
-0.47
inſ
-0.46
preſent
-0.45
raiſ
-0.45
ſelves
-0.45
tranſ
-0.44
ſy
-0.43
POSITIVE LOGITS
من
1.71
من
1.09
FROM
0.91
FROM
0.84
From
0.84
ومن
0.83
dari
0.83
from
0.82
מן
0.82
From
0.81
Activations Density 0.000%