INDEX
Explanations
specific suffix patterns in words, particularly those ending with -e or -ple
New Auto-Interp
Negative Logits
localctx
-0.58
članak
-0.57
Darum
-0.53
出版年
-0.52
INARY
-0.51
DEF
-0.50
vPvB
-0.49
województwie
-0.48
iles
-0.47
jini
-0.46
POSITIVE LOGITS
ath
0.64
ats
0.63
atst
0.59
تضيفلها
0.58
aling
0.52
alth
0.51
aten
0.51
asun
0.50
ATS
0.50
ather
0.50
Activations Density 0.325%