INDEX
Explanations
references to ritual practices or observances
New Auto-Interp
Negative Logits
iero
-0.16
allas
-0.15
ille
-0.15
Ñİн
-0.15
غر
-0.15
.learn
-0.15
Nest
-0.15
AMS
-0.14
ILLE
-0.14
Temper
-0.14
POSITIVE LOGITS
het
0.19
elig
0.17
ewise
0.17
appro
0.17
ums
0.16
ital
0.16
antt
0.16
ITAL
0.15
acial
0.15
wand
0.15
Activations Density 0.029%