INDEX
Explanations
the presence of a specific special character or symbol
New Auto-Interp
Negative Logits
lá»iji
-0.14
aviours
-0.12
ÑĪÑĤÑĥ
-0.12
servis
-0.11
draul
-0.11
اختص
-0.11
дÑĢом
-0.11
contri
-0.10
Bylo
-0.10
stav
-0.10
POSITIVE LOGITS
flowers
0.29
grapes
0.29
oranges
0.28
roses
0.27
cotton
0.27
grains
0.27
wheat
0.26
coconut
0.26
tomatoes
0.26
berries
0.26
Activations Density 0.100%