INDEX
Explanations
phrases indicating possession or relatedness to groups or items
New Auto-Interp
Negative Logits
Personendaten
-1.08
IVEREF
-0.86
تقاوى
-0.73
GEBURTSDATUM
-0.73
оригіналу
-0.70
ſta
-0.69
Tembelea
-0.68
enderror
-0.67
AddTagHelper
-0.67
HFILL
-0.67
POSITIVE LOGITS
Các
0.33
nossos
0.31
Nuestros
0.31
taciones
0.31
niitä
0.31
infatti
0.30
of
0.30
dintre
0.29
chúng
0.29
ônico
0.29
Activations Density 0.013%