INDEX
Explanations
references to ownership or possession
New Auto-Interp
Negative Logits
ویکیپدی
-0.58
they
-0.56
opatra
-0.55
gevens
-0.54
Vallarta
-0.54
mità
-0.54
kommune
-0.54
őben
-0.53
profondeur
-0.53
crossroads
-0.52
POSITIVE LOGITS
ünün
0.93
ⓧ
0.92
ின்
0.92
its
0.88
lerinin
0.84
idän
0.84
Whose
0.80
ുടെ
0.79
নের
0.77
manın
0.77
Activations Density 0.095%