INDEX
Explanations
possessive forms or references to ownership
New Auto-Interp
Negative Logits
religieuses
-0.77
adaptées
-0.72
للاسماء
-0.71
dedans
-0.68
ainfi
-0.67
complètes
-0.67
convention
-0.66
koning
-0.66
ReusableCell
-0.65
SIGINT
-0.65
POSITIVE LOGITS
own
0.87
ின்
0.83
ⓧ
0.73
ുടെ
0.73
its
0.72
my
0.71
երի
0.70
ünün
0.69
의
0.68
main
0.68
Activations Density 0.137%