INDEX
Explanations
references to specific ethnicities or cultural identities
proper nouns and named entities referring to geographic locations and ethnic groups.
New Auto-Interp
Negative Logits
propOrder
-0.70
ModelExpression
-0.53
للمعارف
-0.52
AccessorTable
-0.49
-------
-0.47
comps
-0.47
makeConstraints
-0.45
compr
-0.44
IndentedString
-0.44
Vidite
-0.44
POSITIVE LOGITS
Albanian
0.88
Albania
0.81
Kosovo
0.73
Tirana
0.72
alban
0.69
Albania
0.68
Alban
0.68
Alban
0.66
🇽
0.54
Gj
0.52
Activations Density 0.022%