INDEX
Explanations
names and terms related to specific individuals or entities in a context, particularly related to Malaysian politics
New Auto-Interp
Negative Logits
‘
-0.69
able
-0.67
ette
-0.65
اس
-0.62
"]}
-0.61
Castor
-0.59
”]
-0.58
amic
-0.57
atic
-0.56
a
-0.56
POSITIVE LOGITS
Cah
1.32
Kuh
1.20
Réponses
1.16
Lah
1.13
Ruh
1.12
Tah
1.11
Coh
1.09
AH
1.08
Ahh
1.06
IAH
1.05
Activations Density 0.099%