INDEX
Explanations
references to cultural or religious identities and their associated beliefs
New Auto-Interp
Negative Logits
ostavi
-0.62
الرياضيه
-0.61
AssemblyCulture
-0.59
(!__
-0.58
رشف
-0.57
defaultstate
-0.57
}],
-0.54
ValueStyle
-0.53
NewGuid
-0.52
farwyddwr
-0.50
POSITIVE LOGITS
klaim
0.65
supposedly
0.62
misleading
0.57
umpad
0.56
misled
0.56
ofus
0.55
deceptive
0.53
deluded
0.53
naut
0.52
tkinter
0.52
Activations Density 0.711%