INDEX
Explanations
concepts related to treatment, behavior, and ethics in interpersonal and societal contexts
Describes actions or behaviors, often negative
how things are done or treated
New Auto-Interp
Negative Logits
يتيمه
-0.69
ⓧ
-0.68
AntiForgeryToken
-0.61
fillType
-0.61
IndexPath
-0.61
IsContent
-0.59
IsMutable
-0.56
Naissance
-0.53
yorsunuz
-0.53
ungsbedingungen
-0.52
POSITIVE LOGITS
differently
1.46
accordingly
0.94
according
0.94
incorrectly
0.93
diffé
0.85
differ
0.79
how
0.78
wrong
0.77
correctly
0.76
wrongly
0.73
Activations Density 0.401%