INDEX
Explanations
references to self-harm or suicidal ideation
New Auto-Interp
Negative Logits
ſelf
-0.45
themſelves
-0.42
natives
-0.36
felves
-0.36
TRAIT
-0.36
للمعارف
-0.35
ſelves
-0.35
himſelf
-0.35
ſtand
-0.35
Ubicación
-0.34
POSITIVE LOGITS
__*/
0.49
GetEnumerator
0.45
قایناقلار
0.44
DataContract
0.44
COMMIT
0.42
zarchiwizowane
0.42
いしい
0.42
しゃれ
0.41
CreateTagHelper
0.40
Referencer
0.40
Activations Density 0.084%