INDEX
Explanations
terms related to health risks and medical conditions
New Auto-Interp
Negative Logits
myſelf
-0.73
seamnă
-0.73
―――――
-0.73
Jefus
-0.72
Majefty
-0.71
Monfieur
-0.71
Diſ
-0.70
iſt
-0.69
raiſ
-0.68
himſelf
-0.67
POSITIVE LOGITS
misalnya
1.81
например
1.75
bijvoorbeeld
1.62
például
1.60
beispielsweise
1.53
Например
1.35
například
1.28
Например
1.24
مث
1.22
example
1.21
Activations Density 0.418%