INDEX
Explanations
phrases or concepts related to health challenges and deficiencies
New Auto-Interp
Negative Logits
}}$\\
-0.41
')";
-0.40
للاسماء
-0.40
}{$\-0.40
yake
-0.40
//{
-0.40
illaume
-0.39
measure
-0.38
Numerade
-0.38
'])){
-0.38
POSITIVE LOGITS
begge
0.79
respectively
0.78
这两个
0.73
ambos
0.70
respectively
0.69
beiden
0.68
båda
0.67
どちらも
0.67
respectivamente
0.65
beide
0.64
Activations Density 0.429%