INDEX
Explanations
negative or problematic phrases related to health conditions
New Auto-Interp
Negative Logits
tal
-0.52
'
-0.49
الاتحاد
-0.48
es
-0.48
cul
-0.48
ฟ
-0.46
end
-0.46
-0.45
kal
-0.45
.
-0.45
POSITIVE LOGITS
sizeCache
1.12
]")]
1.04
")));
1.02
referenties
1.01
الرياضيه
1.01
__":
1.00
complexContent
0.98
__":
0.98
脚注の使い方
0.98
انجليز
0.97
Activations Density 0.177%