INDEX
Explanations
terms associated with adverse health effects and medical conditions
a shift, change, or opposite
contrary outcomes or negations
New Auto-Interp
Negative Logits
IsContent
-0.68
脚注の使い方
-0.58
'\\;'
-0.53
EXTERN
-0.50
clickable
-0.48
importanza
-0.47
dám
-0.47
asiun
-0.47
différence
-0.47
withIdentifier
-0.47
POSITIVE LOGITS
malah
1.27
justru
1.23
反而
1.21
むしろ
1.08
かえ
1.00
anzi
0.97
worsen
0.91
наоборот
0.89
逆に
0.89
instead
0.85
Activations Density 0.379%