INDEX
Explanations
instances of impactful language that convey strong emotions or significant concepts
that intensify a negative situation
making worse or more profound
New Auto-Interp
Negative Logits
Paglinawan
-0.59
verifyException
-0.58
aarrggbb
-0.58
nahilalakip
-0.54
PostInfinity
-0.52
atguigu
-0.50
rolla
-0.48
azaki
-0.48
findpost
-0.47
ORTS
-0.47
POSITIVE LOGITS
further
2.07
further
1.91
FURTHER
1.75
Further
1.72
Further
1.70
FURTHER
1.60
exacerbate
1.46
worsened
1.44
worsen
1.39
更に
1.39
Activations Density 0.560%