INDEX
Explanations
adding to or further intensifying
New Auto-Interp
Negative Logits
prevent
0.70
replacement
0.69
prevention
0.67
preventing
0.67
ancipation
0.66
revolutionized
0.64
பண்ப
0.64
replacement
0.64
urndata
0.63
後に
0.63
POSITIVE LOGITS
further
2.34
exacerbate
2.19
further
2.15
Further
2.15
exacerb
2.13
Further
2.09
FURTHER
2.02
进一步
1.95
exacerbated
1.83
reinforce
1.71
Activations Density 0.410%