INDEX
Explanations
phrases related to downplaying or playing down something
variations of the verb "play" and its derivatives
New Auto-Interp
Negative Logits
spot
-0.87
Creat
-0.71
vic
-0.69
ni
-0.68
ust
-0.66
pour
-0.65
ney
-0.64
iph
-0.64
aghan
-0.63
ci
-0.63
POSITIVE LOGITS
enance
0.75
theless
0.71
ements
0.71
overlook
0.68
uate
0.67
innocence
0.66
Mask
0.65
hee
0.65
INTON
0.65
rarily
0.63
Activations Density 0.030%