INDEX
Explanations
instances of the phrase "pay attention."
New Auto-Interp
Negative Logits
_CAPACITY
-0.15
.dw
-0.15
jedn
-0.15
xba
-0.15
ADVERTISEMENT
-0.14
.kode
-0.14
âĢŀJ
-0.14
olars
-0.14
oux
-0.14
anchise
-0.14
POSITIVE LOGITS
811
0.15
610
0.15
395
0.15
Listening
0.15
nid
0.14
trad
0.13
assi
0.13
aller
0.13
nings
0.13
otherapy
0.13
Activations Density 0.013%