INDEX
Explanations
instances of public indifference or lack of attention towards significant events or individuals
New Auto-Interp
Negative Logits
itas
-0.16
PTS
-0.15
atcher
-0.15
mini
-0.15
Attach
-0.15
_attachments
-0.14
itten
-0.14
ÙĪÙĪ
-0.14
zwar
-0.14
Attach
-0.14
POSITIVE LOGITS
still
0.25
still
0.23
Still
0.23
STILL
0.22
noch
0.22
Still
0.21
somehow
0.20
åį´
0.19
853
0.17
åį»
0.17
Activations Density 0.164%