INDEX
Explanations
variations of the word "breaking" and related phrases indicating urgent news updates
New Auto-Interp
Negative Logits
crack
-0.15
ixo
-0.15
emean
-0.15
/inet
-0.14
иÑģ
-0.14
olen
-0.14
emark
-0.14
rani
-0.14
TTY
-0.14
ÏĢÏģοÏĤ
-0.14
POSITIVE LOGITS
uur
0.15
ucid
0.15
uir
0.14
Resp
0.14
ichel
0.14
elt
0.14
summoned
0.14
iteli
0.14
à¥įशन
0.14
ìĪł
0.13
Activations Density 0.007%