INDEX
Explanations
phrases related to current events and crises in different locations
themes related to legal actions and social conflicts involving authorities
New Auto-Interp
Negative Logits
assuming
-0.76
âĶĢâĶĢâĶĢâĶĢ
-0.74
":"/
-0.73
Principle
-0.70
Encyclopedia
-0.70
emphasis
-0.69
onom
-0.68
_.
-0.65
laughs
-0.65
Reviewer
-0.65
POSITIVE LOGITS
collided
1.40
clashed
1.32
crashed
1.25
vanished
1.20
collapsed
1.19
stormed
1.18
exploded
1.17
plummeted
1.14
surged
1.14
derailed
1.14
Activations Density 0.442%