INDEX
Explanations
terms related to tragedies and their aftermaths
New Auto-Interp
Negative Logits
fg
-0.15
strate
-0.14
ube
-0.14
plus
-0.14
getattr
-0.14
COVID
-0.14
©
-0.14
thus
-0.14
Ø«ÙĬر
-0.13
tales
-0.13
POSITIVE LOGITS
event
0.32
incident
0.31
äºĭä»¶
0.29
incident
0.25
evento
0.24
episode
0.24
affair
0.24
event
0.22
attack
0.22
crisis
0.22
Activations Density 0.244%