INDEX
Explanations
instances of potential accusations or attributions of actions by specific entities, such as countries, organizations, or individuals
phrases that reference actions taken by a specific subject
New Auto-Interp
Negative Logits
meat
-0.71
edin
-0.71
é¾įåĸļ士
-0.71
heimer
-0.70
ãĥ´ãĤ¡
-0.70
ãĤ¦ãĤ¹
-0.69
resil
-0.68
elvet
-0.68
borgh
-0.67
redits
-0.67
POSITIVE LOGITS
virtue
1.07
politicians
0.93
products
0.92
successive
0.89
policymakers
0.86
omission
0.86
individuals
0.84
whistleblowers
0.83
laws
0.80
ministers
0.80
Activations Density 0.149%