INDEX
Explanations
phrases related to international affairs and political statements
connections between entities or actions in statements
New Auto-Interp
Negative Logits
agen
-0.75
lux
-0.74
neath
-0.73
̶
-0.71
dayName
-0.71
amel
-0.69
DX
-0.69
agger
-0.68
icult
-0.67
DragonMagazine
-0.66
POSITIVE LOGITS
vowed
1.12
thereby
1.11
urged
0.99
insisted
0.97
thus
0.91
demanded
0.90
consequently
0.87
encourages
0.87
recommends
0.86
warned
0.85
Activations Density 0.335%