INDEX
Explanations
words related to regulations being rolled back
phrases related to reversing policies or actions
New Auto-Interp
Negative Logits
legged
-0.70
án
-0.67
ussen
-0.66
Warning
-0.66
wyn
-0.65
oled
-0.64
raq
-0.64
acht
-0.64
Eye
-0.63
SELECT
-0.62
POSITIVE LOGITS
globalization
0.72
ively
0.66
disco
0.65
misunderstand
0.64
Prohibition
0.63
declines
0.61
answering
0.60
theless
0.60
overc
0.59
interruption
0.59
Activations Density 0.228%