INDEX
Explanations
phrases related to political news and government policies
New Auto-Interp
Negative Logits
audi
-0.70
amiya
-0.63
overe
-0.62
ilial
-0.58
Neurolog
-0.56
behalf
-0.55
CRE
-0.55
deficits
-0.55
aring
-0.54
oster
-0.54
POSITIVE LOGITS
traction
0.93
foothold
0.90
mileage
0.77
bearings
0.77
chy
0.73
FREE
0.70
footing
0.69
ãĥĸ
0.69
attention
0.69
started
0.67
Activations Density 2.201%