INDEX
Explanations
mention of actions related to funding and financial allocation
terms related to defunding and watering regulations
New Auto-Interp
Negative Logits
robat
-0.75
idad
-0.73
士
-0.71
hift
-0.69
drawer
-0.68
ioned
-0.66
Pike
-0.66
olson
-0.66
Domain
-0.64
inquest
-0.64
POSITIVE LOGITS
pling
0.74
mentation
0.73
adow
0.72
xual
0.70
hovah
0.68
OHN
0.68
bler
0.68
watering
0.67
Hort
0.67
elled
0.65
Activations Density 0.023%