INDEX
Explanations
mentions of the United States government
mentions of the government, particularly the United States government
New Auto-Interp
Negative Logits
ranging
-0.69
tein
-0.66
ok
-0.65
omatic
-0.65
cheon
-0.62
var
-0.62
Bey
-0.62
Frequency
-0.61
ranch
-0.61
unci
-0.59
POSITIVE LOGITS
ÃŃs
0.94
orate
0.74
's
0.74
itself
0.71
eers
0.69
subsidized
0.69
Äĩ
0.69
Agency
0.67
ischer
0.67
subsid
0.66
Activations Density 0.167%