INDEX
Explanations
expletives and profanity
phrases related to confusion and absurdity in various situations
New Auto-Interp
Negative Logits
Cosponsors
-0.77
Qiao
-0.68
Regulatory
-0.67
Fiscal
-0.65
Ernst
-0.64
Referred
-0.64
Citation
-0.63
Thomson
-0.62
HHS
-0.62
Deutsche
-0.62
POSITIVE LOGITS
fuck
1.03
shit
0.99
haha
0.87
crap
0.87
shit
0.85
;)
0.84
itty
0.83
fuckin
0.82
bitch
0.81
fuck
0.80
Activations Density 0.715%