INDEX
Explanations
words related to profanity and derogatory language
expressions related to frustration or strong emotions
New Auto-Interp
Negative Logits
Advisory
-0.77
Preview
-0.75
Directorate
-0.73
advisory
-0.71
CLR
-0.71
Cosponsors
-0.70
Proceedings
-0.68
Assistance
-0.68
Associated
-0.68
Municipal
-0.67
POSITIVE LOGITS
aaaa
1.08
fuckin
1.08
aaa
1.01
agra
1.00
illy
0.99
;)
0.97
oooo
0.96
fucking
0.94
fuck
0.93
eeee
0.93
Activations Density 0.322%