INDEX
Explanations
profane language
instances of strong profanity and expressions of frustration
New Auto-Interp
Negative Logits
olor
-0.81
raft
-0.73
ward
-0.70
�
-0.69
rift
-0.69
lus
-0.63
Analysis
-0.63
alyst
-0.62
Reporter
-0.60
holder
-0.60
POSITIVE LOGITS
fucking
3.70
goddamn
3.29
fuckin
2.90
freaking
2.53
godd
2.39
damn
2.20
FUCK
2.19
fucked
1.92
Godd
1.88
shitty
1.87
Activations Density 0.025%