INDEX
Explanations
instances of personal attacks and derogatory language in comments
New Auto-Interp
Negative Logits
balleur
-0.44
houſe
-0.43
ſelf
-0.43
obé
-0.37
richTextPanel
-0.37
officier
-0.37
mystérie
-0.36
侥
-0.36
MessageTagHelper
-0.36
ſeveral
-0.36
POSITIVE LOGITS
criticism
1.58
criticisms
1.50
criticize
1.48
criticizing
1.47
critici
1.38
criticise
1.34
Criticism
1.34
criticized
1.34
critique
1.33
Criticism
1.31
Activations Density 0.691%