INDEX
Explanations
phrases related to expressing strong opinions or critiques
phrases related to controversial political issues
New Auto-Interp
Negative Logits
Processing
-0.86
Twins
-0.76
CrossRef
-0.70
ITNESS
-0.67
Procedures
-0.66
luck
-0.65
Caption
-0.65
Composite
-0.64
0004
-0.64
Browse
-0.64
POSITIVE LOGITS
vehemently
1.21
bashing
1.17
denouncing
1.13
anti
1.11
condemn
1.11
denounce
1.10
opposing
1.10
criticizing
1.08
staunch
1.08
advoc
1.07
Activations Density 0.502%