INDEX
Explanations
expressions of approval or disapproval towards political figures or actions
phrases related to public approval and disapproval ratings
New Auto-Interp
Negative Logits
orage
-0.66
GV
-0.64
fixme
-0.64
Roche
-0.62
udic
-0.61
mith
-0.61
UNCH
-0.61
AAF
-0.60
Codex
-0.59
esis
-0.59
POSITIVE LOGITS
rating
1.00
ratings
0.99
rating
0.92
renheit
0.85
Rating
0.83
rated
0.81
Gallup
0.76
polarization
0.76
disapprove
0.75
watching
0.75
Activations Density 0.125%