INDEX
Explanations
phrases related to political and social issues
references to political themes and community engagement
New Auto-Interp
Negative Logits
guyen
-0.67
zens
-0.66
cumulative
-0.62
earchers
-0.59
GoldMagikarp
-0.57
à¨
-0.55
elve
-0.54
erenn
-0.53
irteen
-0.53
always
-0.52
POSITIVE LOGITS
.:
1.17
.[
1.15
.#
1.13
.
1.12
.?
1.07
.(
1.06
!.
0.99
.]
0.95
?:
0.93
.):
0.93
Activations Density 0.770%