INDEX
Explanations
dates and numerical values
numeric values and special characters
New Auto-Interp
Negative Logits
conservatives
-0.62
tweaked
-0.62
DJs
-0.60
Robot
-0.60
Dele
-0.60
broadcast
-0.59
strategically
-0.59
disruptive
-0.59
activists
-0.58
targeted
-0.58
POSITIVE LOGITS
«
1.12
^^
1.03
âĸł
0.95
^
0.94
Si
0.93
»
0.90
£
0.88
tion
0.87
»
0.86
sic
0.85
Activations Density 0.151%