INDEX
Explanations
elements related to political commentary and advice
New Auto-Interp
Negative Logits
.bat
-0.15
stad
-0.15
rompt
-0.14
ÑĤеÑĢÑĢиÑĤ
-0.14
ediator
-0.14
Specialist
-0.14
upply
-0.14
šov
-0.14
reesome
-0.14
ITTER
-0.14
POSITIVE LOGITS
focus
0.38
focus
0.34
Focus
0.33
attention
0.33
.focus
0.31
-focus
0.30
Focus
0.30
_focus
0.29
focusing
0.27
Attention
0.26
Activations Density 0.266%