INDEX
Explanations
information related to politics, media coverage, and public figures
New Auto-Interp
Negative Logits
)).
-0.94
"))
-0.93
''.
-0.92
]).
-0.91
?".
-0.88
".
-0.87
`.
-0.86
'.
-0.84
"}
-0.83
.''.
-0.78
POSITIVE LOGITS
sprawling
0.62
relentlessly
0.61
crammed
0.59
famously
0.58
rundown
0.57
longtime
0.57
bloated
0.57
vague
0.54
myriad
0.54
sleek
0.54
Activations Density 1.799%