INDEX
Explanations
entities related to political figures
references to specific individuals, particularly notable figures or hosts in media contexts
New Auto-Interp
Negative Logits
ï¸ı
-0.84
ngth
-0.78
andowski
-0.71
BIL
-0.69
photos
-0.68
furt
-0.67
_-
-0.65
pent
-0.64
ahime
-0.62
plac
-0.62
POSITIVE LOGITS
ichick
0.86
Blasio
0.69
Reilly
0.68
Roses
0.68
cipled
0.65
bath
0.65
cker
0.64
Raven
0.61
Wool
0.60
ertodd
0.60
Activations Density 0.082%