INDEX
Explanations
phrases related to political figures and government positions
references to political figures and their roles or actions
New Auto-Interp
Negative Logits
fibers
-0.70
wells
-0.69
mats
-0.69
Scrib
-0.69
selves
-0.69
shards
-0.65
electroly
-0.64
ponds
-0.62
intersection
-0.61
seams
-0.61
POSITIVE LOGITS
veto
0.97
ij士
0.90
appoint
0.88
assassinated
0.84
vetoed
0.84
appointing
0.81
Cabinet
0.79
Downing
0.79
assad
0.78
appointed
0.77
Activations Density 0.639%