INDEX
Explanations
mentions of declining to comment on a topic
instances of the phrase "declined to comment."
New Auto-Interp
Negative Logits
corn
-0.71
Finch
-0.69
tail
-0.67
locked
-0.66
friend
-0.64
nic
-0.63
rip
-0.63
course
-0.62
bey
-0.62
eries
-0.60
POSITIVE LOGITS
ariat
0.97
specifics
0.86
directly
0.77
onym
0.76
substant
0.76
acknow
0.75
publicly
0.75
anonymously
0.75
ature
0.74
amera
0.73
Activations Density 0.025%