INDEX
Explanations
phrases related to reflection or representation
New Auto-Interp
Negative Logits
queue
-0.78
contend
-0.77
sites
-0.72
jan
-0.70
headlined
-0.66
efer
-0.65
opher
-0.64
jet
-0.64
BILL
-0.63
parse
-0.63
POSITIVE LOGITS
ively
0.92
ational
0.86
sentiments
0.83
eternity
0.81
orical
0.80
orically
0.76
iveness
0.74
matically
0.74
atively
0.74
purity
0.73
Activations Density 0.864%