INDEX
Explanations
opinions or views expressed in political op-eds
New Auto-Interp
Negative Logits
Beautiful
-0.65
Illum
-0.61
Dign
-0.60
GOODMAN
-0.59
Arabian
-0.58
Kens
-0.58
Flavoring
-0.57
Pyth
-0.57
Awakens
-0.57
EntityItem
-0.57
POSITIVE LOGITS
comment
0.81
ansion
0.78
reply
0.75
gres
0.75
Ops
0.74
opt
0.72
pos
0.71
etric
0.71
ops
0.71
doc
0.71
Activations Density 0.027%