INDEX
Explanations
names of people or places described in news articles
instances of citations or references to individuals in parentheses
New Auto-Interp
Negative Logits
regret
-0.71
lull
-0.70
adjustments
-0.69
deem
-0.69
treat
-0.67
incre
-0.66
dece
-0.65
toll
-0.65
overnight
-0.64
skyrocket
-0.63
POSITIVE LOGITS
pictured
1.53
left
1.53
Photo
1.49
Picture
1.41
center
1.37
Screenshot
1.37
bottom
1.36
Courtesy
1.35
above
1.32
photo
1.31
Activations Density 0.057%