INDEX
Explanations
mentions of books or articles and their authors
New Auto-Interp
Negative Logits
customs
-0.83
Customs
-0.68
broom
-0.68
crane
-0.67
animate
-0.67
wink
-0.67
ambulance
-0.66
valve
-0.66
whim
-0.66
bucks
-0.65
POSITIVE LOGITS
essays
1.41
articles
1.16
essay
1.15
published
1.12
excerpts
1.09
insightful
1.07
blogs
1.07
1.06
reprinted
1.05
Understanding
1.05
Activations Density 2.254%