INDEX
Explanations
mentions of historical events and documentation
mentions of historical events and their contexts
New Auto-Interp
Negative Logits
Fax
-0.76
nesty
-0.73
amazon
-0.72
Safe
-0.71
waivers
-0.69
Tweet
-0.68
Beware
-0.67
Ratings
-0.67
safe
-0.64
cheap
-0.63
POSITIVE LOGITS
eras
1.46
history
1.39
history
1.37
era
1.31
epoch
1.28
chronological
1.25
historical
1.24
period
1.23
histories
1.22
historians
1.20
Activations Density 0.725%