INDEX
Explanations
news report excerpts mentioning specific locations, events, and people
New Auto-Interp
Negative Logits
nesday
-0.62
destro
-0.60
undermin
-0.60
predec
-0.56
agre
-0.55
advertisement
-0.55
enegger
-0.54
assum
-0.54
omever
-0.54
scrut
-0.54
POSITIVE LOGITS
âĵĺ
1.11
itars
0.79
Profile
0.72
âĺħ
0.69
Joined
0.64
ensis
0.62
Died
0.61
Joined
0.58
Released
0.58
(?,
0.58
Activations Density 0.480%