INDEX
Explanations
news-related information like countries, organizations, events, and political figures
mentions of newsworthy events and reports
New Auto-Interp
Negative Logits
thous
-0.46
uyomi
-0.42
pse
-0.41
ulla
-0.40
chuk
-0.40
emed
-0.38
referen
-0.38
warr
-0.38
succumbed
-0.37
iterations
-0.36
POSITIVE LOGITS
largeDownload
0.47
DRAGON
0.45
CLOSE
0.44
DRAG
0.43
rians
0.41
ettel
0.40
press
0.40
PRESS
0.40
ktop
0.39
pin
0.39
Activations Density 3.847%