INDEX
Explanations
text related to sports events and news
mentions of media outlets or references to images in articles
New Auto-Interp
Negative Logits
ttle
-0.73
arantine
-0.70
sic
-0.64
etime
-0.60
quarantine
-0.60
regate
-0.59
iversary
-0.59
weed
-0.58
stalk
-0.57
pillar
-0.56
POSITIVE LOGITS
Gleaming
0.80
ccording
0.76
largeDownload
0.71
millenn
0.70
NOR
0.69
Morning
0.69
士
0.68
plains
0.67
Emb
0.66
Jer
0.65
Activations Density 0.105%