INDEX
Explanations
descriptions of events or incidents mentioned in articles
references to academic articles and publications
New Auto-Interp
Negative Logits
tsy
-0.75
accommodation
-0.72
Cooldown
-0.65
toast
-0.65
ah
-0.63
drains
-0.61
ahs
-0.61
obe
-0.60
ime
-0.59
reperto
-0.58
POSITIVE LOGITS
titled
1.09
headlined
1.07
vertisement
1.03
uggest
1.01
lished
0.93
itled
0.91
purported
0.85
authored
0.85
published
0.85
entitled
0.84
Activations Density 0.384%