INDEX
Explanations
mentions of specific names, particularly those related to politics and medicine
proper nouns related to individuals and entities, particularly in a political or economic context
New Auto-Interp
Negative Logits
kiss
-0.87
Rays
-0.66
born
-0.66
HERO
-0.63
Austral
-0.63
Quote
-0.62
friend
-0.62
Lange
-0.60
bearer
-0.60
burst
-0.60
POSITIVE LOGITS
owell
3.84
orr
1.64
umption
1.55
atel
1.39
uming
1.15
ennett
1.09
atton
1.03
agher
1.00
ermott
0.99
umers
0.95
Activations Density 0.030%