INDEX
Explanations
names of people or entities
specific names, brands, or entities mentioned in the text
New Auto-Interp
Negative Logits
choke
-0.83
bottleneck
-0.71
gorilla
-0.70
stripping
-0.70
handc
-0.69
striking
-0.68
ierrez
-0.68
______
-0.66
brunt
-0.66
disabled
-0.66
POSITIVE LOGITS
Politics
1.04
Daily
1.02
conom
0.98
Magazine
0.97
Online
0.96
Media
0.95
Week
0.94
Blog
0.94
WN
0.94
NET
0.92
Activations Density 0.362%