INDEX
Explanations
proper nouns related to technology and individuals in news or entertainment
references to organizations and notable individuals
New Auto-Interp
Negative Logits
etheless
-0.98
nce
-0.70
GoldMagikarp
-0.69
utory
-0.66
Cosponsors
-0.64
}{-0.64
":"","
-0.63
general
-0.63
norm
-0.63
ends
-0.63
POSITIVE LOGITS
intact
1.00
hooked
0.90
installed
0.88
handy
0.85
locked
0.82
ready
0.82
attached
0.82
removed
0.81
hostage
0.81
gone
0.78
Activations Density 0.617%