INDEX
Explanations
proper nouns or named entities
instances of the word "identified" in various contexts
New Auto-Interp
Negative Logits
issance
-0.78
ights
-0.75
enjoyment
-0.73
imeo
-0.71
eries
-0.69
stead
-0.67
iths
-0.67
idth
-0.66
ework
-0.64
grain
-0.64
POSITIVE LOGITS
Cosponsors
0.87
surn
0.82
alias
0.77
redacted
0.77
Forensic
0.77
deceased
0.75
names
0.72
onyms
0.72
rified
0.71
Identified
0.70
Activations Density 0.078%