INDEX
Explanations
proper nouns related to political figures, specifically Barack Obama
references to acknowledgement or recognition
New Auto-Interp
Negative Logits
Debor
-0.75
)].
-0.69
conflic
-0.68
senal
-0.67
Tib
-0.66
eleph
-0.66
Seym
-0.65
warr
-0.64
livest
-0.63
dough
-0.62
POSITIVE LOGITS
enzie
1.19
ack
1.03
intosh
1.02
ACK
0.97
acks
0.96
acked
0.95
ademic
0.93
luster
0.89
nesses
0.86
les
0.84
Activations Density 0.011%