INDEX
Explanations
dates and events in a news context
instances of high-impact events or their consequences
New Auto-Interp
Negative Logits
dilig
-0.84
metic
-0.82
nodd
-0.82
oun
-0.78
incent
-0.75
cigar
-0.74
aditional
-0.74
¥ŀ
-0.74
warr
-0.73
deflation
-0.71
POSITIVE LOGITS
She
2.78
Her
2.52
she
2.26
Ms
2.05
Mrs
1.87
she
1.66
Woman
1.66
Women
1.66
She
1.61
her
1.60
Activations Density 0.319%