INDEX
Explanations
events or incidents happening in specific locations
instances of events or actions occurring
New Auto-Interp
Negative Logits
arta
-0.84
heed
-0.72
been
-0.72
ogie
-0.71
aka
-0.70
veland
-0.68
tan
-0.68
ailable
-0.63
omorph
-0.63
omorphic
-0.62
POSITIVE LOGITS
abruptly
0.78
initially
0.70
nesday
0.65
yesterday
0.64
earlier
0.64
Yanukovych
0.64
briefly
0.63
teasp
0.63
last
0.62
SourceFile
0.62
Activations Density 0.366%