INDEX
Explanations
specific mentions of years or dates
key details related to historical events and dates
New Auto-Interp
Negative Logits
TeX
-0.35
20439
-0.33
è»
-0.32
NES
-0.31
ibly
-0.31
Gay
-0.31
agents
-0.30
ciation
-0.30
Nusra
-0.30
Ô
-0.30
POSITIVE LOGITS
volatile
0.29
branded
0.28
differentiate
0.26
fair
0.25
roach
0.25
Diablo
0.24
axy
0.24
retina
0.24
packed
0.24
differentiated
0.23
Activations Density 4.436%