INDEX
Explanations
results related to actions and events with a narrative style
instances of numerical values or measurements related to significant actions or events
New Auto-Interp
Negative Logits
Pand
-0.63
Xi
-0.62
charm
-0.60
poisoned
-0.59
ideal
-0.57
eco
-0.57
vot
-0.56
tones
-0.56
popul
-0.56
Cyborg
-0.56
POSITIVE LOGITS
Eventually
1.13
SPONSORED
1.13
³³³³
1.08
Then
0.97
Later
0.96
Suddenly
0.96
Upon
0.94
Advertisement
0.92
PHOTOS
0.91
Soon
0.90
Activations Density 0.438%