INDEX
Explanations
dates and significant years in historical contexts
sentence endings or conclusions
New Auto-Interp
Negative Logits
selfie
-0.79
pit
-0.77
pudding
-0.76
inki
-0.76
emoji
-0.75
neigh
-0.75
vape
-0.74
minion
-0.73
purse
-0.72
arse
-0.72
POSITIVE LOGITS
Eventually
1.48
Later
1.42
Afterwards
1.39
Shortly
1.29
Ultimately
1.29
Then
1.25
Unfortunately
1.22
Soon
1.21
During
1.20
Ironically
1.18
Activations Density 0.593%