INDEX
Explanations
dates and locations mentioned in a specific format
various forms of punctuation and their associated contexts in textual data
New Auto-Interp
Negative Logits
lando
-0.83
Schwartz
-0.80
zona
-0.76
favor
-0.76
avorable
-0.74
ospons
-0.74
favors
-0.73
ASHINGTON
-0.71
DOE
-0.71
someday
-0.70
POSITIVE LOGITS
apologise
1.12
Scotland
1.09
apologised
1.05
mould
1.01
fibre
1.01
centres
1.00
realise
0.99
recognise
0.99
Premiership
0.98
humour
0.98
Activations Density 0.291%