INDEX
Explanations
specific mentions of academic journals, publishers, and institutions, particularly focusing on names and locations
terms related to political events and elections
New Auto-Interp
Negative Logits
apologise
-0.83
organis
-0.74
realise
-0.72
Firstly
-0.72
Firstly
-0.69
analyse
-0.65
recognise
-0.65
organising
-0.63
centres
-0.63
realised
-0.62
POSITIVE LOGITS
]).
0.72
}.
0.72
.).
0.71
)).
0.66
attRot
0.65
afterward
0.64
>.
0.64
]),
0.63
asio
0.62
.]
0.62
Activations Density 1.270%