INDEX
Explanations
temporal references and indicators of time in the text
New Auto-Interp
Negative Logits
éry
-0.18
otta
-0.16
Appropri
-0.15
opus
-0.15
aus
-0.15
elman
-0.14
anne
-0.14
atten
-0.14
anan
-0.14
amon
-0.14
POSITIVE LOGITS
orate
0.18
FromClass
0.16
ExecutionContext
0.15
vest
0.14
ndern
0.14
ethod
0.14
StatusLabel
0.14
ãĥķãĥĪ
0.13
setC
0.13
icolor
0.13
Activations Density 0.305%