INDEX
Explanations
the preposition 'a'
articles and determiners in the text
New Auto-Interp
Negative Logits
reports
-0.70
overs
-0.66
sorts
-0.66
Atkins
-0.66
livious
-0.65
Airl
-0.64
OTUS
-0.63
EVs
-0.63
Agg
-0.61
proceedings
-0.61
POSITIVE LOGITS
lder
1.20
uras
1.09
ria
1.06
ñ
1.00
merce
0.97
ld
0.97
ctors
0.96
ces
0.95
ctions
0.95
sembly
0.95
Activations Density 0.144%