INDEX
Explanations
instances of conjunctions and references to percentages or numerical data
New Auto-Interp
Negative Logits
rients
-0.16
ultan
-0.15
Giang
-0.15
erus
-0.14
fcc
-0.14
(KP
-0.14
èά
-0.14
ddit
-0.13
rapy
-0.13
báºŃc
-0.13
POSITIVE LOGITS
emi
0.14
shi
0.14
arem
0.14
Filed
0.14
previously
0.14
Cons
0.14
äd
0.14
leg
0.14
aren
0.14
recht
0.14
Activations Density 0.074%