INDEX
Explanations
occurrences of the term "Math" and related mathematical terminology
New Auto-Interp
Negative Logits
ted
-0.17
pand
-0.16
adge
-0.16
uzz
-0.15
ông
-0.15
ắc
-0.14
ean
-0.14
ÅĽcie
-0.14
agina
-0.14
chner
-0.14
POSITIVE LOGITS
ews
0.36
ieu
0.35
ew
0.29
ias
0.28
iesen
0.27
eson
0.27
ilde
0.25
ur
0.23
usalem
0.23
ilda
0.22
Activations Density 0.009%