INDEX
Explanations
instances of the abbreviation "Gr" followed by a number, which likely indicates grades or scores
New Auto-Interp
Negative Logits
an
-0.32
at
-0.19
iance
-0.17
anio
-0.17
anca
-0.15
a
-0.15
anou
-0.15
straint
-0.14
corr
-0.14
anlar
-0.14
POSITIVE LOGITS
imes
0.21
uber
0.20
instead
0.19
ims
0.18
indle
0.18
imal
0.18
Gr
0.18
uner
0.17
ÑĢеб
0.17
illo
0.17
Activations Density 0.008%