INDEX
Explanations
references to attendance and related identifiers in code
New Auto-Interp
Negative Logits
986
-0.20
Polic
-0.17
ennen
-0.16
haled
-0.15
shar
-0.15
barr
-0.15
illet
-0.14
alma
-0.14
-strokes
-0.14
929
-0.13
POSITIVE LOGITS
angkan
0.25
akan
0.22
ang
0.21
angan
0.21
ahan
0.20
arkan
0.19
askan
0.18
unya
0.18
engan
0.18
annya
0.17
Activations Density 0.019%