INDEX
Explanations
occurrences of the letter "A" in various contexts
New Auto-Interp
Negative Logits
el
-0.23
els
-0.17
aura
-0.15
elic
-0.15
mh
-0.15
irie
-0.15
pone
-0.15
abei
-0.15
incident
-0.15
lia
-0.15
POSITIVE LOGITS
partir
0.26
través
0.26
pes
0.25
fort
0.24
princip
0.23
isl
0.23
fin
0.23
prob
0.22
continu
0.22
port
0.21
Activations Density 0.010%