INDEX
Explanations
occurrences of the letter 'a' or 'i' as single characters
New Auto-Interp
Negative Logits
finity
-0.07
LING
-0.07
å£
-0.07
ibus
-0.06
dob
-0.06
ract
-0.06
ury
-0.06
qv
-0.06
het
-0.06
antan
-0.06
POSITIVE LOGITS
ãĤ¤ãĤ¯
0.07
äº
0.07
TA
0.06
Pk
0.06
.EventType
0.06
ewire
0.06
adece
0.06
Ont
0.06
.mixin
0.06
è¯Ŀ
0.06
Activations Density 0.058%