INDEX
Explanations
occurrences of the letter 'a' in various contexts
New Auto-Interp
Negative Logits
vig
-0.78
investi
-0.75
compri
-0.72
impact
-0.72
kari
-0.70
vider
-0.68
men
-0.68
ven
-0.67
ves
-0.67
opis
-0.67
POSITIVE LOGITS
A
1.25
getA
1.25
A
1.06
brancas
1.01
aA
1.01
QApplication
0.99
dA
0.89
vermelhas
0.89
ansatte
0.88
OA
0.86
Activations Density 0.395%