INDEX
Explanations
repetitive patterns of the letter 'a' in various contexts
New Auto-Interp
Negative Logits
queles
-0.99
quilo
-0.84
quela
-0.71
affrontare
-0.57
oignon
-0.50
również
-0.50
alcuni
-0.49
alcune
-0.49
appareil
-0.49
aprile
-0.48
POSITIVE LOGITS
priori
0.64
few
0.63
lot
0.61
OGND
0.55
aVar
0.54
posteriori
0.52
bit
0.52
href
0.50
little
0.47
więc
0.46
Activations Density 0.478%