INDEX
Explanations
instances of the word "por" indicating reasons or purposes
New Auto-Interp
Negative Logits
еÑĢп
-0.15
ilha
-0.15
urtle
-0.14
ustil
-0.14
ripsi
-0.14
lech
-0.14
NÄĽkter
-0.14
ksam
-0.14
endencies
-0.14
аниÑĨ
-0.14
POSITIVE LOGITS
means
0.16
line
0.16
ras
0.16
atch
0.16
ridge
0.16
courtesy
0.16
ro
0.16
ret
0.16
zych
0.15
erro
0.15
Activations Density 0.016%