INDEX
Explanations
quantifiers and references to amounts or numbers
New Auto-Interp
Negative Logits
-0.66
HORE
-0.62
esternos
-0.61
haustible
-0.61
Queryable
-0.57
Cyfarwyddwr
-0.57
Ivoire
-0.57
harapkan
-0.56
cuerdo
-0.56
annica
-0.55
POSITIVE LOGITS
GOTREF
0.64
OGND
0.64
HomeAsUpEnabled
0.51
localVar
0.50
dominant
0.47
Coordenadas
0.47
<eos>
0.46
ναι
0.46
expandindo
0.45
'&:
0.45
Activations Density 0.491%