INDEX
Explanations
instances of numerical values and quantities related to objects or entities
New Auto-Interp
Negative Logits
same
-0.23
stesso
-0.21
próp
-0.18
own
-0.18
owns
-0.18
own
-0.18
OWN
-0.17
Own
-0.17
queues
-0.17
misma
-0.17
POSITIVE LOGITS
certain
0.28
ión
0.28
importante
0.28
important
0.28
important
0.27
icamente
0.26
peque
0.25
ida
0.25
iversit
0.24
irse
0.23
Activations Density 0.032%