INDEX
Explanations
proper nouns or specific terminology often associated with scientific contexts
New Auto-Interp
Negative Logits
navideña
-0.75
alemana
-0.74
asiática
-0.71
chilena
-0.71
navideño
-0.67
británica
-0.66
colgante
-0.64
inglesa
-0.61
compartida
-0.61
sintética
-0.60
POSITIVE LOGITS
eo
0.85
o
0.81
ono
0.80
oro
0.79
evo
0.76
edo
0.75
iro
0.74
ano
0.74
ho
0.73
olo
0.73
Activations Density 1.064%