INDEX
Explanations
abstract nouns, specific items
New Auto-Interp
Negative Logits
associés
0.82
र्में
0.80
ucionales
0.80
ள்ளார்
0.77
Pathways
0.77
uttore
0.77
刓
0.76
partenaires
0.75
associée
0.75
Economía
0.75
POSITIVE LOGITS
thing
1.07
moment
0.81
heart
0.81
loose
0.78
idea
0.78
few
0.74
serious
0.74
heavy
0.72
matter
0.72
hour
0.72
Activations Density 0.000%