INDEX
Explanations
references to small-scale entities or concepts
New Auto-Interp
Negative Logits
uvo
-0.68
obligé
-0.68
גרת
-0.67
ISSUED
-0.66
Réponses
-0.66
lusso
-0.65
eraard
-0.64
suivantes
-0.64
touristique
-0.64
Autorizaciones
-0.64
POSITIVE LOGITS
Small
1.70
SMALL
1.66
Small
1.65
small
1.65
small
1.57
SMALL
1.51
smal
1.48
Smal
1.32
Kleine
1.16
kleinen
1.10
Activations Density 0.076%