INDEX
Explanations
references to varying quantities or conditions pertaining to groups or categories
Followed by adjectives or regions
introducing examples or specific cases
New Auto-Interp
Negative Logits
EconPapers
-0.61
kmale
-0.55
Попис
-0.48
醐
-0.48
surla
-0.45
balleur
-0.44
ValueStyle
-0.43
Халык
-0.41
anch
-0.40
Климаты
-0.38
POSITIVE LOGITS
certaines
0.64
egyes
0.61
certain
0.60
Certain
0.59
sommige
0.59
Certain
0.59
sometimes
0.57
บาง
0.56
によっては
0.56
vissa
0.56
Activations Density 0.332%