INDEX
Explanations
terms related to the concept of "non" or absence
New Auto-Interp
Negative Logits
horaire
-0.73
doulou
-0.70
vettor
-0.69
héri
-0.69
urbain
-0.69
littéraire
-0.68
scuro
-0.67
rése
-0.66
DockStyle
-0.66
rumahnya
-0.65
POSITIVE LOGITS
non
3.13
Non
2.99
Non
2.95
non
2.89
NON
2.71
NON
2.49
非
2.30
nons
1.87
非
1.86
Nons
1.80
Activations Density 0.076%