INDEX
Explanations
terms related to non-compliance and specific medical conditions
New Auto-Interp
Negative Logits
neuve
-0.75
ſche
-0.74
ainfi
-0.72
Beſ
-0.68
purpoſe
-0.67
lámpara
-0.67
Monfieur
-0.67
malheureux
-0.66
Theſe
-0.66
efficaces
-0.66
POSITIVE LOGITS
Non
0.91
non
0.87
NON
0.84
Non
0.81
nong
0.75
nons
0.73
Nons
0.71
non
0.70
NON
0.67
非
0.62
Activations Density 0.160%