INDEX
Explanations
mentions of symptoms or experiences related to medical conditions
New Auto-Interp
Negative Logits
Ju
-0.49
Thus
-0.49
L
-0.49
J
-0.48
fä
-0.46
сво
-0.45
’
-0.45
Thus
-0.45
pri
-0.45
del
-0.44
POSITIVE LOGITS
yours
1.36
your
1.18
contigo
1.12
your
1.11
ของคุณ
1.02
Yours
0.99
you
0.98
Your
0.98
youre
0.97
suggestion
0.95
Activations Density 0.677%