INDEX
Explanations
comparative adjectives indicating levels of health or wellbeing
New Auto-Interp
Negative Logits
</i>
-0.73
low
-0.63
</b>
-0.62
low
-0.62
kom
-0.60
Low
-0.59
high
-0.58
oman
-0.57
rendah
-0.56
useCallback
-0.56
POSITIVE LOGITS
weile
0.94
correctes
0.91
wiſe
0.91
shewn
0.90
Efq
0.86
Оста
0.85
daß
0.84
^(@)
0.83
myſelf
0.83
iſt
0.83
Activations Density 0.042%