INDEX
Explanations
common verbs and expressions related to personal agency and experiences
New Auto-Interp
Negative Logits
eivät
-0.69
Never
-0.63
never
-0.63
Never
-0.60
nobody
-0.60
مشين
-0.58
nobody
-0.57
indisponible
-0.57
never
-0.56
no
-0.55
POSITIVE LOGITS
queryInterface
0.82
כן
0.71
ьаж
0.70
indeed
0.69
matchCondition
0.68
pushFollow
0.66
continence
0.64
таки
0.61
ίο
0.61
Suara
0.60
Activations Density 0.121%