INDEX
Explanations
phrases indicating direction or progression toward a conclusion or end
New Auto-Interp
Negative Logits
Calvo
-0.74
t
-0.71
fatica
-0.70
5
-0.69
foglal
-0.67
T
-0.65
печа
-0.63
cy
-0.63
riuscito
-0.63
</em>
-0.62
POSITIVE LOGITS
toward
2.19
towards
2.15
toward
2.12
Towards
2.11
Toward
2.09
Towards
2.06
towards
2.04
Toward
1.93
hacia
1.46
TOW
1.38
Activations Density 0.056%