INDEX
Explanations
phrases centered around sitting and being stationary
New Auto-Interp
Negative Logits
uolo
-0.83
⋙
-0.73
Valerio
-0.70
})();
-0.65
'])){
-0.62
neod
-0.61
Durata
-0.61
modo
-0.61
ThroughAttribute
-0.61
tory
-0.61
POSITIVE LOGITS
sit
1.45
Sit
1.41
SIT
1.40
Sit
1.39
Sitting
1.29
SIT
1.27
sits
1.26
sitting
1.25
sit
1.19
Sitting
1.18
Activations Density 0.065%