INDEX
Explanations
expressions of personal struggle or difficulty
New Auto-Interp
Negative Logits
åĴ²
-0.17
oldem
-0.15
aden
-0.15
obao
-0.14
instability
-0.14
suite
-0.14
keit
-0.14
objectManager
-0.14
pol
-0.14
_suite
-0.14
POSITIVE LOGITS
stuck
0.39
struggle
0.34
struggling
0.28
struggled
0.26
struggles
0.26
unsure
0.25
Unsure
0.24
strugg
0.23
having
0.23
uncertain
0.23
Activations Density 0.143%