INDEX
Explanations
expressions of frustration and challenges in adapting to new situations
New Auto-Interp
Negative Logits
agg
-0.14
rof
-0.14
munition
-0.14
irq
-0.14
Advance
-0.14
Previous
-0.13
artık
-0.13
últ
-0.13
.Last
-0.13
yıldır
-0.13
POSITIVE LOGITS
initial
0.68
initially
0.63
initial
0.59
Initial
0.54
Initial
0.54
inicial
0.52
Initially
0.52
åĪĿ
0.50
_initial
0.49
æľĢåĪĿ
0.49
Activations Density 0.322%