INDEX
Explanations
words related to changes, particularly in regard to transitions or shifts in context or situation
New Auto-Interp
Negative Logits
enfance
-0.53
tenuta
-0.51
hjemme
-0.51
ritratto
-0.49
ソリン
-0.48
riun
-0.47
témoins
-0.47
ølge
-0.47
memeriksa
-0.47
écnicas
-0.46
POSITIVE LOGITS
shift
1.12
switch
1.10
shifted
1.03
shifting
1.00
Shifting
0.98
Switch
0.95
shifts
0.94
Shift
0.94
shift
0.93
switched
0.93
Activations Density 0.512%