INDEX
Explanations
elements related to negotiation and decision-making
New Auto-Interp
Negative Logits
occorre
-0.56
quelquefois
-0.55
muß
-0.53
désolés
-0.47
doskona
-0.45
poderá
-0.44
erforderlich
-0.44
señalado
-0.43
imidlertid
-0.43
parms
-0.43
POSITIVE LOGITS
idk
0.84
Anyways
0.83
shitty
0.81
Anyways
0.79
fucked
0.79
lmao
0.79
anyways
0.77
fucking
0.77
fuckin
0.73
tryna
0.73
Activations Density 1.871%