INDEX
Explanations
expressions of honesty and frankness
honestly expressing doubt or opinion
New Auto-Interp
Negative Logits
nemlig
-0.60
FunctionFlags
-0.59
surla
-0.56
namelijk
-0.55
appunto
-0.53
たしか
-0.51
pium
-0.51
cass
-0.50
Signalez
-0.49
確かに
-0.49
POSITIVE LOGITS
felt
0.47
feels
0.47
agak
0.45
probably
0.44
could
0.44
honesty
0.43
couldn
0.42
sorprender
0.41
prefier
0.41
honestly
0.41
Activations Density 0.005%