INDEX
Explanations
phrases indicating uncertainty or questioning
New Auto-Interp
Negative Logits
<%=
-0.51
ว์
-0.51
iecie
-0.51
đại
-0.51
ศาสตร์
-0.50
Peters
-0.50
P
-0.48
IMDG
-0.48
mezzo
-0.48
дые
-0.48
POSITIVE LOGITS
frankly
0.93
honestly
0.86
Honestly
0.86
Honestly
0.80
honestly
0.77
Frankly
0.76
really
0.74
really
0.73
ScopeManager
0.73
admit
0.71
Activations Density 0.127%