INDEX
Explanations
phrases related to decision-making and outcomes
New Auto-Interp
Negative Logits
برÛĮ
-0.16
wan
-0.15
RegexOptions
-0.15
loat
-0.15
ĵåIJį
-0.13
ffa
-0.13
lasted
-0.13
æķ¦
-0.13
rone
-0.13
edin
-0.13
POSITIVE LOGITS
boiled
0.47
boil
0.47
boils
0.42
boiling
0.40
bo
0.38
Bo
0.38
rev
0.35
-bo
0.32
.bo
0.31
_bo
0.31
Activations Density 0.132%