INDEX
Explanations
phrases related to decision-making and choices
New Auto-Interp
Negative Logits
одо
-0.17
isay
-0.15
ITO
-0.15
iven
-0.14
————————————————
-0.14
161
-0.14
Yield
-0.14
обов
-0.14
jur
-0.14
.try
-0.14
POSITIVE LOGITS
then
0.22
then
0.22
çĦ¶åIJİ
0.18
Then
0.17
Then
0.17
load
0.17
pack
0.17
za
0.16
åį
0.15
THEN
0.15
Activations Density 0.156%