INDEX
Explanations
phrases related to uncertainty or decision-making processes
phrases that express being "in" or "out" of various situations or contexts
New Auto-Interp
Negative Logits
ector
-0.69
elf
-0.69
uania
-0.68
aler
-0.66
enium
-0.64
arling
-0.64
endants
-0.63
urst
-0.62
doomed
-0.61
md
-0.61
POSITIVE LOGITS
peoples
0.91
vain
0.81
circulation
0.80
escap
0.76
anybody
0.74
your
0.73
your
0.72
YOUR
0.70
limbo
0.70
our
0.69
Activations Density 0.261%