INDEX
Explanations
words related to distractions and efforts to divert attention
terms related to distraction and its effects
New Auto-Interp
Negative Logits
ept
-0.64
holm
-0.63
Numbers
-0.62
iders
-0.61
fred
-0.61
anyahu
-0.60
iton
-0.60
kov
-0.60
bloc
-0.59
conditional
-0.59
POSITIVE LOGITS
attention
0.80
aline
0.75
distract
0.75
distracting
0.73
distractions
0.72
uyomi
0.71
icult
0.70
ingly
0.69
ibility
0.66
staff
0.65
Activations Density 0.058%