INDEX
Explanations
phrases related to continuing or reading further
phrases related to ongoing actions or instructions to read further
New Auto-Interp
Negative Logits
oun
-0.66
olit
-0.61
opol
-0.58
mith
-0.58
Corp
-0.58
mart
-0.57
ciples
-0.56
knife
-0.54
lain
-0.54
é¾
-0.54
POSITIVE LOGITS
reading
0.89
scrolling
0.76
Reading
0.72
clicking
0.67
submitting
0.62
Below
0.61
...]
0.61
READ
0.60
trending
0.59
ceive
0.59
Activations Density 0.013%