INDEX
Explanations
phrases related to continuing or reading additional information
references to the act of continuing to read
New Auto-Interp
Negative Logits
ĪĴ
-0.87
oult
-0.77
ño
-0.76
»Ĵ
-0.75
estern
-0.72
opard
-0.72
alm
-0.71
ascal
-0.67
Ĭ±
-0.67
oint
-0.64
POSITIVE LOGITS
...]
0.76
WATCHED
0.72
Expand
0.71
toggle
0.71
comprehension
0.70
Below
0.69
aloud
0.67
ahead
0.65
âĨĴ
0.64
entials
0.62
Activations Density 0.015%