INDEX
Explanations
phrases associated with reading
instances of the word "read" and its variants
New Auto-Interp
Negative Logits
ortium
-0.64
iership
-0.60
vre
-0.59
penter
-0.58
Launcher
-0.58
ño
-0.57
uncture
-0.57
apolis
-0.56
udic
-0.55
restart
-0.55
POSITIVE LOGITS
aloud
1.32
just
1.02
dress
0.93
ahead
0.89
mitt
0.83
printed
0.81
written
0.81
\\\\\\\\
0.80
letter
0.78
comprehension
0.76
Activations Density 0.026%