INDEX
Explanations
instances where the text mentions looking at or examining something closely
phrases that instruct the reader to examine or consider something
New Auto-Interp
Negative Logits
HCR
-0.69
¯¯¯¯
-0.66
Balt
-0.65
burgh
-0.64
await
-0.64
bery
-0.62
blown
-0.61
Jr
-0.61
ARP
-0.60
EMS
-0.60
POSITIVE LOGITS
yourself
0.75
0.68
correctly
0.66
lucky
0.65
iegel
0.64
perpend
0.63
minded
0.61
math
0.61
maths
0.61
unlucky
0.58
Activations Density 0.179%