INDEX
Explanations
instances where someone is recalling or reacting to new information
instances of the phrases related to hearing and learning new information
New Auto-Interp
Negative Logits
ometimes
-0.80
except
-0.75
ccording
-0.73
umerous
-0.72
entirely
-0.70
unless
-0.68
COMPLE
-0.67
unparalleled
-0.67
aples
-0.67
actly
-0.66
POSITIVE LOGITS
my
0.67
puberty
0.61
Wings
0.59
Deadline
0.59
this
0.58
Pist
0.57
goodbye
0.57
Sloan
0.56
Flower
0.56
graduation
0.56
Activations Density 0.252%