INDEX
Explanations
phrases related to a previous knowledge or predictions
occurrences of the word "knew."
New Auto-Interp
Negative Logits
otion
-0.77
orio
-0.73
phrine
-0.72
adies
-0.72
pmwiki
-0.70
ItemTracker
-0.70
otos
-0.68
adish
-0.68
psey
-0.67
pex
-0.66
POSITIVE LOGITS
beforehand
1.07
instinctively
0.94
nothing
0.76
nothing
0.74
lege
0.73
ledged
0.72
footed
0.70
bones
0.69
firsthand
0.67
ledge
0.67
Activations Density 0.064%