INDEX
Explanations
instances of the word "when"
instances of the word "when."
New Auto-Interp
Negative Logits
agin
-0.76
gan
-0.72
rolet
-0.71
zzi
-0.70
yan
-0.65
gem
-0.65
hole
-0.64
idan
-0.64
ictive
-0.64
aking
-0.63
POSITIVE LOGITS
soever
1.39
confronted
0.82
irlf
0.81
faced
0.80
contrasted
0.76
compared
0.76
comparing
0.74
ŃĶ
0.68
©¶æ
0.68
pressed
0.67
Activations Density 0.122%